6,264 Matching Annotations
  1. Last 7 days
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      _Below we address all the comments by the reviewers. However, the figures that were used in our response are unfortunately not displayed in this format. _

      Reviewer #1

      Evidence, reproducibility and clarity

      Thanks to the development of Ribo-Seq, translational buffering has been reported in the database of published Ribo-Seq and matched RNA-Seq, Rao et al. attempt to understand the mechanism underlying translational buffering of mRNA variation across diverse materials. Although the authors' report provides a step forward in our understanding of translational buffering, this reviewer found a series of concerns in this paper. These points could be tackled to improve the reliability of their findings, the strength of their main message, and the global understandability of the paper.

      Major comments: 1. This paper heavily relies on the reference 18. However, this paper was not properly stated (no page or journal number); the study in Bioinformatics is nowhere to be found on the website, despite being out in 2024 apparently. Either title is wrong (yet a biorxiv can be found). This reviewer guessed that the reference 18 may be accepted. However, without a proper reference, this paper could not be judged since nearly all the parts of this work have been based on the reference 18. Also, the Ribobase data used in this manuscript comes from this reference, so it had better be well defined, especially when another Ribobase data set seems to be available online: http://www.bioinf.uni-freiburg.de/~ribobase/index.html

      We apologize for the citation issue. This citation by Liu et al , 2024 (18) was a preprint from BioRxiv. This manuscript is now published in Nature Biotechnology. The reference has been updated in the revised version of the manuscript. The reference number in revised manuscript is Liu et al, 2025 (23).

      In the Discussion, the authors mentioned "TE is based on a compositional regression model (18) rather than the commonly applied approach of using a logarithmic ratio of ribosome occupancy to mRNA abundance." This important information should be mentioned in the early section of the manuscript. Related to this, there are other published methods for exploring change in translation efficiency (e.g., 10.1093/bioinformatics/btw585; 10.1093/nar/gkz223) that could also be suitable in this context. It is not entirely clear if their approach is better than before. Again, the improper reference to 18 made our assessment of this work difficult.

      We apologize and acknowledge the impact of the citation issue on this point. In Liu et al (2025), we have provided a comparison between our approach and the log-ratio strategy. We also agree that additional context was needed within the current study. Hence, we have now included more detailed information about the TE calculations in the initial results section (line 94).

      As noted by the reviewer, several other methods have been developed previously for measuring changes in translation efficiency. These methods are designed to be used in cases of paired designs where there is a treatment or manipulation that is assayed along with controls. While these methods are highly valuable in assessing differential TE, they are unable to accommodate the type of meta-analyses described in our study. In particular, we do not report changes/differential TE with respect to a control sample but instead focus on the coordinated patterns of TE across experiments. We now note this important distinction in the manuscript in the discussion section (line 494).

      The paper mainly relies on detecting a set of buffered genes using mRNA-TE correlation and MAD ratios (Ribo-Seq/RNA-Seq). While the concept seems sound, the authors should ensure that this method is reliable. Several controls could be used to confirm this. First, if any studies in humans or mice have described a set of genes as buffered, it would be worth checking for overlap between the authors' set of 'TB high' genes and the previously established list. Furthermore, the authors could use packages explicitly developed for translational buffering detection, such as annota2seq (https://academic.oup.com/nar/article/47/12/e70/5423604?login=true). Not all of the data used by the authors may be suitable for such packages, but the authors could at least partially use them on some of their datasets and see whether the buffered genes reported by these packages match their predictions.

      We thank the reviewer for this constructive suggestion. To the best of our knowledge, no prior study in humans or mice has systematically analyzed translational buffering across a wide range of conditions. As a result, defining a gold-standard set for benchmarking is currently not feasible.

      While packages such as anota2seq have proven highly valuable for identifying buffering effects in controlled experimental designs (e.g., comparing a treatment to a matched control), they are not readily applicable to the type of large-scale meta-analysis we present here.Our study integrates ribosome profiling and RNA-seq data across diverse datasets and conditions, which lies outside the design scope of such tools.

      The most relevant point of comparison to our work is Wang et al. 2020 Nature, which examined a related but distinct form of translational buffering across species for a given tissue. We now present the overlap of genes identified as buffered in our study vs Wang et al. 2020. The details are presented in the reviewer's comment 5-2.

      The threshold of 'TB high' or 'TB low' (top and bottom 250) is somewhat arbitrary. Why not top 100 or 500? The authors should provide a rationale for this choice. Also, they could include a numeric measure of buffering (the sum of the two rankings is probably suitable for this purpose). Several of the authors' explorations are suitable for numerical quantification (GO enrichment can be turned into GSEA, and the boxplot can be shown as correlations)

      Thanks for these suggestions. We agree that the threshold used to define TB high and low are somewhat subjective. We ensure that changing this cutoff as suggested is easily achievable with the provided R script. These can be used to reproduce all of the reported analyses of translational buffering with different cutoffs.

      To further assess whether our conclusions are robust to the selection of these thresholds, we tested several different values to define the TB high and TB low groups. As an example, we show here that the effect on protein variation and association of intrinsic features like the UTR lengths with the buffering potential of genes for different thresholds (i.e. if the TB high = top 100 or TB high = top 200) remain similar to the current cutoff of 250. However, if we increase the cutoff of TB high to 2000 and TB low to top 2000-4000 , the difference between the various features is diminished (Figure A& B). Further, protein variation (human cancer cell line and tissue) also becomes more similar across the three categories, possibly indicating a reduced regulatory potential of genes as their rank increases (Figure C& D).Our analyses reveal that highly ranked genes show associations with particular features, indicating an underlying hierarchy in translational buffering potential. This point is now discussed in the manuscript (line 177).

      Legend: Effect of different thresholds on . A. Length features B. Median RNA expression C. Protein variation in human cancer cell line and D. on Primary human tissues

      In response to the reviewer's suggestion of presenting data using numerical quantitation, we incorporated several additional inclusions in the manuscript.

      1. We now report association of CDS / UTR length with translational buffering as a function of their translational buffering rank with highly ranked genes showing associations with particular features, indicating an underlying hierarchy in translational buffering potential (Sup Fig 3 A-B) Ii. We now include scatter plots which show that highly ranked genes have lower variation at the protein level in both cancer cell line and primary tissues (Sup Fig. 6 A-C).

      Iii. We have now carried out modified GO enrichment analyses. Specifically, Gene Ontology enrichment analysis was performed for the TB high genes in humans and mouse using the clusterProfiler R package. Lists of TB high genes in human or mouse were analyzed against the Gene Ontology (GO) database using the enrichGO() function, with the organism-specific annotation database (org.Hs.eg.db for human or org.Mm.eg.db for mouse) as reference. Gene identifiers were supplied as gene symbols, and all genes in the current study were used as the background universe. Enrichment was carried out for the Biological Process (BP) ontology, with significance assessed by the hypergeometric test. P-values were adjusted for multiple testing using the Benjamini–Hochberg method, and terms with an adjusted p-value Legend: Gene Ontology (GO) enrichment analysis of the TB high gene set, performed with the clusterProfiler R package. Enriched GO Biological Process terms are shown after redundancy reduction using clusterProfiler::simplify. Each dot represents a GO term, with dot size indicating the number of genes associated with the term and color reflecting the adjusted p-value (Benjamini–Hochberg correction). Only the top non-redundant terms are displayed.

      • *

      Additionally, we performed Gene set enrichment analysis using the list of genes ordered according to their RNA-TE correlation. Hence lower ranks have lower RNA-TE correlations. The GSEA plots show significantly enriched Gene Ontology Biological Process (GO:BP) terms at the lower ranks of the ordered gene list. Together, these analyses further emphasize the observation that genes involved in macromolecular complexes are translationally buffered.

      • *

      Legend: Curves represent the enrichment score (ES) across the ranked gene list, with vertical bars indicating the positions of pathway-associated genes. The enrichment was identified using the gseGO() function from clusterProfiler.

      Several of the statements of the authors in the Introduction or Discussion sections are not entirely true regarding the literature on the topics, or lack major papers on the topic, and therefore, they are a bit misleading. Among others, here are some:

      We thank the reviewer for the suggestions and now have been incorporated in the revised manuscript, accordingly.

      5-1 "In addition, genetic differences arising from aneuploidy, cell type differences or variability observed in the natural population can further determine the amplitude of variation (4-7). The effect of mRNA variation under these conditions is mostly reflected at the protein levels (2, 4-8).". Several recent or more ancient papers suggest that mRNA variation coming from aneuploidy, natural genetic variation, or CNV is buffered or not well reflected at the protein level: DOI: 10.1038/s41586-024-07442-9 DOI: 10.1073/pnas.2319211121 DOI: 10.1016/j.cels.2017.08.013 DOI: 10.15252/msb.20177548

      We agree that mRNA variation coming from aneuploidy, natural genetic variation, or CNV is buffered or not well reflected at the protein level for some genes. This point has now been revised in the introduction. We have incorporated all the suggested literature into the revised manuscript (line 38).

      5-2: The authors should also consider mentioning these studies and softening their initial statement. "Similarly, translational buffering of certain genes have been reported in mammalian cells, specifically under estrogen receptor alpha (ERα) depletion conditions (16).". Translational buffering has been deeply explored in mammalian tissues and even across several mammalian species in this study (DOI: 10.1038/s41586-020-2899-z). In this, the authors also provide a nice exploration of the gene characteristics that are associated with translational buffering. The authors should mention it and compare the study's findings to theirs ultimately.

      We thank the reviewer for this suggestion. We have now cited the recommended study in the revised manuscript (line 65). Here, we provide a comparison of its findings with ours. While this related work offers important insights into translational buffering, its focus is on buffering across species within a given tissue, whereas our study emphasizes buffering across conditions, cell types, and treatments within a species. Despite this difference in focus, the comparison is highly informative, and we now highlight both the similarities and distinctions between the two studies in the relevant section of the revised manuscript.

      Wang et al. calculate the variation at the transcriptome level vs at the translatome level and is represented as delta ∆ value for each gene. A lower value represents lower variation at the ribosome occupancy level than at the mRNA levels across various species. We classified the genes in the Wang et al study as TB high, TB low genes or others as identified in the current study while indicating the calculated delta ∆ from Wang et al. Many of the genes with a lower delta value (are delta ∆ Legend: A. Dot plot to highlight the delta value of all genes in the Wang et al study (also present in RiboBase) which are further grouped as TB high, low or others in (A) brain and (B) liver.

      5-3: "Differences in species evaluated and statistical methods have resulted in conflicting interpretations (13, 28).". These conflicting results have been previously discussed in reviews on the topic that would be worth mentioning: DOI: 10.1016/j.cell.2016.03.014 DOI: 10.1038/s41576-020-0258-4

      We have added these reviews at the appropriate location of the manuscript.

      1. In addition to the p-values stated in the main text, the authors should annotate their plots when they find significant differences between groups to greatly facilitate the visual interpretation of the graphs.

      We have now annotated many of the relevant graphs with p-values to facilitate visual interpretation, adding them where space and figure design allow.

      Based on the data of Figure 4D, apparently, ribosome occupancy was not buffered even in high TB sets. The authors may argue that translational buffering may not cope with such a strong mRNA reduction. In that case, how big a difference in mRNA level does the buffering system adjust in protein synthesis? The authors should test gradual gene knockdown and/or overexpression and conduct Ribo-Seq/RNA-Seq to survey the buffering range.

      We appreciate the reviewer’s suggestion regarding the experiment to determine the buffering range.To understand this for multiple genes, we attempted a series of knockdowns using CRISPR/gRNA approach using a MutiCas12a approach. We targeted 8 buffered and 2 non-buffered genes using a 10-plex crRNA along with 10-plex gRNA serving as a negative control (Figure below). The fold change at the mRNA level of the targeted gene was within the variation range observed in replicates for other non-targeted genes. The challenge in performing a gradual knockdown is the subtle changes in RNA expression falls within the margin of error of estimation, making it difficult to understand the clear implications of the mRNA levels on buffering. Hence, the precise experimental manipulation of mRNA expression levels that would be conducive to translational buffering remains highly technically challenging. As noted in our manuscript (Figure 4D), the conventional approaches for manipulation of transcript abundance lead to larger changes than typically observed as a result of natural variation.

      *Legend: Validation of translational buffering by targeted knockdown of genes. A. The scatter plot shows the coefficient of variation of mRNA and ribosome occupancy between HEK293T cells targeted with sgRNA of different efficiencies. The genes indicated in blue are buffered and those in green are non buffered genes. B. The plot shows the fold change in mRNA abundance and ribosome occupancy as compared to cells that were infected with non-targeting crRNA array control (ratio of cpm in test vs control). Each color represents a gene and each point of a gene represents cells targeted by one of the four CRISPR arrays. *

      "differential transcript accessibility model" could not be functional if mRNA is reduced beyond the accessible pool (i.e., less than the threshold, all the mRNAs are translated without buffering). The authors should carefully reconsider this model and the effective range of mRNAs.

      We agree with the reviewer that according to the 'differential transcript accessibility model,' transcripts with abundances below a certain threshold should be completely accessible to the translational pool. Further, this could also be true for the other model, wherein initiation rate cannot increase beyond a particular threshold for transcripts of very low abundance. However, our observation from our haploinsufficiency analysis (Figure 4 B& C) and siRNA knockdown analysis from RiboBase (Figure 4 D) suggests that buffering might be possible within a given range of transcript abundance. Testing the buffering range by serial knockdowns might help in determining the threshold at which transcripts exhibit buffering. However, due to the challenges of serial knockdown as discussed above, makes this analysis difficult with Ribosome profiling and matched RNA-seq approach. An alternative approach could involve imaging translating and non-translating mRNA of buffered genes in different cells, which may help distinguish the two models. However, this falls outside the scope of the manuscript.

      Minor comments:

      1. Some figures are of poor quality as they seem to have points outside of the panel representations... Like Figure 3C, one point is out of the square, same for Figure 4E. Similarly, on figure 5F, some outliers seem to be clearly cut from the figure (maybe not, but then the author should put a larger space between the end of the figure and the max y points). Same for panel S2D and S6D, this does not sound so rigorous.

      We agree and apologize for this issue. The axes of the figures have been annotated appropriately to indicate the presence of outliers in the figures.

      1. There are several typos or weird sentences. Here are some (but maybe not all): 2-1: [...]with lower sums corresponding to higher final ranks. "two rankings". Based on these final ranks[...] 2-2: For each dataset, median absolute deviation (MAD) "i" protein abundance was calculated across samples 2-3: [...]neighbor method implemented in the MatchIT package (38) Differences in protein[...] a point is missing here. 2-4: Additionally a second dataset providing predictions of haploinsufficiency (pHaplo score) and triplosensitivity (pTriplo score) for all autosomal genes (25) was used to asses the distribution of these score"S" across buffered and non-buffered gene sets . There is a missing "s" at "score" and there is a space between the last word and the final point.

      The necessary corrections have been incorporated in the revised version of the manuscript.

      1. In the "Lymphoblastoid cell line data analysis:" section, this reviewer wonders why the authors used a different method to calculate buffering compared to before.

      The main reason is the limited sample of the lymphoblastoid cell line data. In our larger analyses, we could use median absolute deviation as a robust metric of dispersion across heterogeneous samples. However, given the smaller dataset in that study we decided CV would be a better indicator of dispersion. To evaluate the potential for translational buffering of genes from RiboBase, we used two metrics. The first was the negative correlation between translation efficiency and RNA abundance across samples. The second metric relied on the ratio of variation in ribosome occupancy to variation in RNA levels. Given the limited sample size of the lymphoblastoid cell line dataset, we used the coefficient of variation (CV) instead of the median absolute deviation (MAD), as the data in this study were normalized using counts per million (CPM) rather than the centered log-ratio (clr) normalization used in RiboBase. This CV ratio allowed us to assess the effect of natural variation in RNA abundance on ribosome occupancy.

      1. "Samples which had R2 less than 0.2 were removed as the residuals calculated for these samples could be unreliable". These samples for which the correspondence between RNA-Seq and Ribo-Seq is low wouldn't be the ones most impacted by translational buffering? Is it sure that the authors are not missing something here?

      We agree with the reviewer that genes that show translational buffering may not conform to linear relationships between the two parameters. However, the proportion of genes exhibiting this buffering effect is not expected to significantly influence the overall regression fit. Instead, we hypothesized that low quality samples or truly different relationships between the two parameters can make this relationship nonlinear, rendering it unsuitable for linear regression analysis for calculation of TE.

      To address these possibilities, we first analysed a commonly used proxy for data quality. Given the characteristic movement of ribosomes across mRNAs, periodicity of sequencing reads is a useful metric to assess whether reads are randomly fragmented, as in RNA-seq, or specifically represent ribosome-protected footprints. For this, we compared two groups: samples that were removed (~30) and those retained for analysis. We plotted the distribution of periodicity scores for all samples in both groups. For the calculation of periodicity scores, first the percentage of reads mapped to the dominant frame position across the dynamic ribosome footprint read length range was calculated for each sample. The periodicity score was calculated by taking the weighted sum of these dominant percentages, with weights based on the total read counts at each length.

      The results indicate that the removed samples did not have lower periodicity scores, suggesting that their quality in terms of periodicity was comparable to the retained samples.

              To assess the second possibility, we checked if the study involved major perturbations, which may skew the relationship towards non linearity. The 30 samples that were removed came from 14 unique studies, 18 of which involved perturbation which possibly affected either of the two parameters. In addition to the genetic/pharmacological perturbations specific to the study, the overall conditions of the cells during an experiment could influence this relationship. Another point to note is that many of the filtered-out samples are HeLa and HEK293T cells, which show a normal relationship between ribosome occupancy and RNA abundance for the majority of cases.
      
              These considerations suggest that removing these samples is most appropriate, as their inclusion could bias the TE calculations.
      

      For Figure 4B and 4C, the authors should provide statistical tests and p-values to confirm the observed trends.

      The haploinsufficiency and triplosensitivity analyses are now supported by a chi-squared test. The details of the statistical test are now mentioned in the text and the p-values have been noted on the respective figures.

      In Figure 2A, the "all genes" color doesn't correspond to the point color.

      The color in the figure has been modified in the revised version of the manuscript.

      1. "To understand if codon usage patterns are[...]". This comes slightly out of the blue. The authors could maybe explain why codon usage should be explored for translational buffering. The authors should cite recent key works in the fields: DOI: 10.1016/j.celrep.2023.113413 DOI: 10.1101/2023.11.27.568910

      We would like to thank the reviewer for their suggestion. The references have been incorporated in the revised version of the manuscript. We have now explained why codon usage could be a contributor in determining the translational buffering potential (line 190).

      "The change in each metric was calculated by subtracting the mean value in the control samples from that in the knockdown samples. This yielded the differential mRNA abundance and ribosome occupancy resulting from gene knockdown.". This looks statistically weak. The authors should consider using more robust methods like DESeq.

      We thank the reviewer for the suggestion. We reanalyzed the selected studies using edgeR and the modified figure is included in the revised version of the manuscript (Figure 4D). The conclusion after this analysis remains essentially the same. In particular, translational buffering is ineffective when mRNA abundance is perturbed drastically. Additionally, the limited number of experiments with direct perturbation of buffered genes limit the generalizability of this observation. This limitation is included in the result section (line 342).

      Legend: Scatter plot represents log2 fold change in RNA abundance and ribosome occupancy. Each point represents a gene and the fold change in its RNA and ribosome occupancy with respect to their controls. The line represents the line of equivalence. Buffered genes do not show less change in ribosome occupancy upon reduction in their RNA levels than other genes.

      1. "Genes in the buffered gene set had a higher codon adaptation index than the non-buffered set, indicating that candidates in the buffered gene set are relatively well expressed due to the presence of a higher proportion of the codons observed in highly expressed genes". What do the authors mean by "relatively well expressed"? Abundantly expressed? This sentence and the causality under it is unclear and should be modified or better explained.

      We thank the reviewer for pointing out the lack of clarity in the sentence. We have now quantitatively measured the CAI in the three categories and modified the sentence to better explain the rationale in the revised version (line 183). “To understand if codon usage patterns are associated with translational buffering, we next analyzed codon properties across buffered and non-buffered human gene sets. The codon adaptation index quantifies how closely a gene’s codon usage aligns with that of highly expressed genes. Genes in the buffered gene set had a higher codon adaptation index than the non-buffered set. Specifically, 28.4% of TB high genes, 14% of TB low genes and 9.3% of genes in the other category fall within the top decile (>90th percentile) of codon adaptation index.”

      The panel 4D is unclear. Is one point associated with one gene? Or is it the average of several genes? If it's one point for one gene, it is important to clearly state it because the number of cases is therefore quite low, especially for the TB high and low.

      Each point and line are associated with a single gene. This is now clarified in the legend of the figure (line 364). The number of genes in this analysis is limited to the available ribosome profiling data with gene knockdown experiments.

      1. In Figure 2J, GGU (Gly), AAG (Lys), and ACU (Arg) provide negative effects on prediction, although these were enriched in the high TB set (Figure 2E). This contradiction should be explained.

      While this appears to be a seeming contradiction, it is in line with what we expected. In particular, the objective of Figure 2J is to illustrate the features that predict the mRNA–TE correlation of genes, as identified using a LGBM model. The Spearman correlation shown reflects the relationship between each feature and the mRNA–TE correlation values. A negative correlation for codons such as GGU (Gly), AAG (Lys), and ACU (Thr) suggests that enrichment of these codons is associated with lower mRNA–TE correlation. This is in agreement with our observation in Figure 2E which suggests that high TB genes are enriched in these codons. In contrast, transcript size exhibits a positive correlation, indicating that shorter transcripts tend to have lower mRNA–TE correlation values.

      Given that the choice of colors is a potential source of confusion, we have revised the text (line 230) and the figure (& legend) to try to clarify this relationship.

      The subtitle of "Translationally buffered genes exhibit variable association kinetics with the translational machinery in response to mRNA variation" sounds unfair to this reviewer. Since the authors did not work on kinetics directly, the use of this word is misleading.

      We agree and revised the subtitle to “The association of translationally buffered genes with the translational machinery varies in response to changes in mRNA abundance"

      1. The explanation of Figure 5A "We next explored the potential mechanisms that may give rise to translational buffering. Specifically, we considered two non-mutually exclusive models by which mRNA abundance might be decoupled from ribosome occupancy. In the first, the "differential transcript accessibility model", mRNA abundance determines the fraction of transcripts that are accessible to the translational pool. In this scenario, an increase in mRNA abundance would be accompanied by a proportionally smaller increase in the fraction of transcripts entering the translating pool for buffered genes, compared to non-buffered genes. In the second, the "initiation rate model", the rate of translation initiation per transcript scales inversely with mRNA abundance. Under this model, the proportion of mRNA entering the translational pool would be comparable across buffered and non-buffered genes (Fig 5A)." is hard to understand. The authors should rewrite for a better understanding of the readers.

      This section has been rewritten in the revised version of the manuscript. The text now reads as

      “We next explored the potential mechanisms that may give rise to translational buffering. Specifically, we considered two non-mutually exclusive models by which mRNA abundance might be decoupled from ribosome occupancy. In the first, the “differential transcript accessibility model”, mRNA abundance determines the fraction of transcripts that are accessible to the translational pool. In this scenario, an increase in mRNA abundance would be accompanied by a proportionally smaller increase in the fraction of transcripts entering the translating pool for buffered genes, compared to non-buffered genes. In the second, the “initiation rate model”, the rate of translation initiation per transcript scales inversely with mRNA abundance. Under this model, as mRNA abundance increases, translation initiation on each transcript is reduced, thereby lowering the number of ribosomes per transcript. However, this mechanism allows a proportional increase in transcripts entering the translational pool for buffered genes, similar to non-buffered genes”

      Significance

      Thanks to the development of Ribo-Seq, translational buffering has been reported in various works. However, the systematic investigation has remained challenging. Employing the database of published Ribo-Seq and matched RNA-Seq, Rao et al. attempt to understand the mechanism underlying translational buffering of mRNA variation across diverse materials. A group of mRNAs whose expression variance is buffered at the translation level was comprehensively surveyed in humans and mice. The authors found a series of features in the translationally buffered genes, including high GC contents in the 5′ UTR, optimal codon usage, and mRNA length. The depletion or increase of one allele of the genes in the group may be particularly detrimental to cells. The authors' report provides a step forward in our understanding of translational buffering, appealing to the broad scientific community in basic and applied biology. However, this reviewer found a series of concerns in this paper, including clarity in the methods, experimental validation, referring the earlier works, etc. These points could be tackled to improve the reliability of their findings, the strength of their main message, and the global understandability of the paper.

      We thank the reviewer for noting the significance of the work and for their constructive feedback.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Rao and colleagues present a comprehensive analysis of translational buffering in human and mouse by mining 1515 matched ribosome profiling and RNAseq datasets from diverse tissues and cell lines. They define translational buffering as genes whose TE is negatively correlated with mRNA abundance across conditions, and further identify candidates by comparing median absolute deviations of ribosome occupancy versus mRNA levels. The authors find a conserved set of buffered genes enriched for components of multiprotein complexes, demonstrate that buffered genes exhibit lower protein variability and greater dosage sensitivity, and propose two non-mutually exclusive mechanistic models (differential accessibility and initiation rate modulation). Finally, they perform complementary fractionation experiments in HEK293T cells to support these models.

      These findings propose a novel, conserved mechanism of translational buffering that tunes gene expression in mouse and human, showing how intrinsic sequence features and cellular context cooperate to stabilize protein output across diverse conditions. However, further evidence is required to fully support the authors conclusions, particularly direct validation of the proposed models of buffering.

      We thank the reviewer for their positive assessment and thoughtful suggestions that we address below.

      Below are my main concerns:

      1. The choice of the top 250 genes by spearman correlation and MAD ratio as "TB high" seems arbitrary. The authors should justify these cut offs (via permutation analysis or FDR control) and show that conclusions are robust to different thresholds.

      We agree that the threshold used to define TB high and TB low is somewhat subjective, and we now clearly acknowledge this in the discussion section (line 485). We now provide an R script that reproduces all analyses of translational buffering, where changing this cutoff to higher or lower values is straightforward.

      To ensure the robustness of our conclusions, we evaluated several thresholds for defining TB high and TB low. We observed that the conclusions hold within a reasonable range of values (100-250). For example, the effects on protein variation and the association of intrinsic features such as UTR lengths with buffering potential remain consistent when TB high is defined as the top 100 or the top 200 genes, compared with the current cutoff of 250. In contrast, when we define TB high as the top 2000 and TB low as ranks 2000–4000, the difference between the various features is diminished (Figure A& B). Further, protein variation (human cancer cell line and tissue) also becomes more similar across the three categories, possibly indicating a reduced regulatory potential of genes as their rank increases (Figure C& D). Our results show that highly ranked genes consistently associate with specific features, suggesting an underlying hierarchy in translational buffering potential.

      Legend: Effect of different thresholds on . A. Length features B. Median RNA expression C. Protein variation in human cancer cell line and D. on Primary human tissues

      The modified compositional regression approach for TE and imputation of missing values are central to the study, but details are relegated to supplemental methods. The manuscript would benefit from a clear comparison of this method against standard log-ratio TE estimates, including sensitivity analyses to missing-data imputation strategies

      We thank the reviewer for the feedback. We have now added further description of the modified compositional regression and the imputation strategy in the results section (line 94). Comparison to standard log-ratio TE estimates and their limitations has already been detailed in Liu et al. 2025, Nature Biotechnology. Therefore, in the current manuscript we specifically focus on the effect of the imputation strategy.

              Specifically, the modified imputation slightly improved concordance between the set of genes that are identified to be translationally buffered using the negative RNA-TE relationship or using RNA -Ribosome occupancy correlation (0.91 to 0.94). Further, we assessed the correlation between TE and protein abundance as measured by mass spectrometry from seven human cell lines (A549, HEK293, HeLa, HepG2, K562, MCF7 and U2OS). The protein measurements were obtained from PaxDb. The new imputation strategy slightly increased mean correlation between the TE and proteome abundance as compared to naive strategy. It specifically showed improved correlation for HepG2, A549 and HeLa cell lines. 3507 genes were used for this analysis that were common between PaxDb, Liu et al., 2005 and the current study.
      

      Legend: Proteomics vs TE correlation of cell types without or with imputation strategy. Spearman correlation between compositional TE calculated as calculated by Liu et al., 2025 from 68 samples from 11 studies (HEK293), 86 samples from 10 studies (HeLa), 58 samples from four studies (U2OS), 29 samples from five studies (A549), five samples from two studies (MCF7), seven samples from two studies (K562) and 10 samples from two studies (HepG2) or from the current study. 57 samples from 10 studies (HEK293), 82 samples from 9 studies (HeLa), 58 samples from four studies (U2OS), 29 samples from five studies (A549), 5 samples from two studies (MCF7), one samples from one studies (K562) and 9 samples from two studies (HepG2) . 3507 genes were used for this analysis that were common between Paxdb, Liu et al., 2005 and the current study.

      Human data are derived mainly from immortalized cell lines, whereas mouse data are from primary tissues. Pooling these heterogeneous sources may conflate cell type-specific regulation with intrinsic buffering. The authors should either stratify analyses by context or demonstrate buffering signatures remain consistent within more homogeneous subsets

      We thank the reviewer for the suggestion and agree that heterogeneity could potentially mask cell type-specific buffering effects. The TB-high genes we report are those that show consistent and robust expression across diverse contexts. However, unlike RNA-seq datasets, the current number of ribosome profiling samples per cell type is still limited, and a more comprehensive assessment of context-specific buffering will require larger datasets that will accumulate over time.

      Nonetheless, we have stratified the analysis by cellular context. Specifically, we grouped samples of the same cell-type and repeated the buffering analysis. We provide a new table listing TB-ranks of genes for the five cell types with the largest sample sizes as a table in github.

      https://github.com/CenikLab/Translational-buffering/blob/Translational-Buffering/combined_tables.xlsx

      As an additional control, we compared buffering patterns between related and unrelated cell lines. For example, the correlation of TB ranks between related cell lines HEK293T (n = 98) and HEK293 (n = 57) is higher (0.46) than between either and an unrelated cell line, HeLa (n = 82). Similarly, the correlation between two liver cell lines, Huh7 (n = 39) and HepG2 (n = 9), is higher (0.20) than between Huh7 and a similarly sampled but unrelated lymphoblastoid cell line (LCL, n = 9; correlation = 0.05). While these analyses suggest that cell type-specific patterns may exist, their exploration is currently limited by sample size, as detecting buffering requires substantial variability in mRNA expression. We now highlight this as a limitation in the Discussion section (line 573).

      *Legend: Spearman correlation between TB ranks of different pairs of cell lines. The first set indicates comparison with HEK293T. The second set indicates comparison between liver cells (HepG2 and Huh 7). *

      The HEK293T fractionation experiments offer preliminary support for both the "accessibility" and "initiation" models, but only slope analyses are shown. To validate these models, the authors should perform targeted reporter assays (dual luciferase constructs with 5′UTR swaps) or manipulations of initiation factors (eIF4E knockdown) to directly test how transcript abundance alters initiation rates versus pool entry

      We thank the reviewer for suggesting experiments to validate the proposed models. In the luciferase reporter experiments, constructs bearing the endogenous UTRs from non-buffered genes would be expected to result in expression that is proportional to transcript abundance. In contrast, swapping a 5’ UTR from buffered genes would mitigate this effect of translation buffering via “initiation rate model” depending on the 5 UTR sequence of transcript. However, as outlined below, this experiment has important caveats:

      1. Role of coding sequence: Such assays primarily test the contribution of the 5′UTR and do not address potential cooperative effects between the 5′UTR and the coding sequence (CDS). Thus, if 5′UTRs fails to recapitulate translational buffering, it would be unclear whether the buffering requires coordinated action of the 5′UTR and CDS or whether the gene in question simply does not conform to the initiation-rate model.
      2. Sensitivity of measurements: Reporter-based measurements often rely on RT-qPCR to quantify expression changes. While suitable for large fold-changes, small shifts may fall within the assay’s technical margin of error, limiting the interpretability of the results. iii. Gene-to-gene variability: Buffered and non-buffered transcripts likely span a wide range of intrinsic initiation rates. Selecting only a few “representative” transcripts for 5′UTR swapping could yield results that are not broadly generalizable.

      Similarly, knockdown of general initiation factors will likely impact on both buffered and non-buffered genes, which could limit the ability to distinguish the effect of transcript abundance on translational buffering via either of the proposed models. We envision an alternative future approach that would involve single molecule imaging translating and non-translating mRNAs of buffered and non-buffered genes under varying abundance conditions in a physiological context. Such experiments are likely the most suitable for disentangling the contributions of accessibility versus initiation. While we find this an exciting direction for future work, it lies beyond the scope of the present manuscript.

      The conclusion that buffering reduces protein variability relies on mass-spec comparisons, but ribosome occupancy does not always reflect functional protein output (due to elongation stalling or co-translational degradation). Incorporating orthogonal measures, such as pulse-labeling or western blots for key buffered versus non-buffered genes, would strengthen the link between buffering and proteome stability

      We agree with the reviewer’s concern and have been acknowledged as a limitation in the discussion section. To address this with orthogonal approaches, we carried out several additional experiments. Specifically, we identified a study from RiboBase (GSE132703) that exhibited significant variation in FUS transcript (a translationally buffered gene) abundance across conditions—namely HEK293T wild type, LARP1A single knockout (SKO), and LARP1A/B double knockout (DKO) using their RNA-seq data. We reached out to the authors of the study and obtained these knockout cell lines. We reanalyzed RNA abundance under the different conditions by RT-qPCR and assessed protein levels by Western blot. Despite observing differences in RNA abundance, FUS protein levels did not exhibit corresponding change at the protein level.

      We also selected a non-buffered gene; DNAJC6, that also showed RNA-level differences. However, the change in RNA expression was not consistent at the protein level. Some caveats of Western blot is its limited sensitivity which may prevent detection of subtle changes and that the measurements are steady-state protein levels which cannot resolve whether differences arise from altered synthesis or degradation.

      *Legend : Validation of buffering gene by western blot: A. Plot showing the RNA abundance and ribosome occupancy of buffered gene ; FUS and non buffered genes; DNAJC6 with variation in HEK293T-wild type, LARP1A single knockout and LARP1A/B double knockout. B. Validation of the RNA seq data by qPCR. C. Western Blot showing the FUS, DNAJC6 and Actin in wild type and different mutants. D. Bar plot showing the quantification of western blot. *

              In addition to this targeted analysis , we performed quantitative mass spectrometry to evaluate the effect of mRNA variation at the protein level at global scale.
      

      LC MS/MS analysis was performed on the above samples in triplicates at the Proteomics facility of the University of Texas. A total of 4,048 proteins were identified using a peptide confidence threshold of 95% and a protein confidence threshold of 99%, with a minimum of two peptides required for identification. Total precursor intensities for all peptides of a protein was summed and was used for protein quantification using DEP (Differential Enrichment of Proteomics Analysis) Package, in Bioconductor, R (https://rdrr.io/bioc/DEP/man/DEP.html). DEP was used for variance normalization and statistical testing of differentially expressed proteins. As expected LARP1 protein was identified in the control cells but not in the single or double knockouts.

      We then plotted the fold change in RNA as determined by edgeR analysis of RNA-seq from (Philippe et al. 2020) and the fold change in protein abundance from our mass spectrometry data. We observed that genes in the TB high group show reduced changes at the protein level compared to TB low or others as determined by the linear regression analysis in both single and double LARP1 KO mutants. This finding is consistent with our findings that buffered genes show lower variation in the protein abundance in response to change in mRNA expression.

      Legend: Scatter plot showing the log2fold change in the RNA and protein levels as determined by RNA seq from (Philippe et al. 2020) or mass spectroscopy. Differential analysis of RNA was done using the edgeR package and the DEP (Differential Enrichment of Proteomics Analysis) Package *was used for mass spectrometry analysis. Only genes with an FDR We have not included this data in the manuscript given the deviation of the approach from our original analysis, but we are happy to reconsider the inclusion of this data to supplement our proteomic analysis.

      While the LGBM modeling shows modest predictive power of sequence features alone, the manuscript stops short of exploring what cellular factors might drive context dependence. Integrating public datasets on RNA-binding protein expression or mTOR pathway activity across samples could illuminate trans-acting determinants of buffering and move beyond correlative sequence analyses,

      We thank the reviewer for this suggestion. To investigate potential trans-acting determinants of buffering, we focused on 1,394 human RBPs as classified by Hentze et al. (2018), reasoning that some of these factors may facilitate translational buffering. Specifically, we examined correlations between the RNA expression of each RBP and the TE of all other genes across samples. p-values were corrected using the Bonferroni procedure. For each RBP, we then performed a Fisher’s exact test to assess whether the number of significant correlations was enriched among buffered versus non-buffered genes.

      This analysis revealed that the expression levels of many RBPs are significantly enriched for either positive or negative correlations with the TE of buffered genes. In particular, we note that RNA expression of many buffered RBPs is enriched for negative correlations with the TE of other buffered transcripts. These results suggest that, rather than considering translational buffering in isolation for each transcript, buffering effects may be coordinated at the translational level and influenced by shared trans-acting factors such as RBPs. Network-based approaches have been valuable for RNA co-expression and are only now being applied to TE covariation. However, the correlative nature of these analyses limits causal inference. For example, although many ribosomal proteins appear to influence the buffering of other ribosomal proteins, they themselves may be regulated by a non-ribosomal RBP—so the apparent effects could reflect upstream regulatory influences. This analysis is now included as a supplementary figure (Sup. Fig. 5) of the revised manuscript.

      Legend: A scatter plot of odds ratio log of number of significant correlations (RNA abundance of RBPs ::TE of genes) and the p value from fisher test. The vertical dashed line represents the threshold odds ratio, above which RBPs exhibit a higher number of significant correlations with buffered genes. P values were corrected using Bonferroni procedure* and the horizontal dashed line represents the adjusted p value cutoff. *

      Reviewer #2 (Significance (Required)):

      Overall, this manuscript leverages an unprecedented compendium of matched ribosome profiling and RNAseq datasets across human cell lines and mouse tissues, combined with improved TE estimation, to robustly catalog genes exhibiting translational buffering, a clear methodological and conceptual strength. The main limitations stem from heterogeneous sample sources, largely correlative analyses, and a lack of targeted mechanistic validation. Compared to prior yeast focused studies, it fills a key gap by demonstrating conservation of buffering in mammals and linking it to dosage sensitivity and protein stability, representing a conceptual advance in understanding post-transcriptional homeostasis and a methodological step forward in TE analysis. This work will interest researchers in RNA biology, gene expression regulation, systems biology, and cancer proteomics, as well as those studying dosage-sensitive pathways and translational control. My expertise is on translational control in cancer.

      We thank the reviewer for noting the broader significance of the work and for their constructive feedback.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Rao and colleagues present a comprehensive analysis of translational buffering in human and mouse by mining 1515 matched ribosome profiling and RNAseq datasets from diverse tissues and cell lines. They define translational buffering as genes whose TE is negatively correlated with mRNA abundance across conditions, and further identify candidates by comparing median absolute deviations of ribosome occupancy versus mRNA levels. The authors find a conserved set of buffered genes enriched for components of multiprotein complexes, demonstrate that buffered genes exhibit lower protein variability and greater dosage sensitivity, and propose two non-mutually exclusive mechanistic models (differential accessibility and initiation rate modulation). Finally, they perform complementary fractionation experiments in HEK293T cells to support these models.

      These findings propose a novel, conserved mechanism of translational buffering that tunes gene expression in mouse and human, showing how intrinsic sequence features and cellular context cooperate to stabilize protein output across diverse conditions. However, further evidence is required to fully support the authors conclusions, particularly direct validation of the proposed models of buffering. Below are my main concerns:

      1. The choice of the top 250 genes by spearman correlation and MAD ratio as "TB high" seems arbitrary. The authors should justify these cut offs (via permutation analysis or FDR control) and show that conclusions are robust to different thresholds
      2. The modified compositional regression approach for TE and imputation of missing values are central to the study, but details are relegated to supplemental methods. The manuscript would benefit from a clear comparison of this method against standard log-ratio TE estimates, including sensitivity analyses to missing-data imputation strategies
      3. Human data are derived mainly from immortalized cell lines, whereas mouse data are from primary tissues. Pooling these heterogeneous sources may conflate cell type-specific regulation with intrinsic buffering. The authors should either stratify analyses by context or demonstrate buffering signatures remain consistent within more homogeneous subsets
      4. The HEK293T fractionation experiments offer preliminary support for both the "accessibility" and "initiation" models, but only slope analyses are shown. To validate these models, the authors should perform targeted reporter assays (dual luciferase constructs with 5′UTR swaps) or manipulations of initiation factors (eIF4E knockdown) to directly test how transcript abundance alters initiation rates versus pool entry
      5. The conclusion that buffering reduces protein variability relies on mass-spec comparisons, but ribosome occupancy does not always reflect functional protein output (due to elongation stalling or co-translational degradation). Incorporating orthogonal measures, such as pulse-labeling or western blots for key buffered versus non-buffered genes, would strengthen the link between buffering and proteome stability
      6. While the LGBM modeling shows modest predictive power of sequence features alone, the manuscript stops short of exploring what cellular factors might drive context dependence. Integrating public datasets on RNA-binding protein expression or mTOR pathway activity across samples could illuminate trans-acting determinants of buffering and move beyond correlative sequence analyses

      Significance

      Overall, this manuscript leverages an unprecedented compendium of matched ribosome profiling and RNAseq datasets across human cell lines and mouse tissues, combined with improved TE estimation, to robustly catalog genes exhibiting translational buffering, a clear methodological and conceptual strength. The main limitations stem from heterogeneous sample sources, largely correlative analyses, and a lack of targeted mechanistic validation. Compared to prior yeast focused studies, it fills a key gap by demonstrating conservation of buffering in mammals and linking it to dosage sensitivity and protein stability, representing a conceptual advance in understanding post-transcriptional homeostasis and a methodological step forward in TE analysis. This work will interest researchers in RNA biology, gene expression regulation, systems biology, and cancer proteomics, as well as those studying dosage-sensitive pathways and translational control. My expertise is on translational control in cancer.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Thanks to the development of Ribo-Seq, translational buffering has been reported in various works. However, the systematic investigation has remained challenging. Employing the database of published Ribo-Seq and matched RNA-Seq, Rao et al. attempt to understand the mechanism underlying translational buffering of mRNA variation across diverse materials. Although the authors' report provides a step forward in our understanding of translational buffering, this reviewer found a series of concerns in this paper. These points could be tackled to improve the reliability of their findings, the strength of their main message, and the global understandability of the paper.

      Major comments:

      1. This paper heavily relies on the reference 18. However, this paper was not properly stated (no page or journal number); the study in Bioinformatics is nowhere to be found on the website, despite being out in 2024 apparently. Either title is wrong (yet a biorxiv can be found). This reviewer guessed that the reference 18 may be accepted. However, without a proper reference, this paper could not be judged since nearly all the parts of this work have been based on the reference 18. Also, the Ribobase data used in this manuscript comes from this reference, so it had better be well defined, especially when another Ribobase data set seems to be available online: http://www.bioinf.uni-freiburg.de/~ribobase/index.html
      2. In the Discussion, the authors mentioned "TE is based on a compositional regression model (18) rather than the commonly applied approach of using a logarithmic ratio of ribosome occupancy to mRNA abundance." This important information should be mentioned early section of the manuscript. Related to this, there are other published methods for exploring change in translation efficiency (e.g., 10.1093/bioinformatics/btw585; 10.1093/nar/gkz223) that could also be suitable in this context. It is not entirely clear if their approach is better than before. Again, the improper reference to 18 made our assessment of this work difficult.
      3. The paper mainly relies on detecting a set of buffered genes using mRNA-TE correlation and MAD ratios (Ribo-Seq/RNA-Seq). While the concept seems sound, the authors should ensure that this method is reliable. Several controls could be used to confirm this. First, if any studies in humans or mice have described a set of genes as buffered, it would be worth checking for overlap between the authors' set of 'TB high' genes and the previously established list. Furthermore, the authors could use packages explicitly developed for translational buffering detection, such as annota2seq (https://academic.oup.com/nar/article/47/12/e70/5423604?login=true). Not all of the data used by the authors may be suitable for such packages, but the authors could at least partially use them on some of their datasets and see whether the buffered genes reported by these packages match their predictions.
      4. The threshold of 'TB high' or 'TB low' (top and bottom 250) is somewhat arbitrary. Why not top 100 or 500? The authors should provide a rationale for this choice. Also, they could include a numeric measure of buffering (the sum of the two rankings is probably suitable for this purpose). Several of the authors' explorations are suitable for numerical quantification (GO enrichment can be turned into GSEA, and the boxplot can be shown as correlations)
      5. Several of the statements of the authors in the Introduction or Discussion sections are not entirely true regarding the literature on the topics, or lack major papers on the topic, and therefore, they are a bit misleading. Among others, here are some:

      5-1 "In addition, genetic differences arising from aneuploidy, cell type differences or variability observed in the natural population can further determine the amplitude of variation (4-7). The effect of mRNA variation under these conditions is mostly reflected at the protein levels (2, 4-8).". Several recent or more ancient papers suggest that mRNA variation coming from aneuploidy, natural genetic variation, or CNV is buffered or not well reflected at the protein level:

      DOI: 10.1038/s41586-024-07442-9 DOI: 10.1073/pnas.2319211121 DOI: 10.1016/j.cels.2017.08.013 DOI: 10.15252/msb.20177548

      5-2: The authors should also consider mentioning these studies and softening their initial statement. "Similarly, translational buffering of certain genes have been reported in mammalian cells, specifically under estrogen receptor alpha (ERα) depletion conditions (16).". Translational buffering has been deeply explored in mammalian tissues and even across several mammalian species in this study (DOI: 10.1038/s41586-020-2899-z). In this, the authors also provide a nice exploration of the gene characteristics that are associated with translational buffering. The authors should mention it and compare the study's findings to theirs ultimately.

      5-3: "Differences in species evaluated and statistical methods have resulted in conflicting interpretations (13, 28).". These conflicting results have been previously discussed in reviews on the topic that would be worth mentioning: DOI: 10.1016/j.cell.2016.03.014 DOI: 10.1038/s41576-020-0258-4 6. In addition to the p-values stated in the main text, the authors should annotate their plots when they find significant differences between groups to greatly facilitate the visual interpretation of the graphs. 7. Based on the data of Figure 4D, apparently, ribosome occupancy was not buffered even in high TB sets. The authors may argue that translational buffering may not cope with such a strong mRNA reduction. In that case, how big a difference in mRNA level does the buffering system adjust in protein synthesis? The authors should test gradual gene knockdown and/or overexpression and conduct Ribo-Seq/RNA-Seq to survey the buffering range. 8. "differential transcript accessibility model" could not be functional if mRNA is reduced beyond the accessible pool (i.e., less than the threshold, all the mRNAs are translated without buffering). The authors should carefully reconsider this model and the effective range of mRNAs.

      Minor comments:

      1. Some figures are of poor quality as they seem to have points outside of the panel representations... Like Figure 3C, one point is out of the square, same for Figure 4E. Similarly, on figure 5F, some outliers seem to be clearly cut from the figure (maybe not, but then the author should put a larger space between the end of the figure and the max y points). Same for panel S2D and S6D, this does not sound so rigorous.
      2. There are several typos or weird sentences. Here are some (but maybe not all):

      2-1: [...]with lower sums corresponding to higher final ranks. "two rankings". Based on these final ranks[...]

      2-2: For each dataset, median absolute deviation (MAD) "i" protein abundance was calculated across samples

      2-3: [...]neighbor method implemented in the MatchIT package (38) Differences in protein[...] a point is missing here.

      2-4: Additionally a second dataset providing predictions of haploinsufficiency (pHaplo score) and triplosensitivity (pTriplo score) for all autosomal genes (25) was used to asses the distribution of these score"S" across buffered and non-buffered gene sets . There is a missing "s" at "score" and there is a space between the last word and the final point. 3. In the "Lymphoblastoid cell line data analysis:" section, this reviewer wonders why the authors used a different method to calculate buffering compared to before. 4. "Samples which had R2 less than 0.2 were removed as the residuals calculated for these samples could be unreliable". These samples for which the correspondence between RNA-Seq and Ribo-Seq is low wouldn't be the ones most impacted by translational buffering? Is it sure that the authors are not missing something here? 5. For Figure 4B and 4C, the authors should provide statistical tests and p-values to confirm the observed trends. 6. In Figure 2A, the "all genes" color doesn't correspond to the point color. 7. "To understand if codon usage patterns are[...]". This comes slightly out of the blue. The authors could maybe explain why codon usage should be explored for translational buffering. The authors should cite recent key works in the fields: DOI: 10.1016/j.celrep.2023.113413 DOI: 10.1101/2023.11.27.568910 8. "The change in each metric was calculated by subtracting the mean value in the control samples from that in the knockdown samples. This yielded the differential mRNA abundance and ribosome occupancy resulting from gene knockdown.". This looks statistically weak. The authors should consider using more robust methods like DESeq. 9. "Genes in the buffered gene set had a higher codon adaptation index than the non-buffered set, indicating that candidates in the buffered gene set are relatively well expressed due to the presence of a higher proportion of the codons observed in highly expressed genes". What do the authors mean by "relatively well expressed"? Abundantly expressed? This sentence and the causality under it is unclear and should be modified or better explained. 10. The panel 4D is unclear. Is one point associated with one gene? Or is it the average of several genes? If it's one point for one gene, it is important to clearly state it because the number of cases is therefore quite low, especially for the TB high and low. 11. In Figure 2J, GGU (Gly), AAG (Lys), and ACU (Arg) provide negative effects on prediction, although these were enriched in the high TB set (Figure 2E). This contradiction should be explained. 12. The subtitle of "Translationally buffered genes exhibit variable association kinetics with the translational machinery in response to mRNA variation" sounds unfair to this reviewer. Since the authors did not work on kinetics directly, the use of this word is misleading. 13. The explanation of Figure 5A "We next explored the potential mechanisms that may give rise to translational buffering. Specifically, we considered two non-mutually exclusive models by which mRNA abundance might be decoupled from ribosome occupancy. In the first, the "differential transcript accessibility model", mRNA abundance determines the fraction of transcripts that are accessible to the translational pool. In this scenario, an increase in mRNA abundance would be accompanied by a proportionally smaller increase in the fraction of transcripts entering the translating pool for buffered genes, compared to non-buffered genes. In the second, the "initiation rate model", the rate of translation initiation per transcript scales inversely with mRNA abundance. Under this model, the proportion of mRNA entering the translational pool would be comparable across buffered and non-buffered genes (Fig 5A)." is hard to understand. The authors should rewrite for a better understanding of the readers.

      Significance

      Thanks to the development of Ribo-Seq, translational buffering has been reported in various works. However, the systematic investigation has remained challenging. Employing the database of published Ribo-Seq and matched RNA-Seq, Rao et al. attempt to understand the mechanism underlying translational buffering of mRNA variation across diverse materials. A group of mRNAs whose expression variance is buffered at the translation level was comprehensively surveyed in humans and mice. The authors found a series of features in the translationally buffered genes, including high GC contents in the 5′ UTR, optimal codon usage, and mRNA length. The depletion or increase of one allele of the genes in the group may be particularly detrimental to cells. The authors' report provides a step forward in our understanding of translational buffering, appealing to the broad scientific community in basic and applied biology. However, this reviewer found a series of concerns in this paper, including clarity in the methods, experimental validation, referring the earlier works, etc. These points could be tackled to improve the reliability of their findings, the strength of their main message, and the global understandability of the paper.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      SECTION A - Evidence, Reproducibility, and Clarity Summary The study investigates the neurodevelopmental impact of trisomy 21 on human cortical excitatory neurons derived from induced pluripotent stem cells (hiPSCs). Key findings include a modest reduction in spontaneous firing, a marked deficit in synchronized bursting, decreased neuronal connectivity, and altered ion channel expression-particularly a downregulation of voltage‐gated potassium channels and HCN1. These conclusions are supported by a combination of in vitro calcium imaging, electrophysiological recordings, viral monosynaptic tracing, RNA sequencing, and in vivo transplantation with two‐photon imaging.

      Major Comments • Convincing Nature of Key Conclusions: The study's conclusions are generally well supported by a diverse set of experimental approaches. However, certain claims regarding the intrinsic properties of the excitatory network would benefit from further qualification. In particular, the assertion that reduced synchronization is solely attributable to altered ion channel expression might be considered somewhat preliminary without additional corroborative experiments.

      1.1) We agree with the reviewer and now write in the abstract: 'Together, these findings demonstrate long-lasting impairments in human cortical excitatory neuron network function associated with Trisomy 21 .' And in the Introduction: 'Collectively, the observed changes in ion channel expression, neuronal connectivity, and network activity synchronization may contribute to functional differences relevant to the cognitive and intellectual features associated with Down syndrome.'

      • One major limitation of the current experimental design is the reliance on predominantly excitatory neuronal cultures derived from hiPSCs. Although the authors convincingly demonstrate differences in network synchronization and connectivity between trisomic (TS21) and control neurons, the almost exclusive focus on excitatory cells limits the physiological relevance of the in vitro network. In the developing cortex, interneurons and astrocytes play crucial roles in modulating network excitability, synaptogenesis, and plasticity. Therefore, incorporating these cell types-either through co-culture systems or through directed differentiation protocols that yield a more heterogeneous neuronal population-could help to determine whether the observed deficits are intrinsic to excitatory neurons or are compounded by a lack of proper inhibitory regulation and glial support. 1.2) Thank you for this thoughtful comment. We agree that interneurons and astrocytes are crucial for network function. To clarify, astrocytes are generated in this culture system, as we previously reported in our characterisation of the timecourse of network development using this approach (Kirwan et al., Development 2025). However, our primary goal was to first isolate and define the cell-autonomous defects intrinsic to TS21 excitatory neurons, minimizing the complexity introduced by additional neuronal types. This focused approach was chosen also because engineering a stable co-culture system with reproducible excitatory/inhibitory (E/I) proportions is a significant undertaking that extends beyond the scope of this initial investigation, and has proven challenging to date for the field. By establishing this foundational phenotype, our work complements prior studies on interneuron and glial contributions. Future studies building on this work will be essential to dissect the more complex, non-cell-autonomous effects within a heterogeneous network. Importantly, since our initial submission, two highly relevant preprints have emerged-including a notable study from the Geschwind laboratory at UCLA (Vuong et al., bioRxiv, 2025; Risgaard et al., bioRxiv, 2025), as well as our own complementary study Lattke et al, under revision, that highlight widespread transcriptional changes in excitatory cells of the human fetal DS cortex, providing strong validation for our central findings. This convergence of results from multiple groups underscores the timeliness and importance of our work.

      • Furthermore, the assessment of neuronal connectivity via pseudotyped rabies virus tracing, while innovative, has inherent limitations. The quantification of connectivity as a ratio of red-to-green fluorescence pixels may be influenced by differential viral infection efficiencies, variations in the expression levels of the TVA receptor, or even by the lower basal activity levels observed in TS21 cultures. Complementary approaches-such as electron microscopy for synaptic density analysis or functional connectivity measurements using multi-electrode arrays (MEAs)-could provide additional structural and functional insights that would validate the rabies tracing data. 1.3) Thank you for this constructive feedback. While we cannot formally exclude that TS21 cells might express the TVA receptor at lower levels due to generalized gene dysregulation, we infected all WT and TS21 cultures in parallel using identical virus preparations and titers to minimize technical variability. Crucially, we also addressed the potential confound of differential basal activity by performing the rabies tracing under TTX incubation (see Suppl. Fig. 7), which blocks network activity and ensures that viral spread reflects structural connectivity alone.

      While complementary methods like EM or MEA could provide additional insight, they fall outside the scope of the current study. We are confident that our rigorous controls validate our use of the rabies tracing method to assess structural connectivity.

      • Qualification of Claims: Some conclusions, particularly those linking specific ion channel dysregulation (e.g., HCN1 loss) directly to network deficits, might be better presented as preliminary. The authors could temper their language to indicate that while the evidence is suggestive, the mechanistic link remains to be fully established. 1.4) We have revised the text to more clearly indicate that the link between HCN1 dysregulation and network deficits is correlative and remains to be fully established. While our ex vivo recordings suggest altered Ih-like currents consistent with reduced HCN1 expression, we now present these findings as preliminary and hypothesis-generating, pending further functional validation. We write in the discussion: However, further targeted functional validation will be needed to confirm a causal link.

      • Need for Additional Experiments: Additional experiments that could further consolidate the current findings include: o Inclusion of Inhibitory Neurons or Co-culture Systems: Incorporating interneurons or astrocytes would help determine whether the observed deficits are solely intrinsic to excitatory neurons. See 1.2 o Alternative Connectivity Assessments: Complementing the rabies virus tracing with electron microscopy or multi-electrode array (MEA) recordings would add structural and functional validation of the connectivity differences. See 1.3 o Extended Temporal Profiling: Monitoring network activity over a longer developmental window would clarify whether the observed deficits represent a delay or a permanent alteration in network maturation. 1.5) In vivo we were able to track the cells for up to five months post-transplantation supporting the interpretation of a permanent alteration.

      • Reproducibility and Statistical Rigor: The methods and data presentation are largely clear, with adequate replication and appropriate statistical analyses. Nonetheless, a more detailed description of the experimental replicates, particularly regarding the viral tracing and in vivo transplantation studies, would enhance reproducibility. The availability of raw data and scripts for calcium imaging analysis would also further support independent verification. We thank the reviewer for these suggestions and we now provide a more detailed description of replicates. We also add the raw data.

      Minor Comments • Experimental Details: Minor revisions could include clarifying the infection efficiency and expression levels of the viral constructs used in connectivity assays to rule out technical variability.

      See 1.3

      • Literature Context: The authors reference prior studies appropriately; however, integrating a brief discussion comparing their findings with alternative DS models (e.g., organoids or other hiPSC-derived systems) would improve contextual clarity. We thank the reviewer for this helpful suggestion. We have now added a brief discussion comparing our findings with those reported in alternative Down syndrome models, including brain organoids and other hiPSC-derived systems. This addition helps to contextualize our results within the broader field and highlights the unique strengths and limitations of our in vitro and in vivo xenograft approach. We write: 'Our findings align with and extend previous studies using alternative Down syndrome models, such as brain organoids and other hiPSC-derived systems. Organoid models have provided valuable insights into early neurodevelopmental phenotypes in DS, including altered interneuron proportions (Xu et al Cell Stem Cell 2019) but also suggest that variability across isogenic lines can overshadow subtle trisomy 21 neurodevelopmental phenotypes (Czerminski et al Front in Neurosci 2023). However, these systems often lack the structural complexity, vascularization, and long-term maturation achievable in vivo. By using a xenotransplantation model, we were able to assess the maturation and functional properties of human neurons within a physiologically relevant environment over extended time frames, offering complementary insights into DS-associated circuit dysfunction (Huo et al Stem Cell Reports 2018; Real et al., 2018).

      • Presentation and Clarity: Figures are generally clear,.But the manuscript contains a minor labeling error. On page 13, the figure is erroneously labeled as "Fig6A", whereas, based on the context and corresponding data, it should be "Fig5A". I recommend that the authors correct this mistake to ensure consistency and avoid potential confusion for readers. Thank you for pointing this out. This has been corrected in the revised manuscript.

      Reviewer #1 (Significance (Required)):

      SECTION B - Significance • Nature and Significance of the Advance: The work offers a substantial conceptual advance by providing a mechanistic link between trisomy 21 and impaired neuronal network synchronization. Technically, the study integrates state-of-the-art imaging, electrophysiology, and transcriptomic profiling, thereby offering a multifaceted view of DS-related neural dysfunction. Clinically, the findings have the potential to inform future therapeutic strategies targeting network connectivity and ion channel function in Down syndrome.

      We thank the reviewer for this very supportive comment.

      • Context in the Existing Literature: The study builds on previous observations of altered network activity in DS patients and DS mouse models (e.g., altered EEG synchronization and reduced synaptic connectivity). It extends these findings to human-derived neuronal models, thus bridging a gap between clinical observations and molecular/cellular mechanisms. Relevant literature includes studies on DS neurodevelopment and the role of ion channels in synaptic maturation. • Target Audience: The reported findings will be of interest to researchers in neurodevelopmental disorders, Down syndrome, and ion channel physiology. Additionally, the study may attract the attention of those working on hiPSC-derived models of neurological diseases, as well as clinicians interested in the pathophysiology of DS. • Keywords and Field Contextualization: Keywords: Down syndrome, trisomy 21, neuronal connectivity, synchronized network activity, hiPSC-derived cortical neurons, ion channel dysregulation.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary The manuscript by Peter et al., reports on the neuronal activity and connectivity of iPSC-derived human cortical neurons from Down syndrome (DS) that is caused by caused by trisomy of the human chromosome 21 (TS21). Major points: Although the manuscript is potentially interesting, the results appear somehow preliminary and need to be corroborated by control experiments and quantifications of effects to fully sustain the conclusions. (1) The authors have not assessed the percentage of WT and TS21 cells that acquire a neuronal or glia identity in their cultures. Indeed, the origin of alterations in network activity and connectivity observed in TS21 neurons could simply derive from reduced number of neurons arising from TS21 iPSC. Alternatively, the same alteration in network activity and connectivity could derive from a multitude of other factors including deficits in neuronal development, neurite extension, or intrinsic electrophysiological properties. In the current version of the manuscript, none of these has been investigated. 2.1) We thank the reviewer for this thoughtful comment. In response, we included an in vivo characterization of cell-type proportions at the same time points where we observed network activity defects using in vivo calcium imaging (see Supplementary Fig. 6).

      Previous work has identified several cellular and molecular phenotypes in human cells, postmortem tissue, and mouse models-including those mentioned by the reviewer. In this study, our focus was on investigating neural network activity, intrinsic electrophysiological properties both in vitro and in vivo, and preliminary bulk RNA sequencing. We have also independently measured cell proportions in the human fetal cortex and conducted a more extensive transcriptomic analysis of Ts21 versus control cells in a separate study (Lattke et al., under revision). We observed a reduction of RORB/FOXP1-expressing Layer 4 neurons in the human fetal cortex at midgestation, as well as increased GFAP+ cells, reduced progenitors and a non significant reduction of Cux2+ cells in late stage DS human cell transplants, along with a gene network dysregulation specifically affecting excitatory neurons (Lattke et al., under revision). Here, we provide complementary findings, demonstrating reduced excitatory neuron network connectivity in vitro and decreased neural network synchronised activity in both in vitro and in vivo models (see also 2.8). We agree with the reviewer that this could be for a number of reasons, both cell autonomous (channel expression and/or function) or non-autonomous (connectivity and/or network composition - as reflected in differences in proportions of SATB2+ neurons generated in TS21 cortical differentiations).

      (2) Electrophysiological properties of TS21 and WT neurons at day 53/54 in vitro indicate an extremely immature stage of development (i.e. RMP between -36 and -27 mV with most of the cells firing a single action potential after current injection) in the utilized culture conditions: This is far from ideal for in vitro neuronal-network studies. Finally, reduced activity of HCN1 channels should be confirmed by specific recordings isolating or blocking the related current.

      2.2) Thank you for this thoughtful comment. We have also conducted ex vivo electrophysiological recordings and found that the neurons exhibit relatively immature properties, consistent with the known slow developmental trajectory of human neuron cultures. In light of this and the absence of direct confirmatory evidence, we now refer to the observed reduction in HCN1 as preliminary.

      Main points highlighting the preliminary character of the study. 1) In Figure 1 immunofluorescence images of the neuronal differentiation markers (Tbr1, Ctip2 and Tuj1) are showed. However, no quantification of the percentage of cells expressing these markers for WT and TS21 neurons is reported. On the other hand, simple inspection of the representative images clearly seams to indicate a difference between the two genotypes, with TS21 cultures showing lower number of cells expressing neuronal markers. This quantification should be corroborated by a similar staining for an astrocyte marker (GFAP, but not S100b since is triplicated in DS). This is an extremely important point since it is obvious that any change in the percentage of neurons (or the neuron/astrocyte ratio) in the cultures will strongly affect the resulting network activity (shown in Figure 2) and the connectivity (showed in Figure 4). Possibly, the quantification should be done at the same time points of the calcium imaging experiments.

      2.3) See 2.1. We included an in vivo characterization of cell-type proportions at the same time points where we observed network activity defects using in vivo calcium imaging. (see Supplementary Fig. 6).

      2) In Figure 2 the authors show some calcium imaging traces of WT and TS21 cultures at different time points. However, they again do not show any quantification of neuronal activity. A power spectra analysis is shown in Supplementary Figure 2, but only for WT cultures, while in Supplementary Figure 3 a comparison between WT and Ts21 power spectra is done, but only at the 50 day time point, while difference in synchrony are assessed at 60 days. At minimum, the author should include in main Figure 2 the quantification of the mean calcium event rate and mean event amplitude at the different time points and the power spectra analysis for both WT and TS21 cultures at the same timepoints.

      2.4) We thank the reviewer for this comment. We now add the power spectra analysis in the main Figure 2 and quantification of the mean calcium burst rate and mean event amplitude in SuppFig. 4.

      Of note, the synchronized neuronal activity is present in WT cultures at day 60, but totally lost at subsequent time-points (70 and 80 days). The results of this later time points are different from previous data from the same lab (Kirwan et al., 2015). How might these data be explained? It would be important to rule out any potential issues with the health of the culture that could explain the loss of neuronal activity.It would be beneficial to check cell viability at the different time points to exclude possible confounding factors ? A propidium staining or a MTT assay would strongly improve the soundness of the calcium data.

      2.5) We thank the reviewer for this important observation. The difference from the findings reported in Kirwan et al., 2015 is due to the use of a different neuronal differentiation medium in the current study (BrainPhys versus N2B27). BrainPhys medium supports robust early network activity compared to N2B27 (onset before day 60 in BrainPhys, post-day 60 in N2B27), resulting in an earlier decline in synchrony at later stages (day 70-80 in BrainPhys, compared with day 90-100 in N2B27). Importantly, in our in vivo xenograft model, burst activity is sustained up to at least 5 months post-transplantation (mpt), indicating that the neurons retain the capacity for network activity over extended periods in a more physiological environment. We adapted the text accordingly.

      3) In Figure 3 there is no quantification of the number and/or density of transplanted neurons for WT and TS21, but only representative images. As above, inspection of the representative images seems to show a decrease in cells labeled by the Tbr1 neuronal marker for TS21 cells. Moreover, the in vivo calcium imaging of transplanted WT and TS21 cells lacks most of the quantification normally done in calcium imaging experiments. Are the event rate and event amplitude different between WT and TS21 neurons ? The measure of neuronal synchrony by mean pixel correlation is not well explained, but it looks somehow simplistic. Neuronal synchrony can be more precisely measured by cross-correlation analysis or spike time tiling coefficients on the traces from single-neuron ROI rather than on all pixels in the field of view, as apparently was done here.

      2.6) We thank the reviewer for these valuable points. We now include quantification of the number and density of transplanted neurons for both WT and Ts21 grafts in Extended Data Figure 5 (see 2.1).

      Regarding the in vivo calcium imaging, we appreciate the reviewer's suggestion to include additional standard metrics. We have quantified the event rate in Real et al 2018. These analyses reveal that Ts21 neurons show a reduction in event rate.

      We agree that our initial description of the synchrony analysis using mean pixel correlation was not sufficiently detailed. We have now clarified this in the Methods and Results, and we acknowledge its limitations. Importantly, we note that the reduced synchronisation is a highly consistent phenotype, observed across at least six independent donor pairs, different differentiation protocols, and both in vitro (and in two independent labs) and in vivo settings. As suggested, future studies using ROI-based approaches-such as cross-correlation or spike-time tiling coefficients-would provide a more refined characterization of synchrony at the single-neuron level (Sintes et al, in preparation). We now include this point in the discussion.

      4) The results on reduced neuronal connectivity in Figure 3 look very striking. However, these results should be accompanied by control experiments to verify the number of neuronal cells and neurite extension in WT and Ts21 cultures. These two parameters could indeed strongly influence the results. As the cultures appear to grow in clusters, bright-field images and TuJ1 staining of the cultures will also greatly help to understand the degree of morphological interconnection between the clusters.

      We now add Tuj1 staining in Supplementary figure 10.

      5) The authors performed RNA-seq experiments on day 50 cultures. Why the authors do not show the complete differential gene expression analysis, but only a small subset of genes? A comprehensive volcano plot and the complete list of identified genes with logFC and FDR values would be helpful. If possible, comparison of the present data (particularly on KCN and HCN expression changes) with published and publicly available expression datasets of other human or human Down syndrome iPSC-derived neurons or human Down syndrome brains will greatly increase the soundness of the present findings. In addition, the gene ontology (GO) results are mentioned in the text, but are not presented. Showing the complete GO analysis for both up and downregulated genes will help the reader to better understand the RNA-seq results. Notably, the results shown in Supplementary Figure on GRIN2A and GRIN2B expression (with values of 300-700 counts versus 2000-4000 counts, respectively) clearly indicate that in both WT and TS21 cultures the NMDA developmental switch has not occurred yet at the 50 days timepoint.

      We now show volcano plots in Supplementary Fig. 11.

      6) The measure of hyperpolarization-activated currents shown in Figure 5 lack proper control experiments. First, the hyperpolarizing current in TS21 cells do not reach a steady-state as the controls. The two curves are therefore hard to compare. To exclude possible difference in kinetic activation, the authors should have prolonged the current injection period (1-2 seconds). Second, to ultimately prove that such currents are mediated by HCN channels in WT cells the authors should perform some control experiments with a specific HCN blocker. A good example of a suitable protocol, with also current blockers to exclude all other possible current contributions, is the one reported in Matt et al Cell. Mol. Life Sci. 68, 125-137 (2011).

      2.7) We thank the reviewer for this detailed and helpful comment. We agree that to definitively identify the recorded currents as Ih, it would be necessary to isolate them pharmacologically using specific HCN channel blockers and appropriate controls, such as those described in Matt et al., Cell. Mol. Life Sci. Unfortunately, due to current constraints, we no longer have access to the animals used in this study and cannot allocate the necessary time or resources, we are unable to perform the additional experiments at this stage.

      However, our goal here was to use electrophysiological recordings as an indication of altered HCN channel activity, which we then support with molecular evidence. We now emphasize this point more clearly in the revised manuscript.

      7) The manuscript lacks information on the statistical analysis used. Also, the numerosity of samples is not clear. Were the dots shown in some graph technical replicates from a single neuronal induction or were all independent neuronal inductions or a mix of the two ? Please clarify.

      We now clarify the numbers in the Figure legend.

      8) The method section lacks important information to guarantee reproducibility. Just a few examples: • Only electrophysiology methods for slice are reported, but not for in vitro culture.

      We now clarify these details in the methods.

      • Details on Laminin coating is lacking. What concentration was used ? Was poly-ornithine or poly-lysine used before Laminin coating ? We now clarify these details in the methods.

      • How long cells were switched to BrainPhys medium before calcium imaging ? We now clarify these details in the methods.

      Minor point/typos etc.

      Introduction • Page 4 line 6: in the line "Trisomy 21 in humans commonly results in a range in developmental and morphological changes in the forebrain ..." "in" could be replaced by "of". We have fixed this. • Page 5 line 2: please remove "an" before the word "another". We have fixed this. • Page 5 line 2: please replace "ecitatory" with "excitatory". We have fixed this typo.

      Results • Page 10 line 25: The concept of "pixel-wise" appears for the first time in this section and could be better introduced to facilitate the understanding of the experiment. • In the "results" section, page 11 line 1 and 4, references are made to "Figure 4D" and "4F," but these figures do not appear to be present in the figure section. Upon reviewing the rest of the section, the data seem to refer to "Figure 3D" and "3E." We have fixed this. Discussion • Page 15 line 20: please replace "synchronised" with "synchronized". We have fixed this typo. • Page 16 line 11: please replace "T21" with "TS21". We have fixed this typo. Methods • Page 19 line 12: "Pens/Strep" has to be replaced by Pen/Strep. We have fixed this typo. • Page 20 line 20: "Tocris Biocience" has to be replaced by "Tocris Bioscience". We have fixed this typo. • Page 21 line 2: "Addegene" has to be replaced by "Addgene". We have fixed this typo. Figures • Figure 3: the schematic experimental design (Fig. 3A) could be enlarged to match the width of the images/graphs below. We have fixed this. • Figure 5: the reviewer suggests resizing/repositioning the graphs in Fig. 1A so that they match the width of those below. We have fixed this. • Figure S1D: In all the figures of the paper, the respective controls for the TS21 1 and TS21 2 lines are labelled as "WT1/WT2," while in these graphs, they are called "Ctrl1" and "Ctrl2." To ensure consistency throughout the paper, it is suggested to change the names in these graphs. We have fixed this. • Figure S4L: The graph is not very clear, especially regarding the significance reported at -50 pA, please modify the graphical visualization and/or add a legend in the caption. We have fixed this.

      Reviewer #2 (Significance (Required)):

      Nature and significance of the advance for the field. The results presented in the manuscript are potentially interesting and useful, but not completely novel (currents deregulation has already been highlighted in mouse models of Down Syndrome).

      2.8) We thank the reviewer for this comment. While we agree that current deregulation has been observed in mouse models of Down syndrome, the novelty and significance of our study lie in demonstrating these alterations directly in human neurons using both in vitro and in vivo xenograft models.

      This is a critical advance because the human cortex has distinct developmental and functional properties not fully recapitulated in mice. In fact, three recent studies have already highlighted significant defects mainly in excitatory neurons within the fetal human DS cortex (Vuong et al., bioRxiv, 2025; Risgaard et al., bioRxiv, 2025; Lattke et al, under revision). Our work builds directly on these observations by providing, for the first time, an electrophysiological and network-level characterization of these human-specific deficits.

      Our findings thus provide translationally relevant insight that is not merely confirmatory but extends previous work by grounding it in a human cellular context.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The manuscript by Peter et al., reports on the neuronal activity and connectivity of iPSC-derived human cortical neurons from Down syndrome (DS) that is caused by caused by trisomy of the human chromosome 21 (TS21).

      Major points:

      Although the manuscript is potentially interesting, the results appear somehow preliminary and need to be corroborated by control experiments and quantifications of effects to fully sustain the conclusions.

      (1) The authors have not assessed the percentage of WT and TS21 cells that acquire a neuronal or glia identity in their cultures. Indeed, the origin of alterations in network activity and connectivity observed in TS21 neurons could simply derive from reduced number of neurons arising from TS21 iPSC. Alternatively, the same alteration in network activity and connectivity could derive from a multitude of other factors including deficits in neuronal development, neurite extension, or intrinsic electrophysiological properties. In the current version of the manuscript, none of these has been investigated.

      (2) Electrophysiological properties of TS21 and WT neurons at day 53/54 in vitro indicate an extremely immature stage of development (i.e. RMP between -36 and -27 mV with most of the cells firing a single action potential after current injection) in the utilized culture conditions: This is far from ideal for in vitro neuronal-network studies. Finally, reduced activity of HCN1 channels should be confirmed by specific recordings isolating or blocking the related current.

      Main points highlighting the preliminary character of the study.

      1) In Figure 1 immunofluorescence images of the neuronal differentiation markers (Tbr1, Ctip2 and Tuj1) are showed. However, no quantification of the percentage of cells expressing these markers for WT and TS21 neurons is reported. On the other hand, simple inspection of the representative images clearly seams to indicate a difference between the two genotypes, with TS21 cultures showing lower number of cells expressing neuronal markers. This quantification should be corroborated by a similar staining for an astrocyte marker (GFAP, but not S100b since is triplicated in DS). This is an extremely important point since it is obvious that any change in the percentage of neurons (or the neuron/astrocyte ratio) in the cultures will strongly affect the resulting network activity (shown in Figure 2) and the connectivity (showed in Figure 4). Possibly, the quantification should be done at the same time points of the calcium imaging experiments.

      2) In Figure 2 the authors show some calcium imaging traces of WT and TS21 cultures at different time points. However, they again do not show any quantification of neuronal activity. A power spectra analysis is shown in Supplementary Figure 2, but only for WT cultures, while in Supplementary Figure 3 a comparison between WT and Ts21 power spectra is done, but only at the 50 day time point, while difference in synchrony are assessed at 60 days. At minimum, the author should include in main Figure 2 the quantification of the mean calcium event rate and mean event amplitude at the different time points and the power spectra analysis for both WT and TS21 cultures at the same timepoints.

      Of note, the synchronized neuronal activity is present in WT cultures at day 60, but totally lost at subsequent time-points (70 and 80 days). The results of this later time points are different from previous data from the same lab (Kirwan et al., 2015). How might these data be explained? It would be important to rule out any potential issues with the health of the culture that could explain the loss of neuronal activity.It would be beneficial to check cell viability at the different time points to exclude possible confounding factors ? A propidium staining or a MTT assay would strongly improve the soundness of the calcium data.

      3) In Figure 3 there is no quantification of the number and/or density of transplanted neurons for WT and TS21, but only representative images. As above, inspection of the representative images seems to show a decrease in cells labeled by the Tbr1 neuronal marker for TS21 cells. Moreover, the in vivo calcium imaging of transplanted WT and TS21 cells lacks most of the quantification normally done in calcium imaging experiments. Are the event rate and event amplitude different between WT and TS21 neurons ? The measure of neuronal synchrony by mean pixel correlation is not well explained, but it looks somehow simplistic. Neuronal synchrony can be more precisely measured by cross-correlation analysis or spike time tiling coefficients on the traces from single-neuron ROI rather than on all pixels in the field of view, as apparently was done here.

      4) The results on reduced neuronal connectivity in Figure 3 look very striking. However, these results should be accompanied by control experiments to verify the number of neuronal cells and neurite extension in WT and Ts21 cultures. These two parameters could indeed strongly influence the results. As the cultures appear to grow in clusters, bright-field images and TuJ1 staining of the cultures will also greatly help to understand the degree of morphological interconnection between the clusters.

      5) The authors performed RNA-seq experiments on day 50 cultures. Why the authors do not show the complete differential gene expression analysis, but only a small subset of genes? A comprehensive volcano plot and the complete list of identified genes with logFC and FDR values would be helpful. If possible, comparison of the present data (particularly on KCN and HCN expression changes) with published and publicly available expression datasets of other human or human Down syndrome iPSC-derived neurons or human Down syndrome brains will greatly increase the soundness of the present findings. In addition, the gene ontology (GO) results are mentioned in the text, but are not presented. Showing the complete GO analysis for both up and downregulated genes will help the reader to better understand the RNA-seq results. Notably, the results shown in Supplementary Figure on GRIN2A and GRIN2B expression (with values of 300-700 counts versus 2000-4000 counts, respectively) clearly indicate that in both WT and TS21 cultures the NMDA developmental switch has not occurred yet at the 50 days timepoint.

      6) The measure of hyperpolarization-activated currents shown in Figure 5 lack proper control experiments. First, the hyperpolarizing current in TS21 cells do not reach a steady-state as the controls. The two curves are therefore hard to compare. To exclude possible difference in kinetic activation, the authors should have prolonged the current injection period (1-2 seconds). Second, to ultimately prove that such currents are mediated by HCN channels in WT cells the authors should perform some control experiments with a specific HCN blocker. A good example of a suitable protocol, with also current blockers to exclude all other possible current contributions, is the one reported in Matt et al Cell. Mol. Life Sci. 68, 125-137 (2011).

      7) The manuscript lacks information on the statistical analysis used. Also, the numerosity of samples is not clear. Were the dots shown in some graph technical replicates from a single neuronal induction or were all independent neuronal inductions or a mix of the two ? Please clarify.

      8) The method section lacks important information to guarantee reproducibility. Just a few examples: - Only electrophysiology methods for slice are reported, but not for in vitro culture. - Details on Laminin coating is lacking. What concentration was used ? Was poly-ornithine or poly-lysine used before Laminin coating ? - How long cells were switched to BrainPhys medium before calcium imaging ?

      Minor point/typos etc.

      Introduction

      • Page 4 line 6: in the line "Trisomy 21 in humans commonly results in a range in developmental and morphological changes in the forebrain ..." "in" could be replaced by "of".
      • Page 5 line 2: please remove "an" before the word "another".
      • Page 5 line 2: please replace "ecitatory" with "excitatory"

      Results

      • Page 10 line 25: The concept of "pixel-wise" appears for the first time in this section and could be better introduced to facilitate the understanding of the experiment.
      • In the "results" section, page 11 line 1 and 4, references are made to "Figure 4D" and "4F," but these figures do not appear to be present in the figure section. Upon reviewing the rest of the section, the data seem to refer to "Figure 3D" and "3E."

      Discussion

      • Page 15 line 20: please replace "synchronised" with "synchronized".
      • Page 16 line 11: please replace "T21" with "TS21".

      Methods

      • Page 19 line 12: "Pens/Strep" has to be replaced by Pen/Strep.
      • Page 20 line 20: "Tocris Biocience" has to be replaced by "Tocris Bioscience".
      • Page 21 line 2: "Addegene" has to be replaced by "Addgene".

      Figures

      • Figure 3: the schematic experimental design (Fig. 3A) could be enlarged to match the width of the images/graphs below.
      • Figure 5: the reviewer suggests resizing/repositioning the graphs in Fig. 1A so that they match the width of those below.
      • Figure S1D: In all the figures of the paper, the respective controls for the TS21 1 and TS21 2 lines are labelled as "WT1/WT2," while in these graphs, they are called "Ctrl1" and "Ctrl2." To ensure consistency throughout the paper, it is suggested to change the names in these graphs.
      • Figure S4L: The graph is not very clear, especially regarding the significance reported at -50 pA, please modify the graphical visualization and/or add a legend in the caption.

      Significance

      Nature and significance of the advance for the field. The results presented in the manuscript are potentially interesting and useful, but not completely novel (currents deregulation has already been highlighted in mouse models of Down Syndrome).

      Work in the context of the existing literature. This work follows the line of evidence that characterizes Down Syndrome in human neurons (Huo, H.-Q. et al. Stem Cell Rep. 10, 1251-1266 (2018); Briggs, J. A. et al. Etiology. Stem Cells 31, 467-478 (2013)), both in vitro and in xenotransplanted mice, by corrborating some important findings already found in animal models (Stern, S., Segal, M. & Moses, E. EBioMedicine 2, 1048-1062 (2015); Cramer, N. P., Xu, X., F. Haydar, T. & Galdzicki, Z. Physiol. Rep. 3, e12655 (2015); Stern, S., Keren, R., Kim, Y. & Moses, E. http://biorxiv.org/lookup/doi/10.1101/467522 (2018) doi:10.1101/467522.

      Audience. Scientists in the field of pre-clinical biomedical research, especially those working on neurodevelopmental disorders and iPSC-based non-animal models.

      Field of expertise. In vitro electrophysiology, Neurodevelopmental disorders, Down Syndrome, ips cells.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The study investigates the neurodevelopmental impact of trisomy 21 on human cortical excitatory neurons derived from induced pluripotent stem cells (hiPSCs). Key findings include a modest reduction in spontaneous firing, a marked deficit in synchronized bursting, decreased neuronal connectivity, and altered ion channel expression-particularly a downregulation of voltage‐gated potassium channels and HCN1. These conclusions are supported by a combination of in vitro calcium imaging, electrophysiological recordings, viral monosynaptic tracing, RNA sequencing, and in vivo transplantation with two‐photon imaging.

      Major Comments

      • Convincing Nature of Key Conclusions: The study's conclusions are generally well supported by a diverse set of experimental approaches. However, certain claims regarding the intrinsic properties of the excitatory network would benefit from further qualification. In particular, the assertion that reduced synchronization is solely attributable to altered ion channel expression might be considered somewhat preliminary without additional corroborative experiments.
      • One major limitation of the current experimental design is the reliance on predominantly excitatory neuronal cultures derived from hiPSCs. Although the authors convincingly demonstrate differences in network synchronization and connectivity between trisomic (TS21) and control neurons, the almost exclusive focus on excitatory cells limits the physiological relevance of the in vitro network. In the developing cortex, interneurons and astrocytes play crucial roles in modulating network excitability, synaptogenesis, and plasticity. Therefore, incorporating these cell types-either through co-culture systems or through directed differentiation protocols that yield a more heterogeneous neuronal population-could help to determine whether the observed deficits are intrinsic to excitatory neurons or are compounded by a lack of proper inhibitory regulation and glial support.
      • Furthermore, the assessment of neuronal connectivity via pseudotyped rabies virus tracing, while innovative, has inherent limitations. The quantification of connectivity as a ratio of red-to-green fluorescence pixels may be influenced by differential viral infection efficiencies, variations in the expression levels of the TVA receptor, or even by the lower basal activity levels observed in TS21 cultures. Complementary approaches-such as electron microscopy for synaptic density analysis or functional connectivity measurements using multi-electrode arrays (MEAs)-could provide additional structural and functional insights that would validate the rabies tracing data.
      • Qualification of Claims: Some conclusions, particularly those linking specific ion channel dysregulation (e.g., HCN1 loss) directly to network deficits, might be better presented as preliminary. The authors could temper their language to indicate that while the evidence is suggestive, the mechanistic link remains to be fully established.
      • Need for Additional Experiments: Additional experiments that could further consolidate the current findings include:
        • Inclusion of Inhibitory Neurons or Co-culture Systems: Incorporating interneurons or astrocytes would help determine whether the observed deficits are solely intrinsic to excitatory neurons.
        • Alternative Connectivity Assessments: Complementing the rabies virus tracing with electron microscopy or multi-electrode array (MEA) recordings would add structural and functional validation of the connectivity differences.
        • Extended Temporal Profiling: Monitoring network activity over a longer developmental window would clarify whether the observed deficits represent a delay or a permanent alteration in network maturation.
      • Reproducibility and Statistical Rigor: The methods and data presentation are largely clear, with adequate replication and appropriate statistical analyses. Nonetheless, a more detailed description of the experimental replicates, particularly regarding the viral tracing and in vivo transplantation studies, would enhance reproducibility. The availability of raw data and scripts for calcium imaging analysis would also further support independent verification.

      Minor Comments

      • Experimental Details:

      Minor revisions could include clarifying the infection efficiency and expression levels of the viral constructs used in connectivity assays to rule out technical variability. - Literature Context:

      The authors reference prior studies appropriately; however, integrating a brief discussion comparing their findings with alternative DS models (e.g., organoids or other hiPSC-derived systems) would improve contextual clarity. - Presentation and Clarity:

      Figures are generally clear,.But the manuscript contains a minor labeling error. On page 13, the figure is erroneously labeled as "Fig6A", whereas, based on the context and corresponding data, it should be "Fig5A". I recommend that the authors correct this mistake to ensure consistency and avoid potential confusion for readers.

      Significance

      • Nature and Significance of the Advance:

      The work offers a substantial conceptual advance by providing a mechanistic link between trisomy 21 and impaired neuronal network synchronization. Technically, the study integrates state-of-the-art imaging, electrophysiology, and transcriptomic profiling, thereby offering a multifaceted view of DS-related neural dysfunction. Clinically, the findings have the potential to inform future therapeutic strategies targeting network connectivity and ion channel function in Down syndrome. - Context in the Existing Literature:

      The study builds on previous observations of altered network activity in DS patients and DS mouse models (e.g., altered EEG synchronization and reduced synaptic connectivity). It extends these findings to human-derived neuronal models, thus bridging a gap between clinical observations and molecular/cellular mechanisms. Relevant literature includes studies on DS neurodevelopment and the role of ion channels in synaptic maturation. - Target Audience:

      The reported findings will be of interest to researchers in neurodevelopmental disorders, Down syndrome, and ion channel physiology. Additionally, the study may attract the attention of those working on hiPSC-derived models of neurological diseases, as well as clinicians interested in the pathophysiology of DS. - Keywords and Field Contextualization:

      Keywords: Down syndrome, trisomy 21, neuronal connectivity, synchronized network activity, hiPSC-derived cortical neurons, ion channel dysregulation.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the referees for taking time to review our manuscript. These reviews are positive, highlighting the novelty of our findings. The majority of comments are cosmetic, and we have added data in response to some technical points. We feel that some of the additional experiments proposed would not add significant methodological depth, and cross-commenting suggests that our referees agree. At present we are attempting antibody staining to quantify Tk peptide retention in the midgut, as per suggestion by reviewer #2.

      We enclose our point-by-point response to each referee's points, below.



      __Reviewer #1 __

      • Can the authors state in the figure legends the numbers of flies used for each lifespan and whether replicates have been done?
      • We have incorporated the requested information into legends for lifespan experiments.

      • Do the interventions shorten lifespan relative to the axenic cohort? Or do they prevent lifespan extension by axenic conditions? Both statements are valid, and the authors need to be consistent in which one they use to avoid confusing the reader.

      • We read these statements differently. The only experiment in which a genetic intervention prevented lifespan extension by axenic conditions is neuronal TkR86C knockdown (Figure 6B-C). Otherwise, microbiota shortened lifespan relative to axenic conditions, and genetic knockdowns extend blocked this effect (e.g. see lines 131-133). We have ensured that the framing is consistent throughout, with text edited at lines 198-199, 298-299, 311-312, 345-347, 408-409, 424-425, 450, 497-503.

      • TkRNAi consistently reduces lipid levels in axenic flies (Figs 2E, 3D), essentially phenocopying the loss of lipid stores seen in control conventionally reared (CR) flies relative to control axenic. This suggests that the previously reported role of Tk in lipid storage - demonstrated through increased lipid levels in TkRNAi flies (Song et al (2014) Cell Rep 9(1): 40) - is dependent on the microbiota. In the absence of the microbiota TkRNAi reduces lipid levels. The lack of acknowledgement of this in the text is confusing

      • We have added text at lines 219-222 to address this point. We agree that this effect is hard to interpret biologically, since expressing RNAi in axenics has no additional effect on Tk expression (Figure S7). Consequently we can only interpret this unexpected effect as a possible off-target effect of RU feeding on TAG, specific to axenic flies. However, this possibility does not void our conclusion, because an off-target dimunition of TAG cannot explain why CR flies accumulate TAG following TkRNAi We hope that our added text clarifies.

      • *I have struggled to follow the authors logic in ablating the IPCs and feel a clear statement on what they expected the outcome to be would help the reader. *

      • We have added the requested statement at lines 423-424, explaining that we expected the IPC ablation to render flies constitutively long-lived and non-responsive to A pomorum.

      • *Can the authors clarify their logic in concluding a role for insulin signalling, and qualify this conclusion with appropriate consideration of alternative hypotheses? *

      • We have added our logic at lines 449-454. In brief, we conclude involvement for insulin signalling because FoxO mutant lifespan does not respond to TkRNAi, and diminishes the lifespan-shortening effect of * pomorum*. However, we cannot state that the effects are direct because we do not have data that mechanistically connects Tk/TkR99D signalling directly in insulin-producing cells. The current evidence is most consistent with insulin signalling priming responses to microbiota/Tk/TkR99D, as per the newly-added text.

      • Typographical errors

      • We have remedied the highlighted errors, at lines 128-140.

      • I'd encourage the authors to provide lifespan plots that enable comparison between all conditions

      • We have plotted our figures in faceted boxes, because the number of survival curves that would need to be presented on the same axis (e.g. 16 for Figure 5) would not be intellegible. However we have ensured that axes on faceted plots are equivalent and with grid lines for comparison. Moreover, our approach using statistical coefficients (EMMs) enables direct quantitative comparison of the differences among conditions.

      Reviewer #2

      • Not…essential for publication…is it possible to look at Tk protein levels?
      • We have acquired a small amount of anti-TK antibody and we will attempt to immunostain guts associated with * pomorum and L. brevis*. We are also attempting the equivalent experiment in mouse colon reared with/without a defined microbiota. These experiments are ongoing, but we note that the referee feels that the manuscript is a publishable unit whether these stainings succeed or not.

      • it would be good to show that the bacterial levels are not impacted [by TkRNAi]

      • We have quantified CFUs in CR flies upon ubiquitous TkRNAi (Figure S5), finding that the RNAi does not affect bacterial load. New text at lines 138-139 articulates this point.

      • The effect of Tk RNAi on TAG is opposite in CR and Ax or CR and Ap flies, and the knockdown shows an effect in either case (Figure 2E, Figure 3D). Why is this?

      • As per response to Reviewer #1, we have added text at lines 219-222 to address this point.

      • Is it possible to perform at least one lifespan repeat with the other Tk RNAi line mentioned?

      • We have added another experiment showing longevity upon knockdown in conventional flies, using an independent TkRNAi line (Figure S3).

      • Is it possible that this driver is simply not resulting in an efficient KD of the receptor? I would be inclined to check this

      • This comment relates to Figure 7G. We do see an effect of the knockdown in this experiment, so we believe that the knockdown is effective. However the direction of response is not consistent with our hypothesis so the experiment is not informative about the role of these cells. We therefore feel there is little to be gained by testing efficacy of knockdown, which would also be technically challenging because the cells are a small population in a larger tissue which expresses the same transcripts elsewhere (i.e. necessitating FISH).

      • Would it be possible to use antibodies for acetylated histones?

      • The comment relates to Figure 4C-E. The proposed studies would be a significant amount of work because, to our knowledge, the specific histone marks which drive activation in TK+ cells remain unknown. On the other hand, we do not see how this information would enrich the present story, rather such experiments would appear to be the beginning of something new. We therefore agree with Reviewer #1 (in cross-commenting) that this additional work is not justified.

      Reviewer #3

      • *In Line243, the manuscript states that the reporter activity was not increased in the posterior midgut. However, based on the presented results in Fig4E, there is seemingly not apparent regional specificity. A more detailed explanation is necessary. *
      • We thank the reviewer sincerely for their keen eye, which has highlighted an error in the previous version of the figure. In revisiting this figure we have noticed, to our dismay, that the figures for GFP quantification were actually re-plots of the figures for (ac)K quantification. This error led to the discrepancy between statistics and graphics, which thankfully the reviewer noticed. We have revised the figure to remedy our error, and the statistics now match the boxplots and results text.

      • Fig1C uses Adh for normalization. Given the high variability of the result, the authors should (1) check whether Adh expression levels changed via bacterial association

      • We selected Adh on the basis of our RNAseq analysis, which showed it was not different between AX and CV guts, whereas many commonly-used “housekeeping” genes were. We have now added a plot to demonstrate (Figure S2).

      • The statement in Line 82 that EEs express 14 peptide hormones should be supported with an appropriate reference

      • We have added the requested reference (Hung et al, 2020) at line 86.

      • Tk+ EEC activity should be assessed directly, rather than relying solely on transcript levels. Approaches such as CaLexA or GCaMP could be used.

      • We agree with reviewers 1-2 (in cross-commenting) that this proposal is non-trivial and not justified by the additional insight that would be gained. As described above, we are attempting to immunostain Tk, which if successful will provide a third line of evidence for regulation of Tk+ cells. However we note that we already have the strongest possible evidence for a role of these cells via genetic analysis (Figure 5).

      • While the difficulty of maintaining lifelong axenic conditions is understandable, it may still be feasible to assess the induction of Tk (ie. Tk transcription or EE activity upregulation) by the microbiome on males.

      • As the reviewer recognises, maintaining axenic experiments for months on end is not trivial. Given the tendency for males either to simply mirror female responses to lifespan-extending interventions, or to not respond at all, we made the decision in our work to only study females. We have instead emphasised in the manuscript that results are from female flies.

      • TkR86C, in addition to TkR99D, may be involved in the A. pomorum-lifespan interaction. Consider revising the title to refer more generally to the "tachykinin receptor" rather than only TkR99D.

      • We disagree with this interpretation: the results do not show that TkR86C-RNAi recapitulates the effect of enteric Tk-RNAi. A potentially interesting interaction is apparent, but the data do not support a causal role for TkR86C. A causal role is supported only for TkR99D, knockdown of which recapitulates the longevity of axenic flies and TkRNAi flies. Therefore we feel that our current title is therefore justified by the data, and a more generic version would misrepresent our findings.

      • The difference between "aging" and "lifespan" should also be addressed.

      • The smurf phenotype is a well-established metric of healthspan. Moreover, lifespan is the leading aggregate measure of ageing. We therefore feel that the use of “ageing” in the title is appropriate.

      • If feasible, assessing foxo activation would add mechanistic depth. This could be done by monitoring foxo nuclear localization or measuring the expression levels of downstream target genes.

      • Foxo nuclear localisation has already been shown in axenic flies (Shin et al, 2011). We have added text and citation at lines 402-403.
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      Marcu et al. demonstrate a gut-neuron axis that is required for the lifespan-shortening effects mediated by gut bacteria. They show that the presence of commensal bacteria-particularly Acetobacter pomorum-promotes Tk expression in the gut, which then binds to neuronal tachykinin receptors to shorten lifespan. Tk has also recently been reported to extend lifespan via adipokinetic hormone (Akh) signaling (Ahrentløv et al., Nat Metab 7, 2025), but the mechanism here appears distinct. The lifespan shortening by Ap via Tk seems to be partially dependent on foxo and independent of both insulin signaling and Akh-mediated lipid mobilization. Although the detailed mechanistic link to lifespan is not fully resolved, the experiment and its results clearly shows the involvement of the molecules tested. This work adds a valuable dimension to our growing understanding of how gut bacteria influence host longevity. However, there are some points that should be addressed.

      1. Tk+ EEC activity should be assessed directly, rather than relying solely on transcript levels. Approaches such as CaLexA or GCaMP could be used.
      2. In Line243, the manuscript states that the reporter activity was not increased in the posterior midgut. However, based on the presented results in Fig4E, there is seemingly not apparent regional specificity. A more detailed explanation is necessary.
      3. If feasible, assessing foxo activation would add mechanistic depth. This could be done by monitoring foxo nuclear localization or measuring the expression levels of downstream target genes.
      4. Fig1C uses Adh for normalization. Given the high variability of the result, the authors should (1) check whether Adh expression levels changed via bacterial association and/or (2) compare the results using different genes as internal standard.
      5. While the difficulty of maintaining lifelong axenic conditions is understandable, it may still be feasible to assess the induction of Tk (ie. Tk transcription or EE activity upregulation) by the microbiome on males.
      6. We also had some concerns regarding the wording of the title. Fig6B and C suggests that TkR86C, in addition to TkR99D, may be involved in the A. pomorum-lifespan interaction. Consider revising the title to refer more generally to the "tachykinin receptor" rather than only TkR99D. The difference between "aging" and "lifespan" should also be addressed. While the study shows a role for Tk in lifespan, assessment of aging phenotypes (eg. Climbing assay, ISC proliferation) beyond the smurf assay is required to make conclusions about aging.
      7. The statement in Line 82 that EEs express 14 peptide hormones should be supported with an appropriate reference, if available.

      Referees cross-commenting

      I agree with the other reviewers that the study has been done very well and hence additional experiments are not mandatory to be published such as calcium imaging. However, I still believe that testing Tk's elevation by the Ap in males should greatly increase the generality of the finding, no matter what the outcome would be. Too many studies use only females.

      Significance

      General assessment

      The main strength of this study is the careful and extensive lifespan analyses, which convincingly demonstrate the role of gut microbiota in regulating longevity. The authors clarify an important aspect of how microbial factors contribute to lifespan control. The main limitation is that the study primarily confirms the involvement of previously reported signaling pathways, without identifying novel molecular players or previously unrecognized mechanisms of lifespan regulation.

      Advance

      The lifespan-shortening effect of Acetobacter pomorum (Ap) has been reported previously, as has the lifespan-shortening effect of Tachykinin (Tk). However, this study is the first to link these two factors mechanistically, which represents a significant and original contribution to the field. The advance is primarily mechanistic, providing new insight into how microbial cues converge on host signaling pathways to influence ageing.

      Audience

      This work will be of particular interest to a specialized audience of basic researchers in ageing biology. It will also attract interest from microbiome researchers who are investigating host-microbe interactions and their physiological consequences. The findings will be useful as a conceptual framework for future mechanistic studies in this area.

      Field of expertise

      Drosophila ageing, lifespan, microbiome, metabolism

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The main finding of this work is that microbiota impacts lifespan though regulating the expression of a gut hormone (Tk) which in turn acts on its receptor expressed on neurons. This conclusion is robust and based on a number of experimental observation, carefully using techniques in fly genetics and physiology: 1) microbiota regulates Tk expression, 2) lifespan reduction by microbiota is absent when Tk is knocked down in gut (specifically in the EEs), 3) Tk knockdown extends lifespan and this is recapitulated by knockdown of a Tk receptor in neurons. These key conclusions are very convincing. Additional data are presented detailing the relationship between Tk and insulin/IGF signalling and Akh in this context. These are two other important endocrine signalling pathways in flies. The presentation and analysis of the data are excellent.

      There are only a few experiments or edits that I would suggest as important to confirm or refine the conclusions of this manuscript. These are:

      1. When comparing the effects of microbiota (or single bacterial species) in different genetic backgrounds or experimental conditions, I think it would be good to show that the bacterial levels are not impacted by the other intervention(s). For example, the lifespan results observed in Figure 2A are consistent with Tk acting downstream of the microbes but also with Tk RNAi having an impact on the microbiota itself. I think this simple, additional control could be done for a few key experiments. Similarly, the authors could compare the two bacterial species to see if the differences in their effects come from different ability to colonise the flies.
      2. The effect of Tk RNAi on TAG is opposite in CR and Ax or CR and Ap flies, and the knockdown shows an effect in either case (Figure 2E, Figure 3D). Why is this? Better clarification is required.
      3. With respect to insulin signalling, all the experiments bar one indicate that insulin is mediating the effects of Tk. The one experiment that does not is using dilpGS to knock down TkR99D. Is it possible that this driver is simply not resulting in an efficient KD of the receptor? I would be inclined to check this, but as a minimum I would be a bit more cautious with the interpretation of these data.
      4. Is it possible to perform at least one lifespan repeat with the other Tk RNAi line mentioned? This would further clarify that there are no off-target effects that can account for the phenotypes.

      There are a few other experiments that I could suggest as I think they could enrich the current manuscript, but I do not believe they are essential for publication: 5. The manuscript could be extended with a little more biochemical/cell biology analysis. For example, is it possible to look at Tk protein levels, Tk levels in circulation, or even TkR receptor activation or activation of its downstream signalling pathways? Comparing Ax and CR or Ap and CR one would expect to find differences consistent with the model proposed. This would add depth to the genetic analysis already conducted. Similarly, for insulin signalling - would it be possible to use some readout of the pathway activity and compare between Ax and CR or Ap and CR? 6. The authors use a pan-acetyl-K antibody but are specifically interested in acetylated histones. Would it be possible to use antibodies for acetylated histones? This would have the added benefit that one can confirm the changes are not in the levels of histones themselves. 7. I think the presentation of the results could be tightened a bit, with fewer sections and one figure per section.

      Referees cross-commenting

      Reviewer 1

      I generally agree with this reviewer but for

      "I'm convinced by the data showing that FOXO is required for TkRNAi to prevent lifespan shortening by Ap, but FOXO doesn't only respond to insulin signalling and can't be taken by itself to indicate a role for insulin signalling which the authors appear to do here."

      To the best of my knowledge, Foxo has only been shown to be required for lifespan extension/modulation by a reduction in insulin-like signalling. I.e. it does respond to other pathways but this is the only one where Foxo activity is known to modulate lifespan.

      Reviewer 3

      I agree with reviewer 1 that point raised under (1) does not appear strictly required for the conclusions of the manuscript.

      Both reviewers 1 and 3:

      I have a different take on the results of experiments where IPCs are manipulated. To me, Figure 7D and E show that ablating the IPCs removes the difference between Ax and Ap i.e. the IPCs are involved and insulin-like signalling is likely involved. The fact that RNAi against the TKR99D receptor does not have the same effect, does not matter (the sensing could happen in different neurons). Similarly, dilp expression is only a minor readout of what is happening with insulin-like signalling - dilps are controlled at the level of secretion.

      However, I would be happy for the authors to present different arguments and make a reasonable conclusion, which may differ from mine. But I think the arguments I present above should be taken into account.

      Significance

      The main contribution of this manuscript is the identification of a mechanism that links the microbiota to lifespan. This is very exciting and topical for several reasons:

      1) The microbiota is very important for overall health but it is still unclear how. Studying the interaction between microbiota and health is an emerging, growing field, and one that has attracted a lot of interest, but one that is often lacking in mechanistic insight. Identifying mechanisms provides opportunities for therapies. The main impact of this study comes from using the fruit fly to identify a mechanism.

      2) It is very interesting that the authors focus on an endocrine mechanism, especially with the clear clinical relevance of gut hormones to human health recently demonstrated with new, effective therapies (e.g. Wegovy).

      3) Tk is emerging as an important fly hormone and this study adds a new and interesting dimension by placing TK between microbiota and lifespan.

      I think the manuscript will be of great interest to researchers in ageing, human and animal physiology and in gut endocrinology and gut function.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this study the authors use a Drosophila model to demonstrate that Tachykinin (Tk) expression is regulated by the microbiota. In Drosophila conventionally reared (CR) flies are typically shorter lived than those raised without a microbiota (axenic). Here, knockdown of Tk expression is found to prevent lifespan shortening by the microbiota and the reduction of lipid stores typically seen in CR flies when compared to axenic counterparts. It does so without reducing food intake or fecundity which are often seen as necessary trade-offs for lifespan extension. Further, the strength of the interaction between Tk and the microbiota is found to be bacteria specific and is stronger in Acetobacter pomorum (Ap) monoassociated flies compared to Levilactobacillus brevis (Lb) monoassociation. The impact on lipid storage was also only apparent in Ap-flies. Building on these findings the authors show that gut specific knockdown is largely sufficient to explain these phenotypes. Knockdown of the Tk receptor, TkR99D, in neurons recapitulates the lifespan phenotype of intestinal Tk knockdown supporting a model whereby Tk from the gut signals to TkR99D expressing neurons to regulate lifespan. In addition, the authors show that FOXO may have a role in lifespan regulation by the Tk-microbiota interaction. However, they rule out a role for insulin producing cells or Akh-producing cells suggesting the microbiota-Tk interaction regulates lifespan through other, yet unidentified, mechanisms.

      Major comments:

      Overall, I find the key conclusions of the paper convincing. The authors present an extensive amount of experimental work, and their conclusions are well founded in the data. In particular, the impact of TkRNAi on lifespan and lipid levels, the central finding in this study, has been demonstrated multiple times in different experiments and using different genetic tools. As a result, I don't feel that additional experimental work is necessary to support the current conclusions. However, I find it hard to assess the robustness of the lifespan data from the other manipulations used (TkR99DRNAi, TkRNAi in dFoxo mutants etc.) because information on the population size and whether these experiments have been replicated is lacking. Can the authors state in the figure legends the numbers of flies used for each lifespan and whether replicates have been done? For all other data it is clear how many replicates have been done, and the methods give enough detail for all experiments to be reproduced.

      Minor comments:

      While I feel the conclusions of this study are well supported by the data I found this to be a complex read and in places hard to follow. I feel some work is necessary in the writing to help the reader follow the authors logic. Below I describe some of the issues that confused me and provide some suggestions that I hope the authors will find helpful.

      Survival curves The authors state that the lifespan difference between CR and axenic flies disappears with TkRNAi because TkRNAi CR flies are longer lived, rather than because TkRNAi axenic flies are shorter lived. Is this consistent in every TkRNAi experiment? It's hard for the reader to assess this because the relevant lifespan curves are presented on separate plots. I'd encourage the authors to provide lifespan plots that enable comparison between all conditions. For example, in figures 2 and 6 the reader wants to directly compare between RU- and RU+ but can't easily do so. Additional plots could be made available in the supplementary figures showing the comparisons that are not easy to make on the main figures.

      Consistent framing of the data Do the interventions shorten lifespan relative to the axenic cohort? Or do they prevent lifespan extension by axenic conditions? Both statements are valid, and the authors need to be consistent in which one they use to avoid confusing the reader. For example, line 325 says TkR86CRNAi prevents lifespan extension in axenic flies. Given the framing in the previous sections, it might be clearer to say that TkR86CRNAi shortens the lifespan of axenic flies to that of CR flies in contrast to TkRNAi and TkR99DRNAi which don't.

      The impact of TkRNAi on lipid levels in axenic flies TkRNAi consistently reduces lipid levels in axenic flies (Figs 2E, 3D), essentially phenocopying the loss of lipid stores seen in control conventionally reared (CR) flies relative to control axenic. This suggests that the previously reported role of Tk in lipid storage - demonstrated through increased lipid levels in TkRNAi flies (Song et al (2014) Cell Rep 9(1): 40) - is dependent on the microbiota. In the absence of the microbiota TkRNAi reduces lipid levels. The lack of acknowledgement of this in the text is confusing for the reader because it is inconsistent with the microbiota driving both higher Tk expression and higher lipid storage. If the microbiota increases Tk expression and this results in reduced lipid storage, why does reduced Tk expression also result in reduced lipid storage in axenic flies? This could further highlight the unique impact that the interaction between TkRNAi and the microbiota has on lipid storage, given it reverses both the impact of the microbiota alone and TkRNAi alone. I feel this aspect of the data should be given more attention in the text both for clarity and because it may be telling us something important about the function of Tk. The current framing around pleiotropic effects is valid, and the impact of Tk on lipid storage is clearly independent of its impact on lifespan and so is not central to this study. However, I feel a short additional paragraph to acknowledge this nuance of the data is needed. It can be made clear in the text that further exploration is beyond the scope of the current study.

      Role of insulin signalling and insulin producing cells I'm convinced by the data showing that FOXO is required for TkRNAi to prevent lifespan shortening by Ap, but FOXO doesn't only respond to insulin signalling and can't be taken by itself to indicate a role for insulin signalling which the authors appear to do here.

      I would expect ablation of IPCs to have the opposite effect to foxo mutation and to increase FOXO activity throughout the organism due to a reduction in Dilp levels and so reduced insulin signalling. So, I have struggled to follow the authors logic in ablating the IPCs and feel a clear statement on what they expected the outcome to be would help the reader. They find that TkRNAi still prevents lifespan shortening by Ap when IPCs are ablated and that TkR99DRNAi in IPCs also doesn't block lifespan shortening by Ap despite reducing the expression of dilp3 and dilp5. To me these data rule out a role for insulin signalling despite the requirement for FOXO and yet the authors conclude that insulin signalling is involved in the response to Ap and TkRNAi, although not obligately (lines 420 - 422 and 511 - 512). Can the authors clarify their logic in concluding a role for insulin signalling, and qualify this conclusion with appropriate consideration of alternative hypotheses? The potential involvement of other signalling inputs to FOXO activity, e.g. immune signalling and JNK, should be acknowledged and warrants some discussion.

      Typographical errors:

      Incomplete sentence line 121 to 122 - starting "Cox proportional hazards.... and posthoc tests (Fig 2b).

      Line 123 "EMMs" - define abbreviation on first use

      References to Fig 2b (first given on line 122), should be capitalised to Fig 2B for consistency.

      Lines 231 and 317 - the phrase "steady state (microbiota independent) expression" in reference to flyATLAS 2 data could be misleading. The term "microbiota independent" could suggest that expression levels have been shown not to be regulated by the microbiota and this is not the case. The authors should change this to simply state they are referring to steady state expression in conventionally reared flies.

      Referees cross-commenting

      Below are brief comments on the revision suggestions that reviewers 2 and 3 have requested.

      Reviewer 2

      1. I agree that confirmation that TkRNAi doesn't impact microbial levels could be helpful and would be straightforward for the authors to do. However, I don't feel it's essential to support the central claims of the paper.
      2. I agree.
      3. I don't feel that any of these experiments supports a role for insulin signalling, so I don't feel that this additional control is needed.
      4. It would be a good addition to have lifespan data from a separate knockdown line for corroboration. However, this has already been done in several different genetic backgrounds through crosses with different driver lines in multiple tissues, so I feel it's unnecessary given the time and resources that lifespan experiments take. There's also the caveat that different RNAi lines can knockdown to different extents so that would have to be assessed as well and if there's a difference it may mean that ultimately not much can be concluded from this additional experiment.
      5. A good suggestion, but not straightforward and depends on the availability of the necessary tools, or possibly the generation of new tools. One for a follow up study.
      6. I feel this is not important enough to the central findings of the study to warrant the extra work.
      7. I agree.

      Reviewer 3 1. Imaging calcium signalling is not straightforward unless a lab already has the tools available and optimised. If Tk+ EEs show changes in calcium signalling I'm not convinced that this tells us anything specific to the Tk-microbiota interaction. The point is the role of Tk itself, not the broader activity of the cells that express it. 2. I agree this needs clarification. 3. I agree that this would add depth, if feasible, but feel it's not essential to support the current conclusions. 4. This is a minor point and given the RT-qPCR data and the RNAseq data corroborate each other I'm convinced that Tk levels are elevated. 5. I feel exploring this in males is opening an additional line of enquiry beyond the scope of the current study. Either the phenotypes are the same - in which case what is added? - or they are different but there's no scope to assess why. A good suggestion for a follow up study. 6. No comment. 7. Agreed.

      One final comment. It's true that FOXO has only been shown to regulate lifespan in the context of insulin signalling. However, as far as I'm aware it hasn't been shown not to regulate lifespan downstream of it's other activators, this simply hasn't been explored due to the historical focus on insulin signalling in this field. In the context of host-microbiota interactions considering other pathways the activate FOXO, such as immune and JNK signals, would make sense.

      Reviewed by Dr Rebecca Clark, Department of Biosciences, Durham University

      Significance

      Overall, I find the key conclusions of the paper convincing. The authors present an extensive amount of experimental work, and their conclusions are well founded in the data. We have known that the microbiota influence lifespan for some time but the mechanisms by which they do so have remained elusive. This study identifies one such mechanism and as a result opens several avenues for further research. The Tk-microbiota interaction is shown to be important for both lifespan and lipid homeostasis, although it's clear these are independent phenotypes. The fact that the outcome of the Tk-microbiota interaction depends on the bacterial species is of particular interest because it supports the idea that manipulation of the microbiota, or specific aspects of the host-microbiota interaction, may have therapeutic potential.<br /> These findings will be of interest to a broad readership spanning host-microbiota interactions and their influence on host health. They move forward the study of microbial regulation of host longevity and have relevance to our understanding of microbial regulation of host lipid homeostasis. They will also be of significant interest to those studying the mechanisms of action and physiological roles of Tachykinins.

      Field of expertise: Drosophila, gut, ageing, microbiota, innate immunity

  2. Oct 2025
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the reviewers for providing thoughtful and constructive feedback, which will help us improve the clarity and rigor of the paper. On balance, the reviews were positive. Reviewer 1 mentioned that “This is a strong manuscript with few problems and all important findings well justified, indeed this is a nicely polished…..high-quality manuscript,” and that “this paper makes a major breakthrough, showing that cell autonomous defects in hTSCs are very likely at the heart of the pathology observed in GIN-prone murine mutants.” Reviewer 3 stated that “The study is well designed, and the manuscript is very well written. The conclusions are supported by the evidence presented.” Reviewer 2 was less enthusiastic, with main concerns being that “The paper is mostly descriptive and often quite confusing leaving one not much closer to understanding the mechanistic basis for the interesting sex-biased semi-lethal phenotype.” and felt that figure titles/section headers overstated the results, and finally recommended to improve some technical aspects and tempering conclusions. The proposed edits we think address most issues raised by the reviewers either with re-writing or adding data as described below.

      In response to reviewer #1 comments:

      Major comments:

      • I am confused as to the basis of the sex-skewing phenomenon? Is the problem that lack of maternally loaded WT Mcm4 worsens the phenotype, or is the issue that Mcm4C3/C3 dams are less able to retain pregnancies, perhaps being a more inflammatory environment? Also, while there quite consistent evidence for reduced viability of Mcm4C3/C3McmGt/+ progeny, especially for female progeny, how confident can we be that the genotype of the dam vs. sire is important? Notably on a Ddx58 background, the progeny of the Mcm4C3/C3 sire included seven live male Mcm4C3/C3McmGt/+ but no female.

      Regarding the first point (sex skewing only when female is C3/C3), we also suspected either: 1) the maternal uterine environment, or 2) reduced oocyte quality. Although not reported in this manuscript, we tested #1 by performing embryo transfer experiments. Transferring 2-cell stage embryos from sex-skewing mating to WT females did not rescue the sex-bias. We then examined oocytes from C3/C3 females. We found evidence for compromised mitochondria and transcriptome disruption. However, we are not sure why this happens (poor follicle support? Oocyte intrinsic phenomenon?). We are reserving these results and additional experiments for another paper, especially since this one mainly deals with GIN and placenta development. If the reviewers feel strongly that the embryo transfer data is crucial, we can include it.

      Regarding how confident we are that the genotype of the dam vs. sire is important, this stems from our previous paper by McNairn et al 2019 (the percentage of female C3/C3 M2/+ from sex-skewing mating is 20% compared to 60% from the reciprocal mating), which was quite dramatic. Consistent with this, MCM levels were significantly reduced in the placentae only when the dam was C3/C3 and the sire C3/+ M2/+, but not in the reciprocal cross. The reviewer makes a good observation about the Ddx58 cross; we can only hypothesize that the mutation somehow sensitizes females in this scenario and will make mention of it in the revision. We also realize that we neglected to write in Methods that the Ddx58 allele was coisogenic in the C3H background.

      • I'm not sure what Supplementary Figure 6 is showing (faster differentiation of C3 but less TGC?). Regardless, it's hard to draw too much conclusion from one not-very-pretty Western blot. This figure requires both additional replicates and a better explanation of how it fits with the other conclusions of the paper..

      We hypothesized that the JZ defect observed in the semi-lethal genotype placentas could arise either from impaired maintenance of the progenitor pool or from reduced capacity of mutant trophoblast progenitors to differentiate into the JZ lineage. The blot in Supplementary Figure 6 was intended as a qualitative demonstration that mutant trophoblast stem cells can differentiate into JZ lineages. We recognize that the figure is not definitive and will revise the text to clarify its purpose. A replicate(s) of the Western will be performed as suggested.

      • Supplementary Figure 7F-G is puzzling. Half of the mESCs have gamma-H2AX at all times, including most in S or G2 phase? In Figure S7E, do the quadrants correspond to being negative or positive for gamma-H2AX? At very least, IF images showing clear gamma-H2AX foci would be much more convincing.

      The gates for γH2AX FACS analysis were established using negative controls lacking primary antibody. As reported previously, embryonic stem cells display high basal levels of γH2AX staining (Chuykin et al., Cell Cycle 2008; Turinetto et al., Stem Cells 2012; Ahuja et al., Nat Comm 2016), which likely explains the broad signal observed across cell cycle phases. Regardless, we will provide immunofluorescence staining of γH2Ax and foci count in our revision.

      • The methods section is well detailed, but it would be ideal to clarify how many replicates each Western Blot or flow cytometry experiment is representative of.

      Thanks for the suggestion. We will update this for Fig4 and Fig5.

      Minor comments:

      • Is it possible that cGAS-STING and RIG pathways act redundantly to cause inflammation and lethality, or that other innate immune components are involved? I don't expect the authors to make compound mutants to test this but at least this possibility should be discussed textually.

      We appreciate the reviewer’s point, and had the same suspicion. Supporting this, we will add new RNA-seq analysis of Tmem173 KO placentas revealed elevated inflammatory gene expression compared to C3/C3 M2/+ controls, consistent with potential redundancy or feedback regulation. We will update in supplementary figures to reflect this.

      In response to reviewer #2 comments:

      Major comments:

      A major concern throughout the paper is that conclusions are often overstating their data. The title of figure 2 is "placentae with replication stress have smaller junctional and labyrinth zones". However, there is no measure of replication stress in this figure, just a histological evaluation of the placentae from the different mutants. The title of figure 3 is "Impact of GIN on LZ is less than JZ," but there is no measure of GIN, but instead measurement of number of cells in cell cycle and some bulk RNA-seq analysis. Title of figure 4 is "TSCs with increased genomic instability exhibit abnormal phenotypes." Again there is no measure of GIN, but instead staining of derived TSCs for proliferation, cell death, and a TSC marker. Title of figure 5 is "DNA damage responses and G2/M checkpoint activation drive premature TSC differentiation." However, there does not appear to be a difference in gH2AX between the two mutant genotypes. Checkpoint proteins might be up, but need quantification and reproduction. > 4C is the only marker of differentiation. Importantly, all the analyses here are associations, not connections, so cannot use the word "drive". Similar issues can be raised with a number of the supplementary figures.

      The Chaos3 (chromosome aberrations occurring spontaneously 3) model is a well-established system of intrinsic chronic replication stress and GIN. It is characterized by ~20 fold elevation of blood micronuclei (Shima et al., Nature 2007), a hallmark of GIN (Soxena et al., Mol Cell 2022); a destabilized MCM2-7 helicase prone to replication fork collapse (Bai et al., PLoS Genet 2016); and increased mitotic chromosome abnormalities and decreased dormant origins (Kawabata et al., Mol Cell 2011; Chuang et al., Nucleic Acid Res 2012) that are known to cause GIN and replication stress (Ibarra et al., PNAS 2008 ). Also, in our previous work (McNairn et al Nature 2019), we showed that placentae from C3/C3 dams exhibit significantly elevated γH2Ax as well as reduced MCM2 and MCM4 protein levels. In our current study, we also observe elevated γH2Ax in mutant TSCs (C3/C3 and C3/C3 M2/+), consistent with genomic instability. Nevertheless, we acknowledge that in TSCs, we did not formally demonstrate replications stress(RS), so where appropriate, we will advise figure titles, for example to say that “cells/placentae with a GIN or RS genotype.”

      We acknowledge the reviewers concern regarding western blots. We will provide quantification and statistics in our revision.

      1) A deeper analysis of the cell lines is likely to be the most fruitful path to reveal interesting mechanisms. It is very surprising that there is no phenotype in ESCs. Authors should check for increased apoptosis. Maybe the phenotypic cells are lost. Or do ESCs use different MCMs/mechanisms of DNA replication or are they better able to handle replication stress and GIN? How many passages were the TSCs and ESCs cultured for? Does GIN (i.e. aneuploidy, CNVs) develop in TSCs and ESCs with passaging? How do the MCM mutations impact the molecular identity of the ESC and TSC cells including their heterogeneity in the population.

      We assessed apoptosis using cleaved caspase 3 flow cytometry in mutant ESCs and observed no difference compared to controls (we will add this data as Supplementary Fig. 7).

      We believe there are intrinsic differences in TSCs and ESCs in their ability to respond to and counteract replication stress and DNA damage. ESCs are known to license more replication origins than somatic cells at a higher rate, which protects them from short G1-induced replication stress (Ahuja et al., Nat Comm 2016; Ge et al., Stem Cell Rep 2015; Matson et al., eLife 2017). Human placental cells physiologically exhibit high levels of mutation rate and chromosomal instability in vivo (Coorens et al., Nature 2021). Supporting this, Wang, D., et al (Nat Comm 2025) reported that several cell cycle and DDR regulators are differentially expressed in human TSCs vs human pluripotent stem cells. Whether such transcriptional differences directly contribute to functional outcomes remains to be determined.

      All experiments in this study were conducted using early-passage ESCs and TSCs (i.e. Finally, we showed that close to 90% mutant ESCs are KLF4+ (a naive pluripotency marker) whereas EOMES+ cells were significantly reduced in TSCs carrying the GIN genotype (Fig. 4E–F and Supplementary Fig. 7), highlighting lineage-specific differences.

      Minor Comments:

      1) There is a lack of quantification and repeats for all Westerns. At minimum there should be three repeats for each experiment, quantification including normalization to a reference protein, and stats confirming any proposed differences between conditions.

      We will update our revision with quantification and statistics for western blots.

      2) I would recommend moving the results in supp table 1 to figure 1. While negative, they are the newer results. The results shown in current figure 1 are essentially a reproduction of their previous work.

      The placental observations presented in Fig.1 are new. In particular, the placental and embryonic weight measurements graphed in Fig1B and C have not been published by our group. Fig1A reproduces our previous observation on embryo viability in GIN mutants (McNairn et al., Nature 2019), while the schematic was provided for better flow and readability given the complex mating schemes. We are agnostic on the Suppl Table 1. It could be changed to a new Table 1 in the main section depending on the journal.

      In response to reviewer #3 comments:

      Major Comments

      While the inclusion of bulk RNAseq data of whole placental tissue is appreciated, the interpretation of the results is somewhat problematic, as it is acknowledged that the cell type composition of the placentas is drastically different between groups. Making conclusions based upon GSEA analysis of two different groups with drastically different cell type composition is somewhat misleading, as based on the results, it is a direct reflection of the cell types present. It would be more helpful to perform cell type deconvolution of the RNAseq data to estimate the proportion of each cell type within the bulk samples and compare that to what is seen histologically and not dive too deeply into the pathways since the results could just be a reflection of the cell types e.g. angiogenesis pathways from more endothelial cells. Additionally, the RNAseq data can be leveraged to look at expression of inflammatory genes by sex, which may show interesting patterns based on the other results.

      We agree that the representation of cell types in the placenta is problematic especially for underrepresented genes. We propose to use the BayesPrism tool (Chu et al., Nat Cancer 2022) to deconvolute bulk RNA-seq for better representation of transcriptional changes in the placenta.

      Section: GIN impairs trophoblast stem cell establishment and maintenance. To support the assertion in the first paragraph, beyond measuring apoptosis, it would be helpful at this stage to look at RNA expression levels indicative of the activation of DNA damage checkpoint genes

      We have performed RNA-seq on mutant ESC and TSCs and are in the process of data analysis. We will update these results in the revision.

      Please include additional methodological details in the methods section on the statistical analysis done for differential expression analysis. Specifically, what type of normalization was used, if lowly expressed genes were filtered out and at what cutoff, what statistical model was used (did you include covariates?), what comparisons were made? Did you stratify by sex? What cutoff was used for statistical significance? Did you perform multiple testing correction?

      We will update RNA-Seq data analysis methods in our full revision.

      2. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #1 comments:

      • Supplementary Table 1. would be enhanced greatly showing comparable tables for Mcm4C3/C3 x Mcm4C3/+McmGt/+ in mice without the Tmem173 or Ddx58 mutations. It is fine to recycle data from McNairn 2019 here, as long as the source is indicated, but a comparison is needed.

      Thanks for pointing this out. We have updated this suggestion in Supp table 1.

      • In Figure S3E-F, is the box above each graph supposed to show the genotype of the dam?

      Yes. Thanks for pointing this out. We have added a description in the figure legend to make it clear.

      • "Indeed, the placenta and embryo weights of E13.5 Mcm4C3/C3 Mcm2Gt/+ Mcm3Gt/+ animals were significantly improved vs. Mcm4C3/C3 Mcm2Gt/+ animals, rendering them similar to Mcm4C3/C3 littermates (Fig. 6A-C). The JZ (but not LZ) area in Mcm4C3/C3 Mcm2Gt/+ Mcm3Gt/+ placentae also increased to the level of Mcm4C3/C3 littermates (Fig. 6D-H)." There are two problems here. First, the figure calls are wrong. Second, the description of the data is not quite right, it looks like the C3/C3 and C3/C3 M2/+ M3/+ LZs are a similar size to each and are statistically indistinguishable.

      Thanks for catching this. We have updated these in the main text.

      *Reviewer #2 comments: *

      Minor comment

      • Need to review citations to figures. For example, no citations are made to figure 4a and 4c.

      Thanks for catching this. We have updated the text.

      Reviewer #3 comments:

      Define the first use of >4C DNA content to help readers understand this potentially unfamiliar term.

      We have edited this part to indicate cells with more than 4C DNA content for better clarity.

      iDEP tool - please include citation to manuscript instead of link

      We have updated this citation.

      Check citations. Some citations to BioRxiv that are now published e.g. 13.

      We have updated this citation.

      3. Description of analyses that authors prefer not to carry out

      Reviewer 2

      2) Along similar lines, most of the in vivo phenotypic analyses are performed at E13.5, long after defects are likely beginning to express themselves especially given that they see phenotypes in the TSCs, which represent the polar TE of a E4.5. To understand the primary defects of the in vivo phenotype, they should be looking much earlier. Supplemental figure 5 is a start but represents a rather superficial analysis.

      The peri-implantation period, namely E4.5, represents a “black box” of embryonic development given that this is a critical stage for implantation. Aside from being an extremely difficult stage to analyze technically, we don’t think it is essential to the conclusions (or doable in a timely manner), especially given the use of TSCs. If we complete EdU studies on E6.5 embryos, we will include them.

      3) Fig. 6 would benefit from evidence that MCM3 mutant is rescuing MCM4 levels in the chromatin fraction of cells and the DNA damage phenotype.

      The genetic evidence presented is strong, and although we didn’t do the suggested experiment, we feel that our previous studies (McNairn et al., Nature 2019 and Chuang et al., PLoS Genet 2010) on the effects of MCM3 as a nuclear export factor (as it is in yeast (Liku et al., Mol Biol Cell 2005)) are a reasonable basis for not repeating such experiments. Furthermore, we are no longer maintaining the Mcm3 line and it would take over a year to reconstitute and rebreed triple mutants.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript examines chronic replication stress-mediated genomic instability in placental development and concludes that it disrupts placental development in mice. The study is well designed and the manuscript is very well written. The conclusions are supported by the evidence presented. The manuscript would be improved by addressing the comments below.

      Major Comments:

      • While the inclusion of bulk RNAseq data of whole placental tissue is appreciated, the interpretation of the results is somewhat problematic, as it is acknowledged that the cell type composition of the placentas is drastically different between groups. Making conclusions based upon GSEA analysis of two different groups with drastically different cell type composition is somewhat misleading, as based on the results, it is a direct reflection of the cell types present. It would be more helpful to perform cell type deconvolution of the RNAseq data to estimate the proportion of each cell type within the bulk samples and compare that to what is seen histologically and not dive too deeply into the pathways since the results could just be a reflection of the cell types e.g. angiogenesis pathways from more endothelial cells. Additionally, the RNAseq data can be leveraged to look at expression of inflammatory genes by sex, which may show interesting patterns based on the other results.

      • Section: GIN impairs trophoblast stem cell establishment and maintenance. To support the assertion in the first paragraph, beyond measuring apoptosis, it would be helpful at this stage to look at RNA expression levels indicative of the activation of DNA damage checkpoint genes

      Minor Comments:

      • Define the first use of >4C DNA content to help readers understand this potentially unfamiliar term.

      • Please include additional methodological details in the methods section on the statistical analysis done for differential expression analysis. Specifically, what type of normalization was used, if lowly expressed genes were filtered out and at what cutoff, what statistical model was used (did you include covariates?), what comparisons were made? Did you stratify by sex? What cutoff was used for statistical significance? Did you perform multiple testing correction?

      • iDEP tool - please include citation to manuscript instead of link

      • Check citations. Some citations to BioRxiv that are now published e.g. 13.

      Significance

      The manuscript concludes that replication-stress induced genomic instability impairs placental development in mice. This is a significant advance in the field, as it mechanistically links genomic instability to placental development with further study needed in human trophoblast to establish clinical relevance. Strengths of this manuscript include solid study design, interpretation and presentation (both writing and figures). Weakness of the manuscript reside primarily in the RNAseq analysis results, methods and interpretation. The manuscript is of interest to audiences with interests in genome maintenance, development and placental biology. To contextualize this reviewer's point of view, this review is based on expertise in genomics, computational biology and placental biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript, "Chronic replication stress-mediated genomic instability disrupts placenta development in mice" by Munisha et al follows up a 2019 paper in Nature by the same group where they show that mutations to the MCM genes lead to a sex-skewed semi-lethal phenotype starting after embryonic day 9.5 and extending to birth. In the paper, they hypothesized that the semi-lethality is secondary to genomic instability (GIN) driven inflammation due to activation of the innate immune pathways sensing cytoplasmic DNA. In this paper, they start by disproving that hypothesis and then go on to present data arguing lethality is due to a placental development defect rather than inflammation. The paper is mostly descriptive and often quite confusing leaving one not much closer to understanding the mechanistic basis for the interesting sex-biased semi-lethal phenotype that was described in their original paper. The most interesting aspect of the paper is the derivation of TSC and ESCs and initial analysis suggesting that the TSCs are more sensitive to the MCM mutations, but the analysis is rather shallow. Importantly it is unclear how the phenotype explains the sex-skewing of the phenotype. Are the TSC phenotypes sex-skewed and if so why? Also, why is the JZ and especially GlyTCs most effected?

      A major concern throughout the paper is that conclusions are often overstating their data. The title of figure 2 is "placentae with replication stress have smaller junctional and labyrinth zones". However, there is no measure of replication stress in this figure, just a histological evaluation of the placentae from the different mutants. The title of figure 3 is "Impact of GIN on LZ is less than JZ," but there is no measure of GIN, but instead measurement of number of cells in cell cycle and some bulk RNA-seq analysis. Title of figure 4 is "TSCs with increased genomic instability exhibit abnormal phenotypes." Again there is no measure of GIN, but instead staining of derived TSCs for proliferation, cell death, and a TSC marker. Title of figure 5 is "DNA damage responses and G2/M checkpoint activation drive premature TSC differentiation." However, there does not appear to be a difference in gH2AX between the two mutant genotypes. Checkpoint proteins might be up, but need quantification and reproduction. > 4C is the only marker of differentiation. Importantly, all the analyses here are associations, not connections, so cannot use the word "drive". Similar issues can be raised with a number of the supplementary figures.

      Major Comments:

      1) A deeper analysis of the cell lines is likely to be the most fruitful path to reveal interesting mechanisms. It is very surprising that there is no phenotype in ESCs. Authors should check for increased apoptosis. Maybe the phenotypic cells are lost. Or do ESCs use different MCMs/mechanisms of DNA replication or are they better able to handle replication stress and GIN? How many passages were the TSCs and ESCs cultured for? Does GIN (i.e. aneuploidy, CNVs) develop in TSCs and ESCs with passaging? How do the MCM mutations impact the molecular identity of the ESC and TSC cells including their heterogeneity in the population.

      2) Along similar lines, most of the in vivo phenotypic analyses are performed at E13.5, long after defects are likely beginning to express themselves especially given that they see phenotypes in the TSCs, which represent the polar TE of a E4.5. To understand the primary defects of the in vivo phenotype, they should be looking much earlier. Supplemental figure 5 is a start but represents a rather superficial analysis.

      3) Fig. 6 would benefit from evidence that MCM3 mutant is rescuing MCM4 levels in the chromatin fraction of cells and the DNA damage phenotype.

      Minor Comments:

      1) There is a lack of quantification and repeats for all Westerns. At minimum there should be three repeats for each experiment, quantification including normalization to a reference protein, and stats confirming any proposed differences between conditions.

      2) I would recommend moving the results in supp table 1 to figure 1. While negative, they are the newer results. The results shown in current figure 1 are essentially a reproduction of their previous work.

      3) Need to review citations to figures. For example, no citations are made to figure 4a and 4c.

      Significance

      As is, the study does not provide much new insight or understanding of how the MCM mutants are driving the sex-skewed semi-lethal phenotype. It would likely take much effort (months) to reach such a goal. However, without such effort, it is unclear what the significance of the story is. It does make the observation that the placenta appears to be impacted more severely and earlier than then the embryo, and that within the placenta, certain zones and cell types are more vulnerable. The reasons for these differential impacts are unclear though.

      If the authors choose not to dig deeper as suggested in the major comments, then at a minimum it would be important to soften their conclusions as raised in the summary and at least perform experiments/edits proposed in minor comments.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In a previous paper (McNairn et al. 2019 "Female-biased embryonic death from inflammation induced by genomic instability" Science), the Schimenti lab demonstrated that mouse embryos with hypomorphic mutations of the heterohexameric minichromosome maintenance complex, mutations that cause increased genomic instability (GIN), show reduced embryonic viability, with greater loss of female embryos and some parent-of-origin effect. Treatment with immunosuppressants, including ibuprofen and testosterone, partially rescued the observed lethality.

      In this new manuscript, the Schimenti lab demonstrates that these GIN-prone mutants feature smaller placentas with fewer cells. Mutations that interfere with the ability of the innate immune system to respond to micronuclei (a consequence of GIN) have no protective effect. Munisha and colleagues then demonstrate that MCM-mutant TSCs are harder to derive and show elevated apoptosis and a greater propensity for differentiation. The mutant TSCs show CHK1 phosphorylation, P53 phosphorylation and higher P21 levels, all consistent with a response to DNA damage. Downstream of this, they also show loss and inhibition of CDK1, which is already established to cause G2/M arrest (generally) and endoreduplication (specifically in trophoblast). The authors advance a model in which GIN results in loss of the TSC pool by apoptosis, cell cycle arrest and premature differentiation, resulting in smaller placentas and particularly fewer junctional zone cells. How this causes inflammation is less clear, but inflammation appears to be a downstream effect rather than cause of poor placentation.

      Major comments:

      This is a strong manuscript with few problems and all important findings well justified, indeed this is a nicely polished manuscript for something just entering peer review. There are a few unclear points textually and a couple places in the supplementary figures where better data quality would help, but generally it is a high-quality manuscript.

      • I am confused as to the basis of the sex-skewing phenomenon? Is the problem that lack of maternally loaded WT Mcm4 worsens the phenotype, or is the issue that Mcm4C3/C3 dams are less able to retain pregnancies, perhaps being a more inflammatory environment? Also, while there quite consistent evidence for reduced viability of Mcm4C3/C3McmGt/+ progeny, especially for female progeny, how confident can we be that the genotype of the dam vs. sire is important? Notably on a Ddx58 background, the progeny of the Mcm4C3/C3 sire included seven live male Mcm4C3/C3McmGt/+ but no female.

      • I'm not sure what Supplementary Figure 6 is showing (faster differentiation of C3 but less TGC?). Regardless, it's hard to draw too much conclusion from one not-very-pretty Western blot. This figure requires both additional replicates and a better explanation of how it fits with the other conclusions of the paper..

      • Supplementary Figure 7F-G is puzzling. Half of the mESCs have gamma-H2AX at all times, including most in S or G2 phase? In Figure S7E, do the quadrants correspond to being negative or positive for gamma-H2AX? At very least, IF images showing clear gamma-H2AX foci would be much more convincing.

      • The methods section is well detailed, but it would be ideal to clarify how many replicates each Western Blot or flow cytometry experiment is representative of.

      The required additional experiments re: Supplementary Figure 6 and 7 could be conducted in a couple of months.

      Minor comments:

      • Supplementary Table 1. would be enhanced greatly showing comparable tables for Mcm4C3/C3 x Mcm4C3/+McmGt/+ in mice without the Tmem173 or Ddx58 mutations. It is fine to recycle data from McNairn 2019 here, as long as the source is indicated, but a comparison is needed.

      • Is it possible that cGAS-STING and RIG pathways act redundantly to cause inflammation and lethality, or that other innate immune components are involved? I don't expect the authors to make compound mutants to test this but at least this possibility should be discussed textually.

      • In Figure S3E-F, is the box above each graph supposed to show the genotype of the dam?

      • "Indeed, the placenta and embryo weights of E13.5 Mcm4C3/C3 Mcm2Gt/+ Mcm3Gt/+ animals were significantly improved vs. Mcm4C3/C3 Mcm2Gt/+ animals, rendering them similar to Mcm4C3/C3 littermates (Fig. 6A-C). The JZ (but not LZ) area in Mcm4C3/C3 Mcm2Gt/+ Mcm3Gt/+ placentae also increased to the level of Mcm4C3/C3 littermates (Fig. 6D-H)." There are two problems here. First, the figure calls are wrong. Second, the description of the data is not quite right, it looks like the C3/C3 and C3/C3 M2/+ M3/+ LZs are a similar size to each and are statistically indistinguishable.

      Significance

      I partially discussed the above in the summary, but this paper makes a major breakthrough, showing that cell autonomous defects in hTSCs are very likely at the heart of the pathology observed in GIN-prone murine mutants.

      Some questions go unsolved. Why are TSCs more prone to die in response to GIN than mESCs, particularly in light of the general observation that karyotypic abnormality is more common in placental lineage? How does the placental abnormality give rise to inflammation? No manuscript can answer every question, and I think this is a mature manuscript that can be published in a good journal with limited modifications.

      I am an expert on gene regulation in placental development, with somewhat less expertise in the DNA damage field. The placenta field will find this paper interesting, as will the DNA damage field. There are also ramifications for cancer research. The question of why some cells tolerate high levels of DNA damage and others die is very relevant to cancer.

  3. Sep 2025
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility and clarity

      SUMMARY

      In this study, Fernandes and colleagues addressed the question of the role of micro-RNAs in regulating the coupling between organ growth and developmental timing. Using Drosophila, they identified the conserved micro-RNA miR-184 as a regulator of the developmental transition between juvenile larval stages and metamorphosis. This transition is under the control of the steroid hormone Ecdysone, and has been shown to be modulated in case of abnormal tissue growth to adjust the duration of larval growth in response to developmental perturbations. The relaxin-like hormone Dilp8 has been identified as a key secreted factor involved in this coupling. Here, the authors show that miR-184 is involved in the regulation of Dilp8 expression both in physiological conditions and upon growth perturbation. They propose that this function is carried out in imaginal tissues, where miR-184 levels are modulated by tissue stress. While several factors have already been involved in triggering sharp dilp8 induction at the transcriptional level, this study adds another level of complexity to the regulation of Dilp8 by proposing that its expression is fine-tunned post-transcriptionally through repression by miR-184.

      __MAJOR COMMENTS______

      Overall, the manuscript is well organized, and the logics of the experimental plan well presented. The results are clear, and I appreciate the quality of the pupariation curves. However, I believe that two main conclusions of the paper are not fully supported by the results presented in the figures: the direct regulation of dilp8 3'UTR by miR-184, and the specificity of this regulation in imaginal discs. Here I develop in more details these two aspects.

      Comment 1) The strategy of the 3'UTR sensor is not fully optimized. Indeed, in most experiments, qRT-PCR is used to assess dilp8 expression levels, although it reflects both transcriptional and post-transcriptional. Importantly, to show that post-transcriptional regulation is involved in the response to tissue damage, the levels of the 3'UTR sensor should be analyzed in discs expressing RAcs (showing at the same time that the response is cell-autonomous in the discs). The expected upregulation of the sensor should be prevented by simultaneous expression of miR-184. This approach would shed light on the relative contribution of transcriptional versus post-transcriptional regulation of dilp8 in response to growth perturbation.

      Response: We thank the reviewer for this comment. We agree that qRT-PCRs do not distinguish between transcriptional and post-transcriptional changes of dilp8 levels, in response to changes in miR-184 levels and tissue damage. In addition to the qRT-PCR data we have looked at dilp8-3’UTR-GFP reporter in response to overexpression of miR-184 in the wingdisc using patched-Gal4 driver, which show downregulation of the GFP reporter in the ptc domain (Fig 4C-D’). This suggests that dilp8 mRNA is a direct target of miR-184 by post-transcriptional regulation through its 3’UTR. Further, to confirm the specificity of the effect of miR-184 on dilp8-3’UTR, we generated a dilp8-3’UTR mutant in which the single target site for miR-184 was mutated. We show that the mutated dilp8-3’UTR reporter doesn’t show any regulation in response to miR-184 overexpression in the ptc domain of the wingdisc (Fig. 4E, E’, F, F’). This experiment confirms the specificity of the dilp8-3’UTR regulation by miR-184.

      As suggested by the reviewer we analysed dilp8-3’UTR-GFP reporter expression by overexpressing RicinA using ptcGAL4 driver in the wing imaginal disc (Fig. S6F-G’). We observed a slight but consistent increase in the dilp8-3’UTR-GFP reporter expression, indicating post-transcriptional regulation of dilp8 expression in response to tissue damage. However, the increase of reporter GFP levels observed in this experiment in response to tissue damage is mild (Fig. S6F-G’) than expected based on the qRT-PCR results (Fig S6A and B). We have added this new data to the manuscript (Fig. S6F-G’).

      We propose the following reasons to explain this result:

      a) both transcriptional and post-transcriptional regulation of dilp8 mRNA in response to developmental perturbations

      b) the data on 3’UTR reporter GFP is specifically from the ptc domain expression of RicinA, whereas for dilp8 transcript levels we have expressed RicinA in all larval imaginal tissues, or in the entire wing imaginal disc, which could be one of the reasons for the stronger effect seen on dilp8 mRNA levels

      c) we are not certain if the tubulin-promoter driven dilp8-3’UTR GFP reporter reflects post-transcriptional regulation of dilp8 by miR-184 efficiently in comparison to qRT-PCR. This is especially as the reporter-GFP-3’UTR will be expressed at very high levels due to the tubulin promoter, a majority of this reporter-GFP mRNA may not be relieved from degradation due to the moderate suppression of miR-184 in response to RicinA overexpression.

      Thus, our experiments suggest that dilp8 levels are regulated post-transcriptionally by miR-184 which contributes to pupariation delays in response to tissue damage. In support of this, we could rescue pupariation delays and dilp8 induction caused by RicinA expression using overexpression of miR-184 (Figs 5B, C). Thus, we confirm that the effect of post-transcriptional regulation by miR-184 during developmental perturbations also contributes to dilp8 induction and pupariation delays. Unfortunately, due to experimental limitations we could not perform simultaneous expression of RicinA and miR-184 to evaluate the rescue of dilp8-3’UTR-GFP sensor expression. The levels of dilp8-3’UTR sensor GFP is reduced efficiently by miR-184 overexpression (Fig 4D), which prevented us from attempting the rescue of the moderate increase of dilp8-3’UTR GFP levels in response to RicinA.

      Comment 2) In my opinion, the use of a 3'UTR sensor is not sufficient to conclude that the regulation by miR-184 is direct, as miR-184 could also regulate an intermediate factor that acts on dilp8 post-transcriptional regulation. To solve this issue, a common strategy is to generate a 3'UTR sensor with mutated binding sites that should abolish the regulation by miR-184. This mutated 3'UTR might also respond differently to tissue damage, which would strongly support the conclusions of the study.

      Response: We couldn’t agree more with the reviewer, this comment is addressed in the response to comment 1. We have confirmed the specificity of regulation of dilp8-3’UTR by miR-184 using target site mutated dilp8-3’UTR (new figures added to the manuscript Fig. 4E, E’, F, F’). We tested if the changes in dilp8 mRNA levels in response to tissue damage is post-transcriptional mediated by miR-184. We observe that there is a slight, but consistent increase of dilp8-3’UTR GFP reporter levels in the ptc domain of wingdisc in response to RicinA expression, suggesting a role for miR-184 mediated post-translational regulation of dilp8. However, we have not yet tested the mutated dilp8-3’UTR GFP reporter in response to tissue damage.

      Comment 3) Concerning the tissue-specific regulation of Dilp8 by miR-184, these results need to be strengthened. Indeed, this comes mostly from phenotypes observed with rn-GAL4. Although this is a classical tool for driving expression in imaginal discs, rn-GAL4 also drives strong expression in other tissues that could contribute to triggering a delay, such as the CNS and part of the gut (proventriculus). In our hands, some growth phenotypes in the wing obtained with rn-GAL4 could be fully reverted by blocking GAL4 in the CNS indicating that the phenotype was not wing-specific. Importantly, miR-184 seems to be highly expressed in the CNS according to FlyBase, reinforcing the possibility that it plays a role in this organ. Here I propose approaches to confirm that miR-184 mediated regulation of dilp8 and developmental timing indeed occur in the discs:

      - Another driver with less secondary expression sites could be used (pdmR11F02-GAL4), or rn-GAL4 could be combined with an elav-GAL80 to prevent expression in most neurons. - The authors could identify the source of Dilp8 upregulation in miR-184 mutants using tissue-specific qRT-PCR instead of whole larvae expression like in Fig 4A-B. - This tissue-specific upregulation could be functionally tested using a rescue experiment, in which the delay observed in miR-184 mutants could be rescued by disc-specific downregulation of Dilp8 (using pdm2-GAL4 for instance).

      Response: We are thankful to the reviewer, and agree that it is important to show that the effects that we see using rn-Gal4 are specific to imaginal discs, and not due to an effect in CNS. We tested this by expressing miR-184 sponge in the CNS. Though miR-184 is highly expressed in the larval CNS, downregulation of miR-184 specifically in the pan-neuronal background using elav-GAL4 led to no effects on pupariation timepoint. We have added this as supplementary data Figure S4. Therefore, we believe that the miR-184 downregulation phenotype in the rnGAL4 background can be mainly attributed to its role in the imaginal discs. In addition, as suggested by the reviewer we have also demonstrated that downregulation of miR-184 in the imaginal discs using rnGAL4 driver leads to an increase in dilp8 expression (Fig S5B). Thus confirming that dilp8 mRNA is enhanced in the imaginal discs by blocking miR-184.

      OPTIONAL: Because it is known that dilp8 is strongly regulated at the transcriptional level, the relative input from post-transcriptional upregulation is an important question arising from this study. Although it might be a more long-term approach, I believe that generating a Dilp8 mutant lacking its 3'UTR or, even better, with mutated miR-184 binding sites, would shed light on the role of this regulation for the response to growth perturbation and/or developmental stability (fluctuating asymmetry).

      Response: We thank the reviewer for the suggestion. This would have been an interesting experiment to carry out especially in the context of fluctuating asymmetry.

      MINOR COMMENTS

      1. __ I think that a number of results could be moved to SI as they are either controls, or reproduce published data without bringing novelty. For instance, results in Fig 5A-D are similar to data published by Sanchez et al, as stated in the text. Fig6A as well.__

      __Response: __We thank the reviewer for this suggestion, Fig. 5A-D, and F has been moved to Fig. S6A-E. We have also moved data from Fig. 6 to Fig. 5, as a result Fig 6 A-D has become Fig. 5 B-D.

      __ Fig 6D is quite mysterious, as it suggests that basal JNK activation regulates miR-184, which is different from a context of tissue damage. I think that this result could be removed. Alternatively, if the authors want to dig in that direction, more experiments should be provided, such as bskDN expression in an RAcs context and the effects on miR-184 levels and the 3'UTR sensor (since transcript levels are already published).__

      Response: We would like to clarify that our experiments suggest that endogenous JNK signalling negatively regulates miR-184, as blocking basal JNK signalling using bskDN increased the levels of miR-184 (changed to Fig 5D). Enhanced JNK signalling has been reported to be involved in tissue damage responses, and we propose that RicinA mediated increase in JNK signalling leads to the reduction of miR-184 (changed to Fig 5A, S6D-E). However, we are not strongly implying this as we did not co-express RicinA and bskDN to show that JNK signalling is responsible for the drop in miR-184 levels in response to tissue damage. We thank the reviewer for seeking this explanation, we have rewritten the results section to improve clarity.

      __ The references related to Dilp8 should be checked more in detail in the intro and discussion. About Dilp8 and developmental stability: remove the ref to Colombani et al 2012, instead put Boone et al 2016 and add Blanco-Obregon et al 2022 (in addition to Garelli et al 2012 who initially identified this phenotype. About Lgr3 as the receptor for Dilp8: add Colombani et al, Current Biology 2015, and cite here Vallejo et al 2015, Garelli et al 2015. Among the important transcriptional regulators of Dilp8, Xrp1 could be mentioned (Boulan et al 2019, Destefanis et al 2022) as it plays a complementary function to JNK depending on the type of tissue stress.__

      __Response: __We are really sorry for the glaring errors in citing appropriate references. We thank the reviewer for correcting this for us. We have made necessary changes to the text.

      Significance

      GENERAL ASSESSMENT This study provides convincing data showing that the conserved microRNA miR-184 plays a role in regulating developmental timing in Drosophila through modulating the levels of Dilp8, a key factor in the coupling between tissue growth and developmental transitions. The results are convincing, but the general conclusions of the paper need to be strengthened regarding the direct regulation of dilp8 by miR-184 and the tissue-specificity of this interaction.

      ADVANCE Dilp8 is a key factor that modulates growth and timing in response to developmental perturbations and contributes to developmental precision in physiological conditions. As such, its regulation has been studied by different groups in the last decade, leading to the identification of several inputs for its transcriptional regulation. Here, the authors uncover a post-transcriptional regulation by miR-184, adding another level of regulation of Dilp8 that contribute to ensuring proper regulation of developmental timing, and opening the possibility that miR-184 might play similar roles in other species.

      AUDIENCE This study is of interest for researchers in the field of basic science, with a focus on developmental timing, tissue damage and biological function of microRNAs.

      REVIEWER EXPERTISE Drosophila, growth control, developmental timing, Dilp8.

      Reviewer #2

      Evidence, reproducibility and clarity

      Drosophila has helped to characterize the mechanisms that coordinate tissue growth with developmental timing. The insulin/relaxin-like peptide Dilp8 has been identified as a key factor that communicates the abnormal growth status of larval imaginal discs to neuroendocrine neurons responsible for regulating the timing of metamorphosis. Dilp8, derived from imaginal discs, targets four Lgr3-positive neurons in the central nervous system, activating cyclic-AMP signaling in an Lgr3-dependent manner. This signaling pathway reduces the production of the molting hormone, ecdysone, delaying the onset of metamorphosis. Simultaneously, the growth rates of healthy imaginal tissues slow down, enabling the development of proportionate individuals.

      In this manuscript "miR-184 modulates dilp8 to control developmental timing during normal growth conditions and in response to developmental perturbations" by Dr. Varghese and colleagues, the authors identify a new post transcriptional regulator of Dilp8. The authors show that miR-184 plays a pivotal role in tissue damage responses by inducing dilp8 expression, which in turn delays pupariation to allow sufficient time for damage repair mechanisms to take effect.

      Major points:

      Comment 1) In most of the experiments for percentage of pupariation, the 50% pupariation in control is around 110 hours AED in figures 1, 2 and 3. In figures 5 and 6 using the UAS Ricin, the controls are more around 90 hours AED. Why this discrepancy?

      Response: We thank the reviewer for asking for this clarification. The former experiments for Figs 1-3 were carried out at 25oC while the latter experiments with a cold sensitive version of RicinA (UAS-RAcs), Figs 5 and 6 (now changed to Figs. 5 and S6 as suggested by reviewer #1) were carried out at 29oC (permissive temperature). This difference in temperature has led to alterations in pupariation timing. We apologise for not having mentioned this in the text, now we have made necessary corrections to the methods section clearly indicating this.

      Comment 2) What is the mechanism behind the expression of miR-184 in stress conditions? Is miR-184 also implicated in other conditions giving rise to a developmental delay (X-rays irradiation or animal bearing rasV12, scrib-/- tumors)?

      Response: We thank the reviewer for these questions.

      a) In response to developmental perturbations by RicinA, we believe that activation of JNK signalling controls miR-184 expression. We propose this as our experiments show that imaginal disc damage leads to enhancement of JNK signalling and increase in dilp8 mRNA levels (as reported earlier by Colombani et al 2012; Sánchez et al 2019), and a simultaneous reduction of miR-184 (Figs. S6A, D, E). We also have performed new experiments to show that in response to RicinA expression in the wingdisc there is moderate increase in the dilp8-3’UTR-GFP sensor expression (Figs. S6F-G’), indicating a post-transcriptional regulation of dilp8 expression in response to tissue stress. We also show that RicinA induced dilp8 expression and pupariation delay can be rescued by increasing miR-184 levels (Fig 5B and C), suggesting that the reduction of miR-184 in response to tissue damage contributes to the damage responses. In a separate experiment we show that blocking the endogenous JNK pathway by the expression of bskDN enhances miR-184 levels, suggesting that miR-184 is under the regulation of JNK signalling (Fig 5D). Hence, we speculate that during tissue stress, activation of JNK signalling leads to a reduction of miR-184 levels which contributes to regulating the levels of dilp8 post-transcriptionally and resulting in pupariation delays. The text has been modified to explain this better.

      b) In a previous paper by Shu et al., 2017 (https://doi.org/10.18632/oncotarget.22226) decreased expression of miR-184 was observed in a lglRNAi; RasV12 tumor background. Apart from this various studies have shown that dilp8 levels increase in response to tumour, radiation stress, apoptosis, and tissue damage (Yeom et al 2021, Ray et al 2019, Demay et al 2014, Katsuyama et al 2015, Colombani et al 2012, Garelli et al 2012). Whether the regulation of dilp8 by miR-184, occurs in these backgrounds is yet to be tested. We have now discussed this possibility in the manuscript.

      Comment 3) dilp8 mutant animals have also been shown to be more resistant to starvation or desiccation (https://doi.org/10.3389/fendo.2020.00461). Is miR-184 implicated in this answer?

      Response: We thank the reviewer for this question. In our earlier experiments miR-184 has been demonstrated to be regulated by nutrition in the larval stages and lack of miR-184 led to enhanced larval death in response to diet restriction (Fernandes et al., 2022). miR-184 was also demonstrated to play a role in the insulin producing cells (IPCs) in regulating lifespan (Fernandes & Varghese., 2022). In the current work, we propose miR-184 to act upstream of dilp8 in response to stress stimuli. Hence, it is possible that miR-184 might be involved in responses to starvation and desiccation stress in the adult female flies, by regulating dilp8 levels post-transcriptionally. However, it has not been tested yet if the miR-184 regulation of dilp8 plays a role in resistance to starvation or desiccation in adult females, as this was not within the scope of the current study. We have now added this reference in the discussion section.

      Comment 4) dilp8 expression has been also shown to be regulated by Xrp1 in response to ribosome stress (https://doi.org/10.1016/j.devcel.2019.03.016). This paper should be included in the manuscript. Is it possible that the expression levels of miR184 are regulated by Xrp1?

      Response: We thank the reviewer for the suggestion and have incorporated the reference into the paper. During ribosome stress in the larval imaginal discs the stress-response transcription factor Xrp1 acts through dilp8 in regulating systemic growth. We agree with the reviewer, it is possible that expression of miR-184 is regulated by Xrp1. Currently we have not explored this possibility. We have now added this to the discussion section.

      Minor points:

      1. __ Does the overexpression of miR184 induce an increased fluctuating asymmetry?__

      Response: We thank the reviewer for asking this question. The role of dilp8 in the fluctuation asymmetry is only observed in the dilp8 hypomorphic mutant background. To replicate this we would have to overexpress miR-184 in either the whole larvae or in the wing discs. Unfortunately overexpression of miR-184 in the wing discs (using rnGAL4) leads to pupal lethality while as overexpression of miR-184 in the whole larvae leads to embryonic lethality and therefore we were not be able to conclude from our experiments if miR-184 overexpression induces increased fluctuating asymmetry.

      2. There are 2 references Colombani et al. (2012 for Dilp8 and 2015 for Lgr3). Can you double check that they are used accordingly

      Response: We thank the reviewer for pointing these errors out and we have incorporated these changes into the paper.

      Significance

      Altogether, the paper present compiling lines of evidence supporting the proposed model. The experiments are well designed and are convincing. The papers is interesting and relevant for a broad audience.

      __Reviewer #3 __

      Evidence, reproducibility and clarity (Required):

      This is an interesting study demonstrating an interaction between miR-184 and the Drosophila insulin-like peptide 8 (dilp8) in the tissue damage response. The authors show that Dilp8 activity is negatively regulated by miR-184, apparently through direct interaction between miR-184 and the dilp8-3'UTR, which leads to lower dilp8 mRNA transcript levels, via an undetermined mechanism, supposedly its degradation? Furthermore, the authors show that during aberrant tissue growth, miR-184 levels are very slightly downregulated (see comment below), and based on other experiments, imply causation of this with the increased dilp8 mRNA levels that occur in these tissues, again via an unclear mechanism: upregulation or stabilization of dilp8 mRNA. The authors present evidence that the JNK pathway, which had been known to be critical for dilp8 mRNA upregulation upon tissue damage, does so via miR-184.

      Major Comments:

      __Comment 1: The data showing the direct regulation of dilp8-3'UTR by miR-184 are not very strong and would require more controls to strengthen the claim, as described below. __

      Response: We have performed new experiments to validate that dilp8-3’UTR is regulated by miR-184. Please see the detailed responses to comments 10-12 below.

      __Comment 2: The miR-184 effects are also very small (less than 2-fold reduction with tissue damage; or less than 2-fold induction with JNK-pathway inhibition via bskDN). These two points are the weakest part of the manuscript and model. __

      Response: We agree with the reviewers on this point. The reduction in miR-184 levels in response to RicinA expression is modest (25–30%), and the induction of miR-184 in response to bskDN expression is less than two-fold (Figs. 5A and D). In contrast, dilp8 transcript levels increase several-fold in response to RicinA expression (Fig. 5C, S6A and B). Since we measure dilp8 transcript levels by qPCR, we detect both transcriptional and post-transcriptional contributions to dilp8 regulation. In addition, we have performed a new experiment to check the post-transcriptional regulation of dilp8, in response to tissue damage. Though the change in the dilp8-3′UTR GFP reporter upon RicinA expression in the ptc domain of the wingdisc is mild (Figs. S6F-G’), this strongly suggests a post-transcriptional outcome of the reduction of miR-184 levels on dilp8. Hence, we propose that tissue damage induces strong transcriptional activation of dilp8, while the reduction of miR-184, despite its smaller magnitude, contributes to dilp8 upregulation via post-transcriptional regulation. In support of this, our experiments demonstrate direct regulation of the dilp8-3′UTR by miR-184 (Figs. 4C-F’), and show strong dilp8 mRNA upregulation in miR-184 deficient conditions (Fig. 4A and B), suggesting the role of miR-184 in maintaining dilp8 levels. We also show that RicinA induced effects on dilp8 and pupariation delay are reversed by co-expression of miR-184 (Fig. 5C). We do not claim that regulation by miR-184 is the sole mechanism for driving dilp8 induction during tissue damage, but suggest that miR-184-mediated post-transcriptional regulation acts in a complementary manner to transcriptional responses. Furthermore, we believe that the mild effect of JNK signaling on miR-184 (as shown by the bskDN experiment) is sufficient for the moderate reduction of miR-184 in response to tissue damage.

      Comment 3: ____Regarding the expression levels, it does not help that the authors show bar graphs with standard errors of the mean instead of the actual data points to allow reliable appreciation of the data dispersion.

      Response: We have modified our figures and have performed statistical analysis according to the suggestions of the reviewers, please see responses to comments 1-9, and 13-19.

      Comment 4: It is difficult to understand how minute changes in miR-184 levels can lead to over an order of magnitude differences (in some cases) in dilp8 mRNA levels considering that it is a stoichiometric relationship. Maybe ?miR-184-Dicer1? complexes are highly stable and re-used for multiple dilp8 transcripts - the authors could discuss how they understand this occurring in their manuscript.

      On the same line, discussion is also rather weak on what regards the mechanism of control of dilp8 mRNA levels by miR-184. Please discuss eg, the evidence for mRNA degradation induction by microRNAs with this UTR binding profile (imperfect UTR binding Fig S4) and-if appropriate-how other possible regulatory models (direct and indirect) could explain the findings.

      Response: We accept the reviewers comment that 25-30% reduction of miR-184 is low in comparison to the many fold increase in dilp8 levels. We believe that both post-transcriptional and transcriptional changes are responsible for the induction of dilp8 in response to tissue damage. However, our experiments suggest the role of post-transcriptional regulation by miR-184, as pupariation delay is rescued by miR-184 overexpression (also please see the response to the previous comment). We are not ruling out the possibility of transcriptional regulation of dilp8 mRNA, rather we are suggesting the possibility that both transcriptional and post-transcriptional means are responsible for changes in dilp8. Moreover, we have not performed absolute measurement of miR-184 in the imaginal discs (what we show is a comparison between control and RicinA expression), hence we do not have an exact estimate of how many miR-184 molecules are reduced and if they would be greatly equal or more in comparison to the dilp8 mRNA molecules that are upregulated, as again while measuring dilp8 mRNA we are not checking how many molecules of dilp8 exactly are increased. As the reviewer suggests, it is possible that miR-184-RISC could be stable to handle multiple dilp8 molecules one after the other, hence it is not a 1:1 relationship between miR-184:dilp8. We have included this in the manuscript. It is also known that imperfect 3’UTR binding as seen in most animal microRNAs leads to translational repression and mRNA deadenylation, which eventually results in mRNA degradation.

      Comment 5: ____We suggest the authors carefully revise their citations to cite appropriate work that supports the claims, and also to avoid missing the seminal studies that report the claims they cite.

      Response: We are really apologetic for the errors citing the key references. We are grateful to the reviewers for correcting this for us. We have made changes to the text to include and correct the references.

      We have the suggestions below which we hope will help the authors improve their manuscript. If the authors address these points raised above, we believe the manuscript should be a valuable contribution to the field, and help in the understanding of how tissues respond to growth aberrations and the regulation of transcript levels by microRNAs.

      Detailed Comments:

      Comment 1. Results 1st paragraph: please describe the screen in more detail. As written, one only discovers it was a miRNA loss-of-function screen when reading the legend of Table S1. Please show the original data of the screen - with dispersion if possible.

      Response: We thank the reviewers for these suggestions, we have now included the data from the screen with SEM, and p-values.

      Comment 2. Results 1st paragraph, Fourth line, "While several miRNAs caused delays in pupariation by 12 hours or more..". Please correct, as actually loss of miRNAs caused delays.

      Response: We thank the reviewer for pointing out this error, we have corrected the text accordingly.

      Comment 3. ____Results (Figure 1) - It says that data from three independent experiments are shown. However there is no dispersion in the data. Could the authors please explain this? Are the results of the three experiments summed and presented as one? or is this one of the three?

      Response: We thank the reviewers for these suggestions and have plotted data with the SEM values.

      Comment 4. It is reported in the legend of Figure S2 that LogRank test was performed to determine statistical significance. However, no statistical data is presented. Please show the results.

      __Response: __We thank the reviewers for these suggestions to improve the data presentation, we have incorporated the p-value as suggested.

      Comment 5. Fig2A and B. Please show the data points in the bar graphs (as in Figure. 2C), or choose another data representation. ____Please consider redoing statistical analysis with a simple t-test. ____It is not clear to me why ANOVA was used to compare two samples. Please state that data are normalized also to control (tub-GAL4>UAS-scramble). Please ____state____ the h post-hatching from which the RNA samples were collected (as in Fig 2C for 20HE quantification).

      __Response: __We thank the reviewers for these suggestions to improve the data presentation, we have incorporated all changes as suggested. Similar changes have been incorporated to the rest of the figures of the manuscript as well. Hours post-hatching information for each figure is now added to the figure legends. __ __

      Comment 6. Fig2C. Fig legend states the bar graphs are "absolute values". Please specify if the bar represents the average, median or something else.

      Response: We thank the reviewer for pointing this out, we have made the suggested changes.

      Comment 7. Throughout the manuscript: please use GAL4 in capital letters or at least standardize it throughout the ms. Currently there are GAL4s and Gal4s.. eg compare Fig 2 and 3 legends.

      Response: We thank the reviewer for pointing this out, we have incorporated all changes as recommended.

      Comment 8. FigS3A and B. Please revise as Fig2A and B above. and apply the same criteria in the respective figure legend.

      __Response: __We thank the reviewer for pointing this out, we have made the changes as recommended.

      Comment 9. Fig. 4 - please indicate on the figures what is whole larvae and what is wing imaginal discs. This will facilitate understanding of the figure.

      __Response: __We thank the reviewers for these suggestions and have included this information in all the figures.

      Comment 10. Fig 4 - Data - Authors do not show that rn-GAL4>miR-184-sponge causes up regulation of dilp8 mRNA levels, hence the model is weakened. Doing this experiment would significantly strengthen the study whatever the result is.

      Response: We thank the reviewer for pointing this out and we have included this in the manuscript (Fig S5B).

      Comment 11. The dilp8-3'UTR experiment is weak especially because its generation is not sufficiently well described in the manuscript. "The dilp8 3'UTR-GFP reporter line was created as described in (Vargheese & Cohen, 2007)" is not sufficient. Please describe the construct generation in sufficient detail so that the experiments can be reproduced by others.

      Response: We thank the reviewer for pointing this out and we have elaborated in the methods section on how we generated the dilp8 3'UTR-GFP reporter and dilp8 3'UTR mutant GFP reporter lines. The plasmid was originally created in Steve Cohen’s lab at EMBL, by modifying pCasper4 plasmid, by introducing a tubulin promoter, EGFP and a multiple cloning site, which allows one to clone 3’UTRs of target genes into this plasmid. Not1 and Xho1 sites were used to clone the dilp8-3’UTR and mut-3’UTR. We hope this explains our strategy sufficiently.

      Comment 12. Making assumptions, if the construct is as described in Vargheese & Cohen, 2007 and contains all of the dilp8 3'UTR - it should be a Tubulin-driven GFP gene with a dilp8-3'UTR "Tub-GFP-(dilp8 3'UTR)". In this case the authors need to rule out the alternative interpretation of the result in Fig. 4D by showing that the expression of miR-184 does not down regulate Tub-GFP expression itself. The best scenario would be to have a mutated dilp8 3'UTR for the miR-184 recognition site. This experiment would significantly strengthen the study and model.

      Response: We thank the reviewer for pointing this out. We agree with the reviewers that this experiment is needed to prove direct regulation of the dilp8-3’UTR by miR-184. We have mutated the sequences complementary to the seed region of miR-184 in the dilp8-3’UTR, and demonstrated that overexpression of miR-184 does not regulate the mutated tub-GFP-(dilp8 3'UTR) expression. This confirms that the dilp8 gene is a direct target of miR-184. This data is added to the manuscript as Figs 4E-F’.

      Comment 13. Figure 4C-D please separate dilp8 from 3'UTR with a space or hyphen.

      Response: We thank the reviewer for pointing this out and have separated dilp8 from 3’UTR with a hyphen.

      Comment 14. Figure 4E. Please name the dilp8 allele as MI00727 as it is not a KO, but rather a hypomorphic mutation (fully WT dilp8 transcripts are still generated, albeit at a much lower level).

      Response: We thank the reviewer for pointing this out and we have made the necessary changes.

      Comment ____15. Figure 6D: please add UAS to bskDN/+. All figures have rn-GAL4 alone or with UAS-GFP as control. This finding would be strengthened with this other control, especially because the size effect is small.____ This being said a general comment for all experiments is that hemi-controls are generally missing for all figures. eg, in Fig 3. One would typically include controls such as A. Phm>+ and +>miR.184; B. aug21>+ and +>miR.184; C. ptth>+ and +>miR.184; D. rn>+ and +>miR.184

      Response: We thank the reviewer for pointing this out. We have added UAS to bskDN, now Fig 5D and have also added the rnGAL4/+ control. We have also performed various hemi-control experiments as suggested by the reviewer to our best capabilities. We have added a separate graph with the hemicontrols in the as a Reviewer Response Figure 1.

      Comment 16. Figure 7: Are IPCs necessary for the model? If not, I suggest removing them and placing the Lgr3 neuron cell bodies much more anterior in this scheme. Their cell bodies are as anterior and rostral as it gets, approximately where the IPCs are depicted in this type of view of the CNS.

      Response: We thank the reviewer for pointing this out and have removed IPCs from the figure, this figure is now labelled as Fig. 6.

      Comment ____17. Table S1- It would be preferable to see the data of these experiments, but if the authors prefer to show this data in a table, please at least add the dispersion analyses (eg standard deviation.. OR median+-quartiles OR Confidence intervals..), N of animals analysed, and statistics against controls.

      Response: We thank the reviewer for pointing this out, we have added the number of larvae analysed, SEM values and statistics against the control condition.

      Comment ____18. In all figures with pupariation time: please also indicate significant findings in the graphs (with an asterisk, for instance) and adjust figure legends accordingly. This could facilitate understanding the data.

      __Response: __Thanks for the suggestion. We have incorporated this information into figure legends.

      Comment ____19. Please revise Figure legends for punctuation.

      __Response: __We have rectified all the errors in punctuation. We thank the reviewers for suggesting this.

      __Comment ____20. __

      a) Abstract:

      Line 10: What is the evidence to call Dilp8 a "paracrine" factor?

      Response: We thank the reviewer for pointing this out, we have changed the text to ‘secreted factor’.

      b) Introduction:

      4th paragraph, 3rd sentence " Dilp8... buffers developmental noise and delays pupariation..." Buffering of developmental noise was first shown in Garelli et al., Science 2012, so this publication should be cited. ____4th paragraph, 5th sentence: please include Jaszczak et al., Genetics 2016. This paper was published together with the 2015 papers, just a matter of timing that it got a 2016 date. Moreover, I do not think Katsuyama et al., 2015 is well cited to back up the statement in this sentence, hence I recommend removing that citation in this sentence.

      Response: We thank the reviewer for pointing this out and have made necessary changes.

      c) 6th paragraph: 5th line "targeting dilp8" : please specify if you mean the gene or the mRNA, or both. Same for line 7.

      Response: We thank the reviewer for pointing this out and have made necessary changes.

      d) Results Page 10, 1st paragraph, 1st sentence: the works cited are not the appropriate studies that demonstrated what is being stated. This was shown in Garelli et al., Science 2012 and Colombani et al., Science 2012. Results Page 10, 1st paragraph, line 11: Please also cite Colombani et al., Science 2012, who first showed that JNK is required for dilp8 regulation.

      Response: We thank the reviewer for pointing this out and are extremely apologetic for this oversight. We have made necessary changes to the manuscript.

      e) Discussion, 2nd paragraph, line 4: again, please indicate the rationale for using "paracrine" to describe Dilp8's activities. The current widely accepted model is that Dilp8 acts on interneurons in the brain ____(eg, reviewed in Juarez-Carreno et al., Cell Stress, 2018; Gontijo and Garelli, Mech Dev, 2018; Mirth and Shingleton, Front Cell Dev Biol, 2019; Texada et al., Genetics 2020; Boulan and Leopold, 2021).____ In order to reach the brain, Dilp8 has to be secreted from the discs and travel to the brain. This is as an endocrine mechanism as it gets for a small larva, considering that some discs can be on the opposite side of the larva (eg, genital discs). While this does not exclude that Dilp8 could also act paracrinally, the only evidence that I am aware of comes from other contexts such as during transdetermination (where Dilp8 has been proposed to work in an autocrine or paracrine fashion, via Drl in imaginal discs (Nemoto et al., Genes to Cells, 2023), however, this is not cited appropriately in this manuscript and is less related to the Lgr3-dependent pathway being studied here.

      Response: We totally agree with the reviewer and appreciate clarifying this for us. We have made necessary changes to the text.

      f) Discussion Page 13, 1st paragraph, This claim is supported by data presented in Garelli et al., Science 2012, not the other two papers. Garelli et al., 2015 shows that the Lgr3 receptor also participates in buffering developmental noise. Other studies have corroborated the Garelli et al., 2012 finding: eg, Colombani et al., Curr Biol 2015; Boone et al., Nat Commun 2016; Blanco-Obregon et al., Nat Commun 2022). Many other studies have shown that Dilp8 promotes developmental stability under tissue stress and challenges.

      Discussion Page 12, 3rd paragraph, 2nd sentence: "The Lgr3 neurons directly interact with ... PTTH ...and insulin-producing neurons" Please cite Colombani et al., 2015 and Vallejo et al., Science 2015. Vallejo et al., propose that circuit with insulin-producing neurons. In the 3rd sentence, only Jaszczak et al., 2016 is cited, whereas this claim/model comes from many studies, such as Halme et al., Curr Biol, 2010; Hackney et al., PLoS One 2012; Garelli et al. Science 2012; Colombani et al., Science, 2012; and the Lgr3 papers from 2015). Jaszczak et al., actually propose that Lgr3 is also required in the ring gland in addition to neurons.

      Discussion page 14 last paragraph,10 line, "In Aedes aegypti ....regulates ilp8 (Ling et al., 2017)". As far as I understand mosquitoes do not have a dilp8 orthologue (see for instance Gontijo and Gontijo, Mech Dev 2018; and Jan Veenstra's work). ilp nomenclature (numbering) does not follow that of Drosophila, so ilp8 is probably a typical Insulin/IGF-like peptide and is NOT an orthologue of Dilp8, a relaxin, so this citation needs to be removed or placed into the broader context of microRNA regulation of ilps.

      Response: We are really sorry for the numerous glaring errors in the references. We thank the reviewers for correcting this for us. We have made necessary changes to the text.

      Thank you for the opportunity to review your interesting work,

      Alisson Gontijo and Rebeca Zanini

      Reviewer #3 (Significance (Required)):

      If the authors address these points raised above, we believe the manuscript should be a valuable contribution to the field, and help in the understanding of how tissues respond to growth aberrations and the regulation of transcript levels by microRNAs.

      __Author’s concluding response: __

      We thank all the reviewers for the overall positive comments and suggestions that we believe have helped us to improve our manuscript. We have incorporated all the changes suggested, especially regarding errors in citing key references. We have performed most of the experimental suggestions. Also, we have modified the way in which graphs are presented, including statistical tests as suggested by the reviewers. Several controls have been performed to strengthen the manuscript further. We believe that this review process aided in significantly improving this manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This is an interesting study demonstrating an interaction between miR-184 and the Drosophila insulin-like peptide 8 (dilp8) in the tissue damage response. The authors show that Dilp8 activity is negatively regulated by miR-184, apparently through direct interaction between miR-184 and the dilp8-3'UTR, which leads to lower dilp8 mRNA transcript levels, via an undetermined mechanism, supposedly its degradation? Furthermore, the authors show that during aberrant tissue growth, miR-184 levels are very slightly downregulated (see comment below), and based on other experiments, imply causation of this with the increased dilp8 mRNA levels that occur in these tissues, again via an unclear mechanism: upregulation or stabilization of dilp8 mRNA. The authors present evidence that the JNK pathway, which had been known to be critical for dilp8 mRNA upregulation upon tissue damage, does so via miR-184. The data showing the direct regulation of dilp8-3'UTR by miR-184 are not very strong and would require more controls to strengthen the claim, as described below. The miR-184 effects are also very small (less than 2-fold reduction with tissue damage; or less than 2-fold induction with JNK-pathway inhibition via bsk-DN). These two points are the weakest part of the manuscript and model. Regarding the expression levels, it does not help that the authors show bar graphs with standard errors of the mean instead of the actual datapoints to allow reliable appreciation of the data dispersion. It is difficult to understand how minute changes in miR-184 levels can lead to over an order of magnitude differences (in some cases) in dilp8 mRNA levels considering that it is a stoichiometric relationship. Maybe ?miR-184-Dicer1? complexes are highly stable and re-used for multiple dilp8 transcripts - the authors could discuss how they understand this occurring in their manuscript. On the same line, discussion is also rather weak on what regards the mechanism of control of dilp8 mRNA levels by miR-184. Please discuss eg, the evidence for mRNA degradation induction by microRNAs with this UTR binding profile (imperfect UTR binding Fig S4) and-if appropriate-how other possible regulatory models (direct and indirect) could explain the findings. We suggest the authors carefully revise their citations to cite appropriate work that supports the claims, and also to avoid missing the seminal studies that report the claims they cite. We have the suggestions below which we hope will help the authors improve their manuscript. If the authors address these points raised above, we believe the manuscript should be a valuable contribution to the field, and help in the understanding of how tissues respond to growth aberrations and the regulation of transcript levels by microRNAs.

      Comments:

      Results 1st paragraph: please describe the screen in more detail. As written, one only discovers it was a miRNA loss-of-function screen when reading the legend of Table S1. Please show the original data of the screen - with dispersion if possible.

      Results 1st paragraph, Fourth line, "While several miRNAs caused delays in pupariation by 12 hours or more..". Please correct, as actually loss of miRNAs caused delays.

      Results (Figure 1) - It says that data from three independent experiments are shown. However there is no dispersion in the data. Could the authors please explain this? Are the results of the three experiments summed and presented as one? or is this one of the three?

      It is reported in the legend of Figure S2 that LogRank test was performed to determine statistical significance. However, no statistical data is presented. Please show the results.

      Fig2A and B. Please show the data points in the bar graphs (as in Figure. 2C), or choose another data representation. Please consider redoing statistical analysis with a simple t-test. It is not clear to me why ANOVA was used to compare two samples. Please state that data are normalized also to control (tub-GAL4>UAS-scramble). Please state the h post-hatching from which the RNA samples were collected (as in Fig 2C for 20HE quantification).

      Fig2C. Fig legend states the bar graphs are "absolute values". Please specify if the bar represents the average, median or something else.

      Throughout the manuscript: please use GAL4 in capital letters or at least standardize it throughout the ms. Currently there are GAL4s and Gal4s.. eg compare Fig 2 and 3 legends.

      FigS3A and B. Please revise as Fig2A and B above. and apply the same criteria in the respective figure legend.

      Fig. 4 - please indicate on the figures what is whole larvae and what is wing imaginal discs. This will facilitate understanding of the figure.

      Fig 4 - Data - Authors do not show that rn-GAL4>miR-184-sponge causes up regulation of dilp8 mRNA levels, hence the model is weakened. Doing this experiment would significantly strengthen the study whatever the result is.

      The dilp8-3'UTR experiment is weak especially because its generation is not sufficiently well described in the manuscript. "The dilp8 3'UTR-GFP reporter line was created as described in (Vargheese & Cohen, 2007)" is not sufficient. Please describe the construct generation in sufficient detail so that the experiments can be reproduced by others.

      Making assumptions, if the construct is as described in Vargheese & Cohen, 2007 and contains all of the dilp8 3'UTR - it should be a Tubulin-driven GFP gene with a dilp8-3'UTR "Tub-GFP-(dilp8 3'UTR)". In this case the authors need to rule out the alternative interpretation of the result in Fig. 4D by showing that the expression of miR-184 does not down regulate Tub-GFP expression itself. The best scenario would be to have a mutated dilp8 3'UTR for the miR-184 recognition site. This experiment would significantly strengthen the study and model.

      Figure 4C-D please separate dilp8 from 3'UTR with a space or hyphen.

      Figure 4E. Please name the dilp8 allele as MI00727 as it is not a KO, but rather a hypomorphic mutation (fully WT dilp8 transcripts are still generated, albeit at a much lower level).

      Figure 6D: please add UAS to bskDN/+. All figures have rn-GAL4 alone or with UAS-GFP as control. This finding would be strengthened with this other control, especially because the size effect is small. This being said a general comment for all experiments is that hemi-controls are generally missing for all figures. eg, in Fig 3. One would typically include controls such as A. Phm>+ and +>miR.184; B. aug21>+ and +>miR.184; C. ptth>+ and +>miR.184; D. rn>+ and +>miR.184

      Figure 7: Are IPCs necessary for the model? If not, I suggest removing them and placing the Lgr3 neuron cell bodies much more anterior in this scheme. Their cell bodies are as anterior and rostral as it gets, approximately where the IPCs are depicted in this type of view of the CNS.

      Table S1- It would be preferable to see the data of these experiments, but if the authors prefer to show this data in a table, please at least add the dispersion analyses (eg standard deviation.. OR median+-quartiles OR Confidence intervals..), N of animals analysed, and statistics against controls.

      In all figures with pupariation time: please also indicate significant findings in the graphs (with an asterisk, for instance) and adjust figure legends accordingly. This could facilitate understanding the data.

      Please revise Figure legends for punctuation.

      Abstract: Line 10: What is the evidence to call Dilp8 a "paracrine" factor?

      Introduction:

      4th paragraph, 3rd sentence " Dilp8... buffers developmental noise and delays pupariation..." Buffering of developmental noise was first shown in Garelli et al., Science 2012, so this publication should be cited.

      4th paragraph, 5th sentence: please include Jaszczak et al., Genetics 2016. This paper was published together with the 2015 papers, just a mater of timing that it got a 2016 date. Moreover, I do not think Katsuyama et al., 2015 is well cited to back up the statement in this sentence, hence I recommend removing that citation in this sentence.

      6th paragraph: 5th line "targeting dilp8" : please specify if you mean the gene or the mRNA, or both. Same for line 7.

      Results Page 10, 1st paragraph, 1st sentence: the works cited are not the appropriate studies that demonstrated what is being stated. This was shown in Garelli et al., Science 2012 and Colombani et al., Science 2012.

      Results Page 10, 1st pagragraph, line 11: Please also cite Colombani et al., Science 2012, who first showed that JNK is required for dilp8 regulation.

      Discussion, 2nd paragraph, line 4: again, please indicate the rationale for using "paracrine" to describe Dilp8's activities. The current widely accepted model is that Dilp8 acts on interneurons in the brain (eg, reviewed in Juarez-Carreno et al., Cell Stress, 2018; Gontijo and Garelli, Mech Dev, 2018; Mirth and Shingleton, Front Cell Dev Biol, 2019; Texada et al., Genetics 2020; Boulan and Leopold, 2021). In order to reach the brain, Dilp8 has to be secreted from the discs and travel to the brain. This is as an endocrine mechanism as it gets for a small larva, considering that some discs can be in the opposite side of the larva (eg, genital discs). While this does not exclude that Dilp8 could also act paracrinally, the only evidence that I am aware of comes from other contexts such as during transdetermination (where Dilp8 has been proposed to work in an autocrine or paracrine fashion, via Drl in imaginal discs (Nemoto et al., Genes to Cells, 2023), however, this is not cited appropriately in this manuscript and is less related to the Lgr3-dependent pathway being studied here.

      Discussion Page 13, 1st paragraph, This claim is supported by data presented in Garelli et al., Science 2012, not the other two papers. Garelli et al., 2015 shows that the Lgr3 receptor also participates in buffering developmental noise. Other studies have corroborated the Garelli et al., 2012 finding: eg, Colombani et al., Curr Biol 2015; Boone et al., Nat Commun 2016; Blanco-Obregon et al., Nat Commun 2022). Many other studies have shown that Dilp8 promotes developmental stability under tissue stress and challenges.

      Discussion Page 12, 3rd paragraph, 2nd sentence: "The Lgr3 neurons directly interact with ... PTTH ...and insulin-producing neurons" Please cite Colombani et al., 2015 and Vallejo et al., Science 2015. Vallejo et al., propose that circuit with insulin-producing neurons. In the 3rd sentence, only Jaszczak et al., 2016 is cited, whereas this claim/model comes from many studies, such as Halme et al., Curr Biol, 2010; Hackney et al., PLoS One 2012; Garelli et al. Science 2012; Colombani et al., Science, 2012; and the Lgr3 papers from 2015). Jaszczak et al., actually propose that Lgr3 is also required in the ring gland in addition to neurons.

      Discussion page 14 last paragraph,10 line, "In Aedes aegypti ....regulates ilp8 (Ling et al., 2017)". As far as I understand mosquitoes do not have a dilp8 orthologue (see for instance Gontijo and Gontijo, Mech Dev 2018; and Jan Veenstra's work). ilp nomenclature (numbering) does not follow that of Drosophila, so ilp8 is probably a typical Insulin/IGF-like peptide and is NOT an orthologue of Dilp8, a relaxin, so this citation needs to be removed or placed into the broader context of microRNA regulation of ilps.

      Thank you for the opportunity to review your interesting work, Alisson Gontijo and Rebeca Zanini

      Significance

      If the authors address these points raised above, we believe the manuscript should be a valuable contribution to the field, and help in the understanding of how tissues respond to growth aberrations and the regulation of transcript levels by microRNAs.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Drosophila has helped to characterize the mechanisms that coordinate tissue growth with developmental timing. The insulin/relaxin-like peptide Dilp8 has been identified as a key factor that communicates the abnormal growth status of larval imaginal discs to neuroendocrine neurons responsible for regulating the timing of metamorphosis. Dilp8, derived from imaginal discs, targets four Lgr3-positive neurons in the central nervous system, activating cyclic-AMP signaling in an Lgr3-dependent manner. This signaling pathway reduces the production of the molting hormone, ecdysone, delaying the onset of metamorphosis. Simultaneously, the growth rates of healthy imaginal tissues slow down, enabling the development of proportionate individuals. In this manuscript "miR-184 modulates dilp8 to control developmental timing during normal growth conditions and in response to developmental perturbations" by Dr. Varghese and colleagues, the authors identify a new post transcriptional regulator of Dilp8. The authors show that miR-184 plays a pivotal role in tissue damage responses by inducing dilp8 expression, which in turn delays pupariation to allow sufficient time for damage repair mechanisms to take effect.

      Major points:

      • In most of the experiments for percentage of pupariation, the 50% pupariation in control is around 110 hours AED in figures 1, 2 and 3. In figures 5 and 6 using the UAS Ricin, the controls are more around 90 hours AED. Why this discrepancy?
      • What is the mechanism behind the expression of miR-184 in stress conditions? Does miR-184 also implicated in other conditions giving rise to a developmental delay (X-rays irradiation or animal bearing rasV12, scrib-/- tumors)?
      • dilp8 mutant animals have also been shown to be more resistant to starvation or desiccation (https://doi.org/10.3389/fendo.2020.00461 ). Is miR-184 implicated in this answer?
      • dilp8 expression has been also shown to be regulated by Xrp1 in response to ribosome stress (https://doi.org/10.1016/j.devcel.2019.03.016). This paper should be included in the manuscript Is it possible that the expression levels of miR184 are regulated by Xrp1?

      Minor points:

      • Does the overexpression of miR184 induce an increased fluctuating asymmetry?
      • There are 2 references Colombani et al. (2012 for Dilp8 and 2015 for Lgr3). Can you double check that they are used accordingly

      Significance

      Altogether, the paper present compiling lines of evidence supporting the proposed model. The experiments are well designed and are convincing. The papers is interesting and relevant for a broad audience.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      In this study, Fernandes and colleagues addressed the question of the role of micro-RNAs in regulating the coupling between organ growth and developmental timing. Using Drosophila, they identified the conserved micro-RNA miR-184 as a regulator of the developmental transition between juvenile larval stages and metamorphosis. This transition is under the control of the steroid hormone Ecdysone, and has been shown to be modulated in case of abnormal tissue growth to adjust the duration of larval growth in response to developmental perturbations. The relaxin-like hormone Dilp8 has been identified as a key secreted factor involved in this coupling. Here, the authors show that miR-184 is involved in the regulation of Dilp8 expression both in physiological conditions and upon growth perturbation. They propose that this function is carried out in imaginal tissues, where miR-184 levels are modulated by tissue stress. While several factors have already been involved in triggering sharp dilp8 induction at the transcriptional level, this study adds another level of complexity to the regulation of Dilp8 by proposing that its expression is fine-tunned post-transcriptionally through repression by miR-184.

      Major Comments

      Overall, the manuscript is well organized, and the logics of the experimental plan well presented. The results are clear, and I appreciate the quality of the pupariation curves. However, I believe that two main conclusions of the paper are not fully supported by the results presented in the figures: the direct regulation of dilp8 3'UTR by miR-184, and the specificity of this regulation in imaginal discs. Here I develop in more details these two aspects. 1. The strategy of the 3'UTR sensor is not fully optimized. Indeed, in most experiments, qRT-PCR is used to assess dilp8 expression levels, although it reflects both transcriptional and post-transcriptional. Importantly, to show that post-transcriptional regulation is involved in the response to tissue damage, the levels of the 3'UTR sensor should be analyzed in discs expressing RAcs (showing at the same time that the response is cell-autonomous in the discs). The expected upregulation of the sensor should be prevented by simultaneous expression of miR-184. This approach would shed light on the relative contribution of transcriptional versus post-transcriptional regulation of dilp8 in response to growth perturbation. 2. In my opinion, the use of a 3'UTR sensor is not sufficient to conclude that the regulation by miR-184 is direct, as miR-184 could also regulate an intermediate factor that acts on dilp8 post-transcriptional regulation. To solve this issue, a common strategy is to generate a 3'UTR sensor with mutated binding sites that should abolish the regulation by miR-184. This mutated 3'UTR might also respond differently to tissue damage, which would strongly support the conclusions of the study. 3. Concerning the tissue-specific regulation of Dilp8 by miR-184, these results need to be strengthened. Indeed, this comes mostly from phenotypes observed with rn-GAL4. Although this is a classical tool for driving expression in imaginal discs, rn-GAL4 also drives strong expression in other tissues that could contribute to triggering a delay, such as the CNS and part of the gut (proventriculus). In our hands, some growth phenotypes in the wing obtained with rn-GAL4 could be fully reverted by blocking GAL4 in the CNS indicating that the phenotype was not wing-specific. Importantly, miR-184 seems to be highly expressed in the CNS according to FlyBase, reinforcing the possibility that it plays a role in this organ. Here I propose approaches to confirm that miR-184 mediated regulation of dilp8 and developmental timing indeed occur in the discs: - Another driver with less secondary expression sites could be used (pdmR11F02-GAL4), or rn-GAL4 could be combined with an elav-GAL80 to prevent expression in most neurons. - The authors could identify the source of Dilp8 upregulation in miR-184 mutants using tissue-specific qRT-PCR instead of whole larvae expression like in Fig 4A-B. - This tissue-specific upregulation could be functionally tested using a rescue experiment, in which the delay observed in miR-184 mutants could be rescued by disc-specific downregulation of Dilp8 (using pdm2-GAL4 for instance).

      Optional: Because it is known that dilp8 is strongly regulated at the transcriptional level, the relative input from post-transcriptional upregulation is an important question arising from this study. Although it might be a more long-term approach, I believe that generating a Dilp8 mutant lacking its 3'UTR or, even better, with mutated miR-184 binding sites, would shed light on the role of this regulation for the response to growth perturbation and/or developmental stability (fluctuating asymmetry).

      Minor Comments

      • I think that a number of results could be moved to SI as they are either controls, or reproduce published data without bringing novelty. For instance, results in Fig 5A-D are similar to data published by Sanchez et al, as stated in the text. Fig6A as well.
      • Fig 6D is quite mysterious, as it suggests that basal JNK activation regulates miR-184, which is different from a context of tissue damage. I think that this result could be removed. Alternatively, if the authors want to dig in that direction, more experiments should be provided, such as bskDN expression in an RAcs context and the effects on miR-184 levels and the 3'UTR sensor (since transcript levels are already published).
      • The references related to Dilp8 should be checked more in detail in the intro and discussion. About Dilp8 and developmental stability: remove the ref to Colombani et al 2012, instead put Boone et al 2016 and add Blanco-Obregon et al 2022 (in addition to Garelli et al 2012 who initially identified this phenotype. About Lgr3 as the receptor for Dilp8: add Colombani et al, Current Biology 2015, and cite here Vallejo et al 2015, Garelli et al 2015. Among the important transcriptional regulators of Dilp8, Xrp1 could be mentioned (Boulan et al 2019, Destefanis et al 2022) as it plays a complementary function to JNK depending on the type of tissue stress.

      Significance

      General Assessment

      This study provides convincing data showing that the conserved microRNA miR-184 plays a role in regulating developmental timing in Drosophila through modulating the levels of Dilp8, a key factor in the coupling between tissue growth and developmental transitions. The results are convincing, but the general conclusions of the paper need to be strengthened regarding the direct regulation of dilp8 by miR-184 and the tissue-specificity of this interaction.

      Advance

      Dilp8 is a key factor that modulates growth and timing in response to developmental perturbations and contributes to developmental precision in physiological conditions. As such, its regulation has been studied by different groups in the last decade, leading to the identification of several inputs for its transcriptional regulation. Here, the authors uncover a post-transcriptional regulation by miR-184, adding another level of regulation of Dilp8 that contribute to ensuring proper regulation of developmental timing, and opening the possibility that miR-184 might play similar roles in other species.

      Audience

      This study is of interest for researchers in the field of basic science, with a focus on developmental timing, tissue damage and biological function of microRNAs.

      Reviewer expertise

      Drosophila, growth control, developmental timing, Dilp8.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the reviewer for their positive comments regarding the research article titled "The Ketogenic Diet Metabolite 1 β-Hydroxybutyrate Promotes Mitochondrial Elongation via Deacetylation and Improves Autism-like Behaviour in Zebrafish" by Uddin GM and colleagues. We appreciate your input, and we will address these comments as indicated below with specific responses to each point raised by reviewers.

      The main changes in the updated manuscript are as follows:

      We have revised the introduction to now incorporate additional background information on mitochondria, NAD, and mitochondrial dynamics and function. This addition aims to provide readers with a broader understanding of the mitochondrial context in relation to our study.

      Furthermore, we recognize that previous studies have explored mitochondrial function in the context of the ketogenic diet. While our specific investigation centered on mitochondrial morphology, we acknowledge the importance of comprehensively investigating mitochondrial function. To this end, we have added new data showing how BHB impacts mitochondrial oxidative phosphorylation in HeLa cells (Sup Fig 2), and how both BHB and NMN impact oxygen consumption/glycolysis in zebrafish (Fig 7).

      We have also added new behaviour analysis of the zebrafish (Fig 6), and have re-framed the discussion around neurodevelopment generally, rather than ASD specifically.

      Finally, we have now included a section in our manuscript that discusses the limitations of our study. These limitations can be further investigated to explore and characterize the full mechanistic potential behind the effects of the ketogenic diet and/or NMN on mitochondrial dynamics.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Uddin GM and colleagues presented a research article entitled 'The Ketogenic Diet Metabolite 1 β-Hydroxybutyrate Promotes Mitochondrial Elongation via Deacetylation and Improves Autism-like Behaviour in Zebrafish'. Roles of ketogenic diet (KD) and NAD+ precursors in health promotion and longevity, as well as on the alleviation of a broad range of diseases are evident. However, their roles in autism are not well done, which is the novelty of the current study. Addressing below questions will improve the quality of the paper.

      Major concerns 1. In the introduction section, a broad overview of the roles of ketogenic diet (KD) in neurodegenerative disease (and ageing, if possible) should be provided. E.g., the authors should summarize exciting progress on the use of KD to treat Alzheimer's disease in animal models (PMID: 23276384). *

      Response: Thank you for your valuable suggestion. While it is true that the KD appears to be beneficial in neurodegenerative (and other disease) models, our focus in this paper is looking at neurodevelopment, rather than all potential benefits of the KD. Nonetheless, we have addressed this comment by incorporating a brief overview of the roles of the KD in neurodegenerative diseases, including Alzheimer's disease (AD), in the introduction section of the manuscript. Specifically, we have summarized the exciting progress made in utilizing KD to treat AD in animal models, as highlighted in the suggested study. This addition helps to provide a better overview of the potential therapeutic effects of KD in neurodegenerative diseases and strengthens the introduction section of the manuscript.

      • Roles of high fat diet to treat diseases could be extended to rare premature ageing diseases. In such scenario, high fat and NAD+ boosting shared some joint mechanisms (PMID: 25440059 ). *

      Response: This information and the reference are now added to the discussion.

      *In the introduction, a more detailed introduction of NAD+ and its roles in mitochondrial homeostasis (especially mitophagy and the mitochondrial fusion-fission balance) should be included (PMID: 24813611; PMID: 30742114; PMID: 31577933). *

      Response: Although our paper focused primarily on mitochondrial fission and fusion, we have incorporated a new paragraph in the introduction to provide a more detailed introduction detailing NAD+ and its roles in mitochondrial homeostasis, specifically highlighting mitophagy. We have included the suggested references.

      • In regarding to the statement of KD increases NAD+, was it due to increased generation (to check protein levels and activities of different NAD+ synthetic enzymes, such as iNAMPT, NMNAT1-3, and NRK) and/or reduced consumption (in addition to reduced glycolysis, does KD inhibit the activities of CD38 and PARPs? In this paper, Sirtuins' activities is (are increased)). Detailed exploration of the activities of these proteins will unveil a clear molecular mechanisms on how KD affects/regulates NAD+. *

      Response: Thank you for the comment. We agree that exploring the detailed mechanism of how the ketogenic diet (KD) affects NAD+ is an interesting question that will have important implications once answered. However, fully elucidating the mechanism of action would require a more comprehensive investigation, which is beyond the scope of this current project. We have now added this as a future direction in the manuscript.

      *Fig. 1: in the NAD+ field, the normal used NR/NMN concentrations are normally high like to use 500 µM to 2-5 mM (as the NAD+ levels in cells are high). In addition to use 50 µM, the authors are strongly to have a dose-dependent study (50 µM, 500µM, 1, 2, 5 mM), and see changes of mitochondrial funciton and parameters. In this condition, NAD+ levels should be also checked. *

      Response: We have added new supplemental data showing the initial dose response of the effects of BHB and NMN on mitochondrial morphology, which led us to choosing the relevant doses for the remainder of the paper. Our objective was not to investigate the broad impacts of different NMN concentrations on mitochondrial function and parameters, or NAD+ levels. As such, we have only focused on doses where we see effects on mitochondrial morphology.

      *Fig. 2: a comprehensive characterization of mitochondrial fusion-fission should be performed. In addition to the protein evaluated, changes on other key fusion-fission proteins, like Bax, Bak, Mfn-1, Mfn-2, etc should be performed (PMID: 17035996; PMID: 24813611). *

      Response: We agree that looking at other key proteins involved in mediating mitochondrial fission and fusion could provide additional insight. Indeed, given the changes in global acetylation that we see, it is expected that some other proteins may also be regulated in this way. However, there are at least a dozen proteins involved in mediating mitochondrial fusion and fission, not to mention many more proteins that regulate these proteins. Unfortunately, it is not feasible to analyze all the proteins involved in mitochondrial fusion-fission. Moreover, looking only at protein levels, doesn't necessarily inform about the activity of any protein. Instead, we concentrated in this paper on investigating known links between protein acetylation and mitochondrial dynamics, particularly focusing on the proteins that have known links to acetylation (i.e., DRP1, OPA1, MFNs). We have added a note in the discussion acknowledging that other means of regulation could also be occurring in parallel.

      *Figs. 1-5 were focused on mitochondrial morphology, whether KD and NMN changed mitochondrial funciton should be explored, such as to use seahorse to check ECR and OCR. *

      Response: Although our question was focused on morphology, we agree that mitochondrial function is important. We have added new data showing that BHB increases basal oxygen consumption in HeLa cells (Sup Fig 2), as well as new data showing that BHB and NMN influence oxygen consumption and glycolysis in our zebrafish model (Fig 7)

      • Fig. 6: NR/NMN used in animal studies (via gavage or in drinking water in mice, and on plate for worms and flies) are normally high (e.g., in drinking water for mice could be 4-12 mM; for worms and flies are normally 1-5 mM); for zebrafish, while they are swimming in water, this reviewer concerned whether it was true that 50 µM of NMN was sufficient to show the benefit presented.*

      Response: Our data show that these doses are indeed sufficient. We did look at some higher doses for NMN, but these were toxic, leading to poor survival and were not studied further.

      *Minor concerns 1. Line 26: For 'a growing list of neurological disorders, including autism spectrum disorder (ASD)', please add AD in. *

      Response: Line 26 is part of the abstract, which we feel should be focused more on the main message of the paper, which does not involve AD. As addressed above, we have added AD as an example in the introduction.

      *Line 57: For 'with side effects such as gastrointestinal disturbances, nausea/vomiting, diarrhea, constipation, and hypertriglyceridemia being reported', rate of frequency shall be provided if any. *

      Response: We have modified the statement to indicate the relative percent of patients suffering the various side effects.

      *Reviewer #1 (Significance (Required)):

      The novelty of the current study was to investigate effects of KD and NAD+ on autism. This investigation was not performed before and thus is the novelty.

      Weakness, effects of KD and NAD+/NMN on mitochondrial function were not well-investigated and should be done. Introduction was not well done, many key information in the fields were not provided which may mislead the readers an over-evaluation of the novelty of the current study.*

      Response: As outlined above, we have edited the introduction to include additional information requested by the reviewer. Moreover, our focus in this manuscript was to look at the mechanisms underlying changes in mitochondrial morphology, not mitochondrial function per se, though this is clearly important and related. Nonetheless, as discussed above, we have also added new data showing how BHB impacts mitochondrial function.

      *My expertise lies in NAD+, mitochondria, and brain health.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The study examined the effect of beta-hydroxybutyrate and nicotinamide nucleotide on mitochondrial morphology and the molecular pathways which mitigate this effect as well as the effect of these treatments on behavior in zebrafish. The study is well done and well written. The only thing I think that could be improved are the bar in the graph some the significant comparisons. It is sometimes difficult to see which groups are being compared.*

      Response: We're happy to adjust how the data is displayed in the relevant bar graphs, but it is not clear exactly what changes the reviewer would like. To some degree this will depend on the specific guideline of the final journal where we hope the manuscript will be published. As such, we have not made changes at this point.

      ***Referees cross-commenting**

      The other reviewers do have some fair comments. Multiple doses would be helpful and showing bioenergetic data would complement the morphological measurements. Additionally, behavioral assays showing changes in social behavior in the Zebrafish would provide a stronger link to ASD. *

      Response: As discussed above, we have added new information on doses and mitochondrial bioenergetics. With respect to behaviour, we have added thigmotaxis data and reworked the discussion around behaviour and neurodevelopment so that it is less specific to ASD.

      *Reviewer #2 (Significance (Required)):

      As beta-hydroxybutyrate is an important substrate for the ketogenic diet, this study helps explain the potential mechanisms in which the ketogenic diet may enhance mitochondrial function.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this paper, Uddin and colleagues have investigated components of the ketogenic diet to understand changes in both mitochondrial morphology and protein expression, and zebrafish locomotor behaviour. They investigate whether beta-hydroxybutyrate (BHB) or nicotinamide nucleotide (NMN) application can later human mitochondria in HeLA cell lines, and also recue a locomotion defect in shank3b+/- zebrafish larvae that have previously been proposed as a model for autism. This study is strengthened by showing data from two species; however the link between the HeLA cell line data and larval zebrafish is not strong. The study would be improved by assessing zebrafish mitochondrial changes after drug application, and testing more than one concentration of BH and NMN in the behavioural assay. This is an interesting study, and it is nicely written and presented. I have made some comments to strengthen the study below.

      Major comments My expertise is in modelling some aspects of autism in zebrafish. To this end I have focussed on the zebrafish part of this manuscript more fully. I have several comments related to the zebrafish experiments. 1. The changes in mitochondrial morphology, peroxisome number and mitochondrial protein levels were measured in HeLA cells and not comparable data is shown for zebrafish. The same experiments should be repeated using larval zebrafish or a zebrafish cell line. *

      Response: We chose to use HeLa cells for the mechanistic studies due to practical reasons. Cell lines offer a controlled and well-established system for investigating cellular processes and molecular mechanisms. Measuring these parameters in tissues is significantly more challenging and requires different reagents (e.g., antibodies) and methodology (electron microscopy) that are not feasible in the current study.

      On the other hand, zebrafish larvae were employed for the behavior studies, which cannot be conducted using cell lines. By utilizing zebrafish, we were able to examine the effects of beta-hydroxybutyrate (BHB) and nicotinamide nucleotide (NMN) on locomotor behavior, providing valuable insights into potential therapeutic implications for autism.

      While we acknowledge the limitations of not directly measuring mitochondrial morphology, peroxisome number, and mitochondrial protein levels in zebrafish, we believe that our study provides significant contributions to understanding the effects of BHB and NMN in zebrafish behavior. Future studies could certainly consider incorporating zebrafish-specific experiments to complement the findings in HeLa cells.

      • How did you choose the concentration of BHB and NMN to use in behavioural experiments? And the timing of application - I don't really understand why you waited 3 days after drug application to measure locomotion. *

      Response: These doses chosen initially as they were similar the doses that induced mitochondrial elongation in HeLa cells and were tolerated by the fish larvae. As we saw promising effects at these initial doses, we decided to explore them in more detail. While we agree that it would be worth comparing the effects of additional doses, as well as looking at their effects at other timepoints, such work would be a major endeavour and is beyond the scope of our initial investigations, which we feel are worth reporting in their current state.

      With respect to the treatment paradigm, fish larvae were treated 10-48 hours post fertilization, as this is a critical neurogenic developmental timepoint that is often used for exposure studies. Fish do not fully hatch until 3-4 days post fertilization, and display only minimal movement before 5 days, which is why we waited until 5 days to look at movement.

      • Do the shank3b+/- larvae show any morphological deficits? Their decrease in locomotion is striking. Is the morphology also rescued by drug application? Can you tie this to the mitochondrial changes that you observed in HeLA cells?*

      Response: We do not observe any gross changes in fish morphology that might explain a decrease in locomotion. Unfortunately, it is not feasible to look at mitochondrial morphology in the fish at this time. However, based on previous published work showing that the ketogenic diet promotes mitochondrial elongation in mouse brains (PMID:32380723), we would expect mitochondrial morphology also to be changed in the fish. Nonetheless, as we have not examined this directly in fish, we are not making this specific claim in this manuscript.

      • In figure 6A you use time spent swimming as a readout of distance. This doesn't really make sense, because without also showing speed of swimming it is not possible to know whether time and distance correlate in the same way across genotypes. This figure could be improved by showing more detail - speed of swimming, time spent immobile etc. This can easily be extracted from the films that you have already made using the ViewPoint software. *

      Response: As requested, we have reanalyzed the zebrafish movement data for a more refined analysis. In the revised version (Fig 6), we include analysis of both speed and distance travelled within a defined time. Importantly, these findings still support differences between WT and shank3b+/- fish that are restored by BHB and NMN to varying degrees.

      • Showing a change in locomotion is not enough to claim that a model is autism-like. At a minimum I think that you need to show changes in social behaviour - likely using older fish (more than three weeks) that interact with each other. Changes in locomotion can be caused by so many factors, many of which are not indicative of autism. It is important that as a field we do not simply claim that locomotion can be used as a proxy for more complex disease phenotypes. This recent review may help you with this point:* https://www.frontiersin.org/articles/10.3389/fnmol.2020.575575/full.

      Response: The reviewer makes an important point that the movement behaviour phenotypes that we see do not necessarily represent classic ASD phenotypes (i.e., repetitive behaviour, reduced sociability, and reduced communication). To begin to address this issue, we analyzed thigmotaxis, which can be a measure of anxiety. Notably, we also see differences that are reversed by BHB and NMN. However, we cannot model all ASD behaviours in a fish model, and we are not set up to look at social behaviour, especially in the young fish that we were studying. As such, even though Shank3 is a recognized ASD gene, and the shank3b+/- model we are studying is a validated ASD model (PMID: 29619162), we have re-phrased the manuscript in the context of neurodevelopment generally, rather than with respect to ASD specifically. As such, we ascribe the movement and thigmotaxis phenotypes as neurodevelopmental phenotypes that are improved by BHB and NMN.

      *For the statistics, as far as I can tell, all of the data should be analysed by ANOVA or the non-parametric equivalent followed by a post-hoc test. Please check this and add information about normality in. *

      Response: As requested, we have clarified our statistical methodology throughout the manuscript.

      For the mechanistic data, we used t-tests for direct comparisons between two groups (e.g., vehicle vs. treatment). While multiple conditions such as vehicles, NMN, BHB, or etomoxir were tested, statistical comparisons were only conducted comparisons between the vehicle and each treatment group individually. As we are not also making comparisons between treatments this is not a multiple comparison, and ANOVA is not applicable in this context. We have clarified this rationale in the manuscript to avoid any confusion.

      For the zebrafish study, where multiple factors were involved (e.g., treatments across different time points or conditions), we performed a two-way ANOVA followed by Tukey's post-hoc test to identify specific group differences. This approach was appropriate for analyzing these datasets and ensures robust conclusion.

      With respect to normality testing, all datasets were assessed for normality using the Shapiro-Wilk test, and no violations of normality were observed. The updated text now includes these details.

      *Minor comments

      1. Make sure that you refer to the fish line as shank3b+/- throughout - see abstract.*

      This has bee corrected.

      • Please add a space between all numbers and units (e.g. 5 Mm). *

      This has bee corrected.

      • There is a spelling error on line 340 page 16: finings instead of findings. *

      This has bee corrected.

      • In figure 1, if each dot represents a different sample, then there appear to be many fewer samples analysed in 1D compared to 1B. Can you comment upon this please*

      __Response: __A total of 80-150 cells were counted per condition, and the analyses were performed on 3 independent replicates with 2 independent technical replicates for each treatment condition. The quantification of mean mitochondrial branch length in Figure 1B was measured using Image-J and the MiNA plugin. The measurements were taken from three independent replicates using a standard region of interest (ROI) and randomly selected areas from each image.

      In Figure 1D, NAD+ levels were measured 24 hours after treatment of vehicle, βHB, NMN, or Eto+βHB in HeLa cells (n=3-6/group). Each sample lysate represents an independent experimental dish from which coverslips were collected for image analysis.

      The difference in sample numbers between Figure 1B and 1D arises because image analysis involves individual cells fixed and stained on coverslips, whereas the NAD assay requires the whole lysate from the entire cell culture dish. Therefore, the higher cell count in Figure 1B represents the number of cells analyzed on coverslips, while Figure 1D represents NAD levels from the lysate normalized to the protein concentration.

      *Reviewer #3 (Significance (Required)):

      I think that this will be interesting to autism researchers and it could lead to more investigation of the ketogenic diet. Some more work is needed, likely in other model organisms, before this research can be translated to human patients. *

      __Response: __We agree that the findings of our study could be of interest to autism researchers and have implications for further investigation of the ketogenic diet (KD). It is important to note that further work, including studies in other model organisms, would be beneficial before translating this research to human patients.

      Our study aimed to provide mechanistic insights into the effects of the KD on mitochondrial morphology and behavior. We recognize that the translation of research findings to human patients requires rigorous investigation, including preclinical and clinical studies. Our study contributes to the understanding of the underlying mechanisms involved in the KD's effects, laying the groundwork for future research and potential therapeutic avenues.

      We appreciate your perspective and emphasize that our intention is to provide valuable insights into the mechanisms underlying the KD's effects rather than suggesting immediate translation to human patients. Further investigation and validation in diverse models and clinical settings will be necessary before considering clinical applications.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this paper, Uddin and colleagues have investigated components of the ketogenic diet to understand changes in both mitochondrial morphology and protein expression, and zebrafish locomotor behaviour. They investigate whether beta-hydroxybutyrate (BHB) or nicotinamide nucleotide (NMN) application can later human mitochondria in HeLA cell lines, and also recue a locomotion defect in shank3b+/- zebrafish larvae that have previously been proposed as a model for autism. This study is strengthened by showing data from two species; however the link between the HeLA cell line data and larval zebrafish is not strong. The study would be improved by assessing zebrafish mitochondrial changes after drug application, and testing more than one concentration of BH and NMN in the behavioural assay.

      This is an interesting study, and it is nicely written and presented. I have made some comments to strengthen the study below.

      Major comments

      My expertise is in modelling some aspects of autism in zebrafish. To this end I have focussed on the zebrafish part of this manuscript more fully. I have several comments related to the zebrafish experiments.

      1. The changes in mitochondrial morphology, peroxisome number and mitochondrial protein levels were measured in HeLA cells and not comparable data is shown for zebrafish. The same experiments should be repeated using larval zebrafish or a zebrafish cell line.
      2. How did you choose the concentration of BHB and NMN to use in behavioural experiments? And the timing of application - I don't really understand why you waited 3 days after drug application to measure locomotion.
      3. Do the shank3b+/- larvae show any morphological deficits? Their decrease in locomotion is striking. Is the morphology also rescued by drug application? Can you tie this to the mitochondrial changes that you observed in HeLA cells?
      4. In figure 6A you use time spent swimming as a readout of distance. This doesn't really make sense, because without also showing speed of swimming it is not possible to know whether time and distance correlate in the same way across genotypes. This figure could be improved by showing more detail - speed of swimming, time spent immobile etc. This can easily be extracted from the films that you have already made using the ViewPoint software.
      5. Showing a change in locomotion is not enough to claim that a model is autism-like. At a minimum I think that you need to show changes in social behaviour - likely using older fish (more than three weeks) that interact with each other. Changes in locomotion can be caused by so many factors, many of which are not indicative of autism. It is important that as a field we do not simply claim that locomotion can be used as a proxy for more complex disease phenotypes. This recent review may help you with this point: https://www.frontiersin.org/articles/10.3389/fnmol.2020.575575/full.
      6. For the statistics, as far as I can tell, all of the data should be analysed by ANOVA or the non-parametric equivalent followed by a post-hoc test. Please check this and add information about normality in.

      Minor comments

      1. Make sure that you refer to the fish line as shank3b+/- throughout - see abstract.
      2. Please add a space between all numbers and units (e.g. 5 Mm).
      3. There is a spelling error on line 340 page 16: finings instead of findings.
      4. In figure 1, if each dot represents a different sample, then there appear to be many fewer samples analysed in 1D compared to 1B. Can you comment upon this please?

      Significance

      I think that this will be interesting to autism researchers and it could lead to more investigation of the ketogenic diet. Some more work is needed, likely in other model organisms, before this research can be translated to human patients.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The study examined the effect of beta-hydroxybutyrate and nicotinamide nucleotide on mitochondrial morphology and the molecular pathways which mitigate this effect as well as the effect of these treatments on behavior in zebrafish. The study is well done and well written. The only thing I think that could be improved are the bar in the graph some the significant comparisons. It is sometimes difficult to see which groups are being compared.

      Referees cross-commenting

      The other reviewers do have some fair comments. Multiple doses would be helpful and showing bioenergetic data would complement the morphological measurements. Additionally, behavioral assays showing changes in social behavior in the Zebrafish would provide a stronger link to ASD.

      Significance

      As beta-hydroxybutyrate is an important substrate for the ketogenic diet, this study helps explain the potential mechanisms in which the ketogenic diet may enhance mitochondrial function.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Uddin GM and colleagues presented a research article entitled 'The Ketogenic Diet Metabolite 1 β-Hydroxybutyrate Promotes Mitochondrial Elongation via Deacetylation and Improves Autism-like Behaviour in Zebrafish'. Roles of ketogenic diet (KD) and NAD+ precursors in health promotion and longevity, as well as on the alleviation of a broad range of diseases are evident. However, their roles in autism are not well done, which is the novelty of the current study. Addressing below questions will improve the quality of the paper.

      Major concerns

      1. In the introduction section, a broad overview of the roles of ketogenic diet (KD) in neurodegenerative disease (and ageing, if possible) should be provided. E.g., the authors should summarize exciting progress on the use of KD to treat Alzheimer's disease in animal models (PMID: 23276384).
      2. Roles of high fat diet to treat diseases could be extended to rare premature ageing diseases. In such scenario, high fat and NAD+ boosting shared some joint mechanisms (PMID: 25440059 ).
      3. In the introduction, a more detailed introduction of NAD+ and its roles in mitochondrial homeostasis (especially mitophagy and the mitochondrial fusion-fission balance) should be included (PMID: 24813611; PMID: 30742114; PMID: 31577933).
      4. In regarding to the statement of KD increases NAD+, was it due to increased generation (to check protein levels and activities of different NAD+ synthetic enzymes, such as iNAMPT, NMNAT1-3, and NRK) and/or reduced consumption (in addition to reduced glycolysis, does KD inhibit the activities of CD38 and PARPs? In this paper, Sirtuins' activities is (are increased)). Detailed exploration of the activities of these proteins will unveil a clear molecular mechanisms on how KD affects/regulates NAD+.
      5. Fig. 1: in the NAD+ field, the normal used NR/NMN concentrations are normally high like to use 500 µM to 2-5 mM (as the NAD+ levels in cells are high). In addition to use 50 µM, the authors are strongly to have a dose-dependent study (50 µM, 500µM, 1, 2, 5 mM), and see changes of mitochondrial funciton and parameters. In this condition, NAD+ levels should be also checked.
      6. Fig. 2: a comprehensive characterization of mitochondrial fusion-fission should be performed. In addition to the protein evaluated, changes on other key fusion-fission proteins, like Bax, Bak, Mfn-1, Mfn-2, etc should be performed (PMID: 17035996; PMID: 24813611).
      7. Figs. 1-5 were focused on mitochondrial morphology, whether KD and NMN changed mitochondrial funciton should be explored, such as to use seahorse to check ECR and OCR.
      8. Fig. 6: NR/NMN used in animal studies (via gavage or in drinking water in mice, and on plate for worms and flies) are normally high (e.g., in drinking water for mice could be 4-12 mM; for worms and flies are normally 1-5 mM); for zebra fish, while they are swimming in water, this reviewer concerned whether it was true that 50 µM of NMN was sufficient to show the benefit presented.

      Minor concerns

      1. Line 26: For 'a growing list of neurological disorders, including autism spectrum disorder (ASD)', please add AD in.
      2. Line 57: For 'with side effects such as gastrointestinal disturbances, nausea/vomiting, diarrhea, constipation, and hypertriglyceridemia being reported', rate of frequency shall be provided if any.

      Significance

      The novelty of the current study was to investigate effects of KD and NAD+ on autism. This investigation was not performed before and thus is the novelty.

      Weakness, effects of KD and NAD+/NMN on mitochondrial function were not well-investigated and should be done. Introduction was not well done, many key information in the fields were not provided which may mislead the readers an over-evaluation of the novelty of the current study.

      My expertise lies in NAD+, mitochondria, and brain health.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their thoughtful comments and overall very supportive feedback.

      Reviewer #1 writes: "The study is very thorough and the experiments contain the appropriate controls. (...) The findings of the study can have relevance for human conditions involving disrupted mitochondrial dynamics, caused for example by mutations in mitofusins." Reviewer #2 writes: "The dataset is rich and the time-resolved approach strong." Reviewer #3 writes: "I admire the philosophy of the research, acknowledging an attempt to control for the many possible confounding influences. (...) This is a powerful and thoughtful study that provides a collection of new mechanistic insights into the link between physical and genetic properties of mitochondria in yeast."

      We address all points below. We have not yet updated our text and figures since we expect substantial additions from new experiments. But we have included Figure R1 with some additional analyses of existing data at the bottom of the manuscript.

      Reviewer1

      1.1 Statistical comparisons are missing throughout the manuscript (with the exception of Fig. 2c). Appropriate statistical tests, along with p-values, should be used and reported where different gorups are compared, for example (but not limited to) Fig. 3d and most panels of Fig. 4.

      We initially decided not to add too many extra labels to the already very busy plots, given that the magnitude of change mostly speaks for itself. However, we will try to find meaningful statistical tests together with a sensible graphical representation for all of the figures. For one example see Figure R1A.

      1.2. I do not agree with the use of Atp6 protein as a direct read-out of mtDNA content. While Atp6 protein levels will decrease with decreasing mtDNA content, the inverse is not necessarily true: decreased Atp6 protein levels do not necessarily indicate decreased mtDNA levels, because they could alternatively or additionally be caused by decreased transcription and/or translation. Therefore, please do not equate Atp6 protein levels to mtDNA levels, and instead rephrase the text referencing the Atp6 experiments in the Results and Discussion sections to measure "mtDNA expression" or "mt-encoded protein" or similar. For example, on p. 14 line 431 should read "mtDNA expression" rather than "decreased synthesis of mtDNA", and line 440 on the same page "mean mtDNA levels" should be "mtDNA expression" or similar.

      All three reviewers agree that using Atp6-NG as a direct proxy for mtDNA requires more validation, or at least rephrasing of the text. We agree that this is the most important point to address. We had previously tried using the mtDNA LacO array (Osman et al. 2015) to directly assess the amount of nucleoids per cell. However, the altered mitochondrial morphology of the Fzo1 depleted cells combined with the LacI-GFP which is still in mitochondria even when mtDNA is gone, increases the noise level to a point that we cannot interpret the signal. However, as this manuscript was in the submission process, the Schmoller lab (co-authors #2 and #7) adapted the HI-NESS system to label mtDNA in live yeast cells(Deng et al. 2025). This system promises much better signal to noise and we expect we can address all concerns regarding the actual count of nucleoids per cell. Should this unexpectedly fail for technical reasons, we will try to calibrate the Atp6-levels with DAPI staining at defined time points and will rephrase the text as the reviewer suggests.

      1.3. In Fig. 3, the authors use the fluorescence intensity of a mitochondrially-targeted mCardinal as a read-out of mitochondrial mass. Please provide evidence that this is not affected by MMP, either with relevant references or by control experiments (e.g. comparing it to N-acridine orange or other MMP-independent dyes or methods).

      Whether or not the import of any mitochondrial protein is dependent on the MMP depends largely on the signal sequence. The preSu9-signaling sequence was previously characterized as largely independent of the MMP compared to other presequences (Martin, Mahlke, and Pfanner 1991), which is why Vowinckel (Vowinckel et al. 2015) and others (Di Bartolomeo et al. 2020; Perić et al. 2016; Ebert et al. 2025) have previously used this as a neutral reference to the strongly MMP-dependent pre-Cox4 signal to estimate MMP. As one control in our own data, we consider that the population-averaged mitochondrial fluorescent signal Figure S3C stays constant in the first few hours, in agreement with the total averaged mitochondrial proteome (Fig R1E). As additional controls, we plan to compare the signal to an MMP independent dye as the reviewer suggests.

      1.4. In Fig. 2e-f, the authors use a promoter reporter with Neongreen to answer whether the reduced levels of the nuclear-encoded mitochondrial proteins Mrps5 and Qcr7 are due to decreased expression or to protein degradation, and find no evidence of degradation of the Neongreen reporter protein. However, subcellular localization might affect the availability of the protein to proteases. Although not absolutely required, it would be relevant to know if the Neongreen fusion protein is found in the same subcellular compartment as Mrps5 and Qcr7 at 0h and 9h after Fzo1 depletion.

      Here, it seems we need to explain the set-up and interpretation of the data better. The key point we are trying to make with the promoter-Neongreen construct is that the regulation is not mainly at the level of transcription. We are showing that the reduction in the levels of the actual protein (orange bars) is not (mainly) explained by a reduction in expression, since the promoter is similarly active at 0 and at 9 hours (grey bars). If expression from the promoter were strongly reduced, the Neongreen would be diluted with growth and would also decrease, but this is not the case. The fluorophore itself is just floating around in the cytosol and is not subject to the same post-translational regulation as Mrps5 and Qcr7, so there is no reason to expect degradation.

      1.5. Fzo1 depletion leads to a very rapid drop in MMP during the first hour of depletion. In the Discussion, can the authors speculate on the possible mechanism of this rapid MMP drop that occurs well before mtDNA or mt-encoded proteins are decreased in level?

      This is indeed an interesting point. We think there are likely three reasons causing this initial drop: Firstly, due to the fragmentation the mixing of mitochondrial content is disturbed and smaller fragments may have suboptimal stoichiometry of components (see also (Khan et al. 2024) who look at this in detail including the Fzo1 deletion); secondly, already fairly early, some mitochondrial fragments may not contain any mtDNA and therefore will be unable to synthesize ETC proteins; thirdly, altered morphological features like changes in the surface-to-volume ratios may play a role. Sadly, mechanistically following up on this is not possible with the tools in our hands and therefore outside of the scope of this manuscript. But we are happy to include these speculations in our discussion.

      1.6. In Fig. 2a, the mtDNA copy number of Fzo1-depleted cells is ca 1.3-fold of the control cells at the 0h timepoint. Why might this be? Is it an impact of one of the inducers? If so, we might be looking at the combination of two different processes when measuring copy number: one that is an induction caused by the inducer(s), and the other a consequence of Fzo1 depletion itself.

      We believe that this 30% increase is within the noise of the experiment rather than an effect of the induction. Since we normalize to t=0 uninduced, the first black data point does not have error bars, emphasizing this difference. None of the protein data suggests that there is an increase in mtDNA encoded proteins (see e.g. 2B, or Atp6 fluorescence data). In the planned HI-NESS experiment, we will see in our single cell data whether there is an actual increase in mtDNA upon TIR induction. Additionally, we will run a qPCR to carefully determine mtDNA levels of untreated wild-type cells, tetracycline treated wild-type cells and tetracycline induced TIR expressing cells to exclude effects of tetracycline as well as the expression of TIR on mtDNA.

      Minor comments:

      1.7. p. 3, line 71: "ten thousands of dividing cells.." should be "tens of thousands of dividing cells".

      Thank you, will correct.

      1.8.-p.4, line 116: please be even more clear with what the "depleted" cells and controls are treated with: are depleted cells treated with both inducers, and controls with neither?

      We will make this more clear. Depleted cells are treated with both inducers, the control cells are not. However, in Figure 1A and in S1 we do controls to show that inducing TIR per se or adding aTC per se does not change growth rate or mitochondrial morphology. We will make this more clear.

      1.9. -p.5, lines 147-148: the authors write "the rate with which the abundance of Cox2 and Var1 proteins decreases was similar to the rate of mtDNA loss" though the actual rate is not shown. Please calculate and show rates for these processes side by side to make comparison possible, or alternatively rephrase the statement.

      Indeed this was not phrased well. We will call it dynamics rather than rates.

      1.10. -Fig. 2d: changing the y-axis numbering to match those in panels a and b would facilitate comparisons.

      Makes sense, we will change this.

      1.11. Fig. 2e: it is recommended to label the western blot panels to indicate what protein is being imaged in each (Neongree,, Mrps5, Qcr7).

      We will adapt the labelling to make it more clear.

      1.12. -p.9, line 262: I suggest referencing Fig. 4e at the end of the first sentence for clarity.

      We will modify the sentence as suggested.

      1.13. -In the sections related to Fig. 3a and Fig. 5a as well as the connected supplemental data, the authors discuss both the median and the mean of mitochondrial mass and Atp6 protein, respectively. For purposes of clarity, I suggest decreasing the focus on the mean (that is provided only in the supplemental data) and focusing the text mainly on the median. The two show differing trends and it is very good that both are shown, but the clarity of the text can be improved by focusing more on the median where possible.

      We will check the phrasing and simplify.

      1.14. -p. 14, line 435: the statement that mt mass is maintained over the first 9h of depletion is only true for the mean mt mass, not for the median. Please make this clear or rephrase.

      We will check phrasing, make it more clear and also point out the extended proteomics data (see Fig R1), which corresponds to the mean of the populations

      1.15.-p.14, line 452: "mitofusions" should be "mitofusins".

      Thanks for catching this.

      Reviewer 2:

      2.1. While inducible TIR is used to reduce background, the manuscript should rigorously exclude auxin/TIR off-targets (growth, mitochondrial phenotypes, gene expression). Please include full matched controls: (plus minus)auxin, (plus minus)TIR, epitope tag alone, and a degron control on an unrelated mitochondrial membrane protein.

      We agree that rigorous controls are crucial for the interpretation of the results. However, we think we have already included most of the controls the reviewer is asking for, but we might have not pointed this out clearly enough. For example, in Fig 1A, we could make it more clear by adding more labels in which samples we added aTC, which is only described in the figure legend.

      Here is a list of all the controls:

      • Each depletion experiment is always matched with an experiment of the same strain without induction. So the genetic background as well as effects such as light exposure, time spent in the microfluidics systems, etc are controlled for.
      • Figure S1D shows that the growth rate is wildtype like in a strain containing either the AID tag or the TIR protein AND upon addition of both chemicals. It also shows that the final genetic background (AID-tag and TIR) also grows like wildtype if the inducers are not added. This conclusively shows that neither the tags/constructs nor the chemicals per se affect growth rate
      • In Figure S1C we show the mitochondrial morphology of the same controls. We will make sure to label them more consistently to match panel D, and include an actual wildtype and a FLAG-AID-Fzo1 strain without TIR treated with both aTC and 5-Ph-IAA as direct comparison
      • In figure 1A we compare the Fzo1 protein levels of a strain with and without TIR. We show that in absence of TIR, adding either aTC or Auxin does not change Fzo1 levels and that the levels are comparable in the strain that is able to deplete Fzo1 directly before addition of 5-Ph-IAA (after 2 h of induction of TIR through addition of tetracycline)
      • Additionally, in Figure S2C we show that two hours after adding aTC, the entire proteome does not change significantly apart from a strong induction of TIR. We can also make this more clear in the figure legend.
      • Additionally, we will run a qPCR to carefully determine mtDNA levels of untreated wild-type cells, tetracycline treated wild-type cells and tetracycline induced TIR expressing cells to exclude effects of tetracycline as well as the expression of TIR on mtDNA. (also in response to 1.6.) In summary, we think we have controlled sufficiently for all confounding parameters and most importantly showed that addition of either aTC or Auxin as well as the FLAG-AID tag per se does not disturb mitochondria or cell growth. We do not see what a degron control on an unrelated protein will tell us. Depending on the nature of the protein, it may or may not have a phenotype that may or may not be related to morphology changes etc.

      2.2. The Mitoloc preSu9 vs Cox4 import ratio is only a proxy of mitochondrial membrane potential (ΔΨm) and itself depends on mitochondrial mass, protein expression, matrix ATP, and import saturation. The authors need to calibrate ΔΨm with orthogonal dyes (TMRE/TMRM) and pharmacologic titrations (FCCP/antimycin/oligomycin) to generate a response curve; show that Mitoloc tracks dye-based ΔΨm across the relevant range and corrects for mass/photobleaching. Report single-cell ΔΨm vs mass residuals.

      We completely agree that the MitoLoc system is only a rough proxy for the actual membrane potential. That is why we make no quantitative claims on the absolute value or absolute difference between groups of cells. We also make very clear in Fig 3B what we are actually measuring and can emphasize again in the text that this is only a proxy. We agree that it is a good idea to compare MitoLoc values to TMRE staining as the reviewer suggests, we will do these experiments in depleted and control cells at different timepoints. Please note though that also dye staining has its caveats, especially in dynamic live cell experiments. TMRM for example is not compatible with the acidic pH 5 medium that is typically used for yeast and subjecting cells to washing steps and higher pH may change both morphology of mitochondria and the MMP, especially in cells that are already “stressed”. We prefer not to complete elaborate pharmacological titration experiments because firstly, this was extensively done in the original MitoLoc paper by the Ralser lab ((Vowinckel et al. 2015), cited 120 times); secondly, the value of the MMP is not the most critical claim of the manuscript. See also 3.12. Please note that in Figure S4D we had already plotted MMP vs mitochondrial concentration.

      2.3. To use Atp6-mNeon as a proxy for mtDNA is an assumption. Interpreting Atp6 intensity as "functional mtDNA" could be confounded by translation, turnover, or assembly. Please (i) report mtDNA copy number time courses (you have qPCR), nucleoid counts (DAPI/PicoGreen or TFAM/Abf2 tagging), and (ii) assess translation (e.g., 35S-labeling or puromycin proxies) and turnover (proteasome/AAA protease inhibition, mitophagy mutants -some data are alluded to- plus mRNA levels for mtDNA-encoded genes). This will support the "reduced synthesis" versus "increased degradation" conclusion.

      We agree with all three reviewers that Atp6 is only a proxy for mtDNA (Jakubke et al. 2021; Roussou et al. 2024) and the correlation should be checked more carefully. We will use the very recently established Hi-NESS system to follow nucleoids/ mtDNA during depletion experiments. See detailed reply to 1.2.

      (ii) in Figure 2C we inhibit mitochondrial translation and show that in this case control and depleted cells have the same level of Cox2, at least suggesting that degradation is not the key mechanism controlling the levels of mtDNA encoded proteins. We cannot do proteasome inhibitor assays since the nature of the AID-TIR systems requires an active proteasome. In figure S5C we show that the Atp6 depletion is similar in an atg32 deletion. This does not completely exclude a contribution of mitophagy to the observed phenotype, but does confirm that mitophagy is not the primary reason for cells becoming petite.

      2.4. The promoter-NeonGreen reporters argue against transcriptional down-regulation of nuclear OXPHOS. Please add mRNA (RT-qPCR/RNA-seq) for representative genes and a pulse-chase or degradation-pathway dependency (e.g., proteasome/mitophagy/autophagy mutants) to firmly assign active degradation. The authors need to normalize proteomics to mitochondrial mass (e.g., citrate synthase/porin) to separate organelle abundance from protein turnover.

      While we are happy to perform qPCR experiments for selected genes, a full RNA-seq experiment seems outside the scope of this study. As explained above, a proteasome inhibitor experiment is not possible in this set-up. Bulk mitophagy/autophagy seems unlikely to be the cause of the decrease of the nuclear-encoded OXPHOS proteins, since most other mitochondrial proteins do not decrease on average on population level in the first hours. This data is now plotted as additional figure (see below) and will be included in the supplementary of the revised manuscript (Fig R1E).

      2.5. Using preSu9-mCardinal intensity as "mitochondrial concentration" is sensitive to expression, import competence, and morphology/segmentation. The authors should provide validation that this metric tracks 3D volume across fragmentation states (e.g., correlation with mito-GFP volumetrics; detergent-free CS activity; TOMM20/Por1 immunoblot per cell).

      We agree that this is an important point and the co-authors discussed this point quite intensively. In figure S3A and B we show (using confocal data) that there is a very strong correlation between the total fluorescence signal and the 3D volume reconstruction. However, the slope of the correlation is different between tubular and fragmented mitochondria (compare panels A and B) and see figure legend. Since we are dealing with diffraction-limited objects it is likely that the 3D reconstruction is sensitive to morphology, especially if mitochondria are “clumping”. We therefore think that the total fluorescence signal is actually a better estimate of mitochondrial mass per cell than the 3D volume reconstruction (especially for our data obtained with a conventional epifluorescence microscope). The mean of the total mitochondrial fluorescence also better matches the population average mitochondrial proteome (Fig R1E). To consolidate this assumption, we will additionally compare our data to a strain with Tom70-Neongreen and to MMP independent dyes.

      Notably, since the morphology is similarly altered in mothers and buds this is of minor impact for our main point – the unequal distribution between mother and buds.

      2.6. The unequal mother-daughter distribution is compelling, but causality remains inferred. Test whether modulating inheritance machinery (actin cables/Myo2, Num1, Mmr1) or altering fission (Dnm1 inhibition) modifies segregation defects and rescues mtDNA/Atp6 decline. Complementation with Fzo1 re-expression at defined times would help order the phenotype cascade.

      We agree that rescue experiments would be very useful. We have some preliminary data for tether experiments, for example with Num1. The general problem is that the fragmented mitochondria clump together. We have not found a method to restore an equal distribution between mother and daughter cells. We will try to optimize the assay, but are not overly confident it will work. Mmr1 deletion aggravates the Fzo1 phenotype, likely also because the distribution becomes even more heterogeneous, but we have not rigorously analyzed this.

      We like the idea of the Fzo1 re-expression and will run such experiments. This will be especially powerful in combination with the new HI-NESS mtDNA reporter. We may be able to track exactly when cells reach the point-of-no return and become petite. This will also help connecting our mathematical model more directly to the data.

      2.7. The model is useful but should include parameter sensitivity (segregation variance, synthesis slopes, initial nucleoid number) and prospective validation (e.g., predict rescue upon partial restoration of synthesis or inheritance, then test experimentally).

      We will refine our model to include the to-be-measured nucleoids/mtDNA values. We will include a parameter sensitivity analysis with the updated model.

      Reviewer 3:

      3.1. About the use of Atp6 as a good proxy for mtDNA content. This is assumed from l285 onwards, based on a previous publication. As the link is fairly central to part of the paper's arguments, and the system in this study is being perturbed in several different ways, a stronger argument or demonstration that this link remains intact (and unchanged, as it is used in comparisons) would seem important.

      We agree, see 1.2.

      3.2. About confounding variables and processes. The study does an admirable job of being transparent and attempting to control for the many different influences involved in the physical-genetic link. But some remain less clearly unpacked, including some I think could be quite important. For example, there is a lot of focus on mito concentration -- but given the phenotypes are changing the sizes of cells, do concentration changes come from volume changes, mito changes, or both? In "ruling out" mitophagy -- a potentially important (and intuitive) influence, the argument is not presented as directly as it could be and it's not completely clear that it can in fact be ruled out in this way. There are a couple of other instances which I've put in the smaller points below.

      Thank you for acknowledging our efforts to show transparent and well-controlled experiments! We address each of the specific points below.

      3.3. full genus name when it first appears

      We will add the full name.

      3.4. I may be wrong here, but I thought the petite phenotype more classically arises from mtDNA deletion mutations, not loss? The way this is phrased implies that mtDNA loss is [always] the cause. Whether I'm wrong on that point or not, the petite phenotype should be described and referenced.

      We can expand the text and cite additional relevant papers. The term “petite” refers to any strain that is respiratory incompetent and leads to small colonies (not necessarily small cells!) (Seel et al. 2023). This can be mutations or gene loss (fragments) on the mtDNA (these are called cytoplasmic petite), or chemically induced loss of mtDNA (e.g. EtBr), or mutations of nuclear genes required for respiration (these are termed nuclear petite; some nuclear petites show loss of mtDNA in addition to the mutation in the nuclear genome) (Contamine and Picard 2000).

      3.5. para starting l59 -- should mention for context that mitochondria in (healthy, wildtype) yeast are generally much more fused than in other organisms

      ok.

      3.6. Fig 1C -- very odd choice of y-axis range! either start at zero or ensure that the data fill as much vertical space of the plot as possible

      True, this was probably some formatting relic. We will adapt the axis to fill the full space. Most of our axes start at 0, but that doesn’t make so much sense here, since we consider the solidity in the control as “baseline”.

      3.7. "wild-type like more tubular mitochondria" reads rather awkwardly. "more tubular mitochondria (as in the wild-type)"?

      Thank you, sounds better.

      3.8. l106 -- imaging artefacts? are mitos fragmenting because of photo stress? -- this is mentioned in l577-8 in the Methods, but the data from the growth rate and MMP comparison isn't given -- an SI figure would be helpful here. It would be reassuring to know that mito morphology wasn't changing in response to phototoxicity too.

      In the methods we just briefly point out that we have done all our “due diligence” controls to check that we do not generate phototoxicity, something that we highlight in the cited review. We do not explicitly have a figure for this, but figure S1A shows that the solidity of the mitochondrial network in control cells stays the same over 9 hours, even though these cells are exposed to the same cultivation and imaging regime as the depleted cells. We will also add a picture of control cells after 9 h. In S1B we show that control cells containing TIR but no AID tag treated with both chemicals imaged over 9 hours also show the same solidity (~mitochondrial morphology) as untreated control. Also, the doubling times of cells grown in our imaging system (Fig R1B) are very similar to the shake flask (Fig R1A). All in all, we are very confident that our imaging settings did not impact our reported phenotypes.

      3.9. para l146 -- so this suggests mtDNA-encoded proteins have a very rapid turnover, O(hours) -- is this known/reasonable?

      Reference (Christiano et al. 2014) suggests that respiratory chain proteins are shorter lived than the average yeast protein. However, based on Figure 2C we think the dynamics mostly speak for a dilution by growth.

      3.10. section l189 -- it's hard to reason fully about these statistics of mitochondrial concentration given that the petite phenotype is fundamentally affecting overall cell volume. can we have details on the cell size distribution in parallel with these results? to put it another way -- how does mitochondrial *amount* per cell change?

      This is a good point. We report mostly on mitochondrial “concentrations” because we think this is what the cell actually cares about (mitochondrial activity in relationship to cytosolic activity). But we will include additional graphs on mitochondrial amount as well as size distributions (Fig R1C, related to Fig 4F). We can already point out that the size distribution of the population does not change much in the first hours. The “petite” phenotype refers to small colonies on growth medium with limited supply of a fermentable carbon source, not to smaller size of single cells.

      3.11. l199 the mean in Fig S3C certainly does change -- it increases, clearly relative both to control and to its initial value. rather than sweeping this under the carpet we should look in more detail to understand it (a consequence of the increased skew of the distribution)?

      This relates somewhat to the previous point. The increase in average concentration is not due to an increased amount in the population, but due to the fact that it is the small buds that get a very high amount of the mitochondria which “exaggerates” the asymmetric/heterogenous distribution. This will be clarified by the figures we mention in the point above.

      3.12. para line 206 -- this doesn't make it clear whether your MMP signal is integrated over all mitochondria in the cell, or normalised by mitochondrial content? this matters quite a lot for the interpretation if the distributions of mitochondrial content are changing. reading on, this is even more important for para line 222. Reading further on, there is an equation on l612 that gives a definition, but it doesn't really clarify (apologies if I'm misunderstanding).

      For each cell, we basically calculate the relative mitochondrial enrichment of the MMP sensitive vs the MMP insensitive pre-sequence.

      So, MMP= (total intensity of mitochondrial pre-Cox4 Neongreen/ total intensity of mitochondrial pre-Su9 Cardinal) / (total cytosolic pre-Cox4 Neongreen/ total cytosolic pre-Su9 Cardinal).

      We calculate this value for each cell, but we do not have the optical resolution to calculate it for individual mitochondrial fragments.

      Both constructs are driven by the same strong promoter, so transcription of the fluorophore should never limit the uptake. Also, in Figure 3D we compare control and depleted cells with similar total mitochondrial concentration, so the difference must be due to a different import of the two fluorophores, see also Fig S4D. The calculated “MMP” value is of course only a crude proxy for the actual membrane potential in millivolts and we do not want to make any claims on absolute values or quantitative differences. But essentially what we are interested in is “mitochondrial health/activity” and we think the system is good at reporting this. See also 2.2.

      3.13. l230 -- a point of personal interest -- low mito concentrations are connected to low "function" (MMP) and give extended division times -- this is interestingly exactly the model needed to reproduce observations in HeLa cells (https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002416). That model went on to predict several aspects of downstream cellular behaviour -- it would be very interesting to see how compatible that picture (parameterised using HeLa observations) is with yeast!

      Thank you for pointing out your interesting paper, which we will include in our discussion. Another recent preprint about fission yeast (Chacko et al. 2025) also fits into this picture. Since you were kind enough to disclose your identity, we would be happy to discuss this further with you in person if we can maybe follow-up on this.

      3.14. l239 "less mitochondria" -- a bit tricky but I'd say "fewer mitochondria" or "less mitochondrial content"

      Thanks, we will think about how to best rephrase this, probably less mitochondrial content.

      3.15. Section l234 So here (and in Fig 4) the focus is on overall distributions of mitochondrial concentration in different cells (mother-to-be, mother, bud; gen 1, gen >1). But we've just seen that one effect of fzo1 is to broader the distribution of mitochondrial concentration across cells. Can't we look in more depth at the implications of this heterogeneity? For example in Fig 4F (which is cool) we look at the distribution of all fzo1 mothers-to-be, mothers, and buds. But this loses information about the provenance. For example, do mothers-to-be with extremely low mito concentrations just push everything to the bud, while mothers-to-be with high mito concentrations distribute things more evenly? It would seem very easy and very interesting to somehow subset the distribution of mothers-to-be by concentration and see how different subsets behave

      This is a good point. When analyzing the data, we pretty much plotted everything against everything and then chose the graphs that we think will best guide the reader through the story-line. We can make additional supplementary plots where we show the starting concentrations/amounts of the mother in relationship to the resulting split ratio at the end of the cycle (Fig R1D).

      3.16. l285 -- experimental design -- do we know that Atp6 will continue to be a good proxy for functional mtDNA in the face of the perturbations provided by Fzo1 depletion? Especially if there is impact on the expression of mitoribosomes, the relationship between mtDNA and Atp6 may look rather different in the mutant?

      This is actually our top-priority experiment now. We will use the HI-NESS system and possibly DAPI staining to make a more direct link to mtDNA/ nucleoid numbers, see 1.2.

      3.17. l290 -- ruled out mitophagy. This message could be much clearer. Comparing Fig S5C and Fig 3A side-by-side is a needlessly difficult task -- put Fig 3A into Fig S5. Then we see that when mitophagy is compromised, the distribution of mitochondrial concentration has a lower median and much lower upper quartile than in the mitophagy-equipped Fzo1 mutant? What is going on here? For a paper motivated by disentangling coupled mechanisms, this should be made clearer!

      Thanks for pointing this out. We can of course easily include the control in the corresponding figure. Compromising mitophagy is likely to generally affect mitochondrial health and turnover a little bit, independent of what is going on with Fzo1. The second evidence that speaks against large-scale mitophagy is the proteomics data: On population level the dynamics of the respiratory chain proteins are very different from those of other (nuclear encoded) mitochondrial proteins. We will add additional supplementary figures to make this more clear, see Fig R1E. Most mitochondrial proteins in the proteomics experiment stay constant in the first few hours, consistent with the imaging data showing that the mean mitochondrial content of the population does not change initially. This again highlights that it is the unequal distribution which is the problem and not massive degradation of mitochondria.

      3.18. With the Atp6 signal, how do we know that fluorescence from different cells is comparable? Buds will be smaller than mother cells for example, potentially leading to less occlusion of the fluorescent signal by other content in the cytoplasm

      This is of course a general problem that anyone faces doing quantitative fluorescence microscopy. From the technical side, we have done the best we could by taking a reasonable amount of z-slices and by choosing fluorophores that are in a range with little cellular background fluorescence (e.g. Neongreen is much better than GFP). From a practical standpoint, we are always comparing to the control, which is subject to the same technical limitations as the depleted cells and the cell sizes are very similar. So, even if we are systematically overestimating the Atp6 concentration in the bud by a few %, the difference to the control would still be qualitatively true. We therefore do not think that any of our conclusions are affected by this.

      3.19. l343 -- maintenance of mtDNA -- here the point about l285 (is the Atp6-mtDNA relationship the same in the Fzo1 mutant) is particularly important, as we're directly tying findings about the protein product to implications about the mtDNA

      We will carefully address this, see above.

      3.20. l367 -- on a first read this description of the model feels like lots of choices have been made without being fully justified. Why a log-normal distribution (when the fit to the data looks rather flawed); why the choice of 5 groups for nucleoid number (why not 3? or 8?); the process used for parameter fitting is very unclear (after reading the methods I think some of these values are read directly from the data, but the shapes of the distributions remain unexplained). l705 -- presumably the ratio was drawn from a log-normal distribution and then the corresponding nucleoid numbers were rounded to integers? the ratio itself wasn't rounded? (also l367) How were the log-normal distributions fitted to experiments (Figs. S7A,B)? Just by eye?

      We will update our model based on measured nucleoid counts and then explain more stringently the choices we make/ parameters we select.

      3.21. l711 by random selection -- just at random? ("selection" could be confusing) Overall, it feels like the model may be too complicated for what it needs to show. Either (a) the model should show qualitatively that unequal inheritance and reduced production leads to rapid loss -- which a much simpler model, probably just involving a couple of lines of algebra, could show. Or (b) the model should quantitatively reproduce the particular numerical observations from the experiments -- it's not totally clear that it does this (do the cell-cycle-based decay timescales in Fig 7 correspond to the hour-based decay timescales in other plots, for example). At the moment the model is at a (b) level of detail but it's only clear that it's reporting the (a) level of results.

      If the HI-NESS and Fzo1 re-addition experiments work as explained above, all parameters will have direct experimental data, and we should get much closer to (a).

      3.22. A lot of the discussion repeats the results; depending on editorial preferences some of this text could probably be pared back to focus on the literature connections and context.

      We will think about streamlining the discussion once some of the additional material alluded to above has been added.

      3.23. Data availability -- it looks like much of the data required to reproduce the results is not going to be made available. Images and proteomic data are promised, but the data associated with mitochondrial concentration and other features are not mentioned. For FAIR purposes all the data (including statistics from analysis of the images) should be published.

      We maybe didn’t phrase this clearly. All data will be made available. Where technically feasible, this will be directly accessible in a repository, otherwise by request to the corresponding author.

      On our OMERO server, we have deposited many TB of raw images as well as all the intermediate steps such as segmentation masks, and the csv files with all the extracted data for each cell (including background corrections etc). Additionally, we can include csvs with the data grouped in a way that we used to generate all the box blots etc. As of now, the OMERO data is unfortunately only available by requesting a personal guest login from our bioinformatics facility, but we were promised that with the next technical update there will be a public link available. The proteomics data and the model are already fully accessible. The raw western blot images with corresponding ponceau staining will be included with the final publication either as additional supplementary material or in whatever format matches the journal requirements.

      3.24 l660 -- can an overview of the EM protocol be given, to avoid having to buy the Mayer 2024 article?

      The cited paper is open access. But we can also include more details in our method section.

      References:

      Chacko, L. A., H. Nakaoka, R. Morris, W. Marshall, and V. Ananthanarayanan. 2025. 'Mitochondrial function regulates cell growth kinetics to actively maintain mitochondrial homeostasis', bioRxiv.

      Christiano, R., N. Nagaraj, F. Frohlich, and T. C. Walther. 2014. 'Global proteome turnover analyses of the Yeasts S. cerevisiae and S. pombe', Cell Rep, 9: 1959-65.

      Contamine, V., and M. Picard. 2000. 'Maintenance and integrity of the mitochondrial genome: a plethora of nuclear genes in the budding yeast', Microbiol Mol Biol Rev, 64: 281-315.

      Deng, Jingti, Lucy Swift, Mashiat Zaman, Fatemeh Shahhosseini, Abhishek Sharma, Daniela Bureik, Francesco Padovani, Alissa Benedikt, Amit Jaiswal, Craig Brideau, Savraj Grewal, Kurt M. Schmoller, Pina Colarusso, and Timothy E. Shutt. 2025. 'A novel genetic fluorescent reporter to visualize mitochondrial nucleoids', bioRxiv: 2023.10.23.563667.

      Di Bartolomeo, F., C. Malina, K. Campbell, M. Mormino, J. Fuchs, E. Vorontsov, C. M. Gustafsson, and J. Nielsen. 2020. 'Absolute yeast mitochondrial proteome quantification reveals trade-off between biosynthesis and energy generation during diauxic shift', Proc Natl Acad Sci U S A, 117: 7524-35.

      Ebert, A. C., N. L. Hepowit, T. A. Martinez, H. Vollmer, H. L. Singkhek, K. D. Frazier, S. A. Kantejeva, M. R. Patel, and J. A. MacGurn. 2025. 'Sphingolipid metabolism drives mitochondria remodeling during aging and oxidative stress', bioRxiv.

      Jakubke, C., R. Roussou, A. Maiser, C. Schug, F. Thoma, R. Bunk, D. Horl, H. Leonhardt, P. Walter, T. Klecker, and C. Osman. 2021. 'Cristae-dependent quality control of the mitochondrial genome', Sci Adv, 7: eabi8886.

      Khan, Abdul Haseeb, Xuefang Gu, Rutvik J. Patel, Prabha Chuphal, Matheus P. Viana, Aidan I. Brown, Brian M. Zid, and Tatsuhisa Tsuboi. 2024. 'Mitochondrial protein heterogeneity stems from the stochastic nature of co-translational protein targeting in cell senescence', Nature Communications, 15: 8274.

      Martin, J., K. Mahlke, and N. Pfanner. 1991. 'Role of an energized inner membrane in mitochondrial protein import. Delta psi drives the movement of presequences', J Biol Chem, 266: 18051-7.

      Osman, C., T. R. Noriega, V. Okreglak, J. C. Fung, and P. Walter. 2015. 'Integrity of the yeast mitochondrial genome, but not its distribution and inheritance, relies on mitochondrial fission and fusion', Proc Natl Acad Sci U S A, 112: E947-56.

      Perić, Matea, Peter Bou Dib, Sven Dennerlein, Marina Musa, Marina Rudan, Anita Lovrić, Andrea Nikolić, Ana Šarić, Sandra Sobočanec, Željka Mačak, Nuno Raimundo, and Anita Kriško. 2016. 'Crosstalk between cellular compartments protects against proteotoxicity and extends lifespan', Scientific Reports, 6: 28751.

      Roussou, Rodaria, Dirk Metzler, Francesco Padovani, Felix Thoma, Rebecca Schwarz, Boris Shraiman, Kurt M. Schmoller, and Christof Osman. 2024. 'Real-time assessment of mitochondrial DNA heteroplasmy dynamics at the single-cell level', The EMBO Journal, 43: 5340-59-59.

      Seel, A., F. Padovani, M. Mayer, A. Finster, D. Bureik, F. Thoma, C. Osman, T. Klecker, and K. M. Schmoller. 2023. 'Regulation with cell size ensures mitochondrial DNA homeostasis during cell growth', Nat Struct Mol Biol, 30: 1549-60.

      Vowinckel, J., J. Hartl, R. Butler, and M. Ralser. 2015. 'MitoLoc: A method for the simultaneous quantification of mitochondrial network morphology and membrane potential in single cells', Mitochondrion, 24: 77-86.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This article addresses the connection between perturbed mitochondrial structure and genetics in yeast. When mitochondrial fusion is compromised, what is the chain of causality -- the mechanism -- that leads to mtDNA populations becoming depleted? This is a fascinating question, linking physical cell biology to population genetics. I admire the philosophy of the research, acknowledging and attempt to control for the many possible confounding influences. The manuscript describes the context and the research tightly and digestibly; the figures illustrate the results in a clear and natural way.

      For transparency, I am Iain Johnston and I am happy for this review to be treated as public domain. To my eyes my most important shortcoming as a review is my relative lack of familiarity with the yeast fzo1 mutant; while I am familiar with analysis of yeast mito morphology and mtDNA segregation, a reviewer familiar with the nuances of this strain and its culture would be a useful complement.

      I have a few more general points and a collection of smaller points below that I believe might help make the story more robust.

      General points

      1. About the use of Atp6 as a good proxy for mtDNA content. This is assumed from l285 onwards, based on a previous publication. As the link is fairly central to part of the paper's arguments, and the system in this study is being perturbed in several different ways, a stronger argument or demonstration that this link remains intact (and unchanged, as it is used in comparisons) would seem important.
      2. About confounding variables and processes. The study does an admirable job of being transparent and attempting to control for the many different influences involved in the physical-genetic link. But some remain less clearly unpacked, including some I think could be quite important. For example, there is a lot of focus on mito concentration -- but given the phenotypes are changing the sizes of cells, do concentration changes come from volume changes, mito changes, or both? In "ruling out" mitophagy -- a potentially important (and intuitive) influence, the argument is not presented as directly as it could be and it's not completely clear that it can in fact be ruled out in this way. There are a couple of other instances which I've put in the smaller points below.

      Smaller points

      l47 full genus name when it first appears

      l58 I may be wrong here, but I thought the petite phenotype more classically arises from mtDNA deletion mutations, not loss? The way this is phrased implies that mtDNA loss is [always] the cause. Whether I'm wrong on that point or not, the petite phenotype should be described and referenced.

      para starting l59 -- should mention for context that mitochondria in (healthy, wildtype) yeast are generally much more fused than in other organisms

      Fig 1C -- very odd choice of y-axis range! either start at zero or ensure that the data fill as much vertical space of the plot as possible

      l105 "wild-type like more tubular mitochondria" reads rather awkwardly. "more tubular mitochondria (as in the wild-type)"?

      l106 -- imaging artefacts? are mitos fragmenting because of photo stress? -- this is mentioned in l577-8 in the Methods, but the data from the growth rate and MMP comparison isn't given -- an SI figure would be helpful here. It would be reassuring to know that mito morphology wasn't changing in response to phototoxicity too.

      para l146 -- so this suggests mtDNA-encoded proteins have a very rapid turnover, O(hours) -- is this known/reasonable?

      section l189 -- it's hard to reason fully about these statistics of mitochondrial concentration given that the petite phenotype is fundamentally affecting overall cell volume. can we have details on the cell size distribution in parallel with these results? to put it another way -- how does mitochondrial amount per cell change?

      l199 the mean in Fig S3C certainly does change -- it increases, clearly relative both to control and to its initial value. rather than sweeping this under the carpet we should look in more detail to understand it (a consequence of the increased skew of the distribution)?

      para line 206 -- this doesn't make it clear whether your MMP signal is integrated over all mitochondria in the cell, or normalised by mitochondrial content? this matters quite a lot for the intepretation if the distributions of mitochondrial content are changing. reading on, this is even more important for para line 222. Reading further on, there is an equation on l612 that gives a definition, but it doesn't really clarify (apologies if I'm misunderstanding).

      l230 -- a point of personal interest -- low mito concentrations are connected to low "function" (MMP) and give extended division times -- this is interestingly exactly the model needed to reproduce observations in HeLa cells (https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002416). That model went on to predict several aspects of downstream cellular behaviour -- it would be very interesting to see how compatible that picture (parameterised using HeLa observations) is with yeast!

      l239 "less mitochondria" -- a bit tricky but I'd say "fewer mitochondria" or "less mitochondrial content"

      Section l234 So here (and in Fig 4) the focus is on overall distributions of mitochondrial concentration in different cells (mother-to-be, mother, bud; gen 1, gen >1). But we've just seen that one effect of fzo1 is to broader the distribution of mitochondrial concentration across cells. Can't we look in more depth at the implications of this heterogeneity? For example in Fig 4F (which is cool) we look at the distribution of all fzo1 mothers-to-be, mothers, and buds. But this loses information about the provenance. For example, do mothers-to-be with extremely low mito concentrations just push everything to the bud, while mothers-to-be with high mito concentrations distribute things more evenly? It would seem very easy and very interesting to somehow subset the distribution of mothers-to-be by concentration and see how different subsets behave

      l285 -- experimental design -- do we know that Atp6 will continue to be a good proxy for functional mtDNA in the face of the perturbations provided by Fzo1 depletion? Especially if there is impact on the expression of mitoribosomes, the relationship between mtDNA and Atp6 may look rather different in the mutant?

      l290 -- ruled out mitophagy. This message could be much clearer. Comparing Fig S5C and Fig 3A side-by-side is a needlessly difficult task -- put Fig 3A into Fig S5. Then we see that when mitophagy is compromised, the distribution of mitochondrial concentration has a lower median and much lower upper quartile than in the mitophagy-equipped Fzo1 mutant? What is going on here? For a paper motivated by disentagling coupled mechanisms, this should be made clearer!

      With the Atp6 signal, how do we know that fluorescence from different cells is comparable? Buds will be smaller than mother cells for example, potentially leading to less occlusion of the fluorescent signal by other content in the cytoplasm

      l336 -- similar to the Jajoo et al. mechanism in fission yeast -- but are you talking about feedback control of the mtDNA or the protein (or mRNA) product?

      l343 -- maintenance of mtDNA -- here the point about l285 (is the Atp6-mtDNA relationship the same in the Fzo1 mutant) is particularly important, as we're directly tying findings about the protein product to implications about the mtDNA

      l367 -- on a first read this description of the model feels like lots of choices have been made without being fully justified. Why a log-normal distribution (when the fit to the data looks rather flawed); why the choice of 5 groups for nucleoid number (why not 3? or 8?); the process used for parameter fitting is very unclear (after reading the methods I think some of these values are read directly from the data, but the shapes of the distributions remain unexplained). l705 -- presumably the ratio was drawn from a log-normal distribution and then the corresponding nucleoid numbers were rounded to integers? the ratio itself wasn't rounded? (also l367) How were the log-normal distributions fitted to experiments (Figs. S7A,B)? Just by eye? l711 by random selection -- just at random? ("selection" could be confusing) Overall, it feels like the model may be too complicated for what it needs to show. Either (a) the model should show qualitatively that unequal inheritance and reduced production leads to rapid loss -- which a much simpler model, probably just involving a couple of lines of algebra, could show. Or (b) the model should quantitatively reproduce the particular numerical observations from the experiments -- it's not totally clear that it does this (do the cell-cycle-based decay timescales in Fig 7 correspond to the hour-based decay timescales in other plots, for example). At the moment the model is at a (b) level of detail but it's only clear that it's reporting the (a) level of results.

      A lot of the discussion repeats the results; depending on editorial preferences some of this text could probably be pared back to focus on the literature connections and context.

      Data availability -- it looks like much of the data required to reproduce the results is not going to be made available. Images and proteomic data are promised, but the data associated with mitochondrial concentration and other features are not mentioned. For FAIR purposes all the data (including statistics from analysis of the images) should be published.

      l660 -- can an overview of the EM protocol be given, to avoid having to buy the Mayer 2024 article?

      Significance

      This is a powerful and thoughtful study that provides a collection of new mechanistic insights into the link between physical and genetic properties of mitochondria in yeast. Cell biologists, geneticists, and the mitochondrial field will find this of potentially deep interest. Because of the mode and dynamics of inheritance in budding yeast, findings here may not be directly transferrable to other eukaryotes, but these insights are still of interest for researchers outside of yeast for their insight into how this well-studied system manages its mitochondrial populations.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Dengler and colleagues use an AID-based acute depletion of Fzo1 in budding yeast, coupling microfluidics live imaging, single-cell quantification (>30k cells), proteomics, an mtDNA-encoded Atp6 reporter, and simple modeling to argue that fusion loss causes (i) rapid fragmentation and ΔΨm decline, (ii) progressive mtDNA/RC depletion, and (iii) unequal mother-daughter mitochondrial inheritance; together with a failure of compensatory synthesis, these changes drive petite formation. The time-resolved design is valuable, but several readouts are indirect, and some claims (particularly those regarding membrane potential, synthesis "failure," and causality) appear over-interpreted without additional controls.

      Major points

      1. While inducible TIR is used to reduce background, the manuscript should rigorously exclude auxin/TIR off-targets (growth, mitochondrial phenotypes, gene expression). Please include full matched controls: {plus minus}auxin, {plus minus}TIR, epitope tag alone, and a degron control on an unrelated mitochondrial membrane protein.
      2. The Mitoloc preSu9 vs Cox4 import ratio is only a proxy of mitochondrial membrane potential (ΔΨm) and itself depends on mitochondrial mass, protein expression, matrix ATP, and import saturation. The authors need to calibrate ΔΨm with orthogonal dyes (TMRE/TMRM) and pharmacologic titrations (FCCP/antimycin/oligomycin) to generate a response curve; show that Mitoloc tracks dye-based ΔΨm across the relevant range and corrects for mass/photobleaching. Report single-cell ΔΨm vs mass residuals.
      3. To use Atp6-mNeon as a proxy for mtDNA is an assumption. Interpreting Atp6 intensity as "functional mtDNA" could be confounded by translation, turnover, or assembly. Please (i) report mtDNA copy number time courses (you have qPCR), nucleoid counts (DAPI/PicoGreen or TFAM/Abf2 tagging), and (ii) assess translation (e.g., 35S-labeling or puromycin proxies) and turnover (proteasome/AAA protease inhibition, mitophagy mutants -some data are alluded to- plus mRNA levels for mtDNA-encoded genes). This will support the "reduced synthesis" versus "increased degradation" conclusion.
      4. The promoter-NeonGreen reporters argue against transcriptional down-regulation of nuclear OXPHOS. Please add mRNA (RT-qPCR/RNA-seq) for representative genes and a pulse-chase or degradation-pathway dependency (e.g., proteasome/mitophagy/autophagy mutants) to firmly assign active degradation. The authors need to normalize proteomics to mitochondrial mass (e.g., citrate synthase/porin) to separate organelle abundance from protein turnover.
      5. Using preSu9-mCardinal intensity as "mitochondrial concentration" is sensitive to expression, import competence, and morphology/segmentation. The authors should provide validation that this metric tracks 3D volume across fragmentation states (e.g., correlation with mito-GFP volumetrics; detergent-free CS activity; TOMM20/Por1 immunoblot per cell).
      6. The unequal mother-daughter distribution is compelling, but causality remains inferred. Test whether modulating inheritance machinery (actin cables/Myo2, Num1, Mmr1) or altering fission (Dnm1 inhibition) modifies segregation defects and rescues mtDNA/Atp6 decline. Complementation with Fzo1 re-expression at defined times would help order the phenotype cascade.
      7. The model is useful but should include parameter sensitivity (segregation variance, synthesis slopes, initial nucleoid number) and prospective validation (e.g., predict rescue upon partial restoration of synthesis or inheritance, then test experimentally).

      Significance

      The dataset is rich and the time-resolved approach strong, but key conclusions rely on indirect proxies and need orthogonal validation and at least one causal rescue experiment to avoid over-interpretation.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript by Dengler et al examines the mechanisms underlying the mtDNA depletion observed in cells where mitochondrial fusion is disrupted by depletion of the fusion factor Fzo1. In Saccharomyces cerevisiae, the authors deplete Fzo1 and use live-cell imaging of thousands of cells to follow the effects and their dynamic following Fzo1 depletion. They find that Fzo1-depleted cells show very rapid mitochondrial fragmentation (within 1h of Fzo1 depletion), and also an immediate drop in mitochondrial membrane potential (MMP). MtDNA is lost by 15h, and along with it the expression of mitochondrially-encoded proteins. Nuclear-encoded mitochondrial proteins are also decreased though somewhat later, and the authors find that this is largely due to their degradation (probably a consequence of lack of mitochondrial import into low-MMP cells). Most importantly, the study identifies two separate mechanisms that together contribute to the loss of mt-encoded proteins in Fzo1-depleted cells: unequal distribution of mitochondria during cell division and the reduction of a fusion-dependent compensatory synthesis of mt-encoded proteins. Unexpectedly, Fzo1-depleted cells end up passing an increased (rather than decreased) amount of mitochondria and mitochondria-encoded proteins to their daughters. Over several generations, and combined with the loss of the compensatory synthesis of more mt-encoded proteins, this leads to the progressive loss of mtDNA and mtDNA-encoded proteins in the population.

      The study is very thorough and the experiments contain the appropriate controls. The conclusions are convincing and largely supported by the experimental data that has been appropriately replicated. The data presentation is generally clear although the text could benefit from some streamlining.

      However, addressing the following major comments is required:

      1. Statistical comparisons are missing throughout the manuscript (with the exception of Fig. 2c). Appropriate statistical tests, along with p-values, should be used and reported where different gorups are compared, for example (but not limited to) Fig. 3d and most panels of Fig. 4.
      2. I do not agree with the use of Atp6 protein as a direct read-out of mtDNA content. While Atp6 protein levels will decrease with decreasing mtDNA content, the inverse is not necessarily true: decreased Atp6 protein levels do not necessarily indicate decreased mtDNA levels, because they could alternatively or additionally be caused by decreased transcription and/or translation. Therefore, please do not equate Atp6 protein levels to mtDNA levels, and instead rephrase the text referencing the Atp6 experiments in the Results and Discussion sections to measure "mtDNA expression" or "mt-encoded protein" or similar. For example, on p. 14 line 431 should read "mtDNA expression" rather than "decreased synthesis of mtDNA", and line 440 on the same page "mean mtDNA levels" should be "mtDNA expression" or similar.
      3. In Fig. 3, the authors use the fluorescence intensity of a mitochondrially-targeted mCardinal as a read-out of mitochondrial mass. Please provide evidence that this is not affected by MMP, either with relevant references or by control experiments (e.g. comparing it to N-acridine orange or other MMP-independent dyes or methods).
      4. In Fig. 2e-f, the authors use a promoter reporter with Neongreen to answer whether the reduced levels of the nuclear-encoded mitochondrial proteins Mrps5 and Qcr7 are due to decreased expression or to protein degradation, and find no evidence of degradation of the Neongreen reporter protein. However, subcellular localization might affect the availability of the protein to proteases. Although not absolutely required, it would be relevant to know if the Neongreen fusion protein is found in the same subcellular compartment as Mrps5 and Qcr7 at 0h and 9h after Fzo1 depletion.
      5. Fzo1 depletion leads to a very rapid drop in MMP during the first hour of depletion. In the Discussion, can the authors speculate on the possible mechanism of this rapid MMP drop that occurs well before mtDNA or mt-encoded proteins are decreased in level?
      6. In Fig. 2a, the mtDNA copy number of Fzo1-depleted cells is ca 1.3-fold of the control cells at the 0h timepoint. Why might this be? Is it an impact of one of the inducers? If so, we might be looking at the combination of two different processes when measuring copy number: one that is an induction caused by the inducer(s), and the other a consequence of Fzo1 depletion itself.

      Minor comments:

      • p. 3, line 71: "ten thousands of dividing cells.." should be "tens of thousands of dividing cells".
      • p.4, line 116: please be even more clear with what the "depleted" cells and controls are treated with: are depleted cells treated with both inducers, and controls with neither?
      • p.5, lines 147-148: the authors write "the rate with which the abundance of Cox2 and Var1 proteins decreases was similar to the rate of mtDNA loss" though the actual rate is not shown. Please calculate and show rates for these processes side by side to make comparison possible, or alternatively rephrase the statement.
      • Fig. 2d: changing the y-axis numbering to match those in panels a and b would facilitate comparisons.
      • Fig. 2e: it is recommended to label the western blot panels to indicate what protein is being imaged in each (Neongree,, Mrps5, Qcr7).
      • p.9, line 262: I suggest referencing Fig. 4e at the end of the first sentence for clarity.
      • In the sections related to Fig. 3a and Fig. 5a as well as the connected supplemental data, the authors discuss both the median and the mean of mitochondrial mass and Atp6 protein, respectively. For purposes of clarity, I suggest decreasing the focus on the mean (that is provided only in the supplemental data) and focusing the text mainly on the median. The two show differing trends and it is very good that both are shown, but the clarity of the text can be improved by focusing more on the median where possible.
      • p. 14, line 435: the statement that mt mass is maintained over the first 9h of depletion is only true for the mean mt mass, not for the median. Please make this clear or rephrase.
      • p.14, line 452: "mitofusions" should be "mitofusins".

      Referees cross-commenting

      I think that the reviews of the other two reviewers are both insightful and constructive. Especially the rescue experiment suggested by Reviewer 2 could provide strong support for the interpretations of the study. Note that all three reviewers ask for validation of the use of Atp6p as a read-out of mtDNA function, and that all agree the data is powerful and the study of value to the field.

      Significance

      The fact that disruption of mt fusion leads to mtDNA loss has been known for some time, but the mechanism behind this phenomenon has remained unknown to date. This thorough and precise study by Dengler et al uses state-of-the-art single-cell analysis to dissect the mechanisms underlying the mtDNA loss following the disruption of mt fusion, and convincingly reveal that it is caused by two different mechanisms: i) the inequal inheritance of mitochondria between mother and bud, and ii) the loss of a compensatory mechanism that normally maintains homeostatic mt protein levels. In the process, the authors shed light on the dynamics of the events following Fzo1 depletion, revealing dramatically fast mt fragmentation and a loss of MMP, which in turn can be expected to act as a stress signal and influence a number of cellular processes.

      The findings of the study can have relevance for human conditions involving disrupted mitochondrial dynamics, caused for example by mutations in mitofusins. The study will be of interest to researchers in mitochondrial biology ranging from dynamics and mtDNA maintenance to mitochondrial medicine.

      The field of expertise of this reviewer: mtDNA maintenance. I am not able to properly evaluate the modelling in Fig. 7.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to the reviewers

      We thank the reviewers for recognizing the importance of study, and how it “addresses a long-standing question in the heterogeneity of cellular responses to stressors”, “makes a conceptual advance by identifying transcription factors as the limiting determinant of IFN-β induction in KSHV-infected cells”, and “serves as a crucial starting point for understanding cellular heterogeneity”. We agree that our findings appeal to a broad audience interested in virology, immunology, cell biology, and gene transcription.

      We also thank the reviewers for their insightful suggestions that will greatly strengthen our study. Below we detail how we plan to address their comments experimentally and how we have already edited the text to respond to them.

      Referee #1

      One experiment that may provide some insight into the selective RelA activation is to quantify viral genomes within the high and low IFN producing cells. Perhaps, the genome as a PAMP, is more abundant in the inducing cells.

      We have added a note in the Discussion section (line 417) that we have evidence that the cGAS PAMP in our system is mitochondrial DNA, not viral DNA. Moreover, our results suggest that the variation in PAMP levels are not the source of heterogeneity, as this would cause heterogeneous activation of the cGAS-STING-TBK1-IRF3 axis. Instead, we have discovered that TBK1 and IRF3 are activated even in cells without interferon-β induction.

      Referee #2

      1) While the study presents intriguing evidence for AP-1 involvement in regulating IFN-β responses, the reliance on total c-Jun levels as a readout is limiting. Because c-Jun activity depends on phosphorylation and promoter binding, additional experiments (i.e., phospho-c-Jun analysis or ChIP at the IFNB1 promoter) would strengthen the link between AP-1 activity and the observed reporter outcomes.

      We agree that that a stronger link between AP-1 activity and IFN induction would improve our study, so we have cloned interferon-β reporter constructs that contain mutations in the AP-1 binding sites. We plan to use this reporter, as well as IFN-β reporter constructs that contain mutations in either the AP-1, IRF3, or NF-κB binding sites, to mechanistically test the connection between AP-1 and activation of the IFN promoter. As a control, we will test that the mutations block reporter induction after stimulation with a well characterized agonist of the IFN induction pathway such as poly(I:C). We have previously investigated c-Jun and ATF2 phosphorylation during KSHV reactivation and caspase inhibition. Surprisingly, in preliminary experiments we did not detect phosphorylation of either AP-1 subunit. We will confirm this result and add these data to the manuscript.

      2) The data presented demonstrating that Serine 386 phosphorylation does not distinguish first responder cells is strong. Including complementary data on Ser396 phosphorylation would strengthen the conclusion, as this well-established activation marker is readily detectable with available reagents and would help confirm that the potentiation of IRF3 activity is not the driver of the observed heterogeneity.

      We will complement the Ser386 results with Ser396 staining.

      3) Consider updating the title to more directly reflect the findings (e.g. "Interferon-β induction heterogeneity during KSHV infection is correlated to expression of ATF2 and RelA")

      We have updated the title to “Interferon-β induction heterogeneity during KSHV infection is correlated to levels and activation of the transcription factors ATF2 and RelA, and not IRF3”

      *4) To ease the interpretation of data, indicate what the black and white circles indicate in the figure legends. *

      We have updated the figures to be more intuitive, using + and -.

      5) IE ORF50 is used to show no differences between first responders and non-responders, but showing early and late genes across tdTomato positive and negative cells would rule out potential differences in progression through reactivation.

      We added a clarification in the Results section (line 195), explaining that we have examined the progression of viral reactivation through single-cell transcriptomics in our previous publication, and that the results indicate that viral gene expression plays a small role in interferon-β heterogeneity. We favor the scRNA-seq dataset for this conclusion, because the tdTomato negative cell population represents a mix of non-reactivating cells, which would not be expected to make IFN, and reactivating cells that fail to induce IFN expression.

      6) The data in Figure 5D (quantified in E and F) show a compelling trend. This could be further clarified by plotting a trend line that connects the results of independent experiments, rather than only showing individual data points. Such visualization would make the consistency of the observed trend across experiments more apparent.

      We have added lines in the graphs in Figure 5 to ease visualization.

      Referee #3

      A major worry comes from using lentiviral transduction to insert the reporter promoter into cells without selecting for clones. Lentiviral transduction introduces heterogeneity due to random insertion of their vector. This results in different copy numbers for the reporter construct, leading to heterogeneity in the reporter expression. Additionally, the expression of foreign proteins, particularly in immune cells, can be perceived as danger signals (10.1007/s12015-016-9670-8) and occasionally trigger p65 activation. To control for this, the authors could validate their reporter results by including a non-IFNb promoter (e.g., constitutive) expressing tdTomato and verifying that these cell populations do not also express endogenous IFNb mRNAs.

      We did not select clonal cell lines because different cells may have different reactivation propensity. Moreover, we did not want the tdTomato signal to reflect specific regulation of a single genomic region. We have now added an explanation as to why we did not clonally select that cell lines in the Results section (line 157). Our control conditions that do not result in IFN-β induction show that lentiviral insertion is not sufficient to cause IFN induction, as we did not detect IFN-β mRNA in the untreated reporter cells (first bar in Figure 1C). We also clarified in the Results section (line 184) that the selective enrichment of both IFN-β and tdTomato mRNA in the sorted tdTomato+ cells demonstrates that tdTomato is a faithful reporter for rare IFN-β expression, regardless of heterogeneous lentiviral transduction in the population. To further verify that lentiviral transduction does not play a role in introducing heterogeneity in induction of our tdTomato reporter and of IFN-β, we will measure IFN-β levels in BC-3 cells constitutively expressing tdTomato, which we have already created. We may also sort BC-3 cells constitutively expressing tdTomato and check that the tdTomato signal is not predictive of IFN expression in these cells. However, the expectation is that all or most cells will be tdTomato positive, which may make sorting for tdTomato negative cells impossible.

      Regarding AP1 and NF-kB activation, the authors could investigate downstream genes such as GADD45B, HSPA1A, and ATF-3 (for AP1), and IL-6, TNFAIP3 (A20) (for both AP1 and NFkB). It would be interesting to determine if these genes are exclusively expressed in tdTomato-expressing cells.

      We will quantify the mRNA levels of these genes by performing qPCR on our cDNA from sort experiments. So far, we have detected IL-6 induction but no enrichment of this transcript in the sorted tdTomato+ samples.

      While the authors observed no direct correlation between c-Jun alone and IFN-b production, it is conceivable that TPA-induced c-Jun primes the cells that become fully transcriptionally active upon a stimulus like viral reactivation. I propose that the authors attempt to inhibit c-Jun activation during KSHV reactivation (TPA + caspase inhibitor) using inhibitors like SP600125 and subsequently assess whether this blockade reduces the proportion of IFNb+ cells.

      We have tried using the suggested inhibitor (SP600125), but found that it inhibits KSHV reactivation, making any result on IFN levels difficult to interpret. Currently, we are testing a dual AP-1 and NF-κB inhibitor (SP100030) and may add these data to the results if we do not encounter similar issues.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      Kaku and Gaglia's study provides one step further in the ongoing debate surrounding viral versus innate immune heterogeneity. They addressed this question by creating a reporter B cell line (BC-3) that expresses tdTomato under the IFNb promoter. This particular cell line is known to be latently infected with KSHV, and lytic infection can be induced by treatment with the protein kinase C activator TPA.

      Through FACS sorting upon KSHV lytic infection, the authors observed a correlation between the promoter reporter activation and endogenous IFNb and IFNl mRNA levels. However, this correlation did not extend to viral replication, indicating that only a fraction of virus-infected cells produce IFN. They identified these cells as "first responders" upon viral replication by treating them with anti-IFNAR, confirming that IFN production is triggered by the cGAS-STING pathway sensing lytic virus infection.

      Surprisingly, p-IRF3 activation was not limited to IFN-producing cells, suggesting the involvement of other key transcription factors. Indeed, they found a correlation between NF-kB and AP1 activation and IFN production. The study concludes that the combined action of NF-kB, AP1, and IRF3 is crucial for robust IFN production.

      Major comment

      The author effectively dissects the necessary components for IFNb activation, despite acknowledging the limitations of their findings. All my potential anecdotal queries, such as the role of other viruses or agonists and the treatment of cells with NF-kB inhibitors, are thoroughly addressed in their discussion.

      However, a major worry comes from using lentiviral transduction to insert the reporter promoter into cells without selecting for clones. Lentiviral transduction introduces heterogeneity due to random insertion of their vector. This results in different copy numbers for the reporter construct, leading to heterogeneity in the reporter expression. Additionally, the expression of foreign proteins, particularly in immune cells, can be perceived as danger signals (10.1007/s12015-016-9670-8) and occasionally trigger p65 activation. To control for this, the authors could validate their reporter results by including a non-IFNb promoter (e.g., constitutive) expressing tdTomato and verifying that these cell populations do not also express endogenous IFNb mRNAs.

      Minor comments

      Regarding AP1 and NF-kB activation, the authors could investigate downstream genes such as GADD45B, HSPA1A, and ATF-3 (for AP1), and IL-6, TNFAIP3 (A20) (for both AP1 and NFkB). It would be interesting to determine if these genes are exclusively expressed in tdTomato-expressing cells.

      While the authors observed no direct correlation between c-Jun alone and IFN-b production, it is conceivable that TPA-induced c-Jun primes the cells that become fully transcriptionally active upon a stimulus like viral reactivation. I propose that the authors attempt to inhibit c-Jun activation during KSHV reactivation (TPA + caspase inhibitor) using inhibitors like SP600125 and subsequently assess whether this blockade reduces the proportion of IFNb+ cells.

      Significance

      The study presents a valuable dataset and serves as a crucial starting point for understanding cellular heterogeneity, particularly regarding the known concept of IRF+NFkB in IFNb production. While this mechanism isn't novel (10.1074/jbc.273.5.2714), the authors demonstrated the difference in activation in a cellular level. This finding can be the basis of future research utilizing more physiologically relevant models, such as primary cells or tissues, to identify factors contributing to varying cellular responses.

      However, the authors acknowledge that these findings should be interpreted with caution and require further validation through additional studies across different models and viral infections. This research will be particularly relevant to those in basic research seeking to deepen their understanding of the dynamic differences in innate immune responses and viral infections.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the manuscript "Interferon-β induction heterogeneity during KSHV infection is correlated to expression and activation of enhanceosome transcription factors other than IRF3", Kaku and Gaglia address a long-standing question in the initiation of host antiviral responses: what drives the heterogeneity in the initiation of IFN responses within a presumably homogenous population of cells. In this study, the authors focus on host factors that contribute to the heterogeneity in IFN induction. They use a KSHV lytic reactivation model (TPA + Caspase inhibitor treated BC-3 cells) and FACS-based reporter assays to enhance the resolution in the detection of molecular drivers of "first responder" cells that make IFN. They find that IRF3 activation alone does not predict IFN expression; rather, the expression of ATF2 and RelA is predictive of IFN-β induction. The authors carefully control for off-target effects of TPA treatment in BJAB cells and paracrine signaling through the inclusion of IFN-neutralizing antibodies. Overall, the manuscript is well-written and easy to follow, and the data compellingly support their conclusion that cell-specific transcription factor activity limits IFN production to cell subsets. Demonstrating coordinated occupancy or functional interplay of these factors would increase confidence in the proposed model and broaden the impact for readers interested in virology, immunology, and transcriptional regulation.

      Comments:

      1. While the study presents intriguing evidence for AP-1 involvement in regulating IFN-β responses, the reliance on total c-Jun levels as a readout is limiting. Because c-Jun activity depends on phosphorylation and promoter binding, additional experiments (i.e., phospho-c-Jun analysis or ChIP at the IFNB1 promoter) would strengthen the link between AP-1 activity and the observed reporter outcomes.
      2. The data presented demonstrating that Serine 386 phosphorylation does not distinguish first responder cells is strong. Including complementary data on Ser396 phosphorylation would strengthen the conclusion, as this well-established activation marker is readily detectable with available reagents and would help confirm that the potentiation of IRF3 activity is not the driver of the observed heterogeneity.

      Minor Comments:

      1. Consider updating the title to more directly reflect the findings (e.g. "Interferon-β induction heterogeneity during KSHV infection is correlated to expression of ATF2 and RelA".
      2. To ease the interpretation of data, indicate what the black and white circles indicate in the figure legends.
      3. The authors predominantly rely on the IE gene ORF50 as a marker of KSHV reactivation and show no differences in expression between first responder cells and those that don't. Measurement of early and late genes across TdTomato-positive and negative cells would rule out potential differences in progression through reactivation that might influence IFN production.
      4. The data in Figure 5D (quantified in E and F) show a compelling trend. This could be further clarified by plotting a trend line that connects the results of independent experiments, rather than only showing individual data points. Such visualization would make the consistency of the observed trend across experiments more apparent.

      Significance

      This study addresses an important and long-stading question in the heterogeneity of cellular responses to stressors, such as viruses. The study is well designed and presented, making it appealling to a broad audience interested in virology, immunology, cell biology, and gene transcription.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript discusses why only a small proportion of KSHV infected cells produce high level of IFN during lytic reactivation in the presence of caspase inhibition, highlighting cellular heterogeneity as a key factor in innate immune regulation in KSHV infection. Using a dt-TOMATO reporter fused to the IFN promoter, the author generated a stable cell line in BC3 cells (a primary effusion lymphoma line). The authors observed that while IRF3 and TBK1 were activated in nearly all infected cells, only those with both high baseline levels of the AP-1 component ATF2 and activated NF-κB (phosphorylated RelA) produced robust FN. These findings suggest that AP-1 and NF-B, rather than IRF3, are the limiting factors for IFN induction in individual cells.

      Overall the findings are interesting and important. While there remain many unknowns, such as why RelA is activated in only a subset of cells, this manuscript takes us one step close to determining how IFN is ultimately induced in KSHV infection.

      One experiment that may provide some insight into the selective RelA activation is to quantify viral genomes within the high and low IFN producing cells. Perhaps, the genome as a PAMP, is more abundant in the inducing cells.

      Significance

      The study makes a conceptual advance by identifying transcription factors as the limiting determinant of IFN-β induction in KSHV-infected cells, highlighting how innate immune responses are regulated during herpesvirus infection and how the regulation influences viral persistence and immune evasion.

      The above findings will be of important to researchers studying herpesvirus biology, innate immunity (IFN signaling), and host-pathogen interaction.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the editor and the reviewers for their positive and constructive comments. Below is our point-by-point responses.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Metabolic dysfunction-associated steatotic liver disease (MASLD) ranges from simple steatosis, steatohepatitis, fibrosis/cirrhosis, and hepatocellular carcinoma. In the current study, the authors aimed to determine the early molecular signatures differentiating patients with MASLD associated fibrosis from those patients with early MASLD but no symptoms. The authors recruited 109 obese individuals before bariatric surgery. They separated the cohorts as no MASLD (without histological abnormalities) and MASLD. The liver samples were then subjected to transcriptomic and metabolomic analysis. The serum samples were subjected to metabolomic analysis. The authors identified dysregulated lipid metabolism, including glyceride lipids, in the liver samples of MASLD patients compared to the no MASLD ones. Circulating metabolomic changes in lipid profiles slightly correlated with MASLD, possibly due to the no MASLD samples derived from obese patients. Several genes involved in lipid droplet formation were also found elevated in MASLD patients. Besides, elevated levels of amino acids, which are possibly related to collagen synthesis, were observed in MASLD patients. Several antioxidant metabolites were increased in MASLD patients. Furthermore, dysregulated genes involved in mitochondrial function and autophagy were identified in MASLD patients, likely linking oxidative stress to MASLD progression. The authors then determined the representative gene signatures in the development of fibrosis by comparing this cohort with the other two published cohorts. Top enriched pathways in fibrotic patients included GTPase signaling and innate immune responses, suggesting the involvement of GTPase in MASLD progression to fibrosis. The authors then challenged human patient derived 3D spheroid system with a dual PPARa/d agonist and found that this treatment restored the expression levels of GTPase-related genes in MASLD 3D spheroids. In conclusion, the authors suggested the involvement of upregulated GTPase-related genes during fibrosis initiation. Overall, the current study might provide some resources regarding transcriptomic and metabolomic data derived from obese patients with and without MASLD. However, several concerns should be carefully addressed.

      1. A recent study, via proteomic and transcriptomic analysis, revealed that four proteins (ADAMTSL2, AKR1B10, CFHR4 and TREM2) could be used to identify MASLD patients at risk of steatohepatitis (PMID: 37037945). It is not clear why the authors did not include this study in their comparison. Thank you for the suggestion. The RNA sequencing dataset (GSE135251) from study PMID 37037945 is the same dataset we used as an external benchmark in our study, referred to as the EU cohort on page 4 in the manuscript. In addition to PMID 37037945, we have cited the original transcriptomic study (PMID 33268509) for the EU cohort. In the revised manuscript, we discussed this proteome-transcriptome paper in the Discussion section and highlighted the potential of AKR1B10 as a biomarker in early MASLD.

      The authors recruited 109 patients but only performed transcriptomic and metabolomic analysis in 94 liver samples. Why did the authors exclude other samples?

      We thank the reviewer for their question and we understand the confusion. The discrepancy in sample size between liver and plasma cohorts is due to the fact that, for certain cases, we were unable to get sufficient liver tissue slices (“Exclusion criteria included: age The authors mentioned clinical data in Table 1 but did not present the table in this manuscript.

      Table 1 (key patient characteristics) was included in the main document after the Methods section, and Table S1 (additional patient characteristics) was provided as a supplemental file in our original submission.

      The generated metabolomic data could be a very useful resource to the MASLD community. However, it is very confusing how the data was generated in those supplemental tables. There is no clear labeling of human clinical information in those tables. Also, what do those values mean in columns 47-154? This reviewer assumed that they are the raw data of metabolomic analysis in plasma samples. However, without clear clinical information in these patients, it is impossible that any scientist can use the data to reproduce the authors' findings.

      We appreciate this suggestion. To ensure accessibility of the data resources, we created a GitHub repository for both data and code, available at https://github.com/SLINGhub/MASLD_dual_omics____.

      The GitHub repository includes clinical data for all 109 participants with patient characteristics and histological gradings, as well as processed omics data (log₂-transformed). We have generated artificial IDs for each patient so that we can include all the requested data in an organized manner. A code template is also provided to replicate the main statistical results from this study. In addition, for readers interested in conducting analyses from the raw data, we have deposited the raw sequencing files and mass spectrometry data in GEO and Zenodo, as detailed in the ‘Data Availability’ section.

      In Fig. 5B, the authors excluded the steatosis and fibrosis overlapped genes. Steatosis and fibrosis specific genes could simply reflect the outcomes rather than causes. In this case, the obtained results might not identify the gene signatures related to fibrosis initiation.

      We appreciate this comment, but we do not fully understand the reviewer’s point since we did not exclude overlapped genes in our analysis, and it was unclear to us whether excluding overlapping genes has anything to do with causality of both processes.

      In Figure 5B, we identified the gene signatures associated with steatosis and fibrosis after adjusting for potential confounders such as age, sex, BMI and diabetes status. Our results showed that these signatures were relatively independent, sharing a limited number of genes. We then examined genes uniquely associated with each process by additional adjustment (e.g., adjusting steatosis models for fibrosis grades). To us this was not an unreasonable approach, given that steatosis precedes fibrosis in most cases, especially in morbid obesity.

      We nevertheless agree with the reviewer’s point that the gene expression changes we identified represent statistical associations without warranting causality. To specifically address fibrosis initiation mechanisms within the limitation of the current study design, we performed a separate comparative analysis between patients with fibrosis+steatosis versus those with steatosis alone (Table S11), which still identified GTPase regulation as a potential key mechanism in fibrosis initiation (Figure 6B).

      In Fig. 6D, the authors used 3D liver spheroid to validate their findings. However, there is no images showing the 3D liver spheroid formation before and after PPARa/d agonist treatment. It is not clear whether the 3D liver spheroid was successfully established.

      There is extensive literature (>40 papers) from the Lauschke lab on 3D liver spheroid culture, including but not limited to PMIDs 27143246, 28264975, 32775153, 37870288 and 39605182. Images of the spheroids can be seen in Figure 1c of Adv. Sci. 2024, 2407572 and elafibrinor treatment did not affect the morphology of the spheroids.

      The authors suggested that targeting LX-2 cells with Rac1 and Cdc42 inhibitors could reduce collagen production. Did the authors observe these two genes upregulated in mRNA and protein expression levels in their cohort when compared MASLD patients with and without fibrosis? Did the authors observe that the expression levels of Rac1 and Cdc42 are correlated with fibrosis progression in MASLD patients?

      Regarding comments 7 and 8, we targeted Rac1 and Cdc42 in the LX-2 cell experiment as they are common and major GTPases. Protein-level data are not available in our dataset, but we examined their transcript-level expression. RAC1 and CDC42 expression levels were positively associated with fibrosis progression, with coefficients of 0.362 (q = 0.027) and 0.342 (q = 0.031), respectively. These results are presented in Table S5, and the corresponding boxplots are shown here.

      Figure R1. RAC1 and CDC42 expression levels in individuals with different fibrosis *levels. *

      Other studies have revealed several metabolite changes related to MASLD progression (PMID: 35434590, PMID: 22364559). However, the authors did not discuss the discrepancies between their findings with the previous studies.

      Thank you for the suggestion. We have incorporated a discussion of the two studies into the Discussion section, highlighting the consistencies and discrepancies between our plasma metabolomic results and previous findings. The main differences may stem from variations in MASLD spectrum and the degree of obesity in the cohorts.

      Reviewer #1 (Significance (Required)):

      Overall, the current study might provide some new resources regarding transcriptomic and metabolomic data derived from obese patients with and without MASLD. The MASLD research community will be interested in the resource data.

      We thank this reviewer for the positive and constructive evaluation of our manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this paper, Kaldis and collaborators investigate the molecular heterogeneity of a 109 morbidly obese patient cohort, focusing on liver transcriptomics and metabolomics analysis from liver and serum. The main finding (i.e. upregulation of GTPase-coding genes) was validated in spheroids and a human HSC cell line. As these proteins are involved in critical cellular functions related to metabolism and cytoskeleton dynamics, these findings shed light on their involvement in human liver pathology which so far has been poorly (or even not) documented to date. This is an interesting addition to the current knowledge about chronic liver pathology. However the manuscript suffers from the lack of a clear-cut definition of patient subgroups and the seemingly indistinct use of generic (MASLD, NAS score) and more granular terms (MASH, fibrosis) across the various analysis they performed.

      We thank this reviewer of highlighting the novelty of our manuscript. We agree that mixing generic and granular terms can be confusing and we tried to use of terms consistently throughout, which has been further improved in the revised version.

      Figure 1 and Table 1 provide comprehensive information regarding histological phenotypes, NAS scores, and patient characteristics. From Figure 2 onward, we specifically focused on steatosis and fibrosis as distinct histological features, identifying molecular signatures associated with each process.

      The term ‘MASH’ was used only when referring to the ex vivo 3D spheroids derived from histologically confirmed MASH patients for validation purposes. As our primary cohort represents early disease stages, we did not characterize molecular features of MASH in that data set.

      In this cohort, the term 'NAS' was mentioned only in Section 1 to characterize the disease spectrum. Additionally, in Figures 3A and 6A, we illustrated the association between gene expression levels and NAS in two external cohorts. This was due to the absence of steatosis grades in the two datasets. NAS is an additive measure of multiple scores (steatosis, inflammation and ballooning), but does not account for fibrosis grades.

      Our study focuses on the molecular features of steatosis grades and fibrosis grades as the main histological processes, with all terminology aligned with this stated objective. This allows us to map the transcriptome and metabolome to pathologist-defined steatosis/fibrosis severity (i.e., 0,1,2,3) and identify genes/metabolites that are correlated with increasing steatosis/fibrosis score.

      Major comments:

      • Are the key conclusions convincing?

      The conclusions are generally consistent with findings from numerous previous studies, as many of the genes identified and their associations with disease states have been previously reported. However, I found it difficult to discern which specific disease stages the authors are referring to throughout the manuscript. Terms such as MASLD (Fig. 1F), steatosis (Fig. 4A), MASH, fibrosis (Fig. 6), and the composite NAS score (Fig. 1G) are used interchangeably, without clearly explaining whether or how the patient cohort was stratified to distinguish between isolated steatosis, MASH, and MASH with or without fibrosis. It is also unclear whether subgroups were propensity score-matched.

      As explained in our previous point, we believe that we did not carelessly use the terms interchangeably, but rather used them as they were available or pertinent to the comparisons in discussion. We have provided a comprehensive cohort description in the first section (Table 1, including all histological features and NAS scores), then focused specifically on steatosis and fibrosis in subsequent analyses. We identified distinct molecular processes underlying these two histological features and validated key fibrosis-related pathways.

      Regarding the comment of ‘propensity score-matched subgroups’, we would like to clarify that the only “sub”-group analysis performed in this paper is the transition from steatosis to steatosis with fibrosis. We have consistently used linear regression as the association analysis framework, without binarization of outcomes. We recall that this is a cross-sectional study with challenging recruitment situation from a bariatric surgery clinic that naturally represents the spectrum of MASLD in obesity. We acknowledge that the sampling can always be biased in such a study. However, given the invasiveness of liver resection, the study is also limited by the reality that not all patients would agree to the study, nor it is feasible to form a perfect subgroup meeting 1:1 ratio as in large-scale epidemiology studies based on plasma samples.

      In a related point, the authors mention that 76% of patients are non-fibrotic, introducing a marked imbalance between fibrotic (n=26) and non-fibrotic (n=83) samples. Given this disparity and potential inter-individual variability, it would be helpful to include observed fold changes or effect sizes to give readers a sense of the magnitude of the biological dysregulations being reported.

      As explained in our previous response, our study design examines associations between histological and molecular features rather than using a case-control approach. For effect size quantification, we report standardized linear regression coefficients, i.e. the change in gene expression Z-score per one-point increase in steatosis or fibrosis grade. We also provided fold changes in our comparative analysis of steatosis+fibrosis versus fibrosis-free steatosis. These effect sizes were fully documented in the Supplemental Tables.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      • The authors seem pretty enthusiastic about elafibranor, despite a failed phase 3 clinical trial. I would qualify elafibranor as a useful tool in preclinical model. We agree with the reviewer and indeed used elafibranor as a research tool for PPARa/d modulation rather than a clinically promising prospect. Discussion regarding elafibranor has been updated.

      • The authors should make clearly the pronounced sex bias in their study, which includes mostly women (and btw refer to sex and not gender in the manuscript). Thank you for this important point. We added "Notably, the cohort was predominantly female (76.1%)" to the 'Overview of the study' section in the manuscript. We also replaced all 'gender' with 'sex' throughout the manuscript. In this cohorts, individuals with previous gender reassignment were excluded (see Materials and Methods).

      • The "MASH" status of the spheroid model is overstated. As described in the text it is much closer to a lipotoxicity model (and even glucotoxicity as Glc concentration is 2g/L). The 3D cultures were established from cells isolated from patients with histologically confirmed MASH. Besides steatosis, we observe increased secretion of pro-inflammatory cytokines, activation of hepatic stellate cells and increased deposition of collagen, thus phenocopying the critical disease hallmarks. Additionally, unbiased omics profiling (transcriptomics, proteomics and lipidomics) reveals significant increases in collagen biosynthesis, inflammatory signaling and cholesterol biosynthesis in MASH patient-derived cultures compared to controls. These differences largely overlapped with the results from analyses of six MASH case-control cohort studies. All of these results have been published previously (PMID 39605182).

      This is confusing with panel D in which the authors establish a relationship between fibrotic patients (F2/F3 vs F0/S0, so I guess "no MASLD liver?) and this model. Is the relationship maintained for steatotic-only patients?

      In Figure 6D, we compared GTPase-related gene expression between patients with fibrosis grade 2/3 (n = 26) and those without fibrosis and steatosis (n = 24). Principal component regression resulted in a positive correlation (β = 9.97) between log2 fold changes in 3D spheroids and human fibrosis samples, indicating consistent directional changes in both systems.

      To answer the question from the reviewer, we compared the expression levels of GTPase-related genes in patients with steatosis but no fibrosis (n = 18) to those without fibrosis and steatosis (n = 24), we observed a negative correlation (β = -10.91). This indicates that GTPase-related gene changes in our 3D spheroids do not align with steatosis-related changes in humans.

      Therefore, under the assumption that fibrosis follows steatosis in the majority of the cases of MASLD progression, the result indicates that the alterations in GTPase-related gene expression in the 3D spheroid model specifically is reflective of fibrosis rather than steatosis.

      Figure R2. Comparison of expression level changes in GTPase-related genes between this human cohort and an independent 3D spheroid system: (A) positive correlation with fibrosis grade 2/3 patients versus controls (left), and (B) negative correlation with steatosis-only patients versus controls (right).

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      I am not convinced that HSC and LX2 cells express significant levels of PPARα. However, did the authors check for this parameter in their LX2 cell line and assessed whether PPARα/b activation by elafibranor (and/or pemafibrate as it is PPARα selective) alter GTPase expression? Whether negative or positive, this could give a clue about possible intercellular crosstalk in the spheroid model.

      We thank this reviewer to point this out. In response, we analysed the mRNA expression of all PPARs in LX-2 cells with and without Elafibranor treatment, respectively (see Figure R3, same as Figure S8G in the Supplemental Material). We confirmed PPARs are expressed in LX-2 cells at the mRNA level (Figure R3A). Elafibranor does not affect their mRNA levels, which is consistent with previous reports that its primary mechanism is through binding and altering the activity of PPAR proteins, not gene expression (PMID 33326461 and PMID 37627519).

      *Figure R3. Gene signatures in LX-2 cells with and without Elafibranor treatment (n = 3). *

      In addition, we assessed mRNA levels of selected GTPase-related genes in LX-2 cells with and without Elafibranor treatment (Figure R3B). Although statistical power was limited, we observed a consistent trend toward reduced RHOU, DOCK2, and RAC1 expression with Elafibranor. this preliminary signal suggests that Elafibranor may counter the elevated GTPase levels seen in MASH patient spheroids, potentially via crosstalk among hepatic cell types, including HSCs.

      To further investigate intercellular crosstalk in GTPase regulation among hepatic cell types, we evaluated signature GTPase-related genes in LX-2 cells, spheroid co-cultures (hepatocytes, HSCs, Kupffer cells), and hepatocyte monocultures. As shown in Figure R4 (same as Figure S10 in the supplemental material), TGFB1 served as a positive control, exhibiting the most pronounced induction upon TGF-β1 treatment in hepatocytes. Despite varied alterations across the selected GTPase-related genes, TGF-β1 treatment produced a trend toward increased VAV1 and DOCK2 expression in co-culture, hepatocytes, and LX-2 cells, and this was reversed by the TGF-β inhibitor in co-culture and hepatocytes. Other GTPase genes, including RAC1, RAB32, and RHOU, displayed cell type–specific responses to TGF-β1. These observations suggest that the regulation of GTPases is mediated by multiple hepatic cell types, supporting the importance of intercellular crosstalk.

      Figure R4. Expression of GTPase-related genes in spheroid co-culture, hepatocyte monoculture, and LX-2 cells (n = 3). Controls for each gene and experiment were normalized to 1 to enable comparison across treatment groups.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      The experiment mentioned above is cheap (cell culture, RT-QPCR) and can be performed within a couple of weeks.

      • Are the data and the methods presented in such a way that they can be reproduced? Yes

      • Are the experiments adequately replicated and statistical analysis adequate? There is no indication of group size, number of replicates for in vitro experiments

      Thank you for this suggestion. We have added the sample sizes to all relevant sections: ‘n = 4’ in the figure legends for 3D spheroid experiments and ‘n = 8–10’ for the LX-2 experiments. This information has also been incorporated into the corresponding experimental descriptions in the Methods section.

      **Referees cross-commenting**

      I believe there is a general consensus on this potentially interesting contribution to the field, with three main points: (1) the need for a careful group-by-group comparison that accounts for potential confounders, (2) a more rigorous exploitation/characterization of the spheroid system, and (3) the need to benchmark the authors' findings against the available literature.

      Thank you for summarizing the main points. Our responses are as follows:

      • We adjusted for key confounders (sex, gender, age, BMI, diabetes) in all statistical analysis to minimize potential bias, mostly using linear regression (rather than group-to-group comparison). In response to Reviewer 3, comment 1, we also conducted additional statistical analyses exploring molecular changes in diabetic vs. non-diabetic individuals.
      • We provided detailed characterization of the spheroid model (response to Reviewer 3, comment 3) and we have done additional experiments in LX-2 cells.
      • We benchmarked our findings using external human cohorts, mouse models, and single cell spheroid systems:
      • Compared our liver transcriptomics data with two published liver RNA-seq datasets (EU cohort, PMID 31467298; VA cohort, PMID 33268509) as shown in Figure 1G. In Figures 3A and 6A, we also included sidebars indicating gene alterations in these cohorts, showing consistent trends. Moreover, we examined the expression alterations of GTPase-related genes in these datasets in response to Reviewer 3’s comment 2.
      • Assessed genes linked to fibrosis progression in hepatic stellate cells from a murine liver fibrosis model (PMID 34839349), confirming differential expression of GTPases and their regulators during fibrosis initiation (Figure S9A).
      • Examined GTPase-related genes in an independent single-cell human spheroid system (PMID 37962490). This enabled cell-type-specific information of GTPase regulation in response to TGF-β (Figure S9C). We also expanded the discussion section on both the consistencies and discrepancies between our findings and previously published studies.

      Reviewer #2 (Significance (Required)):

      The authors identified GTPases as players in the progression of MASLD. This is an interesting preliminary report warranting further molecular investigations (in which liver cell types, which GTPase pathway(s) are involved, which functions are controlled through this pathway...)

      • State what audience might be interested in and influenced by the reported findings.

      This paper will have an impact in the hepatology field

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I have expertise in the analysis of "MASLD" human cohorts and in the molecular biology of chronic liver diseases.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Metabolic dysfunction associated liver disease (MASLD) describes a spectrum of progressive liver pathologies linked to life style-associated metabolic alterations (such as increased body weight and elevated blood sugar levels), reaching from steatosis over steatohepatitis to fibrosis and finally end stage complications, such as liver failure and hepatocellular carcinoma. Treatment options for MASLD include diet adjustments, weight loss, and the receptor-β (THR-β) agonist resmetirom, but remain limited at this stage, motivating further studies to elucidate molecular disease mechanisms to identify novel therapeutic targets. In their present study, the authors aim to identify early molecular changes in MASLD linked to obesity. To this end, they study a cohort of 109 obese individuals with no or early-stage MASLD combining measurements from two anatomic sides: 1. bulk RNA-sequencing and metabolomics of liver biopsies, and 2. metabolomics from patient blood. Their major finding is that GTPase-related genes are transcriptionally altered in livers of individuals with steatosis with fibrosis compared to steatosis without fibrosis.

      Major comments:

      1. Confounders (such as (pre-)diabetes) The patient table shows significant differences in non-MASLD vs. MASLD individuals, with the latter suffering more often from diabetes or hypertriglyceridemia.

      Rather than just stating corrections, subgroup analyses should be performed (accompanied with designated statistical power analyses) to infer the degree to which these conditions contribute to the observations. I.e., major findings stating MASLD-associated changes should hold true in the subgroup of MASLD patients without diabetes/of female sex and so forth (testing for each of the significant differences between groups).

      Our original statistical analysis employed linear regression to examine associations between molecular variables (genes/metabolites) and histological progression (steatosis and fibrosis), with adjustment for potential confounders including diabetic status, age, sex, and BMI. We specifically focused on these two histological features to elucidate the disturbed molecular processes during their progression. Regression coefficients represent the expected change in abundance levels (in units of standard deviation of the corresponding molecule) per one-unit increase in histological grades.

      To address the reviewer's question, we conducted additional subgroup analyses to determine whether our major findings remain consistent in individuals with and without diabetes. We assessed linear associations between gene signatures and histological features separately in non-diabetic (n = 71) and diabetic individuals (n = 23). Statistical power was estimated by comparing the variance explained by the full regression model (y ~ x + a + b + c) against the reduced model (y ~ a + b + c), converting the incremental for x into Cohen's , and applying pwr.f2.test with the corresponding degrees of freedom and sample size at α = 0.05.

      For both steatosis and fibrosis, the results in the non-diabetic subgroup (n = 71) showed high consistency with findings in our original analysis (n = 94, adjusted for diabetes), indicating that our originally reported gene signatures, after correction for diabetic status, remain valid in non-diabetic individuals.

      In contrast, for diabetic individuals (n = 23), associations between genes and histological features did not closely replicate our original findings. Notably, we observed larger estimate effects for fibrosis-associated genes in diabetic individuals, suggesting a potential interaction between diabetes and fibrosis progression.

      Figure R5. Subgroup analysis of the association between gene expressions and steatosis grades

      Figure R6. Subgroup analysis of the association between gene expressions and fibrosis grades

      On the comment "degree to which these conditions contribute to the observations," our original analysis adjusted for diabetes status to identify molecular signatures independently associated with fibrosis without the confounding of diabetes status. Consequently, the reported gene signatures in the original analysis more closely reflect patterns in the non-diabetes group, as demonstrated in our subgroup analysis plots. We also comment that, unfortunately, we did not adjust for the interaction of fibrosis and diabetes in the original analysis.

      Furthermore, our additional analyses revealed a close relationship between diabetes and liver fibrosis. Consistent with Figure 1C, hepatic fibrosis is significantly correlated with insulin resistance parameters in clinical assays, including blood insulin levels and HOMA2-IR. To explore this association further, we compared gene expression profiles between diabetic MASLD patients (n = 21) and non-diabetic MASLD patients (n = 43). Although few genes reached significance after multiple testing correction, 166 genes showed differential expression (p 0.32) between these groups.

      We identified 55 genes as potential "diabetic markers" that both showed differential expression between diabetic and non-diabetic MASLD patients and were significantly associated with steatosis or fibrosis progression. These genes are predominantly downregulated metabolic genes (e.g., BAAT, G6PC1, SULT2A1, MAT1A), suggesting that diabetes may exacerbate metabolic suppression as fibrosis advances. Given the high prevalence of diabetes in the MASLD population, our analysis supports the hypothesis that diabetes worsens MASLD outcomes, likely through impaired metabolic capability during fibrosis progression.

      Regarding the comment on the "subgroup of female sex," our original analysis also adjusted for sex as a potential confounder. Since our cohort is predominantly female (>76%), the majority of our findings likely holds true in the female sub-population, similar to what we observed in our diabetes subgroup analysis.

      External validation

      Additionally, to back up the major GTPase signature findings, it would be desirable to analyze an external dataset of (pre)diabetes patients (other biased groups) for alternations in these genes. It would be important to know if this signature also shows in non-MASLD diabetic patients vs. healthy patients or is a feature specific to MASLD. Also, could the matched metabolic data be used to validate metabolite alterations that would be expected under GTPase-associated protein dysregulation?

      We appreciate the comments regarding the validation GTPase as a unique MASLD signature by external datasets. As shown in our previous analysis, after adjusting for diabetes status, the gene signatures remained largely preserved in the non-diabetes subgroup. Before we respond further, we also preface that publicly available liver tissue data, with appropriate and full-scale clinical metadata and sufficient sample sizes, are extremely rare. To the best of our knowledge, the public data sets we brought into our paper were the most prominent data of reliable quality.

      In the paper, we benchmarked our RNAseq dataset against two datasets: the VA cohort and EU cohort (Figure 1). Our cohort focused primarily on early MASLD patients with obesity, which aligns more closely with the disease spectrum represented in the VA cohort (Figure 1G). Notably, in the published paper for the VA cohort, Hoang et al. highlighted Rho GTPase signaling as one of the top pathways in the fibrosis PPI network (Figure 1B from publication PMID 31467298).

      We interrogated GTPase-related genes in both the VA and EU cohorts. As shown in Figure R7 (below), GTPase-related genes demonstrated a strong association with fibrosis grades in the VA cohort, as expected. The EU cohort comprises more advanced MASLD cases with higher fibrosis grades, and our re-analysis in this cohort specifically focused on MASH patients (as designated by the authors). In those MASH patients, GTPase-related genes did not show significant positive associations with fibrosis progression. This finding is consistent with our hypothesis that GTPase regulation is triggered more prominent during the early progression of fibrosis rather than at later stages.

      Unfortunately, diabetes status was not available in the GEO repository for the VA cohort. Available liver tissue sequencing datasets with balanced representation of diabetic and nondiabetic patients are rare, especially those derived from obese individuals and reflecting the early-to-middle stages of MASLD. In our own cohort, for instance, only two diabetic patients without MASLD were recruited (Table 1). While we cannot rule out a role for insulin resistance in GTPase regulation, we will plan future experiments using mouse models to examine GTPase-mediated fibrosis under diabetic and nondiabetic conditions.

      Regarding the comment ‘validate metabolite alterations that would be expected under GTPase-associated protein dysregulation,’ we note that GTPases are primarily involved in cytoskeletal organization, vesicle trafficking, and other cellular processes, with few well-established links to specific metabolite signatures. Nevertheless, in our partial correlation network integrating hepatic genes and metabolites, we observed co-regulated metabolites associated with GTPase-related genes (Figure R8). These included palmitoleoyl ethanolamide (N-acylethanolamine, an anti-inflammatory metabolite and PPARα ligand), phenylacetic acid (a phenylalanine metabolite), biotin (a coenzyme), arginine, lysine, melatonin (a tryptophan metabolite), and several lipid species such as PC 32:0 and CAR 20:1. While causal relationships cannot be inferred from this dataset, our integrative network highlights potential connections related to the trafficking of these metabolites that warrant further investigation.

      Figure R7. Associations between GTPase-related genes to fibrosis in this study and two external cohorts. Asterisks denote significant associations with q value Figure R8. Integrative subnetwork of GTPase-related genes. Blue squares represent GTPase-related genes, red circles indicate metabolites connected to these genes, and the purple diamond denotes fibrosis, which is connected to RHOU.*

      3D liver spheroid MASH model, Fig. 6D/E

      This 3D experiment is technically not an external validation of GTPase-related genes being involved in MASLD, since patient-derived cells may only retain changes that have happened in vivo. To demonstrate that the GTPase expression signature is specifically invoked by fibrosis the LX-2 set up is more convincing, however, the up-regulation of the GTPase-related genes upon fibrosis induction with TGF-beta, in concordance with the patient data, needs to be shown first (qPCR or RNA-seq).

      We agree with the reviewer that experiments in LX-2 (HSC) cells are important and as we have described under ‘Reviewer #2’ we have done this (Figure R3 and Figure R4). Because HSCs only comprise a minor cell population of liver cells, the signals observed in patient bulk RNA data are likely driven primarily by hepatocytes. Nevertheless, we have highlighted the importance of hepatic cell crosstalk in Figure R4 and in our response to Reviewer #2. Additionally, in Supplementary Figure S9B, we identify the potential cell types of origin for the GTPase signals (predominantly hepatocytes and HSCs) using a single-cell dataset from an independent study (PMID: 37962490).

      Additionally, the description of the 3D model is too uncritical. The maintenance of functional human PHHs in 3D has only become available this year (PMID: 40240606) marking a break-through in the field. Since the authors did not use this system, I would strongly assume their findings are largely attributable to the mesenchymal cells in the 3D culture, and these limitations need to be stated.

      We humbly disagree with the reviewer on the 3D liver spheroids. The paper that the reviewer is referencing is related to the proliferation of hepatocytes in organoids, not – at least not directly – their functional maintenance. Here, we use a spheroid model of mature fully differentiated cells, which is conceptually different from the organoid approach. Maintenance of such functional human hepatocytes for multiple weeks in culture has been possible for close to a decade (PMID 27143246). Moreover, particularly for the modeling of chronic liver disease, such as MASH, it is important to use directly patient-derived cells as short induction cycles (typically 1-2 weeks) of disease phenotypes in organoid models do not faithfully reproduce the molecular signatures that stem from chronic exposures in vivo.

      The 3D liver spheroid model we used here is derived from livers from patients with a histologically confirmed diagnosis of MASH. The isolated cells are fully mature and thus do not require in vitro differentiation. There are no MSCs in the 3D cultures; rather the spheroids contain hepatocytes, stellate cells, Kupffer cells as well as various other immune cell types present in the liver at the time of isolation (T cells, B cells, NK cells). Furthermore, the model is extensively characterized at the transcriptomic, proteomic and lipidomic level (PMID 39605182).

      Novelty / references

      Similar studies that also combined liver and blood lipidomics/metabolomics in obese individuals with and without MASLD (e.g. PMID 39731853, 39653777) should be cited. Additionally, it would benefit the quality of the discussion to state how findings in this study add new insights over previous studies, if their findings/insights differ, and if so, why.

      Thank you for the suggestion. We added the two papers into the discussion section. Specifically, we discussed the consistent findings (such as AKR1B10 in PMID 37037945 and mitochondrial dysfunction in PMID 39731853) and discrepancies (such as limited plasma metabolomic changes and circulating sphingolipid alterations in multiple human and mouse models) in comparison with previously published omics studies in MASLD patients. Also, we thoroughly discussed our findings (e.g., lipid dysregulation, dysregulated tryptophan metabolism, GTPase regulation) and potential mechanisms with extensive literature supports from of human, animal, and cell studies.

      Minor comments:

      1. The quality of Supplementary Figures (e.g. S7) makes is impossible to read the labels Thank you for this feedback. The resolution of the figures was impaired in the initial upload. We will provide all supplementary figures with high resolution in our revised submission and ensure all labels are clearly readable.

      For Figure S7C, we presented the correlation matrix of more than 200 GTPase-related genes along with the TGF-β genes TGFB1 and TGFB3. This illustrates the overall co-expression patterns of GTPase-related genes rather than displaying individual gene labels, with arrows now included to highlight TGFB1 and TGFB3.

      Reviewer #3 (Significance (Required)):

      The authors provide an overall sound study on the hepatic transcriptomic and metabolomic signatures in an Australian cohort of 109 obese non-to-early stage MASLD patients. They perform thorough analyses of metabolome and transcriptome in liver biopsies and metabolome in blood, using standard technologies such as RNA sequencing and mass spectrometry. Their key finding is a GTPase-associated gene signature related to fibrosis onset. Limitations of the study include potential cohort confounders (raising the need for expanded control experiments), limited discussion of similar studies, and limits in cell-type resolution, the latter of which is related to the molecular read out, and has in parts been started to be addressed by in vitro experiments in an immortalized HSC lines. Taken together, given additional control analyses will be performed, the results could be of interest to an expert community in the field of molecular hepatology and, while still descriptive, hold the potential to prompt mechanistic follow-up studies.

      We thank this reviewer for a balanced, positive, and constructive evaluation of our manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Metabolic dysfunction associated liver disease (MASLD) describes a spectrum of progressive liver pathologies linked to life style-associated metabolic alterations (such as increased body weight and elevated blood sugar levels), reaching from steatosis over steatohepatitis to fibrosis and finally end stage complications, such as liver failure and hepatocellular carcinoma. Treatment options for MASLD include diet adjustments, weight loss, and the receptor-β (THR-β) agonist resmetirom, but remain limited at this stage, motivating further studies to elucidate molecular disease mechanisms to identify novel therapeutic targets.

      In their present study, the authors aim to identify early molecular changes in MASLD linked to obesity. To this end, they study a cohort of 109 obese individuals with no or early-stage MASLD combining measurements from two anatomic sides: 1. bulk RNA-sequencing and metabolomics of liver biopsies, and 2. metabolomics from patient blood. Their major finding is that GTPase-related genes are transcriptionally altered in livers of individuals with steatosis with fibrosis compared to steatosis without fibrosis.

      Major comments:

      1. Confounders (such as (pre-)diabetes) The patient table shows significant differences in non-MASLD vs. MASLD individuals, with the latter suffering more often from diabetes or hypertriglyceridemia. Rather than just stating corrections, subgroup analyses should be performed (accompanied with designated statistical power analyses) to infer the degree to which these conditions contribute to the observations. I.e., major findings stating MASLD-associated changes should hold true in the subgroup of MASLD patients without diabetes/of female sex and so forth (testing for each of the significant differences between groups).
      2. External validation Additionally, to back up the major GTPase signature findings, it would be desirable to analyze an external dataset of (pre)diabetes patients (other biased groups) for alternations in these genes. It would be important to know if this signature also shows in non-MASLD diabetic patients vs. healthy patients or is a feature specific to MASLD. Also, could the matched metabolic data be used to validate metabolite alterations that would be expected under GTPase-associated protein dysregulation?
      3. 3D liver spheroid MASH model, Fig. 6D/E This 3D experiment is technically not an external validation of GTPase-related genes being involved in MASLD, since patient-derived cells may only retain changes that have happened in vivo. To demonstrate that the GTPase expression signature is specifically invoked by fibrosis the LX-2 set up is more convincing, however, the up-regulation of the GTPase-related genes upon fibrosis induction with TGF-beta, in concordance with the patient data, needs to be shown first (qPCR or RNA-seq). Additionally, the description of the 3D model is too uncritical. The maintenance of functional human PHHs in 3D has only become available this year (PMID: 40240606) marking a break-through in the field. Since the authors did not use this system, I would strongly assume their findings are largely attributable to the mesenchymal cells in the 3D culture, and these limitations need to be stated.
      4. Novelty / references Similar studies that also combined liver and blood lipidomics/metabolomics in obese individuals with and without MASLD (e.g. PMID 39731853, 39653777) should becited. Additionally, it would benefit the quality of the discussion to state how findings in this study add new insights over previous studies, if their findings/insights differ, and if so, why.

      Minor comments:

      1. The quality of Supplementary Figures (e.g. S7) makes is impossible to read the labels

      Significance

      The authors provide an overall sound study on the hepatic transcriptomic and metabolomic signatures in an Australian cohort of 109 obese non-to-early stage MASLD patients. They perform thorough analyses of metabolome and transcriptome in liver biopsies and metabolome in blood, using standard technologies such as RNA-sequencing and mass spectrometry. Their key finding is a GTPase-associated gene signature related to fibrosis onset. Limitations of the study include potential cohort confounders (raising the need for expanded control experiments), limited discussion of similar studies, and limits in cell-type resolution, the latter of which is related to the molecular read out, and has in parts been started to be addressed by in vitro experiments in an immortalized HSC lines. Taken together, given additional control analyses will be performed, the results could be of interest to an expert community in the field of molecular hepatology and, while still descriptive, hold the potential to prompt mechanistic follow-up studies.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      In this paper, Kaldis and collaborators investigate the molecular heterogeneity of a 109 morbidly obese patient cohort, focusing on liver transcriptomics and metabolomics analysis from liver and serum. The main finding (ie upregulation of GTPase-coding genes) was validated in spheroids and a human HSC cell line. As these proteins are involved in critical cellular functions related to metabolism and cytoskeleton dynamics, these findings shed light on their involvement in human liver pathology which so far has been poorly (or even not) documented to date. This is an interesting addition to the current knowledge about chronic liver pathology. However the manuscript suffers from the lack of a clear-cut definition of patient subgroups and the seemingly indistinct use of generic (MASLD, NAS score) and more granular terms (MASH, fibrosis) across the various analysis they performed.

      Major comments:

      • Are the key conclusions convincing?

      The conclusions are generally consistent with findings from numerous previous studies, as many of the genes identified and their associations with disease states have been previously reported. However, I found it difficult to discern which specific disease stages the authors are referring to throughout the manuscript. Terms such as MASLD (Fig. 1F), steatosis (Fig. 4A), MASH, fibrosis (Fig. 6), and the composite NAS score (Fig. 1G) are used interchangeably, without clearly explaining whether or how the patient cohort was stratified to distinguish between isolated steatosis, MASH, and MASH with or without fibrosis. It is also unclear whether subgroups were propensity score-matched.

      In a related point, the authors mention that 76% of patients are non-fibrotic, introducing a marked imbalance between fibrotic (n=26) and non-fibrotic (n=83) samples. Given this disparity and potential inter-individual variability, it would be helpful to include observed fold changes or effect sizes to give readers a sense of the magnitude of the biological dysregulations being reported. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? - The authors seem pretty enthusiastic about elafibranor, despite a failed phase 3 clinical trial. I would qualify elafibranor as a useful tool in preclinical model.<br /> - The authors should make clearly the pronounced sex bias in their study, which includes mostly women (and btw refer to sex and not gender in the manuscript).<br /> - The "MASH" status of the spheroid model is overstated. As described in the text it is much closer to a lipotoxicity model (and even glucotoxicity as Glc concentration is 2g/L). This is confusing with panel D in which the authors establish a relationship between fibrotic patients (F2/F3 vs F0/S0, so I guess "no MASLD liver?) and this model. Is the relationship maintained for steatotic-only patients?<br /> - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. I am not convinced that HSC and LX2 cells express significant levels of PPARα. However, did the authors check for this parameter in their LX2 cell line and assessed whether PPARα/b activation by elafibranor (and/or pemafibrate as it is PPARα selective) alter GTPase expression? Whether negative or positive, this could give a clue about possible intercellular crosstalk in the spheroid model.<br /> - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      The experiment mentioned above is cheap (cell culture, RT-QPCR) and can be performed within a couple of weeks. - Are the data and the methods presented in such a way that they can be reproduced?

      Yes - Are the experiments adequately replicated and statistical analysis adequate?

      There is no indication of group size, number of replicates for in vitro experiments

      Referees cross-commenting

      I believe there is a general consensus on this potentially interesting contribution to the field, with three main points: (1) the need for a careful group-by-group comparison that accounts for potential confounders, (2) a more rigorous exploitation/characterization of the spheroid system, and (3) the need to benchmark the authors' findings against the available literature.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. The authors identified GTPases as players in the progression of MASLD. This is an interesting preliminary report warranting further molecular investigations (in which liver cell types, which GTPase pathway(s) are involved, which functions are controlled through this pathway...)
      • State what audience might be interested in and influenced by the reported findings. This paper will have an impact in the hepatology field
      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. I have expertise in the analysis of "MASLD" human cohorts and in the molecular biology of chronic liver diseases.
    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Metabolic dysfunction-associated steatotic liver disease (MASLD) ranges from simple steatosis, steatohepatitis, fibrosis/cirrhosis, and hepatocellular carcinoma. In the current study, the authors aimed to determine the early molecular signatures differentiating patients with MASLD associated fibrosis from those patients with early MASLD but no symptoms. The authors recruited 109 obese individuals before bariatric surgery. They separated the cohorts as no MASLD (without histological abnormalities) and MASLD. The liver samples were then subjected to transcriptomic and metabolomic analysis. The serum samples were subjected to metabolomic analysis. The authors identified dysregulated lipid metabolism, including glyceride lipids, in the liver samples of MASLD patients compared to the no MASLD ones. Circulating metabolomic changes in lipid profiles slightly correlated with MASLD, possibly due to the no MASLD samples derived from obese patients. Several genes involved in lipid droplet formation were also found elevated in MASLD patients. Besides, elevated levels of amino acids, which are possibly related to collagen synthesis, were observed in MASLD patients. Several antioxidant metabolites were increased in MASLD patients. Furthermore, dysregulated genes involved in mitochondrial function and autophagy were identified in MASLD patients, likely linking oxidative stress to MASLD progression. The authors then determined the representative gene signatures in the development of fibrosis by comparing this cohort with the other two published cohorts. Top enriched pathways in fibrotic patients included GTPas signaling and innate immune responses, suggesting the involvement of GTPas in MASLD progression to fibrosis. The authors then challenged human patient derived 3D spheroid system with a dual PPARa/d agonist and found that this treatment restored the expression levels of GTPase-related genes in MASLD 3D spheroids. In conclusion, the authors suggested the involvement of upregulated GTPase-related genes during fibrosis initiation. Overall, the current study might provide some resources regarding transcriptomic and metabolomic data derived from obese patients with and without MASLD. However, several concerns should be carefully addressed.

      1. A recent study, via proteomic and transcriptomic analysis, revealed that four proteins (ADAMTSL2, AKR1B10, CFHR4 and TREM2) could be used to identify MASLD patients at risk of steatohepatitis (PMID: 37037945). It is not clear why the authors did not include this study in their comparison.
      2. The authors recruited 109 patients but only performed transcriptomic and metabolomic analysis in 94 liver samples. Why did the authors exclude other samples?
      3. The authors mentioned clinical data in Table 1 but did not present the table in this manuscript.
      4. The generated metabolomic data could be a very useful resource to the MASLD community. However, it is very confusing how the data was generated in those supplemental tables. There is no clear labeling of human clinical information in those tables. Also, what do those values mean in columns 47-154? This reviewer assumed that they are the raw data of metabolomic analysis in plasma samples. However, without clear clinical information in these patients, it is impossible that any scientist can use the data to reproduce the authors' findings.
      5. In Fig. 5B, the authors excluded the steatosis and fibrosis overlapped genes. Steatosis and fibrosis specific genes could simply reflect the outcomes rather than causes. In this case, the obtained results might not identify the gene signatures related to fibrosis initiation.
      6. In Fig. 6D, the authors used 3D liver spheroid to validate their findings. However, there is no images showing the 3D liver spheroid formation before and after PPARa/d agonist treatment. It is not clear whether the 3D liver spheroid was successfully established.
      7. The authors suggested that targeting LX-2 cells with Rac1 and Cdc42 inhibitors could reduce collagen production. Did the authors observe these two genes upregulated in mRNA and protein expression levels in their cohort when compared MASLD patients with and without fibrosis?
      8. Did the authors observe that the expression levels of Rac1 and Cdc42 are correlated with fibrosis progression in MASLD patients?
      9. Other studies have revealed several metabolite changes related to MASLD progression (PMID: 35434590, PMID: 22364559). However, the authors did not discuss the discrepancies between their findings with the previous studies.

      Significance

      Overall, the current study might provide some new resources regarding transcriptomic and metabolomic data derived from obese patients with and without MASLD. The MASLD research community will be interested in the resource data.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Xu et al. investigated split gene drive systems by targeting multiple female essential genes involved in fertility and viability in Drosophila. The authors evaluate the suppression efficiency through individual corsses and cage trials. Resistance allele formation and fitness costs are explored by examining the sterility and fertility of each line. Overall, the experimental design is sound and methods are feasible. The work is comprehensive, and conclusions are well supported by the data. This work offers informative insights that could guide the design of suppression gene drive systems in other invasive disease vectors or agricultural pests.

      However, several points requiring clarification or improvement:

      1 Methodological clarity: Some experimental details are indufficiently described, for example, regarding the setup of genetic crosses involving different Cas9 derivatives. In line 197-198, "the mated females, together with females that were mated with Cas9 only males", it is unclear whether the latter group refers to gRNA-females.

      -We thank the reviewer for pointing out this ambiguity. The latter group refers to Cas9 females crossed to Cas9 males. We have clarified this both in the methods (line 207) and results (line 505-509).

      2.Regarding the inheritance rates, you included the reverse orientation of CG4415-Cas9, as I understood, it means this component is in reverse orientation with fluorescent marker. Since it is standard to design adjacent components in opposite direction to avoid transcriptional interference, the rationale for including this comparison should be better justified.

      • In our construct, ‘CG4415 (reverse orientation)’ indicates that Cas9 was oriented in the same direction as the fluorescent marker, while the other Cas9 constructs (nanos-Cas9 and CG4415-Cas9) places them in opposite directions. “reverse” just indicates a change from a “standard” in another study. Our previous publication showed that Cas9 orientation relative to the marker had little apparent effect on drive performance at the yellow-G locus. In this study, we compared both orientations in a fertility gene and again observed similar results, suggesting that orientation relative to the marker does not substantially affect drive efficiency in our system. We have clarified this in the figure legend text.

      Embryo resistance is inferred from the percentage of sterile drive females derived from drive mothers. How many female individuals were analysed per line and why deep sequencing was not employed to directly detect resistance alleles.

      -Embryo resistance can mean slightly different things for different applications. The most important is probably the fraction of females that have little to no fertility due to embryo resistance. Some of these may not have complete embryo resistance alleles, but instead, have mosaicism, with a sufficient level of resistance to still cause sterility. It is unclear exactly what proportion of resistance to wild-type may cause this, and thus, proportions from pooled sequencing, which could include both complete and all levels of mosaicism, may not be sufficient to measure this parameter. Another relevant parameter that we did not measure is the fraction of males rendered unable to do drive conversion (this value should be closer to the complete resistance rate, but probably still lower because of the multiple gRNAs). Even in this case, deep sequencing would not allow us to determine exactly what is happening in males, making individual sequencing a preferred approached. It is very nice, of course, for characterizing which resistance alleles are present overall, but in this study, we wanted to put a bit more emphasis on the effect of resistance, rather than its sequence characterizing.

      We analyzed 30 females per line for lines targeting nox, oct, dec and stl, 9 females for ndl and 276 individuals for line tra-v2 (Data Set S4). We believe such individual analyses sufficiently detected embryo resistance causing sterility within reasonable error. Note that we did also randomly genotype several sterile females and found mutations at target sites that disrupted gene functions.

      In response to this comment, we have added some text to justify our measurement of resistance alleles and include some of this discussion:

      “Note also that this defines embryo resistance as sufficient to induce sterility, but these may be mosaic rather than complete resistance. Further, note that the multiplex gRNA design in males may allow for continued drive conversion with a complete (as opposed to mosaic) embryo resistance allele, if some sites remain wild-type.”

      Masculinisation phenotypes were observed upon disruption of tra gene. How strong intersexes were distinguished from males? What molecular markers were used to determine genetic sex. This information should be clearly provided.

      -We observed two types of strong masculinisation phenotypes (Figure S2), one with bigger body size than wildtype males, and the other was identical to wildtype males. The homozygosity of the drive allele could be assessed by the brightness of red fluorescence in the eyes. However, we also randomly genotyped these masculinized females (as part of a batch that included males) to confirm their sex using primers for the Y-linked gene PP1Y2. A specific band was detected in wild-type males but not in masculinized females, confirming their genetic sex. This information has been added to the manuscript (lines 477-480).

      It would be more appropriate to use "hatchability"rather than "fertility" when referring to egg-to-larva viability.

      -Thank you for the suggestion. We used egg-to-adult survival rates as a proxy for the fertility of their parents because they usually laid similar number of eggs. However, it still technically incorrect language. We have fixed this in line 582 and elsewhere in the section.

      In cage trials, a complete gene drive is mimicked by introducing Cas9 to the background population, but this differs from actual complete gene drive, due to potential effects from separate insertion sites (different chromosome or loci). These difference could impact the system's performance and should be discussed.

      -We appreciate this point and have added discussion on the limitations of mimicking a complete gene drive using split components (line 766-779).

      7.Given the large amount of data presented, it would improve readability and interpretation if each result section concluded with a concise summary highlighting the key findings and implications.

      -Thank you for the suggestion. We have added brief summaries at the end of each results section to highlight the key findings and their significance.

      Reviewer #1 (Significance (Required)):

      The authors evaluate suppression efficiency through individual courses and cage trials. Resistance allele formation and fitness costs are explored by examining the sterility and fertility of each line. Overall, the experimental design is sound and methods are feasible. The work is comprehensive, and conclusions are well supported by the data. This work offers informative insights that could guide the design of suppression gene drive systems in other invasive disease vectors or agricultural pests.

      -We appreciate the reviewer’s positive assessment of our work.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Paper summary

      The manuscript by Xu. et al presents an insightful and valuable contribution to the field of gene drive research. The manuscript by Xu et al. presents an insightful and valuable contribution to the field of gene drive research. The strategy of targeting and disrupting female fertility genes using selfish homing genetic elements was first proposed by Burt in 2003. However, for this approach to be effective, the phenotypic constraints associated with gene disruption have meant that the pool of suitable target genes remains relatively small - notwithstanding the significant expansion in accessible targets enabled by CRISPR-based genome editing nucleases. Population suppression gene drives are well developed as proof-of-principle systems, with some now in the late stages of development as genetic control strains. However, advancing the pipeline will require a broader set of validated target genes - both to ensure effectiveness across diverse species and to build redundancy into control strategies, reducing reliance on any single genetic target.

      In their paper, the authors conduct a systematic review of nine female fertility genes in Drosophila melanogaster to assess their potential as targets for homing-based suppression gene drives. The authors first conduct a thorough bioinformatic review to select candidate target genes before empirically testing candidates through microinjection and subsequent in vivo analyses of drive efficiency, population dynamics, and fitness costs relating to fecundity and fertility. After finalising their results, the authors identify two promising candidate target genes - oct and stl - which both demonstrate high gene conversion rates and, regarding the latter, can successfully suppress a cage population at a high release frequency. However, the manuscript suffers from a lack of in-depth discussion of a key limitation in its experimental design - namely, that the authors utilise a split-drive design to assess population dynamics and fitness effects when such a drive will not reflect release scenarios in the field. The review below highlights some major strengths and weaknesses of the paper, with suggestions for improvement.

      Key strengths

      The study's most significant strength is in its systematic selection and empirical testing of nine distinct genes as targets for homing-based gene drive, hence providing a valuable resource that substantially expands the pool of potential targets beyond the more commonly studied target genes (e.g. nudel, doublesex, among others). The identification of suitable target genes presents a significant bottleneck in the development of gene drives and the work presented here provides a foundational dataset for future research. The authors bolster the utility of their results by assessing the conservation of candidate genes across a range of pest species, suggesting the potential for broader application.

      A key finding in the paper is the successful suppression of a cage population using a stl-targeting gene drive (albeit at a high release frequency). This provides a critical proof-of-principal result demonstrating that stl is a viable target for a suppression drive. While in the paper suppression was not possible at lower release frequencies, together, the results provide evidence for complex population dynamics and threshold effects that may govern the success or failure of a gene drive release strategy - hence moving the conversation from a technical perspective ("can it work") to how a gene drive may be implemented. Moreover, the authors also employ a multiplexed gRNA strategy for all their gene drive designs and in particular their population suppressive gene drive targeting stl. This provides further proof-of-principal evidence for multiplexed gRNAs in order to combat the evolution of functional resistance following gene drive deployment.

      Finally, a further strength of this paper is in the clever dissection of fitness effects resulting from maternal Cas9 deposition. The authors design and perform a robust set of crosses to elucidate the parental source of fitness effects (i.e. maternally, paternally, or biparentally derived Cas9), finding (as they and others have before) that embryonic fitness was significantly reduced when Cas9 was inherited from a maternal source. As discussed, the authors conclude that maternal deposition is particularly pronounced in the context of split drives as opposed to complete drives, with the implication being that a complete drive might succeed where a split-drive has failed; thus providing a key directive for future study.

      Concerns

      The manuscript's central weakness lies in its interpretation of the results from the cage experiments - namely that a split-drive system was used to "mimic the release of a complete drive". In the study, mosquitoes carrying the drive element (i.e. the gRNA) were introduced into a population homozygous for the Cas9 element over several generations. This design is likely not representative of a real-world scenario and, as the authors state, likely exaggerates fitness costs. This is because the females carrying Cas9 will maternally deposit Cas9 protein into her eggs, with activity spanning several generations. When mated with a drive-carrying male the gRNA will immediately co-exist with maternally deposited Cas9, leading to early somatic cleavage and significant fitness costs (reflected in the author's own fertility crosses). This is fundamentally different to how a complete drive would function in a real-world release, where complete-drive males would mate with wild-type females not carrying Cas9. Their offspring would carry the drive element but would not be exposed to maternally deposited cas9, thus deleterious maternal effects would only begin to appear in the subsequent generation from females carrying the drive. Fitness costs measured from split-drive designs are therefore likely substantially overestimated compared to what would occur during the initial but critical release phase of a complete drive. This flaw weakens the paper's ability to predict the failure or success of the screened targets in a complete drive design, thus weakening the interpretation of the results from the cage trials. As a suggestion for improvement, the authors should explicitly and more prominently discuss the limitations of their split-drive model compared to complete drive models, both in the Results and Discussion. It is also recommended to include a schematic for both strategies that contrasts the experimental setup design (i.e. release of the drive into a Cas9 homozygous background) with a complete-drive release, clearly illustrating differences in maternal deposition pathways. This will not only contextualise the results and support the author's conclusion that observed fitness costs are likely an overestimate but will further strengthen the arguments that the candidate target genes found in this study may still be viable in a complete-drive system.

      -We sincerely appreciate the thoughtful review and the valuable comments and suggestions provided, which have helped improve both the clarity and readability of this study. We have revised several parts in the discussion of the manuscript and hope that these changes adequately address the concerns raised. We have also made Figure S5 to illustrate the differences between two release strategies (biparental-Cas9 split drive in our study and complete drive in real release).

      Please note that this type of fitness cost may have partially undermined our cage study (the fitness effect is notable, but still small compared to total fitness costs), but this is also among the first studies to propose and investigate this phenomenon in the first place (it is also noted in another preprint from our lab but to our knowledge not proposed elsewhere). Thus, part of the impact of our manuscript is showing that this is important, which may inform future cage studies in our lab and elsewhere.

      A second weakness in the manuscript relates to its limited explanation and discussion of key concepts. For example, the manuscript reports a stark difference in outcome of the two stl-targeting drives, where a high initial release in cage 1 led to population elimination versus a failure of the drive to spread in cage 2. The authors attribute this to vague "allele effects" and stochastic factors such as larval competition; however the results appear reminiscent of the Allee effect, which is a well-characterised phenomenon describing the correlation of population size (or density) and individual fitness (or per capita population growth rate). Using their results as an example, is it plausible that the high-frequency initial release in cage 1 imposed enough genetic load to quickly drive the population density below the Allee threshold thus quickly leading to population eradication. In cage 2, the low-frequency at initial release was insufficient to cross the Allee threshold. Omitting mention of this ecological principal greatly weakens the Discussion, and further presents a missed opportunity to discuss one of the more crucial strengths of the paper - that is, in providing a deeper insight into the practical requirements for successful field implementation.

      -While we do indeed mention this Allee effect (the “allele effect” noted above is a misspelling that we have corrected), we were hesitant to give it much discussion, considering that the specific Allee effect in our cages is likely of a very different nature than one would find in nature (we explain that it is likely due to bacterial growth that occurs when fewer larvae are present). However, it is perhaps still a good excuse to cover it in the discussion, while still noting that the specific Allee effect in our cage may not be representative. We have added the following text: “Nonetheless, the successful result in the cage with high release study may point to a potential field strategy for a drive that is less efficient (perhaps even one found to be less efficient in initial field tests compared to laboratory tests). If the initial release frequency of the drive is sufficiently high and widespread, then short-term high genetic load may substantially reduce the population, perhaps enough for Allee effects to become important. At this point, even if average genetic load is slowly declining without additional drive releases, persistent moderate genetic load coupled with the Allee effect may be sufficient to ensure population elimination.”

      In a similar vein, the authors provide only a superficial mechanistic discussion into the fitness costs associated with drives targeting key candidate genes. The paper would benefit from a deeper discussion regarding the specific molecular functions of top-performing genes (stl, oct, nox) and how unintended Cas9 activity could disrupt their activity, integrating known molecular functions with observed fitness costs. For instance, oct encodes a G-protein coupled receptor essential for ovulation and oviduct muscle relaxation, thus disruption to the oct gene would directly impair egg-laying which would account for the observed phenotypic effects. A deeper discussion linking unintended Cas9 activity to the specific, sensitive functions of target genes would elevate the paper from a descriptive screen to a more insightful mechanistic study.

      -We appreciate the reviewer’s comment. We have added a discussion to further explain fitness cost caused by unintended Cas9 activity disrupting target gene functions. However, keep in mind that the exact timing of Cas9 cleavage and the exact timing of these gene’s essential functions is still somewhat uncertain, which may limit insights from this line of analysis compared to a situation where ideal, high quality data is available for both of these. Here is the new material in the discussion:

      “The functions of the top-performing genes suggests a mechanistic basis for the observed fitness costs. Aside from germline cells, nanos has expression in other ovary cells as well. CG4415 lacks this expression, but our Cas9 construct with this promoter may have a different expression pattern that the native gene, as evidenced by its support for good drive conversion in females. stl is essential for ovarian follicle development, and its disruption likely in non-germline ovary cells could compromise egg chamber development and fertility. oct encodes the octopamine β2 receptor, a G-protein coupled receptor critical for ovulation and fertilization, so if it were similarly lost, egg-laying would be directly impaired. nox, which encodes NADPH oxidase, contributes to calcium flux and smooth muscle contraction during ovulation, so its disruption may prevent egg laying. tra is needed in the whole body for sexual development, but may also play an important role in ovary function. Thus, unintended Cas9 activity at these non-germline ovary cells can directly interfere with sensitive reproductive functions, potentially explaining the fertility costs observed in drive carriers. This issue could potentially be overcome if promoters were available that were truly restricted to germline cells rather than other reproductive cells, though it remains unclear if such promoters both exist and would retain their expression pattern at a non-native locus.”

      It is curious that the authors chose two genes on the X chromosome as targets. In insects (such as Drosophila here) that have heterogametic sex chromosomes, homing is not possible in the heterogametic sex as there is no chromosome to home to - so there will be no homing in males. On top of that, there is usually some fitness effect in carrier (heterozygous) females, so in a population these are nearly always bad targets for drives - unless there is some other compelling reason to choose that target?

      -Our rationale for testing X-linked targets is twofold. First, these genes are likely to play important roles in sex-specific functions and may have a different expression pattern (which is why specifically Dec was included), potentially reducing fitness costs. Although homing cannot occur in males, if drive conversion at these sites in females is very high and fitness costs are minimal, the resulting genetic load could still be sufficient to suppress populations (thus, such candidates could be superior even in diploids if they happen to have a lower fitness costs). Second, X-linked targets may have broader relevance for suppression drives in haplodiploid pests (e.g., fire ants), which has the same population dynamics as an X-linked target in a diploid populations. Our results therefore could have provided useful insights for such scenarios (such as for fire ants: Liu et al., bioRxiv 2025) if drive performance was sufficient for followup testing.

      Minor comments

      • Enhanced clarity in the Figures and data presentation would greatly improve readability. For example, Figure 5 is critical yet difficult to interpret; consider changing x-axis labels from icons to explicit text (e.g. "biparental Cas9", "maternal cas9", "paternal Cas9"). Similarly, Figure 4 is difficult to read and the y-axis label "population size" is ambiguous; consider adding shapes or dashes (rather than relying solely on colour) and clarifying the y-axis (e.g. no. adults collected) in the legend.

      -We appreciate the reviewer’s comment and have revised Figure 4 as suggested. Regarding Figure 5, we attempted to replace the icons with text labels; however, this was not possible because there is very little horizontal space and two generations to specify. Instead, we have revised the figure legend to provide a clearer explanation, which can hopefully improve clarity..

      • Expand on or include a schematic to show the differences in construction between the tra-v1 and tra-v2 constructs to better contextualise the discrepancies in results (e.g. inheritance rates of 61%-66% for tra-v1 and 81%-83% for tra-v2 between the two.

      -We have expanded Figure 2 to compare the constructs of tra-v1 and tra-v2. The further explanation of these two constructs was added into the result section: ‘When targeting tra, we originally tested the 4-gRNA construct tra-v1. However, the drive inheritance rate was relatively low (61%-66%), and sequencing revealed that only the middle two gRNAs were active (Table S3). Lack of cleavage at the outmost sites is particularly detrimental to achieving high drive conversion. Therefore, a second construct tra-v2 was tested that retained the two active gRNAs and included two new gRNAs. It showed substantially improved drive inheritance (81%-83%). ’

      • Minor typos e.g.:

      o Line 87: "form" to "from"

      o Line 484: "expended" to "expanded

      o Line 560: "foor" to "for"

      o Line 732: "conversed" to "conserved

      -We have revised these typos.

      • Clarify the split drive system: the authors introduce split drive for the first time in Line 118. They should at least give a clear definition and explanation of split drive and complete drive in the introduction.

      -We have included an introduction of split drive and complete drive in the introduction (line 47-53).

      • Line 237-238., The fitness evaluation lacks a clear description of controls. How were non-drive flies generated and validated as controls?

      -Drive heterozygotes were crossed with Cas9 homozygotes to generate the flies used for fitness evaluation. From the same cross, non-drive progeny were obtained and used as controls, ensuring they shared a comparable genetic background and rearing conditions with the drive-carrying individuals. We have now clarified in the manuscript results that “these served as the controls because they had the same environment and parents as the drive flies”.

      • Line 409-412.,line 423.,The high inheritance rates of stl and oct drives are impressive; however, variation in results across Cas9 promoters should be explained further in the discussion.

      -In the discussion section (lines 751-765), we included a dedicated paragraph addressing the variation observed between the nanos and CG4415 promoters. We have now expanded it to briefly note some differences:

      “Our previous works showed that both nanos and CG4415 have high drive conversion rates8, but nanos failed to suppress target populations in a homing drive targeting the female fertility gene yellow-G due to its fitness cost in drive females27. CG4415 had much lower maternal deposition, which allowed the elimination of cage populations by targeting yellow-G8. Here, we tested both promoters with drives targeting oct and stl, with both showing slightly higher drive efficiency than the drive targeting yellow-G in small-scale crosses. CG4415 has slightly worse though still good performance in females, likely due to male-biased expression compared to nanos.”

      • Line 414: The CG4415 promoter yielded reduced drive conversion rates in females, yet is still referred to as a promising promoter. This conclusion seems optimistic and should be clarified/more justified.

      -Based on our previous study cited in this context, CG4415 shows relatively lower germline conversion rates compared to nanos, although still remaining at a high level. Importantly, CG4415 also exhibits reduced maternal deposition relative to nanos, which could help mitigate fitness costs associated with maternal deposition—an important consideration for suppression systems. Taken together, while its conversion efficiency is lower (but only slightly), the potential benefits of reduced maternal deposition and perhaps even fitness costs provide a rationale for regarding CG4415 as a promising promoter. We state this when first introducing the promoter in the “Drive efficiency assessment” results subsection.

      • Specify the number of flies released, sex ratio, and cage size per generation (Line 466). This is essential for reproducibility.

      -We appreciate the reviewer’s comment and have revised the text to clarify our release approach, which differed from that used in other studies (which tend to have substantial fitness differences between lines in the first generation that can complicate analysis and change results). Rather than directly releasing drive males or females into cages, we first crossed drive males with non-drive females and then mixed them with non-drive females mated to non-drive males. The offspring (including males and females) from these crosses were recorded as the G0 generation, and their ratios were recorded as release frequency. We have specified the release ratio adult numbers in the following paragraph and supplementary file.

      Reviewer #2 (Significance (Required)):

      Overall the manuscript presents a valuable and timely resource for gene drive research, in particular for its systematic appraisal of potential target genes for population suppression drives and its rigorous assessment of the impact of maternal Cas9 deposition. The value in the generation and empirical testing of a novel multiplexed stl-targeting gene drive that led to population eradication in a cage trial should not be understated. While several key aspects of the discussion of the manuscript should be strengthened, the study presents a meaningful contribution to the field, extending previous work and and outlines important considerations for the design and implementation of effective gene drive systems.

      -We thank the reviewer for their encouraging and constructive comments. We are pleased that the systematic evaluation of target genes, the analysis of maternal Cas9 deposition, and the multiplexed stl-targeting drive were recognized as valuable contributions. We have strengthened the discussion as suggested, and we believe these revisions further enhance the manuscript as an aid for the design and implementation of future gene drive systems.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this study, Xu and colleagues explored how CRISPR-based homing gene drives could be used to suppress insect populations by targeting female fertility genes in Drosophila melanogaster. They engineered split gene drives with multiplexed guide RNAs to target nine candidate genes, seeking to prevent functional resistance and achieve high drive conversion with minimal fitness costs.

      Here my comments about this work:

      Abstract: While the stated aim of the study on line 16 is to "maintain high drive conversion efficiency with low fitness costs in female drive carriers," the conclusion in lines 29-31 shifts focus toward the broader challenges and future optimization of gene drive systems. This conclusion does not clearly highlight the specific results of the study or how they relate directly to the original objective. It would be more effective to emphasize the actual findings, such as which target genes performed best and under what conditions, and how these findings support or contradict the stated goals. The study primarily aimed to assess the efficiency of specific female fertility genes and to evaluate strategies for minimizing the formation of functional resistance alleles, rather than proposing a protocol for optimization. Therefore, better alignment is needed between the study's aim, experimental design, and concluding statements. Clarifying this alignment would also help refine the paper's focus and more accurately communicate its contribution, including whether it is exploratory, comparative, or methodologically driven.

      -We have revised the abstract to clarify the alignment as suggested by the reviewer. We note that this discrepancy is due to the initial aim of our study being different than some of the important lessons learned along the way regarding fitness effects from Cas9 deposition in split drives. Still, we agree that it would be better to be more consistent in our wording and conclusions.

      Introduction: One of the key design elements in this study is the use of multiplexed gRNAs. It is reasonable to assume that this strategy may influence fitness costs, potentially in more than one way. Given that assessing fitness cost is a major focus of the study, it would be helpful to include a brief discussion of previous research examining how multiplexed gRNAs may impact fitness in gene drive systems. A short review of relevant studies, if available, would provide important context for interpreting the results and could help clarify whether any observed fitness costs might be attributed, at least in part, to the multiplexing strategy itself. This addition could be appropriately placed around line 102, where gRNA design is discussed.

      -We have added an explanation in the Discussion to mention this. However, it has not been conclusively shown that multiplexed gRNAs have any effect on fitness. Indeed, there have been some multiplexed constructs that seem to have no fitness effect, and some that have high fitness costs. This doesn’t rule out the potential for multiplexed gRNAs to influence fitness itself, but it means that the mechanism may be complex. The new text reads:

      “Another potential though unconfirmed source of fitness cost arises from increased cleavage events associated with multiplexed gRNAs, where the greater number of gRNAs can enhance the overall cut rate compared to single-gRNA designs.”

      Line 42: Cas12a also showed efficacy using gene drives in yeast and Drosophila.

      -We now mention Cas12a at the beginning of the introduction.

      Line 133: The paragraph begins by stating that homologs of the target genes were identified and aligned. To improve clarity, especially for readers who are new to gene drive research, it would be helpful to begin the paragraph with a brief introductory sentence explaining the purpose of this step. For example, you could state the importance of identifying and aligning homologs to assess the conservation of target sites across species, which is critical for evaluating the broader applicability of gene drive strategies. This context would guide the reader and clarify the relevance of the analysis.

      -We have added the explanation as suggested.

      Lines 144-145: You mention that "the exception was tra, for which two constructs containing different gRNA sets were generated." For clarity, it would be helpful to provide a brief explanation of why two different gRNA sets were used for tra, and whether this differs from the approach taken with the other target genes. It's currently unclear whether all other genes were targeted using a single, standardized set of gRNAs, and this should be explicitly stated here for consistency, even though it is mentioned later in the plasmid construction section. Additionally, I suggest combining the sections on gRNA target design and plasmid construction. Since these components are closely related and sequential in the experimental workflow, presenting them together would improve the logical flow and help readers follow the methodology more smoothly.

      -We have combined both the gRNA target design and plasmid construction sections. We also discuss the two tra constructs early in the results section (see response to reviewer 2).

      Line 210: The analysis of the cage experiments was based on models from previous studies that used a simplified assumption of a single gRNA at the target site. While I understand this approach has precedent, it raises important questions about potential limitations. Specifically, could simplifying the analysis to one gRNA affect the conclusions of this study, given that the experimental design involves multiplexed gRNAs with four distinct target sites? The implications of using this simplified model should be clearly addressed, as the dynamics of drive efficiency, resistance formation, and fitness effects may differ when multiple gRNAs are employed. Additionally, while I am not a statistician, it is worth asking whether more sophisticated modeling approaches could be applied to account for all four gRNAs, rather than reducing the system to a single-gRNA framework. A discussion of the modeling choices and their potential consequences would strengthen the interpretation of the results.

      -We have clarified this. While we have modeled multiple gRNAs with high fidelity in SLiM, the maximum likelihood method is not very amenable to such treatment. It may cause our fitness estimate to be a small overestimate, but give the low fitness inferences, would certainly not have a large enough effect to fundamentally change any conclusion (and should be of a consistent level across all cages). We now discuss this in the methods section.

      Lines 297-300: Your results show that the expression of all target genes was higher in females, except for oct, which had higher expression in males. Additionally, oct expression decreased in adults. Given that oct is functionally important for ovulation and fertilization, processes that are primarily required in adult females, this pattern is somewhat unexpected. Could there be a possible explanation for the lower expression of oct, particularly in females and especially in adults, where its function would presumably be most critical? A brief discussion or hypothesis addressing this discrepancy would help clarify the biological relevance and interpretation of the expression data.

      -Based on transcriptome data from FlyBase, derived from Graveley et al. (2011), Oct is indeed expressed slightly higher in adult males than in adult females. This difference may be attributed to the fact that the female flies used in the study were virgins; Oct expression could be upregulated post-mating to mediate ovulation. Additionally, Oct is expressed not only in reproductive tissues but also in other organs such as the nervous system, where sex-specific differences in cell type composition or neural activity may contribute to the observed expression bias. However, high expression does not necessarily correlate with essential expression. Though Oct could have multiple functions, it’s still possible that the only apparent phenotype upon knockout is female sterility. We have added the following text: “This male-biased expression may result from the use of virgin females in the dataset, as oct is likely upregulated after mating. Moreover, oct is also expressed in non-reproductive tissues such as the nervous system, which may contribute to sex-specific differences in expression38. While oct may have multiple functions, it is possible that it is only essential for female fertility.”

      Lines 346-347: What is the distance between the gRNA target sites within each gene? Are all of the gRNAs confirmed to be active? It would be valuable to include a table summarizing the distance between target sites for each gene, the activity levels of the individual gRNAs, and the corresponding homing rates. This would help determine whether there is a correlation between gRNA spacing and drive efficiency. For example, Lopez del Amo et al. (Nature Communications, 2020) demonstrated that even a 20-nucleotide mismatch at each homology arm can significantly reduce drive conversion. Including such a comparative analysis in your study could provide important insights into how gRNA arrangement influences overall drive performance and would be incredibly helpful for future multiplexing designs.

      -We have showed previously that close spacing of gRNAs should help maintain high drive conversion efficiency, and this is alluded to indirectly in the introduction (we now mention it more directly). In our study, gRNAs were positioned in close proximity without overlap, with the general distance between the outermost cut sites within each gene being We have added a summary table (Table S3) presenting the sequencing results, which also showed gRNA activity levels. Notably, most but not all gRNAs were active, at least for embryo resistance (low to moderate activity may still be present in the germline). Coupled with varying activity levels for those that were active, this likely contributed to reduced drive conversion due to mismatches at the homology arms. This observation supports the notion that drive performance could be optimized by selecting and arranging more active gRNAs. Consistent with this, our second construct targeting tra (tra-v2) exhibited a higher inheritance rate than the original construct, suggesting that gRNA arrangement and activity critically influence drive efficiency. Testing the activity of every single gRNA requires the construction of multiple gRNA lines, since in vitro or ex vivo tests will not be accurate as in vivo transformation test. However, in our study, as long as drive conversion rates were reasonably high, further optimization was not needed. Therefore, the multiplexing gRNA design can not only maximize drive conversion, but also reduce labor filtering an increased number of 1-gRNA designs with lower performance.

      Line 434: I was not able to find any sequencing data. This is important to evaluate gRNA activities and establish correlations with drive efficiency.

      -We have added a summary of the sequencing results in Table S3, though these are for embryo resistance alleles. Note that while high gRNA activity is correlated with high drive inheritance, these are not directly related. For suppression drives, germline resistance rates are usually of low importance compared to drive inheritance, so we did not assess these in detail (and pessimistically assumed complete germline resistance in our cage models).

      Line 482: Did the authors test Cas9-only individuals (without the drive) against a wild-type population? This would help determine whether Cas9 alone has any unintended fitness effects. Additionally, is Cas9 expression stable over time and across generations? It would be helpful to include any observations or thoughts on the long-term stability and potential fitness impact of Cas9 in the absence of the drive element.

      -We did not perform a direct comparison of Cas9-only individuals and wild-type flies in this study. However, previous studies (Champer et al., Nature Communications, 2020 - Langmuller et al., eLife 2022), which we now cite in the discussion, found no significant fitness difference between very similar Cas9-expressing lines and wild type in the absence of a drive element, indicating no significant fitness impact from Cas9 alone (though we cannot exclude a small effect, it certainly could not come close to explaining our results). In our experiments, Cas9 expression was generally stable across generations, as indicated by consistent drive inheritance and fertility test results obtained from independent batches. Separate from this study, we did observe rare instability in one nanos-Cas9 line, which had remained stable for over five years but recently became inactive (low population maintenance size may have caused stochastic removal of the functional allele). It is something to watch out for, but probably not on the timescale of a single study.

      Discussion: I would appreciate a more direct and clearly stated conclusion that summarizes the key findings of the study. While the discussion addresses the main outcomes in depth, presenting a concise concluding paragraph, either at the end of the discussion or as a standalone conclusion section, would provide a stronger and more definitive closing statement. This would help reinforce what the study ultimately achieved and ensure the main takeaways are clearly communicated to the reader.

      -We have revised and expanded the last paragraph of the discussion section to make our findings more direct and clear.

      Overall, I believe this is an important study that offers valuable insights for advancing the design of CRISPR-based gene drives. The findings contribute to the development of more efficient and practical gene drive prototypes, bringing the field closer to real-world applications.

      Reviewer #3 (Significance (Required)):

      In this study, Xu and colleagues explored how CRISPR-based homing gene drives could be used to suppress insect populations by targeting female fertility genes in Drosophila melanogaster. They engineered split gene drives with multiplexed guide RNAs to target nine candidate genes, seeking to prevent functional resistance and achieve high drive conversion with minimal fitness costs. Among the targets, the stall (stl) and octopamine β2 receptor (oct) genes performed better, showing the highest inheritance rates in lab crosses. When tested in population cages, the stl drive was able to completely eliminate a fly population, but only when released at a high enough frequency, while other cages failed. These failures were traced and explained by fitness cost in drive-carrying females, caused largely by maternally deposited Cas9, which led to embryo resistance and reduced fertility. Through additional fertility assays and modeling, the team confirmed that the origin and timing of Cas9 expression, particularly from mothers, significantly impacted drive success. Surprisingly, even when Cas9 was driven by promoters with supposedly low somatic activity, such as nanos, fitness still persisted. The study revealed that while gene drives can be powerful, their effectiveness relies on finely balanced factors like promoter choice, drive architecture, and gene function. Overall, the research offers valuable lessons for designing robust, next-generation gene drives aimed at ecological pest control.

      -We sincerely appreciate the reviewer’s positive and thoughtful comments. We agree that the points raised highlight the importance of our findings and hope that our revisions have further improved both the clarity and overall content of the manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, Xu and colleagues explored how CRISPR-based homing gene drives could be used to suppress insect populations by targeting female fertility genes in Drosophila melanogaster. They engineered split gene drives with multiplexed guide RNAs to target nine candidate genes, seeking to prevent functional resistance and achieve high drive conversion with minimal fitness costs.

      Here my comments about this work:

      Abstract: While the stated aim of the study on line 16 is to "maintain high drive conversion efficiency with low fitness costs in female drive carriers," the conclusion in lines 29-31 shifts focus toward the broader challenges and future optimization of gene drive systems. This conclusion does not clearly highlight the specific results of the study or how they relate directly to the original objective. It would be more effective to emphasize the actual findings, such as which target genes performed best and under what conditions, and how these findings support or contradict the stated goals. The study primarily aimed to assess the efficiency of specific female fertility genes and to evaluate strategies for minimizing the formation of functional resistance alleles, rather than proposing a protocol for optimization. Therefore, better alignment is needed between the study's aim, experimental design, and concluding statements. Clarifying this alignment would also help refine the paper's focus and more accurately communicate its contribution, including whether it is exploratory, comparative, or methodologically driven.

      Introduction: One of the key design elements in this study is the use of multiplexed gRNAs. It is reasonable to assume that this strategy may influence fitness costs, potentially in more than one way. Given that assessing fitness cost is a major focus of the study, it would be helpful to include a brief discussion of previous research examining how multiplexed gRNAs may impact fitness in gene drive systems. A short review of relevant studies, if available, would provide important context for interpreting the results and could help clarify whether any observed fitness costs might be attributed, at least in part, to the multiplexing strategy itself. This addition could be appropriately placed around line 102, where gRNA design is discussed.

      Line 42: Cas12a also showed efficacy using gene drives in yeast and Drosophila.

      Line 133: The paragraph begins by stating that homologs of the target genes were identified and aligned. To improve clarity, especially for readers who are new to gene drive research, it would be helpful to begin the paragraph with a brief introductory sentence explaining the purpose of this step. For example, you could state the importance of identifying and aligning homologs to assess the conservation of target sites across species, which is critical for evaluating the broader applicability of gene drive strategies. This context would guide the reader and clarify the relevance of the analysis.

      Lines 144-145: You mention that "the exception was tra, for which two constructs containing different gRNA sets were generated." For clarity, it would be helpful to provide a brief explanation of why two different gRNA sets were used for tra, and whether this differs from the approach taken with the other target genes. It's currently unclear whether all other genes were targeted using a single, standardized set of gRNAs, and this should be explicitly stated here for consistency, even though it is mentioned later in the plasmid construction section. Additionally, I suggest combining the sections on gRNA target design and plasmid construction. Since these components are closely related and sequential in the experimental workflow, presenting them together would improve the logical flow and help readers follow the methodology more smoothly.

      Line 210: The analysis of the cage experiments was based on models from previous studies that used a simplified assumption of a single gRNA at the target site. While I understand this approach has precedent, it raises important questions about potential limitations. Specifically, could simplifying the analysis to one gRNA affect the conclusions of this study, given that the experimental design involves multiplexed gRNAs with four distinct target sites? The implications of using this simplified model should be clearly addressed, as the dynamics of drive efficiency, resistance formation, and fitness effects may differ when multiple gRNAs are employed. Additionally, while I am not a statistician, it is worth asking whether more sophisticated modeling approaches could be applied to account for all four gRNAs, rather than reducing the system to a single-gRNA framework. A discussion of the modeling choices and their potential consequences would strengthen the interpretation of the results.

      Lines 297-300: Your results show that the expression of all target genes was higher in females, except for oct, which had higher expression in males. Additionally, oct expression decreased in adults. Given that oct is functionally important for ovulation and fertilization, processes that are primarily required in adult females, this pattern is somewhat unexpected. Could there be a possible explanation for the lower expression of oct, particularly in females and especially in adults, where its function would presumably be most critical? A brief discussion or hypothesis addressing this discrepancy would help clarify the biological relevance and interpretation of the expression data.

      Lines 346-347: What is the distance between the gRNA target sites within each gene? Are all of the gRNAs confirmed to be active? It would be valuable to include a table summarizing the distance between target sites for each gene, the activity levels of the individual gRNAs, and the corresponding homing rates. This would help determine whether there is a correlation between gRNA spacing and drive efficiency. For example, Lopez del Amo et al. (Nature Communications, 2020) demonstrated that even a 20-nucleotide mismatch at each homology arm can significantly reduce drive conversion. Including such a comparative analysis in your study could provide important insights into how gRNA arrangement influences overall drive performance and would be incredibly helpful for future multiplexing designs.

      Line 434: I was not able to find any sequencing data. This is important to evaluate gRNA activities and establish correlations with drive efficiency.

      Line 482: Did the authors test Cas9-only individuals (without the drive) against a wild-type population? This would help determine whether Cas9 alone has any unintended fitness effects. Additionally, is Cas9 expression stable over time and across generations? It would be helpful to include any observations or thoughts on the long-term stability and potential fitness impact of Cas9 in the absence of the drive element.

      Discussion: I would appreciate a more direct and clearly stated conclusion that summarizes the key findings of the study. While the discussion addresses the main outcomes in depth, presenting a concise concluding paragraph, either at the end of the discussion or as a standalone conclusion section, would provide a stronger and more definitive closing statement. This would help reinforce what the study ultimately achieved and ensure the main takeaways are clearly communicated to the reader.

      Overall, I believe this is an important study that offers valuable insights for advancing the design of CRISPR-based gene drives. The findings contribute to the development of more efficient and practical gene drive prototypes, bringing the field closer to real-world applications.

      Significance

      In this study, Xu and colleagues explored how CRISPR-based homing gene drives could be used to suppress insect populations by targeting female fertility genes in Drosophila melanogaster. They engineered split gene drives with multiplexed guide RNAs to target nine candidate genes, seeking to prevent functional resistance and achieve high drive conversion with minimal fitness costs. Among the targets, the stall (stl) and octopamine β2 receptor (oct) genes performed better, showing the highest inheritance rates in lab crosses. When tested in population cages, the stl drive was able to completely eliminate a fly population, but only when released at a high enough frequency, while other cages failed. These failures were traced and explained by fitness cost in drive-carrying females, caused largely by maternally deposited Cas9, which led to embryo resistance and reduced fertility. Through additional fertility assays and modeling, the team confirmed that the origin and timing of Cas9 expression, particularly from mothers, significantly impacted drive success. Surprisingly, even when Cas9 was driven by promoters with supposedly low somatic activity, such as nanos, fitness still persisted. The study revealed that while gene drives can be powerful, their effectiveness relies on finely balanced factors like promoter choice, drive architecture, and gene function. Overall, the research offers valuable lessons for designing robust, next-generation gene drives aimed at ecological pest control.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Paper summary

      The manuscript by Xu. et al presents an insightful and valuable contribution to the field of gene drive research. The manuscript by Xu et al. presents an insightful and valuable contribution to the field of gene drive research. The strategy of targeting and disrupting female fertility genes using selfish homing genetic elements was first proposed by Burt in 2003. However, for this approach to be effective, the phenotypic constraints associated with gene disruption have meant that the pool of suitable target genes remains relatively small - notwithstanding the significant expansion in accessible targets enabled by CRISPR-based genome editing nucleases. Population suppression gene drives are well developed as proof-of-principle systems, with some now in the late stages of development as genetic control strains. However, advancing the pipeline will require a broader set of validated target genes - both to ensure effectiveness across diverse species and to build redundancy into control strategies, reducing reliance on any single genetic target. In their paper, the authors conduct a systematic review of nine female fertility genes in Drosophila melanogaster to assess their potential as targets for homing-based suppression gene drives. The authors first conduct a thorough bioinformatic review to select candidate target genes before empirically testing candidates through microinjection and subsequent in vivo analyses of drive efficiency, population dynamics, and fitness costs relating to fecundity and fertility. After finalising their results, the authors identify two promising candidate target genes - oct and stl - which both demonstrate high gene conversion rates and, regarding the latter, can successfully suppress a cage population at a high release frequency. However, the manuscript suffers from a lack of in-depth discussion of a key limitation in its experimental design - namely, that the authors utilise a split-drive design to assess population dynamics and fitness effects when such a drive will not reflect release scenarios in the field. The review below highlights some major strengths and weaknesses of the paper, with suggestions for improvement.

      Key strengths

      The study's most significant strength is in its systematic selection and empirical testing of nine distinct genes as targets for homing-based gene drive, hence providing a valuable resource that substantially expands the pool of potential targets beyond the more commonly studied target genes (e.g. nudel, doublesex, among others). The identification of suitable target genes presents a significant bottleneck in the development of gene drives and the work presented here provides a foundational dataset for future research. The authors bolster the utility of their results by assessing the conservation of candidate genes across a range of pest species, suggesting the potential for broader application. A key finding in the paper is the successful suppression of a cage population using a stl-targeting gene drive (albeit at a high release frequency). This provides a critical proof-of-principal result demonstrating that stl is a viable target for a suppression drive. While in the paper suppression was not possible at lower release frequencies, together, the results provide evidence for complex population dynamics and threshold effects that may govern the success or failure of a gene drive release strategy - hence moving the conversation from a technical perspective ("can it work") to how a gene drive may be implemented. Moreover, the authors also employ a multiplexed gRNA strategy for all their gene drive designs and in particular their population suppressive gene drive targeting stl. This provides further proof-of-principal evidence for multiplexed gRNAs in order to combat the evolution of functional resistance following gene drive deployment. Finally, a further strength of this paper is in the clever dissection of fitness effects resulting from maternal Cas9 deposition. The authors design and perform a robust set of crosses to elucidate the parental source of fitness effects (i.e. maternally, paternally, or biparentally derived Cas9), finding (as they and others have before) that embryonic fitness was significantly reduced when Cas9 was inherited from a maternal source. As discussed, the authors conclude that maternal deposition is particularly pronounced in the context of split drives as opposed to complete drives, with the implication being that a complete drive might succeed where a split-drive has failed; thus providing a key directive for future study.

      Concerns

      The manuscript's central weakness lies in its interpretation of the results from the cage experiments - namely that a split-drive system was used to "mimic the release of a complete drive". In the study, mosquitoes carrying the drive element (i.e. the gRNA) were introduced into a population homozygous for the Cas9 element over several generations. This design is likely not representative of a real-world scenario and, as the authors state, likely exaggerates fitness costs. This is because the females carrying Cas9 will maternally deposit Cas9 protein into her eggs, with activity spanning several generations. When mated with a drive-carrying male the gRNA will immediately co-exist with maternally deposited Cas9, leading to early somatic cleavage and significant fitness costs (reflected in the author's own fertility crosses). This is fundamentally different to how a complete drive would function in a real-world release, where complete-drive males would mate with wild-type females not carrying Cas9. Their offspring would carry the drive element but would not be exposed to maternally deposited cas9, thus deleterious maternal effects would only begin to appear in the subsequent generation from females carrying the drive. Fitness costs measured from split-drive designs are therefore likely substantially overestimated compared to what would occur during the initial but critical release phase of a complete drive. This flaw weakens the paper's ability to predict the failure or success of the screened targets in a complete drive design, thus weakening the interpretation of the results from the cage trials. As a suggestion for improvement, the authors should explicitly and more prominently discuss the limitations of their split-drive model compared to complete drive models, both in the Results and Discussion. It is also recommended to include a schematic for both strategies that contrasts the experimental setup design (i.e. release of the drive into a Cas9 homozygous background) with a complete-drive release, clearly illustrating differences in maternal deposition pathways. This will not only contextualise the results and support the author's conclusion that observed fitness costs are likely an overestimate but will further strengthen the arguments that the candidate target genes found in this study may still be viable in a complete-drive system.

      A second weakness in the manuscript relates to its limited explanation and discussion of key concepts. For example, the manuscript reports a stark difference in outcome of the two stl-targeting drives, where a high initial release in cage 1 led to population elimination versus a failure of the drive to spread in cage 2. The authors attribute this to vague "allele effects" and stochastic factors such as larval competition; however the results appear reminiscent of the Allee effect, which is a well-characterised phenomenon describing the correlation of population size (or density) and individual fitness (or per capita population growth rate). Using their results as an example, is it plausible that the high-frequency initial release in cage 1 imposed enough genetic load to quickly drive the population density below the Allee threshold thus quickly leading to population eradication. In cage 2, the low-frequency at initial release was insufficient to cross the Allee threshold. Omitting mention of this ecological principal greatly weakens the Discussion, and further presents a missed opportunity to discuss one of the more crucial strengths of the paper - that is, in providing a deeper insight into the practical requirements for successful field implementation. In a similar vein, the authors provide only a superficial mechanistic discussion into the fitness costs associated with drives targeting key candidate genes. The paper would benefit from a deeper discussion regarding the specific molecular functions of top-performing genes (stl, oct, nox) and how unintended Cas9 activity could disrupt their activity, integrating known molecular functions with observed fitness costs. For instance, oct encodes a G-protein coupled receptor essential for ovulation and oviduct muscle relaxation, thus disruption to the oct gene would directly impair egg-laying which would account for the observed phenotypic effects. A deeper discussion linking unintended Cas9 activity to the specific, sensitive functions of target genes would elevate the paper from a descriptive screen to a more insightful mechanistic study.

      It is curious that the authors chose two genes on the X chromosome as targets. In insects (such as Drosophila here) that have heterogametic sex chromosomes, homing is not possible in the heterogametic sex as there is no chromosome to home to - so there will be no homing in males. On top of that, there is usually some fitness effect in carrier (heterozygous) females, so in a population these are nearly always bad targets for drives - unless there is some other compelling reason to choose that target?

      Minor comments

      • Enhanced clarity in the Figures and data presentation would greatly improve readability. For example, Figure 5 is critical yet difficult to interpret; consider changing x-axis labels from icons to explicit text (e.g. "biparental Cas9", "maternal cas9", "paternal Cas9"). Similarly, Figure 4 is difficult to read and the y-axis label "population size" is ambiguous; consider adding shapes or dashes (rather than relying solely on colour) and clarifying the y-axis (e.g. no. adults collected) in the legend.
      • Expand on or include a schematic to show the differences in construction between the tra-v1 and tra-v2 constructs to better contextualise the discrepancies in results (e.g. inheritance rates of 61%-66% for tra-v1 and 81%-83% for tra-v2 between the two.
      • Minor typos e.g.:
        • Line 87: "form" to "from"
        • Line 484: "expended" to "expanded
        • Line 560: "foor" to "for"
        • Line 732: "conversed" to "conserved
      • Clarify the split drive system: the authors introduce split drive for the first time in Line 118. They should at least give a clear definition and explanation of split drive and complete drive in the introduction.
      • Line 237-238., The fitness evaluation lacks a clear description of controls. How were non-drive flies generated and validated as controls?
      • Line 409-412.,line 423.,The high inheritance rates of stl and oct drives are impressive; however, variation in results across Cas9 promoters should be explained further in the discussion.
      • Line 414: The CG4415 promoter yielded reduced drive conversion rates in females, yet is still referred to as a promising promoter. This conclusion seems optimistic and should be clarified/more justified.
      • Specify the number of flies released, sex ratio, and cage size per generation (Line 466). This is essential for reproducibility.

      Significance

      Overall the manuscript presents a valuable and timely resource for gene drive research, in particular for its systematic appraisal of potential target genes for population suppression drives and its rigorous assessment of the impact of maternal Cas9 deposition. The value in the generation and empirical testing of a novel multiplexed stl-targeting gene drive that led to population eradication in a cage trial should not be understated. While several key aspects of the discussion of the manuscript should be strengthened, the study presents a meaningful contribution to the field, extending previous work and and outlines important considerations for the design and implementation of effective gene drive systems.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Xu et al. investigated split gene drive systems by targeting multiple female essential genes involved in fertility and viability in Drosophila. The authors evaluate the suppression efficiency through individual corsses and cage trials. Resistance allele formation and fitness costs are explored by examining the sterility and fertility of each line. Overall, the experimental design is sound and methods are feasible. The work is comprehensive, and conclusions are well supported by the data. This work offers informative insights that could guide the design of suppression gene drive systems in other invasive disease vectors or agricultural pests.

      However, several points requiring clarification or improvement:

      1. Methodological clarity: Some experimental details are indufficiently described, for example, regarding the setup of genetic crosses involving different Cas9 derivatives. In line 197-198, "the mated females, together with females that were mated with Cas9 only males", it is unclear whether the latter group refers to gRNA-females.
      2. Regarding the inheritance rates, you included the reverse orientation of CG4415-Cas9, as I understood, it means this component is in reverse orientation with fluorescent marker. Since it is standard to design adjacent components in opposite direction to avoid transcriptional interference, the rationale for including this comparison should be better justified.
      3. Embryo resistance is inferred from the percentage of sterile drive females derived from drive mothers. How many female individuals were analysed per line and why deep sequencing was not employed to directly detect resistance alleles.
      4. Masculinisation phenotypes were observed upon disruption of tra gene. How strong intersexes were distinguished from males? What molecular markers were used to determine genetic sex. This information should be clearly provided.
      5. It would be more appropriate to use "hatchability"rather than "fertility" when referring to egg-to-larva viability.
      6. In cage trials, a complete gene drive is mimicked by introducing Cas9 to the background population, but this differs from actual complete gene drive, due to potential effects from separate insertion sites (different chromosome or loci). These difference could impact the system's performance and should be discussed.
      7. Given the large amount of data presented, it would improve readability and interpretation if each result section concluded with a concise summary highlighting the key findings and implications.

      Significance

      The authors evaluate suppression efficiency through individual courses and cage trials. Resistance allele formation and fitness costs are explored by examining the sterility and fertility of each line. Overall, the experimental design is sound and methods are feasible. The work is comprehensive, and conclusions are well supported by the data. This work offers informative insights that could guide the design of suppression gene drive systems in other invasive disease vectors or agricultural pests.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      • *

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      • *We appreciate the reviewers' assessment of the significance of our work and would like to highlight where we believe the novelty of this study lies. Our findings identify E4BP4 as a key transcription factor that maintains mitochondrial homeostasis by restraining the overactivation of biological pathways - such as de novo ceramide synthesis - that are known to drive mitochondrial oxidative dysfunction in the context of obesity. We fully acknowledge that the link between C16:0 ceramide and mitochondrial fragmentation has been previously established. However, to our knowledge, our study is the first to connect this phenomenon to a transcriptional safeguard mechanism, thereby providing a new layer of understanding of how transcription factors preserve mitochondrial integrity and function in brown adipocytes. We believe this conceptual advance adds significant value to the field by framing E4BP4 as a transcriptional "guardian" of mitochondrial homeostasis.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      • *Reviewer #1 comment:

      Figures B: Sample size of EE experiments is too low to draw any meaningful conclusions or to know for certain if the data are reproducible. Small sample sizes, likely coming from one litter and one batch of AAV are prone to type I error.

      Response: We agree with reviewer observation that increasing sample size is essential to confirm reproducibility and robustness. We have therefore planned to repeat the EE experiments with a larger number of mice per group, derived from independent litters and AAV preparations, in order to strengthen the statistical power and validate the phenotype observed in the current study.

      Reviewer #1 comment:

      Figure 3I: Why do cells (none of the groups) show no response to NE stimulation? Please clarify or provide potential mechanistic insight. Perhaps the cells were not differentiated well.

      __ ____Response:__ We agree that the absence of a robust NE response in Figure 3I requires further clarification. To address this, we have planned to repeat the in vitro oxygen consumption assay to confirm the phenotype presented in the study.

      Reviewer #1 comment: Figures 3I vs 5N. There is a striking discrepancy between these panels. In both, cells were treated with palmitate for 6 h, yet the NE and CCCP responses differ significantly. Are these the same cell types and conditions? Please reconcile the differences.

      Response: We would like to clarify that Figures 3I and 5N represent different experimental systems: Figure 3I shows data from primary brown adipocytes with E4bp4 transgene overexpression, whereas Figure 5N shows data from immortalized brown adipocytes with Cas9-mediated mutation of a 65 kb Cers6 enhancer site. Given the distinct cell types and genetic manipulations, a direct comparison between these two panels is not appropriate. Nevertheless, we agree that confirming the consistency of the phenotype across systems is important. To address this, we have planned to repeat oxygen consumption assays in both models to further validate the reproducibility of the observed effects.

      Reviewer #2 comment: A key experiment is missing: does adding C16:0 block the mitochondrial benefits of E4BP4-OE?

      Response: We thank the reviewer for this excellent suggestion. We agree that a rescue experiment is important to directly test whether C16:0 affects the mitochondrial benefits of E4BP4. To address this, we have planned to perform a co-overexpression of E4bp4 and Cers6 in brown adipocytes. The readouts will include mitochondrial morphology and oxygen consumption, enabling us to determine whether restoration of C16:0 production mitigates the protective mitochondria effects of E4BP4 overexpression. This experiment will provide direct mechanistic confirmation of the proposed model.__ __

      __Reviewer #2 comment: __Whether PRDM16-OE mimics the effects of E4BP4 to induce p-Drp1 is not shown.

      __Response: __We thank the reviewer for this valuable suggestion. We agree that testing whether PRDM16 overexpression mimics the effects of E4BP4 on p-Drp1 is important to strengthen the mechanistic link between these transcription factors in terms of regulation of mitochondrial fragmentation. To address this, we have planned to include a Western blot analysis of p-Drp1 in the PRDM16-OE in brown adipocytes.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      • *Reviewer #1 comment:

      Figure 1F: There is an unexpected dip in gene expression at cold exposure days 3 and 7, followed by a rebound at day 14. Is this fluctuation biologically meaningful or technical?

      Response: We thank the reviewer for this thoughtful observation. A previous study demonstrated that E4bp4 (Nfil3) expression displays an early increase (at 2 hours), followed by a decrease in magnitude - while still remaining significantly higher than control - during beige adipocyte differentiation in response to forskolin treatment (DOI: 10.1016/j.molmet.2022.101619). The authors of that study suggested that E4bp4 may contribute to a second wave of cAMP-driven beige adipocyte differentiation. However, in the context of our work, further discussion on whether the fluctuations in BAT E4bp4 expression observed during cold exposure reflect biological regulation would be speculative. Importantly, despite these oscillations across time points, E4bp4 expression remained statistically significant compared with control, supporting the robustness of our findings. We have now introduced this observation in the Results section of the revised manuscript.

      Reviewer #1 comment: Figures 2H and 2I (GTT): How was the AUC calculated? The GTT and ITT curves appear largely parallel aside from fasting differences. If total AUC was used instead of incremental AUC, it may overstate group differences. The recommended method is outlined in [DOI: 10.1038/s42255-021-00414-7]. Also, since insulin's half-life is ~10 minutes, later differences in the ITT curve likely reflect counterregulatory responses driven by hepatic gluconeogenesis.

      Response: We would like to clarify that in our original manuscript we had already calculated the area of the curve (AOC) rather than the area under the curve (AUC), following the recommended approach (DOI: 10.1038/s42255-021-00414-7). Specifically, the AOC was derived by subtracting the baseline glucose value from each subsequent time point, ensuring that the analysis reflects incremental changes rather than absolute glucose levels. We have now made this description more explicit in the revised version to avoid any ambiguity.

      __Reviewer #1 comment: __Figure 4F: How was mitochondrial fragmentation quantified? Please ensure that the ROI boxes shown in zoomed panels match the same region in size and shape - this applies throughout the manuscript.

      __ _Response: _We thank the reviewer for this valuable comment. To improve the quality and interpretation of the data, we have now included a quantitative analysis of mitochondrial morphology parameters associated with Figure 4F (Figure S4B)__. Specifically, we analyzed:

      • Mitochondrial volume (µm³): reflecting overall mitochondrial size.
      • Surface area (µm²): reflecting membrane expansion.
      • Sphericity index: indicating morphological rounding, which increases with fragmentation.
      • Number of branches and branch junctions per mitochondrion: reflecting mitochondrial networking and fusion. Myriocin treatment preserved mitochondrial volume and surface area, reduced sphericity, and increased both the number of branches and branch junctions, reflecting maintenance of a more interconnected mitochondrial network.

      Additionally, we verified that the ROI boxes shown in the zoomed panels are consistent in both size and shape across groups, as requested. We have now introduced this observation in the Methods section of the revised manuscript.

      __ ____Reviewer #1 comment: __Figure 3A: The claim that one group contains smaller mitochondria is not convincing. Both small and elongated mitochondria appear in each group. Moreover, it is unclear whether these minor differences are of any physiological relevance or whether they drive phenotypes.

      Response: We respectfully disagree with this observation and would like to clarify a few points.

      1. We have already demonstrated a statistically significant difference in mitochondrial length between E4bp4-OE and control groups (Figures 3B and 3C). This was based on a random, unbiased analysis, which consistently confirmed longer mitochondria in E4bp4-OE compared with control.

      Some degree of variability in mitochondrial length is expected in electron microscopy analyses, particularly because mitochondria from multiple cell types within iBAT are captured. It is important to note that the protective action of E4bp4 against mitochondrial fragmentation occurs specifically in brown adipocytes, where the transgene is expressed under the control of the adiponectin promoter.

      To address the potential confounding heterogeneity of iBAT mitochondria, we performed complementary cell-autonomous analyses in vitro, allowing us to directly compare mitochondrial dynamics in E4bp4-OE versus control brown adipocytes. This analysis further confirmed that E4bp4-OE prevents lipid overload - induced mitochondrial fragmentation in brown adipocytes.

      Finally, we emphasize that several studies have demonstrated that changes in mitochondrial dynamics, particularly under high-fat diet conditions, disrupt systemic energy homeostasis (DOI: 10.1016/j.cmet.2017.05.010; DOI: 10.1016/j.cell.2019.05.008; DOI: 10.1038/s42255-024-00978-0). Therefore, the differences we report are biologically meaningful in the broader context of mitochondrial dynamics and metabolic disease.


      __Reviewer #1 comment: __Figure 3E: The claim that confocal microscopy reveals palmitate-induced mitochondrial fragmentation is difficult to discern. The images lack clear morphological differences.

      __ _Response: _We thank the reviewer for this observation. To improve the interpretation of these results, we have now included a quantitative analysis of mitochondrial morphology parameters associated with Figure 3E.__ Specifically, we measured:

      • __Mitochondrial volume (µm³): __reflecting overall mitochondrial size.
      • __Surface area (µm²): __reflecting membrane expansion.
      • __Sphericity index: __indicating morphological rounding, which increases with fragmentation.
      • Number of branches and branch junctions per mitochondrion: __reflecting mitochondrial networking and fusion. __ __As shown in the new analysis (Figure S4A)__, palmitate treatment reduced mitochondrial volume, surface area, branches, and branch junctions, while increasing sphericity, consistent with a more fragmented phenotype in control cells. In contrast, these effects were significantly attenuated in E4bp4-OE cells, supporting our conclusion that E4BP4 overexpression protects against lipid overload-induced mitochondrial fragmentation. This text was added in the Results section of the revised manuscript.

      We believe this additional analysis strengthens the robustness of our findings and provides clear quantitative evidence for the morphological changes that were less apparent from qualitative image inspection alone.

      __Reviewer #1 comment: __Figure 3G: Dendra2-labeled mitochondria appear unaffected by palmitate, raising concern about the robustness of the effect across readouts.

      __ _Response: _We respectfully disagree with this observation. As shown in Figure 3G__ (bar graphs), palmitate-treated brown adipocytes exhibited a clear reduction in mitochondrial co-localization, which reflects lower levels of fused mitochondria, in the control group compared with E4bp4-OE. Importantly, no difference in mitochondrial co-localization was observed between the two groups under vehicle-treated conditions. This indicates that E4bp4 overexpression does not promote mitochondrial fusion per se, but rather prevents lipid overload - induced mitochondrial fragmentation. We also note that the representative images presented in Figure 3G are single snapshots taken from a time-lapse assay of mitochondrial dynamics. To further illustrate this effect, we direct the reviewer to the supplementary video accompanying this experiment, which clearly demonstrates the differences in mitochondrial behavior over time.

      __ ____Reviewer #1 comment: __Figure 5H: Were E4BP4 expression levels equivalent between WT and mutant cells? Quantification should be shown. Figure 5H: Were E4BP4 expression levels equivalent between WT and mutant cells? Quantification should be shown.

      Response: __We thank the reviewer for this important point. We have now added the quantification of E4bp4 mRNA levels in cells transduced with either the non-mutated vector (control) and the vector carrying a mutation in the E4bp4 DNA-binding domain (Figure S5)__. The data show no significant difference in E4bp4 expression between the two groups.

      __Reviewer #2 comment: __The evidence of mitochondrial fragmentation is not convincing. In the reviewer's opinion, Figures 3E, 3G, 4F, and 5M demonstrated a decrease in mitochondrial quantity, but not fragmentation.

      Response: __We thank the reviewer for this observation. We have already addressed the comments from reviewer #1 (above) regarding Figures 3E, 3G and 4F related to measurements of mitochondria fragmentation. To strengthen the interpretation of these results, we have also performed a quantitative analysis of mitochondrial morphology parameters associated with __Figure 5M. Specifically, we measured:

      • __Mitochondrial volume (µm³): __reflecting overall mitochondrial size.
      • __Surface area (µm²): __reflecting membrane expansion.
      • __Sphericity index: __indicating morphological rounding, which increases with fragmentation.
      • Number of branches and branch junctions per mitochondrion: __reflecting mitochondrial networking and fusion. As shown in the new analysis (Figure S4C), palmitate treatment significantly reduced mitochondrial volume, surface area, and branching, while increasing sphericity, consistent with enhanced mitochondrial fragmentation in control cells. Notably, these changes were significantly blunted in the Cers6 enhancer edited cells (EΔ), supporting our conclusion that disruption of Cers6 protects against lipid overload-induced mitochondrial fragmentation. __This text was added in the Results section of the revised manuscript.

      Regarding the reviewer's understanding of a "decrease in mitochondrial quantity, but not fragmentation," we respectfully disagree. The analyses performed for Figures 3E, 3G, 4F, and 5M clearly demonstrate that E4bp4 overexpression (E4bp4-OE) prevents lipid overload -induced mitochondrial fragmentation.

      In relation to mitochondrial quantity, our data do not support differences in mitochondrial biogenesis between groups. Specifically, the expression of thermogenic and mitochondrial biogenesis genes (Figure S2G) as well as the mitochondrial-to-nuclear DNA ratio (Figure S3D) showed no significant changes, indicating that mitochondrial biogenesis is not altered.

      Alternatively, it is possible that E4bp4 prevents mitophagy, as our results (Figure 3H) show that E4bp4-OE protects against lipid overload-induced mitochondrial depolarization. In this regard, previous studies have demonstrated that fragmented and depolarized mitochondria are targeted for degradation through mitophagy (DOI: 10.2337/db07-1781; DOI: 10.1074/jbc.M111.242412). While this explanation is consistent with our findings, we acknowledge that it remains speculative at this stage and, although interesting, is beyond the scope of the current study.

      __Reviewer #2 comment: __It is confusing whether the association shown in Figure 1C is a positive or an inverse association.

      Response: __We thank the reviewer for pointing out this source of confusion. __Figure 1C represents common variant associations for E4BP4, where the y-axis indicates the strength of association (-log10 p-value) rather than the direction (positive or inverse) of the effect. We have clarified this in the revised manuscript to avoid misinterpretation. The associations indicate that genetic variants in E4bp4 are positively linked with anthropometric traits such as weight, BMI, and waist-hip ratio.

      5. Description of analyses that authors prefer not to carry out

      *Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. *

      This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      • *Reviewer #2 comment:

      It would be worthwhile to investigate whether in vivo knockdown of E4BP4 blunts the Cers6-suppressing effects of PRDM16-OE.

      Response: We agree that assessing in vivo loss-of-function of E4bp4 in the context of Prdm16 overexpression would be highly informative. At present, this experiment is technically not feasible, as it would require the generation and characterization of complex in vivo models beyond the scope of the current study. Nevertheless, we are actively considering this as a future direction. In the meantime, we believe that the in vitro experiments in brown adipocytes provided here are sufficient to establish the mechanistic relationship between E4BP4 and PRDM16 in the regulation of Cers6 expression.

      __Reviewer #2 comment: __Whether E4BP4-OE affects cold tolerance in mice is now shown.

      __Response: __We thank the reviewer for this thoughtful comment. In our study, we performed an iBAT-specific E4bp4 gain-of-function assay because we observed a downregulation of E4bp4 expression in the context of obesity. The rationale for this approach was to rescue E4bp4 expression in iBAT and thereby evaluate its systemic and mechanistic effects under obesogenic conditions. We recognize that a gain-of-function assay during cold challenge would further enhance E4bp4 expression and, while interesting, this would more directly address the role of E4bp4 in thermogenic regulation rather than in obesity-related metabolic dysfunction. For this reason, we believe that a detailed investigation of E4bp4 in cold-induced thermogenesis is an important but separate question that lies beyond the scope of the current study.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: The manuscript by Valdivieso-Rivera et al. investigated the role of a transcription factor, E4BP4, in brown fat functions. Using in vivo AAV gain-of-function studies, in vitro primary cultured brown adipocytes, and transcription regulation studies, authors identified that E4BP4 works together with PRDM16 to suppress Cers6 transcriptions and its derived ceramide C16:0 production. The resulted decreasing C16:0 prevents diet-induced mitochondrial fragmentation within brown adipocytes, thereby promoting brown fat functions. Overall, this study employed state-of-the-art methodologies and the collected evidence generally supported the conclusion. However, there are issues remaining to be addressed.

      Major Comments:

      1. The evidence of mitochondrial fragmentation is not convincing. In the reviewer's opinion, Figures 3E, 3G, 4F, and 5M demonstrated a decrease in mitochondrial quantity, but not fragmentation.
      2. Whether E4BP4-OE affects cold tolerance in mice is now shown.
      3. A key experiment is missing: does adding C16:0 block the mitochondrial benefits of E4BP4-OE?
      4. Whether PRDM16-OE mimics the effects of E4BP4 to induce p-Drp1 is not shown.

      Minor points:

      1. It is confusing whether the association shown in Figure 1C is a positive or an inverse association.
      2. Results from the PRDM16-OE model were mostly obtained in cultured brown adipocytes. It would be worthwhile to investigate whether in vivo knockdown of E4BP4 blunts the Cers6-suppressing effects of PRDM16-OE.

      Cross-commenting

      Reviewer #1's comments are all solid, and I agree with all of them.

      Significance

      Key strengths include state-of-the-art methodologies and detailed mechanistic studies. Key limitations include some unconvincing staining data, lack of key "rescue" experiments, and less novelty in molecular mechanisms (the ceramide-Drp1 pathway).

      Overall, this study uncovers a critical role of E4BP4 in maintaining brown adipocyte mitochondrial integrity and function, advancing our understanding of TFs in brown fat biology. This study well fits readers' interests in the adipose biology and metabolism field.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary of the key results:

      Valdivieso-Rivera and colleagues present a novel regulatory mechanism by which E4BP4 modulates C16:0 ceramide production in brown adipocytes. Several points warrant clarification or additional data.

      Suggested improvements:

      1) Figure 1F: There is an unexpected dip in gene expression at cold exposure days 3 and 7, followed by a rebound at day 14. Is this fluctuation biologically meaningful or technical?

      2) Figures B: Sample size of EE experiments is too low to draw any meaningful conclusions or to know for certain if the data are reproducible. Small sample sizes, likely coming from one litter and one batch of AAV are prone to type I error.

      3) Figures 2H and 2I (GTT): How was the AUC calculated? The GTT and ITT curves appear largely parallel aside from fasting differences. If total AUC was used instead of incremental AUC, it may overstate group differences. The recommended method is outlined in [DOI: 10.1038/s42255-021-00414-7]. Also, since insulin's half-life is ~10 minutes, later differences in the ITT curve likely reflect counterregulatory responses driven by hepatic gluconeogenesis.

      4) Figure 3I: Why do cells (none of the groups) show no response to NE stimulation? Please clarify or provide potential mechanistic insight. Perhaps the cells were not differentiated well.

      5) Figure 4F: How was mitochondrial fragmentation quantified? Please ensure that the ROI boxes shown in zoomed panels match the same region in size and shape - this applies throughout the manuscript.

      5) Figures 3I vs 5N: There is a striking discrepancy between these panels. In both, cells were treated with palmitate for 6 h, yet the NE and CCCP responses differ significantly. Are these the same cell types and conditions? Please reconcile the differences.

      6) Figure 3A: The claim that one group contains smaller mitochondria is not convincing. Both small and elongated mitochondria appear in each group. Moreover, it is unclear whether these minor differences are of any physiological relevance or whether they drive phenotypes.

      7) Figure 3E: The claim that confocal microscopy reveals palmitate-induced mitochondrial fragmentation is difficult to discern. The images lack clear morphological differences.

      8) Figure 3G: Dendra2-labeled mitochondria appear unaffected by palmitate, raising concern about the robustness of the effect across readouts.

      9) Figure 5H: Were E4BP4 expression levels equivalent between WT and mutant cells? Quantification should be shown. Figure 5H: Were E4BP4 expression levels equivalent between WT and mutant cells? Quantification should be shown.

      Cross-commenting

      I agree with R2's points

      Significance

      This advance is incremental for the basic science community.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-03098

      Corresponding author: Pedro Escoll

      1. General Statements

      Our study investigates the interplay between the metabolism of host cells and the intracellular replication of Salmonella enterica serovar Typhimurium (ST). Type III Secretion Systems (T3SSs) are considered essential for ST to replicate within macrophages. However, we found that restricting macrophages to different bioenergetic contexts, such as supplementing them with glycerol, modulates bacterial replication and remarkably, enables a T3SS-deficient ST mutant (ΔprgHssaV) to replicate intracellularly. This T3SS-independent replication occurs within the Salmonella-containing vacuole (SCV) and is driven by the capacity of the host cell to provide these preferred nutrients, rather than by the host glycolytic activity itself.

      2. Description of the planned revisions

      __Reviewer #1 (Evidence, reproducibility and clarity): __

      Summary:

      In this manuscript, the authors investigate how host cell metabolic heterogeneity influences the intracellular replication of Salmonella enterica serovar Typhimurium. They use live-cell imaging of infected human primary macrophages to reveal that bacterial replication does not occur uniformly across infected cells. They demonstrate that supplementation with specific carbon sources-used by Salmonella during infection-promotes bacterial replication and increases the proportion of macrophages supporting intracellular growth. These effects are seen even in the absence of functional Type III Secretion Systems (T3SS), using a ΔprgHssaV double mutant. The authors further suggest that this replication enhancement is not strictly dependent on host glycolytic activity but rather on the host cell's ability to import nutrients. Their findings imply that intracellular Salmonella can exploit host cell metabolism to grow, even without its canonical virulence secretion systems, under nutrient-favorable conditions.

      Major Concern:

      While the topic is potentially interesting, the novelty is not fully clear. The concept that nutrient availability impacts intracellular Salmonella replication, largely via T3SS2 function, has been addressed previously (e.g., Liss et al., 2017). The finding that added exogenous carbon sources can enhance bacterial growth is thus not unexpected. The key claim-that Salmonella can replicate intracellularly even in the absence of T3SS function-would be significantly strengthened by demonstrating whether this is specific to Salmonella, or whether similar effects are seen with non-intracellular organisms such as E. coli K-12. If the phenomenon is unique to Salmonella, this would suggest a pathogen-specific mechanism beyond general metabolic support.

      As acknowledged by the Reviewer, the novelty and key claim of our work is that Salmonella can replicate intracellularly even in the absence of T3SS. To experimentally sustain that claim, we showed evidence that providing macrophages with the preferred carbon sources used by Salmonella during infection, such as glycerol, bypass the requirement of both T3SS by Salmonella to grow, intravacuolarly, inside macrophages.

      With respect to the article mentioned by the Reviewer (Liss et al. 2017, ref 36 in the manuscript), there are three important novel insights provided by our work: i) we show that Salmonella can replicate intracellularly in the SCV even in the absence of T3SS if certain carbon sources are provided; ii) we show the preference of Salmonella for certain carbon sources intracellularly such as glycerol and galactose (but not preferentially glucose); and iii) we have extended our observations to primary human macrophages in addition to RAW cells.

      We are not convinced that the experiment suggested by the Reviewer to use E. coli K12 (ECK12) is necessary to support our findings for Salmonella, but we propose to add the requested experiment. Briefly, we will infect hMDMs and RAW macrophages with ST-WT-GFP, ST-ΔprgHΔssaV or ECK12-WT-GFP, while culturing macrophages on different carbon sources (glucose, glycerol, galactose, fructose). Then we will monitor intracellular bacterial growth. By comparing bacterial growth of ST double mutant with ECK12-WT-GFP under favorable carbon sources such as glycerol, the results will be definitive to answer whether this phenomenon is unique to Salmonella or not.

      Specific Comments:

      1. Figure 1H: The effect shown here is not compelling due to inconsistent y-axis scaling. Panels 1B, 1C, and 1D should use a unified axis range with 1H to allow direct visual comparison of growth dynamics.

      Thank you, we will change it as suggested.

      Figures 1B, 1C, 1G, 1H: The current presentation of individual growth traces makes it difficult to appreciate the population-level trend. A smoothed average line overlaid on these plots could better represent the average dynamics of replicative vs. non-replicative infections. Or alternatively the total fraction of cells that proliferate summarized as a segmented bar plot (possibly binned per time point).

      We will plot the results as suggested, the total fraction of infected cells harboring bacteria that proliferate as a segmented bar plot, binned per time point.

      Figure 2G: This panel would benefit from including a comparable condition with the SPI-1/SPI-2 double mutant to aid interpretation. Additionally, the authors should explore whether this nutrient-supported replication is seen in non-phagocytic cells such as HeLa or Caco-2, which would help delineate whether the observed phenomenon is macrophage-specific.

      The graph asked by Reviewer is Figure S1D. As we are representing ST growth in macrophages supporting Salmonella replication, some of the conditions, such as lactate, cannot be shown in the infection conditions using the double mutant because there are no cells supporting the replication of the double mutant, so there are no cells to plot.

      As suggested, we are also going to perform the same experiments in HeLa cells to investigate whether the observed phenomenon is macrophage specific.

      Line 117: The sentence stating that the double mutant can undergo "exponential intracellular growth even in the absence of T3SS-dependent secretion" is an overstatement. The data suggest only a modest improvement in growth, restricted to a minority of infected cells. This claim should be revised accordingly, as should similar overstatements in the discussion (e.g., lines 203-204).

      We will remove the term 'exponential' and revise the sentence at line 117 and those in the discussion. Line 203-204 will be: 'we demonstrated that providing macrophages with preferred nutrients allows a subpopulation of ST to replicate intracellularly without the need for a functional T3SS'.

      Line 162: The authors should clarify that glycerol had the strongest effect in primary macrophages, while multiple alternative carbon sources had notable effects primarily in RAW cells.

      We will add this clarification in the text.

      Lines 198-201: This relates to the major concern. The authors should assess whether the observed growth enhancement is unique to Salmonella by testing other bacteria not known for intracellular replication. This would clarify whether the effect is due to general nutrient-driven host cell permissivity or a pathogen-specific adaptation.

      As outlined above, we will perform the suggested experiment with E. coli K12 to answer whether this phenomenon is unique to Salmonella or not.

      RAW 264.7 Observations: The modest intracellular growth of SPI-1/SPI-2 double mutants in RAW cells is consistent with prior observations in the field. The idea that nutrient availability explains this is noteworthy. The authors might consider whether differences in standard culture media (e.g., glucose concentration) influence these outcomes. This could have broader implications for reproducibility in infection models.

      Thank you for the suggestion, we will include a paragraph discussing whether differences in standard culture media might influence bacterial replication. Indeed, to answer also a question from Reviewer #2, we will include a new supplementary Figure where we have already compared "no Glucose" (0 mM), "low Glucose" (2 mM) and standard culture media Glucose levels (10 mM). Our results show that differences in Glucose levels in the culture media influence Salmonella intracellular growth in hMDMs and RAW macrophages (see Figure below).

      Reviewer #1 (Significance):

      This manuscript highlights how host cell metabolism and nutrient availability can influence intracellular Salmonella replication. While the findings are intriguing, the current framing overstates their novelty and impact. Key revisions-such as comparative experiments with non-pathogenic bacteria and non-phagocytic cells, consistent figure scaling, and more measured language-would improve the clarity and significance of the work. If the authors can show Salmonella-specific mechanisms at play, the study could offer important insights into host-pathogen metabolic interactions.

      We believe that performing all experiments suggested by the Reviewers, as well as the requested changes in the text to avoid overstatements, will improve the manuscript and will offer readers new insights and details to better understand the metabolic interactions happening between host and pathogens and how they can shape bacterial virulence.

      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary: In their study titled "Provision of Preferred Nutrients to Macrophages Enables Salmonella to Replicate Intracellularly Without Relying on Type III Secretion Systems", Dr. Garcia-Rodriguez et al. describe the influence of the host cell metabolism on the intracellular proliferation potential of Salmonella during infection. The authors investigate whether the supplementation of the media with different carbon sources has an impact on the intracellular lifestyle of Salmonella. By using single cell tracking in live-cell microscopy, including the use of different reporter strains, they describe that glycerol benefits Salmonella's ability to grow within its vacuolar niche, in part, interestingly, in a Type-3-Secretion System independent manner.

      They furthermore highlight the dependence on host background for this observation by showing that effects differ between cells of varying metabolic activity. Throughout their study, they use cutting-edge methodologies, as well as Salmonella strains that could be of versatile use in other investigations. This work, while limited to in vitro models for now, has implications for the better understanding of how pathogens and their host are intertwined. This, in turn, has significance for the development of new anti-infective strategies further down the line. I therefore believe that it should be disseminated to the research community. The following comments summarize ideas how the quality of the study could be improved:

      Major comments:

      1. Salmonella, especially when cultured to activate the SPI-1 T3SS, introduce rapid cell death in their host - most commonly through activation of the NLRC4 inflammasome and downstream pyroptotic signaling. The authors don't describe the effect of the infection in differently supplemented media on host cell death, yet it would be important to elucidate whether this cellular response is also altered.

      We have performed these experiments and tracked host cell death by measuring Annexin-V levels in single cells, during infection in the conditions using the different supplements. We will include these results in the revised version of the manuscript and main text. Please see the Figure below showing that the different carbon sources did not affect macrophages cell death significantly (future Figure S1E and S1F)

      The aspect of partially T3SS-independent growth enhancement by glycerol (and depending on the host background glucose) is most curious. The authors quantify this by determining the percentage of cells containing proliferating Salmonella and by tracking individual cells over the time course of the infection. I am missing a general statement on whether the initial infection rate (i.e. timepoint 0) is comparable across conditions and mutants, and whether possible discrepancies in the infection rate could have downstream effects on the statements and claims made in the manuscript. This is, to my mind, also important for the quantification of cytosolic and vacuolar bacteria. There, the authors always speak in "percent of infected cells", so it is relevant whether the number of infected cells varies among conditions (see e.g. Figure 3).

      We thank the reviewer for this comment. The initial infection rate at t=0 significantly differs between WT and mutants in RAW 264.7 macrophages, and carbon source supplementation has no effect. However, as we only analyze infected cells, this does not affect the final results. In any case, we are going to add the graphs of % of infected cells at t=0 as supplementary Figures S1G-K.

      The authors use a concentration of 10mM for all supplemented alternative carbon sources. It would be useful to discuss the rationale behind this approach, including whether all chemicals have the same ability to be taken up by the cell. A concentration series (at least for some of the tested compounds) may be beneficial to bolster the conclusions that the authors make.

      We use 10 mM as this is the concentration of Glucose in standard culture media. By using 10 mM for all the different carbon sources, we can thus compare them keeping concentration constant (10 mM). Indeed, to answer also Reviewer #1, we will include in the manuscript a paragraph discussing whether differences in standard culture media might influence bacterial replication. As this Reviewer suggested, we will include a new supplementary Figure comparing no Glucose (0 mM), low Glucose (2 mM) and standard culture media Glucose levels (10 mM), showing that the concentration of glucose has a gradual effect in supporting the replication of the T3SS-deficient strain in RAW macrophages (see Figure below).

      I think it would strengthen the study, if the authors used host cell mutants in certain metabolite transporters, or alternatively Salmonella mutants that are deficient in uptake or metabolism of some of the compounds used in this study. This point is alluded to in the discussion, and I believe if the authors could show that in certain host mutant backgrounds the impact of supplementation with alternative carbon sources can be reversed, it would immensely bolster the strength of the claims.

      Following Reviewer's suggestion, we generated ST metabolic mutants unable to metabolize glycerol, galactose or fructose. As seen in the Figures below, during infection, the supplementations with glycerol/galactose does not boost Salmonella replication in metabolic mutants as in WT conditions, demonstrating that supplemented carbon sources indeed arrive to bacteria within the SCV and are used by intracellular Salmonella to grow. This Figures will be now Future Figure 4J-N.

      I think it would be useful to include the meaning of this work for other intracellular pathogens in the discussion section: Do the authors believe that this phenotype is Salmonella-specific? If the pathogens are at hand, it might be interesting to infect with other intracellular bacteria, such as Shigella or Francisella to investigate if the boosting of growth by glycerol also holds true for these.

      We have performed experiments with Legionella pneumophila and galactose (see figure below), showing that this carbon source is specific of Salmonella (as shown in Figure 4F in the manuscript). We could perform experiments also with L. pneumophila and glycerol to answer the Reviewers question. However, we think that the results with Legionella might be out of the focus of this article and would constitute themselves a new article, as both pathogens have a very different, non-comparable intracellular metabolism. Thus, the experiment suggested by Reviewer #1 using E. coli K12 (ECK12) while culturing macrophages on different carbon sources (glucose, glycerol, galactose, fructose) is in our opinion a better fit. We will monitor intracellular bacterial growth and, by comparing bacterial growth of the ST-ΔprgHssaV double mutant with ECK12-WT-GFP under favorable carbon sources such as glycerol, the results will be definitive to answer whether this phenomenon is unique to Salmonella or not.

      Minor comments:

      • Line 41: The authors write "are required for", but given their findings, it might be more accurate to phrase this as "have previously been described to be required for" or "have previously been described essential for".

      We will change it.

      • Line 86: Is the referencing of Figure S1C correct or should it be S1A?

      Yes, thank you, it is S1A, we will change it.

      • Lines 119,120: Related to what is displayed in Figure 2G: Are these differences significant?

      Glucose, galactose and lactate curves are significantly different compared to control (p

      • Lines 126,127: What is the change for glycerol, and is the intracellular growth significantly higher compared to the control?

      6,2 {plus minus} 1.9% in glycerol vs. 2 {plus minus} 1% in control, p

      • Figure 1E&F: Related to one of the major comments: Would it be possible to quantify this at timepoint 0 to ensure that the initial infection rates are the same across conditions?

      As outlined above, we will add the graphs of % of infected cells at t=0 as supplementary Figures S1G-K (Major Comment number 2 from this Reviewer)

      • Figure 3E,F: Why does the sum of the curves not add up to 100% (especially in the beginning)? And related to that, why do both the percentage of cytosolic and vacuolar cells grow over time? Since this infection is performed with gentamycin present, re-infection should not be possible.

      The localization module of the SINA plasmid relies on transcriptional reporters, whose expression requires time for induction and detection. Therefore, at early time points, infected cells are not classified as vacuolar or cytoplasmic because the reporters have not yet been expressed (as described in PLoS Pathog. 2021;17(4):e1009550, PMID: 33930101).

      At later time points, a subset of cells harbors bacteria that do not express any of the reporters. These bacteria are considered dormant, representing about 10% of the population, as detailed in the same article. In addition, a small percentage of infected cells simultaneously contain both STvac and STcyt. Such cells are subclassified as harboring STcyt but also STvac. Consequently, the total proportion of infected cells carrying STvac and STcyt may also exceed 100%.

      • Figure S1A: While significance testing is described in the legend, there are no indications of significance in the figure panels.

      The Reviewer is right, there is no significant changes between conditions, we will change the significance testing to ns=non-significant.

      • Figure S1B: Due to the stark discrepancies between hMDMs and RAW264.7, it might make sense to plot them on two different y-axes. Furthermore, I would clarify the y-axis: In the legend, it seems as CFU counts are shown, while CFU/ml/t2 rather describes a change over time.

      We agree. However, we will maintain the scale of the Y-axis as it was required by Reviewer #1 to be consistent with Y-axis. We will change the legend to indicate that we plot CFU/ml/t2.

      • Figure S1C: The prgH-mutant seems to outperform the wildtype in intracellular proliferation, while the double mutant underperforms compared to the ssaV-mutant. Could you please discuss/explain how the prgH-deletion has seemingly opposite effects on intracellular proliferation, depending on whether it is introduced in a wildtype or ssaV-KO background?

      As T3SS-1 plays a role in inducing macrophage cell death via activation of the NLRC4 inflammasome, macrophages infected with bacteria carrying a functional T3SS-1 (such as WT), are more prone to undergo cell death at late time-points, which disrupts bacterial proliferation and reduces the proportion of infected cells. Thus, these dead cells were not considered in the analysis. Even if cell death of ST-WT-infected RAW macrophages remains below 5%, more ΔprgH-infected cells are considered in the analyses at late time-points, and ST-ΔprgH continue replicating (and growing in ST area).

      • Figure S2A: As for the comments related to Figure 3, I am unsure how the sum of STvac and STcyt can deviate from 100. This is especially puzzling for the red curve (glycerol) at e.g. 3hpi, when the sum of the two clearly seems to be larger than 100.

      At early time points, no infected cells are classified as vacuolar or cytoplasmic because the reporters have not yet been expressed. At later time points, a subset of cells harbor bacteria that do not express any of the reporters, which are considered dormant (10% of the population). Finally, a small percentage of infected cells simultaneously contain both STvac and STcyt, therefore the total proportion of infected cells carrying STvac and STcyt may also exceed 100%.

      **Cross-commenting** I agree in principle with the comments raised by Reviewer #1 - especially when it comes to the enhancement in significance if the authors assess the species specificity. Elucidating whether the growth enhancement is Salmonella-specific, occurs for other intracellular pathogens (e.g. Shigella, Francisella) or also for extracellular bacteria (e.g. E. coli, Yersinia), would definitely strengthen the study.

      As said before, for the revision we are going to perform the experiments suggested by Reviewer #1 of using E. coli K12 (ECK12) while culturing macrophages on different carbon sources (glucose, glycerol, galactose, fructose). And to satisfy this Reviewer's curiosity, we are going to perform experiments also with L. pneumophila and glycerol.

      Reviewer #2 (Significance):

      General assessment:

      As the authors write in their discussion, the strength of this study is also it's limitation: Using single cell tracking in microscopy is a very elegant and powerful approach, yet conversely, it limits the scope of the study to in vitro approaches. While it enables assessment of bacterial pathogenicity and host-dependence on a single-cell level, it remains to be investigated whether the conclusion that the authors draw from their work will hold in more complex or physiologically relevant models.

      During the preparation of this Revision Plan, we discovered the article published in PLoS Pathogens by Andrew Grant and Pietro Mastroni "Attenuated Salmonella Typhimurium Lacking the Pathogenicity Island-2 Type 3 Secretion System Grow to High Bacterial Numbers inside Phagocytes in Mice" (PLoS Pathog 2012 8(12): e1003070, PMID: 23236281). In this article, authors showed that our main conclusion is also relevant in vivo (Salmonella Typhimurium can replicate within macrophages in the absence of T3SS). This will be addressed in the Discussion of the revised manuscript. Our study provides a metabolic explanation, at the single cell level for those observations.

      A further small shortcoming of the study is the heavy focus on the bacterial aspect in this host-pathogen interaction. While the authors do link the proliferative potential of the intracellular bacteria to the metabolic status of the individual host cell, more could be done with respect to host responses in the varying media compositions, including investigating alterations to the cell cycle, induction of cell death, or the ability to activate inflammatory signaling.

      We agree, and we are actively investigating how restricting macrophages to specific carbon sources impact other host responses, such as cytokine production. For the revised manuscript, we will add the results on the induction of cell death.

      Nonetheless, this study is of large interest to the field and the systematic approach to addressing their hypotheses speaks to the scientific excellence of the investigators.

      Thank you.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      N/A

      • *

      4. Description of analyses that authors prefer not to carry out

      N/A

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In their study titled "Provision of Preferred Nutrients to Macrophages Enables Salmonella to Replicate Intracellularly Without Relying on Type III Secretion Systems", Dr. Garcia-Rodriguez et al. describe the influence of the host cell metabolism on the intracellular proliferation potential of Salmonella during infection. The authors investigate whether the supplementation of the media with different carbon sources has an impact on the intracellular lifestyle of Salmonella. By using single cell tracking in live-cell microscopy, including the use of different reporter strains, they describe that glycerol benefits Salmonella's ability to grow within its vacuolar niche, in part, interestingly, in a Type-3-Secretion System independent manner.

      They furthermore highlight the dependence on host background for this observation by showing that effects differ between cells of varying metabolic activity. Throughout their study, they use cutting-edge methodologies, as well as Salmonella strains that could be of versatile use in other investigations. This work, while limited to in vitro models for now, has implications for the better understanding of how pathogens and their host are intertwined. This, in turn, has significance for the development of new anti-infective strategies further down the line. I therefore believe that it should be disseminated to the research community. The following comments summarize ideas how the quality of the study could be improved:

      Major comments:

      1. Salmonella, especially when cultured to activate the SPI-1 T3SS, introduce rapid cell death in their host - most commonly through activation of the NLRC4 inflammasome and downstream pyroptotic signaling. The authors don't describe the effect of the infection in differently supplemented media on host cell death, yet it would be important to elucidate whether this cellular response is also altered.
      2. The aspect of partially T3SS-independent growth enhancement by glycerol (and depending on the host background glucose) is most curious. The authors quantify this by determining the percentage of cells containing proliferating Salmonella and by tracking individual cells over the time course of the infection. I am missing a general statement on whether the initial infection rate (i.e. timepoint 0) is comparable across conditions and mutants, and whether possible discrepancies in the infection rate could have downstream effects on the statements and claims made in the manuscript. This is, to my mind, also important for the quantification of cytosolic and vacuolar bacteria. There, the authors always speak in "percent of infected cells", so it is relevant whether the number of infected cells varies among conditions (see e.g. Figure 3).
      3. The authors use a concentration of 10mM for all supplemented alternative carbon sources. It would be useful to discuss the rationale behind this approach, including whether all chemicals have the same ability to be taken up by the cell. A concentration series (at least for some of the tested compounds) may be beneficial to bolster the conclusions that the authors make.
      4. I think it would strengthen the study, if the authors used host cell mutants in certain metabolite transporters, or alternatively Salmonella mutants that are deficient in uptake or metabolism of some of the compounds used in this study. This point is alluded to in the discussion, and I believe if the authors could show that in certain host mutant backgrounds the impact of supplementation with alternative carbon sources can be reversed, it would immensely bolster the strength of the claims.
      5. I think it would be useful to include the meaning of this work for other intracellular pathogens in the discussion section: Do the authors believe that this phenotype is Salmonella-specific? If the pathogens are at hand, it might be interesting to infect with other intracellular bacteria, such as Shigella or Francisella to investigate if the boosting of growth by glycerol also holds true for these.

      Minor comments:

      • Line 41: The authors write „are required for", but given their findings, it might be more accurate to phrase this as „have previously been described to be required for" or „have previously been described essential for".
      • Line 86: Is the referencing of Figure S1C correct or should it be S1A?
      • Lines 119,120: Related to what is displayed in Figure 2G: Are these differences significant?
      • Lines 126,127: What is the change for glycerol, and is the intracellular growth significantly higher compared to the control?
      • Figure 1E&F: Related to one of the major comments: Would it be possible to quantify this at timepoint 0 to ensure that the initial infection rates are the same across conditions?
      • Figure 3E,F: Why does the sum of the curves not add up to 100% (especially in the beginning)? And related to that, why do both the percentage of cytosolic and vacuolar cells grow over time? Since this infection is performed with gentamycin present, re-infection should not be possible.
      • Figure S1A: While significance testing is described in the legend, there are no indications of significance in the figure panels.
      • Figure S1B: Due to the stark discrepancies between hMDMs and RAW264.7, it might make sense to plot them on two different y-axes. Furthermore, I would clarify the y-axis: In the legend, it seems as CFU counts are shown, while CFU/ml/t2 rather describes a change over time.
      • Figure S1C: The prgH-mutant seems to outperform the wildtype in intracellular proliferation, while the double mutant underperforms compared to the ssaV-mutant. Could you please discuss / explain how the prgH-deletion has seemingly opposite effects on intracellular proliferation, depending on whether it is introduced in a wildtype or ssaV-KO background?
      • Figure S2A: As for the comments related to Figure 3, I am unsure how the sum of STvac and STcyt can deviate from 100. This is especially puzzling for the red curve (glycerol) at e.g. 3hpi, when the sum of the two clearly seems to be larger than 100.

      Cross-commenting

      I agree in principle with the comments raised by Reviewer #1 - especially when it comes to the enhancement in significance if the authors assess the species specificity. Elucidating whether the growth enhancement is Salmonella-specific, occurs for other intracellular pathogens (e.g. Shigella, Francisella) or also for extracellular bacteria (e.g. E. coli, Yersinia), would definitely strengthen the study.

      Significance

      General assessment:

      As the authors write in their discussion, the strength of this study is also it's limitation: Using single cell tracking in microscopy is a very elegant and powerful approach, yet conversely, it limits the scope of the study to in vitro approaches. While it enables assessment of bacterial pathogenicity and host-dependence on a single-cell level, it remains to be investigated whether the conclusion that the authors draw from their work will hold in more complex or physiologically relevant models.

      A further small shortcoming of the study is the heavy focus on the bacterial aspect in this host-pathogen interaction. While the authors do link the proliferative potential of the intracellular bacteria to the metabolic status of the individual host cell, more could be done with respect to host responses in the varying media compositions, including investigating alterations to the cell cycle, induction of cell death, or the ability to activate inflammatory signaling.

      Nonetheless, this study is of large interest to the field and the systematic approach to addressing their hypotheses speaks to the scientific excellence of the investigators.

      Advance:

      The advance this study makes is rather on the foundational than the applied side - which does not mean that conclusions drawn in this work are not of interest to a wider field. By investigating the intracellular lifestyle on a single-cell level, the authors were able to observe a striking and curious phenotype: that certain alternative carbon sources can enhance intracellular proliferation in a T3SS-independent manner. By further dissecting the reason for this observation, they create a stronger base for their conclusion in what can be described as an overall comprehensive study.

      Audience:

      As outlined in the description of the main advances, this study will be of largest interest to members of the basic research community in host-pathogen interactions. While the study so far focuses on Salmonella, a well-described and genetically accessible intracellular model pathogen, it could also be of interest to a broader community of researchers investigating bacterial pathogenicity, as well as those that are interested in the host metabolism.

      Describe your expertise:

      I have a background in bacterial pathogenicity in Salmonella infection, and have since expanded to other pathogens, as well as co-infections with viruses. In addition to investigating the pathogens, I have expertise in dissecting the host response, with a focus on innate immunity, inflammasome activation and host cell death. Overall, I am accustomed to unbiased screening approaches, which are followed by the formulation and assessment of hypotheses to unravel the molecular mechanisms underlying the host-pathogen interface.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, the authors investigate how host cell metabolic heterogeneity influences the intracellular replication of Salmonella enterica serovar Typhimurium. They use live-cell imaging of infected human primary macrophages to reveal that bacterial replication does not occur uniformly across infected cells. They demonstrate that supplementation with specific carbon sources-used by Salmonella during infection-promotes bacterial replication and increases the proportion of macrophages supporting intracellular growth. These effects are seen even in the absence of functional Type III Secretion Systems (T3SS), using a ΔprgH/ΔssaV double mutant. The authors further suggest that this replication enhancement is not strictly dependent on host glycolytic activity but rather on the host cell's ability to import nutrients. Their findings imply that intracellular Salmonella can exploit host cell metabolism to grow, even without its canonical virulence secretion systems, under nutrient-favorable conditions.

      Major Concern:

      While the topic is potentially interesting, the novelty is not fully clear. The concept that nutrient availability impacts intracellular Salmonella replication, largely via T3SS2 function, has been addressed previously (e.g., Liss et al., 2017). The finding that added exogenous carbon sources can enhance bacterial growth is thus not unexpected. The key claim-that Salmonella can replicate intracellularly even in the absence of T3SS function-would be significantly strengthened by demonstrating whether this is specific to Salmonella, or whether similar effects are seen with non-intracellular organisms such as E. coli K-12. If the phenomenon is unique to Salmonella, this would suggest a pathogen-specific mechanism beyond general metabolic support.

      Specific Comments:

      1. Figure 1H: The effect shown here is not compelling due to inconsistent y-axis scaling. Panels 1B, 1C, and 1D should use a unified axis range with 1H to allow direct visual comparison of growth dynamics.
      2. Figures 1B, 1C, 1G, 1H: The current presentation of individual growth traces makes it difficult to appreciate the population-level trend. A smoothed average line overlaid on these plots could better represent the average dynamics of replicative vs. non-replicative infections. Or alternatively the total fraction of cells that proliferate summarized as a segmented barplot (possibly binned per time point).
      3. Figure 2G: This panel would benefit from including a comparable condition with the SPI-1/SPI-2 double mutant to aid interpretation. Additionally, the authors should explore whether this nutrient-supported replication is seen in non-phagocytic cells such as HeLa or Caco-2, which would help delineate whether the observed phenomenon is macrophage-specific.
      4. Line 117: The sentence stating that the double mutant can undergo "exponential intracellular growth even in the absence of T3SS-dependent secretion" is an overstatement. The data suggest only a modest improvement in growth, restricted to a minority of infected cells. This claim should be revised accordingly, as should similar overstatements in the discussion (e.g., lines 203-204).
      5. Line 162: The authors should clarify that glycerol had the strongest effect in primary macrophages, while multiple alternative carbon sources had notable effects primarily in RAW cells.
      6. Lines 198-201: This relates to the major concern. The authors should assess whether the observed growth enhancement is unique to Salmonella by testing other bacteria not known for intracellular replication. This would clarify whether the effect is due to general nutrient-driven host cell permissivity or a pathogen-specific adaptation.
      7. RAW 264.7 Observations: The modest intracellular growth of SPI-1/SPI-2 double mutants in RAW cells is consistent with prior observations in the field. The idea that nutrient availability explains this is noteworthy. The authors might consider whether differences in standard culture media (e.g., glucose concentration) influence these outcomes. This could have broader implications for reproducibility in infection models.

      Significance

      This manuscript highlights how host cell metabolism and nutrient availability can influence intracellular Salmonella replication. While the findings are intriguing, the current framing overstates their novelty and impact. Key revisions-such as comparative experiments with non-pathogenic bacteria and non-phagocytic cells, consistent figure scaling, and more measured language-would improve the clarity and significance of the work. If the authors can show Salmonella-specific mechanisms at play, the study could offer important insights into host-pathogen metabolic interactions.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the Reviewers for their kind and constructive comments. We are happy to read that the reviewers found our study methodologically robust and comprehensive in addressing the metabolic heterogeneity of endothelial cells.

      Reviewer 1, comment 1: Image quality in sprouting assays - The images presented for the sprouting assays (e.g., Figure 4) are of suboptimal resolution and quality, making it difficult to evaluate the effects of the various compounds on EC behavior. Even under control conditions, clear sprout-like structures are not readily discernible. Improved image resolution-preferably through high-quality bright-field microscopy-and the inclusion of immunofluorescence images of labeled endothelial spheroids are recommended to enhance interpretability.

      Response: We appreciate the reviewer’s concern and have revisited the sprouting assay images. Our approach is consistent with established methods in the field (Heiss et al., FASEB J, 2015), where brightfield imaging is routinely used for quantification without additional immunostaining. Hence, we believe that the brightfield images are of sufficient resolution to allow reproducible quantification of normalized total sprout length. All experiments were performed under identical imaging and analysis protocols, and thus we are confident that the quantification reflects true biological differences. We cite the reference in the revised manuscript and clarify it as well in the Methods section.

      Reviewer 1, comment 2: Validation of the quiescence model - The current approach to induce quiescence should be further substantiated. Beyond proliferation markers, additional hallmarks of quiescent cells-such as epigenetic signatures, protein quality control mechanisms, and translational activity-should be assessed to confirm that the EC subtypes achieve a bona fide resting state.

      Response: We acknowledge the value of proper phenotyping of quiescent cells. However, most studies involving quiescent (endothelial) cells rely on EdU incorporation or similar proliferation markers to confirm entry into a non-proliferative state (Kalucka et al., Cell Metabolism, 2018; Coloff et al., Cell Metabolism, 2016). In our study, we have used EdU staining and FACS analysis to establish cell cycle arrest. Moreover, we find clear proteomic patterns that support the case of a quiescent state. We have also demonstrated the reversibility of quiescence (see Suppl. Fig. 1c) via reseeding and proliferation recovery of all EC types, which is a defining functional hallmark of true quiescence. Together, the EdU, proteomic and reseeding/proliferation data provide strong evidence that our EC subtypes reach a physiologically quiescent, non-senescent state.

      Reviewer 1, comment 3: Reversibility of quiescence - It is important to demonstrate that the EC subtypes investigated can re-enter the cell cycle following release from contact inhibition. Without such evidence, the possibility remains that some of the observed metabolic features reflect a transition to senescence rather than reversible quiescence.

      Response: This is an excellent suggestion. We have included new data that shows that ECs regain proliferative capacity upon reseeding of quiescent ECs at lower confluency (Suppl. Fig. 1c). The results support the interpretation that the observed metabolic features reflect reversible quiescence rather than senescence.

      Reviewer 1, comment 4: Assessment of cell viability - While EC proliferation, migration, and sprouting were examined to infer functional roles of metabolic adaptations, analyses of cell viability and death are also necessary to evaluate potential homeostatic or survival-related functions of the observed metabolic changes.

      Response: We appreciate the Reviewer’s concern about cell viability in our experimental setup, and we agree that viability assessment is important. Using trypan blue staining and automated cell counting, we observed that >85% of ECs remained viable from day 1 through day 10 of the quiescence model and included these results in the manuscript (Suppl. Fig. 1b).

      Reviewer 1, comment 5: Validation of pharmacological findings - The pharmacological inhibition experiments are informative and constitute a central part of the study. However, given the possibility of off-target effects, key conclusions should be corroborated using alternative loss-of-function approaches, such as RNA interference (e.g., shRNA or siRNA).

      Response: We recognize the possibility of side effects for pharmacological inhibitors, but the inhibitors, including the ones that show the strongest different effects in HUVECs and iLECs (succinyl acetone and R162) in our study are well-established, selective inhibitors of glutamate dehydrogenase (Wang et al., Pharmacological Research, 2022) and δ-aminolevulinic acid dehydratase (Nauli et al., J Clin. Biochem. Nutr., 2023), respectively, and have not been reported to exhibit significant off-target activity in endothelial cells. Furthermore, the aim of our study was not to define specific mechanistic pathways, but to highlight phenotype-specific metabolic vulnerabilities in distinct endothelial states. Performing knockdown experiments would go beyond the scope and focus of this manuscript and introduce their own limitations, including off-target effects and, most importantly, timing mismatches relative to our long-term assays (e.g., sprouting assays assessed at day 3 versus transient RNAi effects lasting for only 1-2 days). We hope the Reviewer agrees that our current approach sufficiently supports the study’s conclusions.

      __Reviewer 2, comment 1: __it was not clear whether the authors worked with single donor endothelial cells or with mixed donors. This should be clarified as it is important for the statistical analyses (single donor based EC research typically uses n=4, while for the mixed donor, an n=3 is sufficient).

      Response: We thank Rreviewer 2 for highlighting that we did not include this information in the Methods section and we did so in the revised manuscript. HDBECs, HDLECs and iLECs are from single donors, HUVECs are from mixed donors. We acknowledge the reviewer’s concern about the power of statistical analyses, but we think that n=3 is sufficient with proper correction for statistical tests. Furthermore, previous in vitro studies with ECs are done with single donor cells and in biological triplicates (Wong et al., 2017; Kalucka et al., 2018; Simões-Faria et al., 2025 and more). Moreover, for sprouting assays, we have n > 3 for most conditions.

      Reviewer 2, comment 2: I would like to see a sentence on the importance of shear stress in EC behavior (metabolism) in the introduction. It was recently shown that the in vivo situation of ECs encountering wall shear stress (Faria et al, PMID: 39832080) affects the metabolic behavior switching to glutamine metabolism. This aligns with the research of the authors as well.

      Response: We thank Reviewer 2 for drawing our attention to this relevant and interesting study. We mention the study in the introduction and the discussion.

      Reviewer 2, comment 3: suggestion for the authors: it could be useful if a figure is introduced to show the "physiological" location of the 4 EC used and that a rationale is provided for this.

      Response: We have included this in Supplementary Figure 1 and in the text.

      Reviewer 2, comment 4: figures are of low quality, I found it very difficult to see the spheroid/sprouting images. This should be addressed in the final version prior publication.

      Response: The new version has higher quality sprouting images in figure 4 and 5. The images can also be found in high quality on BioStudies (Accession: S-BSST1716).

      Reviewer 2, comment 5: Fig 2 c: I'm not sure if this panel is very relevant, when looking into detail, opposite pathways are present (glycolysis - gluconeogenesis). As well, I'm not sure if galactose metabolism is truly relevant, unless the author managed to measure distinct hexose and hexose-phosphates? Given the flow injection analysis setup, I doubt this. Would suggest to move this to supplement or to simply leave it out.

      Response: The Reviewer is correct; the employed analytics cannot distinguish different hexoses and hexose-phosphates. We have moved figure 2c to supplementary figure 4c.

      Reviewer 2, comment 6: Fig 3 b: was there any statistics performed on these data to compare the different setups?

      Response: We performed statistical analyses on this data and included it in the figures and figure legends.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      By employing a proteomics and metabolomics approach the authors clarified the molecular landscape of 4 major EC types in quiescent and proliferating conditions. The study is extensive and adds novelty to the EC research

      Major comments:

      • it was not clear whether the authors worked with single donor endothelial cells or with mixed donors. This should be clarified as it is important for the statistical analyses (single donor based EC research typically uses n=4, while for the mixed donor, an n=3 is sufficient).
      • I would like to see a sentence on the importance of shear stress in EC behavior (metabolism) in the introduction. It was recently shown that the in vivo situation of ECs encountering wall shear stress (Faria et al, PMID: 39832080) affects the metabolic behavior switching to glutamine metabolism. This aligns with the research of the authors as well.
      • suggestion for the authors: it could be useful if a figure is introduced to show the "physiological" location of the 4 EC used and that a rationale is provided for this.
      • figures are of low quality, I found it very difficult to see the spheroid/sprouting images. This should be addressed in the final version prior publication.
      • Fig 2 c: I'm not sure if this panel is very relevant, when looking into detail, opposite pathways are present (glycolysis - gluconeogenesis). As well, I'm not sure if galactose metabolism is truly relevant, unless the author managed to measure distinct hexose and hexose-phosphates? Given the flow injection analysis setup, I doubt this. Would suggest to move this to supplement or to simply leave it out.
      • Fig 3 b: was there any statistics performed on these data to compare the different setups?

      Significance

      the study adds insights to the ongoing research on EC molecular behavior.

      using different types of ECs in both quiescent and proliferating mode, as well as the validation of pathways by introducing inhibitors combined with the sprouting assays is an asset.

      I would like to see stated the biological complexity of EC, it was recently shown that shear stress plays an important role in EC metabolism.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: 
 The study by Durot and colleagues explores the metabolic heterogeneity of endothelial cells (ECs) across distinct subtypes (blood vs. lymphatic) and growth states (proliferating vs. quiescent). Through integrated proteomic and metabolomic analyses, the authors demonstrate that quiescent ECs are not metabolically inactive but instead undergo subtype-specific metabolic reprogramming. Functional perturbation of key metabolic pathways using chemical inhibitors results in differential phenotypic responses in blood versus lymphatic ECs. Collectively, the findings underscore a critical, context-dependent role of metabolism in maintaining EC function and highlight metabolic specialization as a fundamental feature of endothelial diversity.

      General Comments: 
 This manuscript presents a comprehensive and methodologically robust investigation into the metabolic diversity of cultured ECs. By combining proteomic and metabolomic approaches, the authors provide novel insights into the distinct metabolic profiles of blood and lymphatic ECs, and how these profiles shift as ECs transition from a proliferative to a quiescent state. The observation that quiescent ECs exhibit active metabolic reprogramming, rather than simply entering a dormant state, is particularly compelling and challenges existing models of cellular quiescence.

      The work is timely, well-written and addresses a significant gap in our understanding of endothelial metabolism. The integration of large-scale omics data with functional perturbation experiments strengthens the overall conclusions and enhances the impact of the study.

      Nevertheless, while the data are largely convincing, certain experimental aspects-particularly those related to the in vitro sprouting assays-require further validation to solidify the mechanistic interpretations. Additionally, some findings would benefit from further validation using alternative approaches (e.g., chemical perturbation studies).

      Specific Comments:

      1. Image quality in sprouting assays - The images presented for the sprouting assays (e.g., Figure 4) are of suboptimal resolution and quality, making it difficult to evaluate the effects of the various compounds on EC behavior. Even under control conditions, clear sprout-like structures are not readily discernible. Improved image resolution-preferably through high-quality bright-field microscopy-and the inclusion of immunofluorescence images of labeled endothelial spheroids are recommended to enhance interpretability.
      2. Validation of the quiescence model - The current approach to induce quiescence should be further substantiated. Beyond proliferation markers, additional hallmarks of quiescent cells-such as epigenetic signatures, protein quality control mechanisms, and translational activity-should be assessed to confirm that the EC subtypes achieve a bona fide resting state.
      3. Reversibility of quiescence - It is important to demonstrate that the EC subtypes investigated can re-enter the cell cycle following release from contact inhibition. Without such evidence, the possibility remains that some of the observed metabolic features reflect a transition to senescence rather than reversible quiescence.
      4. Assessment of cell viability - While EC proliferation, migration, and sprouting were examined to infer functional roles of metabolic adaptations, analyses of cell viability and death are also necessary to evaluate potential homeostatic or survival-related functions of the observed metabolic changes.
      5. Validation of pharmacological findings - The pharmacological inhibition experiments are informative and constitute a central part of the study. However, given the possibility of off-target effects, key conclusions should be corroborated using alternative loss-of-function approaches, such as RNA interference (e.g., shRNA or siRNA).

      Significance

      In summary, this manuscript makes a substantial contribution to the field and is likely to stimulate further research into endothelial metabolic regulation. With additional experimental validation, the study has the potential to serve as a reference in both vascular and metabolic research.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this paper, the GFP-GBP system for mistargeting protein localization was used in fission yeast cells to discover new protein interactions involved in vesicular trafficking during cytokinesis. This approach uncovered a new association between the F-BAR protein Rga7 and its binding partner Rng10 with the Munc13 protein Ync13 at the cell division site. Additional associations were observed between Rga7-Rng10, Ync13 and the glucan synthases Ags1 and Bgs4, and the vesicle fusion protein Sec1. These interactions identified by the GFP-GBP system were further supported by co-immunoprecipitation experiments and by defining localization dependencies with live cell imaging in a variety of mutant strains. The imaging data are all of high quality and for the most part support the conclusions. However, in my opinion some of the interpretations are overstated, and the manuscript would benefit from providing additional mechanistic information. Major and minor recommendations are outlined below.

      Major suggestions 1. The co-IP data are interpreted to suggest that all the above-mentioned proteins form a single "big complex." However, as noted in the manuscript and reflected in the model, the multipass integral membrane proteins Bgs4 and Ags1 are embedded in the vesicle membrane and likely only indirectly associate with the scaffold Rga7-Rng10 via Ync13, without forming a 'complex'. One would expect the entirety of these vesicle contents to co-IP if the model is correct. The first paragraph of page 11 should be revised to more clearly reflect this scenario and to align with the proposed model.

      Response: We thank the reviewer for this thoughtful clarification. In the original manuscript, we stated “…indicating these proteins do interact or form big protein complexes… These results suggest that Rga7, Rng10, and Ync13 form a protein complex.” We agree that our initial wording may have unintentionally implied that all proteins detected in co-IP experiments assemble into a single, large physical complex. As the reviewer correctly noticed, the multipass integral membrane proteins Bgs4 and Ags1 are embedded within vesicle membranes and are more likely to associate indirectly with the Rga7-Rng10-Ync13 complex, rather than being part of one unified protein complex. To avoid overinterpretation, we have modified the last sentence of the first paragraph on the original page 11 as below: “These results suggest that Rga7, Rng10, and Ync13 do form a protein complex, although maybe dynamic and not super stable (see Discussion). Our data indicate that Rga7 interacts with both Ync13 and Rng10 to form a module on the plasma membrane for targeting of the vesicles containing cargos such as glucan synthases Bgs4 and Ags1. However, these glucan synthases are multipass integral membrane embedded proteins and likely only indirectly associate with the module Rng10-Rga7-Ync13, without forming a big protein complex.”

      Can Ync13 be artificially directed or tethered to the division site independently of Rga7-Rng10 (e.g., via Imp2)? If so, can this rescue the phenotypes of rga7Δ cells? This experiment could clarify whether Ync13 is the key functional effector of the Rga7-Rng10 complex.

      Response: We thank the reviewer for suggesting this interesting experiment. We agree that testing whether correctly localized Ync13 is sufficient to execute the division-site function of the Rga7–Rng10 complex would clarify its role. To test this, we artificially targeted Ync13 to the division site independently of Rga7 by tethering it to the scaffold protein Pmo25. Pmo25, an MO25 family protein, localizes to both the plasma membrane at the division site and the spindle pole body (mainly one of the SPBs) during mitosis and cytokinesis, enabling us to mislocalize Ync13 to these structures through GFP–GBP system. We did not use Imp2 because its localization pattern (mainly to the contractile ring [1, 2]) is different from Ync13. Microscopy revealed robust localization of Ync13 at the division site and the SPB in rga7Δ cells, and this tethered Ync13 persisted along the cleavage furrow throughout ring constriction. Importantly, enforced division-site localization of Ync13 significantly rescued the cytokinesis defects and cell lysis of rga7Δ. Consistently, growth assays on Phloxin B (PB) plate showed the elevated lysis/death in rga7Δ cells was rescued by Ync13 tethering to Pmo25-GBP. Together, these findings support that Ync13 is a key functional effector acting downstream of the Rga7–Rng10 scaffold at the division site. We have added these results in the new Figure 6 and associate text in the revised manuscript. We have also updated the model in Figure 8 to reflect this new result.

      1. Demeter J, Sazer S. imp2, a new component of the actin ring in the fission yeast Schizosaccharomyces pombe. J Cell Biol. 1998;143(2):415-27. PubMed PMID: 9786952.
      2. Martin-Garcia R, Coll PM, Perez P. F-BAR domain protein Rga7 collaborates with Cdc15 and Imp2 to ensure proper cytokinesis in fission yeast. J Cell Sci. 2014;127(Pt 19):4146-58. Epub 2014/07/24. doi: 10.1242/jcs.146233. PubMed PMID: 25052092.
      3. The authors should consider structural or computational modeling of the proposed Rga7-Rng10-Ync13 complex. Such analysis could offer insight into how these components interact and strengthen the proposed model. Response: We thank the reviewer for this valuable suggestion. Following the recommendation, we performed structural modeling of the Rga7–Rng10–Ync13 complex using AlphaFold3. Our previous work demonstrated that the F-BAR protein Rga7 forms a stable dimer and its F-BAR domain binds the C-terminal (aa751–1038) region of Rng10 [3]. Based on these findings, we constructed an input model consisting of two full-length Rga7 subunits, two Rng10(751–1038) subunits, and one full-length Ync13. The predicted structure revealed a modular organization in which Rng10(751–1038) associated strongly with the F-BAR domain of the Rga7 dimer, consistent with our prior biochemical data [3]. In addition, the model suggested that Ync13 interacted with the GAP domain of Rga7, positioning Ync13 in close proximity to the Rga7–Rng10 interface (Fig. S5, A, B, D and F). Further domain specific predictions confirmed the interactions between Rga7-GAP and Ync13 N-terminus (pTM: 0.63, ipTM: 0.64), two Rga7 F-BARs (pTM: 0.74, ipTM: 0.71), as well as Rga7 F-BAR and Rng10(751–1038) (pTM: 0.56, ipTM: 0.78) (Fig. S5, C-F). Overlay analyses revealed that the interacting domains align well with the structure of whole complex as the root mean square differences (RMSDs) are Liu Y, McDonald NA, Naegele SM, Gould KL, Wu J-Q. The F-BAR domain of Rga7 relies on a cooperative mechanism of membrane binding with a partner protein during fission yeast cytokinesis. Cell Rep. 2019;26(10):2540-8.e4. doi: 10.1016/j.celrep.2019.01.112. PubMed PMID: 30840879; PubMed Central PMCID: PMCPMC6425953.

      Minor text edits 1. Define "SIN" in the discussion section for clarity.

      Response: We defined the SIN pathway in the Discussion section as suggested: “At low restrictive temperatures, the lethality of mutant sid2, the most downstream kinase in the Septation Initiation Network, is partially rescued by upregulating Rho1. Thus, it has been suggested that the Septation Initiation Network activates Rho1, which in turn activates the glucan synthases [4].”

      Alcaide-Gavilán M, Lahoz A, Daga RR, Jimenez J. Feedback regulation of SIN by Etd1 and Rho1 in fission yeast. Genetics. 2014;196(2):455-70. Epub 2013/12/18. doi: 10.1534/genetics.113.155218. PubMed PMID: 24336750; PubMed Central PMCID: PMCPMC3914619.

      Figure S3, the protein schematics should start at residue "1" and not "0".

      Response: We apologize for the mistake. The schematics in revised figure (now Figure S4A) have been corrected to start at residue 1.

      Mass spectrometry data referenced in the text are not provided in the manuscript.

      __Response: __We apologize for the omission. The mass spectrometry data are now shown in Table S1. __

      __

      In Figure 4A. The Ags1 rim localization does not appear decreased as the authors claim.

      __Response: __After examining the data again, we agree with the reviewer’s assessment. So, we reworded the sentence as the following: “We also found that in ync13Δ cells, the Bgs4 intensity at the rim of the septum was much lower than in WT after ring constriction (Fig. 4B).”


      On page 13: "both Rga7 and Rng10 can mistarget Trs120 to mitochondria."

      Response: Thank you. The typo “mistargeting” has been corrected to “mistarget”.

      Minor figure edits 1. Consider inverting single-channel images to display fluorescence on a white background, which would improve visual clarity.

      Response: We appreciate the reviewer’s suggestion. However, we have chosen to retain the original display format with fluorescence shown in a black background, to be consistent with our (and some others’) previous publications. We believe this format preserves clarity while allowing easier comparison with the previously published works.

      The Figure 1 legend should describe the experimental setup rather than restating conclusions.

      Response: We thank the reviewer for this helpful suggestion. The Figure 1 legend has been revised to describe the experimental setup and imaging conditions rather than summarizing conclusions as the following:

      Fig. 1. Physical interactions among the key cytokinetic proteins in plasma membrane deposition and septum formation revealed by ectopic mistargeting to mitochondria by Tom20-GBP. __Arrowheads mark examples of colocalization at mitochondria. (A) Ync13 colocalizes with Rga7 and Rng10 at cell tips and the division site. (B-F) Tom20-GBP can ectopically mistarget Rga7/Rng10-mEGFP and their interacting partners tagged with tdTomato/RFP/mCherry to mitochondria. Tom20–GBP was used to recruit mEGFP-tagged Rga7 or Rng10 to mitochondria, and colocalization was assessed with tdTomato/RFP/mCherry-tagged candidate binding partners. Cells were grown at 25ºC in YE5S + 1.2 M sorbitol medium for ~36 to 48 h and then were washed with YE5S without sorbitol and grown in YE5S for 4 h before imaging. (B) Rga7/Rng10-Ync13. (C) Rga7/Rng10-Trs120. (D) Rga7/Rng10-Bgs4. (E) Rga7/Rng10-Ags1. (F)__ Rga7-Smi1. Bars, 5 μm.

      Reduce the number of arrows indicating co-localization in microscopy images; highlighting 1-2 representative examples is sufficient and less visually cluttered.

      Response: We appreciate the reviewer’s suggestion. We have revised the micrographs to reduce the number of arrowheads, highlighting several representative examples of co-localization per image. This improves clarity and reduces visual clutter while still guiding the reader to the key observations.

      Figure 3F, the scale bar is listed as 5 μm in the legend but it appears to my eye to be 2 μm.

      Response: We thank the reviewer for noticing this error. After rechecking the original imaging data, we have added a new 5 μm scale bar.

      The orientation of Bgs4/Smi1 should be inverted in the schematic within vesicles so that Smi1 is always on the cytoplasmic side.

      Response: We thank the reviewer for pointing out this error. The schematic has been corrected so that Bgs4 and Smi1 are oriented appropriately, with Smi1 consistently placed on the cytoplasmic side of vesicles because it does not have a transmembrane domain. The revised schematic is included in the updated Figure 8.

      6. Also in the schematic, Mid1 is not at the constricting CR and therefore needs to be removed.

      __Response: __Thank you for the suggestion. Mid1 has been removed from the model figure.

      Reviewer #1 (Significance (Required): From the data presented in the manuscript, it is proposed that Rga7 and Rng10 form a scaffold at the division site for delivery of exocytic vesicles marked by the TRAPPII complex but not the exocyst complex. Further, it is proposed that these vesicles deliver specifically the glucan synthases necessary for septation. Overall, this study builds on previous work from the Wu lab to clarify how the TRAPPII-decorated vesicles are specifically delivered to the cell division site, adding some new information about vesicle trafficking regulation during cytokinesis. It also provides new insight into the role of a F-BAR scaffold protein.

      This paper will be of interest to those studying cytokinesis and also those studying mechanisms of intracellular trafficking.

      Reviewer expertise: Cell division, signaling, membrane biology

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      This paper provides a comprehensive analysis of the roles of Rng10, Rga7, and Ync13 in cytokinesis using fission yeast as a model system. The authors demonstrate that Ync13/Rna7/Rng10 not only interact with each other but also associate with components of glucan synthases, which are essential for secondary septum formation but not for the primary septum. They further show that Ync13 is involved in exocytosis through its interaction with Sec1 and plays a role in membrane trafficking via interaction with the TRAPP-II complex. Collectively, their findings reveal a coordinated mechanism that ensures the timely formation of the secondary septum during cytokinesis, as deletion of these proteins disrupts septum formation and leads to cell lysis.

      The conclusions drawn in this paper are well-supported by the data, with a clear methodology and robust statistical analyses that enhance reproducibility. However, I have the following major and minor comments:

      Major Comments - 1) The authors propose that Ync13, Rng10, and Rga7 interact to form a protein complex, supported by their mislocalization studies. While these findings are suggestive, additional co-immunoprecipitation (co-IP) data specifically demonstrating a direct interaction between Ync13 and Rng10 would strengthen the claim.

      Response: We thank the reviewer for this suggestion. The direct interaction between Rga7 with Rng10 has been already established and published by our group [3, 5]. Here we found that Rga7 and Ync13 directly interact by in vitro binding assay (Figure 2, D and E). While our current data do not suggest a direct physical interaction between Ync13 and Rng10, our mislocalization results and other data do provide strong support for their functional association. In particular, ectopic tethering of Ync13 to mitochondria recruits Rng10 to the same sites and vice versa (Figures. 1B and S2A). Additionally, division-site tethering of Ync13 by Pmo25-GBP rescues both the growth and cell-lysis phenotype of rga7Δ (Figure 6), consistent with the idea that Ync13 functions downstream of Rga7-Rng10 because Rga7 localization depends on Rng10 (Figure 8). Furthermore, our AlphaFold3 modeling predicts that Rng10 binds the BAR domain of Rga7, whereas Ync13 binds the GAP domain of Rga7, suggesting that Rng10 and Ync13 are positioned within the same complex through Rga7 without direct interaction (Figure S5).

              The predicted lack of direct interaction between Ync13 and Rng10(751–1038) is supported by the experiment mentioned below to answer the minor question from the Reviewer 3. We tested the mistargeting of mECitrine-Rng10(751–1038) in *rga7Δ tom20-GBP* cells and found that Ync13-tdTomato could not be recruited to mitochondria (Figure S4H). This indicates that Ync13 cannot interact with Rng10(751–1038) independently of Rga7, supporting our proposed model that Rga7 interacts with Rng10 through the BAR domain while with Ync13 through the GAP domain. We have added these clarifications to the revised manuscript (Results and Discussion) to better contextualize the evidence for the Rga7–Rng10–Ync13 assembly.
      

      Liu Y, McDonald NA, Naegele SM, Gould KL, Wu J-Q. The F-BAR Domain of Rga7 Relies on a Cooperative Mechanism of Membrane Binding with a Partner Protein during Fission Yeast Cytokinesis. Cell Rep. 2019;26(10):2540-8.e4. doi: 10.1016/j.celrep.2019.01.112. PubMed PMID: 30840879; PubMed Central PMCID: PMCPMC6425953. Liu Y, Lee I-J, Sun M, Lower CA, Runge KW, Ma J, et al. Roles of the novel coiled-coil protein Rng10 in septum formation during fission yeast cytokinesis. Mol Biol Cell. 2016;27(16):2528-41. Epub 2016/07/08. doi: 10.1091/mbc.E16-03-0156. PubMed PMID: 27385337; PubMed Central PMCID: PMCPMC4985255.

      2) It remains unclear whether Ync13 directly interacts with components of the glucan synthase complex (Bgs4/Ags1), or if this association is mediated through other factors (Rng10, Rga7). Clarifying the nature of this interaction would significantly enhance the mechanistic insight.

      Response: We thank the reviewer for this thoughtful clarification. As pointed out by Reviewer 1 in major comment 1, the multipass integral membrane proteins Bgs4 and Ags1 are embedded within vesicle membranes and are more likely to associate indirectly with the Rga7–Rng10-Ync13 complex rather than being part of one unified protein complex, although Rga7 Co-IPs with Bgs4 and its binding partner Smi1 (Figure 1, A-C). We would like to make it clear that our model or manuscript does not claim direct interactions between the Ync13-Rga7-Rng10 module and the glucan synthase complexes but suggest that the module aids in selection of vesicle targeting sites on the plasma membrane. To clarify, we have revised the text to more clearly state that our co-IP and in vitro binding results demonstrate that Rga7 physically associates with Ync13 and Rng10, and that vesicle-associated proteins such as Bgs4 and Ags1 are likely recruited through indirect interactions.

      __Minor comments: __1) The manuscript refers to mass spectrometry-based interaction data, but the corresponding dataset is not included. Providing this would enhance transparency and reproducibility.

      __Response: __We apologize for the omission. The mass spectrometry data are now shown in Table S1.

      2) In Figure 2D, the MBP-6x pull-down lane shows a faint band around 76 kDa. The authors should clarify what this band represents and whether it has any relevance to the study.

      Response: We thank the reviewer for noticing this faint band. The weak ~76 kDa band in the MBP-6x pull-down lane is non-specific background binding of MBP and Rga7. We added a note in the figure legend to clarify this point.


      3) A quantification graph corresponding to the data in Figure 3G would aid in better interpreting the results and assessing their significance.

      Response: We thank the reviewer for this suggestion. We have now added two quantification graphs corresponding to Figure 3G, showing the measured Rng10 signal intensities across the division site. Statistical analysis shows the full width at half maximum (FWHM) is significantly different between WT and ync13D cells, and the figure legend and text have been updated accordingly in the revised manuscript.

      4) Figure 4D appears to be missing time legends, which are essential for interpreting the dynamics of the experiment.

      Response: We thank the reviewer for noticing this. We apology for making this confusing statement in figure legend. We would like to clarify that the full width at half maximum (FWHM) was calculated from line scans using single time point images from cells at the end of contractile-ring constriction. Those line scans were fitted with the Gaussian distribution to calculate the mean and standard deviation of FWHM. We have updated the figure legend to make it clearer in the revised manuscript.

      Reviewer #2 (Significance (Required)):

      Nature and Significance of the Advance This study provides a conceptual and mechanistic advance in understanding the spatial and temporal regulation of membrane trafficking during cytokinesis. It identifies a conserved module-Ync13-Rga7-Rng10-that directs the selective tethering and fusion of secretory vesicles at the division site, functioning independently of the exocyst complex. This finding challenges the prevailing model that the exocyst is universally required for vesicle tethering during cytokinesis. While previous work has underscored the roles of TRAPP-II and vesicle trafficking in septum formation (Wang et al., 2016; Arellano et al., 1997; Gerien and Wu, 2018), the precise mechanism targeting vesicles to the division site remained unclear. This study fills that gap by elucidating how Ync13 and Rga7 coordinate vesicle delivery and glucan synthase localization (Liu et al., 2016; Zhu et al., 2018), thereby extending our understanding of septum biogenesis and membrane remodeling beyond actomyosin ring dynamics.

      Relevant Audience: This work is relevant to: • Cell biologists investigating cytokinesis, membrane trafficking, or vesicle fusion. • Yeast geneticists interested in conserved cell division pathways. • Researchers focused on SNARE-mediated membrane dynamics and trafficking regulation. • Biomedical scientists exploring analogous processes in mammalian systems, particularly those studying cell division defects linked to disease. The findings have implications across both basic and translational research in cell biology and membrane dynamics.

      My Expertise: My research focuses on membrane fusion, specifically the SNARE-mediated fusion process. I study the spatio-temporal regulation of fusion events and the coordinated action of regulatory proteins in determining the structural and functional outcomes of membrane fusion. This background provides me with the framework to critically evaluate studies investigating cytokinesis and trafficking mechanisms at the molecular level.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Zhang et al. elucidate key roles of a conserved module the Ync13-Rga7-Rng10 complex in coordinating selective tethering, docking, and fusion of glucan synthases containing vesicles with the plasma membrane, a process crucial for cell wall synthesis and survival of fission yeast at division. Using methods including mistargeting proteins to mitochondria, co-immunoprecipitation, in vitro binding assays, genetic and cellular methods, electron microscopy, and live-cell confocal microscopy, the authors demonstrate that this module controls a vesicle targeting pathway mediated by the TRAPP-II complex and SM protein Sec1, which ensures glucan synthases Bgs4 and Ags1 are deposited at the division site in a spatiotemporal manner.

      Major comments: The authors report aberrant accumulation of Bgs4 and Ags1 in the center of the septum after actomyosin ring constriction in ync13del cells and detect no overall defects in Bgs1 distribution there (Figure 4). When similar experiments were analyzed in this paper ( https://pmc.ncbi.nlm.nih.gov/articles/PMC6249806/), Bgs1 distribution and level did change in cells lacking Ync13, although these phenotypes of Bgs1 appeared later that that of Bgs4. I wonder whether there could exist a second wave of Bgs1 arrival in ync13del cells at later time points after ring fully constricts. Could this late recruitment of Bgs1 depends on Rng7 and Rng10, since these protein complexes are enriched in the middle of septum of ync13del cells? Or as the authors mentioned in the Discussion, could Rho GTPase regulated by Rga7 GAP also play a role in Bgs1 accumulation or fusion with the septum in the above scenario, if no obvious accumulation of vesicles is observed in ync13del cells with electron microscopy? How does Bgs1 localize in ync13-19 rng10del double?

      Response: We thank the reviewer for this insightful observation. We repeated the experiment to observe the localization of Bgs1 in WT and ync13Δ cells. We confirmed our earlier observation reported in this manuscript that the localization of Bgs1 at rim of the division site and its distribution along the division plane in ync13Δ is not very different from WT, although its intensity is higher and has more variation in ync13Δ cells (Figure above) . As suggested by the reviewer, we did microscopy to test Bgs1 localization in ync13-19 temperature sensitive mutant, rng10Δ, ync13-19 rng10Δ, and WT (Fig. S7). While line scan curves for Bgs1 localization at the division site steep for ync13-19 rng10Δ double mutant, it has no statistically significant difference in FWHM as compared to control WT (Fig. S7). Please note that we used different confocal systems, cameras, and laser powers for Fig. 4, C and E (PerkinElmer UltraVIEW Vox CSUX1) and Fig. S7 (Nikon W1+SoRa), so the FWHMs are not comparable between the two figures.

      To test if there is any second wave of Bgs1 localization at the division site, we tracked the fluorescence intensity of Bgs1 throughout 2 h long movies and plotted the Bgs1 intensity profile at the division site over time. The data clearly show only one peak of Bgs1 and no later accumulation at the division site, although Bgs1 intensity has more variation in ync13-19 and ync13-19 rng10Δ cells and the intensity is higher in ync13-19 rng10Δ cells. All these experiments conclude that Ync13-Rga7-Rng10 module impacts the localization of glucan synthases essential for the secondary septum (Bgs4 and Ags1) but not the primary (Bgs1).

      Assessments of protein abundance by Western blotting (Figure 3C and 3D) can benefit from some quantifications.

      Response: We thank the reviewer for this suggestion. We have now quantified the Western blot bands in Figures 3C and 3D, which have been added as supplementary figures along with the Western blot for Rng10 (Fig. S6, A-C) in the revised figures.

      Minor comments: Based on a series of experiments in which mistargeting Rga7 and Rng10 truncations drive Ync13-tdTomato to mitochondria, the authors suggest that Rga7, Rng10, and Ync13 have multivalent interactions with each other. Previous study (https://pmc.ncbi.nlm.nih.gov/articles/PMC6425953/) demonstrated that in cells co-expressing Tom20-GBP mECitrine-Rng10(751-950), Rga7 was efficiently mistargeted to mitochondria. This raises a possibility that Ync13 mistargeted by mECitrine-Rng10(751-1038) could come from Rga7 that strongly associated with Rng10(751-1038) on mitochondria. I wonder whether the authors could compare some of their truncation mistargeting experiments in the original manuscript and the ones in which either Rga7 or Rng10 is deleted, e.g. Tom20-GBP mECitrine-Rng10(751-1038) experiments in rga7del cells, if cells are still viable in this genetic background.

      Response: We thank the reviewer for this insightful suggestion. We tested the mistargeting of mECitrine-Rng10(751–1038) in rga7Δ tom20-GBP cells and found that Ync13-tdTomato could not be recruited to mitochondria. This indicates that Ync13 cannot interact with Rng10 C-terminus independently of Rga7, supporting the Alphafold3 modeling and our proposed model that Rga7 interacts with Rng10 through the BAR domain while with Ync13 through the GAP domain. We have added the new data to the revised manuscript (Fig. S4H and associate text) and included a brief discussion highlighting that Rga7 is required for the Rng10–Ync13 interaction. We removed the mentioning of multivalent interactions in the manuscript to minimize confusion.

      It is interesting that rga7del rng10del double mutants can survive better in EMM or YES with sorbitol. I wonder this would allow the authors to test whether the interaction between Ync13 and Sec1 is modulated by the presence of Rga7 and Rng10 or even the entire vesicle? Does mistargeted Ync13 overexpressed using the 3nmt1 promoter is still capable of driving Sec1 to mitochondria in rga7del rng10del cells.

      Response: We thank the reviewer for this suggestion. While we did not succeed in constructing the pentamutant deleting both rga7 and rng10 and mislocalizing Ync13 to mitochondria, we were able to make a quadruple mutant deleting rng10 and mislocalizing Ync13 to mitochondria. We tested whether mistargeted Ync13 overexpressed using the 3nmt1 promoter can recruit Sec1 to mitochondria in rng10Δ cells. Our results show that overexpressed Ync13 is still able to drive Sec1 localization to mitochondria without Rng10 (Fig. S2G). This suggests that Rng10 (together with Rga7) primarily functions to recruit and position Ync13 at the division site rather than being strictly required for the interaction between Ync13 and Sec1. This is also consistent with our Pmo25-GBP mislocalization experiments where we found that rga7Δ 3nmt1-mECitrine-ync13 cells even under the repressed condition for the 3nmt1 promoter can partially rescue the lysis phenotype of rga7Δ cells (Figure 6).

      The endogenous level of Ync13 is not particular high. Is this low level of Ync13 crucial for its function? Does mildly elevated level of Ync1 promote vesicle fusion at the closing septum?

      Response: We thank the reviewer for this insightful question. To test if there is a correlation between Ync13 levels and vesicle fusion at the division site, we mildly overexpressed Ync13 from the 3nmt1 promoter in YE5S rich medium without additionally added thiamine to obtain cells with different Ync13 levels (the rich medium has some residual amount of thiamine, which partially represses the nmt1 promoter) and then tracked the Rab11 GTPase Ypt3 labeled vesicles. This resulted in increased levels of Ync13 as well as Ypt3 at the division site (Fig. S8B). We measured the Ync13 intensity at division site and counted the number of Ypt3 vesicles reaching the division site in 2-minute continuous movie at the middle focal plane. We observed that increasing Ync13 level promoted the tethering and accumulation of Ypt3 vesicles at the division site until it reached a plateau (Fig. S8B). Thus, the Ync13 level is important for vesicle fusion at the division site. Collectively, Ync13, working with Rga7 and Rng10, plays an important role in vesicle targeting and fusion on the plasma membrane at the division site during cytokinesis. This is consistent with our results that overexpressed Ync13 can mislocalize Sec1 to mitochondria in rng10Δ (Fig. S2G) and can rescue the rga7Δ (Fig. 6).

      Reviewer #3 (Significance (Required)):

      Most of conclusions are well supported by a combination of methods. Out of curiosity, I wonder how much of Bgs4 or Smi1 detected in Co-IP experiments exist in the vesicle-bound form. The authors propose a very interesting working model that addresses several key challenges in achieving vesicle targeting specificity when timely delivery of various enzymes to their respective spatial locations along the primary and secondary septum must be orchestrated. I think this manuscript will be of interest to a broad audience.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Zhang et al. elucidate key roles of a conserved module the Ync13-Rga7-Rng10 complex in coordinating selective tethering, docking, and fusion of glucan synthases containing vesicles with the plasma membrane, a process crucial for cell wall synthesis and survival of fission yeast at division. Using methods including mistargeting proteins to mitochondria, co-immunoprecipitation, in vitro binding assays, genetic and cellular methods, electron microscopy, and live-cell confocal microscopy, the authors demonstrate that this module controls a vesicle targeting pathway mediated by the TRAPP-II complex and SM protein Sec1, which ensures glucan synthases Bgs4 and Ags1 are deposited at the division site in a spatiotemporal manner.

      Major comments:

      The authors report aberrant accumulation of Bgs4 and Ags1 in the center of the septum after actomyosin ring constriction in ync13del cells and detect no overall defects in Bgs1 distribution there (Figure 4). When similar experiments were analyzed in this paper ( https://pmc.ncbi.nlm.nih.gov/articles/PMC6249806/), Bgs1 distribution and level did change in cells lacking Ync13, although these phenotypes of Bgs1 appeared later that that of Bgs4. I wonder whether there could exist a second wave of Bgs1 arrival in ync13del cells at later time points after ring fully constricts. Could this late recruitment of Bgs1 depends on Rng7 and Rng10, since these protein complexes are enriched in the middle of septum of ync13del cells? Or as the authors mentioned in the Discussion, could Rho GTPase regulated by Rga7 GAP also play a role in Bgs1 accumulation or fusion with the septum in the above scenario, if no obvious accumulation of vesicles is observed in ync13del cells with electron microscopy? How does Bgs1 localize in ync13-19 rng10del double?

      Assessments of protein abundance by Western blotting (Figure 3C and 3D) can benefit from some quantifications.

      Minor comments:

      Based on a series of experiments in which mistargeting Rga7 and Rng10 truncations drive Ync13-tdTomato to mitochondria, the authors suggest that Rga7, Rng10, and Ync13 have multivalent interactions with each other. Previous study (https://pmc.ncbi.nlm.nih.gov/articles/PMC6425953/) demonstrated that in cells co-expressing Tom20-GBP mECitrine-Rng10(751-950), Rga7 was efficiently mistargeted to mitochondria. This raises a possibility that Ync13 mistargeted by mECitrine-Rng10(751-1038) could come from Rga7 that strongly associated with Rng10(751-1038) on mitochondria. I wonder whether the authors could compare some of their truncation mistargeting experiments in the original manuscript and the ones in which either Rga7 or Rng10 is deleted, e.g. Tom20-GBP mECitrine-Rng10(751-1038) experiments in rga7del cells, if cells are still viable in this genetic background.

      It is interesting that rga7del rng10del double mutants can survive better in EMM or YES with sorbitol. I wonder this would allow the authors to test whether the interaction between Ync13 and Sec1 is modulated by the presence of Rga7 and Rng10 or even the entire vesicle? Does mistargeted Ync13 overexpressed using the 3nmt1 promoter is still capable of driving Sec1 to mitochondria in rga7del rng10del cells.

      The endogenous level of Ync13 is not particular high. Is this low level of Ync13 crucial for its function? Does mildly elevated level of Ync1 promote vesicle fusion at the closing septum?

      Significance

      Most of conclusions are well supported by a combination of methods. Out of curiosity, I wonder how much of Bgs4 or Smi1 detected in Co-IP experiments exist in the vesicle-bound form. The authors propose a very interesting working model that addresses several key challenges in achieving vesicle targeting specificity when timely delivery of various enzymes to their respective spatial locations along the primary and secondary septum must be orchestrated. I think this manuscript will be of interest to a broad audience.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This paper provides a comprehensive analysis of the roles of Rng10, Rga7, and Ync13 in cytokinesis using fission yeast as a model system. The authors demonstrate that Ync13/Rna7/Rng10 not only interact with each other but also associate with components of glucan synthases, which are essential for secondary septum formation but not for the primary septum. They further show that Ync13 is involved in exocytosis through its interaction with Sec1 and plays a role in membrane trafficking via interaction with the TRAPP-II complex. Collectively, their findings reveal a coordinated mechanism that ensures the timely formation of the secondary septum during cytokinesis, as deletion of these proteins disrupts septum formation and leads to cell lysis.

      The conclusions drawn in this paper are well-supported by the data, with a clear methodology and robust statistical analyses that enhance reproducibility. However, I have the following major and minor comments:

      Major Comments

      1. The authors propose that Ync13, Rng10, and Rga7 interact to form a protein complex, supported by their mislocalization studies. While these findings are suggestive, additional co-immunoprecipitation (co-IP) data specifically demonstrating a direct interaction between Ync13 and Rng10 would strengthen the claim.
      2. It remains unclear whether Ync13 directly interacts with components of the glucan synthase complex (Bgs4/Ags1), or if this association is mediated through other factors (Rng10, Rga7). Clarifying the nature of this interaction would significantly enhance the mechanistic insight.

      Minor comments:

      1. The manuscript refers to mass spectrometry-based interaction data, but the corresponding dataset is not included. Providing this would enhance transparency and reproducibility.
      2. In Figure 2D, the MBP-6x pull-down lane shows a faint band around 76 kDa. The authors should clarify what this band represents and whether it has any relevance to the study.
      3. A quantification graph corresponding to the data in Figure 3G would aid in better interpreting the results and assessing their significance.
      4. Figure 4D appears to be missing time legends, which are essential for interpreting the dynamics of the experiment.

      Significance

      Nature and Significance of the Advance

      This study provides a conceptual and mechanistic advance in understanding the spatial and temporal regulation of membrane trafficking during cytokinesis. It identifies a conserved module-Ync13-Rga7-Rng10-that directs the selective tethering and fusion of secretory vesicles at the division site, functioning independently of the exocyst complex. This finding challenges the prevailing model that the exocyst is universally required for vesicle tethering during cytokinesis. While previous work has underscored the roles of TRAPP-II and vesicle trafficking in septum formation (Wang et al., 2016; Arellano et al., 1997; Gerien and Wu, 2018), the precise mechanism targeting vesicles to the division site remained unclear. This study fills that gap by elucidating how Ync13 and Rga7 coordinate vesicle delivery and glucan synthase localization (Liu et al., 2016; Zhu et al., 2018), thereby extending our understanding of septum biogenesis and membrane remodeling beyond actomyosin ring dynamics.

      Relevant Audience:

      This work is relevant to:

      • Cell biologists investigating cytokinesis, membrane trafficking, or vesicle fusion.
      • Yeast geneticists interested in conserved cell division pathways.
      • Researchers focused on SNARE-mediated membrane dynamics and trafficking regulation.
      • Biomedical scientists exploring analogous processes in mammalian systems, particularly those studying cell division defects linked to disease. The findings have implications across both basic and translational research in cell biology and membrane dynamics.

      My Expertise:

      My research focuses on membrane fusion, specifically the SNARE-mediated fusion process. I study the spatio-temporal regulation of fusion events and the coordinated action of regulatory proteins in determining the structural and functional outcomes of membrane fusion. This background provides me with the framework to critically evaluate studies investigating cytokinesis and trafficking mechanisms at the molecular level.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this paper, the GFP-GBP system for mistargeting protein localization was used in fission yeast cells to discover new protein interactions involved in vesicular trafficking during cytokinesis. This approach uncovered a new association between the F-BAR protein Rga7 and its binding partner Rng10 with the Munc13 protein Ync13 at the cell division site. Additional associations were observed between Rga7-Rng10, Ync13 and the glucan synthases Ags1 and Bgs4, and the vesicle fusion protein Sec1. These interactions identified by the GFP-GBP system were further supported by co-immunoprecipitation experiments and by defining localization dependencies with live cell imaging in a variety of mutant strains. The imaging data are all of high quality and for the most part support the conclusions. However, in my opinion some of the interpretations are overstated, and the manuscript would benefit from providing additional mechanistic information. Major and minor recommendations are outlined below.

      Major suggestions

      1. The co-IP data are interpreted to suggest that all the above-mentioned proteins form a single "big complex." However, as noted in the manuscript and reflected in the model, the multipass integral membrane proteins Bgs4 and Ags1 are embedded in the vesicle membrane and likely only indirectly associate with the scaffold Rga7-Rng10 via Ync13, without forming a 'complex'. One would expect the entirety of these vesicle contents to co-IP if the model is correct. The first paragraph of page 11 should be revised to more clearly reflect this scenario and to align with the proposed model.
      2. Can Ync13 be artificially directed or tethered to the division site independently of Rga7-Rng10 (e.g., via Imp2)? If so, can this rescue the phenotypes of rga7Δ cells? This experiment could clarify whether Ync13 is the key functional effector of the Rga7-Rng10 complex.
      3. The authors should consider structural or computational modeling of the proposed Rga7-Rng10-Ync13 complex. Such analysis could offer insight into how these components interact and strengthen the proposed model.

      Minor text edits

      1. Define "SIN" in the discussion section for clarity.
      2. Figure S3, the protein schematics should start at residue "1" and not "0".
      3. Mass spectrometry data referenced in the text are not provided in the manuscript.
      4. In Figure 4A. The Ags1 rim localization does not appear decreased as the authors claim.
      5. On page 13: "both Rga7 and Rng10 can mistarget Trs120 to mitochondria."

      Minor figure edits

      1. Consider inverting single-channel images to display fluorescence on a white background, which would improve visual clarity.
      2. The Figure 1 legend should describe the experimental setup rather than restating conclusions.
      3. Reduce the number of arrows indicating co-localization in microscopy images; highlighting 1-2 representative examples is sufficient and less visually cluttered.
      4. Figure 3F, the scale bar is listed as 5 μm in the legend but it appears to my eye to be 2 μm.
      5. The orientation of Bgs4/Smi1 should be inverted in the schematic within vesicles so that Smi1 is always on the cytoplasmic side.
      6. Also in the schematic, Mid1 is not at the constricting CR and therefore needs to be removed.

      Significance

      From the data presented in the manuscript, it is proposed that Rga7 and Rng10 form a scaffold at the division site for delivery of exocytic vesicles marked by the TRAPPII complex but not the exocyst complex. Further, it is proposed that these vesicles deliver specifically the glucan synthases necessary for septation. Overall, this study builds on previous work from the Wu lab to clarify how the TRAPPII-decorated vesicles are specifically delivered to the cell division site, adding some new information about vesicle trafficking regulation during cytokinesis. It also provides new insight into the role of a F-BAR scaffold protein.

      This paper will be of interest to those studying cytokinesis and also those studying mechanisms of intracellular trafficking.

      Reviewer expertise: Cell division, signaling, membrane biology

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-03094

      Corresponding author(s): Saurabh S. Kulkarni

      1. General Statements

      We thank the reviewers for their strong praise of the manuscript, highlighting its rigor, depth, and conceptual importance. They consistently described the study as a beautiful, fascinating, and conceptually strong piece of work that addresses a timely question in multiciliated cells. They also noted the high quality of the data, careful quantification, and the use of multiple genetic and pharmacological approaches, all of which improve the reproducibility and credibility of the findings. Importantly, they emphasized the novelty of discovering a direct mechanistic link between Piezo1-mediated mechanotransduction and Foxj1-driven transcriptional control of multiciliation, representing a significant breakthrough for both the cilia field and mechanobiology more broadly. Collectively, these strengths highlight the manuscript’s wide impact and make it highly suitable for publication in a high-impact journal.

      2. Description of the planned revisions

      Reviewer #1:


      There are two experiments that would significantly strengthen these claims.

      • First if their model is correct then even short term treatment with Yoda1 should induce the pathway and effect centriole numbers. While I appreciate the challenge of long term Yoda1 treatment its not clear to me why it would be needed if short term treatment is setting off the transcriptional cascade. Yoda is used throughout the paper to induce all the pathways but we don't know if it actually induces the phenotype. I think this should be addressed with either short term treatments or a dose response to find a dose that does not lead to skin pealing. It is hard to ignore this obvious deficiency.
      • Second, the model predicts that all of this is to regulate Foxj1 levels to regulate the subtle balance between cell size and centriole number. If this is correct, then the overexpression of Foxj1 should have a profound effect on centriole number in multiciliated cells. This is such an easy experiment that would validate many of the claims. RESPONSE:

      We recognize that the reviewer is asking us to test the sufficiency of the pathway with these comments: “If their model is correct, then they should be able to activate the pathway in one way or another to stimulate centriole number. This is a significant limitation to their overall model.” And “If this is correct, then the overexpression of Foxj1 should have a profound effect on centriole number in multiciliated cells.”

      To address reviewers’ suggestions, we will perform the following experiments.

      1. A brief exposure (15 and 30 mins) to Yoda1 and wait for 3 hours to examine changes in centriole amplification. This will avoid skin peeling from long-term exposure.
      2. A brief exposure to Yoda1 (15 mins) followed by a 30-minute wait period, and the cycle repeats a total of 4 times for a total of 3 hours to examine centriole amplification.
      3. The above two experiments will also be done in a constitutively active-Yap background to increase the probability that synergistic activation can lead to centriole amplification.
      4. Although Foxj1 is essential for multiciliogenesis, it is not sufficient to induce multiciliogenesis, as shown by multiple previous studies. Therefore, we do not expect overexpression of Foxj1 to have a profound effect on centriole number. While we will conduct the experiments because we truly want to address the suggestions and gain insight into the answers ourselves, we respectfully ask the Reviewer to consider the following responses to their concern.

      Yoda1 sufficiency: We agree that testing whether acute Yoda1 treatment can induce centriole amplification is an important question. We will conduct experiments with short-pulse and cyclic Yoda1 exposure, including in a constitutively active-YAP background (listed above), to address this possibility. However, several challenges complicate interpretation: (i) PIEZO1 adapts and desensitizes upon activation, (ii) transient signaling may be sufficient to cause secondary signaling but insufficient to drive stable transcriptional programs required for amplification, and (iii) centriole number is inherently variable, making modest effects difficult to resolve. However, we must recognize that failure to observe sufficiency under these conditions would not invalidate the model for two reasons: 1) absence of evidence is not evidence of absence, and thus, we may not have found the right experimental design. 2) PIEZO1–YAP is a necessary input but not sufficient on its own, as elaborated below. For both reasons, we are very careful about the interpretation of results in the manuscript, which shows that this pathway is necessary for centriole amplification using loss-of-function approaches.

      Foxj1 overexpression: Foxj1 is a well-established regulator essential for motile and multiciliogenesis across species (Xenopus, zebrafish, mouse). Loss of Foxj1 reduces cilia number in MCCs, but its activation alone does not have a profound effect on ciliogenesis/cilia number in MCCs. This is because Foxj1 is a part of a larger network essential for multiciliogenesis. This parallels the behavior of other transcriptional regulators, such as Myb, where loss of function impairs centriole amplification, but overexpression does not drive the formation of supernumerary centrioles. Both studies are seminal discoveries in the field of ciliogenesis, but they did not demonstrate the sufficiency of these molecules/pathways. Thus, our results, demonstrating that Foxj1 is necessary to induce tension-dependent centriole amplification, are significant, as the reviewer mentioned. The lack of Foxj1 sufficiency to induce centriole amplification is not a deficiency of the study, but rather evidence that Foxj1 is a part of a larger network essential for tension-dependent centriole amplification.

      Necessity versus sufficiency: We respectfully emphasize that sufficiency is not a prerequisite for demonstrating the significance of a pathway. Mechanochemical signaling is inherently complex, involving many mechanosensitive proteins and pathways. In our case, mechanical stretch increases centriole amplification, with PIEZO1–YAP signaling identified as a key mediator. However, we do not claim that PIEZO1–YAP alone is sufficient. Other pathways, including cadherin-mediated junctions, F-actin–myosin contractility, integrin–focal adhesion signaling, and nuclear mechanotransduction, likely contribute and may regulate unique downstream effectors that collectively promote centriole amplification. Therefore, PIEZO1–YAP should be regarded as one essential component within a larger network.


      __TIMELINE: __We will perform these additional proposed experiments. Since the first author, a postdoctoral researcher on this manuscript, has started a new job and will be coming in on weekends to complete the experiments, we estimate it will take approximately 2-3 months to finish them.


      Reviewer #2:

      1. Considering the Yap-piezo mechanism of action, the authors' logic for the selection of myb, foxj, plk4 and ccno as transcriptional targets is clear, but the HCR-derived signal and the differences seen in the yap morphants are not very strong, notwithstanding the statistical significance. There appear to be distinct subgroups within the treated populations (in Figure S6B, although these data seem quite different in Fig. 7H, so a comment on the technical differences might be helpful), so that the extent to which Yap1 regulates (Myb-)Foxj1 expression in MCCs is not clearly demonstrated by this experiment. Related to this point, it is unclear why 20-25% of the yap1/ piezo1 MO-treated embryos do not show a decline in FOXj1 in Fig. 6, given the qualitative nature of the scoring. Assuming the KD penetrance would vary on a cell-to-cell basis, rather than an embryo-to-embryo basis, this may suggest that there are additional relevant targets (some of which are discussed by the authors). Single-cell analysis might be a way to address this; however, this is not a trivial experiment, it might be sufficient to include a caveat in the text. Furthermore, the conclusion that Foxj1 regulates centriole amplification in a tension-dependent manner is well-supported by the data.

      RESPONSE: We appreciate the reviewer’s thoughtful observation. Differences in the expression of Foxj1 from experiment to experiment are possible due to a combination of factors, including heterogeneity in MCC development across embryos, slightly different embryonic stages, differences in embryo quality between fertilizations, and variability in morpholino delivery and knockdown penetrance, which can occur both across embryos and on a cell-to-cell basis within an embryo. We also note that technical aspects of HCR RNA-FISH, such as proteinase K treatment and washing steps, can affect signal intensity, potentially contributing to the appearance of distinct subgroups within treated populations.

      We agree that single-cell analysis would be a powerful way to dissect these differences, but as the reviewer notes, this is not a trivial experiment and is beyond the scope of the present study. We have therefore added clarifications in the text and discussion to acknowledge these sources of variability and to highlight the possibility of parallel pathways regulating foxj1 expression.

      ********************************************

      Controls for the knockdowns by the various MOs should be provided.

      RESPONSE: We appreciate the reviewer’s comment. The piezo1 MO has been previously established in Kulkarni et al. (2021). Additionally, the current manuscript includes MO control experiments for both erk2 and yap1, through KD at the 1-cell stage using the MO oligonucleotide, followed by mosaic-rescue with the respective WT RNA constructs (mCherry-ERK2 and yap1-GFP) and a nuclear tracer molecule such as H2B-RFP (Fig. 5, E-H, Fig. S5, C&D, Fig. 3, D-F). The mosaic-rescue is a robust experiment that provides an internal control within the same embryo, thereby avoiding differences that may arise due to embryo-to-embryo variability, embryo quality, or differences in fertilization batches. This approach also serves as a valuable tool for detecting cell-autonomous effects, providing a clear readout against uninjected neighboring cells, as the injected cells are labeled with a tracer. We will perform a similar mosaic-rescue experiment for the foxj1 MO.

      TIMELINE: We will conduct mosaic-rescue experiments for the foxj1 MO. We will need 1 month to complete the experiment.

      ********************************************

      __Minor comments:

      __

      Autocorrection of ERK1/2 or MEK1/2 pathways to 1/2 should be avoided. – We are unclear on this comment. Can reviewer please clarify what they mean.


      Reviewer # 3

      Major concerns

      1- The presented data do not yet establish a specific, direct pathway linking mechanotransduction to centriole number, because the molecular players tested (PIEZO1, Ca²⁺, PKC, ERK, YAP, Foxj1) are highly pleiotropic. As such, the observed centriole number phenotypes, and some of the major conclusions, could be indirect. It is therefore critical to test the specificity and causality of the proposed pathway. This could be done with the authors' own strategies and/or with the following potential approaches:

      • Genetic dependency and sufficiency tests: It could be shown that Yoda1 has no effect in PIEZO1 loss-of-function MCCs, and that wild-type PIEZO1, but not conductance-ad PIEZO1 pore mutants restores Yoda1 responsiveness across centriole number, pERK, and YAP readouts. For example, PIEZO1 C terminus was shown to govern Ca²⁺ influx and ERK1/2 activation. Comparing full length PIEZO1 with a C terminal deletion in MCC restricted rescue; loss of rescue of centriole amplification and ERK/YAP activation with the C terminal deletion can provide a genetics anchored specificity test beyond broad inhibitors.

      RESPONSE:

      • To address the reviewer’s concern, we will test whether Yoda1 affects ERK and Yap activation when Piezo1 is depleted. We appreciate the reviewer’s thoughtful suggestion to employ genetic rescue experiments with Piezo1 mutants. Unfortunately, these are not technically feasible in Xenopus, as the Piezo1 coding sequence is exceptionally large (~7.5 kb)____, and repeated attempts by our group to generate and express stable, translatable transcripts have been unsuccessful. To address genetic dependency and specificity despite these technical barriers, we have employed a combination of orthogonal strategies that together provide strong genetic and mechanistic evidence:

      • Mosaic loss-of-function experiments (Fig. 1) demonstrate that Piezo1 regulates centriole number in a cell-autonomous manner, ruling out global epithelial or indirect tissue-wide effects.

      • Pharmacological activation/inhibition with Piezo1-specific agonist (Yoda1) and inhibitors (GSMTx4, gadolinium) produced consistent phenotypes, including activation of downstream ERK and YAP readouts. Notably, Yoda1 is a Piezo-specific agonist, not a broad pharmacological agent.
      • Downstream pathway dissection (calcium chelation, PKC inhibition, ERK2 depletion, and YAP1 knockdown/rescue) consistently converges on the same phenotypes, reduced centriole amplification and altered Foxj1 expression, providing multiple independent lines of evidence that the Piezo1–Ca²⁺–PKC–ERK–YAP axis specifically controls centriole number.
      • Positive feedback regulation of Piezo1 expression by YAP/Foxj1 (Fig. 7) further strengthens the argument for a pathway-specific role rather than pleiotropic, indirect effects. Taken together, while full-length Piezo1 rescue experiments are technically not possible in Xenopus due to gene size constraints, our data employ state-of-the-art genetic, pharmacological, and orthogonal functional assays to rigorously test pathway specificity. These complementary approaches provide compelling evidence for the causal role of Piezo1-mediated mechanotransduction in centriole number control in MCCs.

      • Downstream bypass/rescue experiments: In PIEZO1 loss-of-function or BAPTA conditions, can enforcing MEK/ERK activation or YAP rescue centriole number defect? Conversely, can MEK inhibitors block Yoda1-induced effects.

      RESPONSE: We appreciate the reviewer’s insightful questions.

      • We will express CA Yap in the Piezo1 KD background to assess if we can rescue centriole number. We also note that the converse experiment has already been performed in our study: 1) PKC inhibition abolishes Yoda1-induced ERK phosphorylation and nuclear localization (Fig. 2), 2) both MEK inhibition and ERK2 depletion block Yoda1-induced Yap activation and nuclear entry (Figs. 4, S2). Thus, we have directly demonstrated that MEK inhibition prevents Yoda1-induced effects, satisfying this aspect of the reviewer’s concern.

      ********************************************

      2- Image quantification and analysis must be described in greater detail in the Methods section, as they are central to the major conclusions of the manuscript. For example, the authors should explain how nuclear, cytoplasmic, and centriole segmentation were performed, and how relative protein levels in the nucleus versus the cytoplasm (e.g., YAP, volume- or area-based) were quantified. Specifically, the thresholds and segmentation criteria applied to different cellular structures under various conditions, as well as the use of Imaris and other software, should be clearly detailed.

      RESPONSE: We will describe the methods in greater detail.

      ********************************************

      3- PIEZO1 mRNA was shown to incrase in a Foxj1 linked feedback loop. Does this increase translate into an increase in total protein levels?

      RESPONSE: If the reviewer is referring to Figure 7B, that is the Piezo1 antibody, so yes, the Piezo1 protein levels have increased.

      If the reviewer is referring to Figure 7C and D, we show that loss of Foxj1 leads to a reduction in Piezo1 mRNA expression.

      ********************************************

      4- Is the proposed signaling cascade active in mammalian multiciliated cells (e.g., airway epithelium). If possible, testing this by using one of the major players of the pathway as a readout such as as ERK phosphorylation, YAP nuclear localization in mammalian MCCs will reveal whether regulation of centriole number through this pathway is conserved and would strengthen the generality.


      RESPONSE: We agree with the reviewer that testing conservation of this pathway in mammalian MCCs is of great interest. Indeed, another group is currently investigating the role of Yap in the mammalian airway epithelium; in their temporally controlled Yap knockout model (the global Yap KO being embryonic lethal), they observed that Yap loss led to a reduction in centriole number. To avoid overlap and direct competition with this ongoing work, we chose to focus our efforts on Xenopus.

      Importantly, Xenopus has become a widely recognized and powerful system for MCC biology, enabling mechanistic dissection of centriole amplification and ciliogenesis. Several key discoveries in the field, including the identification of MCIDAS as a master regulator of MCC fate, were first made in Xenopus before being validated in mammals. Similarly, our study provides a mechanistic framework in Xenopus that can inform and guide ongoing studies in the mammalian airway.

      ********************************************

      5- Throughout the results section, there are multiple times where authors raised specific hypothesis about their data (e.g. foxj1 regulation of number control, apical actin/YAP). However, they have not tested them. These hypothesis are very exciting and if possible, testing experimentally, would strengthen the conclusions associated with them.

      RESPONSE: We are not sure what the reviewer means here by “authors raised specific hypothesis about their data (e.g., foxj1 regulation of number control, apical actin/YAP). However, they have not tested them”,

      BECAUSE:

      • Foxj1 regulation of centriole number: We demonstrate a clear reduction in centriole number upon Foxj1 depletion, and importantly, we extend this finding by showing that the reduction is tension-dependent (Fig. 6). We will perform a rescue assay to demonstrate the specificity.
      • Foxj1 and YAP: We never claimed that Foxj1 regulates YAP expression, and this is not part of our proposed model. Instead, our data show that Piezo1–ERK–YAP signaling regulates Foxj1
      • Foxj1 and apical actin: Foxj1 regulation of apical F-actin has already been established in prior work, and in our study, we clearly observe reduced apical actin intensity in Foxj1-depleted MCCs (Fig. 6). To further strengthen this conclusion, we will provide a quantitative analysis of apical actin intensity in Foxj1 morphants. ********************************************

      __TIMELINE: __We will perform these additional proposed experiments. Since the first author, a postdoc on this manuscript, has started a new job and will be coming in on weekends to finish the experiments, we estimate it will take approximately 2-3 months to complete them.

      Minor comments

      MCC vs non MCC identification (Fig. 1): Clarify how non MCCs were distinguished from MCCs (e.g. markers/criteria). – Can the reviewer please clarify which panel or panels? Or provide more specific text that needs to be changed.

      Add the Kintner group reference linking motile cilia number and centriole number in Xenopus MCCs.– Can the reviewer clarify where and which reference? Thank you.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      Reviewer 2

      Major comments:

      1. It should be clarified whether the immunoblots and the related quantitations in Figs. 2 and S2 are all from separate blots/ exposures. If so, they are not useful as controls, and these blots should be repeated with the relevant samples analyzed in parallel. Size markers and labels should be included (2B, 2G; S2B and S2G). An increase in total ERK would alter the interpretation of the increase in nuclear pERK in the IF experiments. RESPONSE: We thank the reviewer for raising this important point regarding clarification of the immunoblots. All experimental groups were analyzed in parallel with their corresponding controls. Because the primary antibodies for pERK and ERK were both raised in rabbit, we optimized our workflow to prevent protein loss during stripping and to ensure accurate visualization. Specifically, lysates from each experimental group were loaded in duplicate on the same gel, separated by a molecular weight ladder that served as a reference point. After transfer, the blot was cut along the ladder, and the two halves were processed in parallel: one probed with anti-pERK and the other with anti-ERK. This strategy ensured that all samples from a single experiment (e.g., Control and Yoda1-treated groups) were analyzed under identical conditions, with staining and imaging performed together at the same exposure. To enhance clarity, we have provided this data as __uncut, full-length __as Supplemental Figure 7 (Figure S7) in the revised revision.

      ********************************************

      Minor comments:

      1. Reference list should be checked for completeness; some citations lack journal/ volume/ page/ year details. – We have corrected the references.
      2. An 'overexposed' version of the image selected for centrioles in Figure 5F might be included with the Chibby-BFP at the same level as in the other figures. At present, the Yap KD cell in the image appears to have normal centrioles; this is potentially confusing, even though the authors clearly explain the matter in the text. – __We have added a new panel to Fig. 5F to avoid confusion.

      __ 3. It might be clearer to present injected/ uninjected in the same orientation in Fig. 6A and B. – __Unfortunately, that is not possible because the injected and uninjected sides are left and right, and they cannot be in the same orientation.

      __ 4. Figure 7B lacks the schematic described in the figure legend. – We have removed the Schematic sentence from the figure legend. That was an error on our side. Thank you for catching it.


      Reviewer 3


      1. Abstract: "how MCCs regulate centriole/cilia numbers remains a major knowledge gap" overstates the field; please soften to reflect recent advances (mechanics/apical area scaling; PIEZO1 implication). – We changed the text to “incompletely understood”.
      2. GsMTx4 rationale: State that GsMTx4 is a spider venom peptide that inhibits cationic mechanosensitive channels (including PIEZO1) and justify its use alongside Yoda1.– GsMTx4 was used in the previous manuscript, and its use was justified there. Here, we are only comparing the results. However, we have added a sentence describing what GSMTx4 is. We have also included a sentence explaining the use of Yoda1. “GsMTx4, a spider venom peptide used in our previous study, inhibits cationic mechanosensitive channels, including Piezo1.”

      “For this experiment, we used the Piezo1 channel-specific chemical agonist, Yoda1, to increase the sensitivity of Piezo1 and upregulate calcium entry into cells”

      Timeline statement: "Centriole amplification to migration and apical docking takes ~4-5 h (personal observation)" is not appropriate; either cite time lapse literature or include your own time lapse data.– We have added a reference that showed imaging for 2 hours, but it was not enough to capture the entire process from intercalation to maturation, so we also kept “personal observation” still in the manuscript. We are unaware of any study that has done time-lapse imaging for 4 hours to capture the entire process of centriole amplification.

      Redundancy: The description of Yoda1 as a channel specific agonist is repeated; keep only once.- Removed

      "WT yap1 GFP construct previously used by Dr. Lance Davidson ..." should move construct description to Methods and keep only the citation in Results.– We moved it to Methods.

      "(Unpublished data; Dr. Mahjoub)" should be removed unless data are shown.- Removed

      Replace "as shown previously in our eLife paper" with "as we previously showed or shown previously (Kulkarni et al., 2021)".– We have made the change.

      The two hypotheses for how Foxj1 could regulate number under tension (actin remodeling vs. transcriptional control of amplification genes) belong in the Discussion unless tested. Moreover, the part on the discussion on yap sequestration by apical actin and the two possibilities presented also should go do discussion. – We have moved both to the discussion section.

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      Reviewer 3

      1- The hypothesis about the centriole pool of Piezo as the mechnosensor for centriole number regulation is very exciting and novel. Can localization controlled variants be used to test whether a centriole associated pool directly senses tension for number control (for example, centrosome targeted PIEZO1 via a PACT tag). Alternatively, broad cellular Ca sensors (GcaMP) or centrosome proximal Ca sensors (e.g., PACT GCaMP) can be used detect local calcium microdomains during tethering or Yoda1 treatment.

      RESPONSE: We appreciate the reviewer's curiosity and excitement; however, these experiments will not alter the conclusion of this paper and will be part of the next study, which aims to delve deeper into how different pools of Piezo1 at centrioles versus cell junctions function in MCCs. To that point, we had thought about these experiments. As mentioned earlier, the Piezo1 coding sequence is exceptionally large (~7.5 kb)____, and repeated attempts by our group to generate and express stable, translatable transcripts have been unsuccessful. Thus, the idea of centrosome-targeted PIEZO1 via a PACT is very exciting; however, it is not technically feasible. Beyond size, PIEZO1 is a trimeric, large plasma-membrane mechanosensitive channel that requires proper ER processing and bilayer incorporation. PACT localizes cargo to the centriole/pericentriolar material, not a membrane compartment; thus, a PACT-anchored PIEZO1 would be membrane-mismatched and almost certainly nonfunctional even if expressed/

      Second, Centrosome-proximal GCaMP (PACT-GCaMP) would show correlation, not causation. This experiment does not address the question “centriole pool of Piezo as the mechanosensor for centriole number regulation”. It will only show if the Ca2+ influx is happening at the basal bodies, but not whether and how that Ca2+ is essential for centriole amplification. For this purpose, we will need to find a way to block Ca2+ influx specifically at basal bodies, rather than junctions, which will require extensive controls.

      We do not claim that any specific Piezo1 or Ca2+ pool is critical for controlling centriole number and thus the suggested experiment would not alter the manuscript's conclusions. We therefore view the above as exciting future directions rather than prerequisites.

      ********************************************

      2- Because the proposed pathway is tension-sensing and YAP pathway is tightly linked to the actin cytoskeleton, the role of actin cysoskeleton in the proposed pathway should be tested directly. The authors mention different hypothesis around actin but has not tested them in the manuscript. For example, actin-depedent sequestration of Yap at the apical surface is intriguing. Does actin polymerization induced by drugs release Yap from the apical surface?

      RESPONSE: We would like to thank the reviewer for their suggestion. As per the reviewers' suggestion, we have moved this section to discussion, stating that “In the future, we plan to address this question by examining how Yap is sequestered by apical actin.”.

      However, we appreciate the reviewer’s enthusiasm and would like to share some experiments we are thinking/planning of to test the hypothesis.

      We plan to examine if the actin polymerization or contractility is responsible for Yap sequestration/release from the apical surface with the following experiments: 1) if the Yap is displaced by Jasplakinolide treatment, which stabilizes filamentous actin, 2) use of ROCK inhibitor to decrease contractility in the absence or presence of Yoda1, 3) Use genetic constructs such as Shroom3 to increase ROCK-mediated contractility to observe changes in Yap localization and dynamics.

      Although these experiments are interesting, they do not alter the conclusion of the current manuscript, and they represent future directions for our research.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript investigates how mechanical tension is transduced into centriole amplification in Xenopus multiciliated cells (MCCs). Building on prior work that centriole number scales with MCC apical area and that this scaling depends on PIEZO1, the study proposes that MCCs repurpose a canonical mechanochemical axis-PIEZO1 → Ca²⁺/PKC → ERK1/2 → YAP → Foxj1-to regulate centriole number rather than mitosis. The authors use tethered vs. untetheredanimal cap explants to modulate tissue tension, combine pharmacologic perturbations with genetic loss of function and rescue, quantititative image analysis and present a model in which tension gated PIEZO1 activates ERK/YAP, influences Foxj1, and tunes centriole number in MCCs.

      The manuscript tackles an important and timely problem with clear disease relevance. It major advance is their presented model that posits that post mitotic MCCs repurpose a canonical mechanotransduction module to regulate organelle number rather than proliferation. It is a conceptually strong study addressing an important problem with a clean mechanical paradigm. However, to support the central claim that centriole number control is a specific, direct consequence of the PIEZO1-Ca²⁺-ERK/YAP pathway within MCCs, the revision should establish specificity and causality and provide experimental data for some of the major conclusions as detailed below. Addressing these points are critical to support the mechanistic conclusions and impact.

      Major concerns:

      1) The presented data do not yet establish a specific, direct pathway linking mechanotransduction to centriole number, because the molecular players tested (PIEZO1, Ca²⁺, PKC, ERK, YAP, Foxj1) are highly pleiotropic. As such, the observed centriole number phenotypes, and some of the major conclusions, could be indirect. It is therefore critical to test the specificity and causality of the proposed pathway. This could be done with the authors' own strategies and/or with the following potential approaches:

      • Genetic dependency and sufficiency tests: It could be shown that Yoda1 has no effect in PIEZO1 loss-of-function MCCs, and that wild-type PIEZO1, but not conductance-dead PIEZO1 pore mutants restores Yoda1 responsiveness across centriole number, pERK, and YAP readouts. For example, PIEZO1 C terminus was shown to govern Ca²⁺ influx and ERK1/2 activation. Comparing full length PIEZO1 with a C terminal deletion in MCC restricted rescue; loss of rescue of centriole amplification and ERK/YAP activation with the C terminal deletion can provide a genetics anchored specificity test beyond broad inhibitors.

      • Downstream bypass/rescue experiments: In PIEZO1 loss-of-function or BAPTA conditions, can enforcing MEK/ERK activation or YAP rescue centriole number defect? Conversely, can MEK inhibitors block Yoda1-induced effects.

      2) The hypothesis about the centriole pool of Piezo as the mechnosensor for centriole number regulation is very exciting and novel. Can localization controlled variants be used to test whether a centriole associated pool directly senses tension for number control (for example, centrosome targeted PIEZO1 via a PACT tag). Alternatively, broad cellular Ca sensors (GcaMP) or centrosome proximal Ca sensors (e.g., PACT GCaMP) can be used detect local calcium microdomains during tethering or Yoda1 treatment.

      3) Because the proposed pathway is tension-sensing and YAP pathway is tightly linked to the actin cytoskeleton, the role of actin cysoskeleton in the proposed pathway should be tested directly. The authors mention different hypothesis around actin but has not tested them in the manuscript. For example, actin-depedent sequestration of Yap at the apical surface is intriguing. Does actin polymerization induced by drugs release Yap from the apical surface?

      4) Image quantification and analysis must be described in greater detail in the Methods section, as they are central to the major conclusions of the manuscript. For example, the authors should explain how nuclear, cytoplasmic, and centriole segmentation were performed, and how relative protein levels in the nucleus versus the cytoplasm (e.g., YAP, volume- or area-based) were quantified. Specifically, the thresholds and segmentation criteria applied to different cellular structures under various conditions, as well as the use of Imaris and other software, should be clearly detailed.

      5) PIEZO1 mRNA was shown to incrase in a Foxj1 linked feedback loop. Does this increase translate into an increase in total protein levels?

      6) Is the proposed signaling cascade active in mammalian multiciliated cells (e.g., airway epithelium). If possible, testing this by using one of the major players of the pathway as a readout such as as ERK phosphorylation, YAP nuclear localization in mammalian MCCs will reveal whether regulation of centriole number through this pathway is conserved and would strengthen the generality.

      7) Throughout the results section, there are multiple times where authors raised specific hypothesis about their data (e.g. foxj1 regulation of number control, apical actin/YAP). However, they have not tested them. These hypothesis are very exciting and if possible, testing experimentally, would strengthen the conclusions associated with them.

      Minor concerns:

      1) Abstract: "how MCCs regulate centriole/cilia numbers remains a major knowledge gap" overstates the field; please soften to reflect recent advances (mechanics/apical area scaling; PIEZO1 implication).

      2) MCC vs non MCC identification (Fig. 1): Clarify how non MCCs were distinguished from MCCs (e.g. markers/criteria).

      3) GsMTx4 rationale: State that GsMTx4 is a spider venom peptide that inhibits cationic mechanosensitive channels (including PIEZO1) and justify its use alongside Yoda1.

      4) Timeline statement: "Centriole amplification to migration and apical docking takes ~4-5 h (personal observation)" is not appropriate; either cite time lapse literature or include your own time lapse data.

      5) Redundancy: The description of Yoda1 as a channel specific agonist is repeated; keep only once.

      6) "WT yap1 GFP construct previously used by Dr. Lance Davidson ..." should move construct description to Methods and keep only the citation in Results.

      7) "(Unpublished data; Dr. Mahjoub)" should be removed unless data are shown.

      8) Add the Kintner group reference linking motile cilia number and centriole number in Xenopus MCCs.

      9) Replace "as shown previously in our eLife paper" with "as we previously showed or shown previously (Kulkarni et al., 2021)".

      10) The two hypotheses for how Foxj1 could regulate number under tension (actin remodeling vs. transcriptional control of amplification genes) belong in the Discussion unless tested. Moreover, the part on the discussion on yap sequestration by apical actin and the two possibilities presented also should go do discussion.

      Significance

      This manuscirpt dissects Piezo1-mediated mechanotransduction to regulation of centriole number in Xenopus multiciliated cells (MCCs) via Ca²⁺, ERK/YAP, and Foxj1. While Piezo1 and its downstream effectors have been implicated broadly in mechanosensation, cellular tension responses, and transcriptional regulation, their specific role in centriole nubmer control in MCCs has been unknown By integrating pharmacological manipulation, genetic perturbation, and functional readouts, the authors demonstrate that this pathway directly influences centriole number.

      The findings extend published knowledge in two main ways:

      (1) they connect a mechanosensitive ion channel to the transcriptional program governing Foxj1 expression and multiciliation, a mechanistic link not previously defined, and

      (2) they highlight the pleiotropic yet coordinated nature of Piezo1 signaling in organelle biogenesis. This work will be of broad interest to cell and developmental biologists studying ciliogenesis, epithelial differentiation, and mechanotransduction, as well as to biomedical researchers interested in multicilaited cells and ciliopathies. By situating a well-studied mechanosensor within the context of MCC biology, the study opens new directions for understanding how tissue-level forces shape organelle number control and function.

      At the same time, the impact of the study is weakened by concerns regarding the causability and specificity of the pathway, since the signaling components examined are highly pleiotropic and it remains challenging to separate direct effects on centriole number from broader cellular consequences. The causal relationships among Piezo1 activity, downstream signaling, and Foxj1 expression require stronger substantiation, and the extent to which this pathway operates in mammalian multiciliated cells remains an open question. Addressing these limitations would strengthen the robustness, generality, and translational relevance of the conclusions.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Narayanan, Kulkami and colleagues here examine how the Piezo-Erk-Yap pathway is involved in centriole numerical control in multiciliated cells (MCCs). Using reverse genetic and pharmacological methods in Xenopus embryos, they show that Piezo-mediated ERK signalling through to Yap regulates tension-sensitive centriole number, through a mechanism that involves Foxj1, very likely acting as a transcription factor. The data are carefully controlled, robustly analysed and well presented. Statistical analyses are notably thorough.

      Main points:

      1. It should be clarified whether the immunoblots and the related quantitations in Figs. 2 and S2 are all from separate blots/ exposures. If so, they are not useful as controls, and these blots should be repeated with the relevant samples analysed in parallel. Size markers and labels should be included (2B, 2G; S2B and S2G). An increase in total ERK would alter the interpretation of the increase in nuclear pERK in the IF experiments.

      2. Considering the Yap-piezo mechanism of action, the authors' logic for the selection of myb, foxj, plk4 and ccno as transcriptional targets is clear, but the HCR-derived signal and the differences seen in the yap morphants are not very strong, notwithstanding the statistical significance. There appear to be distinct subgroups within the treated populations (in Figure S6B, although these data seem quite different in Fig. 7H, so a comment on the technical differences might be helpful), so that the extent to which Yap1 regulates (Myb-)Foxj1 expression in MCCs is not clearly demonstrated by this experiment. Related to this point, it is unclear why 20-25% of the yap1/ piezo1 MO -treated embryos do not show a decline in FOXj1 in Fig. 6, given the qualitative nature of the scoring. Assuming the KD penetrance would vary on a cell-to-cell basis, rather than an embryo-to-embryo basis, this may suggest that there are additional relevant targets (some of which are discussed by the authors). Single-cell analysis might be a way to address this; however, this is not a trivial experiment, it might be sufficient to include a caveat in the text. Furthermore, the conclusion that Foxj1 regulates centriole amplification in a tension-dependent manner is well-supported by the data.

      3. Controls for the knockdowns by the various MOs should be provided.

      Minor points:

      1. Autocorrection of ERK1/2 or MEK1/2 pathways to 1/2 should be avoided.

      2. Reference list should be checked for completeness; some citations lack journal/ volume/ page/ year details.

      3. An 'overexposed' version of the image selected for centrioles in Figure 5F might be included with the Chibby-BFP at the same level as in the other figures. At present, the Yap KD cell in the image appears to have the normal centrioles; this is potentially confusing, even though the authors clearly explain matters in the text.

      4. It might be clearer to present injected/ uninjected in the same orientation in Fig. 6A and B.

      5. Figure 7B lacks the schematic described in the figure legend.

      Significance

      This study presents novel insight into the developmentally important process of ciliogenesis in multiciliated cells that will be of specific interest to the fields of cilium biology and mechanobiology, with additional general interest in calcium signalling and cell biology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript from Narayanan addresses the fascinating question of how Multiciliated cells regulate centriole number to scale with cell size. They have generated a tremendous amount of high quality data that supports a model in which mechanosensitive signaling via piezo1 leads to an increase in intracellular Ca++ that leads to an activation of the Erk pathway which in turn activates the Yap pathway that in turn regulates FoxJ1 levels which they propose regulates centriole number. This is complicated but they have strong quantifiable data that supports most of the claims. I think this is a beautiful study that adds significantly to the field. There is a lot of evidence that disrupting these pathways has a negative consequence on centriole number. What is lacking is a positive connection showing a role of these processes in fine tuning the centriole number as the title suggests. Several key experiments would significantly strengthen their claims.

      • The data is presented in a way that proposes that the ultimate role of these pathways is to regulate Foxj1 levels to fine tune centriole number based on the level of tension. There are two experiments that would significantly strengthen these claims. First if their model is correct then even short term treatment with Yoda1 should induce the pathway and effect centriole numbers. While I appreciate the challenge of long term Yoda1 treatment its not clear to me why it would be needed if short term treatment is setting off the transcriptional cascade. Yoda is used throughout the paper to induce all the pathways but we don't know if it actually induces the phenotype. I think this should be addressed with either short term treatments or a dose response to find a dose that does not lead to skin pealing. It is hard to ignore this obvious deficiency. Second, the model predicts that all of this is to regulate Foxj1 levels to regulate the subtle balance between cell size and centriole number. If this is correct, then the overexpression of Foxj1 should have a profound effect on centriole number in multiciliated cells. This is such an easy experiment that would validate many of the claims.

      Minor issues:

      • The authors attempt to measure an effect of plk4 and ccno in the Yap MO experiment. However, the fact that they could not be scored means the experiment wasn't really performed. I think it is more appropriate to leave out rather than risk giving the impression that these genes were unaffected.

      • The authors indicate that the foxj1 result suggests two alternatives, one that foxj1 regulates actin (pan 2007) and the other that it is a transcription factor. I think the evidence for foxj1 being a transcription factor is extremely well established and while it is possible for it to have an additional unrelated role my interpretation of the Pan paper is that the failed apical docking leads to disrupted actin which is also well established. I don't think there is a lot of evidence for foxj1 being anything other than a TF.

      Significance

      • This is a really beautiful paper that will be well appreciated by the cilia community but also should be appreciated by the broader cell biology community.

      • The strengths of this paper are a high level of rigor in which they perform detailed quantification of a wide range of processes. For many experiments they have multiple methods for disrupting function which again adds to the rigor. They have successfully linked Piezo1, Erk, Yap and FoxJ1 function to proper centriole biogenesis, which is a significant advance.

      • The limitation is that all their perturbations negatively effect centriole number which could be indirect. If their model is correct then they should be able to activate the pathway in one way or another to stimulate centriole number. This is a significant limitation to their overall model.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Summary: The authors have previously published Mass-spectrometry data that demonstrates a physical interaction between Sall4 and the BAF chromatin complex in iPSC derived neurectodermal cells that are a precursor cell state to neural crest cells. The authors sought to understand the basis of this interaction and investigate the role of Sall4 and the BAF chromatin remodelling complex during neural crest cell specification. The authors first validate this interaction with a co-IP between ARID1B subunit and Sall4 confirming the mass spec data. The authors then utilise in silico modelling to identify the specific interaction between the BAF complex and Sall4, suggesting that this contact is mediated through the BAF complex member DPF2. To functionally validate the role of Sall4 during neural crest specification, the authors utilsie CRISPR-Cas9 to introduce a premature stop codon on one allele of Sall4 to generate iPSCs that are haploinsufficient for Sall4. Due to the reports of Sall4's role in pluripotency, the authors confirm that this model doesn't disrupt pluripotent stem cells and is viable to model the role of Sall4 during neural crest induction. The authors expand this assessment of Sall4 function further during their differentiation model to cranial neural crest cells, assessing Sall4 binding with Cut+Run sequencing, revealing that Sall4 binds to motifs that correspond to key genes in neural crest differentiation. Moreover, reduction in Sall4 expression also reduces the binding of the BAF complex, through Cut and Run for BRG1. Overall, the authors then propose a model by which Sall4 and BRG1 bind to and open enhancer regions in neurectodermal cells that enable complete differentiation to cranial neural crest cells.

      Overall, the data is clear and reproducible and offers a unique insight into the role of chromatin remodellers during cell fate specification.

      We thank the Reviewer for the nice words of appreciation of our manuscript.

      However, I have some minor comments.

      1- Using AlphaFold in silico modelling, he authors propose the interaction between the BAF complex with Sall4 is mediated by DPF2, but don't test it. Does a knockout, or knockdown of DPF2 prevent the interaction?

      We agree with the Reviewer that we are not functionally validating our computational prediction that DPF2 is the specific BAF subunit directly linking SALL4 with BAF. We chose not to perform the validation experiment for two main reasons:

      1) This would be outside of the scope of the paper. In fact, from a mechanistic point of view, we have confirmed via both Mass-spectrometry and co-IP with ARID1B that SALL4 and BAF interact in our system. Moreover, mechanistically we also extensively demonstrate that the interaction with SALL4 is required to recruit BAF at the neural crest induction enhancers and we further demonstrate that depletion of SALL4 impairs this. In our view, this was the focus of the manuscript. On the other hand, detecting with certainty which BAF subunit mediates the interaction with SALL4 would be outside the scope of the paper.

      2) Moreover, after careful consideration, we don’t think that even a knock-out of DPF2 would provide a definite answer to which exact BAF subunit mediates the interaction with SALL4. In fact, knock out of DPF2 could potentially disrupt BAF assembly or stability, and this could result in a disruption of the interaction with SALL4 even if DPF2 is not the very subunit mediating it (in other words the experiment could provide a false positive result). In our opinion, the only effective experiment would be mutating the DPF2 residues that we computationally predicted as responsible for the interaction with SALL4, but again this would be very laborious and out of the scope.

      That being said, we agree with the Reviewer that while the SALL4-BAF interaction was experimentally validated with robust approaches, the role of DPF2 in the interaction was only computationally predicted, which comes as a limitation of the study. We have now added a dedicated paragraph in the discussion to acknowledge such limitation.

      2- OPTIONAL: Does knockout of DPF2 phenocopy the Sall4 ko? This would be very interesting to include in the manuscript, but it would perhaps be a larger body of work.

      See point-1.

      3- Figure 1, the day of IP is not clearly described until later in the test. please outline during in the figure.

      We thank the Reviewer for pointing this out. This has been fixed.

      3- What is the expression of Sall1 (and other Sall paralogs) during differentiation. The same with the protein levels of Sall4, does this remain at the below 50%, or is this just during pluripotency?

      As Recommend by the Reviewer, we have performed time-course WB of SALL1 and SALL4. These experiments revealed that SALL1 remains very lowly expressed in wild-type conditions across time points and all the way through differentiation until CNCC (See updated supplementary Fig. S9). This is consistent with previous studies that demonstrated that SALL4, but not SALL1, is required for early mammalian development (see for example Miller et al. 2016, Development, and Koulle et al. 2025, Biorxiv). We performed the same time-course WB for SALL4 which revealed that SALL4 expression progressively decreases after day-5 (as expected) and it’s very low at CNCC stage (day-14), therefore we would expect the KO to remain at even lower level at this stage.

      4- The authors hypothesise that Sall4 binds to enhancers- with the criteria for an enhancer being that these peaks > 1KB from the TSS are enhancers. Can this be reinforced by overlaying with other ChIP tracks that would give more confidence in this? There are several datasets from Joanna Wysocka's lab that also utilise this protocol which can give you more evidence to reinforce the claim and provide further detail as to the role of Sall4.

      We thank the Reviewer for this great suggestion. As recommended, we have used publicly available ChIP-seq data generated by the Wysocka lab (H3K4me1, H3K4m3) and also generated new H3K27ac CHIP-seq data as well. These experiments and analyses confirmed that these regions are putative CNCC enhancers (and a minority of them putative promoters), decorated with H3K4me1 and with progressive increase in H3K27ac after CNCC induction (day-5). See new Supplementary Figure S6.

      5- The authors state that cells fail to become cranial neural crest cells, however they do not propose what the cells do instead. do they become neural? Or they stay at pluriopotent, which is one option given the higher expression of Nanog, OCT4 and OTX2 that are all expressed in pluripotent stem cells.

      We think that it is likely a mix of both. There is a mixed bag of expression of pluripotency markers, but also high expression of neuroectodermal markers. This suggests that most cells safely reach the neuroectodermal stage but fail to go beyond that, while some of the cells simply do not differentiate or regress back to pluripotency. We would rather refrain on overinterpreting what the KO-cells become, as it is likely an aberrant cell type, but following the Reviewer’s indication we have added a paragraph in the discussion to speculate on this.

      6- In general, I would like to see the gating strategy and controls for the flow cytometry in a supplemental figure.

      As Recommended by the Reviewer, we have added the gating strategy in the Supplementary Fig. S4.

      7- For supplementary figure 1- please include the gene names in the main image panels rather than just the germ layer.

      Done. The figure is now Supplementary Figure S3 since two supplementary figures were added before.


      Reviewer #2

      Summary In this manuscript, the authors build on their previous work (Pagliaroli et al., 2021) where they identified an interaction between the transcription factor SALL4 and the BAF chromatin remodeling complex at Day-5 of an iPSC to CNCC differentiation protocol. In their current work, the authors begin by exploring this interaction further, leveraging AlphaFold to predict interaction surfaces between SALL4 and BAF complex members, considering both SALL4 splice isoforms: a longer SALL4A (associated with developmental processes) and a shorter SALL4B (associated with pluripotency). They propose that SALL4A may interact with DPF2, a BAF complex member, in an isoform-dependent manner. The authors next explore the role of SALL4 in craniofacial development, motivated by patient heterozygous loss of function mutations, leveraging iPSC cells with an engineered SALL4 frameshift mutation (SALL4-het-KO). Using this model, the authors first demonstrate that a reduced expression of SALL4 does not impact the iPSC identity, perhaps due to compensation via upregulation of SALL1. Upon differentiation to neuroectoderm, SALL4 haploinsufficiency causes a reduction in newly accessible sites which are associated with a reduction in SALL4 binding and therefore a loss of BAF complex recruitment. Interestingly, however, there were few transcriptional changes at this stage. Later in the CNCC differentiation at Day-14 when the wildtype cells have switched expression of CNCC markers, the SALL4-het-KO cells fail to switch cadherin expression associated with a transition from epithelial to mesenchymal state, and fail to induce CNCC specification and post-migratory markers. Together the authors propose that SALL4 recruits BAF to CNCC enhancers as early as the neuroectodermal stage, and failure of BAF recruitment in SALL4-het-KO lines results in a loss of open chromatin at regulatory regions required later for induction of the CNCC programme. The failure of the later differentiation is compelling in the light of the early stages of the differentiation progressing normally, and the authors outline an interesting proposed mechanism whereby SALL4 recruits BAF to remodel chromatin ahead of CNCC enhancer activation, a model that can be tested further in future work. The link between SALL4 DNA binding and BAF recruitment is nicely argued, and very interesting as altered chromatin accessibility at Day 5 in the neuroectodermal stage is associated with only few changes in gene expression, while gene expression is greatly impacted later in the CNCC stage at Day 14. The in silico predictions of SALL4-BAF interaction interfaces are perhaps less convincing, requiring experimental follow-up outside the scope of this paper. Some of the associated figures could perhaps be moved to the supplement to enhance the focus on the later functional genomics experiments.

      We thank the Reviewer for the nice words of appreciation of our manuscript.

      Major comments

      1. A lot of emphasis is placed on the AlphaFold predictions in Figure 1, however the predictions in Figure 1B appear to be mostly low or very low confidence scores (coloured yellow and orange). It is unclear how much weight can be placed on these predictions without functional follow-up, e.g. mutating certain residues and showing impact on the interaction by co-IP. The latter parts of the manuscript are much better supported experimentally, and therefore perhaps some of the Figure 1 could move to a Supplemental Figure (e.g. the right-hand part of 1B, and the lower part of Figure 1C showing SALL4B predicted interactions). The limitations of AlphaFold predictions should be acknowledged and the authors should discuss how these predicted interactions could be experimentally explored further in the future.

      As recommended by the Reviewer, we have moved part of the AlphaFold predictions to Supplementary Figure S1, and we added a paragraph in the discussion to acknowledge the limitations of AlphaFold.

      The authors only show data for one heterozygous knockout clone for SALL4. It is usual to have more than one clone to mitigate potential clonal effects. The authors should comment why they only have one clone and include any data for a second clone for key experiments if they already have this. Alternatively, the authors could provide any quality control information generated during production of this line, for example if any additional genotyping was performed.

      We apologize for the confusion and for our lack of clarify on this. We have used two clones (one generated with a 11 bp deletion, one with a 19 bp deletion, both in exon-1, see also the point 6 of your minor points). The two clones were used as biological replicates, so for example the two ATAC-seq replicates performed in each time point were performed with the two different clones, and the three RNA-seq replicates were performed with two technical replicates of the clone with the 11bp deletion and one replicate with the clone with 19 bp deletion. We have clarified this in the methods section of the manuscript and added a Supplementary Figure (S2) showing the editing strategy for the two clones. Thank you for catching it.

      The authors show all genomics data (ATAC-seq, CUT&RUN and ChIP-seq) as heatmaps and average profiles. It would be valuable to see some representative loci for the ATAC seq (perhaps along with SALL4 and BRG1 recruitment) at some representative and interesting loci.

      As recommended by the Reviewer, we have added Genome Browser screenshots of representative loci in Fig. 6.

      Figure 4A. The schematic could be improved by including brightfield or immunofluorescent images at the three stages of the differentiation. Are the iPS cells seeded as single cells, or passaged as colonies before starting the differentiation. Further details are required in the methods to clarify how the differentiation is performed, for example at what Day are the differentiating cells passaged, this is not shown on the schematic in Figure 4A.

      As recommended, we added IF images in the Fig. 4A schematic, and added more details in the methods.

      There is likely some heterogeneity of cell types in the differentiation at Day 5 and Day 14. Can the authors comment on this from previous publications or perhaps conduct some IF for markers to demonstrate what proportions of cells are neuroectoderm at Day 5 and CNCCs at Day 14.

      The differentiation starts with single cells that aggregate to form neuroectodermal clusters, as per original protocol. The CNCCs that we obtain with this protocol homogeneously express CNCC markers, as shown by IF of SOX9 in Fig. 4A. For the day-5, as recommended we have added IF for PAX6 also showing homogeneous expression (Fig. 4A).

      For the motif analysis for Day 5-specific SALL4 binding sites (Figure 4E), was de novo motif calling performed? Were any binding sites reminiscent of a SALL4 binding site observed (e.g. an AT-rich motif)? Could the authors comment on this in the text - if there is no SALL4 binding motif, does this suggest SALL4 is recruited indirectly to these sites via interaction with another transcription factor for example?

      Similar to SALL4, SALL1 also recognizes AT-rich motifs. However, while we found AT-rich motifs as enriched in our day-5 motif analysis (in the regions that gain SALL4 binding upon differentiation), the enrichment is not particularly strong, and several other motifs are significantly more enriched, suggesting that, like the Reviewer mentioned, SALL4 might be recruited indirectly at these sites by other factors. We have added a paragraph on this in the discussion.

      Does SALL1 remain upregulated at Day-5 and Day-14 of the differentiation for the SALL4-het-KO line? Are binding sites known for this TF and were they detected in the motif analysis performed? Further discussion of the impact of the overexpression of SALL1 on the phenotypes observed is warranted - e.g. for Figure 5F, could the sites associated with a gain of BRG1 peaks upon loss of SALL4 be associated with SALL1 being upregulated and 'hijacking' BAF recruitment to distinct sites associated with nervous system development? Is SALL1 still upregulated at Day 5?

      As mentioned above, SALL1 also recognizes AT-rich motifs but similar to SALL4 also binds unspecifically, likely in cooperation with other TFs. Like the Reviewer suggested, it is certainly possible that some of the sites associated with a gain of BRG1 peaks upon loss of SALL4 could be associated with SALL1 being upregulated and 'hijacking' BAF recruitment to distinct sites. While this is speculative, we have added a paragraph on this in the discussion.

      Related to the point above, SALL4A is proposed to have an isoform-specific interaction with the BAF complex. It would be valuable to plot SALL4A and SALL4B expression from the available RNA-seq data at Day 0, 5 and 14 to explore whether stage-specific isoform expression matches with the proposed role of SALL4A to interact with BAF at Day 5. It could be valuable to also look at expression of SALL1, 2 and 3 across the time course to see whether additional compensation mechanisms are at play during the differentiation.

      Thanks for suggesting this. We performed a time course analysis of isoform specific gene expression, which showed that SALL4B expression remains low throughout differentiation, while SALLA4A expression increases upon differentiation cues and it remains at high levels until the end. We have added this to supplementary Fig. S9. Moreover, we have performed an additional experiment, using pomalidomide, which is a thalidomide derivative that selectively degrades SALL4A but not SALL4B. Notably, SALL4A degradation recapitulated the main findings obtained with the CRISPR-KO of SALL4, further supporting that SALL4A is the isoform involved in CNCC induction (see new Fig. 8).

      At line 264, The authors state "SALL4 recruits the BAF complex at CNCC developmental enhancers to increase chromatin accessibility". Given that this analysis is performed at Day 5 of the differentiation, which is labelled as neuroectoderm what evidence do the authors have that these are specifically CNCC enhancers? Statements relating to enhancers should generally be re-phrased to putative enhancers (as no functional evidence is provided for enhancer activity), and further evidence could be provided to support that these are CNCC-specific regulatory elements, e.g. showing representative gene loci from CNCC-specific genes. Discussion of the RNA-seq presented in Supplementary Figure 2B may also be appropriate to introduce here given that large numbers of accessible chromatin sites are detected while the expression of very few genes is impacted, suggesting these sites may become active enhancers at a later developmental stage.

      As also recommended by the other Reviewer, to further characterize these sites, we have used publicly available histone modification CHIP-seq data (H3K4me1, H3K4me3) generated by the Wysocka lab (H3K4me1, H3K4m3) and also generated new H3K27ac CHIP-seq data as well. These experiments and analyses confirmed that these regions are putative CNCC enhancers (and a minority of them putative promoters), all decorated with H3K4me1, and all showing progressive increase in H3K27ac after CNCC induction (day-5). See new Supplementary Figure S6.

      1. Do any of the putative CNCC enhancers detected at Day 5 as being sensitive to SALL4 downregulation and loss of BAF recruitment overlap with previously tested VISTA enhancers (https://enhancer.lbl.gov/vista/)?

      Yes, we have found examples of overlap and have included two of them in the updated Figure 6 as Genome Browser screenshots.

      Minor comments

      1. The authors are missing references in the introduction "a subpopulation of neural crest cells that migrate dorsolaterally to give rise to the cartilage and bones of the face and anterior skull, as well as cranial neurons and glia".

      Fixed, thank you.

      The discussion of congenital malformations associated with SALL4 haploinsufficiency is brief in the introduction. From OMIM, SALL4 heterozygous mutations are implicated with the condition Duane-radial ray syndrome (DRRS) with "upper limb anomalies, ocular anomalies, and, in some cases, renal anomalies... The ocular anomalies usually include Duane anomaly". That Duane anomaly is one phenotype among a number for patients with SALL4 haploinsufficiency could be clarified in the introduction. Of note, this is stated more clearly in the discussion but needs re-wording in the introduction.

      Done, thank you.

      The statements "show that the SALL4A isoform directly interacts with the BAF complex subunit DPF2 through its zinc-finger-3 domain" and "this interaction occurs between the zinc-finger-cluster-3 (ZFC3) domain of SALL4A and the plant homeodomains (PHDs) of DPF2" in the introduction appear overstated and should be toned down. To show this the authors would need to mutate or delete the proposed important zinc-finger domains from SALL4A, which is outside the scope of this work. Notably, this is less strongly-stated elsewhere in the manuscript, e.g "predict that this interaction is mediated by the BAF subunit DPF2", Line 162.

      Done, thank you.

      Could the authors clarify why 3 Alphafold output models are shown for SALL4B in Figure 1C, and only one output model for SALL4A?

      AlphaFold3 produces five separate predicted models per protein combination (e.g., Model_1 … Model_4), each derived from slightly different network parameters or initializations. The final output prioritizes the model with the highest confidence score. This multi-model strategy enables the identification of the most robust conformation while providing a measure of structural uncertainty (as per GitHub documentation for AlphaFold3). wE have conducted the same analysis for SALL4A as we did for SALL4B. Specifically, SALL4A interacts with the AT-rich DNA in models 0, 1, and 2, therefore models 3 and 4 were excluded. When analysing models 1 and 2, we found a higher number of residues involved in the interaction (>800 instead of 396). Similarly to model 0, only the interactions between residues belonging to an annotated functional domain (ZFs and PHDs) were considered.

      In Model 1: SALL4A and DPF2 interact mainly through ZF6 and 7, and not 5 as Model 0.

      In Model 2: SALL4A and DPF2 interact mainly through ZF5 and 6, and not 7 as Models 0. In contrast, this model shows an interaction with ZF1 not shown in the other two models, but with a higher PAE (31 average compared to 25 to 27 average of the other two ZFs.

      Therefore, we considered Model 0 as it is the model with higher confidence and representative of all significant models (includes ZF5, 6, and 7).

      Line 121. The authors state "DPF2, a broadly expressed BAF subunit,", but don't show expression during their CNCC differentiation. It would be good to include expression of DPF2 in Figure 1E.

      Done, thank you.

      The text states "a 11 bp deletion within the 3'-terminus of exon 1 of SALL4", while the figure legend states, "Sanger sequencing confirming the 19 bp deletion in one allele of SALL4 is displayed". The authors should clarify this disparity and experimentally confirm the deletion, e.g. by TA-cloning the two alleles and sequencing these separately to show that one allele is wildtype and the other has a frameshift deletion.

      We apologize for the confusion. As stated above (point-2 of the major comments), we have used two clones (one generated with a 11 bp deletion, one with a 19 bp deletion, both in exon-1, see also the point 6 of your minor points). The two clones were used as biological replicates (see response above for details). The deletion for both clones was experimentally confirmed by Sanger sequencing by the company that generated the lines for us (Synthego). The strategy for the two clones is now shown also in Supplementary Fig. S2.

      The authors generate an 11-bp (or 19-bp?) deletion in exon-1 - it would be valuable to include a discussion whether patients have been identified with deletions and frame-shift mutations in this region of SALL4 exon-1. And also clarify, if not clearly stated in the text, that both SALL4A and SALL4B will be impacted by this mutation. Are there examples of patient mutations which only impact SALL4A?

      As requested, we have added a discussion paragraph to discuss this. And, yes, both SALL4A and SALL4B are impacted by both deletions in both clones (11 bp and 19 bp deletion).

      Regarding patient variants on exon-1 and patient variants that only impact SALL4A. We could only find one published pathogenic 170bp deletion in exon 1 (VCV000642045.7). The majority of the pathogenic or likely pathogenic variances are located on exon2. In particular, of the 63 reported pathogenic (or likely pathogenic) clinical variants, 42 were located on exon 2. Among these, 28 are located in the portion shared by both SALL4A and SALL4B, while the remaining 14 were SALL4A specific.

      For the SALL4 blots in Figure 2B, is the antibody expected to detect both isoforms (SALL4A and SALL4B), and which isoform is shown? If two isoforms are detected, they should both be presented in the figure.

      Yes, the antibody detects both isoforms, and we now present both in the figure 2, as recommended.

      SALL4 expression should be shown for Figure 2C to see whether the >50% down-regulation of SALL4 at the protein level may be partially driven by transcriptional changes.

      Done, thank you. As expected, we observed the SALL4 mRNA expression in the KO line is comparable to wild-type conditions, but still this results in a significant decrease of the SALL4 protein level likely because of autoregulatory mechanisms coupled with non-sense mediated decay of the mutated allele. Also, we note that SALL4 usually makes homodimers, therefore lack of sufficient amount of protein could also lead to degradation of the monomers.

      The number of experimental replicates should be indicated in all figure legends where relevant. Raw data points should be plotted visibly over the violin plots (e.g. Figure 2C).

      Done, thank you.

      For Figure 3A, the images of the DAPI and NANOG/OCT4 staining should be shown separately in addition to the overlay.

      Done, thank you.

      The metric 'Corrected Total Cell Fluorescence (CTCF)' should be described in the methods. The number of images used for the quantification in Figure 3A should be

      Done, thank you.

      Figure 3C - what are the 114 differentially expressed genes? Some interesting genes could be labelled on the plot and the data used to generate this plot should be included as a Supplementary Table. Supplementary Tables should similarly be provided for Figure 6C, Day 14 and Supplementary Figure 2B, Day 5.

      As recommended, we have highlighted some interesting genes in the volcano plot and also included all the expression data for all genes in Supplementary Table S3.

      Figure 4B. The shared peaks are not shown. For completeness, it would be ideal to show these sites also.

      Done, thank you.

      Figure 4C is difficult to interpret. Why is the plot asymmetric to the left versus right? What does the axis represent - % of binding sites?

      The asymmetry is due to the fact that there is a larger number of peaks that are downstream of the TSS than peaks that are upstream of TSS. This is consistent with the fact that many SALL4 peaks are in introns, likely representing intronic enhancers.

      Line 224-225. What do n= 3,729 and n= 6,860 refer to? There appear to be many more binding sites indicated in Figure 4B, therefore these numbers cannot represent 86% and 97% of sites?

      Thank you for pointing this out, we should have specified in the text. Those numbers refer to the genes whose TSS is closest to each SALL4 peak. Notably, multiple peaks can share the same closest TSS, hence the discrepancy between # of peaks and # of nearest genes.

      Raw numbers:

      • Day-0 RAW = 6,104 (peaks = 6,114);
      • Day-5 RAW = 17,131 (peaks = 17,137). Now raw data reported in Supplementary Table 4.

      Figure 4E. Several TFs mentioned in the text (Line 243) are not shown in the figure, it would be good to show all TFs motifs mentioned in the text in this figure. Again, there is no mention of whether a sequence-specific motif is detected for SALL4 (e.g. an AT-rich sequence) from this motif analysis.

      Done, thank you. An AT-rich sequence, resembling the SALL4 motif, was detected in a small minority of sites (this is now shown in Supplementary Figure S5), suggesting that SALL4 engages chromatin in a broad manner, going beyond its preferred motif, possibly in cooperation with other TFs. This is consistent with many studies that in mESCs have shown that SALL4 binds at OCT4/NANOG/SOX2 target motifs. This is now discussed in a dedicated paragraph in the discussion.

      Figure 4G. How was the ATAC-seq data normalized for the WT and SALL4-het-KO lines for this comparison? The background levels of accessibility seem quite different in Replicate 1.

      The bigwigs used to make the heatmaps are normalized by sequencing depth using the Deeptools Suite (normalization by RPKM).

      Figures 5B-C could be exchanged to flow better with the text. A Venn diagram could be included to show the overlap between the sites losing BRG1 in SALL4-het-KO (13,505 sites) and the Day5-specific SALL4 sites (17,137 sites).

      Done, thank you.

      At Day 5, the authors suggest a shift towards neural differentiation. It could be interesting for the authors to perform qRT-PCR at Day 5 for some neural markers or look in the Day 14 data for markers of neural differentiation at the expense of CNCC markers.

      See updated Supplementary Fig. S8, where we show timecourse expression of several genes, including neural markers.

      Is the data used to plot Figure 5D the same as Figure 4G. If so, why is only one replicate shown in Figure 5D?

      Only one replicate was shown in the main figure purely for lack of space, but the experiment was replicated twice (with the two different clones), and the results were exactly the same. See plots below for your convenience:

      Figure 6A. How many replicates are shown? If n=2, boxplots are not an appropriate to represent the distribution of the data. Please include n= X in the figure legend and plot the raw data points also.

      Done, thank you, and as suggested we are no longer using boxplots for this panel.

      Figure 6B. What is the significance of CD99 for CNCC differentiation?

      Figure 6F. No error bars are shown, how many replicates were performed for this time couse? The linear regression line does not appear to add much value and could be removed.

      As suggested, we have removed these plots and replaced them with individual genes plots, which include error bars. See updated Supplementary Figure S8.

      At line 304, the authors state "while SALL4-het-KO showed a significant downregulation of these genes". Perhaps 'failed to induce these genes' may be more accurate unless they were expressed at Day 5 and downregulated at Day 14.

      Done, thank you.

      Lines 332-335. The genes selected for pluripotency, neural plate border, CNCC specification could be plotted separately in the Supplement to show individual gene expression dynamics.

      Done, thank you, see point 24.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      In this manuscript, the authors build on their previous work (Pagliaroli et al., 2021) where they identified an interaction between the transcription factor SALL4 and the BAF chromatin remodeling complex at Day-5 of an iPSC to CNCC differentiation protocol. In their current work, the authors begin by exploring this interaction further, leveraging AlphaFold to predict interaction surfaces between SALL4 and BAF complex members, considering both SALL4 splice isoforms: a longer SALL4A (associated with developmental processes) and a shorter SALL4B (associated with pluripotency). They propose that SALL4A may interact with DPF2, a BAF complex member, in an isoform-dependent manner. The authors next explore the role of SALL4 in craniofacial development, motivated by patient heterozygous loss of function mutations, leveraging iPSC cells with an engineered SALL4 frameshift mutation (SALL4-het-KO). Using this model, the authors first demonstrate that a reduced expression of SALL4 does not impact the iPSC identity, perhaps due to compensation via upregulation of SALL1. Upon differentiation to neuroectoderm, SALL4 haploinsufficiency causes a reduction in newly accessible sites which are associated with a reduction in SALL4 binding and therefore a loss of BAF complex recruitment. Interestingly, however, there were few transcriptional changes at this stage. Later in the CNCC differentiation at Day-14 when the wildtype cells have switched expression of CNCC markers, the SALL4-het-KO cells fail to switch cadherin expression associated with a transition from epithelial to mesenchymal state, and fail to induce CNCC specification and post-migratory markers. Together the authors propose that SALL4 recruits BAF to CNCC enhancers as early as the neuroectodermal stage, and failure of BAF recruitment in SALL4-het-KO lines results in a loss of open chromatin at regulatory regions required later for induction of the CNCC programme. The failure of the later differentiation is compelling in the light of the early stages of the differentiation progressing normally, and the authors outline an interesting proposed mechanism whereby SALL4 recruits BAF to remodel chromatin ahead of CNCC enhancer activation, a model that can be tested further in future work.

      Major comments

      The link between SALL4 DNA binding and BAF recruitment is nicely argued, and very interesting as altered chromatin accessibility at Day 5 in the neuroectodermal stage is associated with only few changes in gene expression, while gene expression is greatly impacted later in the CNCC stage at Day 14. The in silico predictions of SALL4-BAF interaction interfaces are perhaps less convincing, requiring experimental follow-up outside the scope of this paper. Some of the associated figures could perhaps be moved to the supplement to enhance the focus on the later functional genomics experiments.

      1. A lot of emphasis is placed on the AlphaFold predictions in Figure 1, however the predictions in Figure 1B appear to be mostly low or very low confidence scores (coloured yellow and orange). It is unclear how much weight can be placed on these predictions without functional follow-up, e.g. mutating certain residues and showing impact on the interaction by co-IP. The latter parts of the manuscript are much better supported experimentally, and therefore perhaps some of the Figure 1 could move to a Supplemental Figure (e.g. the right-hand part of 1B, and the lower part of Figure 1C showing SALL4B predicted interactions). The limitations of AlphaFold predictions should be acknowledged and the authors should discuss how these predicted interactions could be experimentally explored further in the future.
      2. The authors only show data for one heterozygous knockout clone for SALL4. It is usual to have more than one clone to mitigate potential clonal effects. The authors should comment why they only have one clone and include any data for a second clone for key experiments if they already have this. Alternatively, the authors could provide any quality control information generated during production of this line, for example if any additional genotyping was performed.
      3. The authors show all genomics data (ATAC-seq, CUT&RUN and ChIP-seq) as heatmaps and average profiles. It would be valuable to see some representative loci for the ATAC seq (perhaps along with SALL4 and BRG1 recruitment) at some representative and interesting loci.
      4. Figure 4A. The schematic could be improved by including brightfield or immunofluorescent images at the three stages of the differentiation. Are the iPS cells seeded as single cells, or passaged as colonies before starting the differentiation. Further details are required in the methods to clarify how the differentiation is performed, for example at what Day are the differentiating cells passaged, this is not shown on the schematic in Figure 4A.
      5. There is likely some heterogeneity of cell types in the differentiation at Day 5 and Day 14. Can the authors comment on this from previous publications or perhaps conduct some IF for markers to demonstrate what proportions of cells are neuroectoderm at Day 5 and CNCCs at Day 14.
      6. For the motif analysis for Day 5-specific SALL4 binding sites (Figure 4E), was de novo motif calling performed? Were any binding sites reminiscent of a SALL4 binding site observed (e.g. an AT-rich motif)? Could the authors comment on this in the text - if there is no SALL4 binding motif, does this suggest SALL4 is recruited indirectly to these sites via interaction with another transcription factor for example?
      7. Does SALL1 remain upregulated at Day-5 and Day-14 of the differentiation for the SALL4-het-KO line? Are binding sites known for this TF and were they detected in the motif analysis performed? Further discussion of the impact of the overexpression of SALL1 on the phenotypes observed is warranted - e.g. for Figure 5F, could the sites associated with a gain of BRG1 peaks upon loss of SALL4 be associated with SALL1 being upregulated and 'hijacking' BAF recruitment to distinct sites associated with nervous system development? Is SALL1 still upregulated at Day 5?
      8. Related to the point above, SALL4A is proposed to have an isoform-specific interaction with the BAF complex. It would be valuable to plot SALL4A and SALL4B expression from the available RNA-seq data at Day 0, 5 and 14 to explore whether stage-specific isoform expression matches with the proposed role of SALL4A to interact with BAF at Day 5. It could be valuable to also look at expression of SALL1, 2 and 3 across the time course to see whether additional compensation mechanisms are at play during the differentiation.
      9. At line 264, The authors state "SALL4 recruits the BAF complex at CNCC developmental enhancers to increase chromatin accessibility". Given that this analysis is performed at Day 5 of the differentiation, which is labelled as neuroectoderm what evidence do the authors have that these are specifically CNCC enhancers? Statements relating to enhancers should generally be re-phrased to putative enhancers (as no functional evidence is provided for enhancer activity), and further evidence could be provided to support that these are CNCC-specific regulatory elements, e.g. showing representative gene loci from CNCC-specific genes. Discussion of the RNA-seq presented in Supplementary Figure 2B may also be appropriate to introduce here given that large numbers of accessible chromatin sites are detected while the expression of very few genes is impacted, suggesting these sites may become active enhancers at a later developmental stage.
      10. Do any of the putative CNCC enhancers detected at Day 5 as being sensitive to SALL4 downregulation and loss of BAF recruitment overlap with previously tested VISTA enhancers (https://enhancer.lbl.gov/vista/)?

      Minor comments

      1. The authors are missing references in the introduction "a subpopulation of neural crest cells that migrate dorsolaterally to give rise to the cartilage and bones of the face and anterior skull, as well as cranial neurons and glia".
      2. The discussion of congenital malformations associated with SALL4 haploinsufficiency is brief in the introduction. From OMIM, SALL4 heterozygous mutations are implicated with the condition Duane-radial ray syndrome (DRRS) with "upper limb anomalies, ocular anomalies, and, in some cases, renal anomalies... The ocular anomalies usually include Duane anomaly". That Duane anomaly is one phenotype among a number for patients with SALL4 haploinsufficiency could be clarified in the introduction. Of note, this is stated more clearly in the discussion but needs re-wording in the introduction.
      3. The statements "show that the SALL4A isoform directly interacts with the BAF complex subunit DPF2 through its zinc-finger-3 domain" and "this interaction occurs between the zinc-finger-cluster-3 (ZFC3) domain of SALL4A and the plant homeodomains (PHDs) of DPF2" in the introduction appear overstated and should be toned down. To show this the authors would need to mutate or delete the proposed important zinc-finger domains from SALL4A, which is outside the scope of this work. Notably, this is less strongly-stated elsewhere in the manuscript, e.g "predict that this interaction is mediated by the BAF subunit DPF2", Line 162.
      4. Could the authors clarify why 3 Alphafold output models are shown for SALL4B in Figure 1C, and only one output model for SALL4A?
      5. Line 121. The authors state "DPF2, a broadly expressed BAF subunit,", but don't show expression during their CNCC differentiation. It would be good to include expression of DPF2 in Figure 1E.
      6. The text states "a 11 bp deletion within the 3'-terminus of exon 1 of SALL4", while the figure legend states, "Sanger sequencing confirming the 19 bp deletion in one allele of SALL4 is displayed". The authors should clarify this disparity and experimentally confirm the deletion, e.g. by TA-cloning the two alleles and sequencing these separately to show that one allele is wildtype and the other has a frameshift deletion.
      7. The authors generate an 11-bp (or 19-bp?) deletion in exon-1 - it would be valuable to include a discussion whether patients have been identified with deletions and frame-shift mutations in this region of SALL4 exon-1. And also clarify, if not clearly stated in the text, that both SALL4A and SALL4B will be impacted by this mutation. Are there examples of patient mutations which only impact SALL4A?
      8. For the SALL4 blots in Figure 2B, is the antibody expected to detect both isoforms (SALL4A and SALL4B), and which isoform is shown? If two isoforms are detected, they should both be presented in the figure.
      9. SALL4 expression should be shown for Figure 2C to see whether the >50% down-regulation of SALL4 at the protein level may be partially driven by transcriptional changes.
      10. The number of experimental replicates should be indicated in all figure legends where relevant. Raw data points should be plotted visibly over the violin plots (e.g. Figure 2C).
      11. For Figure 3A, the images of the DAPI and NANOG/OCT4 staining should be shown separately in addition to the overlay.
      12. The metric 'Corrected Total Cell Fluorescence (CTCF)' should be described in the methods. The number of images used for the quantification in Figure 3A should be indicated in the legend, and error bars included if multiple images were quantified.
      13. Figure 3C - what are the 114 differentially expressed genes? Some interesting genes could be labelled on the plot and the data used to generate this plot should be included as a Supplementary Table. Supplementary Tables should similarly be provided for Figure 6C, Day 14 and Supplementary Figure 2B, Day 5.
      14. Figure 4B. The shared peaks are not shown. For completeness, it would be ideal to show these sites also.
      15. Figure 4C is difficult to interpret. Why is the plot asymmetric to the left versus right? What does the axis represent - % of binding sites?
      16. Line 224-225. What do n= 3,729 and n= 6,860 refer to? There appear to be many more binding sites indicated in Figure 4B, therefore these numbers cannot represent 86% and 97% of sites?
      17. Figure 4E. Several TFs mentioned in the text (Line 243) are not shown in the figure, it would be good to show all TFs motifs mentioned in the text in this figure. Again, there is no mention of whether a sequence-specific motif is detected for SALL4 (e.g. an AT-rich sequence) from this motif analysis.
      18. Figure 4G. How was the ATAC-seq data normalized for the WT and SALL4-het-KO lines for this comparison? The background levels of accessibility seem quite different in Replicate 1.
      19. Figures 5B-C could be exchanged to flow better with the text. A Venn diagram could be included to show the overlap between the sites losing BRG1 in SALL4-het-KO (13,505 sites) and the Day5-specific SALL4 sites (17,137 sites).
      20. At Day 5, the authors suggest a shift towards neural differentiation. It could be interesting for the authors to perform qRT-PCR at Day 5 for some neural markers or look in the Day 14 data for markers of neural differentiation at the expense of CNCC markers.
      21. Is the data used to plot Figure 5D the same as Figure 4G. If so, why is only one replicate shown in Figure 5D?
      22. Figure 6A. How many replicates are shown? If n=2, boxplots are not an appropriate to represent the distribution of the data. Please include n= X in the figure legend and plot the raw data points also.
      23. Figure 6B. What is the significance of CD99 for CNCC differentiation?
      24. Figure 6F. No error bars are shown, how many replicates were performed for this time couse? The linear regression line does not appear to add much value and could be removed.
      25. At line 304, the authors state "while SALL4-het-KO showed a significant downregulation of these genes". Perhaps 'failed to induce these genes' may be more accurate unless they were expressed at Day 5 and downregulated at Day 14.
      26. Lines 332-335. The genes selected for pluripotency, neural plate border, CNCC specification could be plotted separately in the Supplement to show individual gene expression dynamics.

      Significance

      This work provides a conceptual advance in understanding the aetiology of human SALL4-mediated craniofacial malformations in a cell-type specific manner. Leveraging an in vitro differentiation system, the authors define development timepoints and cell types impacted by altered SALL4 dosage. Additionally, the authors provide interesting mechanistic insights how the teratogen thalidomide may impact craniofacial development through proteasomal targeting and degradation of SALL4, and subsequent impact on neural crest differentiation progression.

      Several audiences will be interested in this work: stem cell and developmental biologists (especially those interested in neural crest and facial development), and researchers interested in enhancer regulation, chromatin biology or gene regulatory mechanisms. Clinician scientists and geneticists will be interested in the proposed implications for mechanisms of disease.

      Field of expertise: We have expertise in mechanisms of gene regulation and in vitro models of early development. We are not experts in modeling protein interactions in silico.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: The authors have previously published Mass-spectrometry data that demonstrates a physical interaction between Sall4 and the BAF chromatin complex in iPSC derived neurectodermal cells that are a precursor cell state to neural crest cells. The authors sought to understand the basis of this interaction and investigate the role of Sall4 and the BAF chromatin remodelling complex during neural crest cell specification. The authors first validate this interaction with a co-IP between ARID1B subunit and Sall4 confirming the mass spec data. The authors then utilise in silico modelling to identify the specific interaction between the BAF complex and Sall4, suggesting that this contact is mediated through the BAF complex member DPF2. To functionally validate the role of Sall4 during neural crest specification, the authors utilsie CRISPR-Cas9 to introduce a premature stop codon on one allele of Sall4 to generate iPSCs that are haploinsufficient for Sall4. Due to the reports of Sall4's role in pluripotency, the authors confirm that this model doesn't disrupt pluripotent stem cells and is viable to model the role of Sall4 during neural crest induction. The authors expand this assessment of Sall4 function further during their differentiation model to cranial neural crest cells, assessing Sall4 binding with Cut+Run sequencing, revealing that Sall4 binds to motifs that correspond to key genes in neural crest differentiation. Moreover, reduction in Sall4 expression also reduces the binding of the BAF complex, through Cut and Run for BRG1. Overall, the authors then propose a model by which Sall4 and BRG1 bind to and open enhancer regions in neurectodermal cells that enable complete differentiation to cranial neural crest cells.

      Overall, the data is clear and reproducible and offers a unique insight into the role of chromatin remodellers during cell fate specification.

      However, I have some minor comments.

      1. Using AlphaFold in silico modelling, he authors propose the interaction between the BAF complex with Sall4 is mediated by DPF2, but don't test it. Does a knockout, or knockdown of DPF2 prevent the interaction?
      2. OPTIONAL: Does knockout of DPF2 phenocopy the Sall4 ko? This would be very interesting to include in the manuscript, but it would perhaps be a larger body of work.
      3. Figure 1, the day of IP is not clearly described until later in the test. please outline during in the figure

      3- What is the expression of Sal1 (and other Sall paralogs) during differentiation. The same with the protein levels of Sall4, does this remain at the below 50%, or is this just during pluripotency? 4. The authors hypothesise that Sall4 binds to enhancers- with the criteria for an enhancer being that these peaks > 1KB from the TSS are enhancers. Can this be reinforced by overlaying with other ChIP tracks that would give more confidence in this? There are several datasets from Joanna Wysocka's lab that also utilise this protocol which can give you more evidence to reinforce the claim and provide further detail as to the role of Sall4 5. The authors state that cells fail to become cranial neural crest cells, however they do not propose what the cells do instead. do they become neural? Or they stay at pluriopotent, which is one option given the higher expression of Nanog, OCT4 and OTX2 that are all expressed in pluripotent stem cells. 6. In general, I would like to see the gating strategy and controls for the flow cytometry in a supplemental figure. 7. For supplementary figure 1- please include the gene names in the main image panels rather than just the germ layer.

      Significance

      The strength of this study lies in its well-designed and clearly presented experiments and datasets. In particular, identifying the specific SALL4 isoform that interacts with the BAF complex-and further exploring the implications of this interaction-is a major highlight. The authors also make effective use of in silico modelling with AlphaFold, offering valuable mechanistic insight into how this interaction is mediated.

      The topic should have appeal to researchers in developmental biology and epigenetics. This study represents a significant step forward in validating the interaction between SALL4 and the BAF complex, and it highlights the requirement of SALL4 for BAF-mediated chromatin remodelling during neural crest specification. These findings are likely to be of interest to those studying the gene regulatory mechanisms underlying craniofacial development.

      However, while the authors outline the roles of SALL4 and the BAF complex in chromatin remodeling during neural crest development, the downstream effects on cell fate specification could be more thoroughly examined. Currently, Gene Ontology analysis is the primary method used to interpret these consequences, and additional functional validation would strengthen the conclusions.

      Intended audience: Basic research, epigenetics in pluripotency and neural crest development.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to reviewers


      We thank the reviewers for their constructive feedback, which has greatly improved the clarity and rigor of our manuscript. We have carefully addressed each comment below, indicating changes made to the text, figures, or supplementary material where appropriate. References to line numbers correspond to the revised version of the manuscript.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      * In this paper, the authors focus on the role of Reticulon-1C in concert with Spastin in response to axonal injury. In data mining, they find axonal mRNAs encoding for ER-associated proteins including Rtn-1. They establish a knockdown targeting both Rtn-1 isoforms Rtn-1A and Rtn-1C. They observe decreased beta-3-Tubulin levels in the soma while axonal protein levels are unchanged. In microfluidic devices, they characterise the effect of a compartment-specific Rtn-1 KD on axonal outgrowth in the axonal compartment. The authors quantify axonal outgrowth, seeing increased outgrowth in an axonal compartment-specific Rtn-1 KD, while the effect seems to be reversed when applying the KD construct in the somatic compartment. When focussing on the axonal growth cone, they find the Rtn-1 KD shows differences in several morphological features of the growth cone. They find an increase in Tubulin levels in an axonal compartment-specific, but a decrease in a somatic compartment-specific Rtn-1 KD. Colocalisation of Rtn-1C and Spastin is shown to be monolaterally increased following axotomy. Combining axotomy with the Rtn-1 KD shows increases in dynamic microtubule growth rates and track lengths. In another model system, neuron balls, they show Rtn1-C, but not Rtn1-A to be present in the axon. In a puro-PLA assay they also show it can be synthesised in the axonal compartment. To investigate the mechanism enabling the cooperation between Spastin and Rtn-1C, they move to a cell line model in which they see a correlating distribution between Spastin and Rtn-1C but not Rtn-1A. Finally, they use in silico modelling to speculate on binding between Spastin domains and Rtn-1 isoforms.*

      Major comment:

      The rationale behind the work is convincing, however some interpretations are presented as more robust than some data allow. Most notably, while the interaction between Rtn-1 and Spastin has been shown prior to this study, it is only presented here through in silico analysis. In figure 5, an increase in the growth rate of dynamic microtubules is observed in either a Rtn-1C KD or by using a Spastin-inhibitor. Due to a described increase in colocalisation between Rtn-1C and Spastin (5A), the increase in growth rate is displayed as caused by Rtn-1 promoting Spastin's severing ability. This result might however be correlative. Further in the injured samples, Spastin-levels seemingly increase (in the representative images) and it is thus not surprising that the level of Rtn-1C colocalising with Spastin increases as well. This might not be indicative of a cooperation and further experimental evidence are required.

      R: We thank the reviewer for this thoughtful comment. We agree that our interpretation should be more cautious, and we have revised the Title, Results and Discussion sections accordingly. In particular:

      1. Following yours and other reviewer comments, we have analyzed a new set of experiments regarding the STED images of non-injured and injured axons. To eliminate the risk of artifactual descriptions, we have avoided deconvolution and worked directly with raw STED images (Figure 5A). Under these conditions, the distribution of Spastin and its intensity in distal axons are not modified by injury, nor those of Rtn-1C and Spastin (Supplementary figure 4). We emphasize in the revised text that the in silico modeling we present is supportive, but not definitive, of a direct interaction. To address this concern, we clarify that our study builds on prior evidence of biochemical interaction between Rtn-1C and Spastin (Mannan et al., 2006), and that our own data demonstrate: i) compatible subcellular distribution in axons by super-resolution (STED microscopy, Figure 5A);ii) a potential functional interplay in axons (rescue of β3-tubulin levels by Spastin inhibition, Figure 5B), and iii) isoform-specific co-distribution with Spastin in heterologous cells that is associated with changes on microtubule integrity (see improved Figure 7). Together, these results go beyond correlative localization, but we acknowledge that they do not directly demonstrate a molecular complex in axons. Thus, we now indicate that "Although we did not directly test their molecular association, these results are consistent with Rtn-1C and Spastin sharing a similar subcellular localization, potentially enabling their functional interaction in distal axons" (lines 285-287)

      We would like to clarify a possible misunderstanding: in our experiments, the increase in microtubule growth rate was observed after axonal Rtn-1 KD. Spastazoline (SPTZ) only prevented the reduction in β3-tubulin levels induced by Rtn-1 KD, while leaving the KD-driven increase in growth rate and track length unaffected (Figures 5B-E). Thus, our interpretation is that axonal Rtn-1 KD correlates with increased Spastin function. (lines 307-309)


      Other comments:

      • Generally, graphs would benefit from individual values plotted as well as the summary. Font sizes and types (but rarely) are sometimes inconsistent. Proteins should be consistently written (capitalised or not).

      __R: __ We agree with the reviewer and thank for taking the time for noticing these inconsistencies as it significantly affects the quality of the work. We have improved several figures and added graphs plotting individual values (Figures: 2 C, 2E; 4 (A-E); 5E; 6D). We have reviewed the Font size and types more carefully and capitalized the proteins accordingly.

      • *Table 1 and figure 1 present data collected from a vast amount of resources. It should be highlighted that datasets from which data was obtained includes many different models, different DIVs and neuronal cell types. Figure 1B may benefit from a different colour scheme. "Ex-vivo" should be "Ex vivo". For "ER mRNAs are a relevant category" it is not described what "relevant" would mean in this context. The title might remove this small part or describe it in the text. It should be described how it is decided that mRNAs are "common". *

      • *

      __R: __We have now highlighted in the result section the diverse origins of the analyzed samples; We removed the indicated part from the text and explained that common mRNAs were chosen based on the Benjamini-Hochberg (Ben) analysis. (Page 33, lines 1299-1304).

      * - Figure 2: add description to y-axis to describe what fold change is displayed, applies to multiple figures. Will improve readability of the figures. In 2C, the ROI showing neuronal somata should be increased to show part of the axon and not cut off the soma.*

      • *

      __R: __We thank the reviewer for taking the time to highlight this. We have included this modification in figure 2 and throughout the article. We have also enlarged the indicated ROIs in figure 2C as requested. (Page 34)

      • *Figure 3: Three out of four axonal compartments seem to be comprised of dying or damaged axons. Especially the axonal KD scrambled image. It should be ensured that neuronal cultures are healthy. *

      • *

      __R: __We completely agree with the reviewer that the selected images were not describing the general good health of axons which has been accredited by the lack of fragmentation and functional responsiveness shown in (Figure 4 and 5 B, C, E). Thus, we have now replaced the previous axonal fields by more representative ones (Figure 3). (page 36)

      • *

      Typo in "intersections". The schematic of 3B is a great addition to explain the graphs above. Perhaps it could be a bit refined as it is currently hard to see whether this is a neuron or a growth cone without context. Maybe show where the axon connects to the depicted growth cones and change the third icon which looks like it was crossed out. Small formatting issues: remove additional space bar before "Figure 3." And add after "Bar"

      __R: __Many thanks for these great suggestions. We have now improved the figures as suggested and changed the indicated formatting issues. (page 36)

      - Figure 4: If not misunderstanding what is depicted, in 4A and B, different lookup tables are used to depict the same signal. Only one of each images is necessary. Do the axons have more tiny branches in the Rtn-1 KD condition in 4A? Unclear why Rtn-1 levels are increased in the Rtn-1 KD (4C), please clarify.

      • *

      __R: __We thank the reviewer for these observations. The reviewer is correct that different lookup tables were initially applied to the same image. Our intention was to highlight the fine distribution of axonal Rtn-1, but since this aspect is already clearly shown in previous figures, we now retain only a single lookup table. The appearance of tiny branches in the Rtn-1 KD condition represents an isolated observation and does not reflect a consistent or robust phenotype associated with Rtn-1 KD.

      As the reviewer points out, the increase of Rtn-1 in the cell bodies of injured neurons following axonal KD was initially surprising to us. However, this was a consistent phenomenon, as shown in the improved Figure 4. Of note, previous studies have reported that total Rtn-1C (but not Rtn-1A) levels increase in response to injury in cortical neurons(Fan et al., 2018). In our case, we interpret this as a compensatory somatic response triggered by the local reduction of Rtn-1 in injured axons. This interpretation is also consistent with the apparent lack of effect of siRNA on distal axonal Rtn-1 levels when applied locally after injury (while somatic application of the same siRNA does reduce axonal Rtn-1). Thus, after 24 hours of KD, the somatic upregulation of Rtn-1 may partially compensate for its expected local synthesis decrease. We have clarified this assumption in the revised text. (lines 247-251)

      - Figure 5: It may be easier to understand what "axotomy" samples are if just referred to as "injured" as later in the same figure. The procedure could also very briefly be explained in the results. 5C should depict AUC in µm2 not µm. 5D Spastin is barely visible, brightness and contrast should be adjusted to enhance visibility.

      • *

      __R: __We thank the reviewer for these helpful suggestions and have implemented the requested changes in Figure 5. Specifically:

      We now consistently refer to "axotomy" samples as "injured" throughout the figure and article. In addition, a brief explanation of the axotomy procedure has been added before Figure 2 and before figure 5, also the description has been clarified in Materials and methods. (lines 191-192) and (lines 289-290) and (lines 779-787)

      To improve the reproducibility of our outgrowth measurements, we revised this analysis approach. Based on previous work from a co-autor (McCurdy et al., 2019), instead of reporting the "relative number of intersections," we now present the total counts obtained from Sholl analysis of binarized axons (see Methods). To this end, we took advantage of the NeuroAnatomy plugin of FIJI, which more precisely tracks axon trajectories and makes the measurement more independent of axon width. Also, this new approach avoids the conflict we had with what we considered the "first line" after the groove ends, which was a bit of arbitrary. Accordingly, the correct term is now "summation of intersections (sum.)" at different distance bins, as reflected in Figure 5D. (page 40)

      For the former Figure 5D (now Figure 5B), we have improved the acquisition of representative images and applied a different set of lookup tables to enhance visibility. (page 40)

      - Figure 6: It should be made clear why it is necessary to switch to another model system just for 6A, please indicate this in the text. PCR bands seem very pixelated, check the quality. It is unclear why soma genes/proteins were only tested with either PCR or WB others with both. Rtn-1C and Rtn1-A should be presented in the same order in the PCR and WB panel. Correct "Rtn1-1A" typo. In 6D, 1.5 dots per soma seems like a low number. When normalized to the area the soma vs the axon occupies, the compartmentalization does not work? Maybe it makes sense to refine analysis or apply puromycin in the somatic compartment and analyze the axonal compartment as comparison?

      __R: __Many thanks for these observations. We have now included the following clarification in the text: "We sought to characterize the isoform expression of Rtn-1 mRNA and protein in both axons and cell bodies. Because microfluidic chambers yield only limited cellular material, we adopted an alternative culture approach using 'neuronballs.' This method enables the segregation of an axon-enriched fraction by mechanically separating axons from somato-dendritic structures" (lines 375-376).

      The resolution of PCR bands has been improved in the revised figure. Note that because the amount of cellular material is relatively scarce, we did not obtain too strong bands.

      The difference in the genes/proteins used for characterizing RNA and protein samples reflects our intention to treat both approaches as complementary. The PCR markers were primarily included to confirm sample purity, which also applies to the WB samples since they derive from the same preparation. In both assays, we used MAP2 as a dendritic marker to demonstrate axonal purity. While we acknowledge that the same genes could have been tested by both methods, we believe the results as presented adequately demonstrate the effective isolation of axons.

      We have switched the order of Rtn-1C/1A for consistency across PCR and WB panels and corrected the indicated typo in Figure 6A.

      We agree with the reviewer that an average of 1.5 puncta per soma initially appeared low. We have identified at least three reasons for this:

      First, the signal derives from only a 15-minute puromycin pulse, which is a very short labeling window. Second, our puro-PLA assay is particularly stringent, as ligation relies directly on puromycin- and Rtn-1C-labeled primary antibodies, without the additional spacing normally introduced by secondary antibodies. In standard PLA, the critical distance for amplification is ~30-40 nm, whereas in our assay this distance is even more restrictive. Third, in our initial analysis we applied an overly cautious threshold to define "true" amplification. We have now refined this threshold using a baseline defined by the absence of puromycin stimulation. With this improved criterion, we now quantify an average of ~5 puncta per soma and ~10 puncta per 1000 µm² of axonal area (Figure 6D and Supplementary Figure 3D). Assuming a neuronal soma diameter of 15 µm (area ≈ 176.71 µm²), this yields ~0.028 puncta per µm² in soma. In comparison, axons display ~0.01 puncta per µm², approximately one-third of the soma value, which is compatible with the idea thar cell bodies dominate neuronal protein synthesis.

      Following the reviewer's valuable suggestion, we performed additional quantifications in which puromycin was applied exclusively to the somatic compartment. Under these conditions, we still observed amplification in axons (~5 puncta per 1000 µm²), although this value was significantly lower than when puromycin was applied directly to axons. This analysis provided a novel appreciation of the puro-PLA technique in neurons: at least half of the signal originates in the axonal compartment, while a portion may reflect proteins synthesized in soma and transported anterogradely to the axon through yet-unknown mechanisms (potentially involving rapid anterograde transport) (Figure 6D). (page 42)

      • Figure 7: 7A shows two images depicting the same information that may not be needed. Can probably be removed. In 7B there is no negative (or any) correlation between Spastin levels and Tubulin, however later it is mentioned that Rtn-1C transports Spastin thus causing a decrease in Tubulin at certain locations? It is nclear if Spastin levels vary intensely between different samples. Mean intensity of the somatic area may be beneficial to rule this out. 7B Tubulin on the right top panel seems to have a decrease in Tubulin levels which is not visible due to the Y axis of Tubulin being set to a different range than the middle and lower panel. The average of line scans from multiple cells may be helpful to determine whether there is indeed no colocalization between Rtn-1A and Spastin. The provided representative images seem to show similar degrees of colocalization between Spastin and Rtn-1A/C.

      • *

      __R: __We thank the reviewer for these valuable observations and acknowledge that Figure 7 may have caused confusion. We have eliminated the fluorescence line-scan traces, as they can be biased depending on the region of the cell analyzed. Although this may not have been sufficiently emphasized in the text, we had already performed a quantitative colocalization analysis across multiple cells and independent experiments, using Mander's coefficients (Figure 7B). These analyses showed higher colocalization between Rtn-1C and Spastin compared to Rtn-1A. Regarding the concerns about variability in Spastin levels or possible bias from Y-axis scaling, we have eliminated those traces by the risk of bias. Also, we had already quantified the total tubulin fluorescence intensity across all the z-stacks and from multiple cells from independent experiments as shown in Figure 7C. To further rule out artifacts caused by variable transfection efficiency, we quantified total fluorescence intensity in both RFP and GFP channels across conditions. As shown in Supplementary Figure 6, no significant differences were observed, suggesting that the changes in tubulin reflect specific effects of Spastin/Rtn-1C co-expression rather than variability in expression levels.

      Results: - It would be helpful to reiterate the hypothesis at the start to ease the reading flow.

      __ R: __Many thanks, we have introduced a line reiterating the hypothesis as suggested (lines 117-118)

      - There seems to be minor redundancy in lines 132-138.

      • *

      __R: __Indeed, we have now removed the indicated phrase.

      • There are several spellings, proof-reading is recommended. For example, in line 136 should be "promotes". 160 "localla", 192 should be "the actin cytoskeleton".,194 should be "we first examined", 195 should be "Different", 223 "using", 259 "axons".

      __R: __We apologize for the spellings; we have now performed a careful proof-reading and introduced these corrections.

      - 154-155: Unclear, why the lower MW Rtn-1C was seen as more important.

      __R: __We apologize for not being clear enough. It is not necessarily more important, but we just took the Rtn-1C molecular weight as reference for the analysis considering that this isoform is the predominant in axons. In any case we have found a significant effect for both isoforms at least on siRNA 2 (data not shown), which is now expressed in the text (line 165-169) : "We also examined the 180 kDa band and found that siRNA 1 reduced expression to a mean of 0.41 relative to Scr, showing a strong trend that did not reach statistical significance (p = 0.05; N = 3; Wilcoxon test compared to 1, data not shown). In contrast, siRNA 2 further reduced expression to a mean of 0.29, which was statistically significant (p = 0.04; N = 3; Wilcoxon test compared to 1, data not shown)."

      - 167 results of 2E not stated before interpreting them.

      • *

      __R: __We have corrected this mistake.

      - 181 would suggest "outline" instead of "perimeter".

      • *

      __R: __We have considered this suggestion and included "outline", nevertheless the morphometric parameter is defined as perimeter, so we retained the term, but with the suggested clarification.

      • *

      - 183-184 "longest shortest path" is a confusing term.

      __R: __We agree that it is a confusing term, thus have now introduced multiple clarifications for the term in the leyend of figure 3 (page 36), and with more detail in a new section of Materials and methods (lines 697-699).

      • figure 4B should be referenced earlier in the sentence.

      __R: __We have corrected the sentence in the text.

      - 243-244 may be correlation. Rtn-1 and Spastin do not necessarily interact so that this result is achieved.

      • *

      __R: __Thanks for the clarification, we are aware that so far in the manuscript the conclusion is not correct, thus now we have stated at the end of the paragraph: "Together, these observations suggest that axonal Rtn-1 KD correlates with higher Spastin microtubule severing" (lines 307-309)

      - 246: In figure 1 the KD seemed to influence both Rtn-1 isoforms, why not here anymore? 259 "axons". 284 "counteract" instead of "suppress"?

      • *

      __R: __We acknowledge the confusion at this point of the article because of measuring a specific isoform. We now indicate that we will focus on Rtn-1C because of previous evidence of the literature pointing to an interaction of Rtn-1C with Spastin (line 264-267). Later we show that Rtn-1C is the predominant isoform in axons (Figure 6). We have corrected all the suggestions in the manuscripts.

      - 485: rephrase as the interaction between Rtn-1C with Spastin has not been shown directly in these experiments.

      __R: __Many thanks for the relevant clarification. Now, we have corrected:" Here, we have described an emerging mechanism relating Rtn-1C with the activity of Spastin, which is the most frequently mutated isoform in HSP (Hazan et al., 1999; Mannan et al., 2006)." (line 632-634). * Methods: 535 "in PBS". 543 citation error. 689-699 is it necessary to add a gaussian blur?*

      • *

      __R: __We have corrected the words and removed the wrong reference. Regarding the use of Gaussian blur, it is a very important point. We used this approach because, in our experimental conditions, it was critical to highlight moving particles that otherwise would go unnoticed by the noise. This was particularly manifest for the seemingly more "unorganized" movements of axonal microtubules after injury.

      References: Mannan, A U et al. appears twice in the citation list (36 and 44).

      * *R: Many thanks for the observation. Now we have corrected it.

      Reviewer #1 (Significance (Required)):

      Overall, this manuscript describes novel fundings which will be interesting to the neuronal cell biology community and scientists working on the field of neuronal injury and regeneration. It is well structured, and the data are mostly well presented but sometimes conclusions are over-interpreted. However, several points need to be addressed in a more convincing way.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Axonal mRNA localization and localized translation support many neuronal functions and is an important determinant of the regenerative potential of axons after injury. How this works mechanistically remains unclear. The authors present a well performed and technically challenging study in which they identify RTN-1 as a regulator of axonal outgrowth after injury. They provide evidence using experiments in microfluidic chambers that RTN1 is locally synthesized in axons. Interestingly, they identify a (local) interplay between RTN1 and Spastin which affects microtubules and thereby regulates the outgrowth of cortical axons after injury. This study provides an interesting new link between a locally synthesized protein (RTN1) and a microtubule-regulating protein Spastin that is changed upon axon injury. This provides an advance in our understanding in axon regeneration after injury and provides the basis for new studies that can further investigate this interplay. Although interesting, I have several concerns that should be clarified and are needed to substantiate the findings and model presented in this study.

      Major concerns:

      1. In figure 1, the authors provide an analysis of overlapping axonal mRNAs. There are more axonal transcriptome studies and a recent study by von Kugelgen and Chekulaeva (2020; doi: 10.1002/wrna.1590) already performed such an analysis, which included more studies. It would be good to mention this. It can be perceived that studies were now chosen to get the outcome that Rtn-1 is present in all studies. For example, von Kugelgen finds mRNA coding for RTN3, another ER structural protein, as present in 16 out of 20 studies analyzed. That said, the authors present more reasons to look at Rtn-1, so the selection to continue with this protein remains valid but can be written up differently so not to present it as the 'sole' ER-shaping protein consistently present in axonal transcriptomes. __R: __We appreciate this important observation to enrich the article; we are aware that the transcriptome data can be even further expanded to more recent studies. Thus, we have now included this reference in the main text and highlighted the relevant finding of RTN3. However, Kugelgen and Chekulaeva used data from dendrites/axons (neurites). Thus, we indicate that "...On a similar approach, but combining data from dendrites and axons, it was found that Reticulon-3 *mRNA is present in 16 out of 20 studies, further suggesting a wider presence of other mRNAs coding for ER structural proteins in axons " (line 128-131)

      2. The description of methods is currently insufficient and incomplete and does not allow for reproducibility of this study. For example, different Rtn-1 antibodies seem to be used in this study. Is the same antibody used for staining and WB? There is no listing of any of the antibodies used in the study and which one is used for which technique/experiment. This should be clarified and should be easy to do so in the methods section (antibody name, origin/company, dilution used) to enhance reproducibility of this study. This is not limited to primary antibodies and any information on secondary antibodies, including what was used for STED is completely missing.*

      3. *

      __R: __Thanks for these critical comments. First, we apologize for the former method version which was mistakenly not as accurate as it should. We have now revisited it and improved several points throughout this section. Regarding the use of primary and secondary antibodies, plasmids, siRNAs, and general reagents, they are all indicated in the Supplementary material, including company and dilution ("Reagent tables").

      • The timeline of KD experiments in Figures 2 and 3 are unclear. For the Western blot KD is performed at DIV7 and collected 48 hours later. However, this is not specified for the stainings done in Figure 2C-E. Is this also at DIV7 and then for 48 hours? In figure 3 the siRNA is added at DIV8 (together with axotomy) and outgrowth is measured 24 hours later. Is 24 hours sufficient to achieve knockdown? Is this also what was done for stainings? Later on in Figure 5B, 48 hours of KD is again used. It is unclear what the rationale of these differing timepoints is. Why was this chosen? Is the timeline also the reason for the difference in segment lengths chosen? In Figure 3, there is a significant effect on outgrowth in the KD in the 'mid-range' which is not present in Figure 5.*

      __R: __We regret the confusion, now all this information is explicitly clarified in the main text (lines 297-299) and the corresponding figure legends. We have strong reasons to have used these different time points. Figure 2 A-B is aimed at validating the siRNA against Rtn-1 thus we treated 7 DIV cultures for 48 hours to be sure of revealing a global effect by WB. In figure 2 C-D, we used the same 7 DIV cultures, but only for 24 hours. The reason for this is that, once the RNAi was validated, we explored its control on local synthesis in a shorter period based in previous literature supporting that axonal KD for 24 hours is sufficient for regulating axonal transcripts (Batista et al., 2017; Gracias et al., 2014; Lucci et al., 2020). We are also confident of using this time point based in the new supplementary figure 3D that shows a significant decrease on puro-PLA signal (indicative of Rtn-1C synthesis) 24 hours after axonal KD.

      In figure 3, we performed axotomy thus we had to wait a longer period for axons to grow (8 DIV) before fully cut them out, in this case we performed axonal KD from 8 to 9 DIVs. This is the same period used for the staining and quantifications shown in figure 4. All this is properly clarified in the main text and figures.

      In Figure 5 we performed a more challenging experiment that required to transfect cells with an EB3-GFP plasmid, then perform axotomy along with axonal KD as well as pharmacological treatment selectively in axonal compartment. First, we tried to measure microtubule dynamics under the same temporal frame of figure 3. Nevertheless, expression levels of EB3-GFP were not adequate for axonal measurements by live-cell imaging. Therefore, compared to figure 3, we increased the time frame after axotomy 24 hours (from 9 to 10 DIV) by this technical reason, but also to explore whether the changes on tubulin intensity might be revealed more clearly (which was the case, figure 5B). These considerations are now included in the main text

      Regarding the significant effect on outgrowth in the KD in the 'mid-range' which is not present in Figure 5. Given that in figure 5D axons are left growing for two days instead of one, the number of intersections and the differences between conditions is modified compared to figure 3, while retaining the overall trends. Note that to improve the reproducibility of our outgrowth measurements, we revised this analysis approach. Based on previous work of a co-autor (McCurdy et al., 2019), instead of reporting the "relative number of intersections," we now present the total counts obtained from the Sholl analysis of binarized axons (see Materials and methods). To this end, we took advantage of the NeuroAnatomy plugin of FIJI, which precisely tracks axon trajectories and makes the measurements more independent of axon width segmentation. Also, this new approach avoids the conflict we had with what we considered the "first line" after the groove ends, which was a bit of arbitrary. Accordingly, the correct term is now "summation of intersections (sum.)" at different distance bins, as reflected in the new Figure 5D.

      Could the authors provide a rescue condition for their siRNA (using a siRNA-resistant construct) to show that their siRNA is specific for RTN1. They nicely show the efficiency of the siRNA but not its specificity. This is crucial because if not specific, this will affect a large part of their study. They already have RTN1A and RTN1C constructs available. Such a rescue experiment should ideally also be performed for one or more of their phenotypic experiments, such as the one presented in Figure 3A or 5 to show that the phenotype is really RTN1 dependent. If done by re-expressing either RTN1A or RTN1C, this could provide insightful information on the relevant isoforms.

      __R: __We agree with the reviewer that this is a critical point. A major challenge in demonstrating the functional role of axonally synthesized proteins using a KD approach is that the rescue may also need to occur locally. Since axonal Rtn-1 appears to play a distinct role compared to its somato-dendritic counterpart (Figure 3), a siRNA-resistant construct would ideally require an axon-targeting sequence to restore local synthesis. As this is technically demanding, we have not yet been able to perform such an experiment, but we are actively working on identifying the optimal sequence to direct Rtn-1C to axons. Importantly, studies performing axonal KD typically rely on at least two independent siRNA sequences, thereby minimizing the likelihood that a phenotype arises from off-target effects. Thus, we have now validated a third siRNA (siRNA 3), which selectively downregulates Rtn-1C. Then, following the same experimental frame of figure 3, we performed axonal Rtn-1 KD after injury and observed that siRNA 3 also significantly increases the outgrowth of injured axons (Supplementary figure 2). This suggests that, at least this phenotype, is not product of an off-target effect. Complementarily, pharmacological rescue with the Spastin inhibitor SPTZ mitigated both the reduction in distal axonal β3-tubulin and the increase on axon outgrowth, supporting that the observed phenotypes are unlikely to arise from off-target effects. If these effects were due to random interference with unrelated mRNA targets, inhibition of an ostensibly independent target such as Spastin would not be expected to yield such a consistent rescue. Accordingly, SPTZ treatment alone did not increase β3-tubulin, indicating that its action is specifically contingent upon Rtn-1 KD. Taken together, the pharmacological rescue in axons (Figure 5B) and the Rtn-1C/Spastin co-distribution in heterologous cells, which correlates with preserved microtubules (improved Figure 7), provide converging evidence to suggest that Rtn-1C-Spastin interplay may underly the observed phenotypes in axons.

      • I find the data presented in Figure 4A/B confusing. Axonal RTN-1 KD does not reduce axonal RTN1 levels, but somatic KD does. I understand that this implies most protein comes from the soma, and the authors indeed present an explanation that increased somatic RTN1 occurs after axonal KD as a compensation mechanism. However, this can also be interpreted that there is no axonal synthesis of RTN1 after injury and axonal KD has indirect or even aspecific effects. Their model depends on this difference. Their data in Figure 6 could provide supporting evidence if it shows RTN1 puro-PLA after injury. Along these same lines, in Figure 6, they nicely include a compartment control for puro-PLA. It therefore seems doable to include a somatic puromycin control for their axonal puro-PLA, to exclude and diffusion/transport of the newly synthesized peptides. This is especially considering two recent papers reporting on this possible phenomenon, although these studies were not performed in neurons.*

      __R: __We consider the possibility that after injury there is no axonal Rtn-1 synthesis as a plausible and relevant appreciation. Unfortunately, we could not perform a puro-PLA experiment after injury, which would have provided a more definite answer. However, now we are more confident of regulating Rtn-1 synthesis before injury as supported by a new supplementary figure 3D that shows a significant decrease on puro-PLA signal (indicative of Rtn-1C synthesis) 24 hours after axonal KD. Thus, based on the similar phenotypes observed before and after injury, we consider our results are still compatible with Rtn-1 axonal synthesis being downregulated, but not absent after injury. First, axonal Rtn-1 KD decreased β3-tubulin levels before and after injury according to figure 5B and the improved statistical analysis performed on figure 2E. Similarly, axonal Rtn-1KD significantly increases microtubule growth rate before and after injury according to the current statistical comparisons (Figure 5E). Second, if β3-tubulin decrease was a merely unspecific siRNA targeting, it is unlikely that SPTZ treatment should increase and restore β3-tubulin levels only in the context of axonal Rtn-1 KD (Figure 5B). We have now included these considerations in the discussion (lines 537-543). Although on a different track, the mechanistic relationship between Rtn-1C and Spastin suggested in Figure 7 could make more plausible that a similar phenomenon regarding the control of tubulin levels may occur locally in axons.

      Following the reviewer's valuable suggestion, we performed additional quantifications in which puromycin was applied exclusively to the somatic compartment. Under these conditions, we still observed amplification in axons (~4 puncta per 1000 µm²), although this value was significantly lower than when puromycin was applied directly to axons (~10 puncta per 1000 µm²). This analysis provided a novel appreciation of the puro-PLA technique in neurons: at least half of the signal originates in the axonal compartment, while a portion may reflect proteins synthesized in soma and transported anterogradely to the axon through yet-unknown mechanisms (potentially involving rapid anterograde transport). Note that we revised the criteria for detecting true amplification spots based in staining without puromycin, which increased true amplification numbers. Still, these seemingly low values are compatible with reflecting a limited amount of time (only 15´ of puromycin pulse) and the stringent conditions of this experiment in which secondary antibodies were avoided by directly labeling primary ones. This approach makes the classical 30-40nm distance for PLA even narrower, thus reducing signal. In any case, assuming a neuronal soma diameter of 15 µm (area ≈ 176.71 µm²), this yields ~0.028 puncta per µm² in somata. In comparison, axons display ~0.01 puncta per µm², approximately one-third of the soma value, which makes sense for the expected difference in ribosome density.

      • In Figure 5A the authors find an increased co-localization (RTN1/Spastin) after axotomy. From their images, it seems that the amount of Spastin is hugely increased, which would by default increase the chance of (random) colocalization of RTN1 on Spastin. Could the authors comment on this?*

      __R: __Thanks for this relevant and constructive critique. We formerly based our colocalization analysis on deconvolved images. However, after performing several quantifications through different deconvolution parameters, we were not convinced about the robustness of this finding and the performed staining. Thus, we performed a new set of experiments and found that non-deconvolved images from the STED microscope were more informative about the expected tubular morphology of the axonal ER. Thus, we improved figure 5A, and now the main conclusion is just that both proteins are closely distributed in distal axons before and after injury.

      • In figure 5E and 5F, the condition of scr + SPTZ is omitted. What is the reason for this? The explanation of results in these figures is confusing. The authors report a 'clear trend' in increase in comet track length and lifetime upon addition of SPTZ to axonal RTN-1 KD. This is however not significant. The comparisons that are made afterwards are confusing (e.g. increase in comet lifetime of SPTZ in non-injured axons with RTN1 KD compared to Scr+DMSO and KD + DMSO in injured axons). Their conclusion is axonal RTN-1 synthesis in injured axons (see my concern in the points above on this) governs microtubules growth rate beyond Spastin activity yet blocking Spastin activity still completely blocks the effect of KD on outgrowth.*

      * *__R: __We thank this observation and fully agree that the general description provided in figure 5 E wasn't satisfactory. We have re-organized the descriptions of these results and performed more relevant statistical comparisons (lines 338-359). Based on the reviewer observation, we now conclude: "Together, these results suggest that axonal Rtn-1 synthesis controls microtubule dynamics in both non-injured and injured axons, mostly independently of Spastin-mediated microtubule severing." (lines 357-359).

      Other/minor concerns:

      - The gene ontology analysis in Figure 1A contains the category 'Endoplasmic reticulum'. In this category are mainly ribosomal proteins. Although in a gene ontology analysis these proteins will be included in this category, it is misleading in this respect since they are just as likely to be coming from cytoplasmic ribosomes. Although it cannot be excluded that these are ER-bound ribosomes, not in the last place because a recent study (Koppers et al., 2024, doi: 10.1016/j.devcel.2024.05.005) found ribosomes attached to the ER in axons, I believe the category should be adapted or at the least clarified in the text.

      • *

      __R: __Many thanks for the suggestion, which is now included in the text. "Note that several of the identified transcripts in the category 'endoplasmic reticulum' code for cytoplasmic ribosomal components, which indeed can be attached to the axonal ER (Koppers et al., 2024) and be locally synthesized in axons (Shigeoka et al., 2019)." (lines 125-128)

      - Is RTN-1C isoform still an ER-shaping protein or rather an ER protein with alternative functions? The final sentence in the abstract makes a statement that a locally synthesized ER-shaping protein lessens microtubule dynamics. Could the authors provide a clearer description and discussion of the evidence in literature for this? RTN1C has been suggested to perform alternative functions in which case the statement that the local synthesis of an ER-shaping protein is important for axonal outgrowth should be adapted.

      R: We agree with the reviewer and are aware that some non-canonical roles of Rtn-1C may partially explain the observed phenotypes. Thus, we have rephrased the last statement of the abstract: "These findings uncover a mechanism by which axonal protein synthesis provides fine control over the microtubule cytoskeleton in response to injury.". Also, we have modified the discussion section introducing new references accordingly..." Some studies have pointed to a non-canonical role for Rtn-1C in the nucleus, including DNA binding and histone deacetylase inhibition (Nepravishta et al., 2010, 2012). It is tempting to speculate that these still emerging roles may also contribute to the observed phenotypes. Of note, different axonally synthesized proteins exert transcriptional control in response to injury or local cues (Twiss et al., 2016)." (lines 576-580).

      • Is there a difference in RTN1 distribution or levels pre- and post-axotomy?

      R: Thanks for the suggestion, with the new analysis we have only found slight reorganization of Rtn-1C and Spastin in distal axons (Figure 5A). We have also included now quantification of their levels and found no significant differences for both proteins (Supplementary figure 4)

      - Line 100/101 states 'the interactome of the axonal ER provides...'. To my knowledge there has been no study looking at the interactome of the axonal ER specifically. Surely axonal ER proteins are known but there is a difference.

      • *

      __R: __We agree with the reviewer that the phrase was misleading, so we rephrased it in the introduction "...Different lines of evidence support that the protein components of the axonal ER may interact with proteins that regulate microtubule dynamics"

      * - Typo line 160 'localla'*

      • *

      __R: __Thanks for taking the time, we have now corrected it.

      - In Figure S1 B, please add the DIVs to make it clearer what each graph corresponds to. The legend of S1B states different distances from the cell body but the graph shows distances from the tip.

      • *

      __R: __We have now corrected the legend accordingly.

      - Figure 2C, why does B3 tubulin decrease in soma, aspecific effect of siRNA?

      • *

      __R: __This was indeed an unexpected finding. However, we do not observe unspecific or global changes in β3-tubulin levels (see Figure 2A and Supplementary Figure 2). Considering our other results linking Rtn-1 to the regulation of the microtubule cytoskeleton, we interpret this decrease as an indirect effect of Rtn-1 depletion rather than an off-target action of the siRNA. Moreover, if the effect were unspecific, both proteins would likely be reduced in the cell body, given that the siRNA was specifically designed to target Rtn-1 as its primary sequence-specific target.

      - What is the rationale on the opposite effect found in outgrowth in Figure 3?

      • *

      __R: __The apparent opposite outcomes observed in Figure 3 - where axonal versus somatic Rtn-1 knockdown leads to divergent effects on axonal outgrowth - can be explained by compartment-specific environments and isoform distribution. The siRNA targets the conserved RHD region, reducing both Rtn-1A and Rtn-1C. Axons are enriched in Rtn-1C. Thus, axonal KD preferentially reduces Rtn-1C. In contrast, somatic KD reduces both isoforms. Rtn-1A, predominant in cell bodies, may probably engage other signaling pathways (Kaya et al., 2013). Interestingly, it was reported by Nozumi et al. (2009b) that global Rtn-1 depletion reduces axonal outgrowth in developing cortical neurons. This aligns with the notion that somatic KD mimics a more global loss of function, whereas axonal KD reveals a compartmentalized, pro-regenerative effect due to local Rtn-1C regulation. (All the references indicated here are in the main manuscript). These considerations are now included in the discussion ( lines 581-593).

      * - Missing word 'we' on line 194*

      • *

      __R: __ We have corrected it.

      - Typo line 629 'witmn h', please proofread the entire manuscript carefully.

      • *

      __R: __ We apologize for the spellings, now we have carefully revised the manuscript.

      - Could the authors comment on why, in Figure 7B/C, GFP only is colocalizing with Spastin-RFP? In general, GFP should be diffusive and not display punctate colocalization with Spastin.

      • *

      We appreciate the reviewer's comment. Under normal conditions, GFP displays a diffuse cytoplasmic distribution. However, in our experimental setup, we observed punctate GFP signals only in the context of co-expression with Spastin-RFP. This is consistent with prior reports showing that soluble GFP can occasionally be sequestered into late endosomal structures (Sahu et al., 2011), which are also known to harbor the M87 Spastin isoform (Allison et al., 2013; Allison et al., 2019). To rigorously exclude the possibility of unspecific fluorescence crosstalk, we independently acquired each fluorophore channel and confirmed that GFP puncta were genuine and not due to bleed-through (Supplementary Figure 5). Further, cells expressing only GFP or only Spastin-RFP did not show overlapping puncta, and co-expression of GFP with Rtn-1A-RFP did not produce any apparent overlap, indicating that the punctate GFP pattern is specifically associated with Spastin co-expression. Thus, the observed GFP colocalization with Spastin reflects a biological phenomenon potentially linked to the endosomal localization of M87 Spastin, and not an artifact of imaging or fluorophore bleed-through.

      Reviewer #2 (Significance (Required)):

      * Axonal mRNA localization and localized translation support many neuronal functions and is an important determinant of the regenerative potential of axons after injury. How this works mechanistically remains unclear. The authors present a well performed and technically challenging study in which they identify RTN-1 as a regulator of axonal outgrowth after injury. They provide evidence using experiments in microfluidic chambers that RTN1 is locally synthesized in axons. Interestingly, they identify a (local) interplay between RTN1 and Spastin which affects microtubules and thereby regulates the outgrowth of cortical axons after injury. This study provides an interesting new link between a locally synthesized protein (RTN1) and a microtubule-regulating protein Spastin that is changed upon axon injury. This provides an advance in our understanding in axon regeneration after injury and provides the basis for new studies that can further investigate this interplay. Although interesting, I have several concerns that should be clarified and are needed to substantiate the findings and model presented in this study.*

      *

      The audience for this study will be mainly basic research in the fields of both axonal protein synthesis and axon regeneration. My expertise is in the field of mRNA localization and local protein synthesis.*

      Batista, A. F. R., Martínez, J. C., & Hengst, U. (2017). Intra-axonal synthesis of SNAP25 is required for the formation of presynaptic terminals. Cell Reports, 20(13), 3085. https://doi.org/10.1016/J.CELREP.2017.08.097

      Fan, X. xuan, Hao, Y. ying, Guo, S. wen, Zhao, X. ping, Xiang, Y., Feng, F. xue, Liang, G. ting, & Dong, Y. wei. (2018). Knockdown of RTN1-C attenuates traumatic neuronal injury through regulating intracellular Ca2+ homeostasis. Neurochemistry International, 121, 19-25. https://doi.org/10.1016/J.NEUINT.2018.10.018

      Gracias, N. G., Shirkey-Son, N. J., & Hengst, U. (2014). Local translation of TC10 is required for membrane expansion during axon outgrowth. Nature Communications 2014 5:1, 5(1), 1-13. https://doi.org/10.1038/ncomms4506

      Lucci, C., Mesquita-Ribeiro, R., Rathbone, A., & Dajas-Bailador, F. (2020). Spatiotemporal regulation of GSK3β levels by miRNA-26a controls axon development in cortical neurons. Development (Cambridge), 147(3). https://doi.org/10.1242/DEV.180232,

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This manuscript investigates the relationship between the endoplasmic reticulum morphogen reticulon-1 (Rtn-1) and the microtubule severing protein spastin in axons after injury. The main message and conclusion of the paper is that local axonal synthesis of Rtn-1 plays a role in regulating the microtubule severing activity of spastin by interacting with spastin and inhibiting its activity. This mechanism would be important after injury by regulating axonal growth.

      * The conclusions of the paper are based on the following claims:*

      * 1) Rtn-1 is synthesized locally in axons.*

      * 2) Specific downregulation in Rtn-1 in axons using microfluidic chambers affects microtubules abundance (measured by beta-3 tubulin) and promotes axon growth after injury.*

      * 3) Inhibition of spastin MT-severing activity with a specific drug rescues the growth effect induced by axonal downregulation of Rtn-1.*

      * 4) Rtn-1c interacts with spastin-M87 to limit its MT-severing activity in a cellular system upon overexpression.*

      *

      *

      Major comments:

      1) Evidence that Rtn-1 is synthesized in axons comes from two experiments. Initially, the authors show that Rtn-1 siRNA transfection in the axonal compartment of microfluidic chambers reduces Rtn-1 levels in axons, suggesting that there is some local synthesis. Although this method is very attractive, I am concerned about the statistical analysis. The graphs show bars rather than individual data points from the average of many neurons (about 300). The plots also show the SEM instead of the SD, thus covering all the variability that is inherent in this type of experiment. The statistics are probably not performed on the 3 biological replicates, but consider the individual neurons as N. This is obviously not correct, since neurons in an experiment may all be affected by the same technical problem and are not independent replicates. For this reason, I am a bit skeptical about this quantification. Another problem is that the quantification of the fluorescence intensity of the sample does not take the nuclei into account. Are the nuclei removed for analysis? Are the images single planes? Addressing the quantification issues is crucial also for data in Figure 4, where the authors show a different effect of Rtn-1 axonal KD after injury.

      * The second experiment is the Puro-PLA in Figure 6D. This experiment shows an average of 1.5 dots of signal per soma, which is a very low level of translation for this compartment where most of the synthesis should be taking place. In the axons, it is not clear how they calculate the axonal area. Again, the number of dots detected is very low and the physiological significance is questionable. A control with a known mRNA translated in axons would be important.*

      * Finally, as an important control, the authors should show the presence of Rtn-1 mRNA by FISH in their experimental system.*

      __R: __We appreciate the critical points addressed here as they moved us to improve the quality of the findings. We analyzed cells/axons as statistical units to increase statistical power given the subtle nature of these local changes. We agree with the reviewer that this approach may increase the risk of finding false positives. To address this point, i) we plotted the individual data points and colored them according with the different experimental dates (all the dates showed a similar trend) ii) We indicated SD instead of SEM iii) We analyzed our data using linear mixed-effects models, with experimental date included as a random effect. This approach allows to preserve the granularity and statistical power, while avoiding pseudoreplication. To exclude artifactual changes, we now analyzed the intensity fold change of total fluorescence normalized to Scr. Our former quantifications were based on the corrected fluorescence intensity used to construct the plot profiles, which could be adding some distortion to the measurements. These changes were applied throughout figures 2 and 4 (pages 34 and 38, respectively). After these new analyses the formerly presented results remain valid.

      We thank the reviewer for raising concerns about the quantification of fluorescence intensity in cell bodies. We now specify in Materials and methods that fluorescence intensity analysis of distal axons (always isolated by the microfluidic chambers) and of cell bodies was performed using the wide-field configuration of the microscope. In all the cases, a single (epifluorescent) plane was analyzed to reflect the total fluorescence of a cell or axon. We did not exclude the nuclear region from the quantifications, as this would also remove cytoplasmic signal located above or below the nucleus.

      We also understand the concerns about puro-PLA experiments. We agree with the reviewer that an average of 1.5 puncta per soma initially appeared low. We have identified at least three reasons for this. First, the signal derives from only a 15-minute puromycin pulse, which is a short labeling window. Second, our puro-PLA assay is particularly stringent, as ligation relied directly on puromycin- and Rtn-1C-labeled primary antibodies, without the additional spacing normally introduced by secondary antibodies. In standard PLA, the critical distance for amplification is ~30-40 nm, whereas in our assay this distance is even more restrictive. Third, in our initial analysis we applied an overly cautious threshold to define "true" amplification. We have now refined this threshold using a baseline defined by the absence of puromycin stimulation. With this improved criterion, we now quantify an average of ~5 puncta per soma and ~10 puncta per 1000 µm² of axonal area (Supplementary Figure 3D). As it is now included in methods, we calculated the axonal area by binarizing β3-tubulin staining and only counted the true amplification spots inside this region. Assuming a neuronal soma diameter of 15 µm (area ≈ 176.71 µm²), this yields ~0.028 puncta per µm² in somata. In comparison, axons display ~0.01 puncta per µm², approximately one-third of the soma value which seems more reasonable. This is also compatible with most of Rtn-1C synthesis comes from the cell body.

      Unfortunately, we could not be able to perform puro-PLA of other axonally synthesized proteins. Nevertheless, to further validate our puro-PLA signal, we tested the specificity of the Rtn-1C antibody we used for this assay by WB, IF, and Rtn-1 KD (Supplementary figure 3 A-C). In addition, we performed axonal Rtn-1 KD in microfluidic chambers for twenty-four hours, which elicited a significant decrease in puro PLA signal compared to Scr (Supplementary figure 3D). Together, these results strongly indicate that the quantified signal reflects Rtn-1C synthesis. To prove that Rtn-1 mRNA is present in these conditions, we now included a RT-PCR performed on RNA isolated from the somato-dendritic and pure axonal fractions of 8 DIV microfluidic chambers (Supplementary figure 3D). Note that the presence of this mRNA in axons has been supported by several studies, one of them using cortical neurons of similar DIV and cultured in microfluidic chambers (Table I and figure 1).

      2) The effects on tubulin following Rtn-1 downregulation in axons is potentially very interesting, but the authors should be careful because it could also mean that the axons are suffering. Can they also stain for other cytoskeletal markers?

      R: Regarding this concern, we are aware that in the former Figure 3 we mistakenly selected axonal fields that did not display healthy axons, which was not the dominant trend. This is accredited by the lack of fragmentation and by the functional responsiveness (microtubule dynamics) shown in Figures 4 and 5B, C, E. We have now replaced the previous axonal fields in Figure 3 with more representative axons (healthy), devoid of varicosities and fragmentation (page 37)

      3) The results using SPTZ are very interesting and implicate spastin microtubule severing activity in the observed phenotype. In my opinion these experiments however do not prove that "axonal Rtn-1 is indeed promoting the severing of microtubules by spastin", but simply that the blocking spastin activity prevents the appearance of the microtubular phenotype (which appears still with a mysterious mechanism). What happens if they try to stabilize the cytoskeleton by another mean (with taxol for example?). The authors should rephrase this conclusion.

      __R: __We completely agree with the reviewer's appreciation. We now explicitly indicate in the main text that this is (so far in the manuscript) a still correlative phenomenon that suggests an interplay with Spastin activity "..Together, these results suggest that locally synthesized Rtn-1 normally acts to suppress the outgrowth of injured axons, a process that could involve the microtubule-severing activity of Spastin." (lines 321-323). Later in the article, with the improved Figure 7, we further propose that these findings may reflect a causal relationship, although this mechanism has not yet been directly demonstrated in axons.

      4) The last experiment (Figure 7) that aims to connect Rtn-1 and spastin function is very artificial, since it is based on overexpression. Why should spastin M87 interact with an ER morphogen? Endogenously it is conceivable that spastin M1 which localizes to the ER would interact with Rtn-1. Moreover, this experiment needs further controls and quantifications. First, it is quite obvious from panel 7C that there is crossover of signal in the two fluorescence channels (see GFP and spastin). Controls need to be shown, where only one of the two fluorescent proteins is expressed, and the specificity of the laser is tested. This experiment is based on only 1 cell shown where co-localisation is detected based on a line that is placed in a specific area of the cell. The effects on the microtubular network needs quantification.

      __R: __We have now improved Figure 7 and added the requested controls to rule out crosstalk as indicated in Supplementary Figure 5 and in the main text. We agree that under normal conditions GFP should display a diffuse cytoplasmic distribution. However, in our experimental setup, we observed punctate GFP signals only in the context of co-expression with Spastin-RFP. This is consistent with prior reports showing that soluble GFP can occasionally be sequestered into late endosomal structures (Sahu et al., 2011), which are also known to harbor the M87 Spastin isoform (Allison et al., 2013; Allison et al., 2019). To exclude the possibility of unspecific fluorescence crosstalk, we independently acquired each fluorophore channel and confirmed that GFP puncta were genuine and not due to bleed-through (Supplementary Figure 5). Further, cells expressing only GFP or only Spastin-RFP did not show overlapping puncta (arrowheads), and the co-expression of GFP with Rtn-1A-RFP did not produce any apparent overlap, indicating that the punctate pattern of GFP is specifically associated with Spastin co-expression. Thus, we consider that the observed GFP colocalization with Spastin potentially reflects a true phenomenon and not an artifact of imaging or fluorophore bleed-through.

      We thank for these observations and apologize for the confusion in the outline of the former figure 7 and the lack of a better description. As the reviewer indicates, one interesting aspect of the M87 isoform is that lacks the ER morphogen domain (so is soluble or cytoplasmic in principle). However, it also harbors endosome and microtubule binding domains which according to previous literature (now included in the main text) may render it a punctate rather than a homogeneous pattern. Also, M87 is the most abundant isoform in the nervous system, particularly at early development. This is the reason why we selected this isoform to test our model. To clarify this point, we based our colocalization analysis in different cells and experimental dates and analyzed all the z-stacks for each cell (see new figure 7B and methods), the intensity plots (now removed) were only for graphical purposes. Similarly, we had already quantified the total tubulin intensity in COS cells based on many cells from different dates and included the sum projections of all the z-stacks from these cells (see new figure 7C). Thus, we removed the intensity profiles as they were clearly misleading (see new figure 7).

      We agree that over-expressing constructs may force interactions or co-distribution of proteins. However, in this case, if the observed results were mainly due to over-expression, we should see a similar trend with isoform A as both constructs are under the control of the same strong promoter (CMV) and harbor the same ER morphogen domain (RHD). Nevertheless, the distribution of M87 tightly mirrors Rtn-1C, which is not the case for Rtn-1A. Only as a theoretical prediction, our molecular modeling suggests that Rtn-1C may be associated with Spastin through its microtubule binding domain (Figure 7E). This would suppose that Spastin "decorates" ER-tubules rather than being in the same ER membranous structure. This discrete pattern of Spastin is more coherent with the distribution of both proteins that is now more clearly observed in distal axons by STED super-resolution (new figure 5A). So, despite a bit unexpected, these results suggest a novel interaction mechanism between these two proteins that deserves further validation.

      5) What is exactly the model proposed? The title implies that axonal synthesis of Rtn-1 is important during injury, but the data in the paper rather suggest that upon injury the majority of Rtn-1 is not locally synthesized. If the levels of Rtn-1 do not change, why the effect on the microtubules should be specific? Why would a siRNA against Rtn-1 in axons not affect the levels of Rtn-1, but those of tubulin? The authors should be careful, and test other control siRNAs, and Rtn-1 siRNAs, since it is well known even in more simple cellular systems that the toxicity of individual siRNAs can vary greatly.

      We consider the possibility that after injury there is no axonal Rtn-1 synthesis as a plausible and relevant appreciation. Unfortunately, we could not perform a puro-PLA experiment after injury, which would have provided a more definite answer. However, now we are more confident of regulating Rtn-1 synthesis before injury as supported by a Supplementary figure 3D that shows a significant decrease on puro-PLA signal (indicative of Rtn-1C synthesis) 24 hours after axonal KD. Thus, based on some similar phenotypes before and after injury, we consider our results are still compatible with Rtn-1 axonal synthesis being downregulated, but not fully absent (the mRNA is still detected, as described by Taylor 2009). As such, axonal Rtn-1 KD decreased β3-tubulin levels before and after injury according to figure 5B and the improved statistical analysis performed on figure 2E. Similarly, axonal Rtn-1KD significantly increases microtubule growth rate before and after injury according to the current statistical comparisons (Figure 5E). in complement, if β3-tubulin decrease was merely due to unspecific siRNA targeting, it is unlikely that SPTZ treatment should restore β3-tubulin only in the context of axonal Rtn-1 KD (Figure 5B). Although on a different track, the mechanistic relationship between Rtn-1C and Spastin suggested in Figure 7 could make more plausible that a similar phenomenon regarding the control of tubulin levels could be occurring locally in axons. We have now included these considerations in the discussion (lines 535-543).

      To discard off-targets effects, we have now validated a third siRNA sequence (siRNA 3) specifically designed against Rtn-1 and showed that it selectively downregulates Rtn-1C but not β3-tubulin in cultured cortical neurons. Then, following the same experimental frame of figure 3, we performed axonal Rtn-1 KD after injury and observed that siRNA 3 also significantly increases the outgrowth of injured axons (Supplementary figure 2). This suggests that, at least this phenotype, is not product of an off-target effect. Thus, the pharmacological rescue of β3-tubulin levels by SPTZ (Figure 5B) and the Rtn-1C/Spastin co-distribution in heterologous cells, which correlates with preserved microtubules (improved Figure 7), provide converging evidence to suggest that Rtn-1C-Spastin interplay may underly the observed phenotypes in axons.

      Minor comments:

      In Figure 5A, it would be helpful to indicate the border of the axon. The figure is not really convincing.

      Following yours and other reviewer comments, we have analyzed a new set of experiments regarding the STED images of non-injured and injured axons. To eliminate the risk of artifactual descriptions, we have avoided deconvolution and worked directly with raw STED images (Figure 5A). Under these conditions, distribution of Spastin and its intensity in distal axons are not modified by injury, nor those of Rtn-1C and Spastin (Supplementary figure 4). Despite these results, data still supports that both proteins are restricted to similar domains subcellular domains before and after injury.

      Reviewer #3 (Significance (Required)):

      The manuscript uses complex methods to address an interesting cell biological question of relevance to understand axonal growth regulation upon injury. A limitation of the study is the statistical analysis, which triggers some doubts about the reproducibility of the data. Further experiments and the addition of controls would be important to support the claims of the authors.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript investigates the relationship between the endoplasmic reticulum morphogen reticulon-1 (Rtn-1) and the microtubule severing protein spastin in axons after injury. The main message and conclusion of the paper is that local axonal synthesis of Rtn-1 plays a role in regulating the microtubule severing activity of spastin by interacting with spastin and inhibiting its activity. This mechanism would be important after injury by regulating axonal growth.

      The conclusions of the paper are based on the following claims:

      1. Rtn-1 is synthesized locally in axons.
      2. Specific downregulation in Rtn-1 in axons using microfluidic chambers affects microtubules abundance (measured by beta-3 tubulin) and promotes axon growth after injury.
      3. Inhibition of spastin MT-severing activity with a specific drug rescues the growth effect induced by axonal downregulation of Rtn-1.
      4. Rtn-1c interacts with spastin-M87 to limit its MT-severing activity in a cellular system upon overexpression.

      Major comments:

      1. Evidence that Rtn-1 is synthesized in axons comes from two experiments. Initially, the authors show that Rtn-1 siRNA transfection in the axonal compartment of microfluidic chambers reduces Rtn-1 levels in axons, suggesting that there is some local synthesis. Although this method is very attractive, I am concerned about the statistical analysis. The graphs show bars rather than individual data points from the average of a large number of neurons (about 300). The plots also show the SEM instead of the SD, thus covering all the variability that is inherent in this type of experiment. The statistics are probably not performed on the 3 biological replicates, but consider the individual neurons as N. This is obviously not correct, since neurons in an experiment may all be affected by the same technical problem and are not independent replicates. For this reason, I am a bit skeptical about this quantification. Another problem is that the quantification of the fluorescence intensity of the sample does not take the nuclei into account. Are the nuclei removed for analysis? Are the images single planes? Addressing the quantification issues is crucial also for data in Figure 4, where the authors show a different effect of Rtn-1 axonal KD after injury. The second experiment is the Puro-PLA in Figure 6D. This experiment shows an average of 1.5 dots of signal per soma, which is a very low level of translation for this compartment where most of the synthesis should be taking place. In the axons, it is not clear how they calculate the axonal area. Again, the number of dots detected is very low and the physiological significance is questionable. A control with a known mRNA translated in axons would be important. Finally, as an important control, the authors should show the presence of Rtn-1 mRNA by FISH in their experimental system.
      2. The effects on tubulin following Rtn-1 downregulation in axons is potentially very interesting, but the authors should be careful because it could also mean that the axons are suffering. Can they also stain for other cytoskeletal markers?
      3. The results using SPTZ are very interesting and implicate spastin microtubule severing activity in the observed phenotype. In my opinion these experiments however do not prove that "axonal Rtn-1 is indeed promoting the severing of microtubules by spastin", but simply that the blocking spastin activity prevents the appearance of the microtubular phenotype (which appears still with a mysterious mechanism). What happens if they try to stabilize the cytoskeleton by another mean (with taxol for example?). The authors should rephrase this conclusion.
      4. The last experiment (Figure 7) that aims to connect Rtn-1 and spastin function is very artificial, since it is based on overexpression. Why should spastin M87 interact with an ER morphogen? Endogenously it is conceivable that spastin M1 which localizes to the ER would interact with Rtn-1. Moreover, this experiment needs further controls and quantifications. First, it is quite obvious from panel 7C that there is crossover of signal in the two fluorescence channels (see GFP and spastin). Controls need to be shown, where only one of the two fluorescent proteins is expressed and the specificity of the laser is tested. This experiment is based on only 1 cell shown where co-localisation is detected based on a line that is placed in a specific area of the cell. The effects on the microtubular network needs quantification.
      5. What is exactly the model proposed? The title implies that axonal synthesis of Rtn-1 is important during injury, but the data in the paper rather suggest that upon injury the majority of Rtn-1 is not locally synthesized. If the levels of Rtn-1 do not change, why the effect on the microtubules should be specific? Why would a siRNA against Rtn-1 in axons not affect the levels of Rtn-1, but those of tubulin? The authors should be careful, and test other control siRNAs, and Rtn-1 siRNAs, since it is well known even in more simple cellular systems that the toxicity of individual siRNAs can vary greatly.

      Minor comments:

      In Figure 5A, it would be helpful to indicate the border of the axon. The figure is not really convincing.

      Significance

      The manuscript uses complex methods to address an interesting cell biological question of relevance to understand axonal growth regulation upon injury. A limitation of the study is the statistical analysis, which triggers some doubts about the reproducibility of the data. Further experiments and the addition of controls would be important to support the claims of the authors.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Axonal mRNA localization and localized translation supports many neuronal functions and is an important determinant of the regenerative potential of axons after injury. How this works mechanistically remains unclear. The authors present a well performed and technically challenging study in which they identify RTN-1 as a regulator of axonal outgrowth after injury. They provide evidence using experiments in microfluidic chambers that RTN1 is locally synthesized in axons. Interestingly, they identify a (local) interplay between RTN1 and Spastin which affects microtubules and thereby regulates the outgrowth of cortical axons after injury. This study provides an interesting new link between a locally synthesized protein (RTN1) and a microtubule-regulating protein Spastin that is changed upon axon injury. This provides an advance in our understanding in axon regeneration after injury and provides the basis for new studies that can further investigate this interplay. Although interesting, I have several concerns that should be clarified and are needed to substantiate the findings and model presented in this study.

      Major concerns:

      1. In figure 1, the authors provide an analysis of overlapping axonal mRNAs. There are more axonal transcriptome studies and a recent study by von Kugelgen and Chekulaeva (2020; doi: 10.1002/wrna.1590) already performed such an analysis, which included more studies. It would be good to mention this. It can be perceived that studies were now chosen to get the outcome that Rtn-1 is present in all studies. For example, von Kugelgen finds mRNA coding for RTN3, another ER structural protein, as present in 16 out of 20 studies analyzed. That said, the authors present more reasons to look at Rtn-1, so the selection to continue with this protein remains valid but can be written up differently so not to present it as the 'sole' ER-shaping protein consistently present in axonal transcriptomes.
      2. The description of methods is currently insufficient and incomplete and does not allow for reproducibility of this study. For example, different Rtn-1 antibodies seem to be used in this study. Is the same antibody used for staining and WB? There is no listing of any of the antibodies used in the study and which one is used for which technique/experiment. This should be clarified and should be easy to do so in the methods section (antibody name, origin/company, dilution used) to enhance reproducibility of this study. This is not limited to primary antibodies and any information on secondary antibodies, including what was used for STED is completely missing.
      3. The timeline of KD experiments in Figure 2 and 3 are unclear. For the Western blot KD is performed at DIV7 and collected 48 hours later. However, this is not specified for the stainings done in Figure 2C-E. Is this also at DIV7 and then for 48 hours? In figure 3 the siRNA is added at DIV8 (together with axotomy) and outgrowth is measured 24 hours later. Is 24 hours sufficient to achieve knockdown? Is this also what was done for stainings? Later on in Figure 5B, 48 hours of KD is again used. It is unclear what the rationale of these differing timepoints is. Why was this chosen? Is the timeline also the reason for the difference in segment lengths chosen? In Figure 3, there is a significant effect on outgrowth in the KD in the 'mid-range' which is not present in Figure 5.
      4. Could the authors provide a rescue condition for their siRNA (using a siRNA-resistant construct) to show that their siRNA is specific for RTN1. They nicely show the efficiency of the siRNA but not its specificity. This is crucial because if not specific, this will affect a large part of their study. They already have RTN1A and RTN1C constructs available. Such a rescue experiment should ideally also be performed for one or more of their phenotypic experiments, such as the one presented in Figure 3A or 5 to show that the phenotype is really RTN1 dependent. If done by re-expressing either RTN1A or RTN1C, this could provide insightful information on the relevant isoforms.
      5. I find the data presented in Figure 4A/B confusing. Axonal RTN-1 KD does not reduce axonal RTN1 levels but somatic KD does. I understand that this implies most protein comes from the soma and the authors indeed present an explanation that increased somatic RTN1 occurs after axonal KD as a compensation mechanism. However, this can also be interpreted that there is no axonal synthesis of RTN1 after injury and axonal KD has indirect or even aspecific effects. Their model depends on this difference. Their data in Figure 6 could provide supporting evidence if it shows RTN1 puro-PLA after injury. Along these same lines, in Figure 6, they nicely include a compartment control for puro-PLA. It therefore seems doable to include a somatic puromycin control for their axonal puro-PLA, to exclude and diffusion/transport of the newly synthesized peptides. This is especially in light of two recent papers reporting on this possible phenomenon, although these studies were not performed in neurons.
      6. In Figure 5A the authors find an increased co-localization (RTN1/Spastin) after axotomy. From their images, it seems that the amount of Spastin is hugely increased, which would by default increase the chance of (random) colocalization of RTN1 on Spastin. Could the authors comment on this?
      7. In figure 5E and 5F, the condition of scr + SPTZ is omitted. What is the reason for this? The explanation of results in these figures is confusing. The authors report a 'clear trend' in increase in comet track length and lifetime upon addition of SPTZ to axonal RTN-1 KD. This is however not significant. The comparisons that are made afterwards are confusing (e.g. increase in comet lifetime of SPTZ in non-injured axons with RTN1 KD compared to Scr+DMSO and KD + DMSO in injured axons). Their conclusion is axonal RTN-1 synthesis in injured axons (see my concern in the points above on this) governs microtubules growth rate beyond Spastin activity yet blocking Spastin activity still completely blocks the effect of KD on outgrowth.

      Other/minor concerns:

      • The gene ontology analysis in Figure 1A contains the category 'Endoplasmic reticulum'. In this category are mainly ribosomal proteins. Although in a gene ontology analysis these proteins will be included in this category, it is misleading in this respect since they are just as likely to be coming from cytoplasmic ribosomes. Although it cannot be excluded that these are ER-bound ribosomes, not in the last place because a recent study (Koppers et al., 2024, doi: 10.1016/j.devcel.2024.05.005) found ribosomes attached to the ER in axons, I believe the category should be adapted or at the least clarified in the text.
      • Is RTN-1C isoform still an ER-shaping protein or rather an ER protein with alternative functions? The final sentence in the abstract makes a statement that a locally synthesized ER-shaping protein lessens microtubule dynamics. Could the authors provide a clearer description and discussion of the evidence in literature for this? RTN1C has been suggested to perform alternative functions in which case the statement that the local synthesis of an ER-shaping protein is important for axonal outgrowth should be adapted.
      • Is there a difference in RTN1 distribution or levels pre- and post-axotomy?
      • Line 100/101 states 'the interactome of the axonal ER provides...'. To my knowledge there has been no study looking at the interactome of the axonal ER specifically. Surely axonal ER proteins are known but there is a difference.
      • Typo line 160 'localla'
      • In Figure S1 B, please add the DIVs to make it more clear what each graph corresponds to. The legend of S1B states different distances from the cell body but the graph shows distances from the tip.
      • Figure 2C, why does B3 tubulin decrease in soma, aspecific effect of siRNA?
      • What is the rationale on the opposite effect found in outgrowth in Figure 3?
      • Missing word 'we' on line 194
      • Typo line 629 'witmn h', please proofread the entire manuscript carefully.
      • Could the authors comment on why, in Figure 7B/C, GFP only is colocalizing with Spastin-RFP? In general, GFP should be diffusive and not display punctate colocalization with Spastin.

      Significance

      Axonal mRNA localization and localized translation supports many neuronal functions and is an important determinant of the regenerative potential of axons after injury. How this works mechanistically remains unclear. The authors present a well performed and technically challenging study in which they identify RTN-1 as a regulator of axonal outgrowth after injury. They provide evidence using experiments in microfluidic chambers that RTN1 is locally synthesized in axons. Interestingly, they identify a (local) interplay between RTN1 and Spastin which affects microtubules and thereby regulates the outgrowth of cortical axons after injury. This study provides an interesting new link between a locally synthesized protein (RTN1) and a microtubule-regulating protein Spastin that is changed upon axon injury. This provides an advance in our understanding in axon regeneration after injury and provides the basis for new studies that can further investigate this interplay. Although interesting, I have several concerns that should be clarified and are needed to substantiate the findings and model presented in this study.

      The audience for this study will be mainly basic research in the fields of both axonal protein synthesis and axon regeneration. My expertise is in the field of mRNA localization and local protein synthesis.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this paper, the authors focus on the role of Reticulon-1C in concert with Spastin in response to axonal injury. In data mining, they find axonal mRNAs encoding for ER-associated proteins including Rtn-1. They establish a knockdown targeting both Rtn-1 isoforms Rtn-1A and Rtn-1C. They observe decreased beta-3-Tubulin levels in the soma while axonal protein levels are unchanged. In microfluidic devices, they characterise the effect of a compartment-specific Rtn-1 KD on axonal outgrowth in the axonal compartment. The authors quantify axonal outgrowth, seeing increased outgrowth in an axonal compartment-specific Rtn-1 KD, while the effect seems to be reversed when applying the KD construct in the somatic compartment. When focussing on the axonal growth cone, they find the Rtn-1 KD shows differences in several morphological features of the growth cone. They find an increase in Tubulin levels in an axonal compartment-specific, but a decrease in a somatic compartment-specific Rtn-1 KD. Colocalisation of Rtn-1C and Spastin is shown to be monolaterally increased following axotomy. Combining axotomy with the Rtn-1 KD shows increases in dynamic microtubule growth rates and track lengths. In another model system, neuron balls, they show Rtn1-C, but not Rtn1-A to be present in the axon. In a puro-PL assay they also show it can be synthesised in the axonal compartment. To investigate the mechanism enabling the cooperation between Spastin and Rtn-1C, they move to a cell line model in which they see a correlating distribution between Spastin and Rtn-1C but not Rtn-1A. Finally, they use in silico modelling to speculate on binding between Spastin domains and Rtn-1 isoforms.

      Major comment:

      The rationale behind the work is convincing, however some interpretations are presented as more robust than some data allow. Most notably, while the interaction between Rtn-1 and Spastin has been shown prior to this study, it is only presented here through in silico analysis. In figure 5, an increase in the growth rate of dynamic microtubules is observed in either a Rtn-1C KD or by using a Spastin-inhibitor. Due to a described increase in colocalisation between Rtn-1C and Spastin (5A), the increase in growth rate is displayed as caused by Rtn-1 promoting Spastin's severing ability. This result might however be correlative. Further in the injured samples, Spastin-levels seemingly increase (in the representative images) and it is thus not surprising that the level of Rtn-1C colocalising with Spastin increases as well. This might not be indicative of a cooperation and further experimental evidence are required.

      Other comments:

      • Generally, graphs would benefit from individual values plotted as well as the summary. Font sizes and types (but rarely) are sometimes inconsistent. Proteins should be consistently written (capitalised or not).
      • Table 1 and figure 1 present data collected from a vast amount of resources. It should be highlighted that datasets from which data was obtained includes many different models, different DIVs and neuronal cell types. Figure 1B may benefit from a different colour scheme. "Ex-vivo" should be "Ex vivo". For "ER mRNAs are a relevant category" it is not described what "relevant" would mean in this context. The title might remove this small part or describe it in the text. It should be described how it is decided that mRNAs are "common".
      • Figure 2: add description to y-axis to describe what fold change is displayed, applies to multiple figures. Will improve readability of the figures. In 2C, the ROI showing neuronal somata should be increased to show part of the axon and not cut off the soma.
      • Figure 3: Three out of four axonal compartments seem to be comprised of dying or damaged axons. Especially the axonal KD scrambled image. It should be ensured that neuronal cultures are healthy. Typo in "intersections". The schematic of 3B is a great addition to explain the graphs above. Perhaps it could be a bit refined as it is currently hard to see whether this is a neuron or a growth cone without context. Maybe show where the axon connects to the depicted growth cones and change the third icon which looks like it was crossed out. Small formatting issues: remove additional space bar before "Figure 3." And add after "Bar"
      • Figure 4: If not misunderstanding what is depicted, in 4A and B, different lookup tables are used to depict the same signal. Only one of each images is necessary. Do the axons have more tiny branches in the Rtn-1 KD condition in 4A? Unclear why Rtn-1 levels are increased in the Rtn-1 KD (4C), please clarify.
      • Figure 5: It may be easier to understand what "axotomy" samples are if just referred to as "injured" as later in the same figure. The procedure could also very briefly be explained in the results. 5C should depict AUC in µm2 not µm. 5D Spastin is barely visible, brightness and contrast should be adjusted to enhance visibility.
      • Figure 6: It should be made clear why it is necessary to switch to another model system just for 6A, please indicate this in the text. PCR bands seem very pixelated, check the quality. It is unclear why soma genes/proteins were only tested with either PCR or WB others with both. Rtn-1C and Rtn1-A should be presented in the same order in the PCR and WB panel. Correct "Rtn1-1A" typo. In 6D, 1.5 dots per soma seems like a low number. When normalised to the area the soma vs the axon occupies, the compartmentalisation does not work? May be it make sense to refine analysis or apply puromycin in the somatic compartment and analyse the axonal compartment as comparison?
      • Figure 7: 7A shows two images depicting the same information that may not be needed. Can probably be removed. In 7B there is no negative (or any) correlation between Spastin levels and Tubulin, however later it is mentioned that Rtn-1C transports Spastin thus causing a decrease in Tubulin at certain locations? It is nclear if Spastin levels vary intensely between different samples. Mean intensity of the somatic area may be beneficial to rule this out. 7B Tubulin on the right top panel seems to have a decrease in Tubulin levels which is not visible due to the Y axis of Tubulin being set to a different range than the middle and lower panel. The average of line scans from multiple cells may be helpful to determine whether there is indeed no colocalization between Rtn-1A and Spastin. The provided representative images seem to show similar degrees of colocalization between Spastin and Rtn-1A/C.

      Results:

      • It would be helpful to reiterate the hypothesis at the start to ease the reading flow.
      • There seems to be minor redundancy in lines 132-138.
      • There are several spellings, proof-reading is recommended. For example, in line 136 should be "promotes". 160 "localla", 192 should be "the actin cytoskeleton".,194 should be "we first examined", 195 should be "Different", 223 "using", 259 "axons". ...
      • 154-155: Unclear, why the lower MW Rtn-1C was seen as more important.
      • 167 results of 2E not stated before interpreting them.
      • 181 would suggest "outline" instead of "perimeter".
      • 183-184 "longest shortest path" is a confusing term.
      • figure 4B should be referenced earlier in the sentence.
      • 243-244 may be correlation. Rtn-1 and Spastin do not necessarily interact so that this result is achieved.
      • 246: In figure 1 the KD seemed to have an effect on both Rtn-1 isoforms, why not here anymore? 259 "axons". 284 "counteract" instead of "suppress"?
      • 485: rephrase as the interaction between Rtn-1C with Spastin has not been shown directly in these experiments.

      Methods: 535 "in PBS". 543 citation error. 689-699 is it necessary to add a gaussian blur?

      References: Mannan, A U et al. appears twice in the citation list (36 and 44).

      Significance

      Overall, this manuscript describes novel fundings which will be interesting to the neuronal cell biology community and scientists working on the field of neuronal injury and regeneration. It is well structured, and the data are mostly well presented but sometimes conclusions are over-interpreted. However, several points need to be addressed in a more convincing way.

    1. Note: This response was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      This manuscript described the translational responses to single and combined BCAA shortages in mouse cell lines. Using Ribo-seq and RNA-seq analysis, the authors found selective ribosome pausing at codons that encode the depleted amino acids, where the pausing at valine codons was prominent at both a single and triple starvations whereas isoleucine codons showed pausing only under a single depletion. They analyzed the mechanisms of the unexpected selective pausing and proposed that the positional codon usage bias could shape the ribosome stalling and tRNA charging patterns across different amino acids. They also examined the stress responses and the changes in the protein expression levels under BCAA starvation.

      The manuscript was well-written, and the findings are interesting, especially their model that positional codon usage bias could be a regulator of ribosome pausing and tRNA charging levels. Although different translational responses to distinct amino acid starvation have been widely documented, the positional codon usage bias is an interesting aspect. The manuscript's central message could have been made clearer. The authors may consider emphasizing this point more explicitly in the abstract. The rich multi-omics dataset in this work provides valuable resources for the translation field.

      We thank the reviewer for the thoughtful and positive evaluation of our work.

      Major comments

      1. The abstract may need to be revised since it is hard to immediately catch the authors' main point. If the authors regard this work as a resource paper, the current version is fine. But it could be better to point out the positional codon usages the authors found, which is a strong point of the current manuscript.

      Response: We thank the reviewer for highlighting the importance of positional codon usage, which indeed represents a key finding of our study. We revised the abstract, and we now emphasize this aspect more clearly. However, in response to review #2, we have framed the observed positional effects and the idea of an elongation bottleneck as one possible contributing mechanism among others and relate it specifically to the attenuation of isoleucine-specific stalling under triple starvation.

      1. Page 18 "Beyond these tRNA dynamics, our data also highlight the importance of the codon positional context within mRNAs, indicating that where a codon is located within the CDS can influence both the extent of ribosomal stalling and overall translation efficiency during nutrient stress." This idea is interesting. To what extent the authors think this could be generalized? The authors may discuss whether they think their proposed model is specific to the different ribosome stalling patterns between valine and isoleucine codons or generalized to other codon combinations. For example, the positional codon usage bias will be different among different organisms, and are there any previous reports on ribosome behaviors that align with their model?

      Response: We thank the reviewer for raising these important points. While our study primarily focuses on the differential stalling patterns of valine and isoleucine codons, we believe the underlying principle, that the position of codons within the CDS can modulate the extent of ribosome stalling, may under very specific circumstances extend beyond this amino acid pair. We expect this positional effect to be potentially relevant for combinations in which one amino acid has considerable enrichment near the 5′ end of coding sequences, coupled with starvation-sensitive tRNA isoacceptors, while the other does not. In our case, valine meets these criteria (see Fig. S11A and Fig. 6). In contrast, isoleucine and leucine codons, although also relatively frequent, show more variable positional distributions and are both decoded by isoacceptors that appear more resistant to starvation, as illustrated in Fig. 6 and reported for mammals and bacteria in Saikia et al. 2016; Darnell, Subramaniam, and O’Shea 2018; Elf et al. 2003; Dittmar et al. 2005. To explore the generalizability of this model, we have now included a transcriptome-wide analysis of codon position biases in mouse for all codons in the revised manuscript (Supplementary Figures 10 and 11). This analysis may serve as a basis to identify additional candidate codons for future studies. Furthermore, we now mention in the Discussion that amino acids with similar properties to valine regarding their positional distribution and tRNA isoacceptors, such as phenylalanine, and glutamine, whose tRNA isoacceptors are predicted to be fully deacylated under their respective starvation in bacteria (Elf et al. 2003), could be promising candidates for testing this model, in combination with amino acids, whose tRNAs are expected to remain partially charged under starvation or to be depleted at the start of the CDS such as i.e. His (Supplementary Fig.11C).

      Even if the authors think this model can be applied to BCAA starvation, would it be possible to explain the different isoleucine codon responses between single and double starvation? The authors may discuss why the ribosome stalling at isoleucine AUU and AUC codons was slightly attenuated under double starvation. And how about the different leucine codon responses among single, double, and triple starvations, although the pausing is not as strong as isoleucine and valine codons?

      Response: Regarding the attenuated isoleucine stalling under double starvation, we believe this is primarily due to stronger inhibition of the mTORC1 pathway when leucine is co-depleted (i.e., in the double starvation condition; Fig. 2D–F). This results in a more substantial suppression of global translation, reducing overall tRNA demand and thereby mitigating stalling (Darnell, 2018). A similar effect may explain the only mild leucine codon stalling observed under single leucine starvation, which also triggers strong mTORC1 inhibition and reduced initiation. In contrast, triple starvation does not suppress mTORC1 to the same extent, and thus reduced initiation alone cannot explain the absence of leucine codon stalling. Instead, we propose that additional features, such as the relative sensitivity of tRNA isoacceptors to starvation and their aminoacylation dynamics, must be considered. Valine tRNAs, for example, are known to be highly sensitive and become strongly deacylated under starvation in bacteria (Elf et al. 2003), a pattern that we also find in our own data (Fig. 6). Leucine tRNAs, by contrast, appear more resistant, possibly due to better amino acid recycling or isoacceptor-specific differences in charging kinetics, though further validation would be needed. However, combined with the strong stalling at 5′-enriched valine codons, this could reduce downstream ribosome traffic and limit exposure of leucine codons, thus preventing stalling. However, our new analysis of the positional relationship between valine and leucine codons within individual transcripts (now shown in Supplementary Figure 11B) did not reveal as strong a pattern as we observed for valine and isoleucine codons. We now discuss these points and their implications in the revised Discussion.

      Experimental validation using artificial reporters carrying biased sequences may also be considered.

      Response: We appreciate the reviewer’s suggestion. In fact, we explored this experimentally using a dual-fluorescent reporter system (GFP–RFP) (Juszkiewicz and Hegde 2017) containing consecutive Val or Ile codons. However, the constructs yielded variable and non-reproducible results under starvation conditions. In addition, testing the role of codon position would require placing the same codons at multiple defined positions within a single transcript and performing ribosome profiling directly on the reporter. This type of targeted experimental validation is technically challenging and falls beyond the scope of the current study. We now mention this explicitly in the revised Discussion as an interesting direction for future work.

      1. Page 13 "Moreover, we noticed that DT changes extend beyond the ribosomal A-site, including the P-site, E-site, and even further positions (Supplementary Fig. 2A), consistent with other studies on single amino acid starvation 39 (Supplementary Fig. 2B-C)." Could the widespread DT changes be due to Ribo-DT pipeline they used or difficulties in offset determination? Indeed the authors showed that this feature was found in other datasets, but it seems that the datasets were processed and analyzed in the same way as their data. The original Ribo-DT paper (Gobet and Naef, 2022, Methods) also showed some widespread DT changes even from RNA-seq. Another analysis method like the codon subsequence abundant shift as a part of diricore analysis (Loayza-Puch et al., 2016, Nature) did not show that broad changed regions. The authors are encouraged to re-analyze the data sets using different methods.

      Response: We agree with the reviewer that the fact that DT changes beyond the ribosomal A-site is puzzling, but this has already been seen in other papers using other approaches (Darnell, Subramaniam, and O’Shea 2018). To validate that this shift is not due to our A-site assignment, enrichment analysis, or DT method, we applied the Diricore pipeline to our Ribo-Seq data. The output of the pipeline provides either 5’-end ribosome density or “subsequence” analysis using an A-site offset for each read size based on the metagene profile at the start codon. Both analyses show the same enriched codons across the different conditions as in our analyses, and the broad shift is similar, with the maximum signal at E, -1 position (Fig. R1).

      1. Page 13 "Intriguingly, only two of the three isoleucine codons (AUU and AUC) showed increased DTs upon Ile starvation (p < 0.01), while just one leucine codon (CUU) exhibited a modest but significant DT increase (p < 0.01) under Leu starvation (Figure 1A-B, Supplementary Figure 2A)." How can the authors explain the different strengths of ribosome pausing at Ile codons under Ile and double starvation? The AUA codon did not show any pausing under either of the starvation conditions. Throughout the manuscript, the authors mainly describe the difference between amino acids but it is desirable to discuss the codon-level difference as well.

      Response: Thank you for raising this point. The observed differences in stalling between the isoleucine codons can likely be explained by differences in tRNA isoacceptor charging and positional bias within transcripts. The AUA codon is decoded by a distinct tRNAIle isoacceptor (tRNAIleUAU), which, according to our tRNA charging data (Fig. 6), remains largely charged during Ile starvation. This observation aligns with previous reports suggesting that this isoacceptor is more resistant to starvation-induced deacylation in mammalian cells and bacteria (Saikia et al. 2016; Elf et al. 2003). In contrast, the AUU and AUC codons are primarily decoded by the tRNAIleAAU isoacceptor, which we find to be strongly deacylated under Ile starvation, likely contributing to the observed codon-specific ribosome pausing. Additionally, we found that the AUA codons are relatively rare in general and particularly underrepresented near the 5′ ends of coding sequences. Our new spatial analysis (now included in Supplementary Figure 11B) confirms that AUA codons tend to occur downstream of AUU and AUC codons within transcripts. This potentially further reduces stalling on these codons and further diminishes their apparent DT increase under starvation. In order to better explain these important points, we have now expanded the codon-level discussion of these differences in the revised manuscript.

      1. Page 13 "We examined the effects of single amino acid starvations (-Leu, -Ile and -Val), as well as combinations, including a double starvation of leucine and isoleucine (hereafter referred to as "double") and a starvation of leucine, isoleucine, and valine ("triple"), allowing us to identify potential non-additive effects." The different double starvations, isoleucine and valine, and leucine and valine, will further support their hypothesis on the effects of the positional codon usage bias on ribosome pausing and tRNA charging patterns. Although this could be beyond the scope of the current manuscript, the authors are encouraged to provide a rationale for the chosen combination.

      Response: Our experimental design evolved stepwise: we initially focused on leucine and isoleucine depletion as we found that despite their structure similarity these had respectively short and long dwell times in our previous work in the mouse liver (Gobet et al. 2020). Valine was included at a later stage to cover all the BCAAs. At the time, we did not anticipate valine to yield particularly striking effects in cells, and therefore we did not include systematic pairwise depletions involving valine. However, the strong and unexpected stalling observed at valine codons, especially under triple starvation, became a central aspect of the study. Thus, we agree that additional combinations, such as Leu/Val or Val/Ile, could be informative and now mention this in the Discussion as a potential direction for future studies.

      Minor comments

      Page 16 "these results imply that BCAA deprivation lowers protein output through multiple pathways: a combination of reduced initiation, direct elongation blocks (stalling), and possibly an increased proteolysis" This conclusion is totally right but may be too general. Could the authors summarize BCAA-specific features of the events including reduced initiation, stalling, and proteolysis that all contribute to protein outputs? This is not well discussed in the latter sections including Discussion.

      Response: We thank the reviewer for this helpful suggestion. We agree that the original statement was too general and have revised the relevant section to more clearly delineate the distinct responses observed under each BCAA starvation condition. Specifically, we now summarize that valine starvation is characterized by strong, positionally biased ribosome stalling; leucine starvation primarily impacts translation initiation, likely via mTORC1 repression; and isoleucine starvation shows a mixed phenotype, with features of both impaired initiation and codon-specific elongation delays. We also clarify that while protein stability or degradation may contribute to the observed changes in protein output, our current data do not allow for quantitative assessment of proteolytic effects (e.g., changes in protein half-life). Therefore, we refrain from making direct quantitative conclusions about the differential modulations of proteolysis and instead focus our discussion on the translational mechanisms supported by our data.

      Reviewer #1 (Significance):

      The manuscript was well-written, and the findings are interesting, especially their model that positional codon usage bias could be a regulator of ribosome pausing and tRNA charging levels. Although different translational responses to distinct amino acid starvation have been widely documented, the positional codon usage bias is an interesting aspect. The manuscript's central message could have been made clearer. The authors may consider emphasizing this point more explicitly in the abstract. The rich multi-omics dataset in this work provides valuable resources for the translation field.

      We thank the reviewer for the encouraging comments and share the view that positional codon-usage bias is an important result; accordingly, we now underscore this point explicitly in the revised Abstract. We also emphasise that our other observations are, to our knowledge, novel: only a handful of multi-omics studies have combined ribosome-pausing profiles with direct tRNA-aminoacylation measurements, and none has systematically examined multiple amino-acid-deprivation conditions as presented here.

      Reviewer #2 (Evidence, reproducibility and clarity):

      This study examines the consequences of starvation for the BRCAAs, either singly, for Leu & Ile, or for all three simultaneously in HeLa cells on overall translation rates, decoding rates at each codon, and on ribosome density, protein expression, and distribution of ribosome stalling events across the CDS for each expressed gene. The single amino acid starvation regimes specifically reduce the cognate intracellular amino acid pool and lead to deacylation of at least a subset of the cognate tRNAs in a manner dependent on continuing protein synthesis. They also induce the ISR equally and decrease bulk protein synthesis equally in a manner that appears to occur largely at the initiation level for -Leu and -Val, judging by the decreased polysome:monsome ratio, but at both the initiation and elongation levels for -Ile-a distinction that remains unexplained. Only -Leu appears to down-regulate mTORC1 and TOP mRNA translation.There is a significant down-regulation of protein levels for 50-200 genes, which tend to be unstable in nutrient-replete cells, only a fraction of which are associated with reduced ribosome occupancies (RPFs measured by Ribo-Seq) on the corresponding mRNAs in the manner expected for reduced initiation, suggesting that delayed elongation is responsible for reduced protein levels for the remaining fraction of genes. All three single starvations lead to increased decoding times for a subset of the cognate "hungry" codons: CUU for -Leu, AUU and AUC for -Ile, and all of the Val codons, in a manner that is said to correspond largely to the particular tRNA isoacceptors that become deacylated, although this correspondence was not explained explicitly and might not be as simple as claimed. All three single starvations also evoke skewing of RPFs towards the 5' ends of many CDSs in a manner correlated with an enrichment within the early regions of the CDSs for one or more of the cognate codons that showed increased decoding times for -Ile (AUC codon) and -Val (GUU, GUC, and GUG), but not for -Leu-of which the latter was not accounted for. These last findings suggest that, at least for -Val and -Ile, delays in decoding N-terminal cognate codons cause elongating ribosomes to build-up early in the CDS. They go on to employ a peak calling algorithm to identify stalling sites in an unbiased way within the CDS, which are greatest in number for -Val, and find that Val codons are enriched in the A-sites (slightly) and adjacent 5' nucleotides (to a greater extent) for -Val starvation; and similarly for Ile codons in -Ile conditions, but not for -Leu starvation-again for unknown reasons. It's unclear why their called stalling sites have various other non-hungry codons present in the A sites with the cognate hungry codons being enriched further upstream, given that stalling should occur with the "hungry" cognate codon in the A site. The proteins showing down-regulation are enriched for stalling sites only in the case of the -Val starvation in the manner expected if stalling is contributing to reduced translation of the corresponding mRNA. It's unclear why this enrichment apparently does not extend to -Ile starvation which shows comparable skewing of RPFs towards the 5'ends, and this fact diminishes the claim that pausing generally contributes to reduced translation for genes with abundant hungry codons. All of the same analyses were carried out for the Double -Ile/-Leu and Triple starvations and yield unexpected results, particularly for the triple starvation wherein decoding times are increased only at Val codons, skewing of RPFs towards the 5' ends of CDSs is correlated only with an enrichment for Val codons within the early regions of the CDSs, and stall sites are enriched only for Val codons at nearly upstream sites, all consistent with the finding that only Val tRNAs become deacylated in the Triple regime. To explain why only Val tRNA charging is reduced despite the observed effective starvation for all three amino acids, they note first that stalling at Val codons is skewed towards the 5'ends of CDS for both -Val and triple starvations more so than observed for Ile or -Leu starvation, which they attribute to a greater frequency of Val codons vs Ile codons in the 5' ends of CDSs. As such, charged Val tRNAs are said to be consumed in translating the 5'ends of CDSs and the resulting stalling prevents ribosomes from reaching downstream Ile and Leu codons at the same frequencies and thus prevents deacylation of the cognate Ile and Leu tRNAs. It's unclear whether this explanation is adequate to explain the complete lack of Ile or Leu tRNA deacylation observed even when amino acid recycling by the proteasome is inhibited-a treatment shown to exacerbate deacylation of cognate tRNAs in the single amino acid starvations and of Val tRNA in the triple starvation. As such, the statement in the Abstract "Notably, we could show that isoleucine starvation-specific stalling largely diminished under triple starvation, likely due to early elongation bottlenecks at valine codons" might be too strong and the word "possibly" would be preferred over "likely". It's also unclear why the proteins that are down-regulated in the triple starvation are not significantly enriched for stalling sites (Fig. 5B) given that the degree of skewing is comparable or greater than for -Val. This last point seems to undermine their conclusion in the Abstract that "that many proteins downregulated under BCAA deprivation harbor stalling sites, suggesting that compromised elongation contributes to decreased protein output." In the case of the double -Ile/-Leu starvation, a related phenomenon occurs wherein decoding rates are decreased for only the AUU Ile codon and only the AAU Ile tRNA becomes deacylated; although in this case increased RPFs in the 5' ends are not correlated with enrichment for Ile or Leu codons and, although not presented, apparently stall sites are not associated with the Ile codon in the double starvation. In addition, stalling sites are not enriched in the proteins down-regulated by the double starvation. Moreover, because Ile codons are not enriched in the 5'ends of CDS, it doesn't seem possible to explain the selective deacylation of the single Ile tRNA observed in the double starvation by the same "bottleneck" mechanism proposed to explain selective deacylation of only Val tRNAs during the triple starvation. This is another reason for questioning their "bottleneck" mechanism.

      We thank the reviewer for their deep assessment, exhaustive reading, and constructive feedback, which have greatly contributed to improving the clarity and contextualization of our manuscript. We would first like to clarify that all experiments in this study were conducted in NIH3T3 mouse fibroblasts, not HeLa cells; we assume this was a misunderstanding and have verified that the correct cell line is consistently indicated throughout the manuscript. We also clarify that our data show that -Leu, double starvation, and to a lesser extent -Ile, downregulate mTORC1 signaling and TOP mRNA translation, whereas valine -Val and triple starvation had minimal effects on these pathways. We agree that some of our conclusions and observed phenomena were not explained in sufficient detail in the original version. To address this, we have significantly reworked the discussion, added complementary figures and clarified key points throughout the text, to better convey the underlying rationale and biological interpretation of our findings. We address each of the reviewer’s points in detail in the point-by-point responses below.

      Specific comments (some of which were mentioned above):

      -The authors have treated cells with CHX in the Ribo-Seq experiments, which has been shown to cause artifacts in determining the locations of ribosome stalling in vivo owing to continued elongation in the presence of CHX (https://doi.org/10.1371/journal.pgen.1005732 ). The authors should comment on whether this artifact could be influencing some of their findings, particular the results in Fig. 5C where the hungry codons are often present upstream of the A sites of called stalling sites in the manner expected if elongation continued slowly following stalling in the presence of CHX.

      Response: We thank the reviewer for raising this important concern. We would like to clarify that our ribosome profiling protocol did not include CHX pretreatment of live cells. CHX was added only during the brief PBS washes immediately before lysis and in the lysis buffer itself. This approach aligns with best practices aimed at minimizing post-lysis ribosome run-off, and is intended to prevent the downstream ribosome displacement artifacts described by Hussmann et al. 2015, which result from pre-incubation of live cells with CHX for several minutes before harvesting. Furthermore, recent studies have demonstrated that CHX-induced biases are species-specific. For instance, Sharma et al. 2021 found that human (and mice) ribosomes are not susceptible to conformational restrictions by CHX, nor does CHX distort gene-level measurements of ribosome occupancy. This suggests that the use of CHX in the lysis buffer, as performed in our protocol, is unlikely to introduce significant artifacts in our ribosome profiling data. To further support this, we reanalyzed data from Darnell, Subramaniam, and O’Shea 2018, where the ribosome profiling samples were prepared without any CHX pretreatment or CHX in the wash buffer, and still observed similar upstream enrichments in their stalling profiles (see Supplementary Figure 2B-C in our manuscript). Additionally, in our previous work (Gobet et al. 2020), we compared ribosome dwell times with and without CHX in the lysis buffer and found no significant differences, reinforcing the notion that CHX use during lysis does not substantially affect the measurement of ribosome stalling. Given these considerations, we believe that CHX-related artifacts, such as downstream ribosome movement, are unlikely to explain the enrichment of hungry codons upstream of identified stalling sites in our data. We have now adjusted the Methods section to clarify this point.

      -p. 12: "These starvation-specific DT and ribosome density modulations were also evident at the individual transcript level, as exemplified by Col1a1, Col1a2, Aars, and Mki67 which showed persistent Val-codon-specific ribosome density increases but lost Ile-codon-specific increases under triple starvation (Supplementary Figure 3A-D). " This conclusion is hard to visualize for any but Val codons. It would help to annotate the relevant peaks of interest for -Ile starvation with arrows.

      Response: We agree and thank the reviewer for this observation. We have now annotated exemplary peaks in Supplementary Figure 3A–D to highlight ribosome pileups over Ile codons. However, we agree that it is still hard to visualize in the given Figure. Therefore, we added scatter plots for each of the transcripts that show the RPM of each position in the Ctrl vs starvation to allow for a better illustration of the milder effects upon Ile starvation (Supplementary Figure 4).

      -To better make the point that codon-specific stalling under BCAA starvation appears to be not driven by codon usage, rather than the analysis in Fig. 1H, wouldn't it be better to examine the correlation between increases in DT under the single amino acid starvation conditions and the codon frequencies across all codons?

      Response: We appreciate the suggestion. We have now added an additional analysis correlating the change in DT with codon usage frequency for each starvation condition. This is included in Supplementary Figure 5A-D and supports our interpretation that codon frequency alone does not explain the observed stalling behavior.

      -p. 13, entire paragraph beginning with "Our RNA-seq and Ribo-seq revealed a general activation of stress response pathways across all starvations..." It is difficult to glean any important conclusions from this lengthy analysis, and the results do not appear to be connected to the overall topic of the study. If there are important conclusions here that relate to the major findings then these connections should be made or noted later in the Discussion. If not, perhaps the analysis should be largely relegated to the Supplemental material.

      Response: We thank the reviewer for this comment. The paragraph in question is intended to provide a global overview of transcriptional and translational responses across the starvation conditions. It serves both as a quality control (e.g., PCA clustering and global shifts in RPF/RNA-seq profiles), and to confirm that expected starvation-induced responses are among the strongest detectable signals separating the starved samples from the control. Indeed, these observations establish that the perturbations are effective and that hallmark nutrient stress responses are globally engaged across conditions. Importantly, very few studies to date have examined transcriptional and translational responses under single or combined branched-chain amino acid (BCAA) starvation conditions. It therefore remains unclear to what extent BCAA depletion broadly remodels gene expression and translation. Our analysis contributes to addressing this gap, revealing that while certain stress pathways are commonly induced, others show condition-specific patterns such as we observed for -Ile starvation. To maintain focus, we have kept the detailed pathway analyses and transcript-level enrichments in the Supplement and rewritten the corresponding text in a more compact manner, reducing it by more than one third.

      -p. 15: "Together, these findings highlight that BCAA starvation triggers a combination of effects on initiation and elongation, with varying dynamics by amino acid starvation." I take issue with this statement as it appears that translation is reduced primarily at the initiation step for all conditions except -Ile. As noted above, these data are never menitioned in the DISCUSSION as to why only -Ile would show a marked elongation component to the inhibition whereas -Val gives the greatest amount of ribosome stalling.

      Response: We acknowledge the reviewer’s point. While the polysome profiles (Figure 3F-H) directly indicate that most conditions repress initiation, codon- and condition-specific elongation defects can still contribute to reduced protein output, even if they are not always detectable as global polysome shifts. Polysome profiles reflect the combined outcome of reduced initiation (which decreases polysome numbers) and ribosome stalling (which can, but does not always have to, increase ribosome density on individual transcripts, potentially counteracting the effects of reduced initiation). For valine starvation strong stalling occurs very early in the CDS (Figure 5F). This bottleneck restricts overall ribosome movement to downstream regions. Thus, while elongation is profoundly impaired, the total number of ribosomes per transcript (which polysome signals largely reflect) may appear low due to reduced overall ribosome traffic. In contrast, isoleucine codon stalling tends to occur also further downstream on the transcript (Figure 5F), allowing ribosomes to accumulate in larger numbers on the mRNA, leading to a clearer "elongation signature" in polysome profiles (Figure 3F, H). Additionally, we observed slightly higher inter-replicate variance for isoleucine starvation (Supplementary Figure 6B), which may have reduced the number of statistically significant stalling sites extracted compared to valine. We have revised the main text and discussion to clarify these points.

      -I cannot decipher Fig. 4D and more detail is required to indicate the identity of each column of data.

      Response: We thank the reviewer for pointing this out. Figure 4D (now Figure 4E) presents an UpSet plot, which is a scalable alternative to Venn diagrams commonly used to visualize intersections across multiple sets. Briefly, each bar in the upper plot represents the number of transcripts with increased 5′ ribosome coverage (Δpi < -0.15; p < 0.05) shared across the conditions indicated in the dot matrix below. Each column in the dot matrix highlights the specific combination of conditions contributing to a given intersection (e.g., dots under “Val” and “Triple” show the overlap between these two). To improve clarity, we have expanded the figure legend accordingly and now refer to the UpSetR methodology in the main text.

      -In Fig. 4E, one cannot determine what the P values actually are, which should be provided in the legend to confirm statistical significance.

      Response: Thank you for pointing that out. The legend in Figure 4E (now Figure 4F) for the p-values was accidentally removed during figure editing. We have added the legend back, so that the statistical significance is clear.

      -It's difficult to understand how the -Leu condition and the Double starvation can produce polarized RPFs (Fig. 4A) without evidence of stalling at the cognate hungry codons (Fig. 4E), despite showing later in Fig. 5A that the numbers of stall sites are comparable in those cases to that found for -Ile.

      Response: We appreciate this comment, which points to an important property of RPF profiles under nutrient stress. As shown in Figure 4A, all starvation conditions induce a degree of 5′ ribosome footprint polarization, a pattern that can be observed under various stress conditions and perturbations (Allen et al. 2021; Hwang and Buskirk 2017; Li et al. 2023). This general 5′ bias likely reflects a combination of slowed elongation and altered ribosome dynamics and is not necessarily linked to codon-specific stalling. However, Val and Triple starvation show a much stronger and more asymmetric polarization, characterized by pronounced 5′ accumulation and 3′ depletion of ribosome density. To better illustrate this, we have updated the visualization of polarity scores and added a new bar chart summarizing the number of transcripts showing strong 5′ polarization under each condition. This quantification highlights that the effect is markedly more prevalent under Val and Triple conditions than under Leu or Double starvation. In addition, Figure 4F demonstrates that this polarity is codon-specific under Val and Triple starvation. We clarify that this analysis tests for enrichment of specific codons near the start codon among the polarized transcripts and does not directly assess stalling. The observed enrichment of Val codons in the 5′ regions of polarized transcripts supports the interpretation that early elongation delays contribute to the RPF shift. In contrast, no such enrichment is observed for Leu starvation, reinforcing that Leu-induced polarity is not driven by stalling at Leu codons. While Figure 5 shows a similar number of peak-called stalling sites in -Leu, -Ile, and Double starvation, we note that Ribo-seq signal variability under Ile starvation was higher, which may have limited statistical power for detecting stalling sites, even though clear dwell time increases were observed at specific codons. Additionally, we have improved the metagene plots depicting total ribosome footprint density in Figure 4A. The previous version incorrectly showed sharp drops at CDS boundaries due to binning artifacts. The updated version more accurately reflects the density distribution and further highlights the stronger polarization in Val and Triple conditions. Together, these clarifications and improvements within the main text now more clearly distinguish between general polarity effects and codon-specific stalling.

      -Fig. 5B: the P values should be given for all five columns, and it should be explained here or in the Discussion why the authors conclude that stalling is an important determinant for reduced translation when a significant correlation seems to exist only for the -Val condition and not even for the Triple condition.

      Response: We thank the reviewer for this important observation. In response, we have revised both the text and the figures to provide a clearer and biologically more meaningful representation of the relationship between ribosome stalling and reduced protein output. Specifically, we have replaced the previous Figure 5B with a new analysis that stratifies transcripts based on the number of identified stalling sites. This updated analysis, now shown in Figure 5B, reveals that under Val and Triple starvation conditions, proteins that are downregulated tend to originate from transcripts with multiple stalling sites. Importantly, the corresponding p-values for all five conditions are now explicitly shown in the figure (as red lines). As the reviewer correctly notes, only the Val condition shows a statistically significant enrichment when considering overall overlap. Triple starvation shows a similarly high proportion of overlap (72.3%) but does not reach statistical significance, likely due to the more complex background composition under combined starvation, which increases the expected overlap and reduces statistical power. By stratifying transcripts by the number of stalling sites, we uncover that transcripts with ≥2 stalling sites are enriched among downregulated proteins specifically under Val and Triple conditions, providing a more robust indication of the link between stalling and translation repression under Valine deprivations. We believe this refined approach, prompted by the reviewer’s comment, offers a clearer and biologically more relevant perspective on the role of ribosome stalling. The original analysis previously shown in Figure 5B is now provided as Supplemental Figure 10C for transparency and comparison. We have clarified this in the revised text and now interpret the relationship more cautiously.

      -p. 17: "Of note, in cases where valine or isoleucine codons were present just upstream (rather than at) the stalling position, we noted a strong bias for GAG (E), GAA (E), GAU (D), GAC (D), AAG (K), CAG (Q), GUG (V) and GGA (G) (Val starvation) and AAC (N), GAC (D), CUG (L), GAG (E), GCC (A), CAG (Q), GAA (E) and AAG (K) (Ile starvation) at the stalling site (Supplementary Figure 7B)." The authors fail to explain why these codons would be present in the A sites at stalling sites rather than the hungry codons themselves, especially since it is the decoding times of the hungry codons that are increased according to Fig. 1A-E. As suggested above, is this a CHX artifact?

      Response: We agree that the observation that the listed codons are enriched at identified stalling positions (now Supplementary Figure 10C), while the depleted amino acid codon is located upstream, is a finding that needs more detailed explanation. Importantly, this phenomenon is not attributable to CHX artifacts, as our Ribo-seq protocol employs CHX solely during brief washes and lysis to prevent post-lysis ribosome run-off, rather than live-cell pre-treatment. Instead, we propose two hypotheses to explain this pattern: Firstly, many of these enriched codons are already inherently slow-decoded with longer DTs even under control conditions (Supplementary Figure 5H, newly added). Together with the upstream hungry codons they might form a challenging consecutive decoding environment, which results in an attenuated ribosome slowdown downstream after the hungry codon. Second, ribosome queuing may further explain this pattern. When a ribosome encounters a critically hungry codon and stalls, subsequent ribosomes can form a queue. The codon within the A-site of the queued ribosome would be (more or less) independent of the identity of the hungry codon itself that caused the initial stall. Since the listed codons have a high frequency within the transcriptome (Supp. Fig 5B), they therefore have an increased likelihood of appearing at this “stalling site”. Importantly, both of these phenomena are not necessarily represented by a general increase of DT on all of the listed codons and would therefore only be captured by the direct extraction of stalling sites but might be averaged out in the global dwell time analysis. We mention this phenomenon now in the Discussion.

      -Fig. 5D: P values for the significance, or lack thereof, of the different overlaps should be provided.

      Response: Thanks for pointing out this omission. We have now computed hypergeometric p-values for comparisons shown in Figure 5D and Figure 5E, and report them directly in the main text. As described, the overlap in stalling sites between Val and triple starvation is highly significant (2522 positions, p < 2.2×10⁻¹⁶), while overlaps involving Ile-specific stalling positions are smaller but still statistically robust (e.g., 149 positions for Ile – Triple, p = 1.77×10⁻⁵²). Notably, we also calculated p-values at the transcript level and found that a large fraction of transcripts with Ile-specific stalling under single starvation also stall under triple starvation, though often at different positions (1806 transcripts, p = 1.78×10⁻⁵⁸). These values are now included in the revised results section to support the interpretation of these overlaps.

      -p. 17: "Nonetheless, when we examined entire transcripts rather than single positions, many transcripts that exhibited isoleucine-related stalling under Ile starvation also stalled under triple starvation, but at different sites along the CDS (Figure 5E). This finding is particularly intriguing, as it suggests that while Ile-starvation-specific stalling sites may shift under triple starvation, the overall tendency of these transcripts to stall remains." The authors never come back to account for this unexpected result.

      Response: Thank you for highlighting this point. We've incorporated this finding as part of the proposed "bottleneck" scenario. While the isoleucine-specific stalling sites identified under Ile starvation do shift or disappear under triple starvation, we've observed that the same transcripts still tend to exhibit stalling. However, this now primarily occurs at upstream valine codons. We interpret this as a consequence of early elongation stalling caused by strong pausing at Val codons. This restriction on ribosome progression effectively prevents ribosomes from reaching the original Ile stalling sites. Therefore, the stalling sites identified under triple starvation are largely explained by the Val codons, reflecting a redistribution of stalling rather than its loss. To further clarify this crucial point, we've now explicitly mentioned Figure 5D-E again in the subsequent paragraph, which introduces the bottleneck theory.

      -It seems very difficult to reconcile the results in Fig. 5F with those in Fig. 4A, where similar polarities in RPFs are observed for -Ile and -Val in Fig, 4A but dramatically different distributions of stalling sites in Fig. 5F. More discussion of these discrepancies is required.

      Response: Thank you for pointing this out. The apparent discrepancy between the RPF profiles shown in Figure 4A and the stalling site distributions in Figure 5F likely reflects the fact that RPF polarization includes both general (unspecific) and codon-specific components. Figure 4A displays total ribosome footprint density, capturing both broad stress-induced effects and codon-specific contributions, whereas Figure 5F focuses specifically on peak-called stalling sites, representing localized and statistically significant pauses. Importantly, we would like to emphasise that Fig 4 shows that -Val and -Ile starvation exhibit different responses and not the same patterns. To make these differences even clearer, we have now updated the visualizations in Figure 4, including improved polarity plots and a new bar chart summarizing the number of transcripts with strong 5′ polarization. These additions highlight that the RPF profiles under -Val starvation are more pronounced and asymmetric, particularly due to 3′ depletion, while the polarity under -Ile is milder and a distinct, much smaller subset of transcripts appears to show polarity score shifts. We believe the updated figures and accompanying explanations now make these distinctions clearer.

      • p. 18: " These isoacceptor-specific patterns correlate largely with the particular subsets of leucine and isoleucine codons that stalled (Figure 1A)." This correlation needs to be addressed for each codon-anticodon pair for all of the codons showing stalling in Fig. 1A.

      Response: We thank the reviewer for this important comment. In the revised manuscript, we have expanded the relevant sections to address codon–anticodon relationships more thoroughly. We now explicitly match codons that exhibited increased dwell times under starvation to the corresponding tRNA isoacceptors whose charging was affected, and we provide a clearer discussion of the caveats involved. As noted by the reviewer, this correlation is not straightforward, as it is complicated by wobble base pairing, anticodon modifications, and the fact that multiple codons can be decoded by more than one isoacceptor, and vice versa. Moreover, in our qPCR-based tRNA charging assay, certain isoacceptors cannot be distinguished due to highly similar sequences (e.g., LeuAAG and LeuUAG, and LeuCAA and LeuCAG), which limits resolution for exact pairing. In addition, we did not assess absolute tRNA abundance, which may further influence decoding capacity. Nevertheless, where resolution is possible, the patterns align well: All tRNAVal isoacceptors became uncharged under Val and triple starvation, matching the consistent dwell time increases across all Val codons. Only tRNAIleAAU (decoding AUU and AUC) was deacylated, matching to these codons showing increased dwell times, while AUA (decoded by still-charged tRNAIleUAU) did not. Only CUU (decoded by uncharged tRNALeuGAA) showed increased dwell time. A mild deacylation of the other Leu isoacceptors was observed, but isoacceptor-level resolution is limited by assay constraints. However, these rather minimal tRNA and DT changes were consistent with more dominant initiation repression rather than elongation stalls. To support this analysis, we included an illustrative figure (now in Supplementary Figure 12F) summarizing the codon–anticodon matches.

      -p. 19: "For instance, in our double starvation condition, unchanged tRNA charging levels (Figure 6E) may result from a pronounced downregulation of global translation initiation, likely driven by the activation of stress responses (Figure 2), subsequently lowering the demand for charged tRNAs as it has been observed previously for Leu starvation 39.” This seems at odds with the comparable down-regulation of protein synthesis for the Double starvation and -Leu and -Ile single starvations shown in Fig. 3C. Also, in the current study, Leu starvation does lower charging of certain Leu tRNAs.

      Response: We thank the reviewer for raising this important point. In the revised manuscript, we have clarified this section and now offer a more refined interpretation of the tRNA charging patterns observed under double starvation. While Figure 3C shows a comparable reduction in global protein synthesis across the -Leu, -Ile, and double starvation conditions, it needs to be considered that the OPP assay has limited sensitivity. It operates in a relatively low fluorescence intensity range and is subject to background signal, which may obscure subtle differences between conditions. Moreover, other factors such as changes in protein stability or turnover could also contribute to the observed differences. Therefore, inter-condition differences in translation repression should be interpreted with caution. However, based on our stress response analysis (Figure 2), mTORC1 inactivation appears strongest under double starvation, likely leading to more profound suppression of translation initiation. This would reduce the overall demand for charged tRNAs and could explain why no detectable tRNA deacylation was observed under double starvation, even though mild uncharging of Leu isoacceptors occurred under -Leu, which exhibited a milder stress response. This distinction is consistent with the observed mild dwell time increases for one Leu codon under -Leu, but not in the double condition. Similarly, the absence of Ile codon stalling and tRNA deacylation under double starvation may be attributed to stress-driven reductions in elongation demand, preventing the tRNA depletion and codon-specific delays observed under single Ile starvation. A more direct clarification is now included in the revised manuscript.

      Reviewer #2 (Significance):

      The results here are significant in showing that starvation for a single amino acid does not lead to deacylation of all isoacceptors for that amino acid and in revealing that starvation for one amino acid can prevent deacylation of tRNAs for other amino acids, as shown most dramatically for the selective deacylation of only Val tRNAs in the triple BRCAA starvation condition. For the various reasons indicated above, however, I'm not convinced that their "bottleneck" mechanism is adequate to explain this phenomenon, especially in the case of the selective deacylation of Ile vs Leu tRNA in the Double starvation regime. It's also significant that deacylation leads to ribosome build-up near the 5'ends of CDS, which seems to be associated with an enrichment for the hungry codons in the case of Val and Ile starvation, but inexplicably, not for Leu or the Double starvations. This last discrepancy makes it hard to understand how the -Leu and Double starvations produce RPF buildups near the 5 ends of CDSs. In addition, the claim in the Discussion that "our data also highlight the importance of the codon positional context within mRNAs, indicating that where a codon is located within the CDS can influence both the extent of ribosomal stalling and overall translation efficiency during nutrient stress" overstates the strength of evidence that the stalling events lead to substantial decreases in translational efficiencies for the affected mRNAs, as the stalling frequency and decreased protein output are significantly correlated only for the -Val starvation, and the data in Fig. 3 D-H suggest that the reductions in protein synthesis generally occur at the level of initiation, even for -Val starvation, with a contribution from slow elongation only for -Ile-which is in itself difficult to understand considering that stalling frequencies are highest in -Val. Thus, while many of the results are very intriguing and will be of considerable interest to the translation field, it is my opinion that a number of results have been overinterpreted and that important inconsistencies and complexities have been overlooked in concluding that a significant component of the translational inhibition arises from the increased decoding times at hungry codons during elongation and that the selective deacylation of Val tRNAs in the Triple starvation can be explained by the "bottleneck" mechanism. The complexities and limitations of the data and their intepretations should be discussed much more thoroughly in the Discussion, which currently is devoted mostly to other phenomena often of tangential importance to the current findings. A suitably revised manuscript would clearly state the limitations and caveats of the proposed mechanisms and consider other possible explanations as well.

      Again, we thank the reviewer for the valuable insights and constructive critiques. We believe that the concerns regarding potential overinterpretation and inconsistencies have now been addressed through clearer explanations and more cautious interpretation throughout the revised manuscript. We also agree that the original Discussion included aspects that, while interesting, were of secondary importance. In light of the reviewer’s suggestions, we have restructured and rebalanced the Discussion to focus more directly on the key findings and their implications. Importantly, we wish to clarify that we do not propose the elongation bottleneck model as a general mechanism across all conditions. In particular, for double (Leu/Ile) starvation, we attribute the observed effects primarily to stress response–mediated translational repression, and not to codon-specific stalling or tRNA depletion. We believe that this distinction is now more clearly conveyed in the revised manuscript.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary

      Worpenberg and colleagues investigated the translational consequences of branched-chain amino acid (BCAA) starvation in mouse cells. Limitation of individual BCAAs has been reported to cause codon-specific and global translational repression. In this paper, the authors use RNA-seq, ribosome profiling (Ribo-seq), proteomics, and tRNA charging assays to characterize the impacts of individual and combined depletion of leucine, isoleucine, and valine on translation. They find that BCAA starvation increases codon-specific ribosome dwell times, activates global translational stress responses and reduces global protein synthesis. They infer that this effect is due to decreased translation initiation and codon-specific translational stalling. They find that the effects of simultaneous depletion are non-additive. In valine and triple (valine, leucine, and isoleucine) depletion, they show that affected transcripts have a high density of valine codons early in their coding sequences, creating an "elongation bottleneck" that obscures the impact of starvation of other amino acids. Finally, they identify isoacceptor-specific differences in tRNA charging that help explain the codon-specific effects that they observe.

      We find the major findings convincing and clear. We find that some results are incompletely explained. We suggest an additional experiment and also have some minor comments that we hope will improve clarity and rigor.

      We thank the reviewer for the thorough and constructive feedback. We appreciate the recognition of our main findings and the helpful suggestions for improving the manuscript. Below we address each point in detail.

      Major comments

      Figure 3O: In this figure and the associated text, the authors try to determine whether differences in protein degradation can explain why some proteins have higher ribosome density but lower proteomic expression. However, since this analysis relies on published protein half-lives from non-starvation conditions and on the assumption that protein synthesis has entirely stopped, we are not convinced it is informative for this experimental context. It does not distinguish between a model in which protein synthesis has been reduced by stalling and a model in which both protein synthesis and degradation rate have increased, which are both consistent with their Ribo-seq and proteomic data. To address this issue, the authors should either perform protein half-life measurements under their starvation conditions, or more clearly explain these two models in the text and acknowledge that they cannot distinguish between them.

      Response: We agree with the reviewer that our current analysis, which is based on protein half-lives obtained under non-starvation conditions, can not definitively separate the effects of reduced translation from those of increased protein degradation. We have revised the relevant section in the manuscript to more clearly state that this analysis is correlative in nature and serves only to explore one possible explanation for the observed disconnect between ribosome density and protein levels. We now also explicitly acknowledge that our dataset does not allow us to distinguish between a model in which protein output is reduced due to stalling and one in which both translation and degradation rates are altered. However, the observed log2FC in the proteomics data are often milder than expected based on complete-medium condition half-life alone, which would be difficult to reconcile with a dominant contribution from global protein destabilization. That said, we also acknowledge that protein degradation is highly context- and protein-specific, and that proteolytic regulation might still play a role. Performing a direct protein half-life measurement under our starvation conditions would indeed be required to rigorously test this, but such an experiment is outside the scope of this study. We now highlight this as a limitation and a valuable direction for future work, and we have softened any interpretations in the main text to reflect the uncertainty regarding the contribution of protein stability changes.

      Minor comments

      Figure 1G: Why does intracellular valine seem to be less depleted under starvation conditions than intracellular leucine or isoleucine? Are the limits of detections different for different amino acids? The authors should acknowledge this discrepancy and comment on whether it has any implications for interpretation of their results.

      Response: We thank the reviewer for this important point. While valine appears slightly less depleted than leucine or isoleucine in Figure 1G, the fold changes and absolute reductions are strong for all three BCAAs, including valine. To further illustrate this, we have added a supplementary bar chart showing the measured intracellular concentrations in µmol/L, including mean and variance across five biological replicates (Supplementary Figure 5A). We believe that the variation may reflect technical factors, such as differences in detection sensitivity or ionization efficiency between amino acids in the targeted metabolomics assay and, therefore, that the observed difference does not have a meaningful impact on the interpretation of our results. We now directly acknowledge these differences in the main text.

      Figure 1H: These data do not appear to meet the assumptions for linear regression. We suggest either reporting a Spearman R correlation (as the data appears linear in rank but not absolute value), or remove it entirely - we think the plot without statistics is sufficient.

      Response: We thank the reviewer for the suggestion. In the revised manuscript, we removed the statistical annotation and retained only the trend line to illustrate the general pattern. We agree that this visualization alone is sufficient to support the qualitative point we aimed to convey.

      Figure 2B: The in-text description of this figure states that "most" ISR genes show a "robust induction," but only three genes are shown in the figure, two of which are upregulated. The authors should instead specify that 2 out of the 3 genes profiled were robustly induced.

      Response: We have rephrased the sentence to say “two of the three genes profiled…” for precision and consistency with the data shown.

      Figure 2D: Please include the full, uncropped blots in the supplementary materials.

      Response: We have now added the full, uncropped western blots to the supplementary material (Supplementary Figure 8).

      Figure 2E: Swap the positions of the RPS6 and 4E-BP1 plots so they line up with their respective blots to make these figures easier to interpret. Authors should consider doing a one-way ANOVA and post-hoc analysis, if we correctly understand that they are making a conclusion about the difference between multiple groups in aggregate.

      Response: We thank the reviewer for the suggestion. The alignment of the RPS6 and 4E-BP1 plots with their respective blots has been corrected. As this panel focuses on comparisons to the control condition only, we have retained the original presentation.

      Figure 4B: Panel A in this figure is very convincing, and these plots don't add additional information. The authors could consider removing them. If this panel stays in, we suggest removing the "mid index" plot, since it is never referenced in the text and doesn't seem relevant to the message of the figure.

      Response: We appreciate the feedback. While we considered removing panel B as suggested, we decided to retain it because it provides a useful summary of panel A. To improve clarity and visual interpretation, we replaced the original boxplot with a bar plot displaying mean values and SEM error bars. We believe the bar plot now nicely illustrates that Val and Triple starvation lead to stronger effects, especially in the reduction of the 3′ index. The “mid index” plot, which was not referenced in the text and did not contribute to the central message, has been removed as suggested.

      Figure 4E: Why is there a reduction in frequency of a Leu and a Val codon under Ile starvation?

      Response: Thank you for highlighting this observation. The reduction in the frequency of a specific Leu and Val codon under Ile starvation in Figure 4F (former Figure 4E) is indeed intriguing. This figure reflects codon usage in the first 20% of the CDSs among the subset of transcripts that exhibit a footprint polarization under each starvation condition. As such, the observed depletion likely arises from the specific transcript composition of the polarized subset under -Ile, which differs from that under -Val or other conditions. Importantly, this pattern is not consistently observed when analyzing the full transcripts (another Leu codon is affected), indicating that it is not a systematic depletion of these codons. One possibility is that an increased frequency of Ile codons (AUC) within the constrained region may lead to a relative underrepresentation of other codons, such as Leu and Val. Alternatively, this may reflect non-random codon co-occurrence patterns within specific transcripts. While our current data do not allow us to investigate this further, we acknowledge these as speculative explanations and now mention this point in the Discussion as a potential avenue for future study.

      Figure 5G: There appears to be one Val codon early in the Hint1 transcript without much stalling under triple or valine starvation conditions. The authors should acknowledge this and comment on why this may be.

      Response: We thank the reviewer for pointing this out. While the Hint1 transcript indeed contains a valine codon early in its CDS, no clear stalling peak was observed at that position under valine or triple starvation. Several factors may contribute to this: local sequence context can influence ribosome pausing, and not all cognate codons necessarily lead to detectable stalling even under amino acid starvation. Additionally, coverage at the 5′ end of Hint1 is relatively sparse in our dataset, and potential mappability limitations, such as regions with low complexity or repetitive elements, may further reduce resolution at specific sites. We now briefly mention this in the manuscript to clarify the possible causes.

      Figure 5B: In the text referencing this figure, the authors state that "a high number of downregulated proteins with associated ribosome stalling sites did not show an overall decreased mean RPF count...as it would be expected from translation initiation defects, linking these stalling sites directly to proteomic changes." However, RPF is affected both by stalling (increases RPF) and initiation defects (decreases RPF). A gene with both stalling and decreased initiation may appear to have no RPF change. The data does suggest a contribution from stalling, but the authors should also acknowledge that reduced initiation may also be playing a role.

      Response: We agree with the reviewer comment. Our cited statement should indeed be more nuanced. The reviewer correctly points out that RPFs are influenced by both increased ribosome density due to stalling and decreased ribosome density due to reduced initiation. Therefore, a gene experiencing both stalling and reduced initiation might appear to have no net change in RPF, or even a slight increase if stalling is dominant. Thus, while the presence of stalling sites strongly suggests a contribution from compromised elongation to reduced protein output, we cannot definitively rule out a concurrent role for reduced initiation, even in cases where RPF counts are not globally decreased. We revised this section in the manuscript to acknowledge this interplay.

      Figure 5E: the black text on dark brown in the center of the Venn diagram is difficult to read. The diagram should either have a different color scheme, or the text in the center should be white instead of black for higher contrast.

      Response: We have adjusted the text color for better contrast and improved readability.

      Supplementary Figure 1C: The ribosome dwell time data in this study is described as "highly correlated" with another published dwell time dataset, but the P and E site data do not seem strongly correlated. The authors should remove the word "highly."

      Response: We have removed the word “highly” to have a more cautious interpretation in the text.

      Supplementary Figure 3E: Not all of the highlighted codons in this figure are ones with prolonged dwell times. To clarify the point that dwell time change is not related to codon frequency, this figure should only highlight codons that have a significantly prolonged dwell time in at least one starvation condition.

      Response: We thank the reviewer for pointing this out. To improve clarity, we have revised the figure and now specifically highlight codons with significantly prolonged dwell times with stars.

      Supplementary Figure 5C: The gene Chop is mentioned in the main text when referencing this figure, but is absent from the heatmap.

      Response: We thank the reviewer for noting this. The gene Chop is annotated under its alternative name Ddit3 in the current version of the heatmap and is indeed present. To avoid confusion, we have now updated the label in the figure to display Chop (Ddit3) directly.

      Supplementary Figure 7A: The authors could clarify this figure by adding additional language to either the figure panel or the figure legend specifying that the RPM metric being used comes from Ribo-seq.

      Response: We have updated the legend to explicitly state that the RPM values shown are derived from Ribo-seq data.

      Supplementary Figure 7D: The metric used to describe the spatial relationship between the first valine and isoleucine codons in transcripts in this figure seems to be describing something conceptually similar to the stalling sites in Figure 5G, but uses a different metric. These figures would be easier to interpret if these spatial relationships were presented in a consistent way throughout the manuscript.

      Response: We thank the reviewer for this helpful observation. Supplementary Figure 7D (now Supplementary Figure 11B) originally used a gene-length-normalized metric to describe codon spacing, whereas Figure 5G depicted absolute nucleotide distances to stalling sites. To ensure consistency across the manuscript, we have now updated Supplementary Figure 11B to also use absolute distances. We believe this adjustment improves clarity and allows for a more direct comparison between spatial codon patterns and stalling events.

      Discussion:

      Reader understanding would be improved if the relevance of paragraphs were established in the first sentence. For instance, in the paragraphs about adaptive misacylation and posttranscriptional modifications, it is unclear until the end of the paragraph how these topics are relevant. Introducing the relevant aspects of the study (the fact that some starvation conditions have less severe effects and the observation about m6A-related mRNAs) at the beginning of these paragraphs would improve clarity.

      Response: We thank the reviewer for this helpful comment. We agree that the flow and clarity of the Discussion can be improved by making the relevance of each paragraph clearer from the outset. In the revised manuscript, we have restructured these sections to better highlight the connection between each topic and our main findings. These changes also align with suggestions from Reviewer 2, and we believe they help to focus the Discussion more tightly around the core insights of our study.

      The authors should provide more information and speculation about possible physiological relevance of their findings, particularly about the way that the effects of triple starvation are highly valine-dependent. Are there physiological conditions under which starvation of all three BCAAs is more likely than starvation of one or two of them? If so, are there any reasons why a valine-based bottleneck might be advantageous?

      Response: We appreciate the reviewer's insightful question regarding the physiological relevance of our findings, particularly the valine-dependent bottleneck observed under triple BCAA starvation. This prompts a crucial discussion on the broader biological context of our work.

      While complete starvation of all three BCAAs might be less frequent than individual deficiencies, such conditions are physiologically relevant in several contexts. In prolonged fasting, starvation, or severe cachectic states associated with chronic diseases (e.g., advanced cancer, critical illness), systemic amino acid pools, including BCAAs, can become significantly depleted due to increased catabolism and insufficient intake (Yu et al. 2021). Moreover, certain specialized diets or therapeutic strategies aim to modulate BCAA levels. For instance, in some Maple Syrup Urine Disease (MSUD) management protocols, BCAA intake is severely restricted to prevent the accumulation of toxic BCAA metabolites (Mann et al. 2021). Similarly, emerging cancer therapies sometimes explore nutrient deprivation strategies to selectively target tumor cells, which could involve broad BCAA reduction (e.g. Sheen et al. 2011; Xiao et al. 2016).

      In these contexts, a valine-based bottleneck, as we describe, could indeed represent an adaptive strategy. If valine-tRNAs are particularly susceptible to deacylation and valine codons are strategically enriched at the 5' end of transcripts, stalling at these early positions could serve as a rapid "gatekeeper" for global translation. This early-stage inhibition would conserve cellular energy and available amino acids by quickly reducing the overall demand for charged tRNAs. Such a mechanism could potentially prioritize the translation of a subset of proteins that might have different codon usage biases or are translated via alternative, less valine-dependent mechanisms. This aligns with the concept of a multi-layered translational control where global initiation repression (as reflected in mTORC1 inhibition and polysome profiles) is complemented by specific elongation checkpoints, allowing for a more nuanced and adaptive response to severe nutrient stress.

      Reviewer #3 (Significance):

      Nature and significance of the advance

      The main contribution of this work is to demonstrate that depletion of multiple amino acids simultaneously impacts translation elongation in ways that are not necessarily additive. These impacts can depend on the distribution of codons in a transcript. It adds to a growing body of work showing that essential amino acid starvation can cause codon-specific ribosome stalling. The authors suggest that the position-dependent stalling they observe could be a novel regulatory mechanism to alleviate the effects of multi-amino acid starvation. However, it is not fully clear from the paper what the significance of a valine-based regulatory adaptation to BCAA starvation is, or whether simultaneous starvation of all three BCAAs is of particular physiological relevance. The paper's primary contribution is mainly focused on the similarity between valine and triple BCAA starvation, and it provides limited insight into the effects of combined depletion of two BCAAs.

      Context of existing literature

      Although ribosome profiling does not distinguish between actively-elongating and stalled ribosomes, sites with higher read coverage, and thereby higher inferred dwell time, can be used to infer ribosome stalling (Ingolia 2011). Various downstream effects of essential amino acid depletion have been documented, such as leucine deficiency being sensed by mTORC1 via leucyl-tRNA synthetase (Dittmar 2005, Han 2012), and shared transcriptional responses among many amino acid depletion conditions (Tang 2015). These authors have previously measured the translational effects of nutrient stress using ribosome profiling (e.g., Gobet 2020), as have others (Darnell 2018, Kochavi et al. 2024). The present work represents the first study (to our knowledge) combining BCAA depletions, representing an incremental and useful contribution to our understanding of translational responses to stress conditions.

      Audience

      This work is of interest to investigators studying the response of human cells in stress conditions, such as in human disease, as well as investigators studying the basic biology of eukaryotic translational control.

      Reviewer expertise: mRNA decay and translation regulation in bacteria.

      We hope the authors have found our comments thoughtful and useful. We welcome further discussion or clarification via email: Juliana Stanley (julianst@mit.edu) and Hannah LeBlanc (leblanch@mit.edu).

      We sincerely thank the reviewers for their thoughtful and constructive feedback, as well as for their careful and thorough reading of our manuscript. We also gratefully acknowledge the invitation for further discussion and would be happy to engage in future correspondence.

      References

      Allen, George E., Olesya O. Panasenko, Zoltan Villanyi, Marina Zagatti, Benjamin Weiss, Lucile Pagliazzo, Susanne Huch, et al. 2021. “Not4 and Not5 Modulate Translation Elongation by Rps7A Ubiquitination, Rli1 Moonlighting, and Condensates That Exclude eIF5A.” Cell Reports 36 (9): 109633. https://doi.org/10.1016/j.celrep.2021.109633.

      Darnell, Alicia M., Arvind R. Subramaniam, and Erin K. O’Shea. 2018. “Translational Control through Differential Ribosome Pausing during Amino Acid Limitation in Mammalian Cells.” Molecular Cell 71 (2): 229-243.e11. https://doi.org/10.1016/j.molcel.2018.06.041.

      Dittmar, Kimberly A., Michael A. Sørensen, Johan Elf, Måns Ehrenberg, and Tao Pan. 2005. “Selective Charging of tRNA Isoacceptors Induced by Amino-Acid Starvation.” EMBO Reports 6 (2): 151–57. https://doi.org/10.1038/sj.embor.7400341.

      Elf, Johan, Daniel Nilsson, Tanel Tenson, and Mans Ehrenberg. 2003. “Selective Charging of tRNA Isoacceptors Explains Patterns of Codon Usage.” Science (New York, N.Y.) 300 (5626): 1718–22. https://doi.org/10.1126/science.1083811.

      Gobet, Cédric, Benjamin Dieter Weger, Julien Marquis, Eva Martin, Nagammal Neelagandan, Frédéric Gachon, and Felix Naef. 2020. “Robust Landscapes of Ribosome Dwell Times and Aminoacyl-tRNAs in Response to Nutrient Stress in Liver.” Proceedings of the National Academy of Sciences of the United States of America 117 (17): 9630–41. https://doi.org/10.1073/pnas.1918145117.

      Hussmann, Jeffrey A., Stephanie Patchett, Arlen Johnson, Sara Sawyer, and William H. Press. 2015. “Understanding Biases in Ribosome Profiling Experiments Reveals Signatures of Translation Dynamics in Yeast.” Edited by Michael Snyder. PLOS Genetics 11 (12): e1005732. https://doi.org/10.1371/journal.pgen.1005732.

      Hwang, Jae-Yeon, and Allen R. Buskirk. 2017. “A Ribosome Profiling Study of mRNA Cleavage by the Endonuclease RelE.” Nucleic Acids Research 45 (1): 327–36. https://doi.org/10.1093/nar/gkw944.

      Juszkiewicz, Szymon, and Ramanujan S. Hegde. 2017. “Initiation of Quality Control during Poly(A) Translation Requires Site-Specific Ribosome Ubiquitination.” Molecular Cell 65 (4): 743-750.e4. https://doi.org/10.1016/j.molcel.2016.11.039.

      Li, Fajin, Jianhuo Fang, Yifan Yu, Sijia Hao, Qin Zou, Qinglin Zeng, and Xuerui Yang. 2023. “Reanalysis of Ribosome Profiling Datasets Reveals a Function of Rocaglamide A in Perturbing the Dynamics of Translation Elongation via eIF4A.” Nature Communications 14 (1): 553. https://doi.org/10.1038/s41467-023-36290-w.

      Mann, Gagandeep, Stephen Mora, Glory Madu, and Olasunkanmi A. J. Adegoke. 2021. “Branched-Chain Amino Acids: Catabolism in Skeletal Muscle and Implications for Muscle and Whole-Body Metabolism.” Frontiers in Physiology 12 (July):702826. https://doi.org/10.3389/fphys.2021.702826.

      Saikia, Mridusmita, Xiaoyun Wang, Yuanhui Mao, Ji Wan, Tao Pan, and Shu-Bing Qian. 2016. “Codon Optimality Controls Differential mRNA Translation during Amino Acid Starvation.” RNA (New York, N.Y.) 22 (11): 1719–27. https://doi.org/10.1261/rna.058180.116.

      Sharma, Puneet, Jie Wu, Benedikt S. Nilges, and Sebastian A. Leidel. 2021. “Humans and Other Commonly Used Model Organisms Are Resistant to Cycloheximide-Mediated Biases in Ribosome Profiling Experiments.” Nature Communications 12 (1): 5094. https://doi.org/10.1038/s41467-021-25411-y.

      Sheen, Joon-Ho, Roberto Zoncu, Dohoon Kim, and David M. Sabatini. 2011. “Defective Regulation of Autophagy upon Leucine Deprivation Reveals a Targetable Liability of Human Melanoma Cells In Vitro and In Vivo.” Cancer Cell 19 (5): 613–28. https://doi.org/10.1016/j.ccr.2011.03.012.

      Xiao, Fei, Chunxia Wang, Hongkun Yin, Junjie Yu, Shanghai Chen, Jing Fang, and Feifan Guo. 2016. “Leucine Deprivation Inhibits Proliferation and Induces Apoptosis of Human Breast Cancer Cells via Fatty Acid Synthase.” Oncotarget 7 (39): 63679–89. https://doi.org/10.18632/oncotarget.11626.

      Yu, Deyang, Nicole E. Richardson, Cara L. Green, Alexandra B. Spicer, Michaela E. Murphy, Victoria Flores, Cholsoon Jang, et al. 2021. “The Adverse Metabolic Effects of Branched-Chain Amino Acids Are Mediated by Isoleucine and Valine.” Cell Metabolism 33 (5): 905-922.e6. https://doi.org/10.1016/j.cmet.2021.03.025.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      Worpenberg and colleagues investigated the translational consequences of branched-chain amino acid (BCAA) starvation in mouse cells. Limitation of individual BCAAs has been reported to cause codon-specific and global translational repression. In this paper, the authors use RNA-seq, ribosome profiling (Ribo-seq), proteomics, and tRNA charging assays to characterize the impacts of individual and combined depletion of leucine, isoleucine, and valine on translation. They find that BCAA starvation increases codon-specific ribosome dwell times, activates global translational stress responses and reduces global protein synthesis. They infer that this effect is due to decreased translation initiation and codon-specific translational stalling. They find that the effects of simultaneous depletion are non-additive. In valine and triple (valine, leucine, and isoleucine) depletion, they show that affected transcripts have a high density of valine codons early in their coding sequences, creating an "elongation bottleneck" that obscures the impact of starvation of other amino acids. Finally, they identify isoacceptor-specific differences in tRNA charging that help explain the codon-specific effects that they observe.

      We find the major findings convincing and clear. We find that some results are incompletely explained. We suggest an additional experiment and also have some minor comments that we hope will improve clarity and rigor.

      Major comments

      Figure 3O: In this figure and the associated text, the authors try to determine whether differences in protein degradation can explain why some proteins have higher ribosome density but lower proteomic expression. However, since this analysis relies on published protein half-lives from non-starvation conditions and on the assumption that protein synthesis has entirely stopped, we are not convinced it is informative for this experimental context. It does not distinguish between a model in which protein synthesis has been reduced by stalling and a model in which both protein synthesis and degradation rate have increased, which are both consistent with their Ribo-seq and proteomic data. To address this issue, the authors should either perform protein half-life measurements under their starvation conditions, or more clearly explain these two models in the text and acknowledge that they cannot distinguish between them.

      Minor comments

      Figure 1G: Why does intracellular valine seem to be less depleted under starvation conditions than intracellular leucine or isoleucine? Are the limits of detections different for different amino acids? The authors should acknowledge this discrepancy and comment on whether it has any implications for interpretation of their results.

      Figure 1H: These data do not appear to meet the assumptions for linear regression. We suggest either reporting a Spearman R correlation (as the data appears linear in rank but not absolute value), or remove it entirely - we think the plot without statistics is sufficient.

      Figure 2B: The in-text description of this figure states that "most" ISR genes show a "robust induction," but only three genes are shown in the figure, two of which are upregulated. The authors should instead specify that 2 out of the 3 genes profiled were robustly induced.

      Figure 2D: Please include the full, uncropped blots in the supplementary materials.

      Figure 2E: Swap the positions of the RPS6 and 4E-BP1 plots so they line up with their respective blots to make these figures easier to interpret. Authors should consider doing a one-way ANOVA and post-hoc analysis, if we correctly understand that they are making a conclusion about the difference between multiple groups in aggregate.

      Figure 4B: Panel A in this figure is very convincing, and these plots don't add additional information. The authors could consider removing them. If this panel stays in, we suggest removing the "mid index" plot, since it is never referenced in the text and doesn't seem relevant to the message of the figure.

      Figure 4E: Why is there a reduction in frequency of a Leu and a Val codon under Ile starvation?

      Figure 5G: There appears to be one Val codon early in the Hint1 transcript without much stalling under triple or valine starvation conditions. The authors should acknowledge this and comment on why this may be.

      Figure 5B: In the text referencing this figure, the authors state that "a high number of downregulated proteins with associated ribosome stalling sites did not show an overall decreased mean RPF count...as it would be expected from translation initiation defects, linking these stalling sites directly to proteomic changes." However, RPF is affected both by stalling (increases RPF) and initiation defects (decreases RPF). A gene with both stalling and decreased initiation may appear to have no RPF change. The data does suggest a contribution from stalling, but the authors should also acknowledge that reduced initiation may also be playing a role.

      Figure 5E: the black text on dark brown in the center of the Venn diagram is difficult to read. The diagram should either have a different color scheme, or the text in the center should be white instead of black for higher contrast.

      Supplementary Figure 1C: The ribosome dwell time data in this study is described as "highly correlated" with another published dwell time dataset, but the P and E site data do not seem strongly correlated. The authors should remove the word "highly."

      Supplementary Figure 3E: Not all of the highlighted codons in this figure are ones with prolonged dwell times. To clarify the point that dwell time change is not related to codon frequency, this figure should only highlight codons that have a significantly prolonged dwell time in at least one starvation condition.

      Supplementary Figure 5C: The gene Chop is mentioned in the main text when referencing this figure, but is absent from the heatmap.

      Supplementary Figure 7A: The authors could clarify this figure by adding additional language to either the figure panel or the figure legend specifying that the RPM metric being used comes from Ribo-seq.

      Supplementary Figure 7D: The metric used to describe the spatial relationship between the first valine and isoleucine codons in transcripts in this figure seems to be describing something conceptually similar to the stalling sites in Figure 5G, but uses a different metric. These figures would be easier to interpret if these spatial relationships were presented in a consistent way throughout the manuscript.

      Discussion:

      Reader understanding would be improved if the relevance of paragraphs were established in the first sentence. For instance, in the paragraphs about adaptive misacylation and posttranscriptional modifications, it is unclear until the end of the paragraph how these topics are relevant. Introducing the relevant aspects of the study (the fact that some starvation conditions have less severe effects and the observation about m6A-related mRNAs) at the beginning of these paragraphs would improve clarity.<br /> The authors should provide more information and speculation about possible physiological relevance of their findings, particularly about the way that the effects of triple starvation are highly valine-dependent. Are there physiological conditions under which starvation of all three BCAAs is more likely than starvation of one or two of them? If so, are there any reasons why a valine-based bottleneck might be advantageous?

      We hope the authors have found our comments thoughtful and useful. We welcome further discussion or clarification via email: Juliana Stanley (julianst@mit.edu) and Hannah LeBlanc (leblanch@mit.edu).

      Significance

      Nature and significance of the advance

      The main contribution of this work is to demonstrate that depletion of multiple amino acids simultaneously impacts translation elongation in ways that are not necessarily additive. These impacts can depend on the distribution of codons in a transcript. It adds to a growing body of work showing that essential amino acid starvation can cause codon-specific ribosome stalling. The authors suggest that the position-dependent stalling they observe could be a novel regulatory mechanism to alleviate the effects of multi-amino acid starvation. However, it is not fully clear from the paper what the significance of a valine-based regulatory adaptation to BCAA starvation is, or whether simultaneous starvation of all three BCAAs is of particular physiological relevance. The paper's primary contribution is mainly focused on the similarity between valine and triple BCAA starvation, and it provides limited insight into the effects of combined depletion of two BCAAs.

      Context of existing literature

      Although ribosome profiling does not distinguish between actively-elongating and stalled ribosomes, sites with higher read coverage, and thereby higher inferred dwell time, can be used to infer ribosome stalling (Ingolia 2011). Various downstream effects of essential amino acid depletion have been documented, such as leucine deficiency being sensed by mTORC1 via leucyl-tRNA synthetase (Dittmar 2005, Han 2012), and shared transcriptional responses among many amino acid depletion conditions (Tang 2015). These authors have previously measured the translational effects of nutrient stress using ribosome profiling (e.g., Gobet 2020), as have others (Darnell 2018, Kochavi et al. 2024). The present work represents the first study (to our knowledge) combining BCAA depletions, representing an incremental and useful contribution to our understanding of translational responses to stress conditions.

      Audience

      This work is of interest to investigators studying the response of human cells in stress conditions, such as in human disease, as well as investigators studying the basic biology of eukaryotic translational control.

      Reviewer expertise: mRNA decay and translation regulation in bacteria.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary and General Critique:

      This study examines the consequences of starvation for the BRCAAs, either singly, for Leu & Ile, or for all three simultaneously in HeLa cells on overall translation rates, decoding rates at each codon, and on ribosome density, protein expression, and distribution of ribosome stalling events across the CDS for each expressed gene. The single amino acid starvation regimes specifically reduce the cognate intracellular amino acid pool and lead to deacylation of at least a subset of the cognate tRNAs in a manner dependent on continuing protein synthesis. They also induce the ISR equally and decrease bulk protein synthesis equally in a manner that appears to occur largely at the initiation level for -Leu and -Val, judging by the decreased polysome:monsome ratio, but at both the initiation and elongation levels for -Ile-a distinction that remains unexplained. Only -Leu appears to down-regulate mTORC1 and TOP mRNA translation. There is a significant down-regulation of protein levels for 50-200 genes, which tend to be unstable in nutrient-replete cells, only a fraction of which are associated with reduced ribosome occupancies (RPFs measured by Ribo-Seq) on the corresponding mRNAs in the manner expected for reduced initiation, suggesting that delayed elongation is responsible for reduced protein levels for the remaining fraction of genes. All three single starvations lead to increased decoding times for a subset of the cognate "hungry" codons: CUU for -Leu, AUU and AUC for -Ile, and all of the Val codons, in a manner that is said to correspond largely to the particular tRNA isoacceptors that become deacylated, although this correspondence was not explained explicitly and might not be as simple as claimed. All three single starvations also evoke skewing of RPFs towards the 5' ends of many CDSs in a manner correlated with an enrichment within the early regions of the CDSs for one or more of the cognate codons that showed increased decoding times for -Ile (AUC codon) and -Val (GUU, GUC, and GUG), but not for -Leu-of which the latter was not accounted for. These last findings suggest that, at least for -Val and -Ile, delays in decoding N-terminal cognate codons cause elongating ribosomes to build-up early in the CDS. They go on to employ a peak calling algorithm to identify stalling sites in an unbiased way within the CDS, which are greatest in number for -Val, and find that Val codons are enriched in the A-sites (slightly) and adjacent 5' nucleotides (to a greater extent) for -Val starvation; and similarly for Ile codons in -Ile conditions, but not for -Leu starvation-again for unknown reasons. It's unclear why their called stalling sites have various other non-hungry codons present in the A sites with the cognate hungry codons being enriched further upstream, given that stalling should occur with the "hungry" cognate codon in the A site. The proteins showing down-regulation are enriched for stalling sites only in the case of the -Val starvation in the manner expected if stalling is contributing to reduced translation of the corresponding mRNA. It's unclear why this enrichment apparently does not extend to -Ile starvation which shows comparable skewing of RPFs towards the 5'ends, and this fact diminishes the claim that pausing generally contributes to reduced translation for genes with abundant hungry codons.<br /> All of the same analyses were carried out for the Double -Ile/-Leu and Triple starvations and yield unexpected results, particularly for the triple starvation wherein decoding times are increased only at Val codons, skewing of RPFs towards the 5' ends of CDSs is correlated only with an enrichment for Val codons within the early regions of the CDSs, and stall sites are enriched only for Val codons at nearly upstream sites, all consistent with the finding that only Val tRNAs become deacylated in the Triple regime. To explain why only Val tRNA charging is reduced despite the observed effective starvation for all three amino acids, they note first that stalling at Val codons is skewed towards the 5'ends of CDS for both -Val and triple starvations more so than observed for Ile or -Leu starvation, which they attribute to a greater frequency of Val codons vs Ile codons in the 5' ends of CDSs. As such, charged Val tRNAs are said to be consumed in translating the 5'ends of CDSs and the resulting stalling prevents ribosomes from reaching downstream Ile and Leu codons at the same frequencies and thus prevents deacylation of the cognate Ile and Leu tRNAs. It's unclear whether this explanation is adequate to explain the complete lack of Ile or Leu tRNA deacylation observed even when amino acid recycling by the proteasome is inhibited-a treatment shown to exacerbate deacylation of cognate tRNAs in the single amino acid starvations and of Val tRNA in the triple starvation. As such, the statement in the Abstract "Notably, we could show that isoleucine starvation-specific stalling largely diminished under triple starvation, likely due to early elongation bottlenecks at valine codons" might be too strong and the word "possibly" would be preferred over "likely". It's also unclear why the proteins that are down-regulated in the triple starvation are not significantly enriched for stalling sites (Fig. 5B) given that the degree of skewing is comparable or greater than for -Val. This last point seems to undermine their conclusion in the Abstract that "that many proteins downregulated under BCAA deprivation harbor stalling sites, suggesting that compromised elongation contributes to decreased protein output."<br /> In the case of the double -Ile/-Leu starvation, a related phenomenon occurs wherein decoding rates are decreased for only the AUU Ile codon and only the AAU Ile tRNA becomes deacylated; although in this case increased RPFs in the 5' ends are not correlated with enrichment for Ile or Leu codons and, although not presented, apparently stall sites are not associated with the Ile codon in the double starvation. In addition, stalling sites are not enriched in the proteins down-regulated by the double starvation. Moreover, because Ile codons are not enriched in the 5'ends of CDS, it doesn't seem possible to explain the selective deacylation of the single Ile tRNA observed in the double starvation by the same "bottleneck" mechanism proposed to explain selective deacylation of only Val tRNAs during the triple starvation. This is another reason for questioning their "bottleneck" mechanism.

      Specific comments (some of which were mentioned above):

      • The authors have treated cells with CHX in the Ribo-Seq experiments, which has been shown to cause artifacts in determining the locations of ribosome stalling in vivo owing to continued elongation in the presence of CHX (https://doi.org/10.1371/journal.pgen.1005732 ). The authors should comment on whether this artifact could be influencing some of their findings, particular the results in Fig. 5C where the hungry codons are often present upstream of the A sites of called stalling sites in the manner expected if elongation continued slowly following stalling in the presence of CHX.
      • p. 12: "These starvation-specific DT and ribosome density modulations were also evident at the individual transcript level, as exemplified by Col1a1, Col1a2, Aars, and Mki67 which showed persistent Val-codon-specific ribosome density increases but lost Ile-codon-specific increases under triple starvation (Supplementary Figure 3A-D). " This conclusion is hard to visualize for any but Val codons. It would help to annotate the relevant peaks of interest for -Ile starvation with arrows.
      • To better make the point that codon-specific stalling under BCAA starvation appears to be not driven by codon usage, rather than the analysis in Fig. 1H, wouldn't it be better to examine the correlation between increases in DT under the single amino acid starvation conditions and the codon frequencies across all codons?
      • p. 13, entire paragraph beginning with "Our RNA-seq and Ribo-seq revealed a general activation of stress response pathways across all starvations..." It is difficult to glean any important conclusions from this lengthy analysis, and the results do not appear to be connected to the overall topic of the study. If there are important conclusions here that relate to the major findings then these connections should be made or noted later in the Discussion. If not, perhaps the analysis should be largely relegated to the Supplemental material.
      • p. 15: "Together, these findings highlight that BCAA starvation triggers a combination of effects on initiation and elongation, with varying dynamics by amino acid starvation." I take issue with this statement as it appears that translation is reduced primarily at the initiation step for all conditions except -Ile. As noted above, these data are never menitioned in the DISCUSSION as to why only -Ile would show a marked elongation component to the inhibition whereas -Val gives the greatest amount of ribosome stalling.
      • I cannot decipher Fig. 4D and more detail is required to indicate the identify of each column of data.
      • In Fig. 4E, one cannot determine what the P values actually are, which should be provided in the legend to confirm statistical significance.
      • It's difficult to understand how the -Leu condition and the Double starvation can produce polarized RPFs (Fig. 4A) without evidence of stalling at the cognate hungry codons (Fig. 4E), despite showing later in Fig. 5A that the numbers of stall sites are comparable in those cases to that found for -Ile.
      • Fig. 5B: the P values should be given for all five columns, and it should be explained here or in the Discussion why the authors conclude that stalling is an important determinant for reduced translation when a significant correlation seems to exist only for the -Val condition and not even for the Triple condition.
      • p. 17: "Of note, in cases where valine or isoleucine codons were present just upstream (rather than at) the stalling position, we noted a strong bias for GAG (E), GAA (E), GAU (D), GAC (D), AAG (K), CAG (Q), GUG (V) and GGA (G) (Val starvation) and AAC (N), GAC (D), CUG (L), GAG (E), GCC (A), CAG (Q), GAA (E) and AAG (K) (Ile starvation) at the stalling site (Supplementary Figure 7B)." The authors fail to explain why these codons would be present in the A sites at stalling sites rather than the hungry codons themselves, especially since it is the decoding times of the hungry codons that are increased according to Fig. 1A-E. As suggested above, is this a CHX artifact?
      • Fig. 5D: P values for the significance, or lack thereof, of the different overlaps should be provided.
      • p. 17: "Nonetheless, when we examined entire transcripts rather than single positions, many transcripts that exhibited isoleucine-related stalling under Ile starvation also stalled under triple starvation, but at different sites along the CDS (Figure 5E). This finding is particularly intriguing, as it suggests that while Ile-starvation-specific stalling sites may shift under triple starvation, the overall tendency of these transcripts to stall remains." The authors never come back to account for this unexpected result.
      • It seems very difficult to reconcile the results in Fig. 5F with those in Fig. 4A, where similar polarities in RPFs are observed for -Ile and -Val in Fig, 4A but dramatically different distributions of stalling sites in Fig. 5F. More discussion of these discrepancies is required.
      • p. 18: " These isoacceptor-specific patterns correlate largely with the particular subsets of leucine and isoleucine codons that stalled (Figure 1A)." This correlation needs to be addressed for each codon-anticodon pair for all of the codons showing stalling in Fig. 1A.
      • p. 19: "For instance, in our double starvation condition, unchanged tRNA charging levels (Figure 6E) may result from a pronounced downregulation of global translation initiation, likely driven by the activation of stress responses (Figure 2), subsequently lowering the demand for charged tRNAs as it has been observed previously for Leu starvation 39. This seems at odds with the comparable down-regulation of protein synthesis for the Double starvation and -Leu and -Ile single starvations shown in Fig. 3C. Also, in the current study, Leu starvation does lower charging of certain Leu tRNAs.

      Significance

      The results here are significant in showing that starvation for a single amino acid does not lead to deacylation of all isoacceptors for that amino acid and in revealing that starvation for one amino acid can prevent deacylation of tRNAs for other amino acids, as shown most dramatically for the selective deacylation of only Val tRNAs in the triple BRCAA starvation condition. For the various reasons indicated above, however, I'm not convinced that their "bottleneck" mechanism is adequate to explain this phenomenon, especially in the case of the selective deacylation of Ile vs Leu tRNA in the Double starvation regime. It's also significant that deacylation leads to ribosome build-up near the 5'ends of CDS, which seems to be associated with an enrichment for the hungry codons in the case of Val and Ile starvation, but inexplicably, not for Leu or the Double starvations. This last discrepancy makes it hard to understand how the -Leu and Double starvations produce RPF buildups near the 5 ends of CDSs. In addition, the claim in the Discussion that "our data also highlight the importance of the codon positional context within mRNAs, indicating that where a codon is located within the CDS can influence both the extent of ribosomal stalling and overall translation efficiency during nutrient stress" overstates the strength of evidence that the stalling events lead to substantial decreases in translational efficiencies for the affected mRNAs, as the stalling frequency and decreased protein output are significantly correlated only for the -Val starvation, and the data in Fig. 3 D-H suggest that the reductions in protein synthesis generally occur at the level of initiation, even for -Val starvation, with a contribution from slow elongation only for -Ile-which is in itself difficult to understand considering that stalling frequencies are highest in -Val. Thus, while many of the results are very intriguing and will be of considerable interest to the translation field, it is my opinion that a number of results have been overinterpreted and that important inconsistencies and complexities have been overlooked in concluding that a significant component of the translational inhibition arises from the increased decoding times at hungry codons during elongation and that the selective deacylation of Val tRNAs in the Triple starvation can be explained by the "bottleneck" mechanism. The complexities and limitations of the data and their intepretations should be discussed much more thoroughly in the Discussion, which currently is devoted mostly to other phenomena often of tangential importance to the current findings. A suitably revised manuscript would clearly state the limitations and caveats of the proposed mechanisms and consider other possible explanations as well.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript described the translational responses to single and combined BCAA shortages in mouse cell lines. Using Ribo-seq and RNA-seq analysis, the authors found selective ribosome pausing at codons that encode the depleted amino acids, where the pausing at valine codons was prominent at both a single and triple starvations whereas isoleucine codons showed pausing only under a single depletion. They analyzed the mechanisms of the unexpected selective pausing and proposed that the positional codon usage bias could shape the ribosome stalling and tRNA charging patterns across different amino acids. They also examined the stress responses and the changes in the protein expression levels under BCAA starvation.

      The manuscript was well-written, and the findings are interesting, especially their model that positional codon usage bias could be a regulator of ribosome pausing and tRNA charging levels. Although different translational responses to distinct amino acid starvation have been widely documented, the positional codon usage bias is an interesting aspect. The manuscript's central message could have been made clearer. The authors may consider emphasizing this point more explicitly in the abstract. The rich multi-omics dataset in this work provides valuable resources for the translation field.

      Major comments

      1. The abstract may need to be revised since it is hard to immediately catch the authors' main point. If the authors regard this work as a resource paper, the current version is fine. But it could be better to point out the positional codon usages the authors found, which is a strong point of the current manuscript.
      2. Page 18 "Beyond these tRNA dynamics, our data also highlight the importance of the codon positional context within mRNAs, indicating that where a codon is located within the CDS can influence both the extent of ribosomal stalling and overall translation efficiency during nutrient stress."<br /> This idea is interesting. To what extent the authors think this could be generalized? The authors may discuss whether they think their proposed model is specific to the different ribosome stalling patterns between valine and isoleucine codons or generalized to other codon combinations. For example, the positional codon usage bias will be different among different organisms, and are there any previous reports on ribosome behaviors that align with their model? Even if the authors think this model can be applied to BCAA starvation, would it be possible to explain the different isoleucine codon responses between single and double starvation? The authors may discuss why the ribosome stalling at isoleucine AUU and AUC codons was slightly attenuated under double starvation. And how about the different leucine codon responses among single, double, and triple starvations, although the pausing is not as strong as isoleucine and valine codons? Experimental validation using artificial reporters carrying biased sequences may also be considered.
      3. Page 13 "Moreover, we noticed that DT changes extend beyond the ribosomal A-site, including the P-site, E-site, and even further positions (Supplementary Fig. 2A), consistent with other studies on single amino acid starvation 39 (Supplementary Fig. 2B-C)." Could the widespread DT changes be due to Ribo-DT pipeline they used or difficulties in offset determination? Indeed the authors showed that this feature was found in other datasets, but it seems that the datasets were processed and analyzed in the same way as their data. The original Ribo-DT paper (Gobet and Naef, 2022, Methods) also showed some widespread DT changes even from RNA-seq. Another analysis method like the codon subsequence abundant shift as a part of diricore analysis (Loayza-Puch et al., 2016, Nature) did not show that broad changed regions. The authors are encouraged to re-analyze the data sets using different methods.
      4. Page 13 "Intriguingly, only two of the three isoleucine codons (AUU and AUC) showed increased DTs upon Ile starvation (p < 0.01), while just one leucine codon (CUU) exhibited a modest but significant DT increase (p < 0.01) under Leu starvation (Figure 1A-B, Supplementary Figure 2A)." How can the authors explain the different strengths of ribosome pausing at Ile codons under Ile and double starvation? The AUA codon did not show any pausing under either of the starvation conditions. Throughout the manuscript, the authors mainly describe the difference between amino acids but it is desirable to discuss the codon-level difference as well.
      5. Page 13 "We examined the effects of single amino acid starvations (-Leu, -Ile and -Val), as well as combinations, including a double starvation of leucine and isoleucine (hereafter referred to as "double") and a starvation of leucine, isoleucine, and valine ("triple"), allowing us to identify potential non-additive effects." The different double starvations, isoleucine and valine, and leucine and valiene, will further support their hypothesis on the effects of the positional codon usage bias on ribosome pausing and tRNA charging patterns. Although this could be beyond the scope of the current manuscript, the authors are encouraged to provide a rationale for the chosen combination.

      Minor comments

      Page 16 "these results imply that BCAA deprivation lowers protein output through multiple pathways: a combination of reduced initiation, direct elongation blocks (stalling), and possibly an increased proteolysis" This conclusion is totally right but may be too general. Could the authors summarize BCAA-specific features of the events including reduced initiation, stalling, and proteolysis that all contribute to protein outputs? This is not well discussed in the latter sections including Discussion.

      Significance

      The manuscript was well-written, and the findings are interesting, especially their model that positional codon usage bias could be a regulator of ribosome pausing and tRNA charging levels. Although different translational responses to distinct amino acid starvation have been widely documented, the positional codon usage bias is an interesting aspect. The manuscript's central message could have been made clearer. The authors may consider emphasizing this point more explicitly in the abstract. The rich multi-omics dataset in this work provides valuable resources for the translation field.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Prior to the point-by-point response to the reviewer, we would like to sincerely thank all the peer reviewers for their overwhelmingly positive comments and helpful suggestions. The recommendations have undoubtedly improved our initial submission, and we have done our best to incorporate as many of the suggestions as possible.

      Reviewer #1* (Evidence, reproducibility and clarity (Required)): *

      *Jones et al. have submitted a manuscript detailing the role of Coenzyme A in the regulation of macrophage polarization. Overall, the manuscript is well designed, and the conclusions are well supported by the data. I find no major or minor deficiencies that need to be corrected. *

      * Reviewer #1 (Significance (Required)): *

      For decades the immunology community has boldly stated that mitochondrial metabolism not only provides the bioenergetics for cell expansion but also dictates cell fate. This has been especially true for fatty acid beta oxidation. Macrophage, T-cell and B-cell polarization have all been shown to require FAO for their polarization, but all based on one inhibitor. NONE of these observations hold up with more rigorous experimentation. The Divakaruni group has previously suggested that intracellular CoA homeostasis was the driver of macrophage differentiation as they could reverse the inhibitory effects by providing heroic levels of CoA extracellularly. Here, they have clarified the role of CoA. Intracellular CoA does not affect macrophage polarization/differentiation. This was done with cleaver manipulation of the CoA pools. Rather, extracellular CoA can act as a weak TLR4 ligand. This work nicely clarifies their previous work and further demonstrates a role for this metabolite as an endogenous activator of type 1 macrophages.

      We are thrilled by the positive comments about our work, and we are grateful the reviewer found our submission to be clarifying for the field and significant in the larger context of immunometabolism research.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      *This is a fairly straightforward manuscript that indicated CoA acts as a "weak" TLR4 agonist and primes macrophages for alternative activation. Overall, the experiments are well done and clear enough. There are two major issues that need to be addressed: *

      We thank the reviewer for their positive comments regarding the quality and clarity of our work.

      1. *Previous work has shown the following pathway: LPS>IL10>STAT3>IL4Ra>>>increased responsiveness to IL4/IL13 and increased expression of M2 associated markers (please note, this pathway does not apply to Arg1, often erroneously associated with M2 macrophages - LPS induces Arg1 far more than IL4 and this is independent of the STAT6 pathway - Dichtl et al., Science Advances and El Kasmi et al. Nature Immunology, and others). This pathway was first described in Lang et al. 2002 J. Immunol. Subsequently, other groups showed IL6 (Jens Brüning) and OSM (Carl Richards) do the same thing, which is not surprising given that they are STAT3 activators. Thus, Il4ra is a STAT3 target gene; this also makes sense in the kinetic evolution of macrophages from inflammatory to tissue reparative (if they survive). In my view, the authors have most likely found the same pathway. In Jones, expression of the IL4Ra was not quantified. Thus, the pathway described above needs to be accounted for. It may not apply here but seems the easiest explanation of the data. *

      This is an excellent and important experiment suggested by the reviewer, and we address this in our revised Supplemental Figure 5. To determine whether the effect of CoA can be explained simply by a STAT3-mediated effect on the IL-4 receptor, we treated cells with the well-characterized STAT3 inhibitor Napabucasin and measured whether CoA could enhance the macrophage IL-4 response. Two results are clear from the data:

      • Treatment with Napabucasin reduced the expression of IL-4-linked cell surface markers and the IL-4 target gene Ccl8. This serves as an important control consistent with the Il4ra gene being a STAT3 target that increases IL-4 responsiveness.
      • Despite STAT3 inhibition and a reduced IL-4 response, CoA provision still augmented the IL-4-induced expression of Ccl8 and the percentage of CD206+/CD301+ cells, indicating a STAT3-independent mechanism. The result aligns with our ATAC-Seq data in Figure 6 that shows broad changes in chromatin accessibility that cannot be completely explained by expression-level changes in the IL-4 receptor.

      *Can the authors come up with a meaningful in vivo experiment to corroborate their data. Pantothenate-deficient mice have many phenotypes (not fully explored at all - PMID 31918006, for example) and pantothenate metabolism can be manipulated in different ways. Obviously, a complex in vivo experiment is not feasible here. But this should be discussed. What happens in human macrophages, where "polarization" is a completely different beast? *

      We thank the reviewer for these thoughtful comments, and address the questions regarding in vivo proof-of-concept and polarization of human macrophages separately:

      • Regarding the question of whether CoA can enhance the phenotype of IL-4-activated human macrophages, this is an excellent suggestion and we have added the data as Figure 1h. Indeed, Coenzyme A dramatically amplifies expression of the human IL-4 responsive genes CCL17, TGM2, and PDCD1LG2 (similarly to mouse macrophages). The result substantially expands the significance of our work by showing the phenotype is reproducible in both mouse and human macrophages – unlike many immunometabolic phenotypes – and we thank the reviewer again for suggesting this experiment.
      • With respect to an in vivo experiment to corroborate our data, we entirely agree with the reviewer regarding both the importance, but also the difficulty in interpretation, of an experiment genetically manipulating CoA synthesis in vivo. As they have suggested, we raise these issues in the discussion on Lines 370-377 of the revised manuscript. Here, we note the following points:
      • Wherever possible/appropriate (e.g. Figures 1g, 3f&g, 5g&h), we have sought to corroborate our in vitro findings with in vivo/ex vivo proofs-of-concept.
      • Studying immune phenotypes in pantothenate-deficient mice would be an exciting experiment in principle, but difficult to interpret if conducted. As noted by the reviewer in the work from Drs. Rock and Jackowski, knockout of one of four isoforms of pantothenate kinase (PANK) shows mild phenotypes consistent with compensation across isoforms for CoA provision. Global double knockout of PANK1 and PANK2, however, is postnatally lethal. Regardless, a tissue-specific double knockout in myeloid cells is unlikely to show a phenotype given our results showing that manipulating intracellular CoA levels in BMDMs does not alter the IL-4 response (Figs. 2h-j).
      • Given the established role of CoA in postnatal development, it would be difficult to attribute any immunologic phenotypes in genetically modified mice to direct effects of CoA as a metabolic DAMP as opposed to indirect effects from a chronically altered immune system.

      Reviewer #2 (Significance (Required)): *This is a fairly straightforward manuscript that indicated CoA acts as a "weak" TLR4 agonist and primes macrophages for alternative activation. Overall, the experiments are well done and clear enough.

      *

      We reiterate our gratitude for the comments on the quality and clarity of our work.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: In this manuscript on enhancement of mIL-4 polarization by exogenous CoA, the authors follow up on their previous studies that had shown a correlation between Etomoxir-driven block in mIL-4 and a reduction of intracellular CoA levels. The results obtained (lack of enhancement of IL-4-induced changes in oxidative phosphorylation and glycolysis; lack of impact of pharmacological decrease/increase of intracellular CoA levels) led them to discard their initial hypothesis. Instead, the presence of a proinflammatory gene signature in macrophages treated with IL-4+CoA triggered experiments testing the involvement of TLR-Myd88 signaling and the identification of CoA as a weak agonist for TLR4 (which is consistent with a preprint manuscript posted in 2022 by others and showing induction of proinflammatory gene express in a TLR2/4-dependent manner).

      • Significance: Overall, these results are novel and interesting, although the use of yeast-derived CoA preparations raises a question about the contribution of contaminants that is only partially controlled by data obtained with a synthetic CoA. Regarding a biological role for CoA in macrophage biology in vivo, the authors propose that CoA may act as a DAMP upon release from dying/dead cells and thereby modify transcriptional polarization of m(IL-4). I have several comments related to specific experimental conditions and interpretation that should be addressed. Most importantly, the key findings of the manuscript should be demonstrated using synthetic CoA as described in comment #5. *

      We are heartened that the reviewer found our initial submission to be novel and interesting, and are grateful for their suggestions to reinforce our existing data with more studies comparing yeast-derived and synthetically-derived Coenzyme A. We have done our best to address each of the individual questions below:

      Major comments:

      1. *Increasing/decreasing intracellular CoA levels does not alter IL-4-induced CD206 expression (Fig. 2i/j. However, the impact of CoA addition to mIL-4 is stronger for Ccl8 and Mgl2 mRNA (Fig. 1a) than for the CD206+ cell fraction (Fig. 1d). Therefore, it would be better (higher sensitivity) to include expression of these genes as readout after CPCA/PZ-2891 treatment. *

      This is a helpful suggestion, and we have now conducted gene expression studies to complement our flow cytometry and mass spectrometry studies while manipulating the intracellular CoA pool. In line with our previous work, neither CPCA (which decreases intracellular free CoA) or PZ-2891 (which increases intracellular free CoA) meaningfully alter expression of IL-4-linked genes including Ccl8 or Mgl2. In fact, the only (statistically insignificant) trend refutes the hypothesis, as gene expression with CPCA leads to marginally increased gene expression. These results are now included in Supplemental Figure S2f. We thank the reviewer for this helpful suggestion, as it has strengthened our conclusion that intracellular CoA levels do not adjust the macrophage IL-4 response.

      • The CoA-induced proinflammatory gene expression in Fig. 3c is relatively weak (e.g. compared to LPS). The authors use CoA throughout the manuscript at a concentration of 1 mM, and we do not know how much of it is required to cause an effect at all. Therefore, dose-response curves for the stimulation of macrophages with titrated amounts of CoA should be provided. In addition, *

      We thank the reviewer for bringing up this point so we could clarify and add to our existing data. We should note that Supplemental Figures 1b&c of our previous submission (and resubmitted manuscript) detail a concentration-response curve showing that at little as 62.5 mM CoA – the lowest concentration tested – was sufficient to enhance IL-4 cell surface marker expression.

      However, it is an excellent suggestion as the reviewer notes, to conduct a similar concentration-response to determine if this lines up with CoA inducing a pro-inflammatory response. The full data set is presented in the answer to reviewer question 4 (comparing CoA purchased from Sigma vs. Avanti Polar Lipids), though we now show in Supplemental Figure S3 that 62.5 mM CoA is sufficient to elicit a pro-inflammatory response. Though it is indeed a weak effect as noted by the reviewer, our data suggest that the relatively mild stimulus is crucial for the effect. Given the results with the TLR3 agonist Poly I:C (Figure 5), which engages a Type 1 interferon response, strong TLR4 agonists that engage the TRIF/Type I interferon arm of the TLR4 response are likely to blunt or block the IL-4 response.

      • Related question: we are informed that the concentration of CoA in the mitochondrial matrix is 5mM, whereas cytosol contains 100µM. For CoA to act as DAMP, I would like to know the concentration of it in supernatants of cell cultures (live vs. dying/dead cells) and from tissues. *

      This is an important point brought up by the reviewer, and we agree that the implicit issue raised (i.e. “do the concentrations of CoA required to see an effect reconcile with a physiological role as a DAMP?”) should be more thoroughly addressed in the manuscript. Tissue concentrations of free CoA (in ng/mg tissue) are well established for mice and range from >100 nmol/g tissue (liver, heart, brown adipose tissue) to Nonetheless, the reviewer’s larger point is very well reasoned, and we address it in the following ways in the discussion on __Lines 378-391. __

      • In light of the reviewer’s comment, we now mention specific instances in the discussion where CoA acting as a DAMP may reasonably play a physiological role (e.g. acetaminophen-induced acute liver injury or other forms of sterile liver injury given that DAMPs are known to be important factors and liver tissue contains relatively high concentrations of CoA).
      • Although cytoplasmic concentrations of CoA may only be 50-100 mM, our work establishes a framework for how ubiquitous metabolic co-factors can activate pattern recognition receptors. Put another way, although CoA itself may not be a physiologically relevant DAMP, discovering this pathway could inform how other nucleotide or nucleoside analogs (e.g. adenine- or adenosine-containing molecules present at millimolar concentrations) exert their effects on innate immunity.
      • Our newly obtained data with HMDMs (Figure 1h) shows that the CoA response in human macrophages – boosting IL-4-linked gene expression by 10-100X – may be much stronger than the 1.5-5X effect observed in mouse BMDMs. As such, it is exciting to speculate that CoA may have a more potent effect on the IL-4 response in humans relative to mice. We trust the reviewer understands the limitations of obtaining human macrophages that preclude conducting a thorough concentration-response analysis given the restrictions of a manuscript revision.
      • It is very good that the authors validate the findings obtained using the yeast-derived CoA with the synthetic molecule. It is very conceivable that the 15% contaminating substances in the yeast CoA could be causing the observed changes in m(IL-4). The fact that synthetic CoA has higher activity in proinflammatory gene expression by BMM (Suppl. Fig. S3) is reassuring, however, it raises the question why this is the case. One possibility is that the concentrations of the different CoA preps cannot directly be compared. Therefore, dose response curves should also be provided for synthetic CoA. *

      This is an astute observation by the reviewer and we thank them for reading our manuscript with such detailed attention to pick this up. We are reassured that the reviewer shares our interpretation that the effect of CoA is not due to a contaminating TLR4 agonist in the yeast-derived preparation (from Sigma-Aldrich; ~85% pure) given a negative Limulus Test (Supplemental Figure S4b). Moreover, the synthetically-derived preparation (from Avanti Polar Lipids; ~99% pure) yields a stronger TLR4 response.

      An exploration of the follow-on question regarding why the effect is greater than 15% is presented below. These experiments have been added to Supplemental Figure S4c&d. The summary of our data suggests the individual concentrations indeed cannot be compared – matched concentrations of synthetic Avanti CoA have greater than a 15% effect than yeast-derived Sigma CoA. There are likely multiple factors that could explain this, some of which are listed below.

      • The physiological effect of a TLR agonist need not be linear with its concentration, as demonstrated by the sigmoidal calibration curves for the TLR-expressing HEK-blue cells (Figures 4b, S4a). This likely does not explain the dramatic difference between the two CoA preparations but is worth noting.
      • While we have determined that the 15% contaminating substances in the yeast-derived CoA are not causing the observed changes in the IL-4 response, it is formally possible that there are contaminating substances blunting the pro-inflammatory response and therefore limiting the effect of CoA purchased from Sigma-Aldrich relative to that from Avanti Polar Lipids. Importantly, however, our data in response to Reviewer Question #5 show there is no difference in amplifying the IL-4 response between the yeast- and synthetically-derived CoA.
      • The difference in activity of yeast and synth. CoA could also be caused by the additional biologically active molecules in the yeast CoA. Therefore, it is important to show that the key findings in the paper (enhancement of m(IL-4) associated gene expression and CD206+ upregulation in vitro and in vivo) are also induced by synth. CoA. This is even more important in the context of the Myd88-independence of CD206+ upregulation in BMM treated with CoA (Suppl. Fig. S4). The experiment should be repeated with synth. CoA. If the enhancement of CD206+ cells induced by CoA is indeed unchanged in Myd88 KO BMM, then the title of the manuscript "CoA enhances alternative macrophage activation via Myd88" would not be supported by the data and needed to be changed. Activation of the TLR4 reporter cell line should also be tested using the synth. CoA molecule.*

      We are grateful for this suggestion by the reviewer to further cement the idea that our observation of CoA enhancing the macrophage IL-4 response was not due to a contaminant in the Sigma-Aldrich CoA preparation. The reviewer makes a few points in this question which we address individually here.

      • The suggestion to confirm that the CoA-induced enhancement of M(IL-4) is not due to a contaminating substance in the Sigma-Aldrich CoA is excellent and necessary. Here we show that synthetically derived CoA (99% pure, purchased from Avanti Polar Lipids) quantitatively reproduces the effect from yeast-derived CoA from Sigma-Aldrich in Supplemental Figure S4e. The response is noteworthy because synthetic CoA has profoundly stronger pro-inflammatory response than yeast-derived CoA, yet both have a similar effect on augmenting M(IL-4). This suggests that any appropriate pro-inflammatory response – irrespective of the relative strength or weakness – is sufficient to maximize the effect. This can also be observed with the range of MyD88-linked TLR agonists used in Figures 5 and S6a.
      • Similarly, we also conducted experiments to show that the effect of synthetic CoA on M(IL-4) is independent of MyD88 similarly to yeast-derived CoA. These data are present in Supplemental Figure S6b&c. Here again, we should note that the effect of synthetic CoA is quantitatively similar to the effect of yeast CoA and Imiquimod (Supplemental Figure S6a).
      • Activation of the TLR4 reporter cell line is available in Supplemental Figure S4c.
      • Regarding the title of the manuscript, we acknowledge that we struggled a bit with how to frame our findings. Importantly, our findings support a model where (i) CoA provision enhances the IL-4 response not via metabolic changes but rather by acting as a mild pro-inflammatory stimulus, and (ii) MyD88 signaling augments the IL-4 response. We should also note that our findings simply show that CoA does not exclusively enhance the IL-4 response via MyD88 signaling, and there may be other redundant pathways (similarly to MyD88 agonist imiquimod but unlike the MyD88 agonists Pam3-CSK4 and low concentrations of LPS). We are open to working journal editors to strike the right balance of scientific accuracy and representation of the work when deciding on a final title.
      • The results from the tumor model in Fig. 5 are presented to show a stronger tumor-promoting effect of m(IL-4) stimulated with Pam3. However, the variability of the data is high and 2 out of 6 mice in the +Pam3 group appear to actually have a lower tumor weight than the control mice. Therefore, these data are quite superficial and preliminary, and would benefit from a replicate experiment. Furthermore, for the evaluation of CoA as a biologically relevant DAMP, it would be important to know whether CoA-treated m(IL-4) show the same tumor-promoting effect in vivo as Pam3. *

      We thank the reviewer for their comment, and agree that our in vivo work is indeed preliminary. Our goal with this report was to focus on the initial discovery of this molecular pathway and its first, broad characterization using a range of techniques (e.g. in vivo outcomes, ATAC-Seq, etc.), many of which can spur more detailed follow-up studies for future papers. As detailed in the manuscript discussion (Lines 415-419), future work beyond our initial discovery is warranted to thoroughly explore the physiological outcomes of CoA as a metabolic DAMP in relevant model systems such as acute liver injury. As an initial proof-of-concept to show that MyD88 signaling can enhance alternative activation, however, we believe our two discrete experiments (sterile inflammation and tumor formation) are sufficient to indicate the phenotype is likely relevant in animal models. In vivo syngeneic tumor models display natural variability in tumor size due to differences in implantation efficiency, host immune responses, and tumor-intrinsic growth kinetics. Nonetheless, our statistical analysis demonstrates that, with high confidence, that the observed differences are reproducible and not attributable to random variation.

      Minor comments:

        • Fig. 1b: where the gates for CD206/CD301 set based on isotype control stainings? *

      We thank the reviewer for pointing out this oversight in our methods. The gates were indeed set on isotype control stains, and this is now mentioned in Lines 519-521 of the revised manuscript.

      The formatting not cohesive m(IL-4) vs. M(IL-4)

      Again, this is an embarrassing oversight on our part and we have done our very best to copy edit the piece and remove any inconsistencies and errors.

      *Methods: primer sequences are not shown. They should be provided. *

      We thank the reviewer for pointing this out, and now include all primer sequences used in Supplemental Table 1 of the revised manuscript.

      Description of flowcytometry (L/D staining after surface? No washing steps after addition of L/D staining)

      We thank the reviewer for pointing out another oversight in our methods, and have provided a more detailed description of the flow cytometric analysis in Lines 509-521 of the revised manuscript.

      Statistics: the methods section states that variability is indicated by SD, but the Figure legends always mention SEM. Please correct.

      We are grateful for the reviewer’s helpful attention to detail, and have corrected the methods to line up with the figure legends.

      *A multitude of typos and editorial inconsistencies (e.g. spelling of m(IL-4), punctation and capitalization) should be corrected/streamlined. *

      We are grateful for the reviewer’s helpful attention to detail, and have done our best to copy edit the manuscript prior to resubmission.

      Reviewer #3 (Significance (Required)):

      strengths: I like that the authors follow up their previous work on Etomoxir and CoA, now finding again an unexpected twist in how the effect on m(IL-4) is brought about. This makes the story more complicated, but is important to get to a more precise and realistic understanding of metabolic and transcriptomic regulation and how they are interconnected (or not). In addition, the use of a relatively broad set of methods including ATACseq and mass spectrometry is a strength.

      weakness: the use of the not very pure yeast derived CoA prep, which is controlled for induction of proinflammatory cytokines by one experiment with synth. CoA. This validation needs to be expanded (see comments above) to substantiate the main message of the manuscript.

      The scope of the manuscript is quite focussed on the mechanism of CoA enhanced m(IL-4). The finding that CoA appears not to act by changing intracellular macrophage metabolism but instead after its release by activation TLR4 widens the scope and suggests a new function for CoA as DAMP. This aspect would need to be further substantiated to be convincing.

      Audience: scientists working at the intersection between metabolism and innate immunity will be interested in the results.

      We thank the reviewer for their kind comments regarding the precision, credibility, and breadth of our manuscript. We hope they find our revised manuscript an improvement over our previous submission regarding both the new experiments and modified text. The comments have undoubtedly improved our manuscript and we are grateful to the reviewer for the considerable effort they put into reading our submission.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript on enhancement of mIL-4 polarization by exogenous CoA, the authors follow up on their previous studies that had shown a correlation between Etomoxir-driven block in mIL-4 and a reduction of intracellular CoA levels. The results obtained (lack of enhancement of IL-4-induced changes in oxidative phosphorylation and glycolysis; lack of impact of pharmacological decrease/increase of intracellular CoA levels) led them to discard their initial hypothesis. Instead, the presence of a proinflammatory gene signature in macrophages treated with IL-4+CoA triggered experiments testing the involvement of TLR-Myd88 signaling and the identification of CoA as a weak agonist for TLR4 (which is consistent with a preprint manuscript posted in 2022 by others and showing induction of proinflammatory gene express in a TLR2/4-dependent manner).

      Significance:

      Overall, these results are novel and interesting, although the use of yeast-derived CoA preparations raises a question about the contribution of contaminants that is only partially controlled by data obtained with a synthetic CoA. Regarding a biological role for CoA in macrophage biology in vivo, the authors propose that CoA may act as a DAMP upon release from dying/dead cells and thereby modify transcriptional polarization of m(IL-4). I have several comments related to specific experimental conditions and interpretation that should be addressed. Most importantly, the key findings of the manuscript should be demonstrated using synthetic CoA as described in comment #5.

      Major comments:

      1. Increasing/decreasing intracellular CoA levels does not alter IL-4-induced CD206 expression (Fig. 2i/j. However, the impact of CoA addition to mIL-4 is stronger for Ccl8 and Mgl2 mRNA (Fig. 1a) than for the CD206+ cell fraction (Fig. 1d). Therefore, it would be better (higher sensitivity) to include expression of these genes as readout after CPCA/PZ-2891 treatment.
      2. The CoA-induced proinflammatory gene expression in Fig. 3c is relatively weak (e.g. compared to LPS). The authors use CoA throughout the manuscript at a concentration of 1 mM, and we do not know how much of it is required to cause an effect at all. Therefore, dose-response curves for the stimulation of macrophages with titrated amounts of CoA should be provided. In addition,
      3. Related question: we are informed that the concentration of CoA in the mitochondrial matrix is 5mM, whereas cytosol contains 100µM. For CoA to act as DAMP, I would like to know the concentration of it in supernatants of cell cultures (live vs. dying/dead cells) and from tissues.
      4. It is very good that the authors validate the findings obtained using the yeast-derived CoA with the synthetic molecule. It is very conceivable that the 15% contaminating substances in the yeast CoA could be causing the observed changes in m(IL-4). The fact that synthetic CoA has higher activity in proinflammatory gene expression by BMM (Suppl. Fig. S3) is reassuring, however, it raises the question why this is the case. One possibility is that the concentrations of the different CoA preps cannot directly be compared. Therefore, dose response curves should also be provided for synthetic CoA.
      5. The difference in activity of yeast and synth. CoA could also be caused by the additional biologically active molecules in the yeast CoA. Therefore, it is important to show that the key findings in the paper (enhancement of m(IL-4) associated gene expression and CD206+ upregulation in vitro and in vivo) are also induced by synth. CoA. This is even more important in the context of the Myd88-independence of CD206+ upregulation in BMM treated with CoA (Suppl. Fig. S4). The experiment should be repeated with synth. CoA. If the enhancement of CD206+ cells induced by CoA is indeed unchanged in Myd88 KO BMM, then the title of the manuscript "CoA enhances alternative macrophage activation via Myd88" would not be supported by the data and needed to be changed. Activation of the TLR4 reporter cell line should also be tested using the synth. CoA molecule.
      6. The results from the tumor model in Fig. 5 are presented to show a stronger tumor-promoting effect of m(IL-4) stimulated with Pam3. However, the variability of the data is high and 2 out of 6 mice in the +Pam3 group appear to actually have a lower tumor weight than the control mice. Therefore, these data are quite superficial and preliminary, and would benefit from a replicate experiment. Furthermore, for the evaluation of CoA as a biologically relevant DAMP, it would be important to know whether CoA-treated m(IL-4) show the same tumor-promoting effect in vivo as Pam3.

      Minor comments:

      1. Fig. 1b: where the gates for CD206/CD301 set based on isotype control stainings?
      2. The formatting not cohesive m(IL-4) vs. M(IL-4)
      3. Methods: primer sequences are not shown. They should be provided.
      4. Description of flowcytometry (L/D staining after surface? No washing steps after addition of L/D staining)
      5. Statistics: the methods section states that variability is indicated by SD, but the Figure legends always mention SEM. Please correct.
      6. A multitude of typos and editorial inconsistencies (e.g. spelling of m(IL-4), punctation and capitalization) should be corrected/streamlined.

      Significance

      Strengths: I like that the authors follow up their previous work on Etomoxir and CoA, now finding again an unexpected twist in how the effect on m(IL-4) is brought about. This makes the story more complicated, but is important to get to a more precise and realistic understanding of metabolic and transcriptomic regulation and how they are interconnected (or not). In addition, the use of a relatively broad set of methods including ATACseq and mass spectrometry is a strength.

      Weakness: the use of the not very pure yeast derived CoA prep, which is controlled for induction of proinflammatory cytokines by one experiment with synth. CoA. This validation needs to be expanded (see comments above) to substantiate the main message of the manuscript.

      The scope of the manuscript is quite focussed on the mechanism of CoA enhanced m(IL-4). The finding that CoA appears not to act by changing intracellular macrophage metabolism but instead after its release by activation TLR4 widens the scope and suggests a new function for CoA as DAMP. This aspect would need to be further substantiated to be convincing.

      Audience: scientists working at the intersection between metabolism and innate immunity will be interested in the results.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This is a fairly straightforward manuscript that indicated CoA acts as a "weak" TLR4 agonist and primes macrophages for alternative activation. Overall, the experiments are well done and clear enough. There are two major issues that need to be addressed:

      1. Previous work has shown the following pathway: LPS>IL10>STAT3>IL4Ra>>>increased responsiveness to IL4/IL13 and increased expression of M2 associated markers (please note, this pathway does not apply to Arg1, often erroneously associated with M2 macrophages - LPS induces Arg1 far more than IL4 and this is independent of the STAT6 pathway - Dichtl et al., Science Advances and El Kasmi et al. Nature Immunology, and others). This pathway was first described in Lang et al. 2002 J. Immunol. Subsequently, other groups showed IL6 (Jens Brüning) and OSM (Carl Richards) do the same thing, which is not surprising given that they are STAT3 activators. Thus, Il4ra is a STAT3 target gene; this also makes sense in the kinetic evolution of macrophages from inflammatory to tissue reparative (if they survive). In my view, the authors have most likely found the same pathway. In Jones, expression of the IL4Ra was not quantified. Thus, the pathway described above needs to be accounted for. It may not apply here but seems the easiest explanation of the data.
      2. Can the authors come up with a meaningful in vivo experiment to corroborate their data. Pantothenate-deficient mice have many phenotypes (not fully explored at all - PMID 31918006, for example) and pantothenate metabolism can be manipulated in different ways. Obviously, a complex in vivo experiment is not feasible here. But this should be discussed. What happens in human macrophages, where "polarization" is a completely different beast?

      Significance

      This is a fairly straightforward manuscript that indicated CoA acts as a "weak" TLR4 agonist and primes macrophages for alternative activation. Overall, the experiments are well done and clear enough.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Jones et al. have submitted a manuscript detailing the role of Coenzyme A in the regulation of macrophage polarization. Overall, the manuscript is well designed, and the conclusions are well supported by the data. I find no major or minor deficiencies that need to be corrected.

      Significance

      For decades the immunology community has boldly stated that mitochondrial metabolism not only provides the bioenergetics for cell expansion but also dictates cell fate. This has been especially true for fatty acid beta oxidation. Macrophage, T-cell and B-cell polarization have all been shown to require FAO for their polarization, but all based on one inhibitor. NONE of these observations hold up with more rigorous experimentation. The Divakaruni group has previously suggested that intracellular CoA homeostasis was the driver of macrophage differentiation as they could reverse the inhibitory effects by providing heroic levels of CoA extracellularly. Here, they have clarified the role of CoA. Intracellular CoA does not affect macrophage polarization/differentiation. This was done with cleaver manipulation of the CoA pools. Rather, extracellular CoA can act as a weak TLR4 ligand. This work nicely clarifies their previous work and further demonstrates a role for this metabolite as an endogenous activator of type 1 macrophages.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-02922

      Corresponding author(s): Christian Specht

      [Please use this template only if the submitted manuscript should be considered by the affiliate journal as a full revision in response to the points raised by the reviewers.

      • *

      If you wish to submit a preliminary revision with a revision plan, please use our "Revision Plan" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      • *

      We thank the reviewers for their thorough and constructive evaluation of our work. We have revised the manuscript carefully and addressed all the criticisms raised, in particular the issues mentioned by several of the reviewers (see point-by-point response below). We have also added a number of explanations in the text for the sake of clarity, while trying to keep the manuscript as concise as possible.

      • *

      In our view, the novelty of our research is two-fold. From a neurobiological point of view, we provide conclusive evidence for the existence of glycine receptors (GlyRs) at inhibitory synapses in various brain regions including the hippocampus, dentate gyrus and sub-regions of the striatum. This solves several open questions and has fundamental implications for our understanding of the organisation and function of inhibitory synapses in the telencephalon. Secondly, our study makes use of the unique sensitivity of single molecule localisation microscopy (SMLM) to identify low protein copy numbers. This is a new way to think about SMLM as it goes beyond a mere structural characterisation and towards a quantitative assessment of synaptic protein assemblies.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      • *

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __

      In this manuscript, the authors investigate the nanoscopic distribution of glycine receptor subunits in the hippocampus, dorsal striatum, and ventral striatum of the mouse brain using single-molecule localization microscopy (SMLM). They demonstrate that only a small number of glycine receptors are localized at hippocampal inhibitory synapses. Using dual-color SMLM, they further show that clusters of glycine receptors are predominantly localized within gephyrin-positive synapses. A comparison between the dorsal and ventral striatum reveals that the ventral striatum contains approximately eight times more glycine receptors and this finding is consistent with electrophysiological data on postsynaptic inhibitory currents. Finally, using cultured hippocampal neurons, they examine the differential synaptic localization of glycine receptor subunits (α1, α2, and β). This study is significant as it provides insights into the nanoscopic localization patterns of glycine receptors in brain regions where this protein is expressed at low levels. Additionally, the study demonstrates the different localization patterns of GlyR in distinct striatal regions and its physiological relevance using SMLM and electrophysiological experiments. However, several concerns should be addressed.

      The following are specific comments:

      1. Colocalization analysis in Figure 1A. The colocalization between Sylite and mEos-GlyRβ appears to be quite low. It is essential to assess whether the observed colocalization is not due to random overlap. The authors should consider quantifying colocalization using statistical methods, such as a pixel shift analysis, to determine whether colocalization frequencies remain similar after artificially displacing one of the channels. *Following the suggestion of reviewer 1, we re-analysed CA3 images of Glrbeos/eos hippocampal slices by applying a pixel-shift type of control, in which the Sylite channel (in far red) was horizontally flipped relative to the mEos4b-GlyRβ channel (in green, see Methods). As expected, the number of mEos4b-GlyRβ detections per gephyrin cluster was markedly reduced compared to the original analysis (revised__ Fig. 1B__), confirming that the synaptic mEos4b detections exceed chance levels (see page 5). *

      Inconsistency between Figure 3A and 3B. While Figure 3B indicates an ~8-fold difference in the number of mEos4b-GlyRβ detections per synapse between the dorsal and ventral striatum, Figure 3A does not appear to show a pronounced difference in the localization of mEos4b-GlyRβ on Sylite puncta between these two regions. If the images presented in Figure 3A are not representative, the authors should consider replacing them with more representative examples or providing an expanded images with multiple representative examples. Alternatively, if this inconsistency can be explained by differences in spot density within clusters, the authors should explain that.

      *The pointillist images in Fig. 3A are essentially binary (red-black). Therefore, the density of detections at synapses cannot be easily judged by eye. For clarity, the original images in Fig. 3A have been replaced with two other examples that better reflect the different detection numbers in the dorsal and ventral striatum. *

      • *

      Quantification in Figure 5. It is recommended that the authors provide quantitative data on cluster formation and colocalization with Sylite puncta in Figure 5 to support their qualitative observations.

      *This is an important point that was also raised by the other reviewers. We have performed additional experiments to increase the data volume for analysis. For quantification, we used two approaches. First, we counted the percentage of infected cells in which synaptic localisation of the recombinant receptor subunit was observed (Fig. 5C). We found that mEos4b-GlyRa1 consistently localises at synapses, indicating that all cells express endogenous GlyRb. When neurons were infected with mEos4b-GlyRb, fewer cells had synaptic clusters, meaning that indeed, GlyR alpha subunits are the limiting factor for synaptic targeting. In cultures infected with mEos4b-GlyRa2, only very few neurons displayed synaptic localisation (as judged by epifluorescence imaging). We think this shows that GlyRa2 is less capable of forming heteromeric complexes than GlyRa1, in line with our previous interpretation (see pp. 9-10, 13). *

      • *

      Secondly, we quantified the total intensity of each subunit at gephyrin-positive domains, both in infected neurons as well as non-infected control cultures (Fig. 5D). We observed that mEos4b-GlyRa1 intensity at gephyrin puncta was higher than that of the other subunits, again pointing to efficient synaptic targeting of GlyRa1. Gephyrin cluster intensities (Sylite labelling) were not significantly different in GlyRb and GlyRa2 expressing neurons compared to the uninfected control, indicating that the lentiviral expression of recombinant subunits does not fundamentally alter the size of mixed inhibitory synapses in hippocampal neurons. Interestingly, gephyrin levels were slightly higher in hippocampal neurons expressing mEos4b-GlyRa1. In our view, this comes from an enhanced expression and synaptic targeting of mEos4b-GlyRa1 heteromers with endogenous GlyRb, pointing to a structural role of GlyRa1/b in hippocampal synapses (pp. 10, 13).

      • *

      The new data and analyses have been described and illustrated in the relevant sections of the manuscript.

      Potential for pseudo replication. It's not clear whether they're performing stats tests across biological replica, images, or even synapses. They often quote mean +/- SEM with n = 1000s, and so does that mean they're doing tests on those 1000s? Need to clarify.

      All experiments were repeated at least twice to ensure reproducibility (N independent experiments). Statistical tests were performed on pooled data across the biological replicates; n denotes the number of data points used for testing (e.g., number of synaptic clusters, detections, cells, as specified in each case). We have systematically given these numbers in the revised manuscript (n, N, and other experimental parameters such as the number of animals used, coverslips, images or cells). Data are generally given as mean +/- SEM or as mean +/- SD as indicated.

      • *

      Does mEoS effect expression levels or function of the protein? Can't see any experiments done to confirm this. Could suggest WB on homogenate, or mass spec?

      The Glrbeos/eos knock-in mouse line has been characterised previously and does not to display any ultrastructural or functional deficits at inhibitory synapses (Maynard et al. 2021 eLife). GlyRβ expression and glycine-evoked responses were not significantly different to those of the wild-type. The synaptic localisation of mEos4b-GlyRb in KI animals demonstrates correct assembly of heteromeric GlyRs and synaptic targeting. Accordingly, the animals do not display any obvious phenotype. We have clarified this in the manuscript (p. 4). In the case of cultured neurons, long-term expression of fluorescent receptor subunits with lentivirus has proven ideal to achieve efficient synaptic targeting. The low and continuous supply of recombinant receptors ensures assembly with endogenous subunits to form heteropentameric receptor complexes (e.g. [Patrizio et al. 2017 Sci Rep]). In the present study, lentivirus infection did not induce any obvious differences in the number or size of inhibitory synapses compared to control neurons, as judged by Sylite labelling of synaptic gephyrin puncta (new__ Fig. 5D__).

      Quantification of protein numbers is challenging with SMLM. Issues include i) some of FP not correctly folded/mature, and ii) dependence of localisation rate on instrument, excitation/illumination intensities, and also the thresholds used in analysis. Can the authors compare with another protein that has known expression levels- e.g. PSD95? This is quite an ask, but if they could show copy number of something known to compare with, it would be useful.

      We agree that absolute quantification with SMLM is challenging, since the number of detections depends on fluorophore maturation, photophysics, imaging conditions, and analysis thresholds (discussed in Patrizio & Specht 2016, Neurophotonics). For this reason, only very few datasets provide reliable copy numbers, even for well-studied proteins such as PSD-95. One notable exception is the study by Maynard et al. (eLife 2021) that quantified endogenous GlyRb-containing receptors in spinal cord synapses using SMLM combined with correlative electron microscopy. The strength of this work was the use of a KI mouse strain, which ensures that mEos4b-GlyRb expression follows intrinsic regional and temporal profiles. The authors reported a stereotypic density of ~2,000 GlyRs/µm² at synapses, corresponding to ~120 receptors per synapse in the dorsal horn and ~240 in the ventral horn, taking into account various parameters including receptor stoichiometry and the functionality of the fluorophore. These values are very close to our own calculations of GlyR numbers at spinal cord synapses that were obtained slightly differently in terms of sample preparation, microscope setup, imaging conditions, and data analysis, lending support to our experimental approach. Nevertheless, the obtained GlyR copy numbers at hippocampal synapses clearly have to be taken as estimates rather than precise figures, because the number of detections from a single mEos4b fluorophore can vary substantially, meaning that the fluorophores are not represented equally in pointillist images. This can affect the copy number calculation for a specific synapse, in particular when the numbers are low (e.g. in hippocampus), however, it should not alter the average number of detections (Fig. 1B) or the (median) molecule numbers of the entire population of synapses (Fig. 1C). We have discussed the limitations of our approach (p. 11).

      Rationale for doing nanobody dSTORM not clear at all. They don't explain the reason for doing the dSTORM experiments. Why not just rely on PALM for coincidence measurements, rather than tagging mEoS with a nanobody, and then doing dSTORM with that? Can they explain? Is it to get extra localisations- i.e. multiple per nanobody? If so, localising same FP multiple times wouldn't improve resolution. Also, no controls for nanobody dSTORM experiments- what about non-spec nb, or use on WT sections?

      *As discussed above (point 6), the detection of fluorophores with SMLM is influenced by many parameters, not least the noise produced by emitting molecules other than the fluorophore used for labelling. Our study is exceptional in that it attempts to identify extremely low molecule numbers (down to 1). To verify that the detections obtained with PALM correspond to mEos4b, we conducted robust control experiments (including pixel-shift as suggested by the reviewer, see point 1, revised__ Fig. 1B__). The rationale for the nanobody-based dSTORM experiments was twofold: (1) to have an independent readout of the presence of low-copy GlyRs at inhibitory synapses and (2) to analyse the nanoscale organisation of GlyRs relative to the synaptic gephyrin scaffold using dual-colour dSTORM with spectral demixing (see p. 6). The organic fluorophores used in dSTORM (AF647, CF680) ensure high photon counts, essential for reliable co-localisation and distance analysis. PALM and dSTORM cannot be combined in dual-colour mode, as they require different buffers and imaging conditions. *

      The specificity of the anti-Eos nanobody was demonstrated by immunohistochemistry in spinal cord cultures expressing mEos4b-GlyRb and wildtype control tissue (Fig. S3). In response to the reviewer's remarks, we also performed a negative control experiment in Glrbeos/eos slices (dSTORM), in which the nanobody was omitted (new__ Fig. S4F,G__). Under these conditions, spectral demixing produced a single peak corresponding to CF680 (gephyrin) without any AF647 contribution (Fig. S4F). The background detection of "false" AF647 detections at synapses was significantly lower than in the slices labelled with the nanobody. We conclude that the fluorescence signal observed in our dual-colour dSTORM experiments arises from the specific detection of mEos4b-GlyRb by the nanobody, rather than from background, cross-reactivity or wrong attribution of colour during spectral demixing. We have added these data and explanations in the results (p. 7) and in the figure legend of Fig. S4F,G.

      What resolutions/precisions were obtained in SMLM experiments? Should perform Fourier Ring Correlation (FRC) on SR images to state resolutions obtained (particularly useful for when they're presenting distance histograms, as this will be dependent on resolution). Likewise for precision, what was mean precision? Can they show histograms of localisation precision.

      This is an interesting question in the context of our experiments with low-copy GlyRs, since the spatial resolution of SMLM is limited also by the density of molecules, i.e. the sampling of the structure in question (Nyquist-Shannon criterion). Accordingly, the priority of the PALM experiments was to improve the sensibility of SMLM for the identification of mEos4b-GlyRb subunits, rather than to maximize the spatial resolution. The mean localisation precision in PALM was 33 +/- 12 nm, as calculated from the fitting parameters of each detection (Zeiss, ZEN software), which ultimately result from their signal-to-noise ratio. This is a relatively low precision for SMLM, which can be explained by the low brightness of mEos4b compared to organic fluorophores together with the elevated fluorescence background in tissue slices.

      • *

      In the case of dSTORM, the aim was to study the relative distribution of GlyRs within the synaptic scaffold, for which a higher localisation precision was required (p. 6). Therefore, detections with a precision ≥ 25 nm were filtered during analysis with NEO software (Abbelight). The retained detections had a mean localisation precision of 12 +/- 5 for CF680 (Sylite) and 11 +/- 4 for AF647 (nanobody). These values are given in the revised manuscript (pp. 18, 22).

      Why were DBSCAN parameters selected? How can they rule out multiple localisations per fluor? If low copy numbers (

      Multiple detections of the same fluorophore are intrinsic to dSTORM imaging and have not been eliminated from the analysis. Small clusters of detections likely represent individual molecules (e.g. single receptors in the extrasynaptic regions, Fig. 2A). DBSCAN is a robust clustering method that is quite insensitive to minor changes in the choice of parameters. For dSTORM of synaptic gephyrin clusters (CF680), a relatively low length (80 nm radius) together with a high number of detections (≥ 50 neighbours) were chosen to reconstruct the postsynaptic domain with high spatial resolution (see point 8). In the case of the GlyR (nanobody-AF647), the clustering was done mostly for practical reasons, as it provided the coordinates of the centre of mass of the detections. The low stringency of this clustering (200 nm radius, ≥ 5 neighbours) effectively filters single detections that can result from background noise or incorrect demixing. An additional reference explaining the use of DBSCAN including the choice of parameters is given on p. 22 (see also R2 point 4).

      For microscopy experiment methods, state power densities, not % or "nominal power".

      *Done. We now report the irradiance (laser power density) instead of nominal power (pp. 18, 21). *

      In general, not much data presented. Any SI file with extra images etc.?

      *The original submission included four supplementary figures with additional data and representative images that should have been available to the reviewer (Figs. S1-S4). The SI file has been updated during revision (new Fig. S4E-G). *

      Clarification of the discussion on GlyR expression and synaptic localization: The discussion on GlyR expression, complex formation, and synaptic localization is sometimes unclear, and needs terminological distinctions between "expression level", "complex formation" and "synaptic localization". For example, the authors state:"What then is the reason for the low protein expression of GlyRβ? One possibility is that the assembly of mature heteropentameric GlyR complexes depends critically on the expression of endogenous GlyR α subunits." Does this mean that GlyRβ proteins that fail to form complexes with GlyRα subunits are unstable and subject to rapid degradation? If so, the authors should clarify this point. The statement "This raises the interesting possibility that synaptic GlyRs may depend specifically on the concomitant expression of both α1 and β transcripts." suggests a dependency on α1 and β transcripts. However, is the authors' focus on synaptic localization or overall protein expression levels? If this means synaptic localization, it would be beneficial to state this explicitly to avoid confusion. To improve clarity, the authors should carefully distinguish between these different aspects of GlyR biology throughout the discussion. Additionally, a schematic diagram illustrating these processes would be highly beneficial for readers.

      We thank the reviewer to point this out. We are dealing with several processes; protein expression that determines subunit availability and the assembly of pentameric GlyRs complexes, surface expression, membrane diffusion and accumulation of GlyRb-containing receptor complexes at inhibitory synapses. We have edited the manuscript, particularly the discussion and tried to be as clear as possible in our wording.

      • *

      We chose not to add a schematic illustration for the time being, because any graphical representation is necessarily a simplification. Instead, we preferred to summarise the main numbers in tabular form (Table 1). We are of course open to any other suggestions.

      Interpretation of GlyR localization in the context of nanodomains. The distribution of GlyR molecules on inhibitory synapses appears to be non-homogeneous, instead forming nanoclusters or nanodomains, similar to many other synaptic proteins. It is important to interpret GlyR localization in the context of nanodomain organization.

      The dSTORM images in Fig. 2 are pointillist representations that show individual detections rather than molecules. Small clusters of detections are likely to originate from a single AF647 fluorophore (in the case of nanobody labelling) and therefore represent single GlyRb subunits. Since GlyR copy numbers are so low at hippocampal synapses (≤ 5), the notion of nanodomain is not directly applicable. Our analysis therefore focused on the integration of GlyRs within the postsynaptic scaffold, rather than attempting to define nanodomain structures (see also response to point 8 of R1). A clarification has been added in the revised manuscript (p. 6).

      __Reviewer #1 (Significance (Required)): __

      The paper presents biological and technical advances. The biological insights revolve mostly on the documentation of Glycine receptors in particular synapses in forebrain, where they are typically expressed at very low levels. The authors provide compelling data indicating that the expression is of physiological significance. The authors have done a nice job of combining genetically-tagged mice with advanced microscopy methods to tackle the question of distributions of synaptic proteins. Overall these advances are more incremental than groundbreaking.

      We thank the reviewer for acknowledging both the technical and biological advances of our study. While we recognize that our work builds upon established models, we consider that it also addresses important unresolved questions, namely that GlyRs are present and specifically anchored at inhibitory synapses in telencephalic regions, such as the hippocampus and striatum. From a methodological point of view, our study demonstrates that SMLM can be applied not only for structural analysis of highly abundant proteins, but also to reliably detect proteins present at very low copy numbers. This ability to identify and quantify sparse molecule populations adds a new dimension to SMLM applications, which we believe increases the overall impact of our study beyond the field of synaptic neuroscience.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      In their manuscript "Single molecule counting detects low-copy glycine receptors in hippocampal and striatal synapses" Camuso and colleagues apply single molecule localization microscopy (SMLM) methods to visualize low copy numbers of GlyRs at inhibitory synapses in the hippocampal formation and the striatum. SMLM analysis revealed higher copy numbers in striatum compared to hippocampal inhibitory synapses. They further provide evidence that these low copy numbers are tightly linked to post-synaptic scaffolding protein gephyrin at inhibitory synapses. Their approach profits from the high sensitivity and resolution of SMLM and challenges the controversial view on the presence of GlyRs in these formations although there are reports (electrophysiology) on the presence of GlyRs in these particular brain regions. These new datasets in the current manuscript may certainly assist in understanding the complexity of fundamental building blocks of inhibitory synapses.

      However I have some minor points that the authors may address for clarification:

      1) In Figure 1 the authors apply PALM imaging of mEos4b-GlyRß (knockin) and here the corresponding Sylite label seems to be recorded in widefield, it is not clearly stated in the figure legend if it is widefield or super-resolved. In Fig 1 A - is the scale bar 5 µm? Some Sylite spots appear to be sized around 1 µm, especially the brighter spots, but maybe this is due to the lower resolution of widefield imaging? Regarding the statistical comparison: what method was chosen to test for normality distribution, I think this point is missing in the methods section.

      *This is correct; the apparent size of the Sylite spots does not reflect the real size of the synaptic gephyrin domain due to the limited resolution of widefield imaging including the detection of out-of-focus light. We have clarified in the legend of Fig. 1A that Sylite labelling was with classic epifluorescence microscopy. The scale bar in Fig. 1A corresponds to 5 µm. Since the data were not normally distributed, nonparametric tests (Kruskal- Wallis one-way ANOVA with Dunn’s multiple comparison test or Mann-Whitney U-test for pairwise comparisons) were used (p. 23). *

      Moreover I would appreciate a clarification and/or citation that the knockin model results in no structural and physiological changes at inhibitory synapses, I believe this model has been applied in previous studies and corresponding clarification can be provided.

      The Glrbeos/eos mouse model has been described previously and does not exhibit any structural or physiological phenotypes (Maynard et al. 2021 eLife). The issue was also raised by reviewer R1 (point 5) and has been clarified in the revised manuscript (p. 4).

      2) In the next set of experiments the authors switch to demixing dSTORM experiments - an explanation why this is performed is missing in the text - I guess better resolution to perform more detailed distance measurements? For these experiments: which region of the hippocampus did the authors select, I cannot find this information in legend or main text.

      Yes, the dSTORM experiments enable dual-colour structural analysis at high spatial resolution (see response to R1 point 7). An explanation has been added (p. 6).

      3) Regarding parameters of demixing experiments: the number of frames (10.000) seems quite low and the exposure time higher than expected for Alexa 647. Can the authors explain the reason for chosing these particular parameters (low expression profile of the target - so better separation?, less fluorophores on label and shorter collection time?) or is there a reference that can be cited? The laser power is given in the methods in percentage of maximal output power, but for better comparison and reproducibility I recommend to provide the values of a power meter (kW/cm2) as lasers may change their maximum output power during their lifetime.

      Acquisition parameters (laser power, exposure time) for dSTORM were chosen to obtain a good localisation precision (~12 nm; see R1 point 8). The number of frames is adequate to obtain well sampled gephyrin scaffolds in the CF680 channel. In the case of the GlyR (nanobody-AF647), the concept of spatial resolution does not really apply due to the low number of targets (see R1, point 13). Power density (irradiance) values have now been given (pp. 18, 21).

      4) For analysis of subsynaptic distribution: how did the authors decide to choose the parameters in the NEO software for DBSCAN clustering - was a series of parameters tested to find optimal conditions and did the analysis start with an initial test if data is indeed clustered (K-ripley) or is there a reference in literature that can be provided?

      DBSCAN parameters were optimised manually, by testing different values. Identification of dense and well-delimited gephyrin clusters (CF680) was achieved with a small radius and a high number of detections (80 nm, ≥ 50 neighbours), whereas filtering of low-density background in the AF647 channel (GlyRs) required less stringent parameters (200 nm, ≥ 5) due to the low number of target molecules. Similar parameters were used in a previous publication (Khayenko et al. 2022, Angewandte Chemie). The reference has been provided on p. 22 (see also R1 point 9).

      5) A conclusion/discussion of the results presented in Figure 5 is missing in the text/discussion.

      *This part of the manuscript has been completely overhauled. It includes new experimental data, quantification of the data (new Fig.5), as well as the discussion and interpretation of our findings (see also R1, point 3). In agreement with our earlier interpretation, the data confirm that low availability of GlyRa1 subunits limits the expression and synaptic targeting of GlyRa1/b heteropentamers. The observation that GlyRa1 overexpression with lentivirus increases the size of the postsynaptic gephyrin domain further points to a structural role, whereby GlyRs can enhance the stability (and size) of inhibitory synapses in hippocampal neurons, even at low copy numbers (pp. 13-14). *

      6) in line 552 "suspension" is misleading, better use "solution"

      Done.

      __Reviewer #2 (Significance (Required)): __

      Significance: The manuscript provides new insights to presence of low-copy numbers by visualizing them via SMLM. This is the first report that visualizes GlyR optically in the brain applying the knock-in model of mEOS4b tagged GlyRß and quantifies their copy number comparing distribution and amount of GlyRs from hippocampus and striatum. Imaging data correspond well to electrophysiological measurements in the manuscript.

      Field of expertise: Super-Resolution Imaging and corresponding analysis

      __Reviewer #4 (Evidence, reproducibility and clarity (Required)): __

      In this study, Camuso et al., make use of a knock-in mouse model expressing endogenously mEos4b-tagged GlyRβ to detect endogenous glycine receptors using single-molecule localization microscopy. The main conclusion from this study is that in the hippocampus GlyRβ molecules are barely detected, while inhibitory synapses in the ventral striatum seem to express functionally relevant GlyR numbers.

      I have a few points that I hope help to improve the strength of this study.

      • In the hippocampus, this study finds that the numbers of detections are very low. The authors perform adequate controls to indicate that these localizations are above noise level. Nevertheless, it remains questionable that these reflect proper GlyRs. The suggestion that in hippocampal synapses the low numbers of GlyRβ molecules "are important in assembly or maintenance of inhibitory synaptic structures in the brain" is on itself interesting, but is not at all supported. It is also difficult to envision how such low numbers could support the structure of a synapse. A functional experiment showing that knockdown of GlyRs affects inhibitory synapse structure in hippocampal neurons would be a minimal test of this.

      *It is not clear what the reviewer means by “it remains questionable that these reflect proper GlyRs”. The PALM experiments include a series of stringent controls (see R1, point 1) demonstrating the existence of low-copy GlyRs at inhibitory synapses in the hippocampus (Fig. 1) and in the striatum (Fig. 3), and are backed up by dSTORM experiments (Fig. 2). We have no reason to doubt that these receptors are fully functional (as demonstrated for the ventral striatum (Fig. 4). However, due to their low number, a role in inhibitory synaptic transmission is clearly limited, at least in the hippocampus and dorsal striatum. *

      • *

      We therefore propose a structural role, where the GlyRs could be required to stabilise the postsynaptic gephyrin domain in hippocampal neurons. This is based on the idea that the GlyR-gephyrin affinity is much higher than that of the GABAAR-gephyrin interaction (reviewed in Kasaragod & Schindelin 2018 Front Mol Neurosci). Accordingly, there is a close relationship between GlyRs and gephyrin numbers, sub-synaptic distribution, and dynamics in spinal cord synapses that are mostly glycinergic (Specht et al. 2013 Neuron; Maynard et al. 2021 eLife; Chapdelaine et al. 2021 Biophys J). It is reasonable to assume that low-copy GlyRs could play a similar structural role at hippocampal synapses. A knockdown experiment targeting these few receptors is technically very challenging and beyond the scope of this study. However, in response to the reviewer's question we have conducted new experiments in cultured hippocampal neurons (new__ Fig. 5__). They demonstrate that overexpression of GlyRa1/b heteropentamers increases the size of the postsynaptic domain in these neurons, supporting our interpretation of a structural role of low-copy GlyRs (p. 14).

      • The endogenous tagging strategy is a very strong aspect of this study and provides confidence in the labeling of GlyRβ molecules. One caveat however, is that this labeling strategy does not discriminate whether GlyRβ molecules are on the cell membrane or in internal compartments. Can the authors provide an estimate of the ratio of surface to internal GlyRβ molecules?

      Gephyrin is known to form a two-dimensional scaffold below the synaptic membrane to which inhibitory GlyRs and GABAARs attach (reviewed in Alvarez 2017 Brain Res). The majority of the synaptic receptors are therefore thought to be located in the synaptic membrane, which is supported by the close relationship between the sub-synaptic distribution of GlyRs and gephyrin in spinal cord neurons (e.g. Maynard et al. 2021 eLife). To demonstrate the surface expression of GlyRs at hippocampal synapses we labelled cultured hippocampal neurons expressing mEos4b-GlyRa1 with anti-Eos nanobody in non-permeabilised neurons (see Figure below for the reviewer only). The close correspondence between the nanobody (AF647) and the mEos4b signal confirms that the majority of the GlyRs are indeed located in the synaptic membrane.

      • *

      Figure (for the reviewer only).* Left: Lentivirus expression of mEos4b-GlyRa1 in fixed and non-permeabilised hippocampal neurons (mEos4b signal). Right: Surface labelling of the recombinant subunit with anti-Eos nanoboby (AF647). *

      • 'We also estimated the absolute number of GlyRs per synapse in the hippocampus. The number of mEos4b detections was converted into copy numbers by dividing the detections at synapses by the average number of detections of individual mEos4b-GlyRβ containing receptor complexes'. In essence this is a correct method to estimate copy numbers, and the authors discuss some of the pitfalls associated with this approach (i.e., maturation of fluorophore and detection limit). Nevertheless, the authors did not subtract the number of background localizations determined in the two negative control groups. This is critical, particularly at these low-number estimations.

      We fully agree that background subtraction can be useful with low detection numbers. In the revised manuscript, copy numbers are now reported as background-corrected values. Specifically, the mean number of detections measured in wildtype slices was used to calculate an equivalent receptor number, which was then subtracted from the copy number estimates across hippocampus, spinal cord and striatum. This procedure is described in the methods (p. 20) and results (p. 5, 8), and mentioned in the figure legends of Fig. 1C, 3C. The background corrected values are given in the text and Table 1.

      Furthermore, the authors state that "The advantage of this estimation is that it is independent of the stoichiometry of heteropentameric GlyRs". However, if the stoichometry is unknown, the number of counted GlyRβ subunits cannot simply be reported as the number of GlyRs. This should be discussed in more detail, and more carefully reported throughout the manuscript.

      *The reviewer is right to point this out. There is still some debate about the stoichiometry of heteropentameric GlyRs. Configurations with 2a:3b, 3a:2b and 4a:1b subunits have been advanced (e.g. Grudzinska et al. 2005 Neuron; Durisic et al. 2012 J Neurosci; Patrizio et al. 2017 Sci Rep; Zhu & Gouaux 2021 Nature). We have therefore chosen a quantification that is independent of the underlying stoichiometry. Since our quantification is based on very sparse clusters of mEos4b detections that likely originate from a single receptor complex (irrespective of its stoichiometry), the reported values actually reflect the number of GlyRs (and not GlyRb subunits). We have clarified this in the results (p. 5) and throughout the manuscript (Table 1). *

      • The dual-color imaging provides insights in the subsynaptic distribution of GlyRβ molecules in hippocampal synapses. Why are similar studies not performed on synapses in the ventral striatum where functionally relevant numbers of GlyRβ molecules are found? Here insights in the subsynaptic receptor distribution would be of much more interest as it can be tight to the function.

      This is an interesting suggestion. However, the primary aim of our study was to identify the existence of GlyRs in hippocampal regions. At low copy numbers, the concept of sub-synaptic domains (SSDs, e.g. Yang et al. 2021 EMBO Rep) becomes irrelevant (see R1 point 13). It should be pointed out that the dSTORM pointillist images (Fig. 2A) represent individual GlyR detections rather than clusters of molecules. In the striatum, our specific purpose was to solve an open question about the presence of GlyRs in different subregions (putamen, nucleus accumbens).

      • It is unclear how the experiments in Figure 5 add to this study. These results are valid, but do not seem to directly test the hypothesis that "the expression of α subunits may be limiting factor controlling the number of synaptic GlyRs". These experiments simply test if overexpressed α subunits can be detected. If the α subunits are limiting, measuring the effect of α subunit overexpression on GlyRβ surface expression would be a more direct test.

      Both R1 and R2 have also commented on the data in Fig. 5 and their interpretation. We have substantially revised this section as described before (see R1 point 3) including additional experiments and quantification of the data (new Fig. 5). The findings lend support to our earlier hypothesis that GlyR alpha subunits (in particular GlyRa1) are the limiting factor for the expression of heteropentameric GlyRa/b in hippocampal neurons (pp. 13-14). Since the GlyRa1 subunit itself does not bind to gephyrin (Patrizio et al. 2017 Sci Rep), the synaptic localisation of the recombinant mEos4b-GlyRa1 subunits is proof that they have formed heteropentamers with endogenous GlyRb subunits and driven their membrane trafficking, which the GlyRb subunits are incapable of doing on their own.

      __Reviewer #4 (Significance (Required)): __

      These results are based on carefully performed single-molecule localization experiments, and are well-presented and described. The knockin mouse with endogenously tagged GlyRβ molecules is a very strong aspect of this study and provides confidence in the labeling, the combination with single-molecule localization microscopy is very strong as it provides high sensitivity and spatial resolution.

      The conceptual innovation however seems relatively modest, these results confirm previous studies but do not seem to add novel insights. This study is entirely descriptive and does not bring new mechanistic insights.

      This study could be of interest to a specialized audience interested in glycine receptor biology, inhibitory synapse biology and super-resolution microscopy.

      my expertise is in super-resolution microscopy, synaptic transmission and plasticity

      As we have stated before, the novelty of our study lies in the use of SMLM for the identification of very small numbers of molecules, which requires careful control experiments. This is something that has not been done before and that can be of interest to a wider readership, as it opens up SMLM for ultrasensitive detection of rare molecular events. Using this approach, we solve two open scientific questions: (1) the demonstration that low-copy GlyRs are present at inhibitory synapses in the hippocampus, (2) the sub-region specific expression and functional role of GlyRs in the ventral versus dorsal striatum.

      • *

      • *

      The following review was provided later under the name “Reviewer #4”. To avoid confusion with the last reviewer from above we will refer to this review as R4-2.


      __Reviewer #4-2 (Evidence, reproducibility and clarity (Required)): __


      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      The authors investigate the presence of synaptic glycine receptors in the telencephalon, whose presence and function is poorly understood.

      Using a transgenically labeled glycine receptor beta subunit (Glrb-mEos4b) mouse model together with super-resolution microscopy (SLMM, dSTORM), they demonstrate the presence of a low but detectable amount of synaptically localized GLRB in the hippocampus. While they do not perform a functional analysis of these receptors, they do demonstrate that these subunits are integrated into the inhibitory postsynaptic density (iPSD) as labeled by the scaffold protein gephyrin. These findings demonstrate that a low level of synaptically localized glycerine receptor subunits exist in the hippocampal formation, although whether or not they have a functional relevance remains unknown.

      They then proceed to quantify synaptic glycine receptors in the striatum, demonstrating that the ventral striatum has a significantly higher amount of GLRB co-localized with gephyrin than the dorsal striatum or the hippocampus. They then recorded pharmacologically isolated glycinergic miniature inhibitory postsynaptic currents (mIPSCs) from striatal neurons. In line with their structural observations, these recordings confirmed the presence of synaptic glycinergic signaling in the ventral striatum, and an almost complete absence in the dorsal striatum. Together, these findings demonstrate that synaptic glycine receptors in the ventral striatum are present and functional, while an important contribution to dorsal striatal activity is less likely.

      Lastly, the authors use existing mRNA and protein datasets to show that the expression level of GLRA1 across the brain positively correlates with the presence of synaptic GLRB.

      The authors use lentiviral expression of mEos4b-tagged glycine receptor alpha1, alpha2, and beta subunits (GLRA1, GLRA1, GLRB) in cultured hippocampal neurons to investigate the ability of these subunits to cause the synaptic localization of glycine receptors. They suggest that the alpha1 subunit has a higher propensity to localize at the inhibitory postsynapse (labeled via gephyrin) than the alpha2 or beta subunits, and may therefore contribute to the distribution of functional synaptic glycine receptors across the brain.

      Major comments:

      • Are the key conclusions convincing?

      The authors are generally precise in the formulation of their conclusions.

      • They demonstrate a very low, but detectable, amount of a synaptically localized glycine receptor subunit in a transgenic (GlrB-mEos4b) mouse model. They demonstrate that the GLRB-mEos4b fusion protein is integrated into the iPSD as determined by gephyrin labelling. The authors do not perform functional tests of these receptors and do not state any such conclusions.
      • The authors show that GLRB-mEos4b is clearly detectable in the striatum and integrated into gephyrin clusters at a significantly higher rate in the ventral striatum compared to the dorsal striatum, which is in line with previous studies.
      • Adding to their quantification of GLRB-mEos4b in the striatum, the authors demonstrate the presence of glycinergic miniature IPSCs in the ventral striatum, and an almost complete absence of mIPSCs in the dorsal striatum. These currents support the observation that GLRB-mEos4b is more synaptically integrated in the ventral striatum compared to the dorsal striatum.
      • The authors show that lentiviral expression of GLRA1-mEos4b leads to a visually higher number of GLR clusters in cultured hippocampal neurons, and a co-localization of some clusters with gephyrin. The authors claim that this supports the idea that GLRA1 may be an important driver of synaptic glycine receptor localization. However, no quantification or statistical analysis of the number of puncta or their colocalization with gephyrin is provided for any of the expressed subunits. Such a claim should be supported by quantification and statistics A thorough analysis and quantification of the data in Fig.5 has been carried out as requested by all the other reviewers (e.g. R1, point 3). The new data and results have been described in the revised manuscript (pp. 9-10, 13-14).

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      One unaddressed caveat is the fact that a GLRB-mEos4b fusion protein may behave differently in terms of localization and synaptic integration than wild-type GLRB. While unlikely, it is possible that mEos4b interacts either with itself or synaptic proteins in a way that changes the fused GLRB subunit’s localization. Such an effect would be unlikely to affect synaptic function in a measurable way, but might be detected at a structural level by highly sensitive methods such as SMLM and STORM in regions with very low molecule numbers (such as the hippocampus). Since reliable antibodies against GLRB in brain tissue sections are not available, this would be difficult to test. Considering that no functional measures of the hippocampal detections exist, we would suggest that this possible caveat be mentioned for this particular experiment.

      *This question has also been raised before (R1, point 5). According to an earlier study the mEos4b-GlyRb knock-in does not cause any obvious phenotypes, with the possible exception of minor loss of glycine potency (Maynard et al. 2021 eLife). The fact that the synaptic levels in the spinal cord in heterozygous animals are precisely half of those of homozygous animals argues against differences in receptor expression, heteropentameric assembly, forward trafficking to the plasma membrane and integration into the synaptic membrane as confirmed using quantitative super-resolution CLEM (Maynard et al. 2021 eLife). Accordingly, we did not observe any behavioural deficits in these animals, making it a powerful experimental model. We have added this information in the revised manuscript (p. 4). *

      In addition, without any quantification or statistical analysis, the author’s claims regarding the necessity of GLRA1 expression for the synaptic localization of glycine receptors in cultured hippocampal neurons should probably be described as preliminary (Fig. 5).

      As mentioned before, we have substantially revised this part (R1, point 3). The quantification and analysis in the new Fig. 5 support our earlier interpretation.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      The authors show that there is colocalization of gephyrin with the mEos4b-GlyRβ subunit using the Dual-colour SMLM. This is a powerful approach that allows for a claim to be made on the synaptic location of the glycine receptors. The images presented in Figure 1, together with the distance analysis in Figure 2, display the co-localization of the fluorophores. The co-localization images in all the selected regions, hippocampus and striatum, also show detections outside of the gephyrin clusters, which the authors refer to as extrasynaptic. These punctated small clusters seem to have the same size as the ones detected and assigned as part of the synapse. It would be informative if the authors analysed the distribution, density and size of these non-synaptic clusters and presented the data in the manuscript and also compared it against the synaptic ones. Validating this extrasynaptic signal by staining for a dendritic marker, such as MAP-2 or maybe a somatic marker and assessing the co-localization with the non-synaptic clusters would also add even more credibility to them being extrasynaptic.

      The existence of extrasynaptic GlyRs is well attested in spinal cord neurons (e.g. Specht et al. 2013 Neuron; this study see Fig. S2). The fact that these appear as small clusters of detections in SMLM recordings results from the fact that a single fluorophore can be detected several times in consecutive image frames and because of blinking. Therefore, small clusters of detections likely represent single GlyRs (that can be counted), and not assemblies of several receptor complexes. Due to their diffusion in the neuronal membrane, they are seen as diffuse signals throughout the somatodendritic compartment in epifluorescence images (e.g. Fig. 5A). SMLM recordings of the same cells resolves this diffuse signal into discrete nanoclusters representing individual receptors (Fig. 5B). It is not clear what information co-localisation experiments with specific markers could provide, especially in hippocampal neurons, in which the copy numbers (and density) of GlyRs is next to zero.

      In addition we would encourage the authors to quantify the clustering and co-localization of virally expressed GLRA1, GLRA2, and GLRB with gephyrin in order to support the associated claims (Fig. 5). Preferably, the density of GLR and gephyrin clusters (at least on the somatic surface, the proximal dendrites, or both) as well as their co-localization probability should be quantified if a causal claim about subunit-specific requirements for synaptic localization is to be made.

      Quantification of the data have been carried out (new Fig.5C,D). The results have been described before (R1, point 3) and support our earlier interpretation of the data (pp. 13-14).

      Lastly, even though it may be outside of the scope of such a study analysing other parts of the hippocampal area could provide additional important information. If one looks at the Allen Institute’s ISH of the beta subunit the strongest signal comes from the stratum oriens in the CA1 for example, suggesting that interneurons residing there would more likely have a higher expression of the glycine receptors. This could also be assessed by looking more carefully at the single cell transcriptomics, to see which cell types in the hippocampus show the highest mRNA levels. If the authors think that this is too much additional work, then perhaps a mention of this in the discussion would be good.

      We have added the requested information from the ISH database of the Allen Institute in the discussion as suggested by the reviewer (p. 12). However, in combination with the transcriptomic data (Fig. S1) our finding strongly suggest that the expression of synaptic GlyRs depends on the availability of alpha subunits rather than on the presence of the GlyRb transcript. This is obvious when one compares the mRNA levels in the hippocampus with those in the basal ganglia (striatum) and medulla. While the transcript concentrations of GlyRb are elevated in all three regions and essentially the same, our data show that the GlyRb copy numbers *at synapses differ over more than 2 orders of magnitude (Fig. 1B, Table 1). *

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Since the labeling and some imaging has been performed already, the requested experiment would be a matter of deploying a method of quantification. In principle, it should not require any additional wet-lab experiments, although it may require additional imaging of existing samples.

      • Are the data and the methods presented in such a way that they can be reproduced?

      Yes, for the most part.

      • Are the experiments adequately replicated and statistical analysis adequate?

      Yes

      Minor comments:

      • Specific experimental issues that are easily addressable.

      N/A

      • Are prior studies referenced appropriately?

      Yes

      • Are the text and figures clear and accurate?

      Yes, although quantification in figure 5 is currently not present.

      A quantification has been added (see R1, point 3).

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      This paper presents a method that could be used to localize receptors and perhaps other proteins that are in low abundance or for which a detailed quantification is necessary. I would therefore suggest that Figure S4 is included into Figure 2 as the first panel, showcasing the demixing, followed by the results.

      We agree in principle with this suggestion. However, the revised Fig. S4 is more complex and we think that it would distract from the data shown in Fig. 2. Given that Fig. S4 is mostly methodological and not essential to understand the text, we have kept it in the supplement for the time being. We leave the final decision on this point to the editor.

      __Reviewer #4-2 (Significance (Required)): __

      [This review was supplied later]

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Using a novel and high resolution method, the authors have provided strong evidence for the presence of glycine receptors in the murine hippocampus and in the dorsal striatum. The number of receptors calculated is small compared to the numbers found in the ventral striatum. This is the first study to quantify receptor numbers in these region. In addition it also lays a roadmap for future studies addressing similar questions.

      • Place the work in the context of the existing literature (provide references, where appropriate).

      This is done well by the authors in the curation of the literature. As stated above, the authors have filled a gap in the presence of glycine receptors in different brain regions, a subject of importance in understanding the role they play in brain activity and function.

      • State what audience might be interested in and influenced by the reported findings.

      Neuroscientists working at the synaptic level, on inhibitory neurotransmission and on fundamental mechanisms of expression of genes at low levels and their relationship to the presence of the protein would be interested. Furthermore, researchers in neuroscience and cell biology may benefit from and be inspired by the approach used in this manuscript, to potentially apply it to address their own aims.

      *We thank the reviewer for the positive assessment of the technical and biological implications of our work, as well as the interest of our findings to a wide readership of neuroscientists and cell biologists. *

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Synaptic transmission, inhibitory cells and GABAergic synapses functionally and structurally, cortex and cortical circuits. No strong expertise in super-resolution imaging methods.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Summary: Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      The authors investigate the presence of synaptic glycine receptors in the telencephalon, whose presence and function is poorly understood.

      Using a transgenically labeled glycine receptor beta subunit (Glrb-mEos4b) mouse model together with super-resolution microscopy (SLMM, dSTORM), they demonstrate the presence of a low but detectable amount of synaptically localized GLRB in the hippocampus. While they do not perform a functional analysis of these receptors, they do demonstrate that these subunits are integrated into the inhibitory postsynaptic density (iPSD) as labeled by the scaffold protein gephyrin. These findings demonstrate that a low level of synaptically localized glycerine receptor subunits exist in the hippocampal formation, although whether or not they have a functional relevance remains unknown.

      They then proceed to quantify synaptic glycine receptors in the striatum, demonstrating that the ventral striatum has a significantly higher amount of GLRB co-localized with gephyrin than the dorsal striatum or the hippocampus. They then recorded pharmacologically isolated glycinergic miniature inhibitory postsynaptic currents (mIPSCs) from striatal neurons. In line with their structural observations, these recordings confirmed the presence of synaptic glycinergic signaling in the ventral striatum, and an almost complete absence in the dorsal striatum. Together, these findings demonstrate that synaptic glycine receptors in the ventral striatum are present and functional, while an important contribution to dorsal striatal activity is less likely.

      Lastly, the authors use existing mRNA and protein datasets to show that the expression level of GLRA1 across the brain positively correlates with the presence of synaptic GLRB. The authors use lentiviral expression of mEos4b-tagged glycine receptor alpha1, alpha2, and beta subunits (GLRA1, GLRA1, GLRB) in cultured hippocampal neurons to investigate the ability of these subunits to cause the synaptic localization of glycine receptors. They suggest that the alpha1 subunit has a higher propensity to localize at the inhibitory postsynapse (labeled via gephyrin) than the alpha2 or beta subunits, and may therefore contribute to the distribution of functional synaptic glycine receptors across the brain.

      Major comments: - Are the key conclusions convincing?

      The authors are generally precise in the formulation of their conclusions.

      1) They demonstrate a very low, but detectable, amount of a synaptically localized glycine receptor subunit in a transgenic (GlrB-mEos4b) mouse model. They demonstrate that the GLRB-mEos4b fusion protein is integrated into the iPSD as determined by gephyrin labelling. The authors do not perform functional tests of these receptors and do not state any such conclusions. 2) The authors show that GLRB-mEos4b is clearly detectable in the striatum and integrated into gephyrin clusters at a significantly higher rate in the ventral striatum compared to the dorsal striatum, which is in line with previous studies. 3) Adding to their quantification of GLRB-mEos4b in the striatum, the authors demonstrate the presence of glycinergic miniature IPSCs in the ventral striatum, and an almost complete absence of mIPSCs in the dorsal striatum. These currents support the observation that GLRB-mEos4b is more synaptically integrated in the ventral striatum compared to the dorsal striatum. 4) The authors show that lentiviral expression of GLRA1-mEos4b leads to a visually higher number of GLR clusters in cultured hippocampal neurons, and a co-localization of some clusters with gephyrin. The authors claim that this supports the idea that GLRA1 may be an important driver of synaptic glycine receptor localization. However, no quantification or statistical analysis of the number of puncta or their colocalization with gephyrin is provided for any of the expressed subunits. Such a claim should be supported by quantification and statistics

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      One unaddressed caveat is the fact that a GLRB-mEos4b fusion protein may behave differently in terms of localization and synaptic integration than wild-type GLRB. While unlikely, it is possible that mEos4b interacts either with itself or synaptic proteins in a way that changes the fused GLRB subunit's localization. Such an effect would be unlikely to affect synaptic function in a measurable way, but might be detected at a structural level by highly sensitive methods such as SMLM and STORM in regions with very low molecule numbers (such as the hippocampus). Since reliable antibodies against GLRB in brain tissue sections are not available, this would be difficult to test. Considering that no functional measures of the hippocampal detections exist, we would suggest that this possible caveat be mentioned for this particular experiment.

      In addition, without any quantification or statistical analysis, the author's claims regarding the necessity of GLRA1 expression for the synaptic localization of glycine receptors in cultured hippocampal neurons should probably be described as preliminary (Fig. 5).

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      The authors show that there is colocalization of gephyrin with the mEos4b-GlyRβ subunit using the Dual-colour SMLM. This is a powerful approach that allows for a claim to be made on the synaptic location of the glycine receptors. The images presented in Figure 1, together with the distance analysis in Figure 2, display the co-localization of the fluorophores. The co-localization images in all the selected regions, hippocampus and striatum, also show detections outside of the gephyrin clusters, which the authors refer to as extrasynaptic. These punctated small clusters seem to have the same size as the ones detected and assigned as part of the synapse. It would be informative if the authors analysed the distribution, density and size of these non-synaptic clusters and presented the data in the manuscript and also compared it against the synaptic ones. Validating this extrasynaptic signal by staining for a dendritic marker, such as MAP-2 or maybe a somatic marker and assessing the co-localization with the non-synaptic clusters would also add even more credibility to them being extrasynaptic.

      In addition we would encourage the authors to quantify the clustering and co-localization of virally expressed GLRA1, GLRA2, and GLRB with gephyrin in order to support the associated claims (Fig. 5). Preferably, the density of GLR and gephyrin clusters (at least on the somatic surface, the proximal dendrites, or both) as well as their co-localization probability should be quantified if a causal claim about subunit-specific requirements for synaptic localization is to be made.

      Lastly, even though it may be outside of the scope of such a study analysing other parts of the hippocampal area could provide additional important information. If one looks at the Allen Institute's ISH of the beta subunit the strongest signal comes from the stratum oriens in the CA1 for example, suggesting that interneurons residing there would more likely have a higher expression of the glycine receptors. This could also be assessed by looking more carefully at the single cell transcriptomics, to see which cell types in the hippocampus show the highest mRNA levels. If the authors think that this is too much additional work, then perhaps a mention of this in the discussion would be good.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Since the labeling and some imaging has been performed already, the requested experiment would be a matter of deploying a method of quantification. In principle, it should not require any additional wet-lab experiments, although it may require additional imaging of existing samples.

      • Are the data and the methods presented in such a way that they can be reproduced?

      Yes, for the most part.

      • Are the experiments adequately replicated and statistical analysis adequate?

      Yes

      Minor comments: - Specific experimental issues that are easily addressable.

      N/A

      • Are prior studies referenced appropriately?

      Yes

      • Are the text and figures clear and accurate?

      Yes, although quantification in figure 5 is currently not present.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      This paper presents a method that could be used to localize receptors and perhaps other proteins that are in low abundance or for which a detailed quantification is necessary. I would therefore suggest that Figure S4 is included into Figure 2 as the first panel, showcasing the demixing, followed by the results.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Using a novel and high resolution method, the authors have provided strong evidence for the presence of glycine receptors in the murine hippocampus and in the dorsal striatum. The number of receptors calculated is small compared to the numbers found in the ventral striatum. This is the first study to quantify receptor numbers in these region. In addition it also lays a roadmap for future studies addressing similar questions.

      • Place the work in the context of the existing literature (provide references, where appropriate).

      This is done well by the authors in the curation of the literature. As stated above, the authors have filled a gap in the presence of glycine receptors in different brain regions, a subject of importance in understanding the role they play in brain activity and function.

      • State what audience might be interested in and influenced by the reported findings.

      Neuroscientists working at the synaptic level, on inhibitory neurotransmission and on fundamental mechanisms of expression of genes at low levels and their relationship to the presence of the protein would be interested. Furthermore, researchers in neuroscience and cell biology may benefit from and be inspired by the approach used in this manuscript, to potentially apply it to address their own aims.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Synaptic transmission, inhibitory cells and GABAergic synapses functionally and structurally, cortex and cortical circuits. No strong expertise in super-resolution imaging methods.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, Camuso et al., make use of a knock-in mouse model expressing endogenously mEos4b-tagged GlyRβ to detect endogenous glycine receptors using single-molecule localization microscopy. The main conclusion from this study is that in the hippocampus GlyRβ molecules are barely detected, while inhibitory synapses in the ventral striatum seem to express functionally relevant GlyR numbers.

      I have a few points that I hope help to improve the strength of this study.

      • In the hippocampus, this study finds that the numbers of detections are very low. The authors perform adequate controls to indicate that these localizations are above noise level. Nevertheless, it remains questionable that these reflect proper GlyRs. The suggestion that in hippocampal synapses the low numbers of GlyRβ molecules "are important in assembly or maintenance of inhibitory synaptic structures in the brain" is on itself interesting, but is not at all supported. It is also difficult to envision how such low numbers could support the structure of a synapse. A functional experiment showing that knockdown of GlyRs affects inhibitory synapse structure in hippocampal neurons would be a minimal test of this.
      • The endogenous tagging strategy is a very strong aspect of this study and provides confidence in the labeling of GlyRβ molecules. One caveat however, is that this labeling strategy does not discriminate whether GlyRβ molecules are on the cell membrane or in internal compartments. Can the authors provide an estimate of the ratio of surface to internal GlyRβ molecules?
      • 'We also estimated the absolute number of GlyRs per synapse in the hippocampus. The number of mEos4b detections was converted into copy numbers by dividing the detections at synapses by the average number of detections of individual mEos4b-GlyRβ containing receptor complexes'. In essence this is a correct method to estimate copy numbers, and the authors discuss some of the pitfalls associated with this approach (i.e., maturation of fluorophore and detection limit). Nevertheless, the authors did not subtract the number of background localizations determined in the two negative control groups. This is critical, particularly at these low-number estimations. Furthermore, the authors state that "The advantage of this estimation is that it is independent of the stoichiometry of heteropentameric GlyRs". However, if the stoichometry is unknown, the number of counted GlyRβ subunits cannot simply be reported as the number of GlyRs. This should be discussed in more detail, and more carefully reported throughout the manuscript.
      • The dual-color imaging provides insights in the subsynaptic distribution of GlyRβ molecules in hippocampal synapses. Why are similar studies not performed on synapses in the ventral striatum where functionally relevant numbers of GlyRβ molecules are found? Here insights in the subsynaptic receptor distribution would be of much more interest as it can be tight to the function.
      • It is unclear how the experiments in Figure 5 add to this study. These results are valid, but do not seem to directly test the hypothesis that "the expression of α subunits may be limiting factor controlling the number of synaptic GlyRs". These experiments simply test if overexpressed α subunits can be detected. If the α subunits are limiting, measuring the effect of α subunit overexpression on GlyRβ surface expression would be a more direct test.

      Significance

      These results are based on carefully performed single-molecule localization experiments, and are well-presented and described. The knockin mouse with endogenously tagged GlyRβ molecules is a very strong aspect of this study and provides confidence in the labeling, the combination with single-molecule localization microscopy is very strong as it provides high sensitivity and spatial resolution.

      The conceptual innovation however seems relatively modest, these results confirm previous studies but do not seem to add novel insights. This study is entirely descriptive and does not bring new mechanistic insights.

      This study could be of interest to a specialized audience interested in glycine receptor biology, inhibitory synapse biology and super-resolution microscopy.

      my expertise is in super-resolution microscopy, synaptic transmission and plasticity

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In their manuscript "Single molecule counting detects low-copy glycine receptors in hippocampal and striatal synapses" Camuso and colleagues apply single molecule localization microscopy (SMLM) methods to visualize low copy numbers of GlyRs at inhibitory synapses in the hippocampal formation and the striatum. SMLM analysis revealed higher copy numbers in striatum compared to hippocampal inhibitory synapses. They further provide evidence that these low copy numbers are tightly linked to post-synaptic scaffolding protein gephyrin at inhibitory synapses. Their approach profits from the high sensitivity and resolution of SMLM and challenges the controversial view on the presence of GlyRs in these formations although there are reports (electrophysiology) on the presence of GlyRs in these particular brain regions. These new datasets in the current manuscript may certainly assist in understanding the complexity of fundamental building blocks of inhibitory synapses.

      However I have some minor points that the authors may address for clarification:

      1. In Figure 1 the authors apply PALM imaging of mEos4b-GlyRß (knockin) and here the corresponding Sylite label seems to be recorded in widefield, it is not clearly stated in the figure legend if it is widefield or super-resolved. In Fig 1 A - is the scale bar 5 µm? Some Sylite spots appear to be sized around 1 µm, especially the brighter spots, but maybe this is due to the lower resolution of widefield imaging? Regarding the statistical comparison: what method was chosen to test for normality distribution, I think this point is missing in the methods section. Moreover I would appreciate a clarification and/or citation that the knockin model results in no structural and physiological changes at inhibitory synapses, I believe this model has been applied in previous studies and corresponding clarification can be provided.
      2. In the next set of experiments the authors switch to demixing dSTORM experiments - an explanation why this is performed is missing in the text - I guess better resolution to perform more detailed distance measurements? For these experiments: which region of the hippocampus did the authors select, I cannot find this information in legend or main text.
      3. Regarding parameters of demixing experiments: the number of frames (10.000) seems quite low and the exposure time higher than expected for Alexa 647. Can the authors explain the reason for chosing these particular parameters (low expression profile of the target - so better separation?, less fluorophores on label and shorter collection time?) or is there a reference that can be cited? The laser power is given in the methods in percentage of maximal output power, but for better comparison and reproducibility I recommend to provide the values of a power meter (kW/cm2) as lasers may change their maximum output power during their lifetime.
      4. For analysis of subsynaptic distribution: how did the authors decide to choose the parameters in the NEO software for DBSCAN clustering - was a series of parameters tested to find optimal conditions and did the analysis start with an initial test if data is indeed clustered (K-ripley) or is there a reference in literature that can be provided?
      5. A conclusion/discussion of the results presented in Figure 5 is missing in the text/discussion.
      6. in line 552 "suspension" is misleading, better use "solution"

      Significance

      Significance: The manuscript provides new insights to presence of low-copy numbers by visualizing them via SMLM. This is the first report that visualizes GlyR optically in the brain applying the knock-in model of mEOS4b tagged GlyRß and quantifies their copy number comparing distribution and amount of GlyRs from hippocampus and striatum. Imaging data correspond well to electrophysiological measurements in the manuscript.

      Field of expertise: Super-Resolution Imaging and corresponding analysis

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, the authors investigate the nanoscopic distribution of glycine receptor subunits in the hippocampus, dorsal striatum, and ventral striatum of the mouse brain using single-molecule localization microscopy (SMLM). They demonstrate that only a small number of glycine receptors are localized at hippocampal inhibitory synapses. Using dual-color SMLM, they further show that clusters of glycine receptors are predominantly localized within gephyrin-positive synapses. A comparison between the dorsal and ventral striatum reveals that the ventral striatum contains approximately eight times more glycine receptors and this finding is consistent with electrophysiological data on postsynaptic inhibitory currents. Finally, using cultured hippocampal neurons, they examine the differential synaptic localization of glycine receptor subunits (α1, α2, and β). This study is significant as it provides insights into the nanoscopic localization patterns of glycine receptors in brain regions where this protein is expressed at low levels. Additionally, the study demonstrates the different localization patterns of GlyR in distinct striatal regions and its physiological relevance using SMLM and electrophysiological experiments. However, several concerns should be addressed.

      The following are specific comments:

      1. Colocalization analysis in Figure 1A. The colocalization between Sylite and mEos-GlyRβ appears to be quite low. It is essential to assess whether the observed colocalization is not due to random overlap. The authors should consider quantifying colocalization using statistical methods, such as a pixel shift analysis, to determine whether colocalization frequencies remain similar after artificially displacing one of the channels.
      2. Inconsistency between Figure 3A and 3B. While Figure 3B indicates an ~8-fold difference in the number of mEos4b-GlyRβ detections per synapse between the dorsal and ventral striatum, Figure 3A does not appear to show a pronounced difference in the localization of mEos4b-GlyRβ on Sylite puncta between these two regions. If the images presented in Figure 3A are not representative, the authors should consider replacing them with more representative examples or providing an expanded images with multiple representative examples. Alternatively, if this inconsistency can be explained by differences in spot density within clusters, the authors should explain that.
      3. Quantification in Figure 5. It is recommended that the authors provide quantitative data on cluster formation and colocalization with Sylite puncta in Figure 5 to support their qualitative observations.
      4. Potential for pseudo replication. It's not clear whether they're performing stats tests across biological replica, images, or even synapses. They often quote mean +/- SEM with n = 1000s, and so does that mean they're doing tests on those 1000s? Need to clarify.
      5. Does mEoS effect expression levels or function of the protein? Can't see any experiments done to confirm this. Could suggest WB on homogenate, or mass spec?
      6. Quantification of protein numbers is challenging with SMLM. Issues include i) some of FP not correctly folded/mature, and ii) dependence of localisation rate on instrument, excitation/illumination intensities, and also the thresholds used in analysis. Can the authors compare with another protein that has known expression levels- e.g. PSD95? This is quite an ask, but if they could show copy number of something known to compare with, it would be useful.
      7. Rationale for doing nanobody dSTORM not clear at all. They don't explain the reason for doing the dSTORM experiments. Why not just rely on PALM for coincidence measurements, rather than tagging mEoS with a nanobody, and then doing dSTORM with that? Can they explain? Is it to get extra localisations- i.e. multiple per nanobody? If so, localising same FP multiple times wouldn't improve resolution. Also, no controls for nanobody dSTORM experiments- what about non-spec nb, or use on WT sections?
      8. What resolutions/precisions were obtained in SMLM experiments? Should perform Fourier Ring Correlation (FRC) on SR images to state resolutions obtained (particularly useful for when they're presenting distance histograms, as this will be dependent on resolution). Likewise for precision, what was mean precision? Can they show histograms of localisation precision.
      9. Why were DBSCAN parameters selected? How can they rule out multiple localisations per fluor? If low copy numbers (<10), then why bother with DBSCAN? Could just measure distance to each one.
      10. For microscopy experiment methods, state power densities, not % or "nominal power".
      11. In general, not much data presented. Any SI file with extra images etc.?
      12. Clarification of the discussion on GlyR expression and synaptic localization: The discussion on GlyR expression, complex formation, and synaptic localization is sometimes unclear, and needs terminological distinctions between "expression level", "complex formation" and "synaptic localization". For example, the authors state:"What then is the reason for the low protein expression of GlyRβ? One possibility is that the assembly of mature heteropentameric GlyR complexes depends critically on the expression of endogenous GlyR α subunits." Does this mean that GlyRβ proteins that fail to form complexes with GlyRα subunits are unstable and subject to rapid degradation? If so, the authors should clarify this point. The statement "This raises the interesting possibility that synaptic GlyRs may depend specifically on the concomitant expression of both α1 and β transcripts." suggests a dependency on α1 and β transcripts. However, is the authors' focus on synaptic localization or overall protein expression levels? If this means synaptic localization, it would be beneficial to state this explicitly to avoid confusion. To improve clarity, the authors should carefully distinguish between these different aspects of GlyR biology throughout the discussion. Additionally, a schematic diagram illustrating these processes would be highly beneficial for readers.
      13. Interpretation of GlyR localization in the context of nanodomains. The distribution of GlyR molecules on inhibitory synapses appears to be non-homogeneous, instead forming nanoclusters or nanodomains, similar to many other synaptic proteins. It is important to interpret GlyR localization in the context of nanodomain organization.

      Significance

      The paper presents biological and technical advances. The biological insights revolve mostly on the documentation of Glycine receptors in particular synapses in forebrain, where they are typically expressed at very low levels. The authors provide compelling data indicating that the expression is of physiological significance. The authors have done a nice job of combining genetically-tagged mice with advanced microscopy methods to tackle the question of distributions of synaptic proteins. Overall these advances are more incremental than groundbreaking.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-03111

      Corresponding author(s): Qingyin Qian and Ryusuke Niwa

      1. General Statements [optional]

      We would like to thank reviewers for their feedback on our initial submission. Changes in figures were noted in the point-to-point reply. For submission of our current revised manuscript, we provide two Word files, which are the “clean” and “Track-and-Change” files. Page and line numbers described below correspond to those of the “clean” file. The “Track-and-Change” file might be helpful for Reviewers to find what we have changed for the current revision.

      In the revised manuscript, major changes in the text were tracked, while minor edits in figure numbers and legends were not tracked. In the Discussion, the section “Xrp1-mediated EE plasticity…” was moved before “Xrp1, a transcription factor …”, to follow the order of the Results, and was split into two: “EE plasticity …” and “Xrp1-mediated EE plasticity …”.

      2. Description of the planned revisions

      - The authors should investigate the regenerative growth of the adult midgut after irradiation. Is there an impact on ISCs proliferation or cell turn over. Is Xrp1 in EEs required in this adaptive response. It would be elegant to use the recently generated tracing method by Tobias Reiff lab to observe overall impact on tissue renewal (rapport-tracing esglexReDDM esg-lexA, 13xLexAop2-CD8::GFP, 13xLexAop2-H2B::mCherry::HA, tub-Gal80ts on the second chromosome. It can be combined with any EEs Gal4-driver (see Nat Commun 2025, https://doi.org/10.1038/s41467-024-55664-2, the stock is already existing, see table1). This reviewer thinks that it is a key experiment to support the proposed model.

      2.1. Author response:

      We will conduct the following experiments to answer these criticisms.

      (1) We will investigate the ISC behavior, proliferation and differentiation, after 100 Gy of radiation by examining changes in the number of progenitor cells and their progenies, using esgtsF/O (esg-Gal4, UAS-GFP, tub-Gal80ts; Act>Cd2>Gal4, UAS-Flp) generated in the study (Jiang et al. Cell 2009 DOI: 10.1016/j.cell.2009.05.014) or esgReDDM (esg-Gal4, UAS-CD8::GFP; UAS-H2B::RFP, tubGal80ts) generated in the study (Antonello et al. EMBO J. 2015 DOI: 10.15252/embj.201591517). Flies will have progenitor cell lineages traced for 7 days, irradiated on day 6, and examined at different time points after radiation, following the design shown in Fig. 2A. Based on the previous findings (Sharma et al. Sci. Rep. 2020 DOI: 10.1038/s41598-020-75867-z; Pyo et al. Radiat. Res. 2014 DOI: 10.1667/RR13545.1), we anticipate that radiation compromises ISCs’ proliferation and differentiation. Should this be the case, our results can be interpreted in relation to those earlier studies.

      (2) In parallel, we will examine whether Xrp1 expression in EEs affects radiation-induced ISC behaviors. As suggested, we will use “EE Rapport” (esg-lexA, 13xLexAop2-CD8::GFP, 13xLexAop2-H2B::mCherry::HA, tub-Gal80ts; Rab3-Gal4) generated in the study (Zipper et al. Nat. Commun. 2025 DOI: 10.1038/s41467-024-55664-2) and compare control flies to flies with Xrp1 knocked down in EEs to assess the impact on ISC behaviors.

      - Is p53 required for Xrp1 induction in the gut after irradiation?

      2.2. Author response:

      To answer this point, we will perform immunostaining of anti-Xrp1 antibody to examine whether p53 is required for Xrp1 induction in irradiated flies with p53 knocked down in EEs.

      - Xrp1 over expression has been shown to induce upd3 ligand and nutrient-driven dedifferentiation of enteroendocrine cells is occuring by activation of the JAK-STAT pathway (DOI: 10.1016/j.devcel.2023.08.022). Could the authors test the function of this signaling pathway during irradiation (upd3-lacZ and Stat-GFP can be used in parallel of upd3 RNAi and UAS Dome-DN.

      2.3. Author response:

      We will conduct the following experiments to answer these points.

      (1) We will examine the cell type in which upd3 ligand induction occurs after radiation by using the upd3.1-LacZ reporter generated in the study (Jiang et al. Cell Stem Cell 2011 DOI: doi.org/10.1016/j.stem.2010.11.026).

      (2) One possibility is that upd3.1-LacZ is detected in EEs. In this case, we will examine the requirement of upd3 in EEs for radiation-induced EE plasticity by knocking down upd3. Another possibility is that upd3.1-LacZ is detected in non-EE cells. If so, we will examine the requirement of the JAK-STAT pathway in EEs by overexpressing dome[△cyt] generated in the study (Brown et al. Curr. Biol. 2001 DOI: 10.1016/s0960-9822(01)00524-3) or knocking down Stat92E in EEs. Because these conditions are not mutually exclusive, both approaches may be pursued, with the latter relating our results to nutrient-driven EE dedifferentiation.

      - Xrp1 is known for its role in cell competition and elimination of looser cells by induction of apoptosis. It would be interesting to check for induction of cell death and/or caspase activation in the fly gut after irradiation and verify a non apoptotic role of DRONC activation in this context using a Dronc RNAi (as proposed by Bergmann lab (https://doi.org/10.1038/s41598-021-81261-0) or Baena-Lopez lab (DOI: 10.15252/embr.201948892)). Overexpression of Xrp1 could be combined with UAS-p35.

      2.4. Author response:

      To address these points, we will investigate apoptosis induction following radiation with anti-cleaved Dcp-1 immunostaining. Based on the previous finding (Sharma et al. Sci. Rep. 2020 DOI: 10.1038/s41598-020-75867-z), we anticipate seeing increased cleaved Dcp-1 signals in all cell types after radiation. We intend to clarify whether radiation increases the ratio of apoptotic EEs among EEs; however, we cannot yet be certain whether it will be feasible.

      Regarding Dronc activation, we previously requested the antibody used in the study (Wilson et al. Nat. Cell Biol. 2002 DOI: 10.1038/ncb799; Lindblad et al. Sci. Rep. 2021 DOI: 10.1038/s41598-021-81261-0) and tested it in our context, after radiation and by Xrp1-S O/E in EEs. We present our data below. In the anterior midgut, anti-Dronc signals were not observed under both control conditions. After radiation and by Xrp1-S O/E in EEs, anti-Dronc signals were seen in part of past EEs (#2 past) and progenitor cells (#3 prgn), implying their EB identity. However, anti-Dronc signals were never observed in current EEs (#1 current), suggesting Dronc does not act directly downstream to Xrp1.

      We will address UAS-p35 in 3.3. Author response and Dronc-RNAi in 4.2. Author response.

      - The authors do not justify or explain why they used 100 Gy of radiation. This is higher than doses used in comparable regeneration studies in adult Drosophila (e.g., PMID25959206, PMID: 28925355). The authors should clarify why this dose was chosen.

      2.5. Author response:

      Our initial rationale was based on the paper (Sharma et al. Sci. Rep. 2020 DOI: 10.1038/s41598-020-75867-z), where the authors claimed that ISC proliferation was inhibited and the ISC number was decreased by 100 Gy of radiation.

      Nevertheless, we understand the reviewer’s concern and will examine 50 Gy of radiation as used in the papers the reviewer listed. We will examine radiation-induced changes in EE lineages and ISC behaviors. Depending on the results, we will evaluate whether and how they should be incorporated into the manuscript.

      - Fig. 2C, the number of past EE’s increased transiently so that baseline number is restored at 18 hr after IR. The authors conclude that fate plasticity is a transient event. Can they rule out loss due to cell death?

      2.6. Author response:

      In our system, past EEs were detected transiently but did not persist. We agree that we cannot distinguish whether the transient appearance of past EEs reflects transient adoption of another identity that ends in cell death or reversible plasticity.

      To partially address this criticism, as noted in 2.4. Author response, we will examine the apoptosis marker cleaved Dcp-1, which also tests whether cleaved Dcp-1-positive cells can be past EEs. However, regardless of detecting apoptosis markers in past EEs, we have changed “transient” into “temporary” to describe a short-lived cell state (see Page 8, Line 178; Page 15, Line 338).

      - They authors interpret fate-conversion as beneficial for tissue repair but never test whether blocking this process impairs recovery or organismal survival or whether promoting it improves outcomes.

      2.7. Author response:

      We have removed this potentially misleading interpretation (see Page 4, removed the last part of the previous introduction, “and propose the possibility that such plasticity contributes to tissue repair”). We present below the data showing a severe reduction of the ISC number in 7-day post-radiation guts, suggesting the inability of tissue repair. We will add this to the manuscript together with results from the following experiments.

      (1) We will examine if the blockage of radiation-induced EE plasticity, via knocking down Xrp1 in EEs, alters the epithelial cell number and cell junction protein localization.

      (2) To complement the result of plasticity inhibition, we attempt to promote plasticity by overexpressing Xrp1 in EEs, to test whether this rescues ISC loss or restores junctions.

      Should knockdown worsen ISC loss and junction integrity, or overexpression rescue them, we will describe EE plasticity as beneficial; otherwise, we will present it as a radiation-induced response without inferring benefits, while noting our limitations.

      We will address organismal survival in 4.3. Author response.

      - Related to the above, it would be helpful to know if fate-converted cells function as true ISCs or ECs (e.g., through proliferation or absorption assays).

      2.8. Author response:

      To partially answer this criticism, we will examine whether EE-derived ISCs are proliferative by examining whether they can be positive for the mitotic marker phospho-histone 3.

      We will address absorption assays in 4.4. Author response.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      - It is surprising to observe EEs dedifferentiation at a steady state during homeostasis, a condition in which Xrp1 is not detected in the gut. Can the authors comment this point in the discussion?

      3.1. Author response:

      We have added our thoughts in terms of Xrp1 being not detectable in homeostatic EE lineages (see Page 15, Line 350 - 356). We have also added our thoughts regarding observation of EE plasticity in homeostatic guts (see Page 14, Line 322 - 332).

      - Xrp1 is existing as a short of long isoforms. The short form has been recently proposed to be required for cell competition (https://doi.org/10.1101/2025.06.15.659587) whereas Xrp1 long isoform may be responsible for reduced cell growth. Could the authors test which isoform is induced in the gut after irradiation? Is the overexpression of Xrp1 long isoform having the same effect that the short isoform used by the authors.

      3.2. Author response:

      We have added data on the effect of Xrp1 long isoform overexpression on EE plasticity (see Fig. 5A - 5B, Page 12, Line 276 - 278), showing that overexpression of the Xrp1 long isoform caused a similar increase in past EEs. In addition, we have changed Xrp1 O/E to Xrp1-S O/E in the contents related to Figs 4, 5, S4, and S5.

      We will address radiation-induced Xrp1 isoforms in 4.1. Author response.

      - Xrp1 is known for its role in cell competition and elimination of looser cells by induction of apoptosis. It would be interesting to check for induction of cell death and/or caspase activation in the fly gut after irradiation and verify a non apoptotic role of DRONC activation in this context using a Dronc RNAi (as proposed by Bergmann lab (https://doi.org/10.1038/s41598-021-81261-0) or Baena-Lopez lab (DOI: 10.15252/embr.201948892)). Overexpression of Xrp1 could be combined with UAS-p35.

      3.3. Author response:

      We have added data regarding p35 O/E combined with Xrp1 O/E, showing that p35 O/E did not further increase the number of past EEs, thereby suggesting that Xrp1-driven EE plasticity has a non-apoptotic nature (see Fig. 5C - 5D, Page 13, Line 293 - 297).

      - Line 221: fig S3E should be S3F

      - Line 230: fig S3F-G should be S3G-H

        • Line 230, Fig S3F-G should be Fig S3G-H.*

      3.4. Author response:

      We have fixed this error.

      - The posterior gut region R4 is more proliferative than the anterior part and is usually used for testing regenerative growth. What is happening there after irradiation?

      3.5. Author response:

      We present below radiation-induced changes in EE lineages and ISC number in the R4bc gut region. Radiation did not alter the proportion of past EEs among EE lineages but reduced the ISC number. We acknowledge differences between anterior and posterior gut regions, but we do not plan to further analyze regional differences or underlying mechanisms.

      - The authors’ explanation for cells with weak GFP in Figure 1 is not convincing. Induction of GFP is an all or nothing event as it results from Pros-driven FLPase and a recombination that removes the transcription stop signals to express GFP from a Ubi promotor. Once that happens, it should not matter how strong or weak Pros is, GFP should be the same. So, another explanation is needed. Nuclear staining of cell #2 in Fig 1B resembles a metaphase chromosome arrangement. Nuclear GFP may appear ‘weak’ in mitosis as the nuclear envelope breaks down. It is positive for the purple Pros/Dl stain, which makes it hard to tell if it is Pros+ or Pros- even though the authors state that cells with weak GFP are Pros- in line 104 (see the point above regarding confusing same-color stain for ISC and EE markers). Could cell #2 be a pre-EE that is undergoing mitosis since the lineage tracer marks both EE and pre-EE cells (line 119)? Or do the authors mean recombination on one or both homologs? This should not be possible since the cells are heterozygotes for the Ubi-GFP locus.

      3.6. Author response:

      For cell #5, RFP- GFPweak may result from the leakiness of the G-TRACE system. We have added our observations of the G-TRACE strains and changed our previous explanation (see Fig. S1B - S1C, Page 5, Line 94 - 97, 103 - 106).

      For cell #2, we agree that RFP+ GFPweak cells may either be a cell turning on pros expression just before sample preparation or a pre-EE undergoing mitosis. Nevertheless, it is not a past EE that has lost the EE marker Pros, so it is considered a current EE. We have removed our previous interpretation of cell #2 (see Page 5, removed “which likely had not yet fully activated recombination”), and changed the image to avoid confusion (see Fig. 1C).

      - Fig. 2C, if past-EE’s increased in number while current EE’s stayed the same, where are new past-EE’s coming from? There cannot be compensatory proliferations since EE’s are post-mitotic. For fate conversion, one would expect the generation of each past-EE to accompany loss of one current EE.

      3.7. Author response:

      We agree that the generation of one past EE should be accompanied by the loss of one current EE. We do not have a clear answer to this question. Our data showed cell numbers per ROI rather than the total cell number across the whole gut. To address this, we have changed the number to the proportion, calculated from [past EE] / ([past EE] + [current EE]), in experiments examining damage-induced EE plasticity, which provides a more informative measure for EE fate conversion (see Fig. 2C, also Fig. S2B and 3E).

      - Fig. 2E. Dl+ past-EE cell number declined at 14 and 18 h after IR and because cell sized increased, the authors conclude that EE cells that de-differentiated into ISCs subsequently re-differentiated into EC’s. To reach this conclusion, the authors should count past-EEs that are positive for EC markers. Cell size alone is insufficient evidence.

      3.8. Author response:

      We have added data quantifying the proportion of past EEs that are positive for the EC marker Pdm1, showing that past EEs were more likely to be ECs in guts examined 14 h after radiation (see Fig. 2F - 2G, Page 9, Line 189).

      - Fig. 6. Where are the % numbers for ISC, EB and EE’s coming from? And wouldn’t these change with time after IR, etc?

      3.9. Author response:

      The numbers came from the calculation of the percentage of the absolute values of control and 14 h post-IR conditions from Fig. 2E. These numbers changed with time after radiation. We realized that the precise numbers were misleading. We therefore have removed such illustration and instead added phrases “more current EEs → past EEs, more past EEs being ISCs → past EEs being ECs” to describe the increase in past EE cell number and the shift in the composition of past EEs (see Fig. 6).

      - Improve Figure 1B: Pros and Dl are shown in the same color, creating confusion. If both are stained together, different colors or clearer labeling should be used. Clarify how cells are identified as Pros+ vs Dl+.

      3.10. Author response:

      Anti-Pros and anti-Dl antibodies were produced from the same host species and were detected with the same secondary antibody, so they were in the same color. We have stated that solid nuclear staining indicates Pros, whereas punctate cytoplasmic staining indicates Dl (see Page 5, Line 100, 102, and 103). Such staining has been reported in previous studies (for example, Fig. 2A - 2B, Veneti et al. Nat. Commun. 2024 DOI: 10.1038/s41467-024-46119-9).

      - Why is Dl (supposed to be cytoplasmic) overlapping with nuclear GFP in cells #3 and 4 in Fig. 1B?

      3.11. Author response:

      Because Dl signals were located apically to DAPI/GFP signals, the overlap was likely due to Z-projection from stacked slices. We present below orthogonal slices along the z-axis, from top to bottom by row, and composite and individual color channels, from left to right by columns, for cell #3 (left) and cell #4 (right).

      For cell #3, Dl signals were present in slices 1/8 and 2/8 and disappeared in slice 3/8, whereas DAPI signals appeared from slice 2/8. For cell #4, Dl signals surrounded DAPI signals when viewed separately. In addition, we realized that nuclear GFP signals slightly outgrew DAPI signals, despite our confirmation that the GFP channel was not saturated.

      We have included separate color channels for DAPI signals and Pros, Dl and DAPI merged channels, showing that Dl signals were absent from the nucleus. For cell #3, in which the nuclear DAPI and cytoplasmic Dl cannot be distinguished in the stacked view, we show the images from a single orthogonal slice in the main panel, and the image from stacked slices as insets (see Fig. 1C).

      - Fig. S1E and F. Very hard to see what the authors describe about Arm and Cora. One problem is that cell boundaries are not visible, just the nuclei, so it is hard to know whether cell-cell interactions the authors describe as normal are really normal. Another problem is the overlap of Arm (supposed to be cytoplasmic) with the nuclear GFP signal. What is that?

      3.12. Author response:

      Regarding the invisibility of cell boundaries, we have improved the image of anti-Cora staining and added anti-Mesh staining and a separate color channel for DAPI signals to reinforce junction integrity (see Fig. S1H - S1I).

      Regarding the overlap of Arm signals with nuclear GFP signals, we realized similar problems as those noted in 3.11. Author response. We present below orthogonal slices along the z-axis and combined and individual color channels, for cell #2 (left) and cell #3 (right). For both cells, Arm signals did not overlap with DAPI signals. We have adjusted the maximum intensity projection to include slices 1-4 instead of 1-8 and added a separate color channel for DAPI signals to avoid the signals appearing to overlap (see Fig. S1G).

      - Include a simple schematic of ISC to EE/EC lineages for readers unfamiliar with Drosophila gut biology.

      3.13. Author response:

      We have included a schematic (see Fig. 1A). Although not requested, we have also improved Fig. 1B to enhance clarity.

      - Discuss the regional difference in Xrp1 efficacy (R2a vs R2b). Is there something known about gene expression differences in different gut regions that can explain the results?

      3.14. Author response:

      At present, we do not have an explanation for these results. We have refined our discussion regarding such regional differences (see Page 16 - 17, Line 381 - 390).

      - Consider moving scRNAseq (Fig. S1G) into main paper: this is a central part of the conclusion.

      3.15. Author response:

      We have moved Fig. S1G, as well as Fig. S1H and S1I, into the main figure (see Fig. 1G - 1I).

      4. Description of analyses that authors prefer not to carry out

      - Xrp1 is existing as a short of long isoforms. The short form has been recently proposed to be required for cell competition (https://doi.org/10.1101/2025.06.15.659587) whereas Xrp1 long isoform may be responsible for reduced cell growth. Could the authors test which isoform is induced in the gut after irradiation? Is the overexpression of Xrp1 long isoform having the same effect that the short isoform used by the authors.

      4.1. Author response:

      We prefer not to distinguish whether the long or short Xrp1 isoform is induced in the gut after radiation. This presents technical challenges and falls outside the scope of the present study. As noted in 3.2. Author response, we instead report in the revised manuscript that both isoforms similarly promote EE plasticity.

      - Xrp1 is known for its role in cell competition and elimination of looser cells by induction of apoptosis. It would be interesting to check for induction of cell death and/or caspase activation in the fly gut after irradiation and verify a non apoptotic role of DRONC activation in this context using a Dronc RNAi (as proposed by Bergmann lab (https://doi.org/10.1038/s41598-021-81261-0) or Baena-Lopez lab (DOI: 10.15252/embr.201948892)). Overexpression of Xrp1 could be combined with UAS-p35.

      4.2. Author response:

      We prefer not to perform Dronc-RNAi, because we did not observe Dronc activation downstream to Xrp1, as shown in 2.4. Author response.

      - They authors interpret fate-conversion as beneficial for tissue repair but never test whether blocking this process impairs recovery or organismal survival or whether promoting it improves outcomes.

      4.3. Author response:

      We prefer not to examine organismal survival. We agree that organismal survival would be informative, but our study focuses on epithelial cell number, which will be tested as noted in 2.7. Author response. We will not mention broad claims at the organismal level.

      - Related to the above, it would be helpful to know if fate-converted cells function as true ISCs or ECs (e.g., through proliferation or absorption assays).

      4.4. Author response:

      We prefer not to perform absorptive assays due to technical challenges. We will instead test proliferation, as noted in 2.8. Author response, and note our limitations.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Qian and colleagues report a study on radiation induced cell fate plasticity in the intestine of Drosophila. Using lineage tracing to mark pre-EE and EE cells, the authors how that these cells can lose EE/pre-EE marker Pros and express ISC or EC markers, indicating fate conversion. Single cell RNAseq analysis showed that even under basal conditions, ISC/EB cell population includes those with EE/pre-EE lineage tracer, confirming fate conversion. The same analysis showed that fate converted ISC/EB cells express transcription factor Ets21C, which is associated with regeneration but not normal development. Exposure to ionizing radiation (IR) increases the frequency of fate conversion and accompanies the induction of Xrp1 (which is not expressed normally in the EE's). Xrp1 knock down reduced IR-induced fate conversion, demonstrating necessity. Xrp1 is also sufficient because overexpression of it resulted in increased fate conversion without IR. scRNAseq analysis showed that overexpression of Xrp1 in pre-EE/EE cells (without IR) resulted in the induction of ISC/progenitor state genes such as esg and Sox homologs. Functional testing of the latter group of genes demonstrated their essential role in cell fate plasticity induced by Xrp1.

      Major comments

      • The authors do not justify or explain why they used 100 Gy of radiation. This is higher than doses used in comparable regeneration studies in adult Drosophila (e.g., PMID25959206, PMID: 28925355). The authors should clarify why this dose was chosen.
      • The authors' explanation for cells with weak GFP in Figure 1 is not convincing. Induction of GFP is an all or nothing event as it results from Pros-driven FLPase and a recombination that removes the transcription stop signals to express GFP from a Ubi promotor. Once that happens, it should not matter how strong or weak Pros is, GFP should be the same. So, another explanation is needed. Nuclear staining of cell #2 in Fig 1B resembles a metaphase chromosome arrangement. Nuclear GFP may appear 'weak' in mitosis as the nuclear envelope breaks down. It is positive for the purple Pros/Dl stain, which makes it hard to tell if it is Pros+ or Pros- even though the authors state that cells with weak GFP are Pros- in line 104 (see the point above regarding confusing same-color stain for ISC and EE markers). Could cell #2 be a pre-EE that is undergoing mitosis since the lineage tracer marks both EE and pre-EE cells (line 119)? Or do the authors mean recombination on one or both homologs? This should not be possible since the cells are heterozygotes for the Ubi-GFP locus.
      • Fig. 2C, if past-EE's increased in number while current EE's stayed the same, where are new past-EE's coming from? There cannot be compensatory proliferations since EE's are post-mitotic. For fate conversion, one would expect the generation of each past-EE to accompany loss of one current EE.
      • Fig. 2C, the number of past EE's increased transiently so that baseline number is restored at 18 hr after IR. The authors conclude that fate plasticity is a transient event. Can they rule out loss due to cell death?
      • Fig. 2E. Dl+ past-EE cell number declined at 14 and 18 h after IR and because cell sized increased, the authors conclude that EE cells that de-differentiated into ISCs subsequently re-differentiated into EC's. To reach this conclusion, the authors should count past-EEs that are positive for EC markers. Cell size alone is insufficient evidence.
      • Fig. 6. Where are the % numbers for ISC, EB and EE's coming from? And wouldn't these change with time after IR, etc?
      • They authors interpret fate-conversion as beneficial for tissue repair but never test whether blocking this process impairs recovery or organismal survival or whether promoting it improves outcomes.
      • Related to the above, it would be helpful to know if fate-converted cells function as true ISCs or ECs (e.g., through proliferation or absorption assays).

      Minor comments

      • Improve Figure 1B: Pros and Dl are shown in the same color, creating confusion. If both are stained together, different colors or clearer labeling should be used. Clarify how cells are identified as Pros+ vs Dl+.
      • Why is Dl (supposed to be cytoplasmic) overlapping with nuclear GFP in cells #3 and 4 in Fig. 1B?
      • Fig. S1E and F. Very hard to see what the authors describe about Arm and Cora. One problem is that cell boundaries are not visible, just the nuclei, so it is hard to know whether cell-cell interactions the authors describe as normal are really normal. Another problem is the overlap of Arm (supposed to be cytoplasmic) with the nuclear GFP signal. What is that?
      • Include a simple schematic of ISC to EE/EC lineages for readers unfamiliar with Drosophila gut biology.
      • Discuss the regional difference in Xrp1 efficacy (R2a vs R2b). Is there something known about gene expression differences in different gut regions that can explain the results?
      • Consider moving scRNAseq (Fig. S1G) into main paper: this is a central part of the conclusion.
      • Line 230, Fig S3F-G should be Fig S3G-H.

      Significance

      Xrp1 is known to have a role in DNA Damage Responses and in cell competition and to function in the context of the p53 network, but this is the first time its role in fate conversion has been demonstrated. For the most part, the data are convincing and include strong genetic evidence from loss- and gain-of-function approaches that demonstrate a role for Xrp1 in activating progenitor gene expression and fate conversion. However, there are several experimental and presentation issues that need to be addressed first as outlined in the previous sections.

      The work highlights how mature cells may revert to stem-like states in response to injury, a theme with broad relevance in regenerative medicine.

      My field of expertise lies in DNA damage responses in Drosophila and human cancer models.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Adult tissue homeostasis refers to the process by which tissues maintain a stable and functional state over time. This usually depends on stem cell activity and the balance between cell proliferation and differentiation to ensure that tissues can repair damage, replace old or dead cells, and maintain their structure and function.

      Damage-induced plasticity plays an important role in restoring tissue homeostasis. Cellular plasticity is the ability of differentiated cells to acquire alternative phenotypic identities. It is typically constrained under homeostatic conditions but can be activated in response to tissue damage to support regeneration. In this study entitled "Xrp1 drives damage-induced cellular plasticity of enteroendocrine cells in the adult Drosophila midgut", Qian Q. et al., describe damage-induced plasticity of secretory enteroendocrine cells (EEs) in the adult Drosophila midgut. They found that ionizing radiation enhances EE plasticity, enabling EEs to dedifferentiate into intestinal stem cells (ISCs), which subsequently re-differentiate into absorptive enterocytes (ECs). Mechanistically, radiation triggers the expression of Xrp1, a stress-responsive transcription factor, within EE lineages. Xrp1 upregulation is necessary for initiating EE plasticity by expressing progenitor specific genes (like escargot for example), as verified by single-cell RNA sequencing of midguts with EE-specific Xrp1 overexpression. This is suggesting that Xrp1 reprograms EEs by promoting progenitor-like transcriptional states.

      The authors nicely describe the dedifferentiation of EEs using the G-TRACE system in response to irradiation and the role of Xrp1 in this process. Yet, the authors need to show the requirement of the EEs dedifferenciation during regenerative growth.

      Major comments:

      • The authors should investigate the regenerative growth of the adult midgut after irradiation. Is there an impact on ISCs proliferation or cell turn over. Is Xrp1 in EEs required in this adaptive response. It would be elegant to use the recently generated tracing method by Tobias Reiff lab to observe overall impact on tissue renewal (rapport-tracing esglexReDDM esg-lexA, 13xLexAop2-CD8::GFP, 13xLexAop2-H2B::mCherry::HA, tub-Gal80ts on the second chromosome. It can be combined with any EEs Gal4-driver (see Nat Commun 2025, https://doi.org/10.1038/s41467-024-55664-2, the stock is already existing, see table1). This reviewer thinks that it is a key experiment to support the proposed model.
      • It is surprising to observe EEs dedifferentiation at a steady state during homeostasis, a condition in which Xrp1 is not detected in the gut. Can the authors comment this point in the discussion?

      Minor comments:

      • Is p53 required for Xrp1 induction in the gut after irradiation?
      • Xrp1 is existing as a short of long isoforms. The short form has been recently proposed to be required for cell competition (https://doi.org/10.1101/2025.06.15.659587) whereas Xrp1 long isoform may be responsible for reduced cell growth. Could the authors test which isoform is induced in the gut after irradiation? Is the overexpression of Xrp1 long isoform having the same effect that the short isoform used by the authors.
      • Xrp1 over expression has been shown to induce upd3 ligand and nutrient-driven dedifferentiation of enteroendocrine cells is occuring by activation of the JAK-STAT pathway (DOI: 10.1016/j.devcel.2023.08.022). Could the authors test the function of this signaling pathway during irradiation (upd3-lacZ and Stat-GFP can be used in parallel of upd3 RNAi and UAS Dome-DN.
      • Xrp1 is known for its role in cell competition and elimination of looser cells by induction of apoptosis. It would be interesting to check for induction of cell death and/or caspase activation in the fly gut after irradiation and verify a non apoptotic role of DRONC activation in this context using a Dronc RNAi (as proposed by Bergmann lab (https://doi.org/10.1038/s41598-021-81261-0) or Baena-Lopez lab (DOI: 10.15252/embr.201948892)). Overexpression of Xrp1 could be combined with UAS-p35.
      • Line 221: fig S3E should be S3F
      • Line 230: fig S3F-G should be S3G-H
      • The posterior gut region R4 is more proliferative than the anterior part and is usually used for testing regenerative growth. What is happening there after irradiation?

      Significance

      Altogether, the paper present compiling lines of evidence supporting the proposed model. The experiments are well designed and are convincing. The papers is interesting and relevant for a broad audience.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Based on the below reviews, we propose the following revision plan. Briefly:

      • We will re-focus the manuscript on the developmental data providing a molecular and cellular blueprint __of lining macrophage development. The __novelty and relevance of our developmental data have been highlighted by all three reviewers, and they have also praised the rigor of these experiments and their interpretation. We thus believe that this re-focus will improve the manuscript's message.
      • We will include our data on CSF1 as a key signal. Whilst previously appreciated as a factor required for tissue-resident macrophages, including those in the joint, our study is the first to show the requirement of lining macrophages over a complete developmental time course, using modern readouts, and in a model that circumvents the limitations of previously used approaches (see point-by-point response for details).
      • However, we will remove the functional data on TGFβ signaling and mechanical loading/mechanosensing. We agree with the reviewers that we would need to generate additional histological and molecular data from conditional knockout mice, antibody and (ant)agonist treatments and the optogenetic model to determine their exact involvement in lining macrophage maturation. These experiments require significant time and other resources. We would therefore like to uncouple this question for a follow-on manuscript, and to re-focus the current study as a developmental atlas. Removal of (some) of these data has been suggested in the reviewers' comments as well.
      • To further elevate our developmental atlas, we are proposing to include additional data and new analyses delineating the developmental dynamics of synovial fibroblasts on single cell (transcriptomic) level. This change to the original manuscript had not been requested by the reviewers, but we are proposing this pro-actively because we believe this would be an impactful addition to a revised version of our study, providing data also on the maturation of the synovial (lining) macrophage niche. Again, this will re-focus the manuscript on the developmental data and provide a novel, valuable resource for those interested in joint biology.
      • We will otherwise respond to all individual reviewer comments and implement the requested changes, unless technically not possible. We are convinced that this revision plan will result in a manuscript that fits very well with the remit of Genes & Development.

      Please find below detailed point-by-point answers.

      Reviewer #1

      Evidence, reproducibility and clarity

      In their manuscript entitled "The synovial lining macrophage layer develops in the first weeks of life in a CSF1- and TGFβ-dependent but monocyte-independent process," the authors explore the developmental trajectory of synovial lining macrophages. They demonstrate that the formation of this specialized macrophage layer is age-dependent and governed by a distinct developmental program that proceeds independently of circulating monocytes. Through scRNA-Seq, the authors show that synovial lining macrophages originate locally from Aqp1⁺ macrophages and are marked by the expression of Csf1r, Tgfbr, and Piezo1. Notably, genetic ablation of each of these factors impaired the development of lining macrophages to varying degrees, suggesting differential contributions of CSF1, TGFβ, and PIEZO1 signaling pathways to their maturation and maintenance.

      The manuscript is well written, and the data quality and representation is of a high standard. The authors have employed a sophisticated array of state-of-the-art mouse models and cutting-edge technologies to elucidate the developmental origin of synovial lining macrophages. Notably, the supporting scRNA-Seq datasets are of excellence and provide valuable insights that will likely be of significant interest to researchers in the field of immunology and joint biology. Accordingly, the experimental approach and interpretations regarding macrophage origin are well-founded and compelling. However, in the eye of the reviewer, the section addressing the underlying molecular mechanisms is a bit less convincing. This part of the study appears slightly underdeveloped, and some of the mechanistic claims lack sufficient experimental clarity. A more rigorous experimental investigation would be essential to reinforce the manuscript's conclusions, particularly concerning the data related to Tgfbr and Piezo1, where the current evidence appears insufficiently substantiated.

      We thank the reviewer for their positive and constructive evaluation of our manuscript. We agree with them (and the other reviewers) that our functional data on the involvement of TGFβ signaling and mechanical loading/mechanosensing are comparably less convincing and substantiated than our developmental data. We are very grateful for their (and the other reviewers') suggestions to provide more support for the involvement of these factors in lining macrophage development. However, we think that carrying this out to the same high standard will require substantial time and other resources. We have therefore decided to uncouple this from the developmental data and pursue this in follow-up work. We will re-focus the current manuscript on the developmental data. We have proposed to the editors to instead include additional data on synovial fibroblast development, to complement our macrophage data and also delineate the maturation of their niche, thereby providing a conclusive developmental atlas.

      Major point:

      1. The numbers of VSIG4⁺ macrophages appear either unaffected or only minimally altered in both Csf1rMerCreMer Tgfbr2floxed and Fcgr1Cre Piezo1floxed mouse models, respectively. This raises an important question: was the gene deletion efficiency sufficient in each model? Accordingly, the authors are encouraged to include quantitative data on gene deletion efficiency for both mouse models, as this information is critical for interpreting the observed phenotypic outcomes and validating the conclusions regarding gene function. Furthermore, to better assess the impact of Tgfbr2 and Piezo1 disruption, the authors should provide more comprehensive flow cytometry analyses and histological data for these mouse models. Given the apparent homogeneity of VSIG4⁺ macrophages (as shown by the authors themselves), bulk RNA-Seq of sorted Tgfbr2- and Piezo1-deficient VSIG4⁺ macrophages (or from TGFβ-treated animals) would offer valuable insights into both the effectiveness of gene deletion and the molecular pathways governed by TGFβ and PIEZO1 in lining macrophages.

      As outlined above, we have decided to uncouple our functional data on TGFβ, Piezo1 and mechanical loading. The points raised here are all very valid, and we will implement your suggestions in our follow-up functional work focusing on signaling events regulating lining macrophage development. On the suggestion to perform bulk RNA sequencing for VSIG4+ macrophages: This is a good one in principle - although we will not be able to use this strategy where we want to assess the consequences of experimental treatments or genetic models on lining macrophage maturation, because acquisition of VSIG4 is a key maturation event that might be impaired in these conditions.

      Minor points:

      Consistent usage of Cx3cr1-GFP+ nomenclature (for instance: Fig. S1 legend "adult mouse synovial tissue, showing PDGFRα⁺ fibroblasts (yellow) and CX3CR1-GFP⁺ cells (cyan)." versus Fig. 1 legend "Automated spot detection highlights Cx3cr1-GFP⁺ macrophages)".

      We will implement these changes.

      Unclear Fig. 3 legend: "Representative immunofluorescence images of synovial tissue from Clec9aCre:Rosa26lsl-tdT mice at 3 weeks and in adulthood, showing and tdTomato (yellow) and stained for DAPI (blue), VSIG4 (cyan)" Check 'showing and tdTomato.'

      We will implement these changes.

      For greater clarity, it would have been helpful if the transcript names had been directly included within Figures 3C, S3A, and S3C.

      We will implement these changes.

      Page 24: "(Mki67CreERT2:Rosa26lsl-tdT)" Last bracket not superscript.

      We will implement these changes.

      Page 25: "we again leveraged our scRNAsequencing dataset" Missing punctuation.

      We will implement these changes.

      Page 27: Fig. 5C legend: " of synovial tissue of 1 week-old, 3 weeks-old and adult mice." Please specify and change to 'adult Csf1rΔFIRE/ΔFIRE mice'.

      We will implement these changes.

      Page 30: The outcome observed in the Acta1-rtTA:tetO-Cre:ChR2-V5fl mouse model appears to be inconclusive: "This approach resulted in an increased density of VSIG4+ and total (F4/80+) macrophages in the exposed leg of some 5 days-old pups, but others showed the opposite trend (Figure S5D)." This variability may reflect low efficiency of the model or other technical limitations (e.g. muscle contractions frequency or time point of analysis). Given this ambiguity, it is worth reconsidering whether the data are sufficiently robust to warrant inclusion. Should the authors choose to include these findings, further experimentation of appropriate depth and precision is required to allow a conclusive interpretation (either it increases the density of VSIG4+ macrophages or not). The same applies to the Yoda1-treated mice, for which additional data are needed to determine whether VSIG4⁺ macrophage density is truly affected.

      We have decided to remove the data on the optogenetic mouse model and Yoda1 treatment and follow-on separately, implementing these suggestions, including proof of concept data for optogenetically induced muscle contractions.

      Significance

      General assessment: provide a summary of the strengths and limitations of the study. What are the strongest and most important aspects? What aspects of the study should be improved or could be developed? This is a well-designed study that uses cutting-edge methodologies to investigate the developmental trajectory of synovial lining macrophages under homeostatic conditions. The authors present robust experimental evidence and compelling interpretations concerning synovial macrophage origin, which are both well-substantiated and impactful. Nonetheless, from the reviewer's perspective, the section exploring the molecular mechanisms underlying macrophage differentiation is comparatively less convincing. This section appears somewhat underdeveloped, as some of the mechanistic claims lack sufficient depth and experimental rigor to fully substantiate the conclusions.

      Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field: In contrast to earlier studies (PMID: 31391580, 32601335), the inclusion of fate-mapping experiments adds an important dimension, offering novel insight into the ontogeny of synovial macrophages. This expanded perspective may prove particularly valuable in advancing our understanding of joint immunology, especially regarding the local origins and lineage relationships of macrophage populations.

      Furthermore, the authors present novel insights into the molecular pathways underlying the differentiation and development of synovial lining macrophages. By demonstrating previously unrecognized regulatory mechanisms, this work significantly deepens our understanding of the cellular and transcriptional programs that drive macrophage specialization within the joint microenvironment.

      Place the work in the context of the existing literature (provide references, where appropriate): This study builds upon previous work characterizing the macrophage compartment in the joint (PMID: 31391580, 32601335), yet provides a substantially more comprehensive dataset that spans multiple developmental time points and data on the origin of this specialized macrophage subset.

      State what audience might be interested in and influenced by the reported findings: Immunologist, clinicians

      Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. This study falls well within the scope of the reviewer's expertise in innate immunity.

      Reviewer #2

      Evidence, reproducibility and clarity

      In the manuscript „The synovial lining macrophage layer develops in the first weeks of life in a CSF1- and TGFβ- dependent but monocyte-independent process", Magalhaes Pinto and colleagues carefully employ a wide range of technologies including single cell profiling, imaging and an exceptional combination of fate mapping models to characterize the ontogeny and development of lining macrophages in the joint, thus dissecting their maturation during postnatal development. Over the last decade, several landmark studies highlighted the imprinting of tissue-resident macrophages by a combination of ontogenetic and tissue-specific niche factors during development. So far, the ontogeny and the tissue niche factors governing the development and maturation of lining macrophages have not been described. Therefore, the results of this study offers insights on a small highly adapted macrophage population with relevance in many disease settings in the joint. Furthermore, the findings are nicely showcasing how macrophages are specializing to even very small tissue niches across development within one bigger anatomical compartment to serve dedicated functions within this niche.

      This manuscript is beautifully written and highlights many novel, highly relevant findings on lining macrophage biology and the authors employ a wide range of different technologies to carefully dissect the postnatal development of lining macrophages.

      In particular, the combination of scRNA-seq and fate mapping is providing a unique the link of transcriptional programs to ontogeny within the tissue niche. Furthermore, the integrative use of distinct fate mapping strategies, transgenic mouse lines, and treatment paradigms to elucidate key niche factors guiding the development and maturation of lining macrophages provides many interesting findings and data that are highly relevant to the field. I really enjoyed reading this manuscript.

      Thank you for your complimentary and constructive assessment of our manuscript, and the detailed comments below, which are very helpful. Please find point-by-point responses below.

      Major points:

      The authors show dynamic regulation of VSIG4 in lining macrophages during development, therefore VSIG4 is maybe not an ideal choice for gating strategies to define lining macrophages or to show as a single markers in immunofluorescence (IF) stainings to demonstrate their abundance across development (even though it is clear that this is the reason why the F4/80 staining is shown next to it). To demonstrate the increase of lining macrophages during development in IF, it would be more helpful if the authors would show quantifications of all F4/80+ cells and additionally VSIG4+ as a proportion of F4/80+ cells (or VSIG4+ F4/80+ and all F4/80+ in a stacked bar plot). We agree with the assessment of VSIG4 not being ideal since this is a key marker of mature lining macrophages only.

      We agree with the assessment of VSIG4 not being ideal since this is a key marker of mature lining macrophages only. We will provide additional data and analyses.

      In Figure 1C, the authors nicely demonstrate that the lining macrophages get closer in their distance across development to build the epithelial-like macrophage structure along the adult lining. Is the close proximity between lining macrophages already fully "matured" at 3 weeks of age and comparable to adults? Please quantify the distance in adult linings.

      We will provide additional data for adult joints.

      Can the authors explain how the grouping was performed between the analyzed human fetal joints? It is not clear why the cut was chosen between the groups at 16/17 weeks of age. Maybe it would be also beneficial if the authors would consider not grouping these samples but rather show the specific quantifications for each samples individually and estimate via linear regression the expansion over time across human development. Furthermore, can the authors give additional information about the distancing of lining macrophages in the human fetal samples, it would be great to see if they follow the same dynamics as in mouse. Maybe comparison to human juvenile/adult joints would also add on to substantiate the findings in human samples (if possible).

      We will show samples ungrouped and perform new linear regression analysis as suggested.

      The scRNA-seq analysis leaves several questions open and some conclusions and workflows cannot be easily followed.

      We appreciate this comment and the complexity of the data, and will implement the below recommendations, and clarify the issues raised. Detailed:

      a. It is not clear how and especially why the signature genes to define macrophages vs. monocytes were chosen. Especially as the signature genes for monocytes would not include patrolling monocytes and the macrophage signature genes seem to be highly regulated during development, see also Apoe expression in NB vs. adult in Figure S2e. Why did the authors not take classical markers such as Itgam, Fcgr1a, Csf1r?

      We will include new analyses using these markers.

      b. Can dendritic cell signatures be excluded? Cluster 11 and 12 show indeed some DC markers, are these really macrophages?

      We will include new analyses to account for DC markers.

      c. The authors provide several figure panels showing TOP marker genes or key marker genes for the identified clusters, however it is not clear if these are TOP DE genes or if the genes were hand chosen. Somehow, the authors give the impression that the clusters were chosen and labeled not based on DE genes, but more on existing literature that previously reported these macrophage populations. DE gene lists for all annotated cell types and macrophage clusters need to be provided within the manuscript.

      We will provide the full DEG analysis results.

      d. The authors claim that Clusters 1 and 4 are "developing" macrophages. How is this defined? Why are these developing cells compared to other clusters? And why are these clusters later on not considered as progenitors of Aqp1 macrophages and Vsig4 macrophages? Why are Aqp1+ macrophages not labeled as developing when they are later on in the manuscript shown as potential intermediate progenitors of lining macrophages?

      As per below comment, we will expand on this and clarify nomenclature and (potential) relationships between these and other macrophages.

      e. Furthermore, it is again confusing that markers are used throughout Figure 2 which are labeled as "key marker genes" for a population and then later on they are claimed to be regulated during development within this population, see for example Figure 2D and 2H.

      We will clarify this as per above answer.

      f. It is appreciated that the authors distinguished cycling clusters such as 8, 9, and 10 based on their cycling gene signature. Here it would be very exciting to see a cell cycle analysis across all clusters and time points to see when exactly the cells are expanding during development; this would also substantiate the data later shown for the Mki67-CreERT2 mouse model.

      We will perform the proposed cell cycle analysis, and implement this and the other reviewer's suggestions for marker selection and cluster annotation (this is also covered in below comments from other reviewers).

      g. Can the authors identify certain gene modules during development of lining macrophages (and/or their progenitors) which are associated with certain functions (e.g. GO terms, GSEA enrichment)?

      This will be included in the revised manuscript.

      To determine the actual presence of the identified macrophage clusters from the scRNA-seq as macrophage populations in the joint, the authors should perform IF or FACS for key markers. Especially, Aqp1+ macrophages should be shown in the developing joint.

      We will provide additional data on Aqp1+ macrophages in the developing joint, and related these to a study by collaborators currently in revision at Immunity, which characterizes the Aqp1+ population in detail (we are hoping to have a doi available during our revision process).

      The authors used a wide range of fate mapping models, which is quite unique and highly appreciated. The obtained results and the conclusions made from the models raise a couple of questions: Whereas contribution of HSC-derived/monocyte-derived macrophages to the lining compartment seems to be minor, there is still labeling across different models. Various aspects would need to be clarified.

      We will clarify these data throughout as per below suggestions.

      a. For example, the authors employ Ms4a3-Cre as a tracing model for GMP-derived monocytes, however all quantifications of the labeling efficiency are not normalized to the labeling in monocytes or another highly recombined cell population. This should be shown, similar to the other fate mapping models (Figure 3 F-I).

      Labelling efficacy for Ms4a3-Cre is near complete for GMP-derived monocytes (and neutrophils) with the Rosa-lsl-tdT (aka Ai14) reporter we have used (see also PMID: 31491389 and doi: 10.1101/2024.12.03.626330); but we will include normalized data as requested.

      b. Please show Ms4a3 expression across clusters across time points, to exclude expression in fetal-derived clusters.

      We will include this in the revised supplementary information, but there is indeed very little at birth (in line with the original report for other tissues PMID: 31491389).

      c. In line with the question raised above, if the authors can exclude a development of the Egfr1+ and Clec4n+ developing macrophages into Aqp1+ macrophages and subsequently into Vsig4 lining macrophages, the obtained data from the Ms4a3-Cre model highly suggests a correlative labeling across these clusters what could implicate a relation. However, the authors do not discuss throughout the manuscript the role of these developing macrophages. It is highly encouraged to include this into the manuscript and it would be of high relevance to understand lining macrophage development.

      This is an interesting point and we agree it deserves consideration in the revised manuscript. Indeed, our trajectory analyses do not predict differentiation of the Egfr1+ and Clec4n+ developing macrophages into Aqp1+ macrophages, and hence, ultimately lining macrophages. Conversely, Aqp1+ cells might also convert into Egfr1+ and Clec4n+ developing macrophages. We will elaborate on this more in the revised manuscript.

      d. The authors conclude from the pseudo bulk transcriptomic profiling of the different macrophage clusters that TdT+ and TdT- macrophages do not differ in their gene expression profile and that this is due to niche imprinting rather than origin imprinting. Even though the data supports that conclusion, the authors should verify if inkling cells early during development also show this similar gene expression profile and gene expression should be compared at the different developmental time points. Tissue niche imprinting is happening within the niche during development, most likely in a stepwise progress, and therefore there should be differences in the beginning.

      This is another important point that we will address in the revised manuscript by performing additional differential gene expression analyses at the different developmental time points, including the earliest stages, as suggested.

      The trajectorial analysis using different pseudotime pipelines is very interesting and nicely points out the potential role of Aqp1 macrophages as intermediates of Vsig4 lining macrophages. From my point of view, all trajectories seem to suggest that Egfr1 developing macrophages and Clec4n developing macrophages might differentiate into Aqp1 macrophages, however the authors are not exploring this further and the role of both developing macrophage clusters is not further discussed (see also comments above).

      We will address and discuss this in the revised manuscript.

      How was the starting point of the trajectorial analyses defined and is it the same for each pipeline used?

      We will clarify this in the revised manuscript.

      Are there potentially two trajectories? It looks like there is one in the beginning of postnatal life and a second one appearing from the monocyte-compartment later in life. If this is true, that would rather speak for a dual ontogeny of Vsig4+ macrophages, wouldn't it?

      We will discuss this in the revised manuscript.

      A heatmap (transcriptional shift) of trajectories between more clusters should be shown at least for Cluster 0,1,2, and 3. It is not sufficient to demonstrate this only between two clusters.

      We will add these analyses during revision.

      To show the similarity between Aqp1 macrophages and proliferating macrophage clusters, the authors should remove the cycling signature and compare these clusters to show that the cycling cells might be Aqp1 macrophages or earlier developing macrophage progenitors aka Clec4n or Egfr1 macrophages.

      We will address this in the revised manuscript.

      The conclusions made from the Mki67-CreERT2 data are a bit difficult to understand, whereas all progenitors (monocyte progenitors and macrophage progenitors will proliferate at the neonatal time point and no conclusions can be made if the cells expand in the niche. The authors should employ Confetti mice or other models (Ubow mice) to analyze clonal expansion in the niche.

      We acknowledge that interpretation of the Mki67-CreERT2 data is complicated by labeling of other cells, and notably, labeling observed in BM-derived cells. To complement the Mki67-CreERT2 data, and specifically account for proliferation of BM-derived cells, we have tried using Ms4a3-Cre:Ubow mice to quantify expansion of the few monocyte-derived macrophages in the joint (lining). However, this yielded

      All predicted cell-cell interactions between macrophages and fibroblasts should be provided in a supplementary table. Are the interactions shown in Figure 5 chosen interactions or the TOP predicted ones? Whereas the authors show different numbers of interactions, it is most likely hand-picked and therefore biased.

      We will provide a full list of all predicted interactions in the revised supplementary material in addition to a list of the full differential gene expression analysis.

      The authors further aim to dissect the factors involved in the developmental niche imprinting of lining macrophages. Even though it is highly appreciated that the authors used so many experimental setups to show the reliance of lining macrophages on Csf1 and TGF-beta as well as mechanosensation, the wide range of models the different methods used and selected developmental time points make it very difficult to really interpret the data. The authors should carefully choose time points and methods (either FACS analysis across all models or IF across all, or both). Often deletion efficiencies for transgenic models and proof of concept that the inhibitors and agonists are working in the treatment paradigm are not provided. For example, Csf1rMer-iCre-Mer Tgfbr2fl/fl mice are used but no deletion efficiency is shown or different time points of analysis, maybe the macrophages are not properly targeted in the set up.

      We have decided to uncouple our experimental data on Tgfb, Piezo1 and mechanosensing/mechanical loading, but are taking this into consideration for revision. In many cases, we have in fact performed flow cytometry and imaging analyses, and agree, we should be showing this consistently.

      The authors have shown the role of Csf1 and Tgfbr2 only for lining macrophages, is this specific in the joint to this population of are subliming macrophages affected in a similar manner.

      We will include data on sublining macrophages in the revised figure (for CSF1; Tgfb data will be uncoupled from this current manuscript).

      Can the authors confirm their results in CSF1R-FIRE mice with anti-Csf1 injections or in Csf1op/op mice?

      We will expand our discussion of the Csf1 findings, and aim to include data for anti-CSF1 antibody treatment during revision. Csf1 has previously been reported as a key factor required for maintenance of tissue-resident macrophages, including those in the joint (lining). Indeed, Csf1op/op mice are deficient in synovial lining macrophages, from 2 days of age onwards (PMID: 8050349), and lining macrophages are also absent from 2-weeks-old and adult Csf1r-/- mice (PMID: 11756160). However, a full developmental analysis has not been performed. We are thus the first to show a full developmental time course, using state-of-the-art experimental readouts, and specifically focusing on the early postnatal window of lining maturation that we have identified here in this study. Moreover, we have used a more specific model, Csf1rFIRE ko, in which Csf1 deficiency is restricted to myeloid cells. This model circumvents issues with other models, which show many developmental defects, some of which unrelated to macrophages. These include growth retardation and skeletal defects, which may influence joint macrophage development. Therefore, although Csf1 dependence of synovial lining macrophage had indeed been previously reported in principle, our data substantially expand on and solidify these findings, thereby adding novelty.

      The setup in Figure S5G is very interesting to test the role of movement and mechanical load on the joint, however, there is basically no data on the model provided showing the efficiency of the induced optogenetic muscle contractions, and only one time point is shown.

      Data on mechanical loading will be uncoupled from the current manuscript and substantiated in a separate follow-up.

      The results regarding the role of Piezo1 and mechanosensation vary a lot. Could it be that analyses were done too early or that actually proper weight load on the joint must be applied for the maturation of the macrophages? The authors should test this to.

      We will uncouple these data from the current manuscript during revision in order to investigate the contribution of these (and other) factors in sufficient detail. However, this is a possibility that we have discussed. In fact, the most appropriate experimental approach to address the involvement of mechanical loading, onset of walking and specifically, weight bearing would be a loss-of-function approach (i.e. paralysis at the newborn stage), for which we unfortunately could not obtain ethics approval from the UK Home Office.

      The Rolipram experiment is shown in Figure S5G, but is not described in the result section. It only appears at some point in the discussion part. The authors should move it to results or remove it from the manuscript.

      We will incorporate these data with the revised section on developing synovial macrophage populations.

      Minor points:

      Please reference the Figure panels in numeric order throughout the text.

      We will change this where not the case already.

      Figure 2a and 2b are a bit out of the storyline, it is not obvious why this is shown here and maybe it would be good to move it to the supplements. Gating strategy is also not used for scRNA-seq. Therefore, it would better fit to the later analysis of joint macrophages across different transgenic mouse models and treatment paradigms. The gating strategies are changing across different experiments throughout the figures, it would be nice to have a similar gating strategy for all experiments, see also Figure 3 where the defining markers for joint macrophages are changing between models.

      We will revise Figures 2, 3 and the related supplementary figures.

      A lot of figure panels have very small labeling that is basically unreadable. Axes at FACS plots for example. Sometimes, it is even impossible to distinguish cluster labels especially when they have similar colors.

      We will revise this, thanks for pointing it out.

      In the text on page 14, many markers are named which are specifically regulated during development in lining macrophages, but these factors are not labeled anywhere in the volcano plot. It would be good to showcase at least some of these named genes in the figure panel, e.g. Trem2.

      We will do this for revision.

      Figure 2F and Figure S2F are really nicely showing the percentage of cells per cluster in each analyzed biological sample. Maybe the authors could additionally consider to show a stacked bar plot with the mean percentage of cells per cluster and how the clusters are distributed across time points?

      We will include this in the revised manuscript.

      Figure 3A: IF for adult lining macrophages and the quantification are missing.

      This will be included in the revised version.

      Reviewer #3 - Major

      Generally, the story could be more streamlined by introducing earlier reporter lines and lineage-origin logic. Clearly state which reporter/CreERT2 lines and acrosses are used. It was unclear in Figure 2 that cells of the cross of the Cx3cr1-GFP and Ms4a3Cre:Rosa26lsl-tdT reporter lines were used for the scRNA-seq. The principle that there are fetal-derived and bone marrow (GMP)-derived monocytes and macrophages doesn't need to be "hidden" until Figure 3. For example, also the imaging of Ms4a3Cre could be introduced before the scRNA-seq.

      We will revise the structure and order of the manuscript during revision. However, we will streamline this between reviewer comments, and would also like to point out that the 2 other reviewers were very complimentary about the writing and clarity, i.e. we may not follow every specific suggestion of reviewer 3, but are very much taking on board their overall comment on structure and clarity.

      Figure 1 could benefit from a cartoon visualizing the anatomy of the knee joint. The terms "sublining" and "synovium" are now a bit unclear, as it appears that sometimes the synovium is indicated as sublining and vice versa. Additionally, a schematic developmental timeline could be added to indicate the parallels between mouse and human development (fetal and postnatal development in mouse versus gestational age in human). Also, the various waves of hematopoiesis could be indicated in this timeline, which would be particularly helpful for Figure 3 for the lineage-tracing readouts. Lastly, the authors could end the manuscript (a new Figure 6) with a general cartoon summarizing all the results presented.

      We will include these illustrations as suggested.

      Figure 1 could be rearranged: first introduce the markers CX3CR1 and VSIG4 (Figure 1D) and then present the quantifications (Figure 1B/E). Where possible, co-visualization CX3CR1-GFP and VSIG4 on tissue sections to strengthen the claims on the relationship between these 2 markers. Tying the scRNA-seq insights (Figure 2) to the imaging would be elegant. Moreover, it would be informative to represent the CX3CR1+ and VSIG4+ macrophages as a percentage of F4/80+ macrophages (Figure 1B/E). Similarly, for the flow cytometry data in Figure 2, the relationship between the markers CX3CR1 and VSIG4 on macrophages could be more clearly displayed and discussed.

      Thanks for this remark. We will endeavour to show co-localization and analysis of both markers wherever possible. However, where we did not use Cx3cr1gfp mice, co-staining was limited by antibody choice and availability.

      The 3D imaging of the joint is a nice addition to the manuscript, as it provides more context to the anatomical structure; however, while the text suggests several newborn joints were imaged, Figure 1F visualizes (again) the knee joint. Could other joints also be represented by 3D imaging? If the knee joint is the only joint available for imaging, and previous confocal imaging focused specifically on the meniscus in the knee joint, could the meniscus also be highlighted in the lightsheet imaging?

      Apologies if this was not clear from the original manuscript text, but we have only imaged the knee joint in 3D. We will clarify this during revision. Whilst we want to maintain the focus on knee joints throughout this manuscript, but we will include additional 3D lightsheet imaging data from micro-dissected knee joints to further substantiate the original data.

      Clarification is requested regarding the imaging quantification representation. The M&M section under "Statistical analysis and reproducibility" states that individual data points are displayed, and bars represent the mean. However, some of the Figure legends (e.g., Figures 1B and S1C) specify that each dot corresponds to an individual mouse, with quantification based on 2-3 sections per mouse. While this appears to be a very reasonable representation of the data, does this mean that for each dot, the mean value from the 2-3 sections per mouse was calculated and plotted?

      We will clarify this.

      It is not clear how the differential expression analysis was performed on the Vsig4+ cells. Please specify if Cluster 0 was used for analysis, or all Vsig4-expressing cells? Not all cells in Cluster 0 have Vsig4+ expression. The authors described the expression dynamics of Aqp1 as intriguing, but lack a reasoning on why this is interesting.

      We will revise this section.

      Figure S3E: In line with the previous comment, can the authors justify that the tdTomato+/- comparisons are not biased by scRNA-seq dropout (scRNA-seq is zero-inflated, so some tdTomato- cells could be false negatives), and provide methodological details (thresholds, ambient RNA correction, etc.) to support this?

      We will clarify this and include additional representations of the tdTomato transcript data.

      Although the sex-related differences in macrophage composition and the absence of differential expression are interesting, they distract from the manuscript's main messages. Moreover, the Discussion does not elaborate on how these observations relate to joint (disease) biology. Consider removing this section or integrating it clearly into the relevant biological context.

      We will remove this section as suggested.

      CreERT2 transgenic lines are often not 100% efficient in recombination, also depending on whether tamoxifen or 4-OHT is used. Could the authors report the percentage of tdTomato+ cells in the joints and compare them to the recombination efficiencies in the monocytes/microglia under the same tamoxifen or 4-OHT conditions? This would help clarify how the interpret the macrophage labeling %'s.

      We will report labelling efficacies and/or show normalized data in the revised manuscript.

      Could the authors draw parallels between the observations in the mouse knee joint macrophage populations and literature on other joints in mouse and the knee joint in human (for example, as described in Alivernini et al., 2020 and in the very recent Raut et al., 2025)?

      We will include a section on this in the revised manuscript.

      Reviewer #3 - Minor comments:

      In general, the authors should clarify in the Results what each marker used for imaging, flow cytometry, or in the mouse reporter lines delineates. For example, mention that F4/80 is a marker for tissue-resident macrophages (correct?) in immunofluorescence, that IBA1 is a marker for macrophages on human tissue sections (Figure S1), and PDPN is GP38 (Figure S2 - align usage of marker reference across main text and figures).

      We will implement this request.

      Figure S1B: Is CX3CR1 also restricted to the lining macrophages in human? Could a co-staining with IBA1 be performed to strengthen the species similarities?

      To our knowledge, there is no antibody available that works for imaging of human CX3CR1. Moreover, CX3CR1 is only limited to the lining population in adult joints, in fetal and newborn (mouse) joints, all macrophages express this receptor, as do fetal progenitors to macrophages. However, Alivernini and colleagues have reported that TREM2high macrophages are the human counterpart of the mouse CX3CR1+ lining population (PMID: 32601335). We do not have access to postnatal human joint tissue samples, unfortunately, but we will attempt to stain for and quantify TREM2+ macrophages in human fetal joints for the revised manuscript.

      Adipocyte diameter quantification: Avoid plotting individual adipocytes from 2 mice without per-mouse visualization. Instead, report the mean adipocyte diameter per mouse and plot those means.

      We will implement this change.

      A little typo was spotted in the "Statistical analysis and reproducibility" section: it is Dunn's, not Bunn's multiple-comparison correction.

      Thanks for spotting this.

      Figure 2A: The gating strategy for the CX3CR1-GFP cells is missing.

      We will provide this in the revised manuscript or supplementary material.

      Improve the visualization of some plots. For example, Figure 2F is hard to read because of the big dot size. The dots seem to add no information to the graph and could be removed. Additionally, for comparing the clusters across the different time points, one could project the cells from the other time points in grey in the background.

      We will revise the presentation of these data.

      Figure S2: The dotplot is more informative than the heatmap, consider removing the heatmap.

      We will do that.

      Figure 3A: If technically feasible, image and visualize both the GFP and tdTomato expression. It would be informative to see the Cx3cr1+ and Ms4a3-derived cells in the same specimen.

      We will strive to show this in the revised manuscript.

      Figure 3C: Highlight that tdTomato expression is visualized here.

      We will do that.

      Figure 3G,F: The authors should place the schematics and graphs next to each other, so the data points can be more easily compared.

      We aim to do this in the revised manuscript.

      Figure 4B: Which co-staining was performed for the immunofluorescence to quantify the % of tdTomato+ cells?

      We co-stained for F4/80 and assessed localization in the lining or sublining. This will be clarified in the revised Figure legend.

      Figure 4C: The trajectory analysis appears to have an arrow pointing from the Ccr2+ macrophages to the Ly6c+ monocytes. Please verify this directionality, as its seems against the known biology.

      This will be addressed during revision.

      Figure 5 mentions that the Csfr1 levels were reduced in a tissue-specific manner, but it is unclear how this tissue specificity was achieved.

      We apologize for this misunderstanding. Csfr1FIRE mice are not tissue-specific knockouts, but they are more specific than global knockout mice, since only a (myeloid-specific) enhancer is affected. We will clarify this in the relevant section.

      For the TGFb perturbations (Tgfbr2 KO and systemic TGFb depletion): did the authors validate reduced TGFb pathway activity in the macrophages, for example, reduced pSMAD2/3 levels? This would validate the effectiveness of the perturbations.

      This is an important point, and assessing signaling events downstream of TGFb is a very good suggestion. As per above comment, we have decided to uncouple the functional data with exception of CSF1 from the revised version of the current manuscript, but we will be taking this into account for substantiating our functional data in follow-up work.

      Figure 5F could benefit from a timeline of the treatment.

      As for 15., we will be taking this into account for follow-up work on the uncoupled functional data.

      The Methods mention that Gene Ontology analysis was performed on the single-cell data, but the results are not plotted in a figure. It would be informative to include this GO/pathway analysis in the appropriate figure(s).

      We will include this in the revised (supplementary) information.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Magalhaes Pinto, Malengier-Devlies, and co-authors investigated the developmental origins and maturation of synovial (lining and sublining) macrophages across embryonic, newborn, and postnatal stages in mouse. The authors used multiple transgenic reporter lines, lineage tracing, scRNA-seq, 2D confocal and 3D lightsheet imaging, and perturbations to delineate the macrophage states and ontogeny. They propose a model in which the majority of the joint lining macrophages has a fetal (EMP-derived) origin and a small proportion has a definitive HSC-derived monocyte origin, which both seed and mature within the synovial space in the postnatal period in the first 3 weeks of life. Using cell-cell communication analysis on their scRNA-seq data, they identified Fgf2, Csf1, and Tgfb as candidate signaling pathways that support (lining) macrophage development and maturation. Functional experiments indicate that the process is CSF1 and TGFb-dependent and also partly dependent on mechanosensing through Piezo1. The key conclusions on the composition of the synovial macrophages are convincing based on the presented results, and are carefully phrased. The study is very comprehensive, yet the description and organization of the results of the different mouse models could be altered to improve the storyline. Several refinements in data presentation, formulation, and minor validation experiments would further improve the clarity of the story, as well as summary recaps of the major findings throughout the text.

      Major comments:

      1. Generally, the story could be more streamlined by introducing earlier reporter lines and lineage-origin logic. Clearly state which reporter/CreERT2 lines and acrosses are used. It was unclear in Figure 2 that cells of the cross of the Cx3cr1-GFP and Ms4a3Cre:Rosa26lsl-tdT reporter lines were used for the scRNA-seq. The principle that there are fetal-derived and bone marrow (GMP)-derived monocytes and macrophages doesn't need to be "hidden" until Figure 3. For example, also the imaging of Ms4a3Cre could be introduced before the scRNA-seq.
      2. Figure 1 could benefit from a cartoon visualizing the anatomy of the knee joint. The terms "sublining" and "synovium" are now a bit unclear, as it appears that sometimes the synovium is indicated as sublining and vice versa. Additionally, a schematic developmental timeline could be added to indicate the parallels between mouse and human development (fetal and postnatal development in mouse versus gestational age in human). Also, the various waves of hematopoiesis could be indicated in this timeline, which would be particularly helpful for Figure 3 for the lineage-tracing readouts. Lastly, the authors could end the manuscript (a new Figure 6) with a general cartoon summarizing all the results presented.
      3. Figure 1 could be rearranged: first introduce the markers CX3CR1 and VSIG4 (Figure 1D) and then present the quantifications (Figure 1B/E). Where possible, co-visualization CX3CR1-GFP and VSIG4 on tissue sections to strengthen the claims on the relationship between these 2 markers. Tying the scRNA-seq insights (Figure 2) to the imaging would be elegant. Moreover, it would be informative to represent the CX3CR1+ and VSIG4+ macrophages as a percentage of F4/80+ macrophages (Figure 1B/E). Similarly, for the flow cytometry data in Figure 2, the relationship between the markers CX3CR1 and VSIG4 on macrophages could be more clearly displayed and discussed.
      4. The 3D imaging of the joint is a nice addition to the manuscript, as it provides more context to the anatomical structure; however, while the text suggests several newborn joints were imaged, Figure 1F visualizes (again) the knee joint. Could other joints also be represented by 3D imaging? If the knee joint is the only joint available for imaging, and previous confocal imaging focused specifically on the meniscus in the knee joint, could the meniscus also be highlighted in the lightsheet imaging?
      5. Clarification is requested regarding the imaging quantification representation. The M&M section under "Statistical analysis and reproducibility" states that individual data points are displayed, and bars represent the mean. However, some of the Figure legends (e.g., Figures 1B and S1C) specify that each dot corresponds to an individual mouse, with quantification based on 2-3 sections per mouse. While this appears to be a very reasonable representation of the data, does this mean that for each dot, the mean value from the 2-3 sections per mouse was calculated and plotted?
      6. It is not clear how the differential expression analysis was performed on the Vsig4+ cells. Please specify if Cluster 0 was used for analysis, or all Vsig4-expressing cells? Not all cells in Cluster 0 have Vsig4+ expression. The authors described the expression dynamics of Aqp1 as intriguing, but lack a reasoning on why this is interesting.
      7. Figure S3E: In line with the previous comment, can the authors justify that the tdTomato+/- comparisons are not biased by scRNA-seq dropout (scRNA-seq is zero-inflated, so some tdTomato- cells could be false negatives), and provide methodological details (thresholds, ambient RNA correction, etc.) to support this?
      8. Although the sex-related differences in macrophage composition and the absence of differential expression are interesting, they distract from the manuscript's main messages. Moreover, the Discussion does not elaborate on how these observations relate to joint (disease) biology. Consider removing this section or integrating it clearly into the relevant biological context.
      9. CreERT2 transgenic lines are often not 100% efficient in recombination, also depending on whether tamoxifen or 4-OHT is used. Could the authors report the percentage of tdTomato+ cells in the joints and compare them to the recombination efficiencies in the monocytes/microglia under the same tamoxifen or 4-OHT conditions? This would help clarify how the interpret the macrophage labeling %'s.
      10. Could the authors draw parallels between the observations in the mouse knee joint macrophage populations and literature on other joints in mouse and the knee joint in human (for example, as described in Alivernini et al., 2020 and in the very recent Raut et al., 2025)?

      Minor comments:

      1. In general, the authors should clarify in the Results what each marker used for imaging, flow cytometry, or in the mouse reporter lines delineates. For example, mention that F4/80 is a marker for tissue-resident macrophages (correct?) in immunofluorescence, that IBA1 is a marker for macrophages on human tissue sections (Figure S1), and PDPN is GP38 (Figure S2 - align usage of marker reference across main text and figures).
      2. For clarity in the microscopy representation, the single channels should be represented in a grey scale.
      3. Figure S1B: Is CX3CR1 also restricted to the lining macrophages in human? Could a co-staining with IBA1 be performed to strengthen the species similarities?
      4. Adipocyte diameter quantification: Avoid plotting individual adipocytes from 2 mice without per-mouse visualization. Instead, report the mean adipocyte diameter per mouse and plot those means.
      5. A little typo was spotted in the "Statistical analysis and reproducibility" section: it is Dunn's, not Bunn's multiple-comparison correction.
      6. Figure 2A: The gating strategy for the CX3CR1-GFP cells is missing.
      7. Improve the visualization of some plots. For example, Figure 2F is hard to read because of the big dot size. The dots seem to add no information to the graph and could be removed. Additionally, for comparing the clusters across the different time points, one could project the cells from the other time points in grey in the background.
      8. Figure S2: The dotplot is more informative than the heatmap, consider removing the heatmap.
      9. Figure 3A: If technically feasible, image and visualize both the GFP and tdTomato expression. It would be informative to see the Cx3cr1+ and Ms4a3-derived cells in the same specimen.
      10. Figure 3C: Highlight that tdTomato expression is visualized here.
      11. Figure 3G,F: The authors should place the schematics and graphs next to each other, so the data points can be more easily compared.
      12. Figure 4B: Which co-staining was performed for the immunofluorescence to quantify the % of tdTomato+ cells?
      13. Figure 4C: The trajectory analysis appears to have an arrow pointing from the Ccr2+ macrophages to the Ly6c+ monocytes. Please verify this directionality, as its seems against the known biology.
      14. Figure 5 mentions that the Csfr1 levels were reduced in a tissue-specific manner, but it is unclear how this tissue specificity was achieved.
      15. For the TGFb perturbations (Tgfbr2 KO and systemic TGFb depletion): did the authors validate reduced TGFb pathway activity in the macrophages, for example, reduced pSMAD2/3 levels? This would validate the effectiveness of the perturbations.
      16. Figure 5F could benefit from a timeline of the treatment.
      17. The Methods mention that Gene Ontology analysis was performed on the single-cell data, but the results are not plotted in a figure. It would be informative to include this GO/pathway analysis in the appropriate figure(s).

      Significance

      This work provides a high temporal-resolution and "spatial" resolution reference map of the ontogeny and maturation of the synovial lining macrophages in the knee joint. It complements existing literature that demonstrated the presence of tissue-resident macrophages in the synovial space and lining (Culemann, et al., 2019 and others) by charting the embryonic-to-postnatal emergence of lining and sublining subsets. In particular, this mouse work identified some key signaling pathways in shaping this tissue compartment. This dataset serves as a robust, steady-state reference for joint pathology and can be implemented with human studies on disease biology of the knee joint (e.g., Alivernini et al., 2020; Raut et al., 2025). Insights into the exact developmental origins, mechanisms contributing to diverse or seemingly similar cell types, and distinct maturation processes are crucial to understanding disease biology, in which developmental processes can be hijacked/reactivated.

      These findings will interest researchers in joint disease biology (osteoarthritis and immune-mediated arthritides such as RA and psoriasis), macrophage development (tissue-resident vs monocyte-derived lineages), the bone/joint microenvironment, and joint mechanobiology.

      The reviewer's expertise is in developmental biology, mesoderm, bone biology, hematopoiesis, and monocyte/macrophage biology in disease

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the manuscript „The synovial lining macrophage layer develops in the first weeks of life in a CSF1- and TGFβ- dependent but monocyte-independent process", Magalhaes Pinto and colleagues carefully employ a wide range of technologies including single cell profiling, imaging and an exceptional combination of fate mapping models to characterize the ontogeny and development of lining macrophages in the joint, thus dissecting their maturation during postnatal development. Over the last decade, several landmark studies highlighted the imprinting of tissue-resident macrophages by a combination of ontogenetic and tissue-specific niche factors during development. So far, the ontogeny and the tissue niche factors governing the development and maturation of lining macrophages have not been described. Therefore, the results of this study offers insights on a small highly adapted macrophage population with relevance in many disease settings in the joint. Furthermore, the findings are nicely showcasing how macrophages are specializing to even very small tissue niches across development within one bigger anatomical compartment to serve dedicated functions within this niche.

      This manuscript is beautifully written and highlights many novel, highly relevant findings on lining macrophage biology and the authors employ a wide range of different technologies to carefully dissect the postnatal development of lining macrophages.

      In particular, the combination of scRNA-seq and fate mapping is providing a unique the link of transcriptional programs to ontogeny within the tissue niche. Furthermore, the integrative use of distinct fate mapping strategies, transgenic mouse lines, and treatment paradigms to elucidate key niche factors guiding the development and maturation of lining macrophages provides many interesting findings and data that are highly relevant to the field. I really enjoyed reading this manuscript.

      Major points:

      1) The authors show dynamic regulation of VSIG4 in lining macrophages during development, therefore VSIG4 is maybe not an ideal choice for gating strategies to define lining macrophages or to show as a single markers in immunofluorescence (IF) stainings to demonstrate their abundance across development (even though it is clear that this is the reason why the F4/80 staining is shown next to it). To demonstrate the increase of lining macrophages during development in IF, it would be more helpful if the authors would show quantifications of all F4/80+ cells and additionally VSIG4+ as a proportion of F4/80+ cells (or VSIG4+ F4/80+ and all F4/80+ in a stacked bar plot).

      2) In Figure 1C, the authors nicely demonstrate that the lining macrophages get closer in their distance across development to build the epithelial-like macrophage structure along the adult lining. Is the close proximity between lining macrophages already fully "matured" at 3 weeks of age and comparable to adults? Please quantify the distance in adult linings.

      3) Can the authors explain how the grouping was performed between the analyzed human fetal joints? It is not clear why the cut was chosen between the groups at 16/17 weeks of age. Maybe it would be also beneficial if the authors would consider not grouping these samples but rather show the specific quantifications for each samples individually and estimate via linear regression the expansion over time across human development. Furthermore, can the authors give additional information about the distancing of lining macrophages in the human fetal samples, it would be great to see if they follow the same dynamics as in mouse. Maybe comparison to human juvenile/adult joints would also add on to substantiate the findings in human samples (if possible).

      4) The scRNA-seq analysis leaves several questions open and some conclusions and workflows cannot be easily followed.

      a. It is not clear how and especially why the signature genes to define macrophages vs. monocytes were chosen. Especially as the signature genes for monocytes would not include patrolling monocytes and the macrophage signature genes seem to be highly regulated during development, see also Apoe expression in NB vs. adult in Figure S2e. Why did the authors not take classical markers such as Itgam, Fcgr1a, Csf1r?

      b. Can dendritic cell signatures be excluded? Cluster 11 and 12 show indeed some DC markers, are these really macrophages?

      c. The authors provide several figure panels showing TOP marker genes or key marker genes for the identified clusters, however it is not clear if these are TOP DE genes or if the genes were hand chosen. Somehow, the authors give the impression that the clusters were chosen and labeled not based on DE genes, but more on existing literature that previously reported these macrophage populations. DE gene lists for all annotated cell types and macrophage clusters need to be provided within the manuscript.

      d. The authors claim that Clusters 1 and 4 are "developing" macrophages. How is this defined? Why are these developing cells compared to other clusters? And why are these clusters later on not considered as progenitors of Aqp1 macrophages and Vsig4 macrophages? Why are Aqp1+ macrophages not labeled as developing when they are later on in the manuscript shown as potential intermediate progenitors of lining macrophages?

      e. Furthermore, it is again confusing that markers are used throughout Figure 2 which are labeled as "key marker genes" for a population and then later on they are claimed to be regulated during development within this population, see for example Figure 2D and 2H.

      f. It is appreciated that the authors distinguished cycling clusters such as 8, 9, and 10 based on their cycling gene signature. Here it would be very exciting to see a cell cycle analysis across all clusters and time points to see when exactly the cells are expanding during development; this would also substantiate the data later shown for the Mki67-CreERT2 mouse model.

      g. Can the authors identify certain gene modules during development of lining macrophages (and/or their progenitors) which are associated with certain functions (e.g. GO terms, GSEA enrichment)?

      5) To determine the actual presence of the identified macrophage clusters from the scRNA-seq as macrophage populations in the joint, the authors should perform IF or FACS for key markers. Especially, Aqp1+ macrophages should be shown in the developing joint.

      6) The authors used a wide range of fate mapping models, which is quite unique and highly appreciated. The obtained results and the conclusions made from the models raise a couple of questions: Whereas contribution of HSC-derived/monocyte-derived macrophages to the lining compartment seems to be minor, there is still labeling across different models. Various aspects would need to be clarified.

      a. For example, the authors employ Ms4a3-Cre as a tracing model for GMP-derived monocytes, however all quantifications of the labeling efficiency are not normalized to the labeling in monocytes or another highly recombined cell population. This should be shown, similar to the other fate mapping models (Figure 3 F-I).

      b. Please show Ms4a3 expression across clusters across time points, to exclude expression in fetal-derived clusters.

      c. In line with the question raised above, if the authors can exclude a development of the Egfr1+ and Clec4n+ developing macrophages into Aqp1+ macrophages and subsequently into Vsig4 lining macrophages, the obtained data from the Ms4a3-Cre model highly suggests a correlative labeling across these clusters what could implicate a relation. However, the authors do not discuss throughout the manuscript the role of these developing macrophages. It is highly encouraged to include this into the manuscript and it would be of high relevance to understand lining macrophage development.

      d. The authors conclude from the pseudo bulk transcriptomic profiling of the different macrophage clusters that TdT+ and TdT- macrophages do not differ in their gene expression profile and that this is due to niche imprinting rather than origin imprinting. Even though the data supports that conclusion, the authors should verify if inkling cells early during development also show this similar gene expression profile and gene expression should be compared at the different developmental time points. Tissue niche imprinting is happening within the niche during development, most likely in a stepwise progress, and therefore there should be differences in the beginning.

      7) The trajectorial analysis using different pseudotime pipelines is very interesting and nicely points out the potential role of Aqp1 macrophages as intermediates of Vsig4 lining macrophages. From my point of view, all trajectories seem to suggest that Egfr1 developing macrophages and Clec4n developing macrophages might differentiate into Aqp1 macrophages, however the authors are not exploring this further and the role of both developing macrophage clusters is not further discussed (see also comments above).

      8) How was the starting point of the trajectorial analyses defined and is it the same for each pipeline used?

      9) Are there potentially two trajectories? It looks like there is one in the beginning of postnatal life and a second one appearing from the monocyte-compartment later in life. If this is true, that would rather speak for a dual ontogeny of Vsig4+ macrophages, wouldn't it?

      10) A heatmap (transcriptional shift) of trajectories between more clusters should be shown at least for Cluster 0,1,2, and 3. It is not sufficient to demonstrate this only between two clusters.

      11) To show the similarity between Aqp1 macrophages and proliferating macrophage clusters, the authors should remove the cycling signature and compare these clusters to show that the cycling cells might be Aqp1 macrophages or earlier developing macrophage progenitors aka Clec4n or Egfr1 macrophages.

      12) The conclusions made from the Mki67-CreERT2 data are a bit difficult to understand, whereas all progenitors (monocyte progenitors and macrophage progenitors will proliferate at the neonatal time point and no conclusions can be made if the cells expand in the niche. The authors should employ Confetti mice or other models (Ubow mice) to analyze clonal expansion in the niche.

      13) All predicted cell-cell interactions between macrophages and fibroblasts should be provided in a supplementary table. Are the interactions shown in Figure 5 chosen interactions or the TOP predicted ones? Whereas the authors show different numbers of interactions, it is most likely hand-picked and therefore biased.

      14) The authors further aim to dissect the factors involved in the developmental niche imprinting of lining macrophages. Even though it is highly appreciated that the authors used so many experimental setups to show the reliance of lining macrophages on Csf1 and TGF-beta as well as mechanosensation, the wide range of models the different methods used and selected developmental time points make it very difficult to really interpret the data. The authors should carefully choose time points and methods (either FACS analysis across all models or IF across all, or both). Often deletion efficiencies for transgenic models and proof of concept that the inhibitors and agonists are working in the treatment paradigm are not provided. For example, Csf1rMer-iCre-Mer Tgfbr2fl/fl mice are used but no deletion efficiency is shown or different time points of analysis, maybe the macrophages are not properly targeted in the set up.

      15) The authors have shown the role of Csf1 and Tgfbr2 only for lining macrophages, is this specific in the joint to this population of are subliming macrophages affected in a similar manner.

      16) Can the authors confirm their results in CSF1R-FIRE mice with anti-Csf1 injections or in Csf1op/op mice?

      17) The setup in Figure S5G is very interesting to test the role of movement and mechanical load on the joint, however, there is basically no data on the model provided showing the efficiency of the induced optogenetic muscle contractions, and only one time point is shown.

      18) The results regarding the role of Piezo1 and mechanosensation vary a lot. Could it be that analyses were done too early or that actually proper weight load on the joint must be applied for the maturation of the macrophages? The authors should test this to

      19) The Rolipram experiment is shown in Figure S5G, but is not described in the result section. It only appears at some point in the discussion part. The authors should move it to results or remove it from the manuscript.

      Minor points:

      1) Please reference the Figure panels in numeric order throughout the text.

      2) Figure 2a and 2b are a bit out of the storyline, it is not obvious why this is shown here and maybe it would be good to move it to the supplements. Gating strategy is also not used for scRNA-seq. Therefore, it would better fit to the later analysis of joint macrophages across different transgenic mouse models and treatment paradigms. The gating strategies are changing across different experiments throughout the figures, it would be nice to have a similar gating strategy for all experiments, see also Figure 3 where the defining markers for joint macrophages are changing between models.

      3) A lot of figure panels have very small labeling that is basically unreadable. Axes at FACS plots for example. Sometimes, it is even impossible to distinguish cluster labels especially when they have similar colors.

      4) In the text on page 14, many markers are named which are specifically regulated during development in lining macrophages, but these factors are not labeled anywhere in the volcano plot. It would be good to showcase at least some of these named genes in the figure panel, e.g. Trem2.

      5) Figure 2F and Figure S2F are really nicely showing the percentage of cells per cluster in each analyzed biological sample. Maybe the authors could additionally consider to show a stacked bar plot with the mean percentage of cells per cluster and how the clusters are distributed across time points?

      6) Figure 3A: IF for adult lining macrophages and the quantification are missing

      Significance

      This manuscript highlights novel, highly relevant findings on lining macrophage biology and the authors employ a wide range of different technologies to carefully dissect the postnatal development of lining macrophages. Furthermore, this study showcases in a very elegant and detailed way the adaptation of macrophage progenitors to a highly specific anatomical tissue niche.

      The manuscript is of high interest to basic scientists focussing on macrophage biology and immune cell development and clinicians and clinician scientists focussing on joint diseases such as RA

      Therefore the manuscript is of interest to a wide community working in immunology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In their manuscript entitled "The synovial lining macrophage layer develops in the first weeks of life in a CSF1- and TGFβ-dependent but monocyte-independent process," the authors explore the developmental trajectory of synovial lining macrophages. They demonstrate that the formation of this specialized macrophage layer is age-dependent and governed by a distinct developmental program that proceeds independently of circulating monocytes. Through scRNA-Seq, the authors show that synovial lining macrophages originate locally from Aqp1⁺ macrophages and are marked by the expression of Csf1r, Tgfbr, and Piezo1. Notably, genetic ablation of each of these factors impaired the development of lining macrophages to varying degrees, suggesting differential contributions of CSF1, TGFβ, and PIEZO1 signaling pathways to their maturation and maintenance.

      The manuscript is well written, and the data quality and representation is of a high standard. The authors have employed a sophisticated array of state-of-the-art mouse models and cutting-edge technologies to elucidate the developmental origin of synovial lining macrophages. Notably, the supporting scRNA-Seq datasets are of excellence and provide valuable insights that will likely be of significant interest to researchers in the field of immunology and joint biology. Accordingly, the experimental approach and interpretations regarding macrophage origin are well-founded and compelling. However, in the eye of the reviewer, the section addressing the underlying molecular mechanisms is a bit less convincing. This part of the study appears slightly underdeveloped, and some of the mechanistic claims lack sufficient experimental clarity. A more rigorous experimental investigation would be essential to reinforce the manuscript's conclusions, particularly concerning the data related to Tgfbr and Piezo1, where the current evidence appears insufficiently substantiated.

      Major point:

      • The numbers of VSIG4⁺ macrophages appear either unaffected or only minimally altered in both Csf1rMerCreMer Tgfbr2floxed and Fcgr1Cre Piezo1floxed mouse models, respectively. This raises an important question: was the gene deletion efficiency sufficient in each model? Accordingly, the authors are encouraged to include quantitative data on gene deletion efficiency for both mouse models, as this information is critical for interpreting the observed phenotypic outcomes and validating the conclusions regarding gene function. Furthermore, to better assess the impact of Tgfbr2 and Piezo1 disruption, the authors should provide more comprehensive flow cytometry analyses and histological data for these mouse models. Given the apparent homogeneity of VSIG4⁺ macrophages (as shown by the authors themselves), bulk RNA-Seq of sorted Tgfbr2- and Piezo1-deficient VSIG4⁺ macrophages (or from TGFβ-treated animals) would offer valuable insights into both the effectiveness of gene deletion and the molecular pathways governed by TGFβ and PIEZO1 in lining macrophages.

      Minor points:

      • Consistent usage of Cx3cr1-GFP+ nomenclature (for instance: Fig. S1 legend "adult mouse synovial tissue, showing PDGFRα⁺ fibroblasts (yellow) and CX3CR1-GFP⁺ cells (cyan)." versus Fig. 1 legend "Automated spot detection highlights Cx3cr1-GFP⁺ macrophages)"
      • Unclear Fig. 3 legend: "Representative immunofluorescence images of synovial tissue from Clec9aCre:Rosa26lsl-tdT mice at 3 weeks and in adulthood, showing and tdTomato (yellow) and stained for DAPI (blue), VSIG4 (cyan)" Check 'showing and tdTomato.'
      • For greater clarity, it would have been helpful if the transcript names had been directly included within Figures 3C, S3A, and S3C.
      • Page 24: "(Mki67CreERT2:Rosa26lsl-tdT)" Last bracket not superscript.
      • Page 25: "we again leveraged our scRNAsequencing dataset" Missing punctuation.
      • Page 27: Fig. 5C legend: " of synovial tissue of 1 week-old, 3 weeks-old and adult mice." Please specify and change to 'adult Csf1rΔFIRE/ΔFIRE mice'.
      • Page 30: The outcome observed in the Acta1-rtTA:tetO-Cre:ChR2-V5fl mouse model appears to be inconclusive: "This approach resulted in an increased density of VSIG4+ and total (F4/80+) macrophages in the exposed leg of some 5 days-old pups, but others showed the opposite trend (Figure S5D)." This variability may reflect low efficiency of the model or other technical limitations (e.g. muscle contractions frequency or time point of analysis). Given this ambiguity, it is worth reconsidering whether the data are sufficiently robust to warrant inclusion. Should the authors choose to include these findings, further experimentation of appropriate depth and precision is required to allow a conclusive interpretation (either it increases the density of VSIG4+ macrophages or not). The same applies to the Yoda1-treated mice, for which additional data are needed to determine whether VSIG4⁺ macrophage density is truly affected.

      Significance

      • General assessment: provide a summary of the strengths and limitations of the study. What are the strongest and most important aspects? What aspects of the study should be improved or could be developed?

      This is a well-designed study that uses cutting-edge methodologies to investigate the developmental trajectory of synovial lining macrophages under homeostatic conditions. The authors present robust experimental evidence and compelling interpretations concerning synovial macrophage origin, which are both well-substantiated and impactful. Nonetheless, from the reviewer's perspective, the section exploring the molecular mechanisms underlying macrophage differentiation is comparatively less convincing. This section appears somewhat underdeveloped, as some of the mechanistic claims lack sufficient depth and experimental rigor to fully substantiate the conclusions. - Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field:

      In contrast to earlier studies (PMID: 31391580, 32601335), the inclusion of fate-mapping experiments adds an important dimension, offering novel insight into the ontogeny of synovial macrophages. This expanded perspective may prove particularly valuable in advancing our understanding of joint immunology, especially regarding the local origins and lineage relationships of macrophage populations. Furthermore, the authors present novel insights into the molecular pathways underlying the differentiation and development of synovial lining macrophages. By demonstrating previously unrecognized regulatory mechanisms, this work significantly deepens our understanding of the cellular and transcriptional programs that drive macrophage specialization within the joint microenvironment. -Place the work in the context of the existing literature (provide references, where appropriate):

      This study builds upon previous work characterizing the macrophage compartment in the joint (PMID: 31391580, 32601335), yet provides a substantially more comprehensive dataset that spans multiple developmental time points and data on the origin of this specialized macrophage subset. - State what audience might be interested in and influenced by the reported findings:

      Immunologist, clinicians - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      This study falls well within the scope of the reviewer's expertise in innate immunity.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-02879 Corresponding author(s): Matteo Allegretti; Alia dos Santos

      1. General Statements

      In this study, we investigated the effects of paclitaxel on both healthy and cancerous cells, focusing on alterations in nuclear architecture. Our novel findings show that:

      • Paclitaxel-induced microtubule reorganisation during interphase alters the perinuclear distribution of actin and vimentin. The formation of extensive microtubule bundles, in paclitaxel or following GFP-Tau overexpression, coincides with nuclear shape deformation, loss of regulation of nuclear envelope spacing, and alteration of the nuclear lamina.

      • Paclitaxel treatment reduces Lamin A/C protein levels via a SUN2-dependent mechanism. SUN2, which links the lamina to the cytoskeleton, undergoes ubiquitination and consequent degradation following paclitaxel exposure.

      • Lamin A/C expression, frequently dysregulated in cancer cells, is a key determinant of cellular sensitivity to, and recovery from, paclitaxel treatment.

      Collectively, our data support a model in which paclitaxel disrupts nuclear architecture through two mechanisms: (i) aberrant nuclear-cytoskeletal coupling during interphase, and (ii) multimicronucleation following defective mitotic exit. This represents an additional mode of action for paclitaxel beyond its well-established mechanism of mitotic arrest.

      We thank the reviewers for their time and constructive feedback. We have carefully considered all comments and have carried out a full revision. The updated manuscript now includes additional data showing:

      • Overexpression of microtubule-associated protein Tau causes similar nuclear aberration phenotypes to paclitaxel. This supports our hypothesis that increased microtubule bundling directly leads to nuclear disruption in paclitaxel during interphase.

      • Paclitaxel's effects on nuclear shape and Lamin A/C and SUN2 expression levels occur independently of cell division.

      • Reduced levels of Lamin A/C and SUN2 upon paclitaxel treatment occur at the protein level via ubiquitination of SUN2.

      • The effects of paclitaxel on the nucleus are conserved in breast cancer cells.

      Full Revision

      We have also edited our text and added further detail to clarify points raised by the reviewers. We believe that our revised manuscript is overall more complete, solid and compelling thanks to the reviewers' comments.

      1. Point-by-point description of the revisions

      Reviewer #1 Evidence, reproducibility and clarity

      This description of the down-regulation of the expression of lamin A/C upon treatment with paclitaxel and its sensitivity to SUN2 is quite interesting but still somehow preliminary. It is unclear whether this effect involves the regulation of gene expression, or of the stability of the proteins. How SUN2 mediates this effect is still unknown.

      We thank the reviewer for this valuable comment. To elucidate the mechanism behind the decrease in Lamin A/C and SUN2 levels, we have now performed several additional experiments. First, we performed RT-qPCR to quantify mRNA levels of these genes, relative to the housekeeping gene GAPDH (Supplementary Figure 3B and O). The levels of SUN2 and LMNA mRNA remained the same between control and paclitaxel-treated cells, indicating that this effect instead occurs at the protein level. We have also tested post-translational modifications as a potential regulatory mechanism for Lamin A/C and SUN2. In addition to the phosphorylation of Ser404 which we had already tested (Supplementary Figure 3C), we have now included additional Phos-tag gel and Western blotting data showing that the overall phosphorylation status of Lamin A/C is not affected by paclitaxel (Supplementary Figure 3E and F). We also pulled-down Lamin A/C from cell lysates and then Western blotted for polyubiquitin and acetyl-lysine, which showed that the ubiquitination and acetylation states of Lamin A/C are also not affected by paclitaxel (Supplementary Figure 3G-I). However, Western blots for polyubiquitin of SUN2 pulled down from cell lysates showed that paclitaxel treatment results in significant SUN2 ubiquitination (Figure 3M and N). Therefore, we propose that the downregulation of SUN2 following paclitaxel treatment occurs by ubiquitin-mediated proteolysis.

      The roles of free tubulins and polymerized microtubules, and thus the potential role of paclitaxel, need to be uncovered.

      We addressed this important point by using an alternative method to stabilise/bundle microtubules in interphase, namely by overexpressing GFP-Tau, as suggested by reviewer 2. Following GFP- Tau overexpression, large microtubule bundles were observed throughout the cytoplasm (Figure 4A), and this resulted in a significant decrease in nuclear solidity (Figure 4B). Furthermore, in cells where microtubule bundles extensively contacted the nucleus, the nuclear lamina became unevenly distributed and appeared patchy (Figure 4C). This supports our hypothesis that the aberrations to nuclear shape and Lamin A/C localisation in paclitaxel-treated cells are due to the presence of microtubules bundles surrounding the nucleus.

      The doses of paclitaxel at which occur the effects described in the paper are not fully consistent with all the conclusions. Most experiments have been done at 5 nM. However, at this dose the effect of lamin A/C over or down expression on the growth (differences in the slopes of the curves in Figure 4A) are not fully convincing and not fully consistent with the clear effect on viability as well (in addition, duration of treatments before assessing vialbility are not specified). At 1 nM, cell growth is reduced and the rescuing effect of lamin over-expression is much more clear (Fig 4A), and the nucleus deformation clear (Fig 2A) but this dose has no effect on lamin A/C expression (Fig 3C), which questions how lamins impact nucleus shape and cell survival. Cytoskeleton reorganisation in these conditions is not described although it could clarify the respective role of force production (suggested in figure 1) and nuclei resistance (shown in figure 2) in paclitaxel sensitivity.

      We thank the reviewer for raising this important point. We have addressed this by conducting additional repeats for the cell confluency measurements to increase the statistical power of our experiments (Figure 5A). Our data now show that GFP-lamin A/C had a statistically significant effect on rescuing cell growth at both 1 nM and 5 nM paclitaxel, while Lamin A/C knockdown exacerbated the inhibition of cell growth at 5 nM paclitaxel but not 1 nM paclitaxel (Figure 5A). In addition, we note that the duration of paclitaxel treatment before assessing viability was specified in the figure legend: "Bar graph comparing cell viability between wild-type (red), GFP-Lamin A/C overexpression (green), and Lamin A/C knockdown (blue) cells following 20 h incubation in 0, 1, 5, or 10 nM paclitaxel." We also repeated cell viability analysis after 48 h incubation in paclitaxel instead of 20 h to allow for a longer time for differences to take effect (Figure 5B).

      We also added figures showing the cytoskeletal reorganisation at both 1 and 10 nM in addition to 0 and 5 nM (Supplementary Figure 1A) showing that microtubule bundling and condensation of actin into puncta correlated with increased paclitaxel concentration. Vimentin colocalised well with microtubules at all concentrations.

      We have also included in our results section further clarification for the use of 5nM paclitaxel in this study. The new section reads as follows: "Experiments were performed at 5 nM paclitaxel (with additional experiments to determine dose relationships at 1 and 10 nM) because this aligns with previous studies7,14,24. Furthermore, previous analysis of patient plasma reveals that typical concentrations are within the low nanomolar range8, and concentrations of 5-10 nM are required in cell culture to reach the same intracellular concentrations observed in vivo in patient tumours9. This aligns with in vitro cytotoxic studies of paclitaxel in eight human tumour cell lines which show that paclitaxel's IC50 ranges between 2.5 and 7.5 nM41."

      Finally, although the absence of role of mitotic arrest is clear from the data, the defective reorganisation of the nucleus after mitosis still suggest that the effect of paclitaxel is not independent of mitosis.

      We thank the reviewer for pointing out the need for clarification in the wording of our manuscript. We have reworded the title and relevant sections of our abstract, introduction, and discussion to make it clearer that the effects of paclitaxel on the nucleus are due to a combination of aberrant nuclear cytoskeletal coupling during interphase and multimicronucleation following mitotic slippage. We have also added additional data in support of the effect of paclitaxel on nuclear architecture during interphase. For this, we used serum-starved cells (which divide only very slowly such that the majority of cells do not pass through mitosis during the 16 h incubation in paclitaxel [Supplementary Figure 2D]). Our new data confirmed that paclitaxel's effects on nuclear solidity, and Lamin A/C and SUN2 proteins levels can occur independently of cell division (Figure 2C; Figure 3H-J). Finally, when we overexpressed GFP-Tau (as discussed above) we observed similar aberrations to nuclear solidity and Lamin A/C localisation. This indicates that these effects occur due to microtubule bundling in interphase, especially as in our study GFP-Tau did not lead to multimicronucleation or appear to affect mitosis (Figure 4).

      Below are the main changes to the text regarding the interphase effect of paclitaxel:

      • Title: "Paclitaxel compromises nuclear integrity in interphase through SUN2-mediated cytoskeletal coupling"

      • Abstract: "Overall, our data supports nuclear architecture disruption, caused by both aberrant nuclear-cytoskeletal coupling during interphase and exit from defective mitosis, as an additional mechanism for paclitaxel beyond mitotic arrest."

      • Introduction: "Here we propose that cancer cells have increased vulnerability to paclitaxel both during interphase and following aberrant mitosis due to pre-existing defects in their NE and nuclear lamina."

      • Discussion: "Overall, our work builds on previous studies investigating loss of nuclear integrity as an anti-cancer mechanism of paclitaxel separate from mitotic arrest14,20,21. We propose that cancer cells show increased sensitivity to nuclear deformation induced by aberrant nuclear-cytoskeletal coupling and multimicronucleation following mitotic slippage. Therefore, we conclude that paclitaxel functions in interphase as well as mitosis, elucidating how slowly growing tumours are targeted."

      minor: a more thorough introduction of known data about dose response of cells in culture and in vivo would help understanding the range of concentrations used in this study.

      As mentioned above, we have now included additional information in our Results section to clarify our paclitaxel dose range: "Experiments were performed at 5 nM paclitaxel (with additional experiments to determine dose relationships at 1 and 10 nM) because this aligns with previous studies7,14,24. Furthermore, previous analysis of patient plasma reveals that typical concentrations are within the low nanomolar range8, and concentrations of 5-10 nM are required in cell culture to reach the same intracellular concentrations observed in vivo in patient tumours9. This aligns with in vitro cytotoxic studies of paclitaxel in eight human tumour cell lines which show that paclitaxel's IC50 ranges between 2.5 and 7.5 nM41."

      Significance

      In this manuscript, Hale and colleagues describe the effect of paclitaxel on nucleus deformation and cell survival. They showed that 5nM of paclitaxel induces nucleus fragmentation, cytoskeleton reorganisation, reduced expression of LaminA/C and SUN2, and reduced cell growth and viability. They also showed that these effects could be at least partly compensated by the over-expression of lamin A/C. As fairly acknowledged by the authors, the induction of nuclear deformation in paclitaxel-treated cells, and the increased sensitivity to paclitaxel of cells expressing low level of lamin A/C are not novel (reference #14). Here the authors provided more details on the cytoskeleton changes and nuclear membrane deformation upon paclitaxel treatment. The effect of lamin A/C over and down expression on cell growth and survival are not fully convincing, as further discussed below. The most novel part is the observation that paclitaxel can induce the down-regulation of the expression of lamin A/C and that this effect is mediated by SUN2.

      We appreciate the reviewer's summary and thank them for their time. We believe our comprehensive revisions have addressed all comments, strengthening the manuscript and making it more robust and compelling.

      Reviewer #2 Evidence, reproducibility and clarity This study investigates the effects of the chemotherapeutic drug paclitaxel on nuclear-cytoskeletal coupling during interphase, claiming a novel mechanism for its anti-cancer activity. The study uses hTERT-immortalized human fibroblasts. After paclitaxel exposure, a suite of state- of-the-art imaging modalities visualizes changes in the cytoskeleton and nuclear architecture. These include STORM imaging and a large number of FIB-SEM tomograms.

      We thank the reviewer for the summary and for highlighting our efforts in using the latest imaging technical advances.

      Major comments:

      The authors make a major claim that in addition to the somewhat well-described mechanism of paclitaxel on mitosis, they have discovered 'an alternative, poorly characterised mechanism in interphase'.

      However, none of the data proves that the effects shown are independent of mitosis. To the contrary, measurements are presented 48 hours after paclitaxel treatment starts, after which it can be assumed that 100% of cells have completed at least one mitotic event. The appearance of micronuclei evidences this, as discussed by the authors shortly. It looks like most of the results shown are based on botched mitosis or, more specifically, errors on nuclear assembly upon exit from mitosis rather than a specific effect of paclitaxel on interphase. The readouts the authors show just happen to be measurements while the cells are in interphase.

      Alternative hypotheses are missing throughout the manuscript, and so are critical controls and interpretations.

      We thank the reviewer for highlighting the lack of clarity in our wording. We have revised the title, abstract and relevant sections of the introduction and discussion to clarify our message that the effects of paclitaxel on the nucleus arise from a combination of aberrant nuclear-cytoskeletal coupling during interphase and multimicronucleation following exit from defective mitosis. We have also included additional data where we used slow-dividing, serum-starved cells (under these conditions, the majority of cells do not undergo mitosis during the 16 h incubation in paclitaxel [Supplementary Figure 2D]). Our new data show that even in these cells there is a clear effect of paclitaxel on nuclear solidity, and Lamin A/C and SUN2 protein levels, further supporting our hypothesis that these phenotypes can occur independently of cell division (Figure 2C; Figure 3H-J). Furthermore, we performed additional experiments where we used overexpression of GFP-Tau as an alternative method of stabilising microtubules in interphase and observed similar aberrations to nuclear solidity and Lamin A/C localisation. As GFP-Tau overexpression did not lead to micronucleation or appear to affect mitosis, these data support the hypothesis that nuclear aberrations occur due to microtubule bundling in interphase (Figure 4). We discuss these experiments in more detail below. Finally, we have reworded the introduction to better introduce alternative hypotheses and mechanisms for paclitaxel's activity.

      The authors claim that 'Previously, the anti-cancer activity of paclitaxel was thought to rely mostly on the activation of the mitotic checkpoint through disruption of microtubule dynamics, ultimately resulting in apoptosis.' The authors may have overlooked much of the existing literature on the topic, including many recent manuscripts from Xiang-Xi Xu's and another lab.

      We would like to note that the paper from Xiang-Xi Xu's lab (Smith et al, 2021) was cited in our original manuscript (reference 14 in both the original and revised manuscripts). We have now also included additional review articles from the Xiang-Xi Xu lab (PMID:36368286 20 and PMID: 35048083 21). Furthermore, we have clarified the wording in both the introduction and discussion to better reflect the current understanding of paclitaxel's mechanism and alternative hypotheses.

      The data, e.g. in Figure 1, does not hold up to the first alternative hypothesis, e.g. that paclitaxel stabilizes microtubules and that excessive mechanical bundling of microtubules induces major changes to cell shape and mechanical stress on the nucleus. Even the simplest controls for this effect (the application of an alternative MT stabilizing drug or the overexpression of an MT stabilizer, e.g., tau).

      We thank the reviewer for suggesting this control experiment using the microtubule stabiliser Tau. We have now included these experiments in the revised version of the manuscript (Figure 4). The overexpression of GFP-Tau supports our hypothesis that cytoskeletal reorganisation in paclitaxel exerts mechanical stress on the nucleus during interphase, resulting in nuclear deformation and aberrations to the nuclear lamina. In particular, GFP-Tau overexpression resulted in large microtubule bundles throughout the cytoplasm (Figure 4A). Notably, in cells where these bundles extensively contacted the nucleus, we observed a significant decrease in nuclear solidity (Figure 4B) accompanied by changes in nuclear lamina organisation, including a patchy lamina phenotype, similar to that induced by paclitaxel (Figure 4C).

      The focus on nuclear lamina seems somewhat arbitrary and adjacent to previously published work by other groups. What would happen if the authors stained for focal adhesion markers? There would probably be a major change in number and distribution. Would the authors conclude that paclitaxel exerts a specific effect on focal adhesions? Or would the conclusion be that microtubule stabilization and the following mechanical disruption induce pleiotropic effects in cells? Which effects are significant for paclitaxel function on cancer cells?

      We thank the reviewer for raising important points regarding the specificity of paclitaxel's effects. We agree that microtubule stabilisation can induce myriad cellular changes, including alterations to focal adhesions and other cytoskeletal components. Our focus on Lamin A/C and nuclear morphology is grounded both in the established clinical relevance of nuclear mechanics in cancer and builds on mechanistic work from other groups.

      Lamin A/C expression is commonly altered in cancer, and nuclear morphology is frequently used in cancer diagnosis35. Lamin A/C also plays a crucial role in regulating nuclear mechanics32 and, importantly, determines cell sensitivity to paclitaxel14. However, the mechanism by which Lamin A/C determines sensitivity of cancer cells to paclitaxel is unclear.

      Our data are consistent with Lamin A/C being a determinant of paclitaxel survival sensitivity. We also provide evidence that paclitaxel itself reduces Lamin A/C protein levels and disrupts its organisation at the nuclear envelope. We directly link these effects to microtubule bundling around the nucleus and degradation of force-sensing LINC component SUN2, highlighting the importance of nuclear architecture and mechanics to overall cellular function. Furthermore, we show that recovery from paclitaxel treatment depends on Lamin A/C expression levels. This has clinical relevance, as unlike cancer cells, healthy tissue with non-aberrant lamina would be able to selectively recover from paclitaxel treatment.

      Minor comments:

      While I understand the difficulty of the experiments and the effort the authors have put into producing FIB-SEM tomograms, I am not sure they are helping their study or adding anything beyond the light microscopy images. Some of the images may even be in the way, such as supplementary Figure 6, which lacks in quality, controls, and interpretation. Do I see a lot of mitochondria in that slice?

      We agree with the reviewer that Supplementary Figure 6 does not add significant value to the manuscript and thank the reviewer for pointing this out. We have removed it from the manuscript accordingly.

      I may have overlooked it, but has the number of cells from which lamellae have been produced been stated?

      We thank the reviewer for pointing out the missing information. For our cryo-ET experiments, we collected data from 9 lamellae from paclitaxel-treated cells and 6 lamellae from control cells, with each lamella derived from a single cell. This information has now been added to the figure legend (Figure 2F).

      Significance

      The significance of studying the effect of paclitaxel, the most successful chemotherapy drug, should be broad and of interest to basic researchers and clinicians.

      As outlined above, I believe that major concerns about the design and interpretation of the study hamper its significance and advancements.

      We appreciate the reviewer's concerns and have performed major revisions to strengthen the significance of our study. Specifically, we conducted two key sets of experiments to validate our original conclusions: serum starvation to control for the effects of cell division, and overexpression of the microtubule stabiliser Tau to demonstrate that paclitaxel can affect the nucleus via its microtubule bundling activity in interphase.

      By elucidating the mechanistic link between microtubule stabilisation and nuclear-cytoskeletal coupling, our findings contribute to our understanding of paclitaxel's multifaceted actions in cancer cells.

      My areas of expertise could be broadly defined as Cell Biology, Cytoskeleton, Microtubules, and Structural Biology.

      Reviewer #3 Evidence, reproducibility and clarity The manuscript presents interesting new ideas for the mechanism of an old drug, taxol, which has been studied for the last 40 years.

      We thank the reviewer for the positive feedback.

      Although similar ideas are published, which may be suitable to be cited? • Paclitaxel resistance related to nuclear envelope structural sturdiness. Smith ER, Wang JQ, Yang DH, Xu XX. Drug Resist Updat. 2022 Dec;65:100881. doi: 10.1016/j.drup.2022.100881. Epub 2022 Oct 15. PMID: 36368286 Review. • Breaking malignant nuclei as a non-mitotic mechanism of taxol/paclitaxel. Smith ER, Xu XX. J Cancer Biol. 2021;2(4):86-93. doi: 10.46439/cancerbiology.2.031. PMID: 35048083 Free PMC article.

      We thank the reviewer for bringing to our attention these important review articles. In our initial manuscript, we only cited the original paper (14, also reference 14 in the original manuscript). We have now included citations to the suggested publications (20,21).

      We would also like to emphasise how our manuscript distinguishes itself from the work of Smith et al.14,20,21:

      • Cell-type focus: In their study 14, Smith et al. examined the effect of paclitaxel on malignant ovarian cancer cells and proposed that paclitaxel's effects on the nucleus are limited to cancer cells. However, our data extends these findings by demonstrating paclitaxel's effects in both cancerous and non-cancerous backgrounds.

      • Cytoskeletal reorganisation: Smith et al. show reorganisation of microtubules in paclitaxel-treated cells14. Our data show re-organisation of other cytoskeletal components, including F-actin and vimentin.

      • Multimicronucleation: Smith et al. propose that paclitaxel-induced multimicronucleation occurs independently of cell division14. Although we observe progressive nuclear abnormalities during interphase over the course of paclitaxel treatment, our data do not support this conclusion; we find that multimicronucleation occurs only following mitosis.

      • Direct link between microtubule bundling and nuclear aberrations: We show that nuclear aberrations caused by paclitaxel during interphase (distinct from multimicronucleation) are directly linked to microtubule bundling around the nucleus, suggesting they result from mechanical disruption and altered force propagation.

      • Lamin A/C regulation: Consistent with Smith et al.14, we show that Lamin A/C depletion leads to increased sensitivity to paclitaxel treatment. However, we further demonstrate that paclitaxel itself leads to reduced levels of Lamin A/C and that this effect occurs independently of mitosis and is mediated via force-sensing LINC component SUN2. Upon SUN2 knockdown, Lamin A/C levels are no longer affected by paclitaxel treatment.

      • Recovery: Finally, our work reveals that cells expressing low levels of Lamin A/C recover less efficiently after paclitaxel removal. This might help explain how cancer cells could be more susceptible to paclitaxel.

      Only one cell line was used in all the experiments? "Human telomerase reverse transcriptase (hTERT) immortalised human fibroblasts" ? The cells used are not very relevant to cancer cells (carcinomas) that are treated with paclitaxel. It is not clear if the observations and conclusions will be able to be generalized to cancer cells.

      We thank the reviewer for this comment. Our initial study aimed to understand the effects of paclitaxel on nuclear architecture in non-aberrant backgrounds. To show that the observed effects of paclitaxel are also applicable to cancer cells, we have now repeated our main experiments using MDA-MB-231 human breast cancer cells (Supplementary Figure 1B; Supplementary Figure 3P-T). Similar to our findings in human fibroblasts, paclitaxel treatment of MDA-MB-231 led to cytoskeletal reorganisation (Supplementary Figure 1B), a decrease in nuclear solidity (Supplementary Figure 3P), aberrant (patchy) localisation of Lamin A/C (Supplementary Figure 3Q), and a reduction in Lamin A/C and SUN2 levels (Supplementary Figure 3R-T).

      "Fig. 1. (B) STORM imaging of α-tubulin immunofluorescence in cells fixed after 16 h incubation in control media or 5 nM paclitaxel. Lower panels show α-tubulin clusters generated with HDBSCAN analysis. Scale bars = 10 μm." It needs explanation of what is meaning of the different color lines in the lower panels, just different filaments?

      We have added further detail to the figure legend for clarification: "Lower panels show α-tubulin clusters generated with HDBSCAN analysis. Different colours distinguish individual α-tubulin clusters, representing individual microtubule filaments or filament bundles."

      Generally, the figures need additional description to be clear.

      We have added further clarification and detail to our figure legends.

      "Figure 3 - Paclitaxel results in aberrations to the nuclear lamina." The sentence seems not to be well constructed. "Paclitaxel treatment causes ..."?

      We changed this sentence to: "Figure 3 - Paclitaxel treatment results in aberrant organisation of the nuclear lamina and decreased Lamin A/C levels via SUN2."

      Lamin A and C levels are different in different images (Fig. 3B, H): some Lamin A is higher, and sometime Lamin C is higher? This may possibly due to culture condition or subtle difference in sample handling?.

      We thank the reviewer for pointing this out and we agree that the ratio of Lamin A to Lamin C can vary with culture conditions. To confirm that paclitaxel treatment reduces total Lamin A/C levels regardless of this ratio, we repeated the Western blot analysis in three additional biological replicates using cells in which Lamin C levels exceeded Lamin A levels. These experiments confirmed a comparable decrease in total Lamin A/C levels. Figure 3B and 3C have been updated accordingly.

      Also, the effect on Lamin A/C and SUN2 levels are not significant of robust.

      Decreased Lamin A/C and SUN2 levels following paclitaxel treatment were consistently seen across three or more biological repeats (Figure 3B-C), and this could be replicated in a different cell type (MDA-MB-231) (Supplementary Figure 3R-T). Furthermore, Western blotting results are consistent with the patchy Lamin A/C distribution observed using confocal and STORM following paclitaxel treatment (Figure 3A; Supplementary Figure 3A), where Lamin A/C appears to be absent from discrete areas of the lamina.

      Any mechanisms are speculated for the reason for the reduction?

      We have now included additional data which aims to shed light on the mechanism behind the decrease in Lamin A/C and SUN2 levels following paclitaxel treatment. We found that SUN2 is selectively degraded during paclitaxel treatment. Immunoprecipitation of SUN2 followed by Western blotting against Polyubiquitin C showed increased SUN2 ubiquitination in paclitaxel (Figure 3M and N). Furthermore, in our original manuscript, we showed that Lamina A/C levels remained unaltered during paclitaxel treatment in cells where SUN2 had been knocked down. We propose that changes in microtubule organisation affect force propagation to Lamin A/C specifically via SUN2 and that this leads to Lamina A/C removal and depletion. Future work will be needed to fully understand this mechanism.

      In addition to the findings described above, we report no significant changes in mRNA levels for LMNA or SUN2 in paclitaxel (Supplementary Figure 3B and O). Phos-tag gels followed by Western blotting analysis for Lamin A/C also did not detect changes to the overall phosphorylation status of Lamin A/C due to paclitaxel treatment. This is in agreement with our initial data showing no changes to Lamin A/C Ser 404 phosphorylation levels (Supplementary Figure 3E and F). Finally, Lamin A/C immunoprecipitation experiments followed by Western blotting for Polyubiquitin C and acetyl-lysine showed no significant changes in the ubiquitination and acetylation state of Lamin A/C in paclitaxel-treated cells (Supplementary Figure 3G-I).

      Also, the about 50% reduction in protein level is difficult to be convincing as an explanation of nuclear disruption.

      The nuclear lamina and LINC complex proteins play a critical role in regulating nuclear integrity, stiffness and mechanical responsiveness to external forces28,31-33,54,75, as well as in maintaining the nuclear intermembrane distance69,74. In particular, SUN-domain proteins physically bridge the nuclear lamina to the cytoskeleton through interactions with Nesprins, thereby preserving the perinuclear space distance30,69,74. Mutations in Lamins have been shown to disrupt chromatin organization, alter gene expression, and compromise nuclear structural integrity, and experiments with LMNA knockout cells reveal that nuclear mechanical fragility is closely coupled to nuclear deformation47. Furthermore, nuclear-cytoskeletal coupling is essential during processes such as cell migration, where cells undergo stretching and compression of the nucleus; weakening or loss of the lamina in such cases compromises cell movement47,73. In our work, we show that alterations to nuclear Lamin A/C and SUN2 by paclitaxel treatment coincide with nuclear deformations (Figure 2A-D, F, G; Figure 3A-D, F, G; Supplementary Figure 3A, P-T) and that these deformations are reversible following paclitaxel removal (Supplementary Figure 4B-D). Our experiments also demonstrate that Lamin A/C expression levels significantly influence cell growth, cell viability, and cell recovery in paclitaxel (Figure 5). Therefore, drawing on current literature and our results, we propose that, during interphase, paclitaxel induces severe nuclear aberrations through the combined effects of: i) increased cytoskeletal forces on the NE caused by microtubule bundling; ii) loss of ~50% Lamin A/C and SUN2; iii) reorganisation of nucleo-cytoskeletal components.

      Significance

      The manuscript presents interesting new ideas for the mechanism of an old drug, taxol, which has been studied for the last 40 years.

      The data may be improved to provide stronger support.

      Additional cell lines (of cancer or epithelial origin) may be repeated to confirm the generality of the observation and conclusions.?

      We thank the reviewer for the feedback and valuable suggestions. In response, we have included experiments using human breast cancer cell line MDA-MB-231 to further corroborate our findings and interpretations. We believe these additions have improved the clarity, robustness and impact of our manuscript, and we are grateful for the reviewer's contributions to its improvement.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript presents interesting new ideas for the mechanism of an old drug, taxol, which has been studied for the last 40 years. Although similar ideas are published, which may be suitable to be cited?

      • Paclitaxel resistance related to nuclear envelope structural sturdiness. Smith ER, Wang JQ, Yang DH, Xu XX. Drug Resist Updat. 2022 Dec;65:100881. doi: 10.1016/j.drup.2022.100881. Epub 2022 Oct 15. PMID: 36368286 Review.
      • Breaking malignant nuclei as a non-mitotic mechanism of taxol/paclitaxel. Smith ER, Xu XX. J Cancer Biol. 2021;2(4):86-93. doi: 10.46439/cancerbiology.2.031. PMID: 35048083 Free PMC article.

      Only one cell line was used in all the experiments? "Human telomerase reverse transcriptase (hTERT) immortalised human fibroblasts" ? The cells used are not very relevant to cancer cells (carcinomas) that are treated with paclitaxel. It is not clear if the observations and conclusions will be able to be generalized to cancer cells.

      "Fig. 1. (B) STORM imaging of α-tubulin immunofluorescence in cells fixed after 16 h incubation in control media or 5 nM paclitaxel. Lower panels show α-tubulin clusters generated with HDBSCAN analysis. Scale bars = 10 μm." It needs explanation of what is meaning of the different color lines in the lower panels, just different filaments?

      Generally, the figures need additional description to be clear.

      "Figure 3 - Paclitaxel results in aberrations to the nuclear lamina." The sentence seems not to be well constructed. "Paclitaxel treatment causes ..."?

      Lamin A and C levels are different in different images (Fig. 3B, H): some Lamin A is higher, and sometime Lamin C is higher? This may possibly due to culture condition or subtle difference in sample handling?. Also, the effect on Lamin A/C and SUN2 levels are not significant of robust. Any mechanisms are speculated for the reason for the reduction? Also, the about 50% reduction in protein level is difficult to be convincing as an explanation of nuclear disruption.

      Significance

      The manuscript presents interesting new ideas for the mechanism of an old drug, taxol, which has been studied for the last 40 years.

      The data may be improved to provide stronger support.

      Additional cell lines (of cancer or epithelial origin) may be repeated to confirm the generality of the observation and conclusions.?

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This study investigates the effects of the chemotherapeutic drug paclitaxel on nuclear-cytoskeletal coupling during interphase, claiming a novel mechanism for its anti-cancer activity. The study uses hTERT-immortalized human fibroblasts. After paclitaxel exposure, a suite of state-of-the-art imaging modalities visualizes changes in the cytoskeleton and nuclear architecture. These include STORM imaging and a large number of FIB-SEM tomograms.

      Major comments:

      The authors make a major claim that in addition to the somewhat well-described mechanism of paclitaxel on mitosis, they have discovered 'an alternative, poorly characterised mechanism in interphase'.

      However, none of the data proves that the effects shown are independent of mitosis. To the contrary, measurements are presented 48 hours after paclitaxel treatment starts, after which it can be assumed that 100% of cells have completed at least one mitotic event. The appearance of micronuclei evidences this, as discussed by the authors shortly. It looks like most of the results shown are based on botched mitosis or, more specifically, errors on nuclear assembly upon exit from mitosis rather than a specific effect of paclitaxel on interphase. The readouts the authors show just happen to be measurements while the cells are in interphase.

      Alternative hypotheses are missing throughout the manuscript, and so are critical controls and interpretations.

      The authors claim that 'Previously, the anti-cancer activity of paclitaxel was thought to rely mostly on the activation of the mitotic checkpoint through disruption of microtubule dynamics, ultimately resulting in apoptosis.' The authors may have overlooked much of the existing literature on the topic, including many recent manuscripts from Xiang-Xi Xu's and another lab.

      The data, e.g. in Figure 1, does not hold up to the first alternative hypothesis, e.g. that paclitaxel stabilizes microtubules and that excessive mechanical bundling of microtubules induces major changes to cell shape and mechanical stress on the nucleus. Even the simplest controls for this effect (the application of an alternative MT stabilizing drug or the overexpression of an MT stabilizer, e.g., tau).

      The focus on nuclear lamina seems somewhat arbitrary and adjacent to previously published work by other groups. What would happen if the authors stained for focal adhesion markers? There would probably be a major change in number and distribution. Would the authors conclude that paclitaxel exerts a specific effect on focal adhesions? Or would the conclusion be that microtubule stabilization and the following mechanical disruption induce pleiotropic effects in cells? Which effects are significant for paclitaxel function on cancer cells?

      Minor comments:

      While I understand the difficulty of the experiments and the effort the authors have put into producing FIB-SEM tomograms, I am not sure they are helping their study or adding anything beyond the light microscopy images. Some of the images may even be in the way, such as supplementary Figure 6, which lacks in quality, controls, and interpretation. Do I see a lot of mitochondria in that slice?

      I may have overlooked it, but has the number of cells from which lamellae have been produced been stated?

      Significance

      The significance of studying the effect of paclitaxel, the most successful chemotherapy drug, should be broad and of interest to basic researchers and clinicians.

      As outlined above, I believe that major concerns about the design and interpretation of the study hamper its significance and advancements.

      My areas of expertise could be broadly defined as Cell Biology, Cytoskeleton, Microtubules, and Structural Biology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This description of the down-regulation of the expression of lamin A/C upon treatment with paclitaxel and its sensitivity to SUN2 is quite interesting but still somehow preliminary. It is unclear whether this effect involves the regulation of gene expression, or of the stability of the proteins. How SUN2 mediates this effect is still unknown. The roles of free tubulins and polymerized microtubules, and thus the potential role of paclitaxel, need to be uncovered.

      The doses of paclitaxel at which occur the effects described in the paper are not fully consistent with all the conclusions. Most experiments have been done at 5 nM. However, at this dose the effect of lamin A/C over or down expression on the growth (differences in the slopes of the curves in Figure 4A) are not fully convincing and not fully consistent with the clear effect on viability as well (in addition, duration of treatments before assessing vialbility are not specified). At 1 nM, cell growth is reduced and the rescuing effect of lamin over-expression is much more clear (Fig 4A), and the nucleus deformation clear (Fig 2A) but this dose has no effect on lamin A/C expression (Fig 3C), which questions how lamins impact nucleus shape and cell survival. Cytoskeleton reorganisation in these conditions is not described although it could clarify the respective role of force production (suggested in figure 1) and nuclei resistance (shown in figure 2) in paclitaxel sensitivity.

      Finally, although the absence of role of mitotic arrest is clear from the data, the defective reorganisation of the nucleus after mitosis still suggest that the effect of paclitaxel is not independent of mitosis.

      minor: a more thorough introduction of known data about dose response of cells in culture and in vivo would help understanding the range of concentrations used in this study.

      Significance

      In this manuscript, Hale and colleagues describe the effect of paclitaxel on nucleus deformation and cell survival. They showed that 5nM of paclitaxel induces nucleus fragmentation, cytoskeleton reorganisation, reduced expression of LaminA/C and SUN2, and reduced cell growth and viability. They also showed that these effects could be at least partly compensated by the over-expression of lamin A/C. As fairly acknowledged by the authors, the induction of nuclear deformation in paclitaxel-treated cells, and the increased sensitivity to paclitaxel of cells expressing low level of lamin A/C are not novel (reference #14). Here the authors provided more details on the cytoskeleton changes and nuclear membrane deformation upon paclitaxel treatment. The effect of lamin A/C over and down expression on cell growth and survival are not fully convincing, as further discussed below. The most novel part is the observation that paclitaxel can induce the down-regulation of the expression of lamin A/C and that this effect is mediated by SUN2.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      I have already provided a document with a point-by-point response. I do not wish to re-format all of the text again in this HTML box. The document I provided can be published as it is.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This study demonstrates an improved integral gene drive (IGD) for use in Anopheles gambiae. Inserting the coding sequence for Cas9 in-frame with a germline-specific gene (nanos) improved the performance of this IGD relative to previously reported systems while reducing fitness costs. Integration of the gRNA cassette within a synthetic intron is an elegant solution to constraining the minimal elements of the IGD within a single insertion. The results of this study found that while the IGD can be used to propagate anti-malarial effectors (MM-CP) within a population, fitness costs and resistance alleles were higher than anticipated, potentially limiting the application of this particular IGD design without further optimisation.

      The results comprehensively demonstrate the effective transmission and stability of the IGD over several generations, while also characterising the limitations of the system. Although I don't think the authors make any claims which are not supported by their results. It might be good to provide more of an explanation for how the performance of this IGD compares to the zpg IGD reported in Ellis et al 2022 for readers less familiar with the IGD literature.

      The manuscript is overall very well written with clear results and methods. However, I found the descriptions referring to the effects of the maternal, paternal, and even grandmaternal inheritance hard to follow. The statistical analysis and replications are adequate as well.

      Referee cross-commenting

      I agree with the other reviewer's comments regarding the need to clarify a few points made in the overall well written manuscript.

      Significance

      Gene drives are the most promising genetic biocontrol method for controlling the spread of malaria. However, there are many technical challenges that have made the development of gene drives quite difficult. This study works to address one such challenge - constraining the expression of Cas9 to the germline by integrating it within an endogenous loci rather than using semi-synthetic promoters. While IGD have been demonstrated before, this study further improves on their performance while reducing off-target effects.

      The manuscript is written for a highly specialized audience that is very familiar with the genetic biocontrol, and especially the gene-drive field of research.

      My fields of research include genetic biocontrol and insect synthetic biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, the authors develop a complete integral drive system in Anopheles gambiae malaria mosquitoes. This type of gene drive is interesting, with special advantages and disadvantages compared to more common designs. Here, the authors develop the Cas9 element and combine it with a previously developed antimalaria effector element. The new element performs very well in terms of drive efficiency, but it has unintended fitness costs, and a higher than desirable rate of functional resistance allele formation. Nevertheless, this study represents a very good step forward toward developing effective gene drives and is thus of high impact.

      The format of the manuscript is a bit suboptimal for review. Please add line numbers next time for easy reference. It would also help to have spaces between paragraphs and to have figures (with legends) added to the text where they first appear.

      It might be useful to add subsections to the results, just like in the methods section. It could even be expanded a bit with some specific parts from the discussion, through this is optional.

      Abstract: The text says: "As a minimal genetic modification, nanosd does not induce widespread transcriptomic perturbations." However, it does seem to change things based on Figure 3c.

      Page 2: "drive technologies for public health and pest control applications" needs a period afterward.

      Page 2: "The fitness costs, homing efficiency, and resistance rate of the gene drive is" should be "The fitness costs, homing efficiency, and resistance rate of the gene drive are".

      Page 2: "When they target important mosquito genes, gene drives are designed to ensure that the nuclease activity window (germline) does not overlap with that of the target gene (somatic)." is note quite correct. This is, of course, sensible for suppression drives, but it's not a necessary requirement for modification drives with rescue elements in many situations.

      Page 2: "recessive somatic fitness cost phenotypes" is unclear. I think that you are trying to avoid the recessive fitness cost of null alleles becoming a dominant fitness cost from a gene drive allele (in drive-wild-type heterozygotes).

      Page 2: "This optimization approach has had only limited success, and suboptimal performance is commonly attributed to not capturing all the regulatory elements specific to the germline gene's expression9,12". I don't think this is correct. There are several examples where a new promoter helped a lot. The zpg promoter in Anopheles gambiae allowed success at the dsx site in suppression cage studies (Kyrou et al 2018), and nanos gave big improvement to modification drives at the cardinal locus (Carballer et al 2020). In flies, several promoters were tested, and one allowed success in cage experiments (Du et al 2024). In Aedes, the shu promoter allowed for high drive performance (Anderson et al 2023), though this last one hasn't been tested in more difficult situations. I think you could certainly argue in the general case that not all promoters will work the way their transcriptome says, but there are many examples where they seem to be pretty good.

      Page 2: "make it more likely that mutations that disrupt the drive components are selected against though loss of function of the host gene." I think that this needs a bit more explanation. You are referring to mutations in regulatory elements or frameshift mutations. This will make it more resistant to mutation. Also, these mutations would tend to have a minor effect expect perhaps in the cargo gene of a modification drive. By using a cargo gene in an integral drive, you could still keep it somewhat safer, but whether this is 1.2x or 10x safer is unclear.

      Page 3: "they can incur severe unintended fitness costs". This is central to integral drives and this manuscript. It's worth elaborating on.

      Page 3: "Regulatory elements from germline genes that have worked sub-optimally in traditional gene drive designs for the reasons outlined above may work well in an IDG design20." This is setting up the integral drive with nanos, but nanos DOES work well in traditional Anopheles gambiae gene drive designs. It is possible that you might end up with less somatic expression than Hammond et al 2020 (though the comparison is unclear due to batch effects in that study), but there is no direct comparison in this manuscript to that.

      Page 3: "This suggests an impact of maternal deposition on drive efficiency only in female drive carriers." This is quite strange. Usually, I would expect to see an equal reduction in efficiency between male and female progeny. Could this be due to limited sample size? Random idea: It's also possible that almost all maternal deposition was mosaic and wouldn't be enough to direct affect drive conversion. However, it could cause enough of a fitness cost TOGETHER with new drive expression in females that perhaps only tissues with randomly low expression rates properly developed and led to progeny, reducing drive inheritance? Another possibility: Could the drive/resistance males have impaired fertility, so these ones are underrepresented in the batch cross? If nanos is needed in males and a single drive copy is not quite enough for good fertility or mating competitiveness, they may be underrepresented in your crosses. They might have worse fertility than drive homozygous males, which at least have two partially working copies of nanos rather than just one (in many cells, at least). Maybe check the testis for abnormal phenotypes?

      Overall, it would be favorable if the drive allele was somewhere more fit than a nonfunctional resistance allele. This could already be achieved in this drive, but it doesn't get much mention.

      Page 3: There should be a comma after, "Interestingly, while many of the observed mutations were predicted to abolish nanos expression" and "This could indicate that in these experiments".

      Page 3 last sentence: Please improve the clarity.

      Removing the EGFP is supposed to restore the fitness, and this was helpful in some previous integral drive constructs. This could get a bit more mention (it is possible that I missed this somewhere in the manuscript).

      Page 4: The MM-CP line and it's association with the integral drive strategy could get a little more introduction. Maybe even a supplemental figure showing the general idea.

      Page 5: "cassette is predicted to disrupt the CP function entirely (Fig. 5d)" also lacks a period.

      Page 5: "The subsequent stabilization of the nanosd frequency and the lack of rapid loss suggests that any associated fitness cost is primarily recessive." This is not quite correct because by this point, drive/wild-type heterozygotes are rare, and this is where you'd find a potential dominant fitness cost. It should be correct in the end stages where it is a mix of drive and functional/nonfunctional resistance alleles (though the nonfunctional resistance alleles may cause greater fitness costs when together with a drive - see above).

      Page 6: "Maternal deposition of Cas9, or Cas9;gRNA, into the zygote can lead to cutting at stages when homing is not favoured, and has been commonly observed for canonical Anopheles nanos drives9,10,35." Reference 35 (which is more suitable for referencing an example of nanos in other Anopheles) found some resistance alleles by deep sequencing, but the timing that they formed was unclear (it's not certain if it was maternal deposition). This study may be a more suitable reference: Carballar-Lejarazú R, Tushar T, Pham TB, James AA. Cas9-mediated maternal-effect and derived resistance alleles in a gene-drive strain of the African malaria vector mosquito, Anopheles gambiae. Genetics, 2022.

      Page 8: "could further reduce the likelihood of resistance allele formation by increasing the frequency of HDR events." Multiple gRNAs would mostly help by reducing functional resistance allele formation, especially since drive conversion is already very high in Anopheles.

      Page 8, last paragraph: This conclusion is perhaps a little optimistic considering the functional resistance alleles, which should get a little more attention in the summary or elsewhere in the discussion section.

      Figure 1d: The vertical text saying "Non-WT" is confusing. The circles themselves show + and -. Also, "-" isn't necessarily a knockout allele, so I'm not sure if - is the best symbol for resistance.

      Figure 2e: The vertical scale is not the most intuitive. Consider rearranging it to "Transition from larvae to pupae" starting at zero and going to 1 when all the larvae become pupae.

      Figure 2e-f: For both of these, there are clear differences between males and females. Thus, when comparing drive homozygotes to wild-type, it would probably be better to have separate statistical comparisons for males and females.

      Figure 3: Can any of these transcription results in individual genes potentially explain the observed fitness cost?

      Figure 3b: The scale here also doesn't quite make sense. It's the fraction of underdeveloped ovaries, but the graph is also perhaps trying to show whether just 1-2 ovaries are present, or maybe how many ovaries are undeveloped, but then it would say "zero"? This should be clarified. Number of ovaries and how well-developed they are is separate (it can be put on the same graph, but needs to be more clear).

      Figure 4f: The vertical axis should say "ONNV."

      Figure 5c-d: These should be labeled as the most common resistance allele. Also, I'm not sure how relevant it is, but we also found an alternate start codon here: Hou S, Chen J, Feng R, Xu X, Liang N, Champer J. A homing rescue gene drive with multiplexed gRNAs reaches high frequency in cage populations but generates functional resistance. J Genet Genomics, 2024. Maybe this is a more common problem than one would expect?

      Figure 5cd,S4,S5: They have a bit of a weird plot. Why not make four line graphs for each? Also, some alleles use the  symbol. + is wild-type, which is well understood, but - as resistance is not always clear, and seeing them together may confuse readers. Additionally, the fact that you have the most common resistance allele in Figure 5cd might mean that you know more about the genotype? If so, it would be best to separate wild-type and resistance alleles in whatever the final figure looks like.

      Some supplemental raw data files would be useful if they were available, but the figures are through enough that this isn't essential.

      Review by:

      Jackson Champer, with major assistance from Ruobing Feng (essentially section B) and Jie Du

      Referee cross-commenting

      We don't have any cross-comments, other than supporting the idea of slightly more comparisons to the authors' previous construct.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      A key innovation of the nanosd gene drive is its integral gene drive (IGD) design, which inserts the drive cassette directly into the A. gambiae nanos gene, incorporating only the minimal components necessary for drive function. The drive achieves high transmission rates, without causing widespread disruption of gene expression or increasing susceptibility to malaria parasites, and imposes an acceptable fitness cost-primarily a reduction in female fecundity when homozygous. The strong performance of nanosd can be attributed to its design: Cas9 is expressed in the correct cells and timing to induce efficient homing, effectively hijacking the nanos gene's natural expression profile. However, despite the careful design aimed at preserving nanos function, the rescue was incomplete: homozygous female drive carriers exhibited a clear reduction in ovarian function.

      In caged population trials, both the drive and a co-introduced anti-malaria effector gene reached high frequencies, even in the presence of emerging resistance alleles. Because the drive is inserted into an essential gene, nonfunctional resistance alleles are selected against and tend to be purged over time. Nonetheless, functional resistance remains a concern. The use of a single, though precisely positioned gRNA targeting the native nanos gene ATG site increases the likelihood of generating functional resistance alleles. Over the long term, if the drive imposes fitness costs, it may be outcompeted by such functional resistance alleles, potentially undermining the goal of sustained population modification.

      Overall, this study represent a notable advance in Anopheles mosquito gene drive development and can be considered as high impact. - Place the work in the context of the existing literature (provide references, where appropriate).

      Previous IGD efforts in Drosophila, mice and mosquitoes have demonstrated nearly super‐Mendelian inheritance but often at the expense of host fitness. For example, Nash et al. built an intronic‐gRNA Cas9 drive at the D. melanogaster rcd-1r locus that propagated efficiently through cage populations (Nash et al., 2022), and Gonzalez et al. reported that a Cas9 drive inserted at the germline zpg locus in Anopheles stephensi biased inheritance by ~99.8% (Gonzalez et al., 2025). However, these strong drives disrupted essential genes: in A. gambiae, inserting Cas9 into zpg produced efficient homing but rendered females largely sterile (Ellis et al., 2022). A similar germline Cas9 knock-in in Mus musculus enabled gene conversion in both sexes, albeit with only modest efficiency and potential fitness trade-offs (Weitzel et al., 2021). The current nanosd IGD is explicitly designed to overcome this limitation by selecting a more permissive gene target and using a minimal drive cassette, so as to preserve mosquito viability while maintaining robust drive efficiency, although still with reduced female drive homozygotes fertility.

      This nanosd gene drive like previous homing drives in Anopheles, is capable of achieving a high level of inheritance bias. Although it uses the endogenous nanos regulatory elements, which have less leaky somatic expression compared to using vasa (Gantz et al., 2015; Hammond et al., 2016; Hammond et al., 2017) or zpg promoters(Hammond et al., 2021; Kyrou et al., 2018), to drive Cas9 expression and thereby reduces somatic expression-induced female sterility, the incomplete rescue of nanos function still leads to reduced female fertility in drive homozygotes. - State what audience might be interested in and influenced by the reported findings.

      It's worth noting the broad audience that will find this work relevant. Gene drive developers and molecular geneticists will be impressed by the good drive performance and directly influenced by the design principles showcased here. The study's integral gene drive architecture that leverages the endogenous nanos regulatory elements, in-frame E2A peptide linkage for co-expression, and intronic insertion of gRNA and selectable markers addresses long-standing challenges in promoter leakage, somatic fitness costs, and resistance allele evolution. What's more, vector biologists and malaria researchers will be interested in the successful deployment of a gene drive in A. gambiae that actually carries a disease-blocking trait. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      We have worked on CRISPR gene drive development in both fruit flies and Anopheles mosquitoes and have experience with modeling their spread.

      References

      Ellis, D.A., Avraam, G., Hoermann, A., Wyer, C.A.S., Ong, Y.X., Christophides, G.K., and Windbichler, N. (2022). Testing non-autonomous antimalarial gene drive effectors using self-eliminating drivers in the African mosquito vector Anopheles gambiae. PLOS Genetics 18, e1010244-e1010244.

      Gantz, V.M., Jasinskiene, N., Tatarenkova, O., Fazekas, A., Macias, V.M., Bier, E., and James, A.A. (2015). Highly efficient Cas9-mediated gene drive for population modification of the malaria vector mosquito Anopheles stephensi. Proc Natl Acad Sci U S A 112, E6736-E6743.

      Gonzalez, E., Anderson, M.A.E., Ang, J.X.D., Nevard, K., Shackleford, L., Larrosa-Godall, M., Leftwich, P.T., and Alphey, L. (2025). Optimization of SgRNA expression with RNA pol III regulatory elements in Anopheles stephensi. Scientific Reports 15, 13408.

      Hammond, A., Galizi, R., Kyrou, K., Simoni, A., Siniscalchi, C., Katsanos, D., Gribble, M., Baker, D., Marois, E., Russell, S., et al. (2016). A CRISPR-Cas9 gene drive system targeting female reproduction in the malaria mosquito vector Anopheles gambiae. Nat Biotechnol 34, 78-83.

      Hammond, A., Karlsson, X., Morianou, I., Kyrou, K., Beaghton, A., Gribble, M., Kranjc, N., Galizi, R., Burt, A., Crisanti, A., et al. (2021). Regulating the expression of gene drives is key to increasing their invasive potential and the mitigation of resistance. PLOS Genetics 17, e1009321-e1009321.

      Hammond, A.M., Kyrou, K., Bruttini, M., North, A., Galizi, R., Karlsson, X., Kranjc, N., Carpi, F.M., D'Aurizio, R., Crisanti, A., et al. (2017). The creation and selection of mutations resistant to a gene drive over multiple generations in the malaria mosquito. PLOS Genetics 13, e1007039-e1007039.

      Kyrou, K., Hammond, A.M., Galizi, R., Kranjc, N., Burt, A., Beaghton, A.K., Nolan, T., and Crisanti, A. (2018). A CRISPR-Cas9 gene drive targeting doublesex causes complete population suppression in caged Anopheles gambiae mosquitoes. Nature Biotechnology 36, 1062-1066.

      Nash, A., Capriotti, P., Hoermann, A., Papathanos, P.A., and Windbichler, N. (2022). Intronic gRNAs for the construction of minimal gene drive systems. Frontiers in Bioengineering and Biotechnology 0, 570-570. Weitzel, A.J., Grunwald, H.A., Ceri, W., Levina, R., Gantz, V.M., Hedrick, S.M., Bier, E., and Cooper, K.L. (2021). Meiotic Cas9 expression mediates gene conversion in the male and female mouse germline. Plos Biol 19, e3001478-e3001478.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-03064

      Corresponding author(s): Massimo, Hilliard; Sean, Coakley

      1. General Statements

      We are grateful to the reviewers for taking time to review our manuscript and for providing such clear, insightful and actionable suggestions. The consensus between 4 independent reviewers that this story is of general interest to cell biologists, neurobiologists and clinical researchers is remarkable. In addition to our mechanistic insights into the regulation of GTPase activity, we think that the experimental systems we have developed will be of great value to study how GTPases their associated GAPs and GEFs function to maintain the nervous system, especially due to the demonstrated conservation of these molecules. We believe that our data provides a powerful and tractable model to study such molecules in a physiological context.

      We agree with the reviewers' concerns and propose the following plan below to address them.

      2. Description of the planned revisions

      Reviewer #1(Evidence, reproducibility and clarity (Required)):


      __Summary Stability of the PLM axon in C. elegans is maintained through interactions with the epidermis. Previous studies by this group found that loss of the tbc-10 Rab GTPase Activating Protein strongly enhanced the PLM axon break phenotype of unc-70/beta-spectrin mutants. TBC-10 is a GAP for RAB-35 and thus loss of rab-35 suppresses the tbc-10 phenotype. Of the two RAB-35 GEFs, loss of RME-4 partially suppressed the tbc-10 phenotype and FLCN-1 was not involved suggesting that there may be an additional GEF involved. Here Bonacossa-Pereira et al identify a point mutation in agef-1a (vd92) as a suppressor of tbc-10 PLM axon break phenotype (all experiments also have a dominant allele of unc-70) and confirm that point mutation is causative by replicating the mutation via genome editing (vd123). Rescue experiments demonstrate that AGEF-1a is required in the epidermis and not PLM as previous demonstrated with tbc-10 and unc-70. Rescue is dependent on a functional SEC7/GEF activity. AGEF-1a is a functional ortholog to human BIG2/ArfGEF2 as its expression fully rescues tbc-10. AGEF-1a functions upstream of RAB-35 as expression of activated RAB-35 can suppress loss of agef-1. AGEF-1a functions in parallel to RME-4 as the double has stronger suppression of tbc-10. AGEF-1a is an ARF GEF, however it functions independently of ARF-1.2 as loss of arf-1.2 does not suppress tbc-10. They demonstrate that AGEF-1a interacts with RAB-35 through colocalization experiments suggesting that AGEF-1a could directly activate RAB-35. Finally, they demonstrate that AGEF-1a regulates the localization of the LET-805 epidermal attached complex component as it restores localization in a tbc-10 mutant.

      Major comments

      The manuscript is well written and easy to understand.

      The experiments are well done and controlled.

      I enjoyed reading this paper. However...

      Some of the claims are not supported by the data.__

      __1) The claim that AGEF-1a directly interacts with RAB-35 was not demonstrated. The evidence provided to support a direct interaction are colocalization experiments in Figure 3. AGEF-1a does partially colocalize with RAB-35 in the epidermis. However, colocalization does not indicate a physical interaction direct or indirect. A simple fix would be to change the claim to that they partially colocalize. Optional, a physical interaction could be done with the split-GFP since they already have the AGEF-1 strain or they could perform co-IP experiments, though neither of those are proof of direct interactions.

      __

      We agree that the biochemical co-IP experiment could provide some answers, however, using a full length AGEF-1a would not only represent a significant technical challenge but will also not prove a direct interaction in a physiological context. To overcome this limitation, and to directly test their interaction in vivo, we propose to use a split-GFP approach as suggested by the reviewer. In this experiment, we will generate an endogenously tagged GFP1-10::rab-35 allele and combine it with the previously generated and available tagged agef-1a::GFP11x7. If AGEF-1 and RAB-35 closely interact, we should observe the reconstitution of full length GFP. It is possible that the endogenously tagged versions only provide a very weak GFP signal that will be difficult to detect. As an alternative approach, we will generate the same tagged molecules as overexpressed transgenes under epidermal-specific promoters (such as Pdpy-7). If the results are still negative, we agree to temper our claim that these molecules physically interact and rephrase the manuscript to reflect the new data.

      • *

      2) The claim that AGEF-1a facilitates RAB-35 activation is not supported. While it is likely that AGEF-1a facilitates RAB-35 activation based on the epistasis experiments as well as studies in mammalian cells there were no experiments to demonstrate that modulating AGEF-1a activity resulted in a change in RAB-35 activity. I would suggest tempering this claim to something along the line that the data are consistent with AGEF-1a regulating RAB-35 activity as shown in mammalian cells. An optional experiment would be to look at the colocalization of RAB-35 with a known effector in wild type and agef-1(vd92) with the expectation that there would be a higher level of colocalization in agef-1 mutants. Effector pull-down experiments or perhaps a cell based GEF assay could be used (PMID: 35196081).


      We welcome this suggestion and acknowledge the limitations of these experiments. While we might be able to determine if AGEF-1 and RAB-35 physically interact in vivo with the experiments proposed above, screening for the relevant rab-35 effector in this context and/or doing effector pull-down/cell based GEF assays would be a significant technical challenge. We propose to temper our claim as suggested.

      3) The claim that AGEF-1a functions independently of ARF-1.2 is not well supported. The fact that the ARF-1.2 mutant does not suppress tbc-10 suggests that ARF-1.2 may not be involved but does not eliminate the possibility that ARF-1.2 functions redundantly with ARF-5 or WARF-1/ARF-1.1. This can be resolved by toning down the claim. Alternatively, this can be tested by RNAi of arf-5 and warf-1 in tbc-10 and arf-1.2; tbc-10 mutants.

      We agree that warf-1 and arf-5 could be functioning redundantly with arf-1.2. We have attempted to generate an AID::arf-5 allele to test the effect of cell-specific degradation, but homozygous AID::arf-5 animals were lethal. We have not yet examined warf-1. We believe the best way to test these two molecules is through RNAi knockdown, and we propose to do this experiment and adjust our interpretation and discussion according to the new data.

      Minor comments

      Figure 1C the CRISPR generated allele (vd123) is referred to as [S784L] and then in 1E vd92 is referred to as [S784L]. Perhaps it would be clearer if the allele name was used instead of the amino acid change.

      We will reformat the manuscript to include the allele names instead of amino acid change.

      Page 6 "We reasoned that if the S784L mutation we isolated causes a similar loss of the GTPase activation function, then SKIN::AGEF-1a[E608K] would not have the capacity to restore the rate of PLM axon breaks to background levels in agef-1[S784L]; tbc-10; vdSi2 animals." It was unclear to me whether you were testing if the S784L mutation could be disrupting a GEF independent function or might disrupt the nucleotide exchange activity as might be tested in a biochemical assay. There are many reasons this change could cause a loss of function phenotype (ie. Improper folding, mislocalization, etc.). The most clear explanation would be that you were testing if GEF function was required for rescue rather than testing if the S784L mutation disrupted GEF activity.

      Indeed, this experiment reveals that reducing the activation of the AGEF-1 target phenocopies the effect of S784L and does not further enhance the effect of S784L. However, it does not answer if, specifically, the GEF function is affected by S784L. We propose to rewrite the quoted sentence as follows: "We asked whether the GEF function is required for axonal damage. If that is the case, then SKIN::AGEF-1a[E608K] overexpression should phenocopy the effect of AGEF-1a[S784L]."

      • *

      Page 13. It was unclear how testing if AGEF-1, RME-4, ARF-5 and RAB-35 form complexes in vivo (I assume you are suggesting colocalize based on figure 3 interpretation) would resolve how AGEF-1 was regulating RAB-35.


      We apologize that our phrasing was not clear. We will rewrite this section to better reflect the following idea. Given literature data showing an allosteric interaction between RME-4/DENND1 and ARF-5/Arf5, and our own data showing that AGEF-1 regulates RAB-35, we believe these molecules could form a complex. Considering that we do not have data to support this notion, mostly due to the inability to test the effect of ARF-5, we will present this possibility in the discussion section.


      __**Cross-commenting**

      I agree with the comments made by the other reviewers and I stand by my own as well. I will echo that it is important to know the nature of their agef-1 allele.

      Reviewer #1 (Significance (Required)):

      Bonacossa-Pereira et al identify AGEF-1 as a regulator of axon integrity that functions in a pathway with RAB-35 in the epidermis is an exciting finding. As pointed out in the discussion, mutations in the human ortholog cause neurodevelopmental defects which leads to obvious characterization of BIG2/ArfGEF2 in neurons while this study indicates that this protein can have cell non-autonomous roles in regulating neurons. These findings could have important implications for understanding the etiology of these defects that would be of interest to neurobiologists and clinical researchers.

      The finding of this paper would also be of interest to cell biologists and particularly those studying the roles of Rab and Arf GTPases in membrane trafficking, such as myself. The idea that AGEF-1 might function as a Rab35 GEF is provocative and would generate a lot of interest and skepticism from the field. However, there is no data to support that AGEF-1 would be a direct regulator of Rab35 over the previously demonstrated cross regulation of Rab35 by Arf GTPases. Therefore, it would be fine to speculate in the discussion a direct interaction, but I would refrain from suggesting this as a model and elsewhere in the manuscript.

      __

      Although we agree that current evidence is not sufficient to support the model where AGEF-1 is a direct regulator of RAB-35, our data points to the direction where there is an important genetic relationship between these molecules in a physiological context in a living animal, with a defined phenotype relevant to the nervous system maintenance. We think that the proposed revision experiments will provide a better understanding of how AGEF-1 functions with RAB-35 and we agree with the suggestion to rephrase our manuscript to reflect the limitations of our results.


      __Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This interesting manuscript reports the outcome of a fruitful C. elegans genetic screen with a complex but clever design. Through it, the authors identify AGEF-1 as a GEF that likely regulates the active state of the GTPase RAB-35 in the skin to protect touch receptor axons from mechanical breakage.

      Major points: 1. Based on localization experiments, the authors claim "AGEF-1a interacts with RAB-35 in the epidermis" (Results heading) and state "these data demonstrate that AGEF-1a interacts with a subset of RAB-35 molecules in the epidermis." In general, localization studies cannot be used to conclude physical interaction (with some exceptions such as single-molecule kinetics). In this case, the data in my view do not even make a compelling argument for co-localization. There is a lot of AGEF-1 and RAB-35 signal everywhere and it may not be meaningful that the signals sometimes overlap. A more quantitative approach with controls would be needed to conclude meaningful co-localization. Importantly, this would still not demonstrate interaction.__

      We thank the reviewer for the comment. Indeed, co-localization does prove a physical interaction, and we appreciate the concern about our imaging data not making a compelling argument. To address this notion, we plan to perform an experiment using a more robust, quantitative and physiologically relevant strategy. We will generate an endogenously tagged mScarlet3::rab-35 allele for precise endogenous localization. In addition, as a positive control, we will generate an endogenous rme-4::GFP11x7 allele to cell-specifically demonstrate the level of colocalization of RME-4 with mScarlet3::RAB-35 within the epidermis. To address the possible interaction between AGEF-1a and RAB-35 we will leverage a split-GFP approach to assess their interaction in vivo, in the context relevant to the phenotypes we observed (see reply to reviewer #1 point 1).

      __2. The effect of the AGEF-1(S784L) mutation is not clear to me. Naively, as the S784L mutation lies in the auto-inhibitory domain, I would have expected AGEF-1 to become constitutively active, not inactive as the authors seem to suggest. Is the idea that it is constitutively auto-inhibited? The main evidence for a loss of function effect seems to be that a putative dominant negative mutation AGEF-1(E608K) does not further supress axon breakage when co-expressed in trans to AGEF(S784L), but in my view this only shows that, once the defect is suppressed, it cannot be suppressed any further. Defining the nature of the S784L allele is important. Some suggestions, although the authors may come up with different approaches: use of an inducible or cell-specific depletion system like AID/TIR1, Cre/lox, or FLP/FRT to circumvent the lethality of agef-1(0) and reveal what a true loss-of-function looks like; testing if deletion of the auto-inhibitory domain phenocopies S784L to test if this mutation impairs autoinhibition.

      __

      This is an very insightful comment. To address this point, we will follow the reviewer's suggestion and deplete AGEF-1 cell-specifically in the epidermis using the auxin-inducible degron system. Specifically, we will generate an agef-1::AID allele to degrade this molecule in a spatially and temporally controlled fashion, which will allow to circumvent the lethality of agef-1(0) and determine whether the S784L allele mimics the depletion of AGEF-1.

      Although it would be interesting to further dissect the effect of this mutation on AGEF-1 activity, we believe that this falls outside of the scope of this manuscript. As an alternative, we propose to elaborate more in the discussion the implications of the possible roles for the S784L mutation to clarify our model of its function. Our data supports a model in which this mutation reduces AGEF-1 function leading to a reduction in the activity of its downstream target GTPases. It is possible that this is due to AGEF-1 becoming constitutively autoinhibited, or that this mutation affects the structure of the molecule in a way that it reduces its affinity towards its downstream effectors.

      Minor points: 1. I am not able to see the "vesicle-like structures with a clear luminal space" or RAB-35 being "notably enriched at the membrane near the epidermal furrow" in Fig. 3. The "3D surface rendering" in Fig. 3e is grossly oversampled and should not be included.

      We will rectify this section and include new super-resolved images using Airyscan confocal microscopy. We hope these will yield a better-quality representation of these concepts. __ 2. As the agef-1a isoform is specifically referenced throughout, please describe the different agef-1 isoforms somewhere to save readers from having to look this up.__

      Yes, we will include a description of the isoforms. In C. elegans there are two: AGEF-1a which has been confirmed by cDNA and AGEF-1b which is predicted and partially confirmed by cDNA. The mutation we isolated exclusively affects AGEF-1a.

      3. The authors include an interesting speculation in the Discussion: "Future investigations of BIG2-associated neurological disorders should consider... hyper-activity of BIG2 as a driver of neuropathology." If the authors have the tools to test the effect of hyperactive BIG2 in this system, it could be an exciting addition.


      This is an exciting idea that we would like to keep in the Discussion. The biology of BIG2 activity regulation is a nascent field of research and we believe that to accurately generate and characterise a hyperactive BIG2 would be beyond the scope of this manuscript.

      __ On a personal note, since GEFs act oppositely to GTPase Activating Proteins (GAPs), I had to stop and re-read carefully whenever the authors referred to a GEF "activating" a GTPase. I understand their meaning (i.e., putting the GTPase in its active GTP-bound state, not activating its GTPase function) but I wanted to point out this potential confusion in case there is a way to better define terms in the Introduction or change word choice. I realize this may be a standard jargon in the field.__

      Indeed, this is confusing nomenclature and a difficult concept to deliver in an accurate and succinct manner. We propose to include a clearer, more didactic explanation of their function. In a simple explanation, GTPases perform cellular functions when bound to GTP. GAPs terminate GTPase activity by catalysing GTP hydrolysis, generating GDP. GEFs initiate GTPase activity by catalysing the release of GDP and allowing GTP binding.

      __ Please check the correct nomenclature for CRISPR/Cas9.__


      We will rectify where appropriate.

      __6. p.7 "these molecules act in synergy", consider replacing with "redundantly".

      __

      We will rectify where appropriate.

      __Reviewer #2 (Significance (Required)):

      The significance of this story is to show that GEF-GTPases pairing can be highly context-dependent. Previous studies have identified GEFs that pair with RAB-35 and GTPases that pair with AGEF-1, but the authors find that these factors have at best a modest role in the context of skin-axon interactions. Instead, the authors suggest a novel GTPase-GEF pairing of RAB-35 with AGEF-1 and provide evidence that this relationship is conserved in the human homolog of AGEF-1. These results suggest that GTPase-GEF pairings depend not only on chemical affinity but also cellular context.

      The main strength of the study is its clever genetics. For the screen, the authors looked for suppressors of a synthetic defect in axon integrity caused in part by elevated activity of RAB-35 due to loss of its GAP TBC-10. It is satisfying that this screen isolated a mutation in a GEF that in principle could counterbalance the loss of a GAP.

      The main weakness of the study is the lack of direct evidence for an AGEF-1/RAB-35 interaction. While not necessary for publication, the inclusion of biochemical data to support the role of AGEF-1 as a GEF for RAB-35 and the effect of the S784L mutation on this activity would strongly elevate the study. The genetic data for this interaction are consistent with the model but not conclusive, and in my view the colocalization data are not compelling. Nevertheless this is a solid genetic story with a clever screen.__

      __ __We appreciate the feedback and are grateful for the positive comments on the significance of our study. As explained in the significance section related to Reviewer 1, if we find evidence of a direct interaction between AGEF-1 and RAB-35 in the proposed new experiments, we will include it in the manuscript; alternatively, we will present it as a possibility in the discussion section, as suggested. We agree that a more nuanced understanding of the effect of the S784L is interesting and that our colocalization data can be improved, and we have proposed experiments to address these concerns.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This paper investigates the mechanism by which molecular pathways in the skin protect the processes of nerves that innervate them from damage. The authors previously showed that spectrin and the small GTPase RAB-35 act in the epidermis of C. elegans to protect mechanosensory axons from breaking. In this paper they used a suppression screen to identify another gene involved in this process, an ARF-GEF called AGEF-1. Partial loss-of-function mutations in agef-1 suppress the axon-breakage phenotype of spectrin mutations, and genetic experiments by the authors are consistent with the possibility that AGEF-1 could act directly as an exchange factor for RAB-35. Consistent with this model, they show that AGEF-1 and RAB-35 colocalise in the skin.

      Major comments: The experiments in this paper are well-designed and well-controlled, and the interpretations of the results are all reasonable. On the other hand, I don't think the authors' hypothesis that AGEF-1 acts directly as an exchange factor for RAB-35, or that these two proteins directly interact, is definitively proven. This is not an issue of the authors overinterpreting their data--the paper is very carefully and thoughtfully written. However, the most interesting and counterintuitive finding--that an ARF-GEF could also be a RAB-GEF--might be strengthened with more experiments (for example, could they more directly show protein-protein interaction through co-IP or mass spec?).__

      We thank the reviewer for the suggestion. We propose to further investigate the notion that AGEF-1a might be a direct interactor of RAB-35 using a split-GFP approach to assess whether these molecules closely interact, in vivo, in the physiological context that is relevant for the maintenance of the touch sensing neurons (please see reply to reviewer #1 major point 1 and reviewer #2 major point 1 for more details).

      Minor comments: There are also two places where the fact that null mutations are lethal (for agef-1 and arf-5) prevented the authors from addressing the effect of agef-1 loss of function in the skin, and addressing whether ARF-5 could be an AGEF-1 target, respectively. In principle, they could have tried to make a CRISPR line in which these genes could be cell-specifically deleted in the skin (using a dpy-7-driven recombinase). I don't think either of these experiments are essential, but if it is feasible to make these lines it would tie up a couple of loose ends.

      We agree to explore the roles of agef-1 and arf-5 loss-of-function. We propose to tissue-specifically degrade agef-1 using an auxin-inducible degradation strategy (please see reviewer #2 major point 2 reply for more details). For arf-5, we propose knocking-down its function using RNAi to overcome lethality (please see reviewer #1 major point 3 reply for more details).

      __Reviewer #3 (Significance (Required)):

      Overall I think this is an interesting paper on a topic of general interest. The most interesting finding is that an exchange factor for an ARF (a small GRPase involved in vesicle coating/uncoating) could also be an exchange factor for a RAB (a small GTPase involved in vesicle tethering). The evidence presented is suggestive and intriguing, though as noted above not completely definitive. In summary, I think it is an interesting paper in its current form, and anything it could do to more firmly establish a direct interaction between AGEF-1 and RAB-35 would increase its impact and importance.

      __

      We thank the reviewer for the positive evaluation of the significance of our study.

      __ Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Summary: In this study Bonacossa-Pereira et al. identify AGEF-1a, an Arf-GEF, as a factor that functions in the epidermis through RAB-35 to regulate axonal integrity of the PLM mechanosensory neurons in C. elegans. Specifically, epidermal attachment sites are regulated by these genes form the epidermis and compromising these attachment sites results in axonal degeneration. The study provides some evidence that that RAB-35 and AGEF-1 at least partially colocalize in the skin. Finally, the authors provide evidence that the human orthologue BIG2 is capable of functionally replacing AGEF-1a in C. elegans. Overall, the experiments are well designed and the paper is clear and succinct. The conclusions are supported by the findings and provide an important extension of the author's findings a few back, when they identified the role of rab-35 in mediating the epidermal-neuronal attachment sites.

      Major comments: 1. AGEF-1/BIG2 are known to regulate other GTPases such as ARF-5 or ARF-2. The authors exclude a non-redundant function for ARF-2, but are unable to establish a role for ARF-5 because of the lethality associated with the mutation. Alternative approaches, such as cell specific knock out or knock down experiment. In addition, studies to test potentially physical interaction such as pull-down assays, co-IP experiments and FRET could be used to test whether AGEF-can bind RAB-35 or ARF-5.__

      We thank the reviewer for this suggestion. We propose addressing these concerns using a tissue-specific degradation for AGEF-1a (please see reviewer #1 major point 2 for details). To establish a role for ARF-5 we propose to do an RNAi mediated knock-down to overcome lethality (please see reviewer #1 major point 3 for details). Finally, we plan to use a split-GFP approach to test the physical interaction between agef-1a and rab-35 in vivo (please see reviewer #1 major point 1 for details)

      __ Phenotypic readout has been limited to only axon breaks. It may be interesting to also test other aspects such as axonal deformities including swellings and vesiculation in other parts of the nervous system. Moreover, behavioral or functional experiments such as response to gentle touch or synaptic integrity could be informative.__

      We have not observed any obvious touch receptor neurons axonal phenotypes other than axonal breaks in these mutants, and we will include a statement that reflects this concept. In relation to the behavior, we have not tested it as the results will be difficult to interpret for two reasons: first, the breaks are not always bilateral and one neuron is sufficient to provide mechanical response; second, the mixed identity of the PLM neurite allows it to retain some function despite being severed. However, if deemed essential, we will perform these experiments.

      __ Overexpression constructs such as SKIN::RAB-35[Q69L], SKIN::BIG2, SKIN::AGEF-1a[E608K] in extrachromosomal transgenes could lead to non-physiological localization or effects. Single copy expression using MosSCI or CRISPR insertions are generally considered better approaches (other than endogenous reporters) to provide accurate insights at the physiological level. While the authors tacitly acknowledge this by conducting the experiments in a rab-35 mutant background and very low transgene concentration, at the very least this caveat regarding the localization should be discussed.__

      This is an important remark, and we appreciate the comment. We acknowledge that experiments using extrachromosomal arrays have inherent caveats, especially for localization studies. To address the RAB-35 localization concern we plan to repeat the localization studies using an endogenously tagged RAB-35 using CRISPR to overcome the possible artifacts caused by extrachromosomal array driven expression (please see reviewer #1 point 1 for more details). For the cell-specific rescues or dominant-negative constructs expression, we believe that using extrachromosomal arrays is sufficient, since this allows us to compare genetically identical transgenic vs non-transgenic siblings of independent lines. Moreover, given these constructs are already driven by a tissue-specific promoter that is inherently stronger than their respective endogenous promoters, even a single-copy insertion would have the same caveats.

      __4. The study does not address clearly whether AGEF-1a acts in parallel to spectrin or upstream/ downstream to it. Epistasis experiments could help to figure out the signaling pathway involved.

      __

      Indeed, this is a concept that we need to communicate more clearly. We have data showing that a mutation in agef-1 does not cause axonal damage on its own, and that it has no effect on the axonal damage caused by unc-70 dominant negative mutation alone. We only detect an effect of agef-1 when tbc-10 is mutated together with unc-70 (Fig. 1a of manuscript). Together, these data indicate that agef-1 functions upstream of rab-35, thus acting in parallel to unc-70 (see schematic below) to ensure the mechanical stability of neuron epidermal attachment. We plan to include this data and the following schematic as a supplement to better convey the idea and discuss the results appropriately.

      __ The finding that BIG2 rescues the mutant defect is an important finding and rightfully finds its place in the abstract. I wonder whether a reference to the human diseases caused by loss of BIG2 in the abstract and introduction would not increase interest/impact for the study, rather than burying this potentially interesting connection in the discussion.

      __

      We appreciate the reviewer's comment, and welcome the suggestion. We propose to include relevant background about BIG2-related human diseases in the abstract and introduction as suggested and expand the discussion regarding BIG2 mutations.

      __Minor comments:

      1. Some explanation about how mutating the autoinhibitory domain could impact the catalytic activity of a GEF might be helpful.__

      2. *

      We acknowledge that this notion was not well communicated. We propose to elaborate more about why we think a mutation in the autoinhibitory domain might be affecting the GEF activity and we plan to do further experiments to dissect how this might be happening. Please see reviewer #2 major point 2 for a more detailed explanation.

      __ The paper refers to rme-4(b1001) as a null allele while wormbase refers to the same as a missense allele. It would be more accurate to refer rme-4(b1001) as a strong loss of function or putative null.__

      We agree and will refer to b1001 as a strong loss-of-function.

      __ The paper does not clearly discuss limitations of the hypomorphic agef-1[S784L] and that the observed phenotypes in this hypomorph might underestimate the complete role of AGEF-1a.__

      • *

      We thank the reviewer for this suggestion. We propose to elaborate more on these limitations, especially considering the possible new results from the experiments suggested in reply to reviewer #2 major comment point 2.

      __ In figure 1, where there really only one extrachromosomal transgenic line for some of the construct tested? __

      • *

      For the Pdpy-7::AGEF-1a lines we have scored 3 transgenic lines (data not included) and only one yielded a full rescue. For all extrachromosomal lines presented, we tested 3 independent transgenic lines. For brevity, we only included the result for the positive rescues (1 for BIG2 and 1 for AGEF-1a), except for the Pmec-4 lines, of which none rescued the phenotype (data included in Table S2). We will update Table S2 to include all the lines tested.

      __ The concentrations of transgenes vary in different transgenes. Is there a rationale behind this? __

      Yes, we have attempted multiple concentrations of injections for each transgene and there was some variability for each construct injected, thus we only included the ones where we observed an effect. As mentioned in point 4 above, we will update Table S2 to include details of all lines tested.

      __ In Fig.1e: I may be useful to also show the "WT" phenotype, i.e. the strong defects to get a visual comparison for the degree of rescue. __

      • *

      We think this suggestion will help the readers. We will include this as a representative dashed line showing the WT phenotype.

      __Reviewer #4 (Significance (Required)):

      The study has identified AGEF-1a as a regulator of axonal maintenance, functioning to protect neurons against mechanical stress by acting through RAB-35. Additionally, this epidermal GEF, AGEF-1a is functionally conserved as its human orthologue BIG2 can replace AGEF-1a in C. elegans for axonal protection. Important points here are that the findings extend prior work by the authors of non-autonomous mechanism that regulates epidermal-neuronal attachment. In my humble opinion, the human disease connection, in particular with regard to the unexplained neuronal phenotypes in patients could be better developed in the manuscript. It may also increase impact/interest of a wonderful story that right now reads a bit 'wormy'.__


      This is an important remark and we are grateful for the positive comments. The fact that human BIG2 is also conserved in C. elegans points to a fundamental role of this molecule in multicellular life, and it provides a tractable model to investigate the function of this molecule in a physiological context. We welcome the suggestion to elaborate more the connection with the unexplained neuronal phenotypes in patients and use a more accessible language to convey our findings to a wider audience.


      3. Description of the revisions that have already been incorporated in the transferred manuscript

      N/A

      4. Description of analyses that authors prefer not to carry out

      __Reviewer #1 __


      "...studies to test potentially physical interaction such as pull-down assays, co-IP experiments and FRET could be used to test whether AGEF-can bind RAB-35 or ARF-5."


      While pull-down assays, co-IP and FRET would reveal whether AGEF-1a can form a complex with RAB-35, we believe that using a full length AGEF-1a would not only represent a significant technical challenge but will also not prove a direct interaction in a physiological context.


      "...An optional experiment would be to look at the colocalization of RAB-35 with a known effector in wild type and agef-1(vd92) with the expectation that there would be a higher level of colocalization in agef-1 mutants. Effector pull-down experiments or perhaps a cell based GEF assay could be used (PMID: 35196081)."


      We think that screening for the relevant rab-35 effector in this context and/or doing effector pull-down/cell based GEF assays would be a significant technical challenge. We propose to address this concern by tempering our claim as suggested by the reviewer.


      "...It may be interesting to also test other aspects such as axonal deformities including swellings and vesiculation in other parts of the nervous system. Moreover, behavioral or functional experiments such as response to gentle touch or synaptic integrity could be informative."

      As indicated above in major point 2 of reviewer 4, these are interesting ideas that might answer how the function of these neurons might be affected. However, in addition to the challenges indicated above, they will not provide further insights into how their integrity is maintained. We believe these will fall outside the scope of the manuscript, but if deemed essential we will perform behavioral analysis.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Summary:

      In this study Bonacossa-Pereira et al. identify AGEF-1a, an Arf-GEF, as a factor that functions in the epidermis through RAB-35 to regulate axonal integrity of the PLM mechanosensory neurons in C. elegans. Specifically, epidermal attachment sites are regulated by these genes form the epidermis and compromising these attachment sites results in axonal degeneration. The study provides some evidence that that RAB-35 and AGEF-1 at least partially colocalize in the skin. Finally, the authors provide evidence that the human orthologue BIG2 is capable of functionally replacing AGEF-1a in C. elegans. Overall, the experiments are well designed and the paper is clear and succinct. The conclusions are supported by the findings and provide an important extension of the author's findings a few back, when they identified the role of rab-35 in mediating the epidermal-neuronal attachment sites.

      Major comments:

      1. AGEF-1/BIG2 are known to regulate other GTPases such as ARF-5 or ARF-2. The authors exclude a non-redundant function for ARF-2, but are unable to establish a role for ARF-5 because of the lethality associated with the mutation. Alternative approaches, such as cell specific knock out or knock down experiment. In addition, studies to test potentially physical interaction such as pull-down assays, co-IP experiments and FRET could be used to test whether AGEF-can bind RAB-35 or ARF-5.
      2. Phenotypic readout has been limited to only axon breaks. It may be interesting to also test other aspects such as axonal deformities including swellings and vesiculation in other parts of the nervous system. Moreover, behavioral or functional experiments such as response to gentle touch orsynaptic integrity could be informative.
      3. Overexpression constructs such as SKIN::RAB-35[Q69L], SKIN::BIG2, SKIN::AGEF-1a[E608K] in extrachromosomal transgenes could lead to non-physiological localization or effects. Single copy expression using MosSCI or CRISPR insertions are generally considered better approaches (other than endogenous reporters) to provide accurate insights at the physiological level. While the authors tacitly acknowledge this by conducting the experiments in a rab-35 mutant background and very low transgene concentration, at the very least this caveat regarding the localization should be discussed.
      4. The study does not address clearly whether AGEF-1a acts in parallel to spectrin or upstream/ downstream to it. Epistasis experiments could help to figure out the signaling pathway involved.
      5. The finding that BIG2 rescues the mutant defect is an important finding and rightfully finds its place in the abstract. I wonder whether a reference to the human diseases caused by loss of BIG2 in the abstract and introduction would not increase interest/impact for the study, rather than burying this potentially interesting connection in the discussion.

      Minor comments:

      1. Some explanation about how mutating the autoinhibitory domain could impact the catalytic activity of a GEF might be helpful.
      2. The paper refers to rme-4(b1001) as a null allele while wormbase refers to the same as a missense allele. It would be more accurate to refer rme-4(b1001) as a strong loss of function or putative null.
      3. The paper does not clearly discuss limitations of the hypomorphic agef-1[S784L] and that the observed phenotypes in this hypomorph might underestimate the complete role of AGEF-1a.
      4. In figure 1, where there really only one extrachromosomal transgenic line for some of the construct tested?
      5. The concentrations of transgenes vary in different transgenes. Is there a rationale behind this?
      6. In Fig.1e: I may be useful to also show the "WT" phenotype, i.e. the strong defects to get a visual comparison for the degree of rescue.

      Significance

      The study has identified AGEF-1a as a regulator of axonal maintenance, functioning to protect neurons against mechanical stress by acting through RAB-35. Additionally, this epidermal GEF, AGEF-1a is functionally conserved as its human orthologue BIG2 can replace AGEF-1a in C. elegans for axonal protection. Important points here are that the findings extend prior work by the authors of non-autonomous mechanism that regulates epidermal-neuronal attachment. In my humble opinion, the human disease connection, in particular with regard to the unexplained neuronal phenotypes in patients could be better developed in the manuscript. It may also increase impact/interest of a wonderful story that right now reads a bit 'wormy'.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This paper investigates the mechanism by which molecular pathways in the skin protect the processes of nerves that innervate them from damage. The authors previously showed that spectrin and the small GTPase RAB-35 act in the epidermis of C. elegans to protect mechanosensory axons from breaking. In this paper they used a suppression screen to identify another gene involved in this process, an ARF-GEF called AGEF-1. Partial loss-of-function mutations in agef-1 suppress the axon-breakage phenotype of spectrin mutations, and genetic experiments by the authors are consistent with the possibility that AGEF-1 could act directly as an exchange factor for RAB-35. Consistent with this model, they show that AGEF-1 and RAB-35 colocalise in the skin.

      Major comments: The experiments in this paper are well-designed and well-controlled, and the interpretations of the results are all reasonable. On the other hand, I don't think the authors' hypothesis that AGEF-1 acts directly as an exchange factor for RAB-35, or that these two proteins directly interact, is definitively proven. This is not an issue of the authors overinterpreting their data--the paper is very carefully and thoughtfully written. However, the most interesting and counterintuitive finding--that an ARF-GEF could also be a RAB-GEF--might be strengthened with more experiments (for example, could they more directly show protein-protein interaction through co-IP or mass spec?).

      Minor comments: There are also two places where the fact that null mutations are lethal (for agef-1 and arf-5) prevented the authors from addressing the effect of agef-1 loss of function in the skin, and addressing whether ARF-5 could be an AGEF-1 target, respectively. In principle, they could have tried to make a CRISPR line in which these genes could be cell-specifically deleted in the skin (using a dpy-7-driven recombinase). I don't think either of these experiments are essential, but if it is feasible to make these lines it would tie up a couple of loose ends.

      Significance

      Overall I think this is an interesting paper on a topic of general interest. The most interesting finding is that an exchange factor for an ARF (a small GRPase involved in vesicle coating/uncoating) could also be an exchange factor for a RAB (a small GTPase involved in vesicle tethering). The evidence presented is suggestive and intriguing, though as noted above not completely definitive. In summary, I think it is an interesting paper in its current form, and anything it could do to more firmly establish a direct interaction between AGEF-1 and RAB-35 would increase its impact and importance.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This interesting manuscript reports the outcome of a fruitful C. elegans genetic screen with a complex but clever design. Through it, the authors identify AGEF-1 as a GEF that likely regulates the active state of the GTPase RAB-35 in the skin to protect touch receptor axons from mechanical breakage.

      Major points:

      1. Based on localization experiments, the authors claim "AGEF-1a interacts with RAB-35 in the epidermis" (Results heading) and state "these data demonstrate that AGEF-1a interacts with a subset of RAB-35 molecules in the epidermis." In general, localization studies cannot be used to conclude physical interaction (with some exceptions such as single-molecule kinetics). In this case, the data in my view do not even make a compelling argument for co-localization. There is a lot of AGEF-1 and RAB-35 signal everywhere and it may not be meaningful that the signals sometimes overlap. A more quantitative approach with controls would be needed to conclude meaningful co-localization. Importantly, this would still not demonstrate interaction.
      2. The effect of the AGEF-1(S784L) mutation is not clear to me. Naively, as the S784L mutation lies in the auto-inhibitory domain, I would have expected AGEF-1 to become constitutively active, not inactive as the authors seem to suggest. Is the idea that it is constitutively auto-inhibited? The main evidence for a loss of function effect seems to be that a putative dominant negative mutation AGEF-1(E608K) does not further supress axon breakage when co-expressed in trans to AGEF(S784L), but in my view this only shows that, once the defect is suppressed, it cannot be suppressed any further. Defining the nature of the S784L allele is important. Some suggestions, although the authors may come up with different approaches: use of an inducible or cell-specific depletion system like AID/TIR1, Cre/lox, or FLP/FRT to circumvent the lethality of agef-1(0) and reveal what a true loss-of-function looks like; testing if deletion of the auto-inhibitory domain phenocopies S784L to test if this mutation impairs autoinhibition.

      Minor points:

      1. I am not able to see the "vesicle-like structures with a clear luminal space" or RAB-35 being "notably enriched at the membrane near the epidermal furrow" in Fig. 3. The "3D surface rendering" in Fig. 3e is grossly oversampled and should not be included.
      2. As the agef-1a isoform is specifically referenced throughout, please describe the different agef-1 isoforms somewhere to save readers from having to look this up.
      3. The authors include an interesting speculation in the Discussion: "Future investigations of BIG2-associated neurological disorders should consider... hyper-activity of BIG2 as a driver of neuropathology." If the authors have the tools to test the effect of hyperactive BIG2 in this system, it could be an exciting addition.
      4. On a personal note, since GEFs act oppositely to GTPase Activating Proteins (GAPs), I had to stop and re-read carefully whenever the authors referred to a GEF "activating" a GTPase. I understand their meaning (i.e., putting the GTPase in its active GTP-bound state, not activating its GTPase function) but I wanted to point out this potential confusion in case there is a way to better define terms in the Introduction or change word choice. I realize this may be a standard jargon in the field.
      5. Please check the correct nomenclature for CRISPR/Cas9.
      6. p.7 "these molecules act in synergy", consider replacing with "redundantly".

      Significance

      The significance of this story is to show that GEF-GTPases pairing can be highly context-dependent. Previous studies have identified GEFs that pair with RAB-35 and GTPases that pair with AGEF-1, but the authors find that these factors have at best a modest role in the context of skin-axon interactions. Instead, the authors suggest a novel GTPase-GEF pairing of RAB-35 with AGEF-1 and provide evidence that this relationship is conserved in the human homolog of AGEF-1. These results suggest that GTPase-GEF pairings depend not only on chemical affinity but also cellular context.

      The main strength of the study is its clever genetics. For the screen, the authors looked for suppressors of a synthetic defect in axon integrity caused in part by elevated activity of RAB-35 due to loss of its GAP TBC-10. It is satisfying that this screen isolated a mutation in a GEF that in principle could counterbalance the loss of a GAP.

      The main weakness of the study is the lack of direct evidence for an AGEF-1/RAB-35 interaction. While not necessary for publication, the inclusion of biochemical data to support the role of AGEF-1 as a GEF for RAB-35 and the effect of the S784L mutation on this activity would strongly elevate the study. The genetic data for this interaction are consistent with the model but not conclusive, and in my view the colocalization data are not compelling. Nevertheless this is a solid genetic story with a clever screen.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      Stability of the PLM axon in C. elegans is maintained through interactions with the epidermis. Previous studies by this group found that loss of the tbc-10 Rab GTPase Activating Protein strongly enhanced the PLM axon break phenotype of unc-70/beta-spectrin mutants. TBC-10 is a GAP for RAB-35 and thus loss of rab-35 suppresses the tbc-10 phenotype. Of the two RAB-35 GEFs, loss of RME-4 partially suppressed the tbc-10 phenotype and FLCN-1 was not involved suggesting that there may be an additional GEF involved. Here Bonacossa-Pereira et al identify a point mutation in agef-1a (vd92) as a suppressor of tbc-10 PLM axon break phenotype (all experiments also have a dominant allele of unc-70) and confirm that point mutation is causative by replicating the mutation via genome editing (vd123). Rescue experiments demonstrate that AGEF-1a is required in the epidermis and not PLM as previous demonstrated with tbc-10 and unc-70. Rescue is dependent on a functional SEC7/GEF activity. AGEF-1a is a functional ortholog to human BIG2/ArfGEF2 as its expression fully rescues tbc-10. AGEF-1a functions upstream of RAB-35 as expression of activated RAB-35 can suppress loss of agef-1. AGEF-1a functions in parallel to RME-4 as the double has stronger suppression of tbc-10. AGEF-1a is an ARF GEF, however it functions independently of ARF-1.2 as loss of arf-1.2 does not suppress tbc-10. They demonstrate that AGEF-1a interacts with RAB-35 through colocalization experiments suggesting that AGEF-1a could directly activate RAB-35. Finally, they demonstrate that AGEF-1a regulates the localization of the LET-805 epidermal attached complex component as it restores localization in a tbc-10 mutant.

      Major comments

      The manuscript is well written and easy to understand.

      The experiments are well done and controlled.

      I enjoyed reading this paper. However...

      Some of the claims are not supported by the data.

      1. The claim that AGEF-1a directly interacts with RAB-35 was not demonstrated. The evidence provided to support a direct interaction are colocalization experiments in Figure 3. AGEF-1a does partially colocalize with RAB-35 in the epidermis. However, colocalization does not indicate a physical interaction direct or indirect. A simple fix would be to change the claim to that they partially colocalize. Optional, a physical interaction could be done with the split-GFP since they already have the AGEF-1 strain or they could perform co-IP experiments, though neither of those are proof of direct interactions.
      2. The claim that AGEF-1a facilitates RAB-35 activation is not supported. While it is likely that AGEF-1a facilitates RAB-35 activation based on the epistasis experiments as well as studies in mammalian cells there were no experiments to demonstrate that modulating AGEF-1a activity resulted in a change in RAB-35 activity. I would suggest tempering this claim to something along the line that the data are consistent with AGEF-1a regulating RAB-35 activity as shown in mammalian cells. An optional experiment would be to look at the colocalization of RAB-35 with a known effector in wild type and agef-1(vd92) with the expectation that there would be a higher level of colocalization in agef-1 mutants. Effector pull-down experiments or perhaps a cell based GEF assay could be used (PMID: 35196081).
      3. The claim that AGEF-1a functions independently of ARF-1.2 is not well supported. The fact that the ARF-1.2 mutant does not suppress tbc-10 suggests that ARF-1.2 may not be involved but does not eliminate the possibility that ARF-1.2 functions redundantly with ARF-5 or WARF-1/ARF-1.1. This can be resolved by toning down the claim. Alternatively, this can be tested by RNAi of arf-5 and warf-1 in tbc-10 and arf-1.2; tbc-10 mutants.

      Minor comments

      Figure 1C the CRISPR generated allele (vd123) is referred to as [S784L] and then in 1E vd92 is referred to as [S784L]. Perhaps it would be clearer if the allele name was used instead of the amino acid change.

      Page 6 "We reasoned that if the S784L mutation we isolated causes a similar loss of the GTPase activation function, then SKIN::AGEF-1a[E608K] would not have the capacity to restore the rate of PLM axon breaks to background levels in agef-1[S784L]; tbc-10; vdSi2 animals." It was unclear to me whether you were testing if the S784L mutation could be disrupting a GEF independent function or might disrupt the nucleotide exchange activity as might be tested in a biochemical assay. There are many reasons this change could cause a loss of function phenotype (ie. Improper folding, mislocalization, etc.). The most clear explanation would be that you were testing if GEF function was required for rescue rather than testing if the S784L mutation disrupted GEF activity.

      Page 13. It was unclear how testing if AGEF-1, RME-4, ARF-5 and RAB-35 form complexes in vivo (I assume you are suggesting colocalize based on figure 3 interpretation) would resolve how AGEF-1 was regulating RAB-35.

      Cross-commenting

      I agree with the comments made by the other reviewers and I stand by my own as well. I will echo that it is important to know the nature of their agef-1 allele.

      Significance

      Bonacossa-Pereira et al identify AGEF-1 as a regulator of axon integrity that functions in a pathway with RAB-35 in the epidermis is an exciting finding. As pointed out in the discussion, mutations in the human ortholog cause neurodevelopmental defects which leads to obvious characterization of BIG2/ArfGEF2 in neurons while this study indicates that this protein can have cell non-autonomous roles in regulating neurons. These findings could have important implications for understanding the etiology of these defects that would be of interest to neurobiologists and clinical researchers.

      The finding of this paper would also be of interest to cell biologists and particularly those studying the roles of Rab and Arf GTPases in membrane trafficking, such as myself. The idea that AGEF-1 might function as a Rab35 GEF is provocative and would generate a lot of interest and skepticism from the field. However, there is no data to support that AGEF-1 would be a direct regulator of Rab35 over the previously demonstrated cross regulation of Rab35 by Arf GTPases. Therefore, it would be fine to speculate in the discussion a direct interaction, but I would refrain from suggesting this as a model and elsewhere in the manuscript.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their careful assessment and enthusiastic appreciation of our work.

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __In this article, Thomas et al. use a super-resolution approach in living cells to track proteins involved in the fusion event of sexual reproduction. They study the spatial organization and dynamics of the actin fusion focus, a key structure in cell-cell fusion in Schizosaccharomyces pombe. The researchers have adapted a high-precision centroid mapping method using three-color live-cell epifluorescence imaging to map the dynamic architecture of the fusion focus during yeast mating. The approach relies on tracking the centroid of fluorescence signals for proteins of interest, spatially referenced to Myo52-mScarlet-I (as a robust marker) and temporally referenced using a weakly fluorescent cytosolic protein (mRaspberry), which redistributes strongly upon fusion. The trajectories of five key proteins, including markers of polarity, cytoskeleton, exocytosis and membrane fusion, were compared to Myo52 over a 75-minute window spanning fusion. Their observations indicate that secretory vesicles maintain a constant distance from the plasma membrane whereas the actin network compacts. Most importantly, they discovered a positive feedback mechanism in which myosin V (Myo52) transports Fus1 formin along pre-existing actin filaments, thereby enhancing aster compaction.

      This article is well written, the arguments are convincing and the assertions are balanced. The centroid tracking method has been clearly and solidly controlled. Overall, this is a solid addition to our understanding of cytoskeletal organization in cell fusion.

      Major comments: No major comment.

      Minor comments: _ Page 8 authors wrote "Upon depletion of Myo52, Ypt3 did not accumulate at the fusion focus (Figure 3C). A thin, wide localization at the fusion site was occasionally observed (Figure 3C, Movies S3)" : Is there a quantification of this accumulation in the mutant?

      We will provide the requested quantification. The localization is very faint, so we are not sure that quantification will capture this faithfully, but we will try.

      _ The framerate of movies could be improved for reader comfort: For example, movie S6 lasts 0.5 sec.

      We agree that movies S3 and S6 frame rates could be improved. We will provide them with slower frame rate.

      Reviewer #1 (Significance (Required)):

      This study represents a conceptual and technical breakthrough in our understanding of cytoskeletal organization during cell-cell fusion. The authors introduce a high-precision, three-color live-cell centroid mapping method capable of resolving the spatio-temporal dynamics of protein complexes at the nanometer scale in living yeast cells. This methodological innovation enables systematic and quantitative mapping of the dynamic architecture of proteins at the cell fusion site, making it a powerful live-cell imaging approach. However, it is important to keep in mind that the increased precision achieved through averaging comes at the expense of overlooking atypical or outlier behaviors. The authors discovered a myosin V-dependent mechanism for the recruitment of formin that leads to actin aster compaction. The identification of Myo52 (myosin V) as a transporter of Fus1 (formin) to the fusion focus adds a new layer to our understanding of how polarized actin structures are generated and maintained during developmentally regulated processes such as mating.

      Previous studies have shown the importance of formins and myosins during fusion, but this paper provides a quantitative and dynamic mapping that demonstrates how Myo52 modulates Fus1 positioning in living cells. This provides a better understanding of actin organization, beyond what has been demonstrated by fixed-cell imaging or genetic perturbation.

      Audience: Cell biologists working on actin dynamics, cell-cell fusion and intracellular transport. Scientists involved in live-cell imaging, single particle tracking and cytoskeleton modeling.

      I have expertise in live-cell microscopy, image analysis, fungal growth machinery and actin organization.

      We thank the reviewer for their appreciation of our work.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __ A three-color imaging approach to use centroid tracking is employed to determine the high resolution position over time of tagged actin fusion focus proteins during mating in fission yeast. In particular, the position of different protein components (tagged in a 3rd color) were determined in relation to the position (and axis) of the molecular motor Myo52, which is tagged with two different colors in the mating cells. Furthermore, time is normalized by the rapid diffusion of a weak fluorescent protein probe (mRaspberry) from one cell to the other upon fusion pore opening. From this approach multiple important mechanistic insights were determined for the compaction of fusion focus proteins during mating, including the general compaction of different components as fusion proceeds with different proteins having specific stereotypical behaviors that indicate underlying molecular insights. For example, secretory vesicles remain a constant distance from the plasma membrane, whereas the formin Fus1 rapidly accumulates at the fusion focus in a Myo52-dependent manner.

      I have minor suggestions/points: (1) Figure 1, for clarity it would be helpful if the cells shown in B were in the same orientation as the cartoon cells shown in A. Similarly, it would be helpful to have the orientation shown in D the same as the data that is subsequently presented in the rest of the manuscript (such as Figure 2) where time is on the X axis and distance (position) is on the Y axis.

      We have turned each image in panel B by 180° to match the cartoon in A. For panel D, we are not sure what the reviewer would like. This panel shows the coordinates of each Myo52 position, whereas Figure 2 shows oriented distance (on the Y axis) over time (on the X axis). Perhaps the reviewer suggests that we should display panel D with a rotation onto the Y axis rather than the X axis. We feel that this would not bring more clarity and prefer to keep it as is.

      (2) Figure 2, for clarity useful to introduce how the position of Myo52 changes over time with respect to the fusion site (plasma membrane) earlier, and then come back to the positions of different proteins with respect to Myo52 shown in 2E. Currently the authors discuss this point after introducing Figure 2E, but better for the reader to have this in mind beforehand.

      We have added a sentence at the start of the section describing Figure 2, pointing out that the static appearance of Myo52 is due to it being used as reference, but that in reality, it moves relative to the plasma membrane: “Because Myo52 is the reference, its trace is flat, even though in reality Myo52 also moves relative to other proteins and the plasma membrane (see Figure 2E)”. This change is already in the text.

      (3) First sentence of page 8 "..., peaked at fusion time and sharply dropped post-fusion (Figure S3)." Figure S3 should be cited so that the reader knows where this data is presented.

      Thanks, we have added the missing figure reference to the text.

      (4) Figure 3D-H, why is Exo70 used as a marker for vesicles instead of Ypt3 for these experiments? Exo70 seems to have a more confusing localization than Ypt3 (3C vs 3D), which seems to complicate interpretations.

      There are two main reasons for this choice. First, the GFP-Ypt3 fluorescence intensity is lower than that of Exo70-GFP, which makes analysis more difficult and less reliable. Second, in contrast to Exo70-GFP where the endogenous gene is tagged at the native genomic locus, GFP-Ypt3 is expressed as additional copy in addition to endogenous untagged Ypt3. Although GFP-Ypt3 was reported to be fully functional as it can complement the lethality of a ypt3 temperature sensitive mutant (Cheng et al, MBoC 2002), its expression levels are non-native and we do not have a strain in which ypt3 is tagged at the 5’ end at the native genomic locus. For these reasons, we preferred to examine in detail the localization of Exo70. We do not think it complicates interpretations. Exo70 faithfully decorates vesicles and exhibits the same localization as Ypt3 in WT cells (see Figure 2D) and in myo52-AID (see Figure 3C-D). We realize that our text was a bit confusing as we opposed the localization of Exo70 and Ypt3, when all we wanted to state was that the Exo70-GFP signal is stronger. We have corrected this in the text.

      (5) Page 10, end of first paragraph, "We conclude...and promotes separation of Myo52 from the vesicles." This is an interesting hypothesis/interpretation that is consistent with the spatial-temporal organization of vesicles and the compacting fusion focus, but the underlying molecular mechanism has not be concluded.

      This is an interpretation that is in line with our data. Firm conclusion that the organization of the actin fusion focus imposes a steric barrier to bulk vesicle entry will require in vitro reconstitution of an actin aster driven by formin-myosin V feedback and addition of myosin V vesicle-like cargo, which can be a target for future studies. To make clear that it is an interpretation and not a definitive statement, we have added “likely” to the sentence, as in: “We conclude that the distal position of vesicles in WT cells is a likely steric consequence of the architecture of the fusion focus, which restricts space at the center of the actin aster and promotes separation of Myo52 from the vesicles”.

      (6) Figure 5F and 5G, the results are confusing and should be discussed further. Depletion of Myo52 decreases Fus1 long-range movements, indicating that Fus1 is being transported by Myo52 (5F). Similarly, the Fus1 actin assembly mutant greatly decreases Fus1 long-range movements and prevents Myo52 binding (5G), perhaps indicating that Fus1-mediated actin assembly is important. It seems the author's interpretations are oversimplified.

      We show that Myo52 is critical for Fus1 long-range movements, as stated by the reviewer. We also show that Fus1-mediated actin assembly is important. The question is in what way.

      One possibility is that FH2-mediated actin assembly powers the movement, which in this case represents the displacement of the formin due to actin monomer addition on the polymerizing filament. A second possibility is that actin filaments assembled by Fus1 somehow help Myo52 move Fus1. This could be for instance because Fus1-assembled actin filaments are preferred tracks for Myo52-mediated movements, or because they allow Myo52 to accumulate in the vicinity of Fus1, enhancing their chance encounter and thus the number of long-range movements (on any actin track). Based on the analysis of the K1112A point mutant in Fus1 FH2 domain, our data cannot discriminate between these three different options, which is why we concluded that the mutant allele does not allow us to make a firm conclusion. However, the Myo52-dependence clearly shows that a large fraction of the movements requires the myosin V. We have clarified the end of the paragraph in the following way: “Therefore, analysis of the K1112A mutant phenotype does not allow us to clearly distinguish between Fus1-powered from Myo52-powered movements. Future work will be required to test whether, in addition to myosin V-dependent transport, Fus1-mediated actin polymerization also directly contributes to Fus1 long-range movements.”

      (7) Figure 6, why not measure the fluorescence intensity of Fus1 as a proxy for the number of Fus1 molecules (rather than the width of the Fus1 signal), which seems to be the more straight-forward analysis?

      The aim of the measurement was to test whether Myo52 and Fus1 activity help focalize the formin at the fusion site, not whether these are required for localization in this region. This is why we are measuring the lateral spread of the signal (its width) rather than the fluorescence intensity of the signal. We know from previous work that Fus1 localizes to the shmoo tip independently of myosin V (Dudin et al, JCB 2015), and we also show this in Figure 6. However, the precise distribution of Fus1 is wider in absence of the myosins.

      We can and will measure intensities to test whether there is also a quantitative difference in the number of molecules at the shmoo tip.

      (8) Figure 7, the authors should note (and perhaps discuss) any evidence as to whether activation of Fus1 to facilitate actin assembly depends upon Fus1 dissociating from Myo52 or whether Fus1 can be activated while still associated with Myo52, as both circumstances are included in the figure.

      This is an interesting point. We have no experimental evidence for or against Fus1 dissociating from Myo52 to assemble actin. However, it is known that formins rotate along the actin filament double helix as they assemble it, a movement that seems poorly compatible with processive transport by myosin V. In Figure 7, we do not particularly want to imply that Myo52 associates with Fus1 linked or not with an actin filament. The figure serves to illustrate the focusing mechanism of myosin V transporting a formin, which is more evident when we draw the formin attached to a filament end. We have now added a sentence in the figure legend to clarify this point: “Note that it is unknown whether Myo52 transports Fus1 associated or not with an actin filament.”

      (9) Figure 7, the color of secretory vesicles should be the same in A and B.

      This is now corrected.

      Reviewer #2 (Significance (Required)):

      This is an impactful and high quality manuscript that describes an elegant experimental strategy with important insights determined. The experimental imaging strategy (and analysis), as well as the insight into the pombe mating fusion focus and its comparison to other cytoskeletal compaction events will be of broad scientific interest.

      We thank the reviewer for their appreciation of our work.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Fission yeast cell-cell fusion during mating is mediated by an actin-based structure called the 'fusion focus', which orchestrates actin polymerization by the mating-specific formin, Fus1, to direct polarized secretion towards the mating site. In the current study, Thomas and colleagues quantitatively map the spatial distribution of proteins mediating cell-cell fusion using a three-color fluorescence imaging methodology in the fission yeast Schizosaccharomyces pombe. Using Myo52 (Type V myosin) as a fluorescence reference point, the authors discover that proteins known to localize to the fusion focus have distinct spatial distributions and accumulation profiles at the mating site. Myo52 and Fus1 form a complex in vivo detected by co-immunoprecipitation and each contribute to directing secretory vesicles to the fusion focus. Previous work from this group has shown that the intrinsically disordered region (IDR) of Fus1 plays a critical role in forming the fusion focus. Here, the authors swap out the IDR of fission yeast Fus1 for the IDR of an unrelated mammalian protein, coincidentally called 'fused in sarcoma' (FUS). They express the Fus1∆IDR-FUSLC-27R chimera in mitotically dividing fission yeast cells, where Fus1 is not normally expressed, and discover that the Fus1∆IDR-FUSLC-27R chimera can travel with Myo52 on actively polymerizing actin cables. Additionally, they show that acute loss of Myo52 or Fus1 function, using Auxin-Inducible Degradation (AID) tags and point mutations, impair the normal compaction of the fusion focus, suggesting that direct interaction and coordination of Fus1 and Myo52 helps shape this structure.

      Major Comments:

      (1) In the Results section for Figure 2, the authors claim that actin filaments become shorter and more cross-linked they move away from the fusion site during mating, and suggest that this may be due to the presence of Myo51. However, the evidence to support this claim is not made clear. Is it supported by high-resolution electron microscopy of the actin filaments, or some other results? This needs to be clarified.

      Sorry if our text was unclear. The basis for the claim that actin filaments become shorter comes from our observation that the average position of tropomyosin and Myo51, both of which decorate actin filaments, is progressively closer to both Fus1 and the plasma membrane. Thus, the actin structure protrudes less into the cytosol as fusion progresses. The basis for claiming that Myo51 promotes actin filament crosslinking comes mainly from previously published papers, which had shown that 1) Myo51 forms complexes with the Rng8 and Rng9 proteins (Wang et al, JCB 2014), and 2) the Myo51-Rng8/9 not only binds actin through Myo51 head domain but also binds tropomyosin-decorated actin through the Rng8/9 moiety (Tang et al, JCB 2016; reference 27 in our manuscript). We had also previously shown that these proteins are necessary for compaction of the fusion focus (Dudin et al, PLoS Genetics 2017; reference 28 in our manuscript). Except for measuring the width of Fus1 distribution in myo51∆ mutants, which confirms previous findings, we did not re-investigate here the function of Myo51.

      We have now re-written this paragraph to present the previous data more clearly: “The distal localization of Myo51 was mirrored by that of tropomyosin Cdc8, which decorates linear actin filaments (Figure 2B) (Hatano et al, 2022). The distal position of the bulk of Myo51-decorated actin filaments was confirmed using Airyscan super-resolution microscopy (Figure 2B, right). Thus, the average position of actin filaments and decreasing distance to Myo52 indicates they initially extend a few hundred nanometers into the cytosol and become progressively shorter as fusion proceeds. Previous work had shown that Myo51 cross-links and slides Cdc8-decorated actin filaments relative to each other (Tang et al, 2016) and that both proteins contribute to compaction of the fusion focus in the lateral dimension along the cell-cell contact area (perpendicular to the fusion axis) (Dudin et al, 2017). We confirmed this function by measuring the lateral distribution of Fus1 along the cell-cell contact area (perpendicular to the fusion axis), which was indeed wider in myo51∆ than WT cells (see below Figure 6A-B).”

      (2) In Figure 4, the authors comment that disrupting Fus1 results in more disperse Myo52 spatial distribution at the fusion focus, raising the possibility that Myo52 normally becomes focused by moving on the actin filaments assembled by Fus1. This can be tested by asking whether latrunculin treatment phenocopies the 'more dispersed' Myo52 localization seen in fus1∆ cells? If Myo52 is focused instead by its direct interaction with Fus1, the latrunculin treatment should not cause the same phenotype.

      This is in principle a good idea, though it is technically challenging because pharmacological treatment of cell pairs in fusion is difficult to do without disturbing pheromone gradients which are critical throughout the fusion process (see Dudin et al, Genes and Dev 2016). We will try the experiment but are unsure about the likelihood of technical success.

      We note however that a similar experiment was done previously on Fus1 overexpressed in mitotic cells (Billault-Chaumartin et al, Curr Biol 2022; Fig 1D). Here, Fus1 also forms a focus and latrunculin A treatment leads to Myo52 dispersion while keeping the Fus1 focus, which is in line with our proposal that Myo52 becomes focused by moving on Fus1-assembled actin filaments. Similarly, we showed in Figure 5B that Latrunculin A treatment of mitotic cells expressing Fus1∆IDR-FUSLC-27R also results in Myo52, but not Fus1 dispersion.

      (3) The Fus1∆IDR-FUSLC-27R chimera used in Figure 5 is an interesting construct to examine actin-based transport of formins in cells. I was curious if the authors could provide the rates of movement for Myo52 and for Fus1∆IDR-FUSLC-27R, both before and after acute depletion of Myo52. It would be interesting to see if loss of Myo52 alters the rate of movement, or instead the movement stems from formin-mediated actin polymerization.

      We will measure these rates.

      (4) Also, Myo52 is known to interact with the mitotic formin For3. Does For3 colocalize with Myo52 and Fus1∆IDR-FUSLC-27R along actin cables?

      This is an interesting question for which we do not have an answer. For technical reasons, we do not have the tools to co-image For3 with Fus1∆IDR-FUSLC-27R because both are tagged with GFP. We feel that this question goes beyond the scope of this paper.

      (5) If Fus1∆IDR-FUSLC-27R is active, does having ectopic formin activity in mitotic cells affect actin cable architecture? This could be assessed by comparing phalloidin staining for wildtype and Fus1∆IDR-FUSLC-27R cells.

      We are not sure what the purpose of this experiment is, or how informative it would be. If it is to evaluate whether Fus1∆IDR-FUSLC-27R is active, our current data already demonstrates this. Indeed, Fus1∆IDR-FUSLC-27R recruits Myo52 in a F-actin and FH2 domain-dependent manner (shown in Figure 5B and 5G), which demonstrates that Fus1∆IDR-FUSLC-27R FH2 domain is active. Even though Fus1∆IDR-FUSLC-27R assembles actin, we predict that its effect on general actin organization will be weak. Indeed, it is expressed under endogenous fus1 promoter, leading to very low expression levels during mitotic growth, such that only a subset of cells exhibit a Fus1 focus. Furthermore, most of these Fus1 foci are at or close to cell poles, where linear actin cables are assembled by For3, such that they may not have a strong disturbing effect. Because analysis of actin cable organization by phalloidin staining is difficult (due to the more strongly staining actin patches), cells with clear change in organization predicted to be rare in the population, and the gain in knowledge not transformative, we are not keen to do this experiment.

      Minor Comments:

      Prior studies are referenced appropriately. Text and figures are clear and accurate. My only suggestion would be Figure 1E-H could be moved to the supplemental material, due to their extremely technical nature. I believe this would help the broad audience focus on the experimental design mapped out in Figure 1A-D.

      We are relatively neutral about this. If this suggestion is supported by the Editor, we can move these panels to supplement.

      Reviewer #3 (Significance (Required)):

      Significance: This study provides an improved imaging method for detecting the spatial distributions of proteins below 100 nm, providing new insights about how a relatively small cellular structure is organized. The use of three-color cell imaging to accurately measure accumulation rates of molecular components of the fusion focus provides new insight into the development of this structure and its roles in mating. This method could be applied to other multi-protein structures found in different cell types. This work uses rigorously genetic tools such as knockout, knockdown and point mutants to dissect the roles of the formin Fus1 and Type V myosin Myo52 in creating a proper fusion focus. The study could be improved by biochemical assays to test whether Myo52 and Fus1 directly interact, since the interaction is only shown by co-immunoprecipitation from extracts, which may reflect an indirect interaction.

      Indeed, future studies should dissect the Fus1-Myo52 interaction, to determine whether it is direct and identify mutants that impair it.

      I believe this work advances the cell-mating field by providing others with a spatial and temporal map of conserved factors arriving to the mating site. Additionally, they identified a way to study a mating specific protein in mitotically dividing cells, offering future questions to address.

      This study should appeal to a range of basic scientists interested in cell biology, the cytoskeleton, and model organisms. The three-colored quantitative imaging could be applied to defining the architecture of many other cellular structures in different systems. Myosin and actin scientists will be interested in how this work expands the interplay of these two fields.

      I am a cell biologist with expertise in live cell imaging, genetics and biochemistry.

      We thank the reviewer for their appreciation of our work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Fission yeast cell-cell fusion during mating is mediated by an actin-based structure called the 'fusion focus', which orchestrates actin polymerization by the mating-specific formin, Fus1, to direct polarized secretion towards the mating site. In the current study, Thomas and colleagues quantitatively map the spatial distribution of proteins mediating cell-cell fusion using a three-color fluorescence imaging methodology in the fission yeast Schizosaccharomyces pombe. Using Myo52 (Type V myosin) as a fluorescence reference point, the authors discover that proteins known to localize to the fusion focus have distinct spatial distributions and accumulation profiles at the mating site. Myo52 and Fus1 form a complex in vivo detected by co-immunoprecipitation and each contribute to directing secretory vesicles to the fusion focus. Previous work from this group has shown that the intrinsically disordered region (IDR) of Fus1 plays a critical role in forming the fusion focus. Here, the authors swap out the IDR of fission yeast Fus1 for the IDR of an unrelated mammalian protein, coincidentally called 'fused in sarcoma' (FUS). They express the Fus1∆IDR-FUSLC-27R chimera in mitotically dividing fission yeast cells, where Fus1 is not normally expressed, and discover that the Fus1∆IDR-FUSLC-27R chimera can travel with Myo52 on actively polymerizing actin cables. Additionally, they show that acute loss of Myo52 or Fus1 function, using Auxin-Inducible Degradation (AID) tags and point mutations, impair the normal compaction of the fusion focus, suggesting that direct interaction and coordination of Fus1 and Myo52 helps shape this structure.

      Major Comments:

      • In the Results section for Figure 2, the authors claim that actin filaments become shorter and more cross-linked they move away from the fusion site during mating, and suggest that this may be due to the presence of Myo51. However, the evidence to support this claim is not made clear. Is it supported by high-resolution electron microscopy of the actin filaments, or some other results? This needs to be clarified.

      • In Figure 4, the authors comment that disrupting Fus1 results in more disperse Myo52 spatial distribution at the fusion focus, raising the possibility that Myo52 normally becomes focused by moving on the actin filaments assembled by Fus1. This can be tested by asking whether latrunculin treatment phenocopies the 'more dispersed' Myo52 localization seen in fus1∆ cells? If Myo52 is focused instead by its direct interaction with Fus1, the latrunculin treatment should not cause the same phenotype.

      • The Fus1∆IDR-FUSLC-27R chimera used in Figure 5 is an interesting construct to examine actin-based transport of formins in cells. I was curious if the authors could provide the rates of movement for Myo52 and for Fus1∆IDR-FUSLC-27R, both before and after acute depletion of Myo52. It would be interesting to see if loss of Myo52 alters the rate of movement, or instead the movement stems from formin-mediated actin polymerization.

      • Also, Myo52 is known to interact with the mitotic formin For3. Does For3 colocalize with Myo52 and Fus1∆IDR-FUSLC-27R along actin cables?

      • If Fus1∆IDR-FUSLC-27R is active, does having ectopic formin activity in mitotic cells affect actin cable architecture? This could be assessed by comparing phalloidin staining for wildtype and Fus1∆IDR-FUSLC-27R cells.

      Minor Comments:

      • Prior studies are referenced appropriately.

      • Text and figures are clear and accurate. My only suggestion would be Figure 1E-H could be moved to the supplemental material, due to their extremely technical nature. I believe this would help the broad audience focus on the experimental design mapped out in Figure 1A-D.

      Significance

      Significance: This study provides an improved imaging method for detecting the spatial distributions of proteins below 100 nm, providing new insights about how a relatively small cellular structure is organized. The use of three-color cell imaging to accurately measure accumulation rates of molecular components of the fusion focus provides new insight into the development of this structure and its roles in mating. This method could be applied to other multi-protein structures found in different cell types. This work uses rigorously genetic tools such as knockout, knockdown and point mutants to dissect the roles of the formin Fus1 and Type V myosin Myo52 in creating a proper fusion focus. The study could be improved by biochemical assays to test whether Myo52 and Fus1 directly interact, since the interaction is only shown by co-immunoprecipitation from extracts, which may reflect an indirect interaction.

      I believe this work advances the cell-mating field by providing others with a spatial and temporal map of conserved factors arriving to the mating site. Additionally, they identified a way to study a mating specific protein in mitotically dividing cells, offering future questions to address.

      This study should appeal to a range of basic scientists interested in cell biology, the cytoskeleton, and model organisms. The three-colored quantitative imaging could be applied to defining the architecture of many other cellular structures in different systems. Myosin and actin scientists will be interested in how this work expands the interplay of these two fields.

      I am a cell biologist with expertise in live cell imaging, genetics and biochemistry.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      A three-color imaging approach to use centroid tracking is employed to determine the high resolution position over time of tagged actin fusion focus proteins during mating in fission yeast. In particular, the position of different protein components (tagged in a 3rd color) were determined in relation to the position (and axis) of the molecular motor Myo52, which is tagged with two different colors in the mating cells. Furthermore, time is normalized by the rapid diffusion of a weak fluorescent protein probe (mRaspberry) from one cell to the other upon fusion pore opening. From this approach multiple important mechanistic insights were determined for the compaction of fusion focus proteins during mating, including the general compaction of different components as fusion proceeds with different proteins having specific stereotypical behaviors that indicate underlying molecular insights. For example, secretory vesicles remain a constant distance from the plasma membrane, whereas the formin Fus1 rapidly accumulates at the fusion focus in a Myo52-dependent manner.

      I have minor suggestions/points:

      (1) Figure 1, for clarity it would be helpful if the cells shown in B were in the same orientation as the cartoon cells shown in A. Similarly, it would be helpful to have the orientation shown in D the same as the data that is subsequently presented in the rest of the manuscript (such as Figure 2) where time is on the X axis and distance (position) is on the Y axis.

      (2) Figure 2, for clarity useful to introduce how the position of Myo52 changes over time with respect to the fusion site (plasma membrane) earlier, and then come back to the positions of different proteins with respect to Myo52 shown in 2E. Currently the authors discuss this point after introducing Figure 2E, but better for the reader to have this in mind beforehand.

      (3) First sentence of page 8 "..., peaked at fusion time and sharply dropped post-fusion (Figure S3)." Figure S3 should be cited so that the reader knows where this data is presented.

      (4) Figure 3D-H, why is Exo70 used as a marker for vesicles instead of Ypt3 for these experiments? Exo70 seems to have a more confusing localization than Ypt3 (3C vs 3D), which seems to complicate interpretations.

      (5) Page 10, end of first paragraph, "We conclude...and promotes separation of Myo52 from the vesicles." This is an interesting hypothesis/interpretation that is consistent with the spatial-temporal organization of vesicles and the compacting fusion focus, but the underlying molecular mechanism has not be concluded.

      (6) Figure 5F and 5G, the results are confusing and should be discussed further. Depletion of Myo52 decreases Fus1 long-range movements, indicating that Fus1 is being transported by Myo52 (5F). Similarly, the Fus1 actin assembly mutant greatly decreases Fus1 long-range movements and prevents Myo52 binding (5G), perhaps indicating that Fus1-mediated actin assembly is important. It seems the author's interpretations are oversimplified.

      (7) Figure 6, why not measure the fluorescence intensity of Fus1 as a proxy for the number of Fus1 molecules (rather than the width of the Fus1 signal), which seems to be the more straight-forward analysis?

      (8) Figure 7, the authors should note (and perhaps discuss) any evidence as to whether activation of Fus1 to facilitate actin assembly depends upon Fus1 dissociating from Myo52 or whether Fus1 can be activated while still associated with Myo52, as both circumstances are included in the figure.

      (9) Figure 7, the color of secretory vesicles should be the same in A and B.

      Significance

      This is an impactful and high quality manuscript that describes an elegant experimental strategy with important insights determined. The experimental imaging strategy (and analysis), as well as the insight into the pombe mating fusion focus and its comparison to other cytoskeletal compaction events will be of broad scientific nterest.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      • In this article, Thomas et al. use a super-resolution approach in living cells to track proteins involved in the fusion event of sexual reproduction. They study the spatial organization and dynamics of the actin fusion focus, a key structure in cell-cell fusion in Schizosaccharomyces pombe. The researchers have adapted a high-precision centroid mapping method using three-color live-cell epifluorescence imaging to map the dynamic architecture of the fusion focus during yeast mating. The approach relies on tracking the centroid of fluorescence signals for proteins of interest, spatially referenced to Myo52-mScarlet-I (as a robust marker) and temporally referenced using a weakly fluorescent cytosolic protein (mRaspberry), which redistributes strongly upon fusion. The trajectories of five key proteins, including markers of polarity, cytoskeleton, exocytosis and membrane fusion, were compared to Myo52 over a 75-minute window spanning fusion. Their observations indicate that secretory vesicles maintain a constant distance from the plasma membrane whereas the actin network compacts. Most importantly, they discovered a positive feedback mechanism in which myosin V (Myo52) transports Fus1 formin along pre-existing actin filaments, thereby enhancing aster compaction.

      • This article is well written, the arguments are convincing and the assertions are balanced. The centroid tracking method has been clearly and solidly controlled. Overall, this is a solid addition to our understanding of cytoskeletal organization in cell fusion. Major comments: No major comment.

      Minor comments:

      • Page 8 authors wrote "Upon depletion of Myo52, Ypt3 did not accumulate at the fusion focus (Figure 3C). A thin, wide localization at the fusion site was occasionally observed (Figure 3C, Movies S3)" : Is there a quantification of this accumulation in the mutant?

      • The framerate of movies could be improved for reader comfort: For example, movie S6 lasts 0.5 sec.

      Significance

      This study represents a conceptual and technical breakthrough in our understanding of cytoskeletal organization during cell-cell fusion. The authors introduce a high-precision, three-color live-cell centroid mapping method capable of resolving the spatio-temporal dynamics of protein complexes at the nanometer scale in living yeast cells. This methodological innovation enables systematic and quantitative mapping of the dynamic architecture of proteins at the cell fusion site, making it a powerful live-cell imaging approach. However, it is important to keep in mind that the increased precision achieved through averaging comes at the expense of overlooking atypical or outlier behaviors. The authors discovered a myosin V-dependent mechanism for the recruitment of formin that leads to actin aster compaction. The identification of Myo52 (myosin V) as a transporter of Fus1 (formin) to the fusion focus adds a new layer to our understanding of how polarized actin structures are generated and maintained during developmentally regulated processes such as mating.

      Previous studies have shown the importance of formins and myosins during fusion, but this paper provides a quantitative and dynamic mapping that demonstrates how Myo52 modulates Fus1 positioning in living cells. This provides a better understanding of actin organization, beyond what has been demonstrated by fixed-cell imaging or genetic perturbation.

      Audience: Cell biologists working on actin dynamics, cell-cell fusion and intracellular transport. Scientists involved in live-cell imaging, single particle tracking and cytoskeleton modeling.

      I have expertise in live-cell microscopy, image analysis, fungal growth machinery and actin organization.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note to all Reviewers:

      We would like to thank all the reviewers for their time and insightful feedback. In response to the comments and points raised, we have performed major revisions to our manuscript. We have expanded our analysis on the role of TP53 loss of function in BM activation (Figure 3), investigating human LUAD datasets as well as murine LUAD models. We show that TP53 pathway is significantly negatively correlated with BM, and that loss of TP53 leads to the acquisition of the basal-like phenotype regardless of the type of driver oncogene present (KRAS/EGFR). Furthermore, we added a new figure (Figure 7), where we demonstrate that type I interferon can promote BM activation in LUAD harboring TP53 mutations but not in those with wild type TP53. With this, we propose a mechanism of action of how a subset of LUAD tumors (TP53-mut) upregulate BM, become more aggressive and resistant to therapies.

      Finally, we have made the manuscript clearer and transparent by improving the presentation of plots, as well as including source data files and Rmarkdown files for reproducibility.

      Reviewer 1:

      Major comments

      R1-Comment 1: The authors did not submit with the manuscript all the results that they have obtained from their analysis, on which they based their claims. I suggest that the authors submit a SourceData file in Excel format. This file should contain the values and the relevant information for each of the plots presented in the main and supplementary figures. For example, in case of box plots, the five-number summary should be provided. Further, the p-values and the test used for their calculations should be also mentioned. The file could be organized in a way that the data and relevant information for each figure panel are presented in separated data sheets in order that the reader can easily navigate through the file and find the information for each figure panel fast. Similarly, the authors should provide access to the scripts that they have developed or adapted from published scripts to perform the analysis of the datasets and obtain the results presented in the manuscript. The access to the scripts used in the manuscript is important to reproduce analysis. The scripts can be deposited at github, for example.

      Reply: We thank the reviewer for their advice in making the presentation of our results and methods more transparent and reproducible. We have now provided the source data file (supplementary file 2), which contains relevant data for each figure. We have also uploaded Rmarkdown files to github and R Markdown HTML reports are compiled in Supplementary File 3, this shows how the analyses were performed and how each figure generated. All datasets required to reproduce the analyses and figures have also been added to Zenodo (10.5281/zenodo.16964654) and will be published when the article is in press.

      R1-Comment 2: The results and their interpretations are mainly done based on in silico analysis from publicly available transcriptomic datasets. The confirmation of the results obtained by the in silico analysis is limited to the last figure, in which the authors show results obtained by multiplexed immunohistochemistry and histo-cytometry of tissue microarrays from a curated cohort of FFPE samples. The relevance of the results obtained by the in silico analysis may increase if the authors could present results either in a conditional (lung-specific) Kras mutant mouse model, or patient-derived xenograft (PDX) mouse model of lung cancer. The PDX mouse model will be more suitable in case that access to genetically modified mouse models is not given and/or the time for the experiments is limited. In both cases, the hyperactivation of the small GTPase KRAS should expand the BM gene expression signature in the mouse lung in a Sox9-dependent manner, thereby leading to lung tumors. Further, Sox9 loss-of-function experiments should reduce the BM gene expression signature and favor the ALV gene expression signature. These results would strongly support the interpretation of the in silico results by the authors in the present manuscript, and would significantly increase the impact of the manuscript in the scientific field of lung cancer.

      Reply: We thank the reviewer for their insightful feedback on how to improve the impact of our study through further functional validation of in silico findings. To address this comment, we have performed additional analyses, including data and experiments from both murine and human LUAD model systems to elucidate a novel mechanism of BM activation in LUAD. We appreciate the reviewer’s suggestion to pursue analysis of Sox9 involvement in regulating BM activation and agree that both KRAS and SOX9 activation are likely to be involved in at least some elements of the process of disease progression we described in this manuscript. Indeed, previous studies have completed the experiments suggested, demonstrating Sox9 knock-out reduced Kras driven tumour progression and morphological grade in vivo (PMID: 37258742 and 34021911); and was associated with loss of AT2 lineage identity (PMID: 37468622).

      Our analysis of human LUAD using scRNA-seq data has demonstrated that this differentiation spectrum in fact extends beyond loss of lineage fidelity and in a subset of cells leads to transdifferentiation to a basal-like cell state. In our revised manuscript, we have more clearly elucidated the role of KRAS and TP53 in these two events during LUAD progression, demonstrating that while oncogenic KRAS (and likely downstream SOX9 activation) can lead to the loss of lineage commitment in LUAD cells, mutations in TP53 are required for acquisition of the basal-like phenotype. We have also expanded on this mechanism identifying a novel role for type-1 interferon signaling in the presence of TP53 loss-of-function as a mechanism that can lead to BM activation and acquisition of a basal-like cell state in LUAD. The data related to these analyses are now presented in figures 3 and 7.

      In accordance with the 3Rs principles for ethical use of animals in research we have taken advantage of publicly available data from previous experiments analyzing conditional (lung-specific) Kras mutant mouse model to validate our in-silico findings. This confirmed our in silico analysis of human LUAD, demonstrating an important role for TP53 loss of function in regulating BM activation (presented in Figure 3E&F and Figure S3F&G)

      We also showed that the type I interferon signaling is capable of driving BM activation in LUAD but only in the context of TP53 loss-of-function. These experiments were performed using 3D organotypic cultures of H441 cells (human adenocarcinoma cell line with mutant TP53) and A549 cells (human adenocarcinoma cell line with wild-type TP53). These 3D cultures were treated with IFN-alpha, both BM and basal-like marker upregulation (MKI67, CDC20, TOP2A, S100A9, S100A2, SOX9 and KRT17) was observed only in LUAD cells carrying a mutation in TP53. These data are now presented in Figure S7D.

      R1-Comment 3: In general, the description of the results in the corresponding section of the manuscript can be improved to facilitate the understanding of the results presented.

      As an example, the figure 1B is described on page 13 as follows: "...we first used a publicly available microarray dataset [9] to identify genes differentially expressed between epithelial cells engaged in BM (embryonic day 14 [E14]) or ALV (embryonic day 19 [E19]) (Figure 1B and TABLE S1)." By looking at the plot in figure 1B, this description is not sufficient to understand what the authors present in this figure panel, not even after reading the corresponding figure legend.

      Reply: We thank the reviewer for their advice on making our manuscript clearer. Throughout the manuscript we have now edited the result descriptions, we have also provided further detail to the methods sections, figure legends and axes labels to enhance clarity and facilitate understanding of the analyses performed.

      In the example cited we have edited the sections referenced above as follows:

      “To test this hypothesis, we identified genes that were differentially expressed in epithelial cells engaged in active BM (corresponding to embryonic day 14) vs active ALV (corresponding to embryonic day 19), using a publicly available microarray dataset.”

      We have also changed the Y axis label of Figure 1B to: “log2(FC E19 [ALV] – E14[BM])”.

      The description in the figure legend has also been modified to provide more context: “Dot plot showing the identification of genes differentially expressed by epithelial cells during murine developmental-BM (embryonic day 14) and ALV (embryonic day 19) [1]. Genes with the highest Fold Change of expression between day 14 (BM) and 19 (ALV) of murine lung development are coloured green or red, respectively. These genes were used to generate ALV/BM signatures [9]

      R1-Comment 5: Another example is the description of the figure 3B on page 16: "This showed low levels of BM activation in tumour cells from residual disease (RD) that was significantly increased in samples with recurrent progressive disease (PD) (Figure 3B)." By looking figure 3B and the corresponding figure legend, one cannot find the group "residual disease (RD)".

      __Reply: __We thank the reviewer for their diligent reading and have now corrected the figures to provide clearer labelling of axes and maintain consistency throughout. In the example cited, we have corrected the axis label to Residual disease (RD) and partial response (PR).

      R1-Comment 6: Another example is the description of the figure 3C and 3D on page 16: "Single-cell analysis showed that both ALV-BM- and ALV-BM+ LUAD cells were increased in samples from recurrent progressive disease (Figure 3C,D)." By looking at figure 3D and the corresponding legend, I do not find the explanation of "TRUE" and "FALSE". The same is for figures 3J and 3M.

      Reply: For this example (Figure 3 in the original manuscript is now figure 4), TRUE/FALSE labels have been replaced by PR (partial response) and PD (progressive disease) in panel D; replaced by “Responder (R)” or “Non-responder (NR)” in panels J&M.

      R1-Comment 7: Other figure panels were also poorly described in the results section and in the corresponding legends. Further, the presentation of the results in the main and supplementary figures has to be improved. For example, labeling of the Y-axis in the figures 1H to 1J, 2C, 2D, 2G, 2H, 3B, 3C, 3J, 3L, etc. has to be improved. As a point of reference, I would suggest checking how other authors present similar results in life science journals. These deficiencies in the presentation and description of the results make it difficult for the readers to understand the manuscript.

      Reply: These axes labels have been changed throughout to provide more information. “BM” changed to “BM (ssGSEA score)” or “BM (module score)” and “ALV” changed to “ALV (ssGSEA score)” or “ALV (module score)” for figures 1H, 1I, 1J, S1H-L, 2C, 2D, 3B, 3F, S3E, S3F, S3G, 4B, 4C, 4J, 4L, S4C, S4D, S5A, 6A, S6A, S6B, ssGSEA score was applied to bulk RNAseq samples, and modules scores were calculated for single cells.

      Additionally:

      S2A, S2B – OS label changed to Survival probability/OS probability.

      S4H – y axis label changed to PDL1 (RPPA).

      S3B – y axis label changed to “Tumour mutational burden (mut/mB).

      S3C – y axis label changed to “Tobacco smoking (SBS mutational signature)”.

      4F – y axis label changed to “DFS (proportion)”.

      4H – y axis label changed to “PFS (proportion)”.

      R1-R1-Comment 8: The authors write on page 18 "Despite AT2 cells being well described as the cell of origin for LUAD, this population was significantly less abundant in LUAD samples compared to control, demonstrating a high degree of transcriptomic plasticity within LUAD epithelium (Figure 4D)." How can the authors show that these results are not produced by the process of integration of the four scRNA-seq NSCLC datasets, the implementation of a specific machine learning classifier for the cell type-classification, or the manually filtration to exclude doublets? For example, will the authors achieve the same (or similar) results using a different machine learning classifier? If yes, please include the results in the manuscript.

      Reply: The integration was performed using the method described by Stuart et al. (PMID: 31178118), implemented in the Seurat package. The term “machine learning classifier” has now been replaced by “label transfer” to clarify the method used and avoid confusion. Label transfer was only used to identify major cell types in the datasets used, i.e. the whole epithelial population. Doublet removal was performed as follows (and described in the methods section): epithelial cells were clustered using the shared nearest neighbor (SNN) modularity optimization algorithm implemented by the FindClusters function in the Seurat R package, based on 30 principal components and setting the resolution parameter to 0.1. This clustering solution identified multiple small clusters with divergent expression profiles to the majority of cells that were initially classified as epithelial (in the label transfer analysis). Manual examination of the marker genes for these small clusters showed they were characterized by expression of epithelial genes alongside canonical markers for either B cells (CD79A), macrophages (CD68, SPP1, APOE, CD14, MARCO) or Tcells/NK cells (CD3D, NKG7, CXCR4). These cells were therefore classed as heterotypic doublets and excluded from further analysis. All other cell types from the integrated datasets were analyzed in the same way, and no further epithelial clusters (that were not small clusters of doublets) were identified.

      Further clustering to identify epithelial subpopulations was performed on the integrated dataset and the results presented from this analysis represent the clustering solution that ensures all subpopulations were identified across datasets to mitigate any potential batch-effects not resolved by the integration process. Furthermore, our results showing that LUAD cells exhibit a high degree of transcriptomic plasticity were also confirmed by the lineage fidelity analysis (Figure 5G&I), which demonstrates this observation is not dependent on a single clustering, integration or machine learning algorithm. This observation is also supported by other studies that have described loss of lineage commitment during LUAD tumorigenesis, where tumour cells become transcriptionally and phenotypically distinct from healthy AT2 cells.

      Reviewer 1:

      Minor comments:

      __R1-Comment 9: __Please introduce the abbreviation for alveogenesis the first time that is used in the abstract, as it was done for branching morphogenesis.

      __Reply: __Abbreviation for alveogenesis has now been added to the abstract.

      R1-Comment 10: On page 18 the author write: "Consistent with the analyses presented above, pseudo bulk expression profiles for each sample showed that ALV and BM scores were significantly negatively correlated (r = -0.68, p = 4.1e-09)." Where are these results shown? I was not able to find these results. If they are not in the current version of the manuscript, please include the results

      Reply: Scatter plot showing the negative correlation has now been added as Figure S5A.

      __R1-Comment 11: __The authors should submit a supplementary table containing a list of the different data sets that were used for this manuscript. The table should include accession numbers and links to the different depositories, in which the data sets can be found. This will improve the overview of the datasets used in the study, as well as facilitate the finding of the datasets by the readers.

      Reply: The list of all datasets used in this study, together with accession numbers and links are now in Supplementary file1.

      R1-Comment 12: In figure 1G, change the color for FALSE in the legend.

      Reply: Color for FALSE changed in Figure 1G and Figure S1E.

      R1-Comment 13: Provide the complete list of mutated genes for Figure S2C.

      Reply: Figure S2C has been replaced by figures 3C (top mutated genes in LUAD-BM) and S3A (top mutated genes in LUAD-ALV).

      Reviewer 1 (Significance (required)):

      __R1-Comment 14: __Conceptually, Bienkowska KJ et al. propose that LUAD tumors undergo reversion from an alveogenic to branching morphogenic phenotype during disease progression, generating inflamed or basal-like cell states that are variably persistent following TKI or ICB treatments. This concept is in line with reports using murine models of Kras-driven LUAD. In addition, there are parallels with findings in idiopathic pulmonary fibrosis (IPF, another hyperproliferative lung disease), in which KRT5-/KRT17+ basaloid cells were transiently found, like the basal-like phenotype that Bienkowska KJ identified in human LUAD. In other words, the concept proposed by the authors is novel and in line with previous publication in LC and IPF.

      Response: We are glad the reviewer found our results novel and appreciated how they provide a linkage of previously defined mechanisms seen in murine developmental models to human cancer progression, and how they may be relevant for other diseases such as IPF.

      __R1-Comment 15: __The in silico analysis of publicly available transcriptomic datasets presented by Bienkowska KJ et al. is original and comprehensive. It is an interesting contribution to the cancer research field. However, the impact of their findings to this scientific field will significantly increase if the authors could confirm the interpretation of their results using other experimental systems in addition to the one used in the las figure. For example, the experiments that I suggested in point 2., using either conditional Kras transgenic mice or a PDX mouse model for lung cancer will not only confirm the concpet proposed by the authors, but it will also provide further mechanistic insides related to this model at cellular and molecular level.

      Response: We thank the reviewer for describing our analysis as original and comprehensive and their suggestion to develop the manuscript further with additional mechanistic analyses. We have comprehensively examined the mechanisms responsible for regulating BM activation using a combination of in vivo models and 3D organotypic cultures, elucidating a novel role for type-1 interferon signaling in the presence of TP53 loss-of-function as a mechanism that can lead to BM activation and acquisition of a basal-like cell state in LUAD. For further information regarding these additions to the revised manuscript, we direct the reviewer to the response provided to R1-comment 2 (above).

      __R1-Comment 16: __Overall, the manuscript by Bienkowska KJ et al. addresses topics that are relevant to the field of lung cancer, the leading cause of cancer-related deaths worldwide. The bioinformatic methods implemented are cutting-edge. However, the text of the manuscript and the presentation of the results in the figures have to be improved to better exploit the potential of their findings. In addition, further experiments should be performed to confirm (and perhaps complement) the interpretation of their findings. I hope that my comments support the authors to improve the manuscript to reach the standard of manuscripts recently published at renowned journals in Review COMMONS. I recommend a major revision of the manuscript before publication.

      __Reply: __We are pleased to read that the reviewer found the methods implemented by us to be cutting-edge, and that they recognized the relevance of this topic to the lung cancer field.

      We thank the reviewer for their comments, which have helped us to significantly improve our manuscript.

      We have made changes to how we present our data (as described in responses above) and performed further analyses to support our original findings. We have also now performed further in silico and functional analyses to expand and complement our original findings.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      R2-Comment 1: __The study is novel and interesting, but the mechanisms how the dysregulation of developmental program was driven by specific oncogene and how to link these signatures to therapy were also not clear. __

      __Reply: __We are pleased that the reviewer finds our study to be novel and interesting. We appreciate the reviewer pointing out the need to clarify the role of specific oncogenes to BM activation and response to therapies.

      We have now added further analyses and edited the text to examine and explain how the ALV and BM signatures are driven by different oncogenes (Figure 3 and results section “TP53 loss of function is required for BM activation”), which showed that common oncogenic drives (e.g. KRAS and EGFR) can drive reduced ALV signature expression but TP53 mutations (or deletion in murine models) was critical for driving BM activation. Implications for therapy response are shown in (Figure 4). We have shown that BM activation is a key determinant of tyrosine kinase inhibitor (TKI) resistance in LUAD, representing a frequently activated off-target mechanism of resistance that supersedes the presence of an actionable oncogenic driver in terms of response rates; and that the BM signature also identified patients that, although positive for immune checkpoint blockade (ICB) response biomarkers, will likely fail to respond to this treatment. In the manuscript we have now thoroughly revised these sections of the results to clarify the details associated with these conclusions (results sections: “BM activation is associated with targeted-therapy resistance in lung adenocarcinomas” and “BM activation predicts poor response to immune-checkpoint blockade”).

      We have also added further data to the manuscript elucidating the molecular mechanisms regulating BM activation (Figure 7), which has identified an important role for aberrant type-I interferon signaling in the context of mutant TP53.

      Reviewer #2 (Significance (Required)):

      __R2-Comment 2: __The authors in this manuscript aimed to examine the role of developmental programmes, alveogenesis and branching morphogenesis (BM), in regulating phenotypic diversity in NSCLC. They demonstrated that developmental programmes (ALV and BM) frequently become

      dysregulated in NSCLC, with BM activation identifying aggressive LUAD that were resistant

      to multiple therapies, including TKIs and ICB. They found that BM activation in LUAD was associated with TP53 pathway mutations and required AT2 cells to lose their alveolar identity, acquiring a basallike state. The study is very intriguing, and the findings may pave a link to the disease progression and therapy resistance in LUAD.

      __Response: __We are pleased the reviewer found the study intriguing and with the potential to better understand LUAD progression and resistance to therapies.

      __R2-Comment 3: __The current results presented, although comprehensively presented, is still many an association study, the mechanisms how these dysregulations of developmental programmes driven by the driver oncogenes or carcinogens are still unknown.

      Response: We thank the reviewer for challenging us to further examine the molecular mechanisms underpinning our initial observations. As described above (see response to Reviewer #1 comment 2), we have performed additional in silico and mechanistic experimental analyses, which identified a novel role for type-I IFN signaling and TP53 loss-of-function in the activation of the BM program in LUAD. We hope these additions have enhanced the significance of the manuscript presented.

      __R2-Comment 4: __The NSCLC is a heterogeneous disease, LUAD and LUSC are two different diseases in terms of oncogenesis, driver mutations and response to treatment. The manuscript may either just focused on LUAD or describe results carefully to include both LUAD and LUSC. For example, in the result of abstract, only LUAD was described, there was no mention of LUSC.

      __Response: __We agree with the reviewer that NSCLC is a heterogeneous and complex disease. Indeed, this was in part what motivated us to investigate the role played by developmental processes in these distinct oncogenic processes. Our analyses showed that LUSC tumors were generally high for the BM signature (Figure 1I), which likely contributed to why this signature did not stratify survival rates for LUSC (Figure S2B). As a result, we opted to focus on LUAD as we found that BM activation was predictive of disease progression and survival in this NSCLC subtype. However, we did not completely remove LUSC from our manuscript to examine the degree to which LUAD tumors upregulating BM become “LUSC-like” and evaluate whether histological transformations occurred in LUAD cases with BM activation (as described in Figure 5 and the “BM activation in LUAD is associated with a basal-like phenotype” results section).

      We have also now added a description of results from both LUAD and LUSC analyses to the abstract to clarify these points.

      __R2-Comment 5: __The most common driver mutation of LUAD was EGFR, the authors also try to link the BM activation link to TKI resistance. I assumed that the TKIs most of the patients used were EGFR TKI, but the study did not examine the role of EGFR in the dysregulation of developmental programmes.

      __Response: __We would like to thank the reviewer for highlighting an important aspect of how our work fits with current clinical practice in LUAD management. Our analyses were carried out over multiple cohorts that include different patient demographics, which have varied prevalence for specific oncogenic driver mutations (with EGFR mutations typically being more prevalent in Asian cohorts and KRAS mutations generally being the most common oncogenic driver in Western cohorts). To examine these two common oncogenic drivers impact on BM activation, we now include a direct analysis of BM level in cases harboring these mutations (Figure S3D-E). This showed that that irrespective of oncogenic driver mutations TP53 loss of function was associated with increased BM. Our new analysis of KRAS driven mouse models has also showed that KRAS activation is sufficient to induce reduced expression of the ALV signature but failed to elicit increased BM activation. Given our analysis of human tumours showed that EGFR mutant LUAD cases with wild-type TP53 had low levels of BM activation (Figure S3D), we have no reason to suspect that EGFR mutations alone would be sufficient to elicit BM activation.

      __R2-Comment 6: __The TKI resistance was very complicated, not just EGFR T790M, the results and discussion regarding the activation of BM and TKI resistance seems not adequate. The mouse model used by Dr. Chang was mainly KRAS driven mouse lung cancer model (mice carrying RosatdT, Sox2EGFP, ShhCre, Sox9CKO, Fgfr2CKO, RosamTmG, Sox9CreER, Nkx2.1CreER, and KrasLSL-G12D alleles). It is not clear whether the EGFR driven (the most common driver of LUAD) mouse model also has same genetic signature. At least, the authors should describe or discuss these discrepancies.

      __Response: __We thank the reviewer for their comments and advice on making our manuscript clearer. We have now revised the description of BM activation and TKI resistance in the results section (titled “BM activation is associated with targeted-therapy resistance in lung adenocarcinomas”).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The study is novel and interesting, but the mechanisms how the dysregulation of developmental program was driven by specific oncogene and how to link these signatures to therapy were also not clear.

      Significance

      The authors in this manuscript aimed to examine the role of developmental programmes, alveogenesis and branching morphogenesis (BM), in regulating phenotypic diversity in NSCLC. They demonstrated that developmental programmes (ALV and BM) frequently become dysregulated in NSCLC, with BM activation identifying aggressive LUAD that were resistant to multiple therapies, including TKIs and ICB. They found that BM activation in LUAD was associated with TP53 pathway mutations and required AT2 cells to lose their alveolar identity, acquiring a basallike state.

      The study is very intriguing and the findings may pave a link to the disease progression and therapy resistance in LUAD. The current results presented, although comprehensively presented, is still many an association study, the mechanisms how these dysregulations of developmental programmes driven by the driver oncogenes or carcinogens are still unknown. The NSCLC is a heterogeneous disease, LUAD and LUSC are two different diseases in terms of oncogenesis, driver mutations and response to treatment. The manuscript may either just focused on LUAD or describe results carefully to include both LUAD and LUSC. For example, in the result of abstract, only LUAD was described, there was no mention of LUSC. The most common driver mutation of LUAD was EGFR, the authors also try to link the BM activation link to TKI resistance. I assumed that the TKIs most of the patients used were EGFR TKI, but the study did not examine the role of EGFR in the dysregulation of developmental programmes. The TKI resistance was very complicated, not just EGFR T790M, the results and discussion regarding the activation of BM and TKI resistance seems not adequate. The mouse model used by Dr. Chang was mainly KRAS driven mouse lung cancer model (mice carrying RosatdT, Sox2EGFP, ShhCre, Sox9CKO, Fgfr2CKO, RosamTmG, Sox9CreER, Nkx2.1CreER, and KrasLSL-G12D alleles). It is not clear whether the EGFR driven (the most common driver of LUAD) mouse model also has same genetic signature. At least, the authors should describe or discuss these discrepancies.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Bienkowska KJ et al show in this manuscript a compilation of bioinformatic analysis of publicly available microarray datasets, bulk RNA sequencing (RNA-seq) datasets and single cell RNA sequencing (scRNA-seq) datasets. One transcriptomic data set from mouse (Chang DR et al., Proc Natl Acad Sci USA, 2013) was analyzed in this manuscript to determine the gene expression signatures specific for the developmental processes alveogenesis (ALV) and branching morphogenesis (BM). The rest of the transcriptomic data sets that were analyzed for this manuscript were selected based on different parameters including the involvement of non-small cell lung cancer (NSCLC), lug adenocarcinoma (LUAD), lung squamous cell carcinoma (LUSC) and various cohorts with different characteristics related to lung cancer (LC), such as mutations related to LC (in EGFR, ALK, BRAF, ROS1 and KRAS), and/or treatments/resistance of LC patients with/to tyrosine kinase inhibitors (TKI), anti-PD1 strategies, immune checkpoint blockade, (ICB) among others. In the last figure, the authors present results obtained by multiplexed immunohistochemistry and histo-cytometry of tissue microarrays from a curated cohort of archival formalin-fixed paraffin-embedded (FFPE) samples to confirm their interpretation of the results obtained by the transcriptomic analysis.

      The findings and/or claims of the authors could be summarized in the following bullet points:

      • After defining the ALV and BM gene expression signatures, the authors determined the expression of these signatures in bulk RNA-seq data sets from The Cancer Genome Atlas (TCGA). The ALV signature was significantly downregulated in LUAD and LUSC tumour subtypes compared to control samples, whereas BM was upregulated. These findings were validated using a Laser Capture Micro-Dissected (LCMD) microarray dataset (Lin J et al., AM J Pathol, 2014) comparing the epithelial compartment from non-tumour alveoli, non-tumour bronchi, LUAD and LUSC. Interestingly, ALV was suppressed and BM was activated in all LUSC tumours analyzed; whereas in LUAD tumours were heterogeneous for BM activation with some cases exhibiting ALV expression comparable to control tissue. In summary, the mutually antagonistic regulation of ALV and BM was found to account for a significant proportion of transcriptomic variance in human NSCLC bulk tissue datasets.
      • BM activation was associated with poor overall survival rates in five independent LUAD cohorts (p=2.04e-13); and was significantly prognostic for resistance to TKIs (p=0.003) and ICBs (p=0.014), in pre-treatment biopsies.
      • ScRNA-seq analysis revealed that malignant LUAD cells with loss of alveolar lineage fidelity predominantly acquired inflamed or basal-like cellular states, which were variably persistent in samples from TKI and ICB recurrence.
      • The authors conclude from their analysis that LUAD tumours undergo reversion from an alveogenic to branching morphogenic phenotype during disease progression, generating inflamed or basal-like cell states that are variably persistent following TKI or ICB treatments.

      Major comments:

      1. The authors did not submit with the manuscript all the results that they have obtained from their analysis, on which they based their claims. I suggest that the authors submit a SourceData file in Excel format. This file should contain the values and the relevant information for each of the plots presented in the main and supplementary figures. For example, in case of box plots, the five-number summary should be provided. Further, the p-values and the test used for their calculations should be also mentioned.

      The file could be organized in a way that the data and relevant information for each figure panel are presented in separated data sheets in order that the reader can easily navigate through the file and find the information for each figure panel fast.

      Similarly, the authors should provide access to the scripts that they have developed or adapted from published scripts to perform the analysis of the datasets and obtain the results presented in the manuscript. The access to the scripts used in the manuscript is important to reproduce analysis. The scripts can be deposited at github, for example. 2. The results and their interpretations are mainly done based on in silico analysis from publicly available transcriptomic datasets. The confirmation of the results obtained by the in silico analysis is limited to the last figure, in which the authors show results obtained by multiplexed immunohistochemistry and histo-cytometry of tissue microarrays from a curated cohort of FFPE samples.

      The relevance of the results obtained by the in silico analysis may increase if the authors could present results either in a conditional (lung-specific) Kras mutant mouse model, or patient-derived xenograft (PDX) mouse model of lung cancer. The PDX mouse model will be more suitable in case that access to genetically modified mouse models is not given and/or the time for the experiments is limited. In both cases, the hyperactivation of the small GTPase KRAS should expand the BM gene expression signature in the mouse lung in a Sox9-dependent manner, thereby leading to lung tumors. Further, Sox9 loss-of-function experiments should reduce the BM gene expression signature and favor the ALV gene expression signature. These results would strongly support the interpretation of the in silico results by the authors in the present manuscript, and would significantly increase the impact of the manuscript in the scientific field of lung cancer. 3. In general, the description of the results in the corresponding section of the manuscript can be improved to facilitate the understanding of the results presented. As an example, the figure 1B is described on page 13 as follows:

      "...we first used a publicly available microarray dataset [9] to identify genes differentially expressed between epithelial cells engaged in BM (embryonic day 14 [E14]) or ALV (embryonic day 19 [E19]) (Figure 1B and TABLE S1)."

      By looking at the plot in figure 1B, this description is not sufficient to understand what the authors present in this figure panel, not even after reading the corresponding figure legend.

      Another example is the description of the figure 3B on page 16:

      "This showed low levels of BM activation in tumour cells from residual disease (RD) that was significantly increased in samples with recurrent progressive disease (PD) (Figure 3B)."

      By looking figure 3B and the corresponding figure legend, one cannot find the group "residual disease (RD)".

      Another example is the description of the figure 3C and 3D on page 16:

      "Single-cell analysis showed that both ALV-BM- and ALV-BM+ LUAD cells were increased in samples from recurrent progressive disease (Figure 3C,D)."

      By looking at figure 3D and the corresponding legend, I do not find the explanation of "TRUE" and "FALSE". The same is for figures 3J and 3M.

      Other figure panels were also poorly described in the results section and in the corresponding legends.

      Further, the presentation of the results in the main and supplementary figures has to be improved. For example, labeling of the Y-axis in the figures 1H to 1J, 2C, 2D, 2G, 2H, 3B, 3C, 3J, 3L, etc. has to be improved. As a point of reference, I would suggest checking how other authors present similar results in life science journals.

      These deficiencies in the presentation and description of the results make it difficult for the readers to understand the manuscript. 4. The authors write on page 18

      "Despite AT2 cells being well described as the cell of origin for LUAD, this population was significantly less abundant in LUAD samples compared to control, demonstrating a high degree of transcriptomic plasticity within LUAD epithelium (Figure 4D)."

      How can the authors show that these results are not produced by the process of integration of the four scRNA-seq NSCLC datasets, the implementation of a specific machine learning classifier for the cell type-classification, or the manually filtration to exclude doublets? For example, will the authors achieve the same (or similar) results using a different machine learning classifier? If yes, please include the results in the manuscript.

      Minor comments:

      1. Please introduce the abbreviation for alveogenesis the first time that is used in the abstract, as it was done for branching morphogenesis.
      2. On page 18 the author write:

      "Consistent with the analyses presented above, pseudo bulk expression profiles for each sample showed that ALV and BM scores were significantly negatively correlated (r = -0.68, p = 4.1e-09)."

      Where are these results shown? I was not able to find these results. If they are not in the current version of the manuscript, please include the results 7. The authors should submit a supplementary table containing a list of the different data sets that were used for this manuscript. The table should include accession numbers and links to the different depositories, in which the data sets can be found. Thiy will improve the overview of the datasets used in the study, as well as facilitate the finding of the datasets by the readers. 8. In figure 1G, change the color for FALSE in the legend. 9. Provide the complete list of mutated genes for Figure S2C

      Significance

      Conceptually, Bienkowska KJ et al. propose that LUAD tumors undergo reversion from an alveogenic to branching morphogenic phenotype during disease progression, generating inflamed or basal-like cell states that are variably persistent following TKI or ICB treatments. This concept is in line with reports using murine models of Kras-driven LUAD. In addition, there are parallels with findings in idiopathic pulmonary fibrosis (IPF, another hyperproliferative lung disease), in which KRT5-/KRT17+ basaloid cells were transiently found, like the basal-like phenotype that Bienkowska KJ identified in human LUAD. In other words, the concept proposed by the authors is novel and in line with previous publication in LC and IPF.

      The in silico analysis of publicly available transcriptomic datasets presented by Bienkowska KJ et al. is original and comprehensive. It is an interesting contribution to the cancer research field. However, the impact of their findings to this scientific field will significantly increase if the authors could confirm the interpretation of their results using other experimental systems in addition to the one used in the las figure. For example, the experiments that I suggested in point 2., using either conditional Kras transgenic mice or a PDX mouse model for lung cancer will not only confirm the concpet proposed by the authors, but it will also provide further mechanistic insides related to this model at cellular and molecular level.

      Overall, the manuscript by Bienkowska KJ et al. addresses topics that are relevant to the field of lung cancer, the leading cause of cancer-related deaths worldwide. The bioinformatic methods implemented are cutting-edge. However, the text of the manuscript and the presentation of the results in the figures have to be improved to better exploit the potential of their findings. In addition, further experiments should be performed to confirm (and perhaps complement) the interpretation of their findings. I hope that my comments support the authors to improve the manuscript to reach the standard of manuscripts recently published at renowned journals in Review COMMONS. I recommend a major revision of the manuscript before publication.

  4. Aug 2025
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      GENERAL COMMENTS

      We thank the three reviewers for their comments on the paper.

      We are pleased to see that they consider it be a comprehensive and well-executed study, which clearly establishes a previously overlooked connection between MRTF-SRF signalling and proliferation, and that its conclusions require no further experimentation.

      As review 3 points out, this work has implications for cancer biology, and suggests new research routes to understand the relation between cell adhesion, proliferation, and transformation.

      However, two referees raise significant concerns about its impact

      Review 1 suggests that the paper lacks impact without exploration the wider biological significance of our observations, although it considers it to be a good basic cell biology study. It suggests further work extending the findings to tissue- or tumor-based systems. While we consider such studies worthwhile – indeed we are currently pursuing these directions – we consider them beyond the scope of the present paper.

      Review 2 questions the novelty of our findings. We strongly disagree. This is is the first study to show that MRTF-SRF signalling is required for the proliferation of both primary and immortalised fibroblasts, and epithelial cells. We show that MRTF inactivation leads cells to enter a quiescence-like state under conditions that would permit efficient cell cycle progression in wildtype cells. The study will alter the field's perspective on the role of MRTF-SRF signalling, previously viewed as concerned with cell adhesion, morphology, and motility.

      Responses to individual reviews (italic) follow in regular text.

      RESPONSE TO INDIVIDUAL REVIEWS (comments in italic, response in regular, changes made)

      __Reviewer #1 __

      *(Evidence, reproducibility and clarity (Required)): *

      *The manuscript by Neilsen et al. presents a thorough and well-structured study showing that Myocardin-related transcription factors (MRTF-A/B), via MRTF-SRF, are essential for the proliferation of both primary and immortalized fibroblasts and epithelial cells. Using a combination of knockouts/rescue experiments, cytoskeletal analysis, and transcriptomics, the authors demonstrate that MRTF-SRF signalling controls actin dynamics and contractility-key drivers of cell cycle progression. Notably, they show that the proliferative arrest caused by MRTF loss is reversible, distinguishing it from classical senescence. **

      Major points*

      • The link between MRTF-SRF activity, cytoskeletal organisation, and cell proliferation is clearly established. The fact that disrupting contractility phenocopies MRTF loss strengthens the case that the pathway acts through mechanical control.*
      • The authors support their conclusions using multiple cell types (MEFs, primary fibroblasts, epithelial cells), a range of complementary assays (RNA-seq, traction force microscopy, adhesion/spreading), and genetic tools (CRISPR, inducible rescue).*
      • The ability to restore proliferation by re-expressing MRTF-A argues against true senescence and instead suggests a quiescence-like state driven by cytoskeletal disruption.*
      • This work particularly highlights how mechanical inputs feed into transcriptional programs to regulate proliferation, with implications for understanding anchorage-dependent growth.**

      Suggestions While the authors argue convincingly against classical senescence, elevated SA-βGal and SASP expression suggest a more nuanced arrest state. It not really clear what this state is or is not, therefore a deeper discussion of possible hybrid or intermediate states would be helpful - maybe potential additional experiments to include or exclude potential explanations - e.g. how does it differ from G0 exit?* Our findings show that MRTF inactivation inhibits cell proliferation under conditions that would permit efficient cell cycle progression in wildtype cells, inducing a state with some features associated with classical senescence, and others conventionally associated with reversible cell cycle arrest/quiescence. The reviewer correctly points out that this raises problems with accurately defining the nature of the MRTF-null proliferation defect.

      To our knowledge there are no rigorously defined unambiguous markers for senescence, quiescence, or G0. Indeed, recent studies have shown that senescence and quiescence / G0 states are not as distinct as previously assumed (Anwar et al, 2018; Ashraf et al 2023) as we reviewed in detail in Discussion p27, §2; p28 §3. We therefore do not consider it a productive endeavour to define markers for the MRTF-null state as opposed to defining its mechanistic basis. However, we agree that we should have been clearer about how the phenotypes we observe relate to classical cell arrest states.

      We have therefore revised the presentation of the Results to make it clear which features of the non-proliferative state associated with MRTF inactivation are seen in classical senescence, and which are found in reversible cell cycle exit or quiescence.

      Things done:

      • __Results pp16-17 and Fig 1. Figure panels and presentation are reordered to present “senescence” features together before marker expression (panel G is now panel I). Text now explicitly points out that the spectrum of cell cycle markers, specifically p27 upregulation, is not that associated with classical senescence (p16, p21,etc) but previously linked to reversible arrest or quiescence. Lines 371-380 have been moved up from the succeeding paragraph; statement added re p27 and reversible cell cycle exit on lines 387-389; summary sentence added in lines 398-401). __
      • Statement added that reversibility distinguishes the MRTF defect from classical senescence p20§1 line 454-455.
      • Note that p27 is associated with reversible arrest included on p20§2 line 460. We also explicitly summarised the features of the phenotype at the start of the Discussion.

      • Sentences added p27§1 lines 626-631.

      • Emphasis that p27 protein upregulation is associated with reversible cell cycle inhibition and quiescence is added on p28 line 668-669.

      • The transcriptomic data are strong, but the paper would benefit from zooming in on specific MRTF-SRF targets (e.g., actin isoforms, adhesion molecules) that directly link cytoskeletal regulation to cell cycle control.*

      We have now clarified presentation of the RNAseq data in Figure 5 and the data summary tables. Figure 5B now identifies which of those genes showing deficits in MRTF-null MEFs were previously identified as direct genomic targets for MRTF-SRF, and that the majority are cytoskeletal.

      • __Additional columns added in Table 1 to indicate whether genes are candidate genomic MRTF-SRF targets; Table 2 now show gene symbol lists as well as ENSMBL IDs for GO categories and NCBI Entrez IDs for GSEA categories, respectively. __
      • __Figure 5B revised to point out cytoskeletal genes that are genomic MRTF-SRF targets in bold, legend clarified p40 lines 920-922. __
      • Now noted____ p23 lines 527-529 that cytoskeletal genes affected include many direct MRTF-SRF targets. Our data confirms that in MEFs, MRTF inactivation affects fibroblast cell morphology, adhesion, spreading, motility and contractility (Figures 5, 6), as seen in many other settings.

      A critical question remains as to whether these effects a reflect limitation in one MRTF target gene or several, and how this defect relates to proliferation.

      Concerning specific MRTF-SRF gene targets:

      Cells lacking cytoplasmic actins are reported to exhibit defective proliferation, (__now noted in Results p23 lines 529-532). __We are currently evaluating whether this defect has similarities with the MRTF-null proliferation phenotype (see Discussion p31, §2).

      Previous findings suggest that defective cytoplasmic actin expression may underlie most MRTF knockout phenotypes (Salvany et al, 2014; Maurice et al., 2024) previously noted in the Discussion (see p31, §2).

      The myoferlin gene promotes growth of liver cancer cells by inhibiting ERK activation and oncogene induced senescence. We showed that myoferlin expression does not promote proliferation of MRTF-null MEFs in the original submission (see Figure S5E). Additionally, we now point out that the RNAseq data show that myoferlin expression is not significantly affected in MRTF-null MEFs __(new text p23, lines 532-534). __

      • It depends on where what target journal would be, but this is is a very well executes mechanistic study that doesn't really have an impact. Extending the discussion to human systems-or tissues where contractility is critical-could broaden the impact and applicability of the findings.*

      We interpret this comment as indicating that our paper does not address the wider biological implications of our findings by extension to studies in tissue or tumour systems.

      As outlined in our response to review 3, our study provides strong evidence that MRTF-SRF will be required for cell proliferation in settings where physical progression through cell cycle transitions requires high contractility, either owing to intrinsic factors or external physical constraints such as tissue stiffness, fibrosis, or tumour microenvironment.

      Discussion now explicitly addresses potential roles for tissue stiffness (pp30§2 lines 717-718, and p32§1 725-727). However, we feel that resolution of this question is beyond the scope of the present paper.

      • As above, the paper briefly mentions transformation, but it would be valuable to elaborate on whether MRTF-SRF acts as a barrier or enabler in tumorigenesis under different conditions. This I feel is the main weakness remaining - e.g. it would be fine with enabling different effects driven by other transcription events in emerging tumour cells (oncogenic in context of RAS, suppressive in context of p53) but I think the manuscript fails to be definitive on this points. Addressing this would make a much stronger and impactful study. I believe they have an impact peice of science that outlines how mechanical events impact cell fate decisions, but this is unlikely to be the driver - ie it facilitates cell fate decisions in context of tissue stiffness.*

      We find it difficult to understand the precise points being made here.

      However, transformation has long been known to bypass physical constraints on proliferation such as the requirement for adhesion. Moreover, MRTF-SRF activity is not necessarily required for proliferation of all transformed cells (Hampl et al, 2013; Medjkane et al, 2009; our unpublished data). The relation of our findings to transformation is thus an open question, which we are actively pursuing. Now noted in revised Discussion p32, lines 752-755.

      MRTF-independent proliferation of tumor cells could reflect oncogenic signals substituting for MRTF-dependent ones (eg from focal adhesions), or from relief of cytoskeletal contraints on proliferation (adhesion independent proliferation). In contrast, in proliferation of DLC1-deleted cancer cells is dependent on suppression of oncogene-induced senescence by MRTF-SRF signalling (Hampl et al, 2013). These points were already made in Discussion p28, pp30-31.

      Although our current work is focussed on cell transformation, we would respectfully suggest the in-depth resolution of this complex question is beyond the scope of the present paper.

      See also response to (3) above.

      *Reviewer #1 (Significance (Required)): *

      *Overall *

      This is a well-executed and insightful study that deepens our understanding of how cytoskeletal signals drive proliferation through MRTF-SRF. It broadens the role of this pathway beyond motility and offers new perspectives on mechanotransduction and cellular plasticity. If is weak in its demonstration of biological significance, but if the aim to to present a pure basic cell biology story it is good.

      The vast majority of work with the SRF system has led to the common perception that its role is exclusively with cell motility and adhesive processes, not proliferation. The results presented in the paper, even if limited to cell culture models, are therefore novel.

      Reviewer #2

      (Evidence, reproducibility and clarity (Required)):

      *In this manuscript, Nielsen and colleagues examine the impact of MRTF-A/B and SRF gene inactivation on cell proliferation. They performed an extensive body of work (using multiple cell types and multiple clones) to show that MRTF inactivation causes cell cycle arrest and senescence (mimicking the phenotype of SRF knockout cells) although the changes in the expression of various CDK inhibitors were cell-type specific. *

      *Very interestingly, simultaneous inactivation of all three major CDK inhibitors failed to rescue MRTF knockout cells from their proliferation defect. Expectedly, MRTF knockout cells exhibited defects in actin cytoskeleton, adhesion, and contractility. Interestingly, hyperactivating Rho also failed to rescue MRTF knockout cells from proliferation defect. The main conclusion of the paper was derived from experiments which showed that inhibition of either ROCK or myosin caused wild-type cells to behave like MRTF knockout cells rather than demonstration of any molecular perturbation that could reverse the proliferation defect of MRTF knockout cells. *

      While the experimental studies are thorough and rigorous, a vast majority of the core findings related to the loss-of-function of MRTF that are reported herein (i.e. defects in cell proliferation, elevation of CDK inhibitors, migration, actin cytoskeleton, contractility) are not conceptually new and have been previously reported in other cell systems by several investigators including this research group.

      This is the first study showing that MRTF-SRF signalling is required for the proliferation of both primary and immortalised fibroblasts, and epithelial cells. We show that the MRTF-SRF non-proliferative state combines features of both classical senescence and reversible cell cycle exit / quiescence.

      The vast majority of previous work with the SRF system has led to the common perception that its role is exclusively related to cell motility and adhesive processes and not proliferation (see Olson and Nordheim 2010). Where proliferation has been examined directly, both others and our own previous studies of the MRTFs in immune cells and cancer cells lines have revealed no direct role in proliferation (Schratt et al, 2001;Medjkane et al 2009; Maurice et al, 2024).

      The results presented here are therefore novel.

      In the reviewer's opinion, since the authors have not been able to identify a molecular strategy to reverse the proliferation phenotype of MRTF knockout cells, the underlying mechanisms of MRTF-dependent regulation of cell proliferation remain largely unanswered.

      Indeed, our attempts to rescue the phenotype (knockouts of the CKIs, and overexpression of different downregulated factors) did not restore proliferation. We therefore now aim to attack the problem (i) through overexpression screens, and (ii) by identifying differences between MRTF-SRF dependent and -independent (eg transformed) cells. However, these are new projects that are beyond the scope of a revised paper.

      • *

      Other comments: Majority of the immunoblot data have not been quantified.

      P16 data in Fig 1G vs Fig S1A are not similar (although the authors mention that the findings are similar)

      We have addressed these issues by reorganisation and quantification the immunoblotting data as follows:

      • Figure S1A has been moved to new Figure 1I, replacing the limited analysis shown in old Figure 1G. This more comprehensive, and displays data from all three WT and Mrtfab-/-
      • Figure 1I data is quantified. Marker expression in each Mrtfab-/- pool is evaluated relative its mean expression in the three WT pools treated in parallel.
      • A new Figure S1A shows mean marker expression across the three Mrtfab-/- pools, drawn from 5 independent analyses (not all markers included in each analysis). Different analyses of marker expression may exhibit variation, resulting from differences in handling, culture medium, plating density, relative confluence, etc. However, Mrtfab-/- cells exhibit markedly increased p27 and TLR2 expression, while expression of the other markers tested, including p16, consistently decreases.
      • Spearman comparisons among the WT and Mrtfab-/- pools show that relative marker expression is indeed well correlated between the pools of each genotype. Note on quantitation added in Methods p10 lines 209-213.

      Figure 1I moved from former Figure S1A, to replace former Figure 1G. New legend now includes quantitation, and reference to Spearman correlations, p44 lines 834-841.

      New Figure S1A displays data from multiple independent experiments with all 3 Mrtfab-/- pools. New legend, p44 lines 997-1002.

      Figure S1B legend notes correlation between relative marker expression in untreated WT and Mrtfab-/- cells, p44, lines 1005-1008.

      Results text rewritten p17 lines 383-391; no reference to “similar”.

      *Reviewer #2 (Significance (Required)): *

      *This study aims to investigate a fundamental biological question of how an actin-regulated transcription machinery regulates cell proliferation and is therefore of broad significance. Strengths and limitations of this study are described above. *

      Reviewer #3

      *(Evidence, reproducibility and clarity (Required)): *

      Summary

      *The manuscript by Nielsen et al. (Treisman lab) entitled "MRTF-dependent cytoskeletal dynamics drive efficient cell cycle progression" investigates the effects on cell proliferation elicited upon cellular depletion of the transcription factors MRTF-A and MRTF-B. The MRTFs are actin-dependent co-factors of SRF, which direct the transcription of SRF target genes. The MRTF-SRF regulatory circuit defines both the functioning and the control of actin-driven cytoskeletal dynamics. *

      *The work presented identifies essential molecular links that interconnect cytoskeleton-dependent cellular activities (cell-cell adhesion, cell-substrate contact, cell spreading) and cell proliferation. *

      *General assessment on used methodology. *

      *The presented comprehensive body of work is performed competently; it includes all relevant and necessary state-of-the-art technologies. *

      • *

      Reviewer #3 (Significance (Required)):

      Advance

      Previously published evidence by others (including the Treisman group) had indicated that SRF does not seem essential for the proliferation of some cell types (i. e., embryonic (stem) cells, activation-dependent immune cells, etc.). In regard to this, the authors discuss in the current manuscript: "Although further work is needed to elucidate the basis for these context-dependent dfferences, our data show that MRTF-SRF signalling is likely to play a more general role in proliferation than previously thought." The current manuscript already delineates this "general role": MRTF-SRF signalling impinges on cell proliferation whenever proliferative activities are dependent upon cytoskeletal dynamics.

      We of course support the view that it is MRTF-SRF's role in cytoskeletal dynamics, especially contractility, that is a limiting factor for cell cycle progression in our cells; however, this may not be the cases or other cell types or settings, such adhesion-independent or transformed cells, and/or stiff tissue environments.

      We have stated this view more strongly, modifying the abstract and discussion, and rewording the sentence quoted above.

      The major point is that MRTF-SRF-dependent proliferation may be more common than previously thought, the field having focussed on its role in cytoskeletal dynamics rather than proliferation.

      Abstract lines 48-49; Discussion p28, line 668-669; pp30-31, lines 713-714, 725-727. See also last para pp31/32, __added lines 752-755. __

      *The work has implications for cancer biology. It offers new directions to investigate the regulation of proliferative activities of anchorage-independent tumor cells. **

      Audience *

      *The insights generated serve the wide interests of a large and diverse group of cell and tumor biologists. *

      *Reviewers field of expertise (keywords). *

      Cytoskeletal dynamics, transcriptional con*

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The manuscript by Nielsen et al. (Treisman lab) entitled "MRTF-dependent cytoskeletal dynamics drive efficient cell cycle progression" investigates the effects on cell proliferation elicited upon cellular depletion of the transcription factors MRTF-A and MRTF-B. The MRTFs are actin-dependent co-factors of SRF, which direct the transcription of SRF target genes. The MRTF-SRF regulatory circuit defines both the functioning and the control of actin-driven cytoskeletal dynamics. The work presented identifies essential molecular links that interconnect cytoskeleton-dependent cellular activities (cell-cell adhesion, cell-substrate contact, cell spreading) and cell proliferation.

      General assessment on used methodology.

      The presented comprehensive body of work is performed competently; it includes all relevant and necessary state-of-the-art technologies.

      Significance

      Advance

      Previously published evidence by others (including the Treisman group) had indicated that SRF does not seem essential for the proliferation of some cell types (i. e., embryonic (stem) cells, activation-dependent immune cells, etc.). In regard to this, the authors discuss in the current manuscript: "Although further work is needed to elucidate the basis for these context-dependent dfferences, our data show that MRTFSRF signalling is likely to play a more general role in proliferation than previously thought." The current manuscript already delineates this "general role": MRTF-SRF signalling impinges on cell proliferation whenever proliferative activities are dependent upon cytoskeletal dynamics.

      The work has implications for cancer biology. It offers new directions to investigate the regulation of proliferative activities of anchorage-independent tumor cells.

      Audience

      The insights generated serve the wide interests of a large and diverse group of cell and tumor biologists.

      Reviewers field of expertise (keywords).

      Cytoskeletal dynamics, transcriptional control.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Nielsen and colleagues examine the impact of MRTF-A/B and SRF gene inactivation on cell proliferation. They performed an extensive body of work (using multiple cell types and multiple clones) to show that MRTF inactivation causes cell cycle arrest and senescence (mimicking the phenotype of SRF knockout cells) although the changes in the expression of various CDK inhibitors were cell-type specific. Very interestingly, simultaneous inactivation of all three major CDK inhibitors failed to rescue MRTF knockout cells from their proliferation defect. Expectedly, MRTF knockout cells exhibited defects in actin cytoskeleton, adhesion, and contractility. Interestingly, hyperactivating Rho also failed to rescue MRTF knockout cells from proliferation defect. The main conclusion of the paper was derived from experiments which showed that inhibition of either ROCK or myosin caused wild-type cells to behave like MRTF knockout cells rather than demonstration of any molecular perturbation that could reverse the proliferation defect of MRTF knockout cells. While the experimental studies are thorough and rigorous, a vast majority of the core findings related to the loss-of-function of MRTF that are reported herein (i.e. defects in cell proliferation, elevation of CDK inhibitors, migration, actin cytoskeleton, contractility) are not conceptually new and have been previously reported in other cell systems by several investigators including this research group. In the reviewer's opinion, since the authors have not been able to identify a molecular strategy to reverse the proliferation phenotype of MRTF knockout cells, the underlying mechanisms of MRTF-dependent regulation of cell proliferation remain largely unanswered.

      Other comments: Majority of the immunoblot data have not been quantified. P16 data in Fig 1G vs Fig S1A are not similar (although the authors mention that the findings are similar)

      Significance

      This study aims to investigate a fundamental biological question of how an actin-regulated transcription machinery regulates cell proliferation and is therefore of broad significance. Strengths and limitations of this study are described above.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Neilsen et al. presents a thorough and well-structured study showing that Myocardin-related transcription factors (MRTF-A/B), via MRTF-SRF, are essential for the proliferation of both primary and immortalized fibroblasts and epithelial cells. Using a combination of knockouts/rescue experiments, cytoskeletal analysis, and transcriptomics, the authors demonstrate that MRTF-SRF signalling controls actin dynamics and contractility-key drivers of cell cycle progression. Notably, they show that the proliferative arrest caused by MRTF loss is reversible, distinguishing it from classical senescence.

      Major points

      1. The link between MRTF-SRF activity, cytoskeletal organisation, and cell proliferation is clearly established. The fact that disrupting contractility phenocopies MRTF loss strengthens the case that the pathway acts through mechanical control.
      2. The authors support their conclusions using multiple cell types (MEFs, primary fibroblasts, epithelial cells), a range of complementary assays (RNA-seq, traction force microscopy, adhesion/spreading), and genetic tools (CRISPR, inducible rescue).
      3. The ability to restore proliferation by re-expressing MRTF-A argues against true senescence and instead suggests a quiescence-like state driven by cytoskeletal disruption.
      4. This work particularly highlights how mechanical inputs feed into transcriptional programs to regulate proliferation, with implications for understanding anchorage-dependent growth.

      Suggestions

      1. While the authors argue convincingly against classical senescence, elevated SA-βGal and SASP expression suggest a more nuanced arrest state. It not really clear what this state is or is not, therefore a deeper discussion of possible hybrid or intermediate states would be helpful - maybe potential additional experiments to include or exclude potential explanations - e.g. how does it differ from G0 exit?
      2. The transcriptomic data are strong, but the paper would benefit from zooming in on specific MRTF-SRF targets (e.g., actin isoforms, adhesion molecules) that directly link cytoskeletal regulation to cell cycle control.
      3. It depends on where what target journal would be, but this is is a very well executes mechanistic study that doesn't really have an impact. Extending the discussion to human systems-or tissues where contractility is critical-could broaden the impact and applicability of the findings.
      4. As above, the paper briefly mentions transformation, but it would be valuable to elaborate on whether MRTF-SRF acts as a barrier or enabler in tumorigenesis under different conditions. This I feel is the main weakness remaining - e.g. it would be fine with enabling different effects driven by other transcription events in emerging tumour cells (oncogenic in context of RAS, suppressive in context of p53) but I think the manuscript fails to be definitive on this points. Addressing this would make a much stronger and impactful study. I believe they have an impact peice of science that outlines how mechanical events impact cell fate decisions, but this is unlikely to be the driver - ie it facilitates cell fate decisions in context of tissue stiffness.

      Significance

      Overall

      This is a well-executed and insightful study that deepens our understanding of how cytoskeletal signals drive proliferation through MRTF-SRF. It broadens the role of this pathway beyond motility and offers new perspectives on mechanotransduction and cellular plasticity. If is weak in its demonstration of biological significance, but if the aim to to present a pure basic cell biology story it is good.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Proposed revision plan

      Based on the below reviews, we propose the following revision plan. Briefly:

      • We will remove the functional data on TGFβ signaling and mechanical loading/mechanosensing. We agree with the reviewers that we would need to generate additional histological and molecular data from conditional knockout mice, antibody and (ant)agonist treatments and the optogenetic model to determine their exact involvement in lining macrophage maturation. These experiments require significant time and other resources.
      • We would therefore like to uncouple this question for a follow-on manuscript.We will re-focus the manuscript on the developmental data providing a molecular and cellular blueprint of lining macrophage development. This will include our data on CSF1 as a key signal. The novelty and relevance of our developmental data have been highlighted by all three reviewers, and they have also praised the rigor of these experiments and their interpretation. We thus believe that this re-focus will improve the manuscript message.
      • To further enhance this, we are proposing to include additional data delineating the developmental dynamics of synovial fibroblasts. We have generated an in-depth single cell RNAsequencing dataset but did not include fibroblast-specific analyses in the original manuscript. This is not a change proposed by the reviewers, but we are proposing this because we believe this would be an impactful addition to a revised version of our study, providing data also on the maturation of the synovial (lining) macrophage niche.
      • We will otherwise respond to all individual reviewer comments and implement the requested changes, unless technically not possible. Please find below detailed point-by-point answers.

      Reviewer #1

      Evidence, reproducibility and clarity

      In their manuscript entitled "The synovial lining macrophage layer develops in the first weeks of life in a CSF1- and TGFβ-dependent but monocyte-independent process," the authors explore the developmental trajectory of synovial lining macrophages. They demonstrate that the formation of this specialized macrophage layer is age-dependent and governed by a distinct developmental program that proceeds independently of circulating monocytes. Through scRNA-Seq, the authors show that synovial lining macrophages originate locally from Aqp1⁺ macrophages and are marked by the expression of Csf1r, Tgfbr, and Piezo1. Notably, genetic ablation of each of these factors impaired the development of lining macrophages to varying degrees, suggesting differential contributions of CSF1, TGFβ, and PIEZO1 signaling pathways to their maturation and maintenance.

      The manuscript is well written, and the data quality and representation is of a high standard. The authors have employed a sophisticated array of state-of-the-art mouse models and cutting-edge technologies to elucidate the developmental origin of synovial lining macrophages. Notably, the supporting scRNA-Seq datasets are of excellence and provide valuable insights that will likely be of significant interest to researchers in the field of immunology and joint biology. Accordingly, the experimental approach and interpretations regarding macrophage origin are well-founded and compelling. However, in the eye of the reviewer, the section addressing the underlying molecular mechanisms is a bit less convincing. This part of the study appears slightly underdeveloped, and some of the mechanistic claims lack sufficient experimental clarity. A more rigorous experimental investigation would be essential to reinforce the manuscript's conclusions, particularly concerning the data related to Tgfbr and Piezo1, where the current evidence appears insufficiently substantiated.

      We thank the reviewer for their positive and constructive evaluation of our manuscript. We agree with them (and the other reviewers) that our functional data on the involvement of TGFβ signaling and mechanical loading/mechanosensing are comparably less convincing and substantiated than our developmental data. We are very grateful for their (and the other reviewers’) suggestions to provide more support for the involvement of these factors in lining macrophage development. However, we think that carrying this out to the same high standard will require substantial time and other resources. We have therefore decided to uncouple this from the developmental data and pursue this in follow-up work. We will re-focus the current manuscript on the developmental data. We have proposed to the editors to instead include additional data on synovial fibroblast development, to complement our macrophage data and also delineate the maturation of their niche, thereby providing a conclusive developmental atlas.

      Major point:

      1. The numbers of VSIG4⁺ macrophages appear either unaffected or only minimally altered in both Csf1rMerCreMer Tgfbr2floxed and Fcgr1Cre Piezo1floxed mouse models, respectively. This raises an important question: was the gene deletion efficiency sufficient in each model? Accordingly, the authors are encouraged to include quantitative data on gene deletion efficiency for both mouse models, as this information is critical for interpreting the observed phenotypic outcomes and validating the conclusions regarding gene function. Furthermore, to better assess the impact of Tgfbr2 and Piezo1 disruption, the authors should provide more comprehensive flow cytometry analyses and histological data for these mouse models. Given the apparent homogeneity of VSIG4⁺ macrophages (as shown by the authors themselves), bulk RNA-Seq of sorted Tgfbr2- and Piezo1-deficient VSIG4⁺ macrophages (or from TGFβ-treated animals) would offer valuable insights into both the effectiveness of gene deletion and the molecular pathways governed by TGFβ and PIEZO1 in lining macrophages.

      As outlined above, we have decided to uncouple our functional data on TGFβ, Piezo1 and mechanical loading. The points raised here are all very valid, and we will implement your suggestions in our follow-up functional work focusing on signaling events regulating lining macrophage development. On the suggestion to perform bulk RNA sequencing for VSIG4+ macrophages: This is a good one in principle – although we will not be able to use this strategy where we want to assess the consequences of experimental treatments or genetic models on lining macrophage maturation, because acquisition of VSIG4 is a key maturation event that might be impaired in these conditions.

      Minor points:

      Consistent usage of Cx3cr1-GFP+ nomenclature (for instance: Fig. S1 legend "adult mouse synovial tissue, showing PDGFRα⁺ fibroblasts (yellow) and CX3CR1-GFP⁺ cells (cyan)." versus Fig. 1 legend "Automated spot detection highlights Cx3cr1-GFP⁺ macrophages)".

      We will implement these changes.

      Unclear Fig. 3 legend: "Representative immunofluorescence images of synovial tissue from Clec9aCre:Rosa26lsl-tdT mice at 3 weeks and in adulthood, showing and tdTomato (yellow) and stained for DAPI (blue), VSIG4 (cyan)" Check 'showing and tdTomato.'

      We will implement these changes.

      For greater clarity, it would have been helpful if the transcript names had been directly included within Figures 3C, S3A, and S3C.

      We will implement these changes.

      Page 24: "(Mki67CreERT2:Rosa26lsl-tdT)" Last bracket not superscript.

      We will implement these changes.

      Page 25: "we again leveraged our scRNAsequencing dataset" Missing punctuation.

      We will implement these changes.

      Page 27: Fig. 5C legend: " of synovial tissue of 1 week-old, 3 weeks-old and adult mice." Please specify and change to 'adult Csf1rΔFIRE/ΔFIRE mice'.

      We will implement these changes.

      Page 30: The outcome observed in the Acta1-rtTA:tetO-Cre:ChR2-V5fl mouse model appears to be inconclusive: "This approach resulted in an increased density of VSIG4+ and total (F4/80+) macrophages in the exposed leg of some 5 days-old pups, but others showed the opposite trend (Figure S5D)." This variability may reflect low efficiency of the model or other technical limitations (e.g. muscle contractions frequency or time point of analysis). Given this ambiguity, it is worth reconsidering whether the data are sufficiently robust to warrant inclusion. Should the authors choose to include these findings, further experimentation of appropriate depth and precision is required to allow a conclusive interpretation (either it increases the density of VSIG4+ macrophages or not). The same applies to the Yoda1-treated mice, for which additional data are needed to determine whether VSIG4⁺ macrophage density is truly affected.

      We have decided to remove the data on the optogenetic mouse model and Yoda1 treatment and follow-on separately, implementing these suggestions, including proof of concept data for optogenetically induced muscle contractions.

      Significance

      General assessment: provide a summary of the strengths and limitations of the study. What are the strongest and most important aspects? What aspects of the study should be improved or could be developed? This is a well-designed study that uses cutting-edge methodologies to investigate the developmental trajectory of synovial lining macrophages under homeostatic conditions. The authors present robust experimental evidence and compelling interpretations concerning synovial macrophage origin, which are both well-substantiated and impactful. Nonetheless, from the reviewer's perspective, the section exploring the molecular mechanisms underlying macrophage differentiation is comparatively less convincing. This section appears somewhat underdeveloped, as some of the mechanistic claims lack sufficient depth and experimental rigor to fully substantiate the conclusions.

      Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field: In contrast to earlier studies (PMID: 31391580, 32601335), the inclusion of fate-mapping experiments adds an important dimension, offering novel insight into the ontogeny of synovial macrophages. This expanded perspective may prove particularly valuable in advancing our understanding of joint immunology, especially regarding the local origins and lineage relationships of macrophage populations.

      Furthermore, the authors present novel insights into the molecular pathways underlying the differentiation and development of synovial lining macrophages. By demonstrating previously unrecognized regulatory mechanisms, this work significantly deepens our understanding of the cellular and transcriptional programs that drive macrophage specialization within the joint microenvironment.

      Place the work in the context of the existing literature (provide references, where appropriate): This study builds upon previous work characterizing the macrophage compartment in the joint (PMID: 31391580, 32601335), yet provides a substantially more comprehensive dataset that spans multiple developmental time points and data on the origin of this specialized macrophage subset.

      State what audience might be interested in and influenced by the reported findings: Immunologist, clinicians

      Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. This study falls well within the scope of the reviewer's expertise in innate immunity.

      Reviewer #2

      Evidence, reproducibility and clarity

      In the manuscript „The synovial lining macrophage layer develops in the first weeks of life in a CSF1- and TGFβ- dependent but monocyte-independent process", Magalhaes Pinto and colleagues carefully employ a wide range of technologies including single cell profiling, imaging and an exceptional combination of fate mapping models to characterize the ontogeny and development of lining macrophages in the joint, thus dissecting their maturation during postnatal development. Over the last decade, several landmark studies highlighted the imprinting of tissue-resident macrophages by a combination of ontogenetic and tissue-specific niche factors during development. So far, the ontogeny and the tissue niche factors governing the development and maturation of lining macrophages have not been described. Therefore, the results of this study offers insights on a small highly adapted macrophage population with relevance in many disease settings in the joint. Furthermore, the findings are nicely showcasing how macrophages are specializing to even very small tissue niches across development within one bigger anatomical compartment to serve dedicated functions within this niche.

      This manuscript is beautifully written and highlights many novel, highly relevant findings on lining macrophage biology and the authors employ a wide range of different technologies to carefully dissect the postnatal development of lining macrophages.

      In particular, the combination of scRNA-seq and fate mapping is providing a unique the link of transcriptional programs to ontogeny within the tissue niche. Furthermore, the integrative use of distinct fate mapping strategies, transgenic mouse lines, and treatment paradigms to elucidate key niche factors guiding the development and maturation of lining macrophages provides many interesting findings and data that are highly relevant to the field. I really enjoyed reading this manuscript.

      Thank you for your complimentary and constructive assessment of our manuscript, and the detailed comments below, which are very helpful. Please find point-by-point responses below.

      Major points:

      The authors show dynamic regulation of VSIG4 in lining macrophages during development, therefore VSIG4 is maybe not an ideal choice for gating strategies to define lining macrophages or to show as a single markers in immunofluorescence (IF) stainings to demonstrate their abundance across development (even though it is clear that this is the reason why the F4/80 staining is shown next to it). To demonstrate the increase of lining macrophages during development in IF, it would be more helpful if the authors would show quantifications of all F4/80+ cells and additionally VSIG4+ as a proportion of F4/80+ cells (or VSIG4+ F4/80+ and all F4/80+ in a stacked bar plot). We agree with the assessment of VSIG4 not being ideal since this is a key marker of mature lining macrophages only.

      We will provide these additional analyses.

      In Figure 1C, the authors nicely demonstrate that the lining macrophages get closer in their distance across development to build the epithelial-like macrophage structure along the adult lining. Is the close proximity between lining macrophages already fully "matured" at 3 weeks of age and comparable to adults? Please quantify the distance in adult linings.

      We will provide data for adult joints.

      Can the authors explain how the grouping was performed between the analyzed human fetal joints? It is not clear why the cut was chosen between the groups at 16/17 weeks of age. Maybe it would be also beneficial if the authors would consider not grouping these samples but rather show the specific quantifications for each samples individually and estimate via linear regression the expansion over time across human development. Furthermore, can the authors give additional information about the distancing of lining macrophages in the human fetal samples, it would be great to see if they follow the same dynamics as in mouse. Maybe comparison to human juvenile/adult joints would also add on to substantiate the findings in human samples (if possible).

      We will show samples ungrouped and perform linear regression analysis as suggested.

      The scRNA-seq analysis leaves several questions open and some conclusions and workflows cannot be easily followed.

      We appreciate this comment and the complexity of the data, and will implement the below recommendations, and clarify the issues raised.

      It is not clear how and especially why the signature genes to define macrophages vs. monocytes were chosen. Especially as the signature genes for monocytes would not include patrolling monocytes and the macrophage signature genes seem to be highly regulated during development, see also Apoe expression in NB vs. adult in Figure S2e. Why did the authors not take classical markers such as Itgam, Fcgr1a, Csf1r?

      Can dendritic cell signatures be excluded? Cluster 11 and 12 show indeed some DC markers, are these really macrophages?

      The authors provide several figure panels showing TOP marker genes or key marker genes for the identified clusters, however it is not clear if these are TOP DE genes or if the genes were hand chosen. Somehow, the authors give the impression that the clusters were chosen and labeled not based on DE genes, but more on existing literature that previously reported these macrophage populations. DE gene lists for all annotated cell types and macrophage clusters need to be provided within the manuscript.

      The authors claim that Clusters 1 and 4 are "developing" macrophages. How is this defined? Why are these developing cells compared to other clusters? And why are these clusters later on not considered as progenitors of Aqp1 macrophages and Vsig4 macrophages? Why are Aqp1+ macrophages not labeled as developing when they are later on in the manuscript shown as potential intermediate progenitors of lining macrophages?

      Furthermore, it is again confusing that markers are used throughout Figure 2 which are labeled as "key marker genes" for a population and then later on they are claimed to be regulated during development within this population, see for example Figure 2D and 2H.

      It is appreciated that the authors distinguished cycling clusters such as 8, 9, and 10 based on their cycling gene signature. Here it would be very exciting to see a cell cycle analysis across all clusters and time points to see when exactly the cells are expanding during development; this would also substantiate the data later shown for the Mki67-CreERT2 mouse model.

      Can the authors identify certain gene modules during development of lining macrophages (and/or their progenitors) which are associated with certain functions (e.g. GO terms, GSEA enrichment)?

      To determine the actual presence of the identified macrophage clusters from the scRNA-seq as macrophage populations in the joint, the authors should perform IF or FACS for key markers. Especially, Aqp1+ macrophages should be shown in the developing joint.

      We will provide additional data, but would also like to reference a study by collaborators currently in revision at Immunity, which characterizes the Aqp1+ population in detail. We are hoping to have a doi available during our revision process.

      The authors used a wide range of fate mapping models, which is quite unique and highly appreciated. The obtained results and the conclusions made from the models raise a couple of questions: Whereas contribution of HSC-derived/monocyte-derived macrophages to the lining compartment seems to be minor, there is still labeling across different models. Various aspects would need to be clarified.

      We will clarify these data throughout as per below suggestions.

      For example, the authors employ Ms4a3-Cre as a tracing model for GMP-derived monocytes, however all quantifications of the labeling efficiency are not normalized to the labeling in monocytes or another highly recombined cell population. This should be shown, similar to the other fate mapping models (Figure 3 F-I).

      Labelling efficacy for Ms4a3-Cre is near complete for GMP-derived monocytes (and neutrophils) with the Rosa-lsl-tdT (aka Ai14) reporter we have used (see also PMID: 31491389 and doi: 10.1101/2024.12.03.626330); but we will include normalized data as requested.

      Please show Ms4a3 expression across clusters across time points, to exclude expression in fetal-derived clusters.

      We will include this in the revised supplementary information, but there is indeed very little at birth (in line with the original report for other tissues PMID: 31491389).

      In line with the question raised above, if the authors can exclude a development of the Egfr1+ and Clec4n+ developing macrophages into Aqp1+ macrophages and subsequently into Vsig4 lining macrophages, the obtained data from the Ms4a3-Cre model highly suggests a correlative labeling across these clusters what could implicate a relation. However, the authors do not discuss throughout the manuscript the role of these developing macrophages. It is highly encouraged to include this into the manuscript and it would be of high relevance to understand lining macrophage development.

      This is an interesting point and we agree it deserves consideration in the revised manuscript. Indeed, our trajectory analyses do not predict differentiation of the Egfr1+ and Clec4n+ developing macrophages into Aqp1+ macrophages, and hence, ultimately lining macrophages. Conversely, Aqp1+ cells might also convert into Egfr1+ and Clec4n+ developing macrophages. We will elaborate on this more in the revised manuscript.

      The authors conclude from the pseudo bulk transcriptomic profiling of the different macrophage clusters that TdT+ and TdT- macrophages do not differ in their gene expression profile and that this is due to niche imprinting rather than origin imprinting. Even though the data supports that conclusion, the authors should verify if inkling cells early during development also show this similar gene expression profile and gene expression should be compared at the different developmental time points. Tissue niche imprinting is happening within the niche during development, most likely in a stepwise progress, and therefore there should be differences in the beginning.

      This is another important point that we will address in the revised manuscript by performing additional differential gene expression analyses at the different developmental time points, including the earliest stages, as suggested.

      The trajectorial analysis using different pseudotime pipelines is very interesting and nicely points out the potential role of Aqp1 macrophages as intermediates of Vsig4 lining macrophages. From my point of view, all trajectories seem to suggest that Egfr1 developing macrophages and Clec4n developing macrophages might differentiate into Aqp1 macrophages, however the authors are not exploring this further and the role of both developing macrophage clusters is not further discussed (see also comments above).

      We will address and discuss this in the revised manuscript.

      How was the starting point of the trajectorial analyses defined and is it the same for each pipeline used?

      We will clarify this in the revised manuscript.

      Are there potentially two trajectories? It looks like there is one in the beginning of postnatal life and a second one appearing from the monocyte-compartment later in life. If this is true, that would rather speak for a dual ontogeny of Vsig4+ macrophages, wouldn't it?

      We will discuss this in the revised manuscript.

      A heatmap (transcriptional shift) of trajectories between more clusters should be shown at least for Cluster 0,1,2, and 3. It is not sufficient to demonstrate this only between two clusters.

      We will add these analyses during revision.

      To show the similarity between Aqp1 macrophages and proliferating macrophage clusters, the authors should remove the cycling signature and compare these clusters to show that the cycling cells might be Aqp1 macrophages or earlier developing macrophage progenitors aka Clec4n or Egfr1 macrophages.

      We will address this in the revised manuscript.

      The conclusions made from the Mki67-CreERT2 data are a bit difficult to understand, whereas all progenitors (monocyte progenitors and macrophage progenitors will proliferate at the neonatal time point and no conclusions can be made if the cells expand in the niche. The authors should employ Confetti mice or other models (Ubow mice) to analyze clonal expansion in the niche.

      We agree that interpretation of the Mki67-CreERT2 data is complicated by labeling of other cells, and notably, labeling observed in BM-derived cells. We will highlight this better in the revised manuscript. We have tried using Ubow mice to address this issue, but the recombination efficacy we yielded was too low to draw conclusions. We will address this during revision.

      All predicted cell-cell interactions between macrophages and fibroblasts should be provided in a supplementary table. Are the interactions shown in Figure 5 chosen interactions or the TOP predicted ones? Whereas the authors show different numbers of interactions, it is most likely hand-picked and therefore biased.

      We will provide a full list of all predicted interactions in the revised supplementary material in addition to a list of the full differential gene expression analysis.

      The authors further aim to dissect the factors involved in the developmental niche imprinting of lining macrophages. Even though it is highly appreciated that the authors used so many experimental setups to show the reliance of lining macrophages on Csf1 and TGF-beta as well as mechanosensation, the wide range of models the different methods used and selected developmental time points make it very difficult to really interpret the data. The authors should carefully choose time points and methods (either FACS analysis across all models or IF across all, or both). Often deletion efficiencies for transgenic models and proof of concept that the inhibitors and agonists are working in the treatment paradigm are not provided. For example, Csf1rMer-iCre-Mer Tgfbr2fl/fl mice are used but no deletion efficiency is shown or different time points of analysis, maybe the macrophages are not properly targeted in the set up.

      We have decided to uncouple our experimental data on Tgfb, Piezo1 and mechanosensing/mechanical loading, but are taking this into consideration for revision. In many cases, we have in fact performed flow cytometry and imaging analyses, and agree, we should be showing this consistently.

      The authors have shown the role of Csf1 and Tgfbr2 only for lining macrophages, is this specific in the joint to this population of are subliming macrophages affected in a similar manner.

      We will include data on sublining macrophages in the revised figure (for CSF1; Tgfb data will be uncoupled from this current manuscript).

      Can the authors confirm their results in CSF1R-FIRE mice with anti-Csf1 injections or in Csf1op/op mice?

      We will expand our discussion of the Csf1 findings, and will consider including anti-CSF1 data during revision. Phenotypes on other Csf1(r) deficient mice are published, if not with the same developmental resolution as our time course in Csf1rFIRE knockout mice and with simpler readouts. Csf1op/op mice are indeed deficient in synovial lining macrophages, from 2 days of age onwards (PMID: 8050349), and lining macrophages are also absent from 2-weeks-old and adult Csf1r-/- mice (PMID: 11756160).

      The setup in Figure S5G is very interesting to test the role of movement and mechanical load on the joint, however, there is basically no data on the model provided showing the efficiency of the induced optogenetic muscle contractions, and only one time point is shown.

      Data on mechanical loading will be uncoupled from the current manuscript and substantiated in a separate follow-up.

      The results regarding the role of Piezo1 and mechanosensation vary a lot. Could it be that analyses were done too early or that actually proper weight load on the joint must be applied for the maturation of the macrophages? The authors should test this to.

      We will uncouple these data from the current manuscript during revision. However, this is a possibility that we have discussed. In fact, the most appropriate experimental approach to address the involvement of mechanical loading, onset of walking and specifically, weight bearing would be a loss-of-function approach (i.e. paralysis at the newborn stage), for which we unfortunately could not obtain ethics approval from the UK Home Office.

      The Rolipram experiment is shown in Figure S5G, but is not described in the result section. It only appears at some point in the discussion part. The authors should move it to results or remove it from the manuscript.

      We will incorporate these data with the revised section on developing synovial macrophage populations.

      Minor points:

      Please reference the Figure panels in numeric order throughout the text.

      We will change this where not the case.

      Figure 2a and 2b are a bit out of the storyline, it is not obvious why this is shown here and maybe it would be good to move it to the supplements. Gating strategy is also not used for scRNA-seq. Therefore, it would better fit to the later analysis of joint macrophages across different transgenic mouse models and treatment paradigms. The gating strategies are changing across different experiments throughout the figures, it would be nice to have a similar gating strategy for all experiments, see also Figure 3 where the defining markers for joint macrophages are changing between models.

      We will revise Figures 2, 3 and the related supplementary figures.

      A lot of figure panels have very small labeling that is basically unreadable. Axes at FACS plots for example. Sometimes, it is even impossible to distinguish cluster labels especially when they have similar colors.

      We will revise this, thanks for pointing it out.

      In the text on page 14, many markers are named which are specifically regulated during development in lining macrophages, but these factors are not labeled anywhere in the volcano plot. It would be good to showcase at least some of these named genes in the figure panel, e.g. Trem2.

      We will do this for revision.

      Figure 2F and Figure S2F are really nicely showing the percentage of cells per cluster in each analyzed biological sample. Maybe the authors could additionally consider to show a stacked bar plot with the mean percentage of cells per cluster and how the clusters are distributed across time points?

      We will include this in the revised manuscript.

      Figure 3A: IF for adult lining macrophages and the quantification are missing.

      This will be included in the revised version.

      Significance

      This manuscript highlights novel, highly relevant findings on lining macrophage biology and the authors employ a wide range of different technologies to carefully dissect the postnatal development of lining macrophages. Furthermore, this study showcases in a very elegant and detailed way the adaptation of macrophage progenitors to a highly specific anatomical tissue niche.

      The manuscript is of high interest to basic scientists focussing on macrophage biology and immune cell development and clinicians and clinician scientists focussing on joint diseases such as RA.

      Therefore the manuscript is of interest to a wide community working in immunology.

      Reviewer #3

      Summary:

      Magalhaes Pinto, Malengier-Devlies, and co-authors investigated the developmental origins and maturation of synovial (lining and sublining) macrophages across embryonic, newborn, and postnatal stages in mouse. The authors used multiple transgenic reporter lines, lineage tracing, scRNA-seq, 2D confocal and 3D lightsheet imaging, and perturbations to delineate the macrophage states and ontogeny. They propose a model in which the majority of the joint lining macrophages has a fetal (EMP-derived) origin and a small proportion has a definitive HSC-derived monocyte origin, which both seed and mature within the synovial space in the postnatal period in the first 3 weeks of life. Using cell-cell communication analysis on their scRNA-seq data, they identified Fgf2, Csf1, and Tgfb as candidate signaling pathways that support (lining) macrophage development and maturation. Functional experiments indicate that the process is CSF1 and TGFb-dependent and also partly dependent on mechanosensing through Piezo1.

      The key conclusions on the composition of the synovial macrophages are convincing based on the presented results, and are carefully phrased. The study is very comprehensive, yet the description and organization of the results of the different mouse models could be altered to improve the storyline. Several refinements in data presentation, formulation, and minor validation experiments would further improve the clarity of the story, as well as summary recaps of the major findings throughout the text.

      We thank this reviewer for their detailed review. We will be implementing the requested changes wherever technically feasible.

      Major comments:

      Generally, the story could be more streamlined by introducing earlier reporter lines and lineage-origin logic. Clearly state which reporter/CreERT2 lines and acrosses are used. It was unclear in Figure 2 that cells of the cross of the Cx3cr1-GFP and Ms4a3Cre:Rosa26lsl-tdT reporter lines were used for the scRNA-seq. The principle that there are fetal-derived and bone marrow (GMP)-derived monocytes and macrophages doesn't need to be "hidden" until Figure 3. For example, also the imaging of Ms4a3Cre could be introduced before the scRNA-seq.

      We will revise the structure and order of the manuscript during revision.

      Figure 1 could benefit from a cartoon visualizing the anatomy of the knee joint. The terms "sublining" and "synovium" are now a bit unclear, as it appears that sometimes the synovium is indicated as sublining and vice versa. Additionally, a schematic developmental timeline could be added to indicate the parallels between mouse and human development (fetal and postnatal development in mouse versus gestational age in human). Also, the various waves of hematopoiesis could be indicated in this timeline, which would be particularly helpful for Figure 3 for the lineage-tracing readouts. Lastly, the authors could end the manuscript (a new Figure 6) with a general cartoon summarizing all the results presented.

      We will include illustrations as suggested.

      Figure 1 could be rearranged: first introduce the markers CX3CR1 and VSIG4 (Figure 1D) and then present the quantifications (Figure 1B/E). Where possible, co-visualization CX3CR1-GFP and VSIG4 on tissue sections to strengthen the claims on the relationship between these 2 markers. Tying the scRNA-seq insights (Figure 2) to the imaging would be elegant. Moreover, it would be informative to represent the CX3CR1+ and VSIG4+ macrophages as a percentage of F4/80+ macrophages (Figure 1B/E). Similarly, for the flow cytometry data in Figure 2, the relationship between the markers CX3CR1 and VSIG4 on macrophages could be more clearly displayed and discussed.

      Thanks for this remark. We will endeavor to show co-localization and analysis of both markers wherever possible. However, where we did not use Cx3cr1gfp mice, co-staining was limited by antibody choice.

      The 3D imaging of the joint is a nice addition to the manuscript, as it provides more context to the anatomical structure; however, while the text suggests several newborn joints were imaged, Figure 1F visualizes (again) the knee joint. Could other joints also be represented by 3D imaging? If the knee joint is the only joint available for imaging, and previous confocal imaging focused specifically on the meniscus in the knee joint, could the meniscus also be highlighted in the lightsheet imaging?

      Apologies if this was not clear from the original manuscript text, but we have only imaged the knee joint in 3D. We will clarify this during revision and consider inclusion of additional imaging data.

      Clarification is requested regarding the imaging quantification representation. The M&M section under "Statistical analysis and reproducibility" states that individual data points are displayed, and bars represent the mean. However, some of the Figure legends (e.g., Figures 1B and S1C) specify that each dot corresponds to an individual mouse, with quantification based on 2-3 sections per mouse. While this appears to be a very reasonable representation of the data, does this mean that for each dot, the mean value from the 2-3 sections per mouse was calculated and plotted?

      We will clarify this.

      It is not clear how the differential expression analysis was performed on the Vsig4+ cells. Please specify if Cluster 0 was used for analysis, or all Vsig4-expressing cells? Not all cells in Cluster 0 have Vsig4+ expression. The authors described the expression dynamics of Aqp1 as intriguing, but lack a reasoning on why this is interesting.

      We will revise this section.

      Figure S3E: In line with the previous comment, can the authors justify that the tdTomato+/- comparisons are not biased by scRNA-seq dropout (scRNA-seq is zero-inflated, so some tdTomato- cells could be false negatives), and provide methodological details (thresholds, ambient RNA correction, etc.) to support this?

      We will clarify this and include additional representations of the tdTomato transcript data.

      Although the sex-related differences in macrophage composition and the absence of differential expression are interesting, they distract from the manuscript's main messages. Moreover, the Discussion does not elaborate on how these observations relate to joint (disease) biology. Consider removing this section or integrating it clearly into the relevant biological context.

      We will remove this section as suggested.

      CreERT2 transgenic lines are often not 100% efficient in recombination, also depending on whether tamoxifen or 4-OHT is used. Could the authors report the percentage of tdTomato+ cells in the joints and compare them to the recombination efficiencies in the monocytes/microglia under the same tamoxifen or 4-OHT conditions? This would help clarify how the interpret the macrophage labeling %'s.

      We will report labelling efficacies and/or show normalized data in the revised manuscript.

      Could the authors draw parallels between the observations in the mouse knee joint macrophage populations and literature on other joints in mouse and the knee joint in human (for example, as described in Alivernini et al., 2020 and in the very recent Raut et al., 2025)?

      We will include a section on this in the revised manuscript.

      Minor comments:

      In general, the authors should clarify in the Results what each marker used for imaging, flow cytometry, or in the mouse reporter lines delineates. For example, mention that F4/80 is a marker for tissue-resident macrophages (correct?) in immunofluorescence, that IBA1 is a marker for macrophages on human tissue sections (Figure S1), and PDPN is GP38 (Figure S2 - align usage of marker reference across main text and figures).

      We will implement this request.

      For clarity in the microscopy representation, the single channels should be represented in a grey scale.

      We will revise image presentation.

      Figure S1B: Is CX3CR1 also restricted to the lining macrophages in human? Could a co-staining with IBA1 be performed to strengthen the species similarities?

      To our knowledge, there is no antibody available that works for imaging of human CX3CR1. Moreover, CX3CR1 is only limited to the lining population in adult joints, in fetal and newborn (mouse) joints, all macrophages express this receptor, as do fetal progenitors to macrophages. However, Alivernini and colleagues have reported that TREM2high macrophages are the human counterpart of the mouse CX3CR1+ lining population (PMID: 32601335).

      Adipocyte diameter quantification: Avoid plotting individual adipocytes from 2 mice without per-mouse visualization. Instead, report the mean adipocyte diameter per mouse and plot those means.

      We will implement this change.

      A little typo was spotted in the "Statistical analysis and reproducibility" section: it is Dunn's, not Bunn's multiple-comparison correction.

      Thanks for spotting this.

      Figure 2A: The gating strategy for the CX3CR1-GFP cells is missing.

      We will provide this in the revised manuscript or supplementary material.

      Improve the visualization of some plots. For example, Figure 2F is hard to read because of the big dot size. The dots seem to add no information to the graph and could be removed. Additionally, for comparing the clusters across the different time points, one could project the cells from the other time points in grey in the background.

      We will revise the presentation of these data.

      Figure S2: The dotplot is more informative than the heatmap, consider removing the heatmap.

      We will do that.

      Figure 3A: If technically feasible, image and visualize both the GFP and tdTomato expression. It would be informative to see the Cx3cr1+ and Ms4a3-derived cells in the same specimen.

      We will thrive to show this in the revised manuscript.

      Figure 3C: Highlight that tdTomato expression is visualized here.

      We will do that.

      Figure 3G,F: The authors should place the schematics and graphs next to each other, so the data points can be more easily compared.

      We aim to do this in the revised manuscript.

      Figure 4B: Which co-staining was performed for the immunofluorescence to quantify the % of tdTomato+ cells?

      We co-stained for F4/80 and assessed localization in the lining or sublining. This will be clarified in the revised Figure legend.

      Figure 4C: The trajectory analysis appears to have an arrow pointing from the Ccr2+ macrophages to the Ly6c+ monocytes. Please verify this directionality, as its seems against the known biology.

      This will be addressed during revision.

      Figure 5 mentions that the Csfr1 levels were reduced in a tissue-specific manner, but it is unclear how this tissue specificity was achieved.

      We apologize for this misunderstanding. Csfr1FIRE mice are not tissue-specific knockouts, but they are more specific than global knockout mice, since only a (myeloid-specific) enhancer is affected. We will clarify this in the relevant section.

      For the TGFb perturbations (Tgfbr2 KO and systemic TGFb depletion): did the authors validate reduced TGFb pathway activity in the macrophages, for example, reduced pSMAD2/3 levels? This would validate the effectiveness of the perturbations. This is an important point, and assessing signaling events downstream of TGFb is a very good suggestion.

      As per above comment, we have decided to uncouple the functional data with exception of CSF1 from the revised version of the current manuscript, but we will be taking this into account for substantiating our functional data in follow-up work.

      Figure 5F could benefit from a timeline of the treatment.

      As for the previous point raised, we will be taking this into account for follow-up work on the uncoupled functional data.

      The Methods mention that Gene Ontology analysis was performed on the single-cell data, but the results are not plotted in a figure. It would be informative to include this GO/pathway analysis in the appropriate figure(s).

      We will include this in the revised (supplementary) information.

      Significance:

      This work provides a high temporal-resolution and "spatial" resolution reference map of the ontogeny and maturation of the synovial lining macrophages in the knee joint. It complements existing literature that demonstrated the presence of tissue-resident macrophages in the synovial space and lining (Culemann, et al., 2019 and others) by charting the embryonic-to-postnatal emergence of lining and sublining subsets. In particular, this mouse work identified some key signaling pathways in shaping this tissue compartment. This dataset serves as a robust, steady-state reference for joint pathology and can be implemented with human studies on disease biology of the knee joint (e.g., Alivernini et al., 2020; Raut et al., 2025). Insights into the exact developmental origins, mechanisms contributing to diverse or seemingly similar cell types, and distinct maturation processes are crucial to understanding disease biology, in which developmental processes can be hijacked/reactivated.

      These findings will interest researchers in joint disease biology (osteoarthritis and immune-mediated arthritides such as RA and psoriasis), macrophage development (tissue-resident vs monocyte-derived lineages), the bone/joint microenvironment, and joint mechanobiology.

      The reviewer's expertise is in developmental biology, mesoderm, bone biology, hematopoiesis, and monocyte/macrophage biology in disease

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Magalhaes Pinto, Malengier-Devlies, and co-authors investigated the developmental origins and maturation of synovial (lining and sublining) macrophages across embryonic, newborn, and postnatal stages in mouse. The authors used multiple transgenic reporter lines, lineage tracing, scRNA-seq, 2D confocal and 3D lightsheet imaging, and perturbations to delineate the macrophage states and ontogeny. They propose a model in which the majority of the joint lining macrophages has a fetal (EMP-derived) origin and a small proportion has a definitive HSC-derived monocyte origin, which both seed and mature within the synovial space in the postnatal period in the first 3 weeks of life. Using cell-cell communication analysis on their scRNA-seq data, they identified Fgf2, Csf1, and Tgfb as candidate signaling pathways that support (lining) macrophage development and maturation. Functional experiments indicate that the process is CSF1 and TGFb-dependent and also partly dependent on mechanosensing through Piezo1. The key conclusions on the composition of the synovial macrophages are convincing based on the presented results, and are carefully phrased. The study is very comprehensive, yet the description and organization of the results of the different mouse models could be altered to improve the storyline. Several refinements in data presentation, formulation, and minor validation experiments would further improve the clarity of the story, as well as summary recaps of the major findings throughout the text.

      Major comments:

      1. Generally, the story could be more streamlined by introducing earlier reporter lines and lineage-origin logic. Clearly state which reporter/CreERT2 lines and acrosses are used. It was unclear in Figure 2 that cells of the cross of the Cx3cr1-GFP and Ms4a3Cre:Rosa26lsl-tdT reporter lines were used for the scRNA-seq. The principle that there are fetal-derived and bone marrow (GMP)-derived monocytes and macrophages doesn't need to be "hidden" until Figure 3. For example, also the imaging of Ms4a3Cre could be introduced before the scRNA-seq.
      2. Figure 1 could benefit from a cartoon visualizing the anatomy of the knee joint. The terms "sublining" and "synovium" are now a bit unclear, as it appears that sometimes the synovium is indicated as sublining and vice versa. Additionally, a schematic developmental timeline could be added to indicate the parallels between mouse and human development (fetal and postnatal development in mouse versus gestational age in human). Also, the various waves of hematopoiesis could be indicated in this timeline, which would be particularly helpful for Figure 3 for the lineage-tracing readouts. Lastly, the authors could end the manuscript (a new Figure 6) with a general cartoon summarizing all the results presented.
      3. Figure 1 could be rearranged: first introduce the markers CX3CR1 and VSIG4 (Figure 1D) and then present the quantifications (Figure 1B/E). Where possible, co-visualization CX3CR1-GFP and VSIG4 on tissue sections to strengthen the claims on the relationship between these 2 markers. Tying the scRNA-seq insights (Figure 2) to the imaging would be elegant. Moreover, it would be informative to represent the CX3CR1+ and VSIG4+ macrophages as a percentage of F4/80+ macrophages (Figure 1B/E). Similarly, for the flow cytometry data in Figure 2, the relationship between the markers CX3CR1 and VSIG4 on macrophages could be more clearly displayed and discussed.
      4. The 3D imaging of the joint is a nice addition to the manuscript, as it provides more context to the anatomical structure; however, while the text suggests several newborn joints were imaged, Figure 1F visualizes (again) the knee joint. Could other joints also be represented by 3D imaging? If the knee joint is the only joint available for imaging, and previous confocal imaging focused specifically on the meniscus in the knee joint, could the meniscus also be highlighted in the lightsheet imaging?
      5. Clarification is requested regarding the imaging quantification representation. The M&M section under "Statistical analysis and reproducibility" states that individual data points are displayed, and bars represent the mean. However, some of the Figure legends (e.g., Figures 1B and S1C) specify that each dot corresponds to an individual mouse, with quantification based on 2-3 sections per mouse. While this appears to be a very reasonable representation of the data, does this mean that for each dot, the mean value from the 2-3 sections per mouse was calculated and plotted?
      6. It is not clear how the differential expression analysis was performed on the Vsig4+ cells. Please specify if Cluster 0 was used for analysis, or all Vsig4-expressing cells? Not all cells in Cluster 0 have Vsig4+ expression. The authors described the expression dynamics of Aqp1 as intriguing, but lack a reasoning on why this is interesting.
      7. Figure S3E: In line with the previous comment, can the authors justify that the tdTomato+/- comparisons are not biased by scRNA-seq dropout (scRNA-seq is zero-inflated, so some tdTomato- cells could be false negatives), and provide methodological details (thresholds, ambient RNA correction, etc.) to support this?
      8. Although the sex-related differences in macrophage composition and the absence of differential expression are interesting, they distract from the manuscript's main messages. Moreover, the Discussion does not elaborate on how these observations relate to joint (disease) biology. Consider removing this section or integrating it clearly into the relevant biological context.
      9. CreERT2 transgenic lines are often not 100% efficient in recombination, also depending on whether tamoxifen or 4-OHT is used. Could the authors report the percentage of tdTomato+ cells in the joints and compare them to the recombination efficiencies in the monocytes/microglia under the same tamoxifen or 4-OHT conditions? This would help clarify how the interpret the macrophage labeling %'s.
      10. Could the authors draw parallels between the observations in the mouse knee joint macrophage populations and literature on other joints in mouse and the knee joint in human (for example, as described in Alivernini et al., 2020 and in the very recent Raut et al., 2025)?

      Minor comments:

      1. In general, the authors should clarify in the Results what each marker used for imaging, flow cytometry, or in the mouse reporter lines delineates. For example, mention that F4/80 is a marker for tissue-resident macrophages (correct?) in immunofluorescence, that IBA1 is a marker for macrophages on human tissue sections (Figure S1), and PDPN is GP38 (Figure S2 - align usage of marker reference across main text and figures).
      2. For clarity in the microscopy representation, the single channels should be represented in a grey scale.
      3. Figure S1B: Is CX3CR1 also restricted to the lining macrophages in human? Could a co-staining with IBA1 be performed to strengthen the species similarities?
      4. Adipocyte diameter quantification: Avoid plotting individual adipocytes from 2 mice without per-mouse visualization. Instead, report the mean adipocyte diameter per mouse and plot those means.
      5. A little typo was spotted in the "Statistical analysis and reproducibility" section: it is Dunn's, not Bunn's multiple-comparison correction.
      6. Figure 2A: The gating strategy for the CX3CR1-GFP cells is missing.
      7. Improve the visualization of some plots. For example, Figure 2F is hard to read because of the big dot size. The dots seem to add no information to the graph and could be removed. Additionally, for comparing the clusters across the different time points, one could project the cells from the other time points in grey in the background.
      8. Figure S2: The dotplot is more informative than the heatmap, consider removing the heatmap.
      9. Figure 3A: If technically feasible, image and visualize both the GFP and tdTomato expression. It would be informative to see the Cx3cr1+ and Ms4a3-derived cells in the same specimen.
      10. Figure 3C: Highlight that tdTomato expression is visualized here.
      11. Figure 3G,F: The authors should place the schematics and graphs next to each other, so the data points can be more easily compared.
      12. Figure 4B: Which co-staining was performed for the immunofluorescence to quantify the % of tdTomato+ cells?
      13. Figure 4C: The trajectory analysis appears to have an arrow pointing from the Ccr2+ macrophages to the Ly6c+ monocytes. Please verify this directionality, as its seems against the known biology.
      14. Figure 5 mentions that the Csfr1 levels were reduced in a tissue-specific manner, but it is unclear how this tissue specificity was achieved.
      15. For the TGFb perturbations (Tgfbr2 KO and systemic TGFb depletion): did the authors validate reduced TGFb pathway activity in the macrophages, for example, reduced pSMAD2/3 levels? This would validate the effectiveness of the perturbations.
      16. Figure 5F could benefit from a timeline of the treatment.
      17. The Methods mention that Gene Ontology analysis was performed on the single-cell data, but the results are not plotted in a figure. It would be informative to include this GO/pathway analysis in the appropriate figure(s).

      Significance

      This work provides a high temporal-resolution and "spatial" resolution reference map of the ontogeny and maturation of the synovial lining macrophages in the knee joint. It complements existing literature that demonstrated the presence of tissue-resident macrophages in the synovial space and lining (Culemann, et al., 2019 and others) by charting the embryonic-to-postnatal emergence of lining and sublining subsets. In particular, this mouse work identified some key signaling pathways in shaping this tissue compartment. This dataset serves as a robust, steady-state reference for joint pathology and can be implemented with human studies on disease biology of the knee joint (e.g., Alivernini et al., 2020; Raut et al., 2025). Insights into the exact developmental origins, mechanisms contributing to diverse or seemingly similar cell types, and distinct maturation processes are crucial to understanding disease biology, in which developmental processes can be hijacked/reactivated.

      These findings will interest researchers in joint disease biology (osteoarthritis and immune-mediated arthritides such as RA and psoriasis), macrophage development (tissue-resident vs monocyte-derived lineages), the bone/joint microenvironment, and joint mechanobiology.

      The reviewer's expertise is in developmental biology, mesoderm, bone biology, hematopoiesis, and monocyte/macrophage biology in disease

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the manuscript „The synovial lining macrophage layer develops in the first weeks of life in a CSF1- and TGFβ- dependent but monocyte-independent process", Magalhaes Pinto and colleagues carefully employ a wide range of technologies including single cell profiling, imaging and an exceptional combination of fate mapping models to characterize the ontogeny and development of lining macrophages in the joint, thus dissecting their maturation during postnatal development. Over the last decade, several landmark studies highlighted the imprinting of tissue-resident macrophages by a combination of ontogenetic and tissue-specific niche factors during development. So far, the ontogeny and the tissue niche factors governing the development and maturation of lining macrophages have not been described. Therefore, the results of this study offers insights on a small highly adapted macrophage population with relevance in many disease settings in the joint. Furthermore, the findings are nicely showcasing how macrophages are specializing to even very small tissue niches across development within one bigger anatomical compartment to serve dedicated functions within this niche.

      This manuscript is beautifully written and highlights many novel, highly relevant findings on lining macrophage biology and the authors employ a wide range of different technologies to carefully dissect the postnatal development of lining macrophages.

      In particular, the combination of scRNA-seq and fate mapping is providing a unique the link of transcriptional programs to ontogeny within the tissue niche. Furthermore, the integrative use of distinct fate mapping strategies, transgenic mouse lines, and treatment paradigms to elucidate key niche factors guiding the development and maturation of lining macrophages provides many interesting findings and data that are highly relevant to the field. I really enjoyed reading this manuscript.

      Major points:

      1) The authors show dynamic regulation of VSIG4 in lining macrophages during development, therefore VSIG4 is maybe not an ideal choice for gating strategies to define lining macrophages or to show as a single markers in immunofluorescence (IF) stainings to demonstrate their abundance across development (even though it is clear that this is the reason why the F4/80 staining is shown next to it). To demonstrate the increase of lining macrophages during development in IF, it would be more helpful if the authors would show quantifications of all F4/80+ cells and additionally VSIG4+ as a proportion of F4/80+ cells (or VSIG4+ F4/80+ and all F4/80+ in a stacked bar plot).

      2) In Figure 1C, the authors nicely demonstrate that the lining macrophages get closer in their distance across development to build the epithelial-like macrophage structure along the adult lining. Is the close proximity between lining macrophages already fully "matured" at 3 weeks of age and comparable to adults? Please quantify the distance in adult linings.

      3) Can the authors explain how the grouping was performed between the analyzed human fetal joints? It is not clear why the cut was chosen between the groups at 16/17 weeks of age. Maybe it would be also beneficial if the authors would consider not grouping these samples but rather show the specific quantifications for each samples individually and estimate via linear regression the expansion over time across human development. Furthermore, can the authors give additional information about the distancing of lining macrophages in the human fetal samples, it would be great to see if they follow the same dynamics as in mouse. Maybe comparison to human juvenile/adult joints would also add on to substantiate the findings in human samples (if possible).

      4) The scRNA-seq analysis leaves several questions open and some conclusions and workflows cannot be easily followed.

      a. It is not clear how and especially why the signature genes to define macrophages vs. monocytes were chosen. Especially as the signature genes for monocytes would not include patrolling monocytes and the macrophage signature genes seem to be highly regulated during development, see also Apoe expression in NB vs. adult in Figure S2e. Why did the authors not take classical markers such as Itgam, Fcgr1a, Csf1r?

      b. Can dendritic cell signatures be excluded? Cluster 11 and 12 show indeed some DC markers, are these really macrophages?

      c. The authors provide several figure panels showing TOP marker genes or key marker genes for the identified clusters, however it is not clear if these are TOP DE genes or if the genes were hand chosen. Somehow, the authors give the impression that the clusters were chosen and labeled not based on DE genes, but more on existing literature that previously reported these macrophage populations. DE gene lists for all annotated cell types and macrophage clusters need to be provided within the manuscript.

      d. The authors claim that Clusters 1 and 4 are "developing" macrophages. How is this defined? Why are these developing cells compared to other clusters? And why are these clusters later on not considered as progenitors of Aqp1 macrophages and Vsig4 macrophages? Why are Aqp1+ macrophages not labeled as developing when they are later on in the manuscript shown as potential intermediate progenitors of lining macrophages?

      e. Furthermore, it is again confusing that markers are used throughout Figure 2 which are labeled as "key marker genes" for a population and then later on they are claimed to be regulated during development within this population, see for example Figure 2D and 2H.

      f. It is appreciated that the authors distinguished cycling clusters such as 8, 9, and 10 based on their cycling gene signature. Here it would be very exciting to see a cell cycle analysis across all clusters and time points to see when exactly the cells are expanding during development; this would also substantiate the data later shown for the Mki67-CreERT2 mouse model.

      g. Can the authors identify certain gene modules during development of lining macrophages (and/or their progenitors) which are associated with certain functions (e.g. GO terms, GSEA enrichment)?

      5) To determine the actual presence of the identified macrophage clusters from the scRNA-seq as macrophage populations in the joint, the authors should perform IF or FACS for key markers. Especially, Aqp1+ macrophages should be shown in the developing joint.

      6) The authors used a wide range of fate mapping models, which is quite unique and highly appreciated. The obtained results and the conclusions made from the models raise a couple of questions: Whereas contribution of HSC-derived/monocyte-derived macrophages to the lining compartment seems to be minor, there is still labeling across different models. Various aspects would need to be clarified.

      a. For example, the authors employ Ms4a3-Cre as a tracing model for GMP-derived monocytes, however all quantifications of the labeling efficiency are not normalized to the labeling in monocytes or another highly recombined cell population. This should be shown, similar to the other fate mapping models (Figure 3 F-I).

      b. Please show Ms4a3 expression across clusters across time points, to exclude expression in fetal-derived clusters.

      c. In line with the question raised above, if the authors can exclude a development of the Egfr1+ and Clec4n+ developing macrophages into Aqp1+ macrophages and subsequently into Vsig4 lining macrophages, the obtained data from the Ms4a3-Cre model highly suggests a correlative labeling across these clusters what could implicate a relation. However, the authors do not discuss throughout the manuscript the role of these developing macrophages. It is highly encouraged to include this into the manuscript and it would be of high relevance to understand lining macrophage development.

      d. The authors conclude from the pseudo bulk transcriptomic profiling of the different macrophage clusters that TdT+ and TdT- macrophages do not differ in their gene expression profile and that this is due to niche imprinting rather than origin imprinting. Even though the data supports that conclusion, the authors should verify if inkling cells early during development also show this similar gene expression profile and gene expression should be compared at the different developmental time points. Tissue niche imprinting is happening within the niche during development, most likely in a stepwise progress, and therefore there should be differences in the beginning.

      7) The trajectorial analysis using different pseudotime pipelines is very interesting and nicely points out the potential role of Aqp1 macrophages as intermediates of Vsig4 lining macrophages. From my point of view, all trajectories seem to suggest that Egfr1 developing macrophages and Clec4n developing macrophages might differentiate into Aqp1 macrophages, however the authors are not exploring this further and the role of both developing macrophage clusters is not further discussed (see also comments above).

      8) How was the starting point of the trajectorial analyses defined and is it the same for each pipeline used?

      9) Are there potentially two trajectories? It looks like there is one in the beginning of postnatal life and a second one appearing from the monocyte-compartment later in life. If this is true, that would rather speak for a dual ontogeny of Vsig4+ macrophages, wouldn't it?

      10) A heatmap (transcriptional shift) of trajectories between more clusters should be shown at least for Cluster 0,1,2, and 3. It is not sufficient to demonstrate this only between two clusters.

      11) To show the similarity between Aqp1 macrophages and proliferating macrophage clusters, the authors should remove the cycling signature and compare these clusters to show that the cycling cells might be Aqp1 macrophages or earlier developing macrophage progenitors aka Clec4n or Egfr1 macrophages.

      12) The conclusions made from the Mki67-CreERT2 data are a bit difficult to understand, whereas all progenitors (monocyte progenitors and macrophage progenitors will proliferate at the neonatal time point and no conclusions can be made if the cells expand in the niche. The authors should employ Confetti mice or other models (Ubow mice) to analyze clonal expansion in the niche.

      13) All predicted cell-cell interactions between macrophages and fibroblasts should be provided in a supplementary table. Are the interactions shown in Figure 5 chosen interactions or the TOP predicted ones? Whereas the authors show different numbers of interactions, it is most likely hand-picked and therefore biased.

      14) The authors further aim to dissect the factors involved in the developmental niche imprinting of lining macrophages. Even though it is highly appreciated that the authors used so many experimental setups to show the reliance of lining macrophages on Csf1 and TGF-beta as well as mechanosensation, the wide range of models the different methods used and selected developmental time points make it very difficult to really interpret the data. The authors should carefully choose time points and methods (either FACS analysis across all models or IF across all, or both). Often deletion efficiencies for transgenic models and proof of concept that the inhibitors and agonists are working in the treatment paradigm are not provided. For example, Csf1rMer-iCre-Mer Tgfbr2fl/fl mice are used but no deletion efficiency is shown or different time points of analysis, maybe the macrophages are not properly targeted in the set up.

      15) The authors have shown the role of Csf1 and Tgfbr2 only for lining macrophages, is this specific in the joint to this population of are subliming macrophages affected in a similar manner.

      16) Can the authors confirm their results in CSF1R-FIRE mice with anti-Csf1 injections or in Csf1op/op mice?

      17) The setup in Figure S5G is very interesting to test the role of movement and mechanical load on the joint, however, there is basically no data on the model provided showing the efficiency of the induced optogenetic muscle contractions, and only one time point is shown.

      18) The results regarding the role of Piezo1 and mechanosensation vary a lot. Could it be that analyses were done too early or that actually proper weight load on the joint must be applied for the maturation of the macrophages? The authors should test this to

      19) The Rolipram experiment is shown in Figure S5G, but is not described in the result section. It only appears at some point in the discussion part. The authors should move it to results or remove it from the manuscript.

      Minor points:

      1) Please reference the Figure panels in numeric order throughout the text.

      2) Figure 2a and 2b are a bit out of the storyline, it is not obvious why this is shown here and maybe it would be good to move it to the supplements. Gating strategy is also not used for scRNA-seq. Therefore, it would better fit to the later analysis of joint macrophages across different transgenic mouse models and treatment paradigms. The gating strategies are changing across different experiments throughout the figures, it would be nice to have a similar gating strategy for all experiments, see also Figure 3 where the defining markers for joint macrophages are changing between models.

      3) A lot of figure panels have very small labeling that is basically unreadable. Axes at FACS plots for example. Sometimes, it is even impossible to distinguish cluster labels especially when they have similar colors.

      4) In the text on page 14, many markers are named which are specifically regulated during development in lining macrophages, but these factors are not labeled anywhere in the volcano plot. It would be good to showcase at least some of these named genes in the figure panel, e.g. Trem2.

      5) Figure 2F and Figure S2F are really nicely showing the percentage of cells per cluster in each analyzed biological sample. Maybe the authors could additionally consider to show a stacked bar plot with the mean percentage of cells per cluster and how the clusters are distributed across time points?

      6) Figure 3A: IF for adult lining macrophages and the quantification are missing

      Significance

      This manuscript highlights novel, highly relevant findings on lining macrophage biology and the authors employ a wide range of different technologies to carefully dissect the postnatal development of lining macrophages. Furthermore, this study showcases in a very elegant and detailed way the adaptation of macrophage progenitors to a highly specific anatomical tissue niche.

      The manuscript is of high interest to basic scientists focussing on macrophage biology and immune cell development and clinicians and clinician scientists focussing on joint diseases such as RA

      Therefore the manuscript is of interest to a wide community working in immunology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In their manuscript entitled "The synovial lining macrophage layer develops in the first weeks of life in a CSF1- and TGFβ-dependent but monocyte-independent process," the authors explore the developmental trajectory of synovial lining macrophages. They demonstrate that the formation of this specialized macrophage layer is age-dependent and governed by a distinct developmental program that proceeds independently of circulating monocytes. Through scRNA-Seq, the authors show that synovial lining macrophages originate locally from Aqp1⁺ macrophages and are marked by the expression of Csf1r, Tgfbr, and Piezo1. Notably, genetic ablation of each of these factors impaired the development of lining macrophages to varying degrees, suggesting differential contributions of CSF1, TGFβ, and PIEZO1 signaling pathways to their maturation and maintenance.

      The manuscript is well written, and the data quality and representation is of a high standard. The authors have employed a sophisticated array of state-of-the-art mouse models and cutting-edge technologies to elucidate the developmental origin of synovial lining macrophages. Notably, the supporting scRNA-Seq datasets are of excellence and provide valuable insights that will likely be of significant interest to researchers in the field of immunology and joint biology. Accordingly, the experimental approach and interpretations regarding macrophage origin are well-founded and compelling. However, in the eye of the reviewer, the section addressing the underlying molecular mechanisms is a bit less convincing. This part of the study appears slightly underdeveloped, and some of the mechanistic claims lack sufficient experimental clarity. A more rigorous experimental investigation would be essential to reinforce the manuscript's conclusions, particularly concerning the data related to Tgfbr and Piezo1, where the current evidence appears insufficiently substantiated.

      Major point:

      • The numbers of VSIG4⁺ macrophages appear either unaffected or only minimally altered in both Csf1rMerCreMer Tgfbr2floxed and Fcgr1Cre Piezo1floxed mouse models, respectively. This raises an important question: was the gene deletion efficiency sufficient in each model? Accordingly, the authors are encouraged to include quantitative data on gene deletion efficiency for both mouse models, as this information is critical for interpreting the observed phenotypic outcomes and validating the conclusions regarding gene function. Furthermore, to better assess the impact of Tgfbr2 and Piezo1 disruption, the authors should provide more comprehensive flow cytometry analyses and histological data for these mouse models. Given the apparent homogeneity of VSIG4⁺ macrophages (as shown by the authors themselves), bulk RNA-Seq of sorted Tgfbr2- and Piezo1-deficient VSIG4⁺ macrophages (or from TGFβ-treated animals) would offer valuable insights into both the effectiveness of gene deletion and the molecular pathways governed by TGFβ and PIEZO1 in lining macrophages.

      Minor points:

      • Consistent usage of Cx3cr1-GFP+ nomenclature (for instance: Fig. S1 legend "adult mouse synovial tissue, showing PDGFRα⁺ fibroblasts (yellow) and CX3CR1-GFP⁺ cells (cyan)." versus Fig. 1 legend "Automated spot detection highlights Cx3cr1-GFP⁺ macrophages)"
      • Unclear Fig. 3 legend: "Representative immunofluorescence images of synovial tissue from Clec9aCre:Rosa26lsl-tdT mice at 3 weeks and in adulthood, showing and tdTomato (yellow) and stained for DAPI (blue), VSIG4 (cyan)" Check 'showing and tdTomato.'
      • For greater clarity, it would have been helpful if the transcript names had been directly included within Figures 3C, S3A, and S3C.
      • Page 24: "(Mki67CreERT2:Rosa26lsl-tdT)" Last bracket not superscript.
      • Page 25: "we again leveraged our scRNAsequencing dataset" Missing punctuation.
      • Page 27: Fig. 5C legend: " of synovial tissue of 1 week-old, 3 weeks-old and adult mice." Please specify and change to 'adult Csf1rΔFIRE/ΔFIRE mice'.
      • Page 30: The outcome observed in the Acta1-rtTA:tetO-Cre:ChR2-V5fl mouse model appears to be inconclusive: "This approach resulted in an increased density of VSIG4+ and total (F4/80+) macrophages in the exposed leg of some 5 days-old pups, but others showed the opposite trend (Figure S5D)." This variability may reflect low efficiency of the model or other technical limitations (e.g. muscle contractions frequency or time point of analysis). Given this ambiguity, it is worth reconsidering whether the data are sufficiently robust to warrant inclusion. Should the authors choose to include these findings, further experimentation of appropriate depth and precision is required to allow a conclusive interpretation (either it increases the density of VSIG4+ macrophages or not). The same applies to the Yoda1-treated mice, for which additional data are needed to determine whether VSIG4⁺ macrophage density is truly affected.

      Significance

      • General assessment: provide a summary of the strengths and limitations of the study. What are the strongest and most important aspects? What aspects of the study should be improved or could be developed?

      This is a well-designed study that uses cutting-edge methodologies to investigate the developmental trajectory of synovial lining macrophages under homeostatic conditions. The authors present robust experimental evidence and compelling interpretations concerning synovial macrophage origin, which are both well-substantiated and impactful. Nonetheless, from the reviewer's perspective, the section exploring the molecular mechanisms underlying macrophage differentiation is comparatively less convincing. This section appears somewhat underdeveloped, as some of the mechanistic claims lack sufficient depth and experimental rigor to fully substantiate the conclusions. - Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field:

      In contrast to earlier studies (PMID: 31391580, 32601335), the inclusion of fate-mapping experiments adds an important dimension, offering novel insight into the ontogeny of synovial macrophages. This expanded perspective may prove particularly valuable in advancing our understanding of joint immunology, especially regarding the local origins and lineage relationships of macrophage populations. Furthermore, the authors present novel insights into the molecular pathways underlying the differentiation and development of synovial lining macrophages. By demonstrating previously unrecognized regulatory mechanisms, this work significantly deepens our understanding of the cellular and transcriptional programs that drive macrophage specialization within the joint microenvironment. -Place the work in the context of the existing literature (provide references, where appropriate):

      This study builds upon previous work characterizing the macrophage compartment in the joint (PMID: 31391580, 32601335), yet provides a substantially more comprehensive dataset that spans multiple developmental time points and data on the origin of this specialized macrophage subset. - State what audience might be interested in and influenced by the reported findings:

      Immunologist, clinicians - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      This study falls well within the scope of the reviewer's expertise in innate immunity.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the reviewers

      We would like to thank the reviewers for their overall positive evaluations of our manuscript and for their invaluable suggestions that will allow us to reinforce our conclusions. We acknowledge that there is some work to be done and are ready to address most of the reviewers' comments as detailed in our replies below.

      Reviewer #1

      1. The findings that mmDicer is proviral in bat cells relies exclusively on the observation that the depletion of Dicer in M. myotis cells leads to a reduced accumulation of SFV and SINV at the RNA and protein levels (figure 2). Heterologous expression of mmDicer in HEK 293T NoDice doesn't lead to an increase permissivity to viral infections (figure 1) and the accumulation of Dicer foci is only observed in M. myotis cells but not when mmDicer is expressed in HEK 293 NoDice cells (figure 6). Given that the key finding of this manuscript relies on these knockdown experiments, the authors should ensure that the impact on viral infections is due to the specific silencing of mmDicer and not caused by off-target effects of their siRNA-mediated approach. The authors designed a siRNA pool to efficiently knock-down mmDicer. They should validate their findings by using individual Dicer siRNA and verify whether the decrease SFV/SINV accumulation is observed with at least two individual siRNAs targeting Dicer. It would also strengthen their findings if they could show a complementation experiment in which a mmDicer (designed to not be affected by the siRNA-mediated silencing) is introduced exogenously in Dicer-depleted cells and show that it rescues the observed decrease in viral accumulation to demonstrate that the proviral role is strictly dependent on mmDicer. Alternatively, the authors could consider a CRISPR/Cas9 genome editing approach to knockout Dicer in bat cells to test whether this proviral effect is confirmed.

      Reply: We agree with this reviewer that it is important to provide evidence for the specificity of the knock-down and to rule out any off-target effect of the siRNAs. This is the reason for using the siTool technology, which relies on the use of a pool of 30 siRNAs that are transfected at a final concentration of 3 nM. This means that each individual siRNA in the pool is at a concentration of 0.1 nM, so the possibility of off-target effect is largely avoided and the efficiency of silencing is boosted by the cooperative activity of many siRNAs (see https://www.sitoolsbiotech.com/documents/sipools/siPOOLBrochure2019_Web.pdf for more details). This being said, we agree that it would be better to confirm that the observed effect can be recapitulated using a single siRNA and that a complementation experiment would definitely strengthen our findings. For this reason, we will test two individual siRNAs targeting the 3' UTR of mmDicer, which will allow us to complement the knock-down by transfecting a cDNA construct. Regarding the CRISPR/Cas9 genome editing approach, we will give it a try, but Dicer is notoriously difficult to knock-out, so we cannot be sure that this will be successful.

      Figure 2: the authors knock-downed Dicer in M. myotis nasal epithelial cells and carried out infections with SINV-GFP and SFV. The authors conclude that Dicer is proviral as its depletion causes a decrease in SINV-GFP and SFV accumulation. While this conclusion is supported by the decrease levels of viral RNA and protein levels upon Dicer depletion (figure 2D, 2E, 2G), the effect on the viral titers is non-significant for both viruses (Figure 2C and 2F) based on the statistical analysis. This reviewer appreciates that the titers are lower upon Dicer knockdown, which support the authors' findings at the viral RNA and protein levels. However, as these results are central to the core message of the manuscript, the authors should provide evidence that this proviral effect observed is statistically significant on viral titers by perhaps providing additional repeats and/or comment on this observation.

      Reply: Indeed, we agree that even if the effect of Dicer knockdown results in a lowering of the viral titer, it would be better to have a statistically significant effect. We will repeat the experiment to increase the number of replicates and the power of the statistical test.

      a) *In figure 4 and 5, the authors nicely show that mmDicer accumulate to cytoplasmic foci in M. myotis cells upon infection with SFV and SINV and these foci co-localise with double-stranded RNA. The authors used a commercial polyclonal antibody against Dicer (A301-937A, Bethyl according to the Material and Methods section) which is specific to human Dicer to carry out their immunostaining in bat cells. The authors should provide evidence that this antibody indeed recognises/crossreacts with mmDicer as well and that the staining shown is indeed specific to mmDicer localisation especially because the heterologous expression of HA-tagged version of mmDicer in HEK 293T NoDice cells did not show this accumulation of cytoplasmic foci. The authors should verify the specificity of their mmDicer immunostaining by performing the same labelling in bat cells in which Dicer is knock-downed (or knock out) by individual and validated siRNA against mmDicer. The decrease signal of bat Dicer staining using the anti-human Dicer antibody would indicate specificity. *

      Reply: the reviewer is correct in its assertion and it is important to provide evidence that the protein that is detected by the anti-human Dicer antibody in bat cells is indeed Dicer. We will perform the suggested experiment and do an immunostaining using the Dicer antibody in bat cells upon Dicer knockdown.

      b) Another complementary approach would be to test their Dicer staining between HEK NoDice cells (no Dicer present) versus NoDice complemented with either mmDicer or human Dicer constructs, which would then indicate how much the anti-human Dicer antibody recognises bat Dicer.

      Reply: this complementary approach should yield even cleaner result than the previous one as there will be no expression of Dicer at all in the HEK NoDice cells. Therefore, we should be able to measure the increase of signal in the IF upon expression of either human or bat Dicer. We will perform this experiment together with the other one suggested above. In addition, since the constructs are tagged, we might be able to do a double-staining and verify the colocalization of the two signals.

      c) In addition, the authors should overexpress HA-tagged mmDicer in M. myotis nasal epithelial cells and test whether HA-mmDicer accumulate into foci upon infection using an anti-HA immunostaining. This would confirm that these accumulation into foci indeed is specific to mmDicer but also would reinforce the authors' findings that host factors within bat cells are important for this formation into foci since mmDicer expression in HEK 293T No Dice cells didn't show this phenotype upon infection (figure 6). OPTIONAL: it would be interesting to overexpress HA-tagged human Dicer into M. myotis nasal epithelial cells as well to then test using anti-HA staining whether human Dicer in presence of host factors from the bat can accumulate into cytoplasmic foci or not upon viral infection.

      Reply: we could perform the suggested experiment, but we might face the issue that transfected cells might mount an immune response, which makes them resistant to the infection. We have observed indeed that we needed to use a higher MOI to infect cells after they have been transfected. Since we will have controls in place, this might not be too much of a problem, but we will have to keep it in mind. Alternatively, we will perform a lentiviral transduction of the cells.

      This reviewer appreciates that this might be judged as beyond the scope of this study since it is focused on the role of Dicer in M. myotis. However, the observation that mmDicer accumulates into foci containing as well viral dsRNA is very interesting and it would significantly improve the manuscript if the authors would provide further indications that this phenotype is related to the lack of antiviral activity of mmDicer compared to what has been previously shown in other bat species (P.alecto and T. brasiliensis). In other words, is this accumulation of mmDicer into foci responsible for its different impact on virus infection? It would therefore be insightful to compare Dicer localisation upon infection in M. myotis versus P.alecto and/or T. brasiliensis bat cells in which Dicer was shown to be antiviral and test whether this accumulation in foci is only observed in bat cells in which Dicer is proviral (M. myotis) but not in the other bat cells in which Dicer is antiviral (P.alecto and/or T. brasiliensis).

      Reply: this is something that we have been wondering about and we have therefore started to look for the cell lines that have been described in the two published studies. While it proved difficult to find the PaKi cells from P. alecto bats, which is not commercially available, we have obtained the Tblu cells from T. brasiliensis and will look at Dicer localization in this model. However, we have to pay attention to the fact that the published data reported a contribution of RNAi in this cell line upon SARS-CoV-2 infection and that we will be using SINV. In addition, we do not know yet whether the anti-Dicer antibody will cross react with the T. brasiliensis Dicer protein.

      OPTIONAL: Given the difference between the provial role of mmDicer compared to the antiviral activity of Dicer in cells from P.alecto and T. brasiliensis bat cells, it would strengthen the authors' findings. if additional experiments would be conducted in parallel using M. myotis, P.alecto and/or T. brasiliensis cells. Notably knocking down Dicer in both M. myotis, P.alecto and/or T. brasiliensis cells, compare the impact on viral infections with SINV, SFV, VSV and correlate any observed difference in phenotype with putative variations in the formation of foci.

      Reply: it would indeed be really nice to be able to do the Dicer knockdown experiment in several bat cell lines and to correlate the phenotype with the formation of foci. This experiment might take a long time and we are not sure to be able to realize it in a reasonable amount of time. It could however be the subject of another manuscript further down the line.

      *Minor comments *

        • Figure 2I: The authors performed a knockdown of Dicer in M. myotis nasal epithelial cells and monitor the impact on VSV-GFP infection. They found that knocking down Dicer leads to an increase in GFP protein and RNA levels suggesting an antiviral role of Dicer while, in contrast, no effect is observed on the production of infectious particles (figure 2H). On the western blot there is only a slight/weak increase of GFP protein level observed upon Dicer knockdown. Yet, the quantification of the band intensity shows a 4-fold increase relative to tubulin and compared to cells treated with siRNA control. This 4-fold increase seems exaggerated given the low increase in the intensity shown on the blot. This discrepancy is most likely due to the lower intensity of tubulin in the western blot analysis of siDicer-treated cells compared to siNeg-treated cells. The authors should reload their western blot with equal amount of protein extract loaded to ensure that the results shown on the western blot are in line with the quantification.*

      Reply: the signal quantification for this experiment was done across several replicates, but we agree that the observed effect seems exaggerated when compared to the signal seen on the blot. We observed important variations between replicates, but we will make sure that this was not due to a problem in the analysis and reload the western blot if needed.

        • Figure 3D: the authors mention that in both HEK293T cells and M. myotis nasal epithelial cells infected with SINV-GFP, there was an enrichment of 22-nucleotides (nt) paired positive and negative sense reads that overlapped with a 2-nt overhang, typical of Dicer cleavage. In Figure 3D, the data shows indeed that the duplexes are enriched for reads of 22-nt but it is unclear how this analysis reveals a 3' 2nt overhang within these duplexes. Can the authors clarify this point and if the data provided in that particular analysis indeed doesn't allow to detect these overhangs, please rephrase accordingly or provide additional analysis to support that point. *

      Reply: In Figure 3D, the graphs show the probability of pairing of all 22 nucleotides sequence mapping either to the plus or the minus strand of the viral RNA. Thus, for each sequence mapping to the plus strand, the number of sequences mapping to the minus strand with a full or partial overall is counted. A corresponding probability of pairing and Z score is calculated for each number of overlapping nucleotides (for more information on the calculation see Antoniewski (2014) Computing siRNA and piRNA Overlap Signatures. In Animal Endo-SiRNAs: Methods and Protocols, Werner A (ed) pp 135-146. New York, NY: Springer). The Z score peaks for an overlap of 20 nt in both HEK293T and M. myotis nasal epithelial cells infected with SINV. This means that there is a higher probability of two 22 nt sequence to pair along 20 nt, and thus that there are two unpaired nucleotides at the extremities of the duplexes. This higher Z score at 20 nt is not seen in VSV-infected cells. We will rephrase the text in the manuscript to make this point clearer.

        • Typo: page 5, line 152: the authors mention that Dicer knock down had an antiviral effect against VSV-GFP infection at the RNA and protein levels. However, the data in Figure 2I and 2J show an increase in both GFP RNA and proteins levels upon knockdown of Dicer. Although this data suggests that Dicer is antiviral against VSV, the knockdown of Dicer itself is not antiviral but rather proviral/increase virus accumulation. Please rephrase this sentence to avoid confusions. *

      Reply: thank you for spotting this typo. We have corrected it accordingly.

      Reviewer #2.

      1. Figure 1 relies on transduction of cells and antibiotic selection to obtain mmDicer-expressing cells. Although we would expect that every cell expresses the construct of interest, this is not always the case, depending on the cell type and toxicity of the construct. As the constructs are tagged, I suggest that the authors use flow cytometry to measure expression levels in a single cell manner. While doing so, they can infect with SINV-GFP and correlate GFP signal with construct expression in each cell, providing a more accurate measurement of mmDicer effect on viral infection. Alternatively, the authors could use live microscopy, as done in Fig 2, to obtain similar data.

      Reply: the reviewer is correct that we did not go for monoclonal selection of our mmDicer-expressing cells and therefore that there could be some cell-to-cell variation in expression. However, we have done immunostaining of Dicer in these cells and did not see drastic differences in expression, so we do not think this should impact SINV-GFP expression in a major way. We will provide these images and a quantification of the Dicer signal as a supplementary figure.

      For Fig 1C and 1F, it would be great to have growth curves with two different MOIs, instead of a single time point, to ensure that a putative antiviral effect is not missed. Same goes for Fig 2C, especially when the authors document quite a big defect on GFP expression (a proxy for SINV infection) when Dicer is knocked down (Fig 2B). There may be a bigger difference in titers at earlier time points. This matter runs throughout the manuscript. I do not suggest that the authors should provide growth curves every time viral titers are measured, but it is still worth doing it for the 2-3 key experiments of the paper.

      Reply: we will perform growth curves of virus infection for the key experiments in the manuscript as suggested. We already have done kinetic measurements of GFP accumulation at different MOIs, which we can provide as supplementary data, but we agree with the reviewer that GFP signal should not been used as the only proxy for the infection and that measuring viral titers by plaque assay is important as well.

      Figure 4, could the authors provide a proof that the Dicer antibody is specific in the bat context? This can be done by staining Dicer in bat cells knocked down for Dicer and infected with SINV. The apparition of foci upon anti-Dicer antibody staining should be abbrogated or severely impaired by the knock-down.

      Reply: see our reply to point 3 of Reviewer 1.

      Fig 5C, please provide a quantification of the images.

      Reply: these microscopy images have not been quantified because they have been obtained with an epifluorescence microscope. Indeed, the Pearson correlation coefficient can only be obtained using a confocal microscope. In fact, we have tried to use a confocal microscope to take pictures of these FISH images, but the SINV gRNA signal was too weak or the dots too small to be properly visualized. Furthermore, there is a very large difference in signal intensity between HEK293T and M. myotis cells, making it difficult to define a signal threshold compatible for both cell lines.

      l.263, when comparing this work with the recent publications on bat antiviral RNAi, the authors could also provide the percentage identity between Dicers from different species.

      Reply: this is a valid point, we have looked at the percentage identity between Dicer proteins from different bat species but we did not include this in our manuscript. We will provide this analysis in the revised version together with a comparison of Dicer from other mammals as a reference point.

      Reviewer 3.

        • Without direct comparison to the other bat species Dicers (especially where RNAi activity has been suggested as antiviral in previous publications) there is little in this paper that can be concluded about global aspects of bat dicer/RNAi.*

      Reply: see our reply to point 4 of Reviewer 1. We are planning to look at least in Tblu cells whether there is also a relocalization of Dicer upon SINV infection. So far, we could not obtain PaKi cells, but we are still looking and should we get those, we will test them as well.

      *Minor *

      What rules out that the mmDicer re-localization observed in the immortalized mm nasal epithelial is due simply to greater expression levels over the NoDice cells heterologously expressing mmDicer?

      Reply: we will provide an immunoblot to show the level of Dicer expression between HEK NoDice + mmDicer and M. myotis nasal epithelial cells as suggested below to address this point.

      • Although partially addressed in the text stating the generally long half-life of miRNAs, it seems the simplest explanation for this observation is due to some activity of a shorter-lived miRNA is required for optimal alphavirus replication is the mm nasal epithelial cells. *

      Reply: this is an interesting hypothesis that would prove difficult to test in a reasonable amount of time. We thank the reviewer and will mention this possibility in the discussion of the revised manuscript.

      *Suggestions that could enhance the magnitude of conclusions that can be drawn from this work. *

      *Major *

        • Making NoDice cells expressing other bat species Dicers, including those with claims that RNAi is antiviral, would address how universal these current observations are to bats/cell lines.*

      Reply: this could be an alternative to the use of P. alecto or T. brasiliensis cell lines that we have mentioned above. We will try to clone Dicer from the Tblu cells that we have in the laboratory. Since we do not have PaKi cells at the moment, it will be more complicated for the Pteropus Dicer, but one possibility could be to synthesize it. However, Dicer is a big gene so it could prove tricky.

        • Including an immunoblot showing that mm cells express mmDicer no more abundantly than the heterologous NoDice cells would allow ruling out the trivial explanation that foci occur at a certain critical mass of Dicer*

      Reply: yes, we will provide this piece of data as stated in reply to point 2.

      *Minor *

        • I believe line 151 " In contrast, Dicer * *knock down had an ANTIVIRAL effect against VSV-GFP infection at the RNA and protein *

      *levels, but no difference in titers was found (Fig. 2H-J)." should be " In contrast, Dicer *

      *knock down had an PROVIRAL effect against VSV-GFP infection at the RNA and protein *

      *levels, but no difference in titers was found (Fig. 2H-J)." *

      Reply: thank you for spotting this error, which was also mentioned by Reviewer 1, we have corrected this in the text.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript by Gaucherand and colleagues, the authors demonstrate that heterologous expression of Myotis myotis Dicer into 293 derivative Dicer KO cells did not produce antiviral effects. The authors further demonstrate that knockdown of Dicer in SV40 immortalized M myotis nasal epithelial cells results in reduced alphavirus infection. Finally, they show a correlation where mmDicer changes subcellular localization co-localizing with likely alphavirus replication foci. The manuscript is clearly written, and the conclusions drawn as stated are accurate.

      Strengths

      • This is an overall topical area of research: how bat antiviral responses differ from other mammals - - with enormous general interest in host-pathogen interfaces, and particular relevance to the role of RNAi.
      • The manuscript is clearly written and does not overstate the conclusions.
      • The team are well-qualified experts in this area with an excellent track record of findings from the Pfeffer lab in the years preceding this work

      Critiques

      Major

      1. Without direct comparison to the other bat species Dicers (especially where RNAi activity has been suggested as antiviral in previous publications) there is little in this paper that can be concluded about global aspects of bat dicer/RNAi. Minor
      2. What rules out that the mmDicer re-localization observed in the immortalized mm nasal epithelial is due simply to greater expression levels over the NoDice cells heterologously expressing mmDicer?
      3. Although partially addressed in the text stating the generally long half-life of miRNAs, it seems the simplest explanation for this observation is due to some activity of a shorter-lived miRNA is required for optimal alphavirus replication is the mm nasal epithelial cells.

      Suggestions that could enhance the magnitude of conclusions that can be drawn from this work.

      Major

      • Making NoDice cells expressing other bat species Dicers, including those with claims that RNAi is antiviral, would address how universal these current observations are to bats/cell lines.
      • Including an immunoblot showing that mm cells express mmDicer no more abundantly than the heterologous NoDice cells would allow ruling out the trivial explanation that foci occur at a certain critical mass of Dicer Minor
      • I believe line 151 " In contrast, Dicer knock down had an ANTIVIRAL effect against VSV-GFP infection at the RNA and protein levels, but no difference in titers was found (Fig. 2H-J)." should be " In contrast, Dicer knock down had an PROVIRAL effect against VSV-GFP infection at the RNA and protein levels, but no difference in titers was found (Fig. 2H-J)."

      Significance

      As written, this work would be significant to aficionados of bat RNAi. With a little extra work, this could have broader significance regarding more global aspect of Dicer in the the bat antiviral response.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This study by the Pfeffer lab interrogates the role of Dicer during RNA virus infection in bats. This is an interesting and important topic, as bats are well-documented to be a reservoir of viruses that can target humans. The field of bat immunology is gaining momentum, but there is still a lot to be done. This study is thus particularly timely. It also explores more of a niche pathway when it comes to immunity: antiviral RNAi and its entry point, Dicer. This work comes after two recent studies, cited by the authors (Dai 2024, Owolabi 2025), that also explore this concept. Here though, the Pfeffer lab comes to an opposite conclusion, as their results advocate against the existence of antiviral RNAi in bats. As discussed by the authors, discrepancies between their study and the two others may be linked to differences in experimental systems. It nonetheless brings a novel, interesting take on the topic of Dicer & antiviral RNAi in bats, and will be of interest to immunologists and virologists. Altogether, I find the manuscript well-written and clear. Experiments are to the point and well interpreted. Below are a few suggestions that will help bolster the authors' conclusions.

      Figure 1 relies on transduction of cells and antibiotic selection to obtain mmDicer-expressing cells. Although we would expect that every cell expresses the construct of interest, this is not always the case, depending on the cell type and toxicity of the construct. As the constructs are tagged, I suggest that the authors use flow cytometry to measure expression levels in a single cell manner. While doing so, they can infect with SINV-GFP and correlate GFP signal with construct expression in each cell, providing a more accurate measurement of mmDicer effect on viral infection. Alternatively, the authors could use live microscopy, as done in Fig 2, to obtain similar data.

      For Fig 1C and 1F, it would be great to have growth curves with two different MOIs, instead of a single time point, to ensure that a putative antiviral effect is not missed. Same goes for Fig 2C, especially when the authors document quite a big defect on GFP expression (a proxy for SINV infection) when Dicer is knocked down (Fig 2B). There may be a bigger difference in titers at earlier time points. This matter runs throughout the manuscript. I do not suggest that the authors should provide growth curves every time viral titers are measured, but it is still worth doing it for the 2-3 key experiments of the paper.

      Figure 4, could the authors provide a proof that the Dicer antibody is specific in the bat context? This can be done by staining Dicer in bat cells knocked down for Dicer and infected with SINV. The apparition of foci upon anti-Dicer antibody staining should be abbrogated or severely impaired by the knock-down.

      Fig 5C, please provide a quantification of the images.

      l.263, when comparing this work with the recent publications on bat antiviral RNAi, the authors could also provide the percentage identity between Dicers from different species.

      Significance

      This study by the Pfeffer lab interrogates the role of Dicer during RNA virus infection in bats. This is an interesting and important topic, as bats are well-documented to be a reservoir of viruses that can target humans. The field of bat immunology is gaining momentum, but there is still a lot to be done. This study is thus particularly timely. It also explores more of a niche pathway when it comes to immunity: antiviral RNAi and its entry point, Dicer. This work comes after two recent studies, cited by the authors (Dai 2024, Owolabi 2025), that also explore this concept. Here though, the Pfeffer lab comes to an opposite conclusion, as their results advocate against the existence of antiviral RNAi in bats. As discussed by the authors, discrepancies between their study and the two others may be linked to differences in experimental systems. It nonetheless brings a novel, interesting take on the topic of Dicer & antiviral RNAi in bats, and will be of interest to immunologists and virologists. Altogether, I find the manuscript well-written and clear. Experiments are to the point and well interpreted. Below are a few suggestions that will help bolster the authors' conclusions.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Bats acts a reservoir for many viruses. While some of these viruses can be pathogenic for humans and other animals, infected bats tolerate these viruses and show little to no pathogenesis. It is therefore key to characterise which immune pathways are active in bats and how do they differ from other mammals to understand how bats can sustain these virus infections. RNA interference (RNAi) acts as an antiviral mechanism in plants, invertebrates and was recently shown to be active in a cell type-dependent manner as a defence mechanism in mammals. Notably, recent findings show that antiviral RNAi activity is high in cells lines from two bats species (P.alecto and T. brasiliensis) and that this pathway might play an important role in bat viral tolerance. In this study, the authors investigate the antiviral role of Dicer in another bat species, Myotis myotis. First they express M. myotis Dicer (mmDicer) or human Dicer (hDicer) in a human epithelial kidney (HEK) 293T cell line knockout for Dicer (NoDice cells) and show that, in a human cell line, expression of mmDicer or hDicer doesn't restrict infections with either Sindbis virus (SINV) or vesicular stomatitis virus (VSV). The authors then tested the role of endogenous bat Dicer in M. myotis nasal epithelial cells and found that mmDicer has a proviral activity since its knockdown reduced the replication of SINV and Semliki Forest virus (SFV), but not of VSV. The authors also show by small RNA deep sequencing analysis that there was only a modest RNAi signature in both HEK293T and M. myotis infected with SINV suggesting that mmDicer does not have increased RNAi activity compared to human cells. Interestingly, the authors then found that in M. myotis cells infected with SINV, SFV but not VSV, mmDicer accumulates into cytoplasmic foci, which also contain double-stranded RNA (dsRNA) derived from viral replication. Finally, the authors showed that this relocalisation of mmDicer into foci was dependent on host factors from M. myotis cells as there was no change in localisation in SINV-infected HEK 293T NoDice cells complemented with mmDicer.

      Major comments

      • The findings that mmDicer is proviral in bat cells relies exclusively on the observation that the depletion of Dicer in M. myotis cells leads to a reduced accumulation of SFV and SINV at the RNA and protein levels (figure 2). Heterologous expression of mmDicer in HEK 293T NoDice doesn't lead to an increase permissivity to viral infections (figure 1) and the accumulation of Dicer foci is only observed in M. myotis cells but not when mmDicer is expressed in HEK 293 NoDice cells (figure 6). Given that the key finding of this manuscript relies on these knockdown experiments, the authors should ensure that the impact on viral infections is due to the specific silencing of mmDicer and not caused by off-target effects of their siRNA-mediated approach. The authors designed a siRNA pool to efficiently knock-down mmDicer. They should validate their findings by using individual Dicer siRNA and verify whether the decrease SFV/SINV accumulation is observed with at least two individual siRNAs targeting Dicer. It would also strengthen their findings if they could show a complementation experiment in which a mmDicer (designed to not be affected by the siRNA-mediated silencing) is introduced exogenously in Dicer-depleted cells and show that it rescues the observed decrease in viral accumulation to demonstrate that the proviral role is strictly dependent on mmDicer. Alternatively, the authors could consider a CRISPR/Cas9 genome editing approach to knockout Dicer in bat cells to test whether this proviral effect is confirmed.
      • Figure 2: the authors knock-downed Dicer in M. myotis nasal epithelial cells and carried out infections with SINV-GFP and SFV. The authors conclude that Dicer is proviral as its depletion causes an decrease in SINV-GFP and SFV accumulation. While this conclusion is supported by the decrease levels of viral RNA and protein levels upon Dicer depletion (figure 2D, 2E, 2G), the effect on the viral titers is non-significant for both viruses (Figure 2C and 2F) based on the statistical analysis. This reviewer appreciates that the titers are lower upon Dicer knockdown, which support the authors' findings at the viral RNA and protein levels. However, as these results are central to the core message of the manuscript, the authors should provide evidence that this proviral effect observed is statistically significant on viral titers by perhaps providing additional repeats and/or comment on this observation.
      • In figure 4 and 5, the authors nicely show that mmDicer accumulate to cytoplasmic foci in M. myotis cells upon infection with SFV and SINV and these foci co-localise with double-stranded RNA. The authors used a commercial polyclonal antibody against Dicer (A301-937A, Bethyl according to the Material and Methods section) which is specific to human Dicer to carry out their immunostaining in bat cells. The authors should provide evidence that this antibody indeed recognises/crossreacts with mmDicer as well and that the staining shown is indeed specific to mmDicer localisation especially because the heterologous expression of HA-tagged version of mmDicer in HEK 293T NoDice cells did not show this accumulation of cytoplasmic foci. The authors should verify the specificity of their mmDicer immunostaining by performing the same labelling in bat cells in which Dicer is knock-downed (or knock out) by individual and validated siRNA against mmDicer. The decrease signal of bat Dicer staining using the anti-human Dicer antibody would indicate specificity. Another complementary approach would be to test their Dicer staining between HEK NoDice cells (no Dicer present) versus NoDice complemented with with either mmDicer or human Dicer constructs, which would then indicate how much the anti-human Dicer antibody recognises bat Dicer. In addition, the authors should overexpress HA-tagged mmDicer in M. myotis nasal epithelial cells and test whether HA-mmDicer accumulate into foci upon infection using an anti-HA immunostaining. This would confirm that these accumulation into foci indeed is specific to mmDicer but also would reinforce the authors' findings that host factors within bat cells are important for this formation into foci since mmDicer expression in HEK 293T No Dice cells didn't show this phenotype upon infection (figure 6). OPTIONAL: it would be interesting to overexpress HA-tagged human Dicer into M. myotis nasal epithelial cells as well to then test using anti-HA staining whether human Dicer in presence of host factors from the bat can accumulate into cytoplasmic foci or not upon viral infection.
      • This reviewer appreciates that this might be judged as beyond the scope of this study since it is focused on the role of Dicer in M. myotis. However, the observation that mmDicer accumulates into foci containing as well viral dsRNA is very interesting and it would significantly improve the manuscript if the authors would provide further indications that this phenotype is related to the lack of antiviral activity of mmDicer compared to what has been previously shown in other bat species (P.alecto and T. brasiliensis). In other words, is this accumulation of mmDicer into foci responsible for its different impact on virus infection? It would therefore be insightful to compare Dicer localisation upon infection in M. myotis versus P.alecto and/or T. brasiliensis bat cells in which Dicer was shown to be antiviral and test whether this accumulation in foci is only observed in bat cells in which Dicer is proviral (M. myotis) but not in the other bat cells in which Dicer is antiviral (P.alecto and/or T. brasiliensis).
      • OPTIONAL: Given the difference between the provial role of mmDicer compared to the antiviral activity of Dicer in cells from P.alecto and T. brasiliensis bat cells, it would strengthen the authors' findings. if additional experiments would be conducted in parallel using M. myotis, P.alecto and/or T. brasiliensis cells. Notably knocking down Dicer in both M. myotis, P.alecto and/or T. brasiliensis cells, compare the impact on viral infections with SINV, SFV, VSV and correlate any observed difference in phenotype with putative variations in the formation of foci.

      Minor comments

      • Figure 2I: The authors performed a knockdown of Dicer in M. myotis nasal epithelial cells and monitor the impact on VSV-GFP infection. They found that knocking down Dicer leads to an increase in GFP protein and RNA levels suggesting an antiviral role of Dicer while, in contrast, no effect is observed on the production of infectious particles (figure 2H). On the western blot there is only a slight/weak increase of GFP protein level observed upon Dicer knockdown. Yet, the quantification of the band intensity shows a 4-fold increase relative to tubulin and compared to cells treated with siRNA control. This 4-fold increase seems exaggerated given the low increase in the intensity shown on the blot. This discrepancy is most likely due to the lower intensity of tubulin in the western blot analysis of siDicer-treated cells compared to siNeg-treated cells. The authors should reload their western blot with equal amount of protein extract loaded to ensure that the results shown on the western blot are in line with the quantification.
      • Figure 3D: the authors mention that in both HEK293T cells and M. myotis nasal epithelial cells infected with SINV-GFP, there was an enrichment of 22-nucleotides (nt) paired positive and negative sense reads that overlapped with a 2-nt overhang, typical of Dicer cleavage. In Figure 3D, the data shows indeed that the duplexes are enriched for reads of 22-nt but it is unclear how this analysis reveals a 3' 2nt overhang within these duplexes. Can the authors clarify this point and if the data provided in that particular analysis indeed doesn't allow to detect these overhangs, please rephrase accordingly or provide additional analysis to support that point.
      • Typo: page 5, line 152: the authors mention that Dicer knock down had an antiviral effect against VSV-GFP infection at the RNA and protein levels. However, the data in Figure 2I and 2J show an increase in both GFP RNA and proteins levels upon knockdown of Dicer. Although this data suggests that Dicer is antiviral against VSV, the knockdown of Dicer itself is not antiviral but rather proviral/increase virus accumulation. Please rephrase this sentence to avoid confusions.

      Significance

      The findings from this study are interesting as they provide further insights into the role of RNAi towards virus infections. Notably, it highlights a putative proviral role of Dicer in M. myotis bat cells in contrast to the antiviral role in mammals (including other bat species) as well as in plants and invertebrates. Another exciting finding of this study is the observation that mmDicer accumulates in cytoplasmic foci upon viral infection and that these foci also contain viral dsRNA replication intermediates. These accumulation of Dicer into foci only appear in bat cells infected with viruses producing large amounts of dsRNA such as SFV and SINV but not with VSV infection where no dsRNA was detected.

      While these findings are novel and interesting, this study, as it stands, is rather descriptive and doesn't provide mechanistic insights into the proviral activity of mmDicer and its localisation into cytoplasmic foci upon infections. The importance of the authors' findings would greatly improve if there were some experiments addressing whether this localisation of mmDicer into foci is responsible or at least correlate with its proviral activity/its lack of antiviral activity. Comparative studies between M. myotis cells in which Dicer is proviral and/or P.alecto and T. brasiliensis cells where RNAi was previously shown to be antiviral would likely provide key mechanistic insights.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-02946

      Corresponding author(s): Margaret, Frame

      Roza, Masalmeh

      [Please use this template only if the submitted manuscript should be considered by the affiliate journal as a full revision in response to the points raised by the reviewers.

      If you wish to submit a preliminary revision with a revision plan, please use our "Revision Plan" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      We thank the reviewers for recognizing the significance of our work and for their constructive feedback and suggestions, most of which we have implemented in our revised manuscript.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      Reviewer #1

      Evidence, reproducibility and clarity

      Review of Masalmeh et al. Title: "FAK modulates glioblastoma stem cell energetics..."

      Previous studies have implicated FAK and the related tyrosine kinase PYK2 in glioblastoma growth, cell migration, and invasion. Herein, using a murine stem cell model of glioblastoma, the authors used CRISPR to inactivate FAK, FAK-null cells selected and cloned, and lentiviral re-expression of murine FAK in the FAK-null cells (termed FAK Rx) was accomplished. FAK-/- cells were shown to possess epithelial characteristics whereas FAK Rx cells expressed mesenchymal markers and increased cell migration/invasion in vitro. Comparisons between FAK-/- and FAK Rx cells showed that FAK re-expressed increased mitochondrial respiration and amino acid uptake. This was associated with FAK Rx cells exhibiting filamentous mitochondrial morphology (potentially an OXPHOS phenotype) and decreased levels of MTFR1L S235 phosphorylation (implicated in mito morphology fragmentation). Mito and epithelial cell morphology of FAK-/- cells was reversed by treatment with Rho-kinase inhibitors that also increased mito metabolism and cell viability. Last, FAK-dependent glioblastoma tumor growth was shown by comparisons of FAK-/- and FAK Rx implantation studies.

      The studies by Masalmeh provide interesting findings associating FAK expression with changes in mitochondrial morphology, energy metabolism, and glutamate uptake. According to the authors model, FAK expression is supporting a glioblastoma stem cell like phenotype in vitro and tumor growth in vivo. What remains unclear is the mechanistic connection to cell changes and whether or not these are be dependent on intrinsic FAK activity or as the Frame group has previously published, potentially FAK nuclear localization. The associations with MTFR1L phosphorylation and effects by Rho kinase inhibition are likely indirect and remind this reviewer of long-ago studies with FAK-null fibroblasts that exhibit epithelial characteristics, still express PYK2, exhibited elevated RhoA GTPase activity. Some of these phenotypes were linked to changes in RhoGEF and RhoGAP signaling with FAK and/or Pyk2. At a minimum, it would be informative to know whether Pyk2 signaling is relevant for observed phenotypes and whether the authors can further support their associations with FAK-targeted or FAK-Pyk2-targeted inhibitors or PROTACs.

      Some questions that would enhance potential impact. 1. Cell generation. Please describe the analysis of FAK-/- clones in more detail. The "low viability" phenotype needs further explanation with regard to clonal expansion and growth characteristics?

      Response:

      • We included a better description and a supplementary figure in our revised manuscript to indicate that we have examined several FAK -/- clones and confirmed that our observations were not due to clonal variation; multiple clones displayed similar morphological changes (Figure S1D). We also show that the elongated mesenchymal-like morphology was observed at 48 h after nucleofecting the cells with the FAK‑expressing vector, before beginning G418 selection to enrich for cells expressing FAK (Figure S1C). We also included experiments to acutely modulate FAK signalling (detaching and seeding cells on fibronectin) (Figure S2D, E, F and Figure S3) to exclude the possibility that the profound effects are due to protocols/selection we used for generating FAK-deleted cells.
      • Regarding the term "low viability", we have clarified in the text that there is no significant difference in cell number (Figure S1A) or 'cell viability' when it is assessed by trypan blue exclusion (a non-mitochondria-dependent read-out) (Figure S1B) between FAK-expressing FAK Rx and FAK-/- cells cultured for three days under normal conditions. Therefore, we agree the term 'cell viability' in this context could be confusing and have replace "cell viability" with "metabolic activity as measured by Alamar Blue." in Figure 1D and Figure 5B, and the corresponding text in the original manuscript. This wording more accurately reflects the data.

      Figure 1F: need further support of MET change upon FAK KO and EMT reversion.

      Response: We have added a heatmap (Figure S1E) illustrating the changes in protein expression of core-enriched EMT/MET genes products (by proteomics) after FAK gene deletion (EMT genes as defined in Howe et al., 2018) ; this strengthens the conclusion that the MET reversion morphological phenotype is accompanied by recognised MET protein changes.

      Fig. 2: Need further support if FAK effects impact glycolysis or oxidative phosphorylation in particular as implicated by the stem cell model.

      Response: We show that FAK impacts both glycolysis (Figure 2A, 2E, and 2F) and mitochondrial oxidative phosphorylation on the basis of the oxygen consumption rate (OCR) (Figure 2B, and 2D), showing both are contributing pathways to FAK-dependent energy production. We have clarified this in the text.

      Is there a combinatorial potential between FAKi and chemotherapies used for glioblastoma. Need to build upon past studies.

      Response: Yes, previous studies suggest that inhibiting FAK can sensitize GBM cells to chemotherapy (Golubovskaya et al., 2012; Ortiz-Rivera et al., 2023). We have included a paragraph in the discussion section to make sure this is clearer. Although it is not the subject of this study, we appreciate it is useful context.

      The notation of changes in glucose transporter expression should be followed up with regard to the potential that FAK-expressing cells may have different uptake of carbon sources and other amino acids. Altered uptake could be one potential explanation for increase glycolysis and glutamine flux.

      Response: We agree with the reviewer that glucose uptake could be contributing and we include data that 2 glucose transporters are indeed FAK-regulated namely Glucose transporter 1 (GLUT1, encoded by Slc2a1 gene) and Glucose transporter 3 (GLUT 3, encoded by Slc2a3 gene) (shown in Figure S2B and C).

      It would be helpful to support the confocal microscopy of mitos with EM.

      Response:

      We are concerned (and in our experience) that Electron microscopy (EM) may introduce artefacts during sample preparation. In contrast, immunofluorescence sample preparation is less susceptible to artefacts. The SORA system we used is not a conventional point-scanning confocal microscope, but is a super-resolution module based on a spinning disk confocal platform (CSU-W1; Yokogawa) using optical pixel reassignment with confocal detection. This method enhances resolution in all dimensions with resolution in our samples measured at 120nm. This has been instructive in defining a new level of changes in mitochondrial morphology upon FAK gene deletion.

      Lack of FAK expression with increased MTFR1 phosphorylation is difficult to interpret.

      Response: We do not directly show that this phosphorylation event is causal in our experiments; however, we think it important to document this change since it has been published that phosphorylation of MTFR1 has been causally linked to the mitochondrial morphology we observed in other systems (Tilokani et al., 2022).

      Need to have better support between loss of FAK and the increase in Rho signaling. Use of Rho kinase inhibitors is very limited and the context to FAK (and or Pyk2) remains unclear. Past studies have linked integrin adhesion to ECM as a linkage between FAK activation and the transient inhibition of RhoA GTP binding. Is integrin signaling and FAK involved in the cell and metabolism phenotypes in this new model?

      Response: To better support the antagonistic effect of FAK on Rho-kinase (ROCK) signalling, we included a new experiment in which the integrin-FAK signalling pathway has been disrupted by treating FAK WT cells with an agent that causes detachment from the substratum, Accutase, and growing the cells in suspension in laminin-free medium. We present ROCK activity data, as judged by phosphorylated MLC2 at serine 19 (pMLC2 S19), relating this to induced FAK phosphorylation at Y397 (a surrogate for FAK activity) that is supressed after integrin disengagement. These measurements have been compared with conditions whereby integrin-FAK signalling is activated by growing the cells on laminin coated surfaces. We observed a time-dependent decrease in pFAK(Y397) levels (normalised to total FAK) in suspended cells compared to those spread on laminin, while pMLC2(S19) levels increased in a reciprocal manner over time in detached cells relative to spread cells (S4A and B). There is therefore an inverse relationship between integrin-FAK signalling and ROCK-MLC2 activity, consistent with findings from FAK gene deletion experiments. In the former case, we do not rely on gene deletion cell clones.

      Significance

      The studies by Masalmeh provide interesting findings associating FAK expression with changes in mitochondrial morphology, energy metabolism, and glutamate uptake. According to the authors model, FAK expression is supporting a glioblastoma stem cell like phenotype in vitro and tumor growth in vivo. What remains unclear is the mechanistic connection to cell changes and whether or not these are be dependent on intrinsic FAK activity or as the Frame group has previously published, potentially FAK nuclear localization. The associations with MTFR1L phosphorylation and effects by Rho kinase inhibition are likely indirect and remind this reviewer of long-ago studies with FAK-null fibroblasts that exhibit epithelial characteristics, still express PYK2, exhibited elevated RhoA GTPase activity. Some of these phenotypes were linked to changes in RhoGEF and RhoGAP signaling with FAK and/or Pyk2. At a minimum, it would be informative to know whether Pyk2 signaling is relevant for observed phenotypes and whether the authors can further support their associations with FAK-targeted or FAK-Pyk2-targeted inhibitors or PROTACs.

      __Response: __

      Deleting the gene encoding FAK in mouse embryonic fibroblasts leads to elevated Pyk2 expression (Sieg, 2000). However, in the GBM stem cell model we used here, Pyk2 was not expressed (determined by both transcriptomics and proteomics). We have included Figure S1E to show that PYK2 expression was undetectable in FAK -/- and FAK Rx cells at the RNA level (Figure S1F). We conclude that there is no compensatory increase in Pyk2 upon FAK loss in these cells. In the transformed neural stem cell model of GBM, we do not consistently or robustly detect nuclear FAK.

      Review #2

      Masalmeh and colleagues employ a neural stem/progenitor cell-based glioma model (NPE cells) to investigate the role of Focal Adhesion Kinase (FAK) in GBM, with a focus on potential links between the regulation of morphological/adhesive and metabolic GBM cell properties. For this, the authors employ wt cells alongside newly generated FAK-KO and -reexpressing cells, as well as pharmacological interventions to probe the relevance of specific signaling pathways. The authors´ main claims are that FAK crucially modulates glioma cell morphology, cell-cell and cell-substrate interactions and motility, as well as their metabolism, and that these effects translate to changes to relevant in vivo properties such as invasion and tumor growth.

      My main issues are with the model chosen by the authors.

      As per the methods section, generation of FAK-KO and -"Rx" NPE cells entailed protracted selection/expansion processes, which may have resulted in inadvertent selection for cellular/molecular properties unrelated to the desired one (loss or gain of FAK expression) and which may have had cascading effects on NPE cells. The authors nonetheless repeatedly claim the parameters they quantify, such as mitochondrial or cytoskeletal properties or metabolic features, to have directly resulted from FAK loss or reintroduction. Examples of such causal inferences are to be found in lines 123, 134/135, 165, 181. Such causal claims are, in my view, unsupported.

      Acute perturbation of FAK expression/activity, genetically or pharmacologically, followed by a rapid assessment of the processes under investigation, would be needed to begin to assess causality, even if acute genetic perturbations may be technically challenging as sufficient gene expression reduction or restoration to physiologically relevant levels may be hard to achieve.

      Response:

      We would like to first comment on the model we used here, which we think will clarify the validity of our approach. The model is a transformed stem cell model of GBM that was published in (Gangoso et al., Cell, 2021) and is now used regularly in the GBM field. As mentioned in the response to Reviewer 1, we have added text (page 4 and 5 in the revised manuscript) and a new supplementary figure (Figure S1D) clarifying that the morphological changes we observed were consistent across multiple FAK -/- clones, showing this was not due to any inter-clonal variability. We also added images showing that the morphological changes were apparent at 48 h after nucleofecting FAK -/- cells with the FAK‑expressing vector specifically (not the empty vector), prior to starting G418 selection to enrich for FAK‑expressing cells (Figure S1C), addressing the worry that clonal variation and selection was the cause of the FAK-dependent phenotypes we observed. We believe that our model provides a type of well controlled, clean genetic cancer cell system of a type that is commonly used in cancer cell biology, allowing us to attribute phenotypes to individual proteins.

      We have also carried out a more acute treatment by using the FAK inhibitor VS4718 to perturb FAK kinase activity and assessed the effects on glycolysis and glutamine oxidation after 48h treatment (Figure S2D, E and F). We found that treating the transformed neural stem cells (parental population) with FAK inhibitor (300nM VS4718) decreases glucose incorporation into glycolysis intermediates and glutamine incorporation into TCA cycle intermediates, consistent with a role for FAK's kinase activity in maintaining glycolysis and glutamine oxidation.

      The employed pharmacological modulation of ROCK activity is the only approach that, given the presumably acute nature of the treatment, may have allowed the authors to probe the proposed functional links. The methods section of the manuscript does not however comprise details as to the duration of these treatments, which leaves open the possibility of long-term treatment having been carried out (data shown in Figure 5B refers to 72hr treatment).

      __Response: __

      We have added the duration of the treatment to the Methods section and Figure Legends, to clarify that cells were treated with ROCK inhibitors for 24h, before assessing the effects on mictochondria (Figure 4C, D, S4C and D) and glutamine oxidation (Figure 5A, and S5). For metabolic activity by AlamarBlue assay, cells were treated with ROCK inhibitors for 72h (Figure 5B).

      Even in the case of ROCK inhibitor experiments, it is however unclear if and how the effects on cell morphology and adhesion, mitochondrial organization and metabolic activity may be connected to each other and, if at all, to FAK expression.

      Given the above uncertainties due to the nature of the model and experimental approaches, it is hard to assess the reliability and thus the relevance of the findings.

      Response:

      FAK suppresses ROCK activity (as judged by pMLC2 S19, Figure 4A and B). Treating FAK -/- cells with two different ROCK inhibitors restored mesenchymal-like cell morphology, mitochondrial morphology and glutamine oxidation. As mentioned above, to strengthen our evidence for the antagonistic role of FAK in ROCK-MLC2 signalling, we have now introduced an experiment whereby integrin-FAK signalling was disrupted through treatment with a detachment agent (Accutase), and subsequently maintaining the cells in suspension in laminin-free medium. We assessed pMLC2 S19 levels (a measure of ROCK activity) relating this to FAK phosphorylation that is supressed after integrin disengagement. These results were evaluated relative to spread wild type cells growing on laminin where Integrin-FAK signalling was active (Figure S4A and B). We observed an inverse relationship between Integrin-FAK signalling and ROCK-MLC2 activity in keeping with our conclusions (Figure 4A and B).

      Experimental support for the ability of cell-substrate interaction modulation to concomitantly impact cellular metabolism and motility/invasion would be significant both in terms of advancing our understanding of glioma cell biology and of its translational potential, but the evidence being provided is at best compatible with the proposed model.

      Response: We carried out a new experiment to support the ability of cell-substrate interaction modulation to impact metabolism; specifically, we inhibited cell-substrate interactions by plating the cells on Poly-2-hydroxyethyl methacrylate (Poly 2-HEMA)-coated dishes. This suppressed FAK phosphorylation at Y397, as expected, with concomitant reduction in glutamine utilisation in the TCA cycle (Figure S3A, B and C).

      My background/expertise is in developmental and adult neurogenesis, in vivo modelling of gliomagenesis and cell fate control/reprogramming, with a focus on molecular mechanisms of differentiation and quantitative aspects of lineage dynamics; molecular details of the control of cellular metabolism, cell-cell adhesion and cytoskeletal dynamics are not core expertise of mine.

      We appreciate this reviewer's expertise are not necessarily in the cancer cell biology and genetic intervention aspects of our study. We hope that the explanations we have provided satisfy the reviewer that our conclusions are valid.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Masalmeh and colleagues employ a neural stem/progenitor cell-based glioma model (NPE cells) to investigate the role of Focal Adhesion Kinase (FAK) in GBM, with a focus on potential links between the regulation of morphological/adhesive and metabolic GBM cell properties. For this, the authors employ wt cells alongside newly generated FAK-KO and -reexpressing cells, as well as pharmacological interventions to probe the relevance of specific signaling pathways. The authors´ main claims are that FAK crucially modulates glioma cell morphology, cell-cell and cell-substrate interactions and motility, as well as their metabolism, and that these effects translate to changes to relevant in vivo properties such as invasion and tumor growth. My main issues are with the model chosen by the authors.

      As per the methods section, generation of FAK-KO and -"Rx" NPE cells entailed protracted selection/expansion processes, which may have resulted in inadvertent selection for cellular/molecular properties unrelated to the desired one (loss or gain of FAK expression) and which may have had cascading effects on NPE cells. The authors nonetheless repeatedly claim the parameters they quantify, such as mitochondrial or cytoskeletal properties or metabolic features, to have directly resulted from FAK loss or reintroduction. Examples of such causal inferences are to be found in lines 123, 134/135, 165, 181. Such causal claims are, in my view, unsupported. Acute perturbation of FAK expression/activity, genetically or pharmacologically, followed by a rapid assessment of the processes under investigation, would be needed to begin to assess causality, even if acute genetic perturbations may be technically challenging as sufficient gene expression reduction or restoration to physiologically relevant levels may be hard to achieve.

      The employed pharmacological modulation of ROCK activity is the only approach that, given the presumably acute nature of the treatment, may have allowed the authors to probe the proposed functional links. The methods section of the manuscript does not however comprise details as to the duration of these treatments, which leaves open the possibility of long-term treatment having been carried out (data shown in Figure 5B refers to 72hr treatment). Even in the case of ROCK inhibitor experiments, it is however unclear if and how the effects on cell morphology and adhesion, mitochondrial organization and metabolic activity may be connected to each other and, if at all, to FAK expression.

      Significance

      Given the above uncertainties due to the nature of the model and experimental approaches, it is hard to assess the reliability and thus the relevance of the findings.

      Experimental support for the ability of cell-substrate interaction modulation to concomitantly impact cellular metabolism and motility/invasion would be significant both in terms of advancing our understanding of glioma cell biology and of its translational potential, but the evidence being provided is at best compatible with the proposed model.

      My background/expertise is in developmental and adult neurogenesis, in vivo modelling of gliomagenesis and cell fate control/reprogramming, with a focus on molecular mechanisms of differentiation and quantitative aspects of lineage dynamics; molecular details of the control of cellular metabolism, cell-cell adhesion and cytoskeletal dynamics are not core expertise of mine.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Review of Masalmeh et al.

      Title: "FAK modulates glioblastoma stem cell energetics..."

      Previous studies have implicated FAK and the related tyrosine kinase PYK2 in glioblastoma growth, cell migration, and invasion. Herein, using a murine stem cell model of glioblastoma, the authors used CRISPR to inactivate FAK, FAK-null cells selected and cloned, and lentiviral re-expression of murine FAK in the FAK-null cells (termed FAK Rx) was accomplished. FAK-/- cells were shown to possess epithelial characteristics whereas FAK Rx cells expressed mesenchymal markers and increased cell migration/invasion in vitro. Comparisons between FAK-/- and FAK Rx cells showed that FAK re-expressed increased mitochondrial respiration and amino acid uptake. This was associated with FAK Rx cells exhibiting filamentous mitochondrial morphology (potentially an OXPHOS phenotype) and decreased levels of MTFR1L S235 phosphorylation (implicated in mito morphology fragmentation). Mito and epithelial cell morphology of FAK-/- cells was reversed by treatment with Rho-kinase inhibitors that also increased mito metabolism and cell viability. Last, FAK-dependent glioblastoma tumor growth was shown by comparisons of FAK-/- and FAK Rx implantation studies.

      The studies by Masalmeh provide interesting findings associating FAK expression with changes in mitochondrial morphology, energy metabolism, and glutamate uptake. According to the authors model, FAK expression is supporting a glioblastoma stem cell like phenotype in vitro and tumor growth in vivo. What remains unclear is the mechanistic connection to cell changes and whether or not these are be dependent on intrinsic FAK activity or as the Frame group has previously published, potentially FAK nuclear localization. The associations with MTFR1L phosphorylation and effects by Rho kinase inhibition are likely indirect and remind this reviewer of long-ago studies with FAK-null fibroblasts that exhibit epithelial characteristics, still express PYK2, exhibited elevated RhoA GTPase activity. Some of these phenotypes were linked to changes in RhoGEF and RhoGAP signaling with FAK and/or Pyk2. At a minimum, it would be informative to know whether Pyk2 signaling is relevant for observed phenotypes and whether the authors can further support their associations with FAK-targeted or FAK-Pyk2-targeted inhibitors or PROTACs.

      Some questions that would enhance potential impact.

      1. Cell generation. Please describe the analysis of FAK-/- clones in more detail. The "low viability" phenotype needs further explanation with regard to clonal expansion and growth characteristics?
      2. Figure 1F: need further support of MET change upon FAK KO and EMT reversion.
      3. Fig. 2: Need further support if FAK effects impact glycolysis or oxidative phosphorylation in particular as implicated by the stem cell model.
      4. Is there a combinatorial potential between FAKi and chemotherapies used for glioblastoma. Need to build upon past studies.
      5. The notation of changes in glucose transporter expression should be followed up with regard to the potential that FAK-expressing cells may have different uptake of carbon sources and other amino acids. Altered uptake could be one potential explanation for increase glycolysis and glutamine flux.
      6. It would be helpful to support the confocal microscopy of mitos with EM.
      7. Lack of FAK expression with increased MTFR1 phosphorylation is difficult to interpret.
      8. Need to have better support between loss of FAK and the increase in Rho signaling. Use of Rho kinase inhibitors is very limited and the context to FAK (and or Pyk2) remains unclear. Past studies have linked integrin adhesion to ECM as a linkage between FAK activation and the transient inhibition of RhoA GTP binding. Is integrin signaling and FAK involved in the cell and metabolism phenotypes in this new model?

      Significance

      The studies by Masalmeh provide interesting findings associating FAK expression with changes in mitochondrial morphology, energy metabolism, and glutamate uptake. According to the authors model, FAK expression is supporting a glioblastoma stem cell like phenotype in vitro and tumor growth in vivo. What remains unclear is the mechanistic connection to cell changes and whether or not these are be dependent on intrinsic FAK activity or as the Frame group has previously published, potentially FAK nuclear localization. The associations with MTFR1L phosphorylation and effects by Rho kinase inhibition are likely indirect and remind this reviewer of long-ago studies with FAK-null fibroblasts that exhibit epithelial characteristics, still express PYK2, exhibited elevated RhoA GTPase activity. Some of these phenotypes were linked to changes in RhoGEF and RhoGAP signaling with FAK and/or Pyk2. At a minimum, it would be informative to know whether Pyk2 signaling is relevant for observed phenotypes and whether the authors can further support their associations with FAK-targeted or FAK-Pyk2-targeted inhibitors or PROTACs.

    4. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-02946

      Corresponding author(s): Margaret, Frame

      Roza, Masalmeh

      [Please use this template only if the submitted manuscript should be considered by the affiliate journal as a full revision in response to the points raised by the reviewers.

      If you wish to submit a preliminary revision with a revision plan, please use our "Revision Plan" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      We thank the reviewers for recognizing the significance of our work and for their constructive feedback and suggestions, most of which we have implemented in our revised manuscript.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      Reviewer #1

      Evidence, reproducibility and clarity

      Review of Masalmeh et al. Title: "FAK modulates glioblastoma stem cell energetics..."

      Previous studies have implicated FAK and the related tyrosine kinase PYK2 in glioblastoma growth, cell migration, and invasion. Herein, using a murine stem cell model of glioblastoma, the authors used CRISPR to inactivate FAK, FAK-null cells selected and cloned, and lentiviral re-expression of murine FAK in the FAK-null cells (termed FAK Rx) was accomplished. FAK-/- cells were shown to possess epithelial characteristics whereas FAK Rx cells expressed mesenchymal markers and increased cell migration/invasion in vitro. Comparisons between FAK-/- and FAK Rx cells showed that FAK re-expressed increased mitochondrial respiration and amino acid uptake. This was associated with FAK Rx cells exhibiting filamentous mitochondrial morphology (potentially an OXPHOS phenotype) and decreased levels of MTFR1L S235 phosphorylation (implicated in mito morphology fragmentation). Mito and epithelial cell morphology of FAK-/- cells was reversed by treatment with Rho-kinase inhibitors that also increased mito metabolism and cell viability. Last, FAK-dependent glioblastoma tumor growth was shown by comparisons of FAK-/- and FAK Rx implantation studies.

      The studies by Masalmeh provide interesting findings associating FAK expression with changes in mitochondrial morphology, energy metabolism, and glutamate uptake. According to the authors model, FAK expression is supporting a glioblastoma stem cell like phenotype in vitro and tumor growth in vivo. What remains unclear is the mechanistic connection to cell changes and whether or not these are be dependent on intrinsic FAK activity or as the Frame group has previously published, potentially FAK nuclear localization. The associations with MTFR1L phosphorylation and effects by Rho kinase inhibition are likely indirect and remind this reviewer of long-ago studies with FAK-null fibroblasts that exhibit epithelial characteristics, still express PYK2, exhibited elevated RhoA GTPase activity. Some of these phenotypes were linked to changes in RhoGEF and RhoGAP signaling with FAK and/or Pyk2. At a minimum, it would be informative to know whether Pyk2 signaling is relevant for observed phenotypes and whether the authors can further support their associations with FAK-targeted or FAK-Pyk2-targeted inhibitors or PROTACs.

      Some questions that would enhance potential impact. 1. Cell generation. Please describe the analysis of FAK-/- clones in more detail. The "low viability" phenotype needs further explanation with regard to clonal expansion and growth characteristics?

      Response:

      • We included a better description and a supplementary figure in our revised manuscript to indicate that we have examined several FAK -/- clones and confirmed that our observations were not due to clonal variation; multiple clones displayed similar morphological changes (Figure S1D). We also show that the elongated mesenchymal-like morphology was observed at 48 h after nucleofecting the cells with the FAK‑expressing vector, before beginning G418 selection to enrich for cells expressing FAK (Figure S1C). We also included experiments to acutely modulate FAK signalling (detaching and seeding cells on fibronectin) (Figure S2D, E, F and Figure S3) to exclude the possibility that the profound effects are due to protocols/selection we used for generating FAK-deleted cells.
      • Regarding the term "low viability", we have clarified in the text that there is no significant difference in cell number (Figure S1A) or 'cell viability' when it is assessed by trypan blue exclusion (a non-mitochondria-dependent read-out) (Figure S1B) between FAK-expressing FAK Rx and FAK-/- cells cultured for three days under normal conditions. Therefore, we agree the term 'cell viability' in this context could be confusing and have replace "cell viability" with "metabolic activity as measured by Alamar Blue." in Figure 1D and Figure 5B, and the corresponding text in the original manuscript. This wording more accurately reflects the data.

      Figure 1F: need further support of MET change upon FAK KO and EMT reversion.

      Response: We have added a heatmap (Figure S1E) illustrating the changes in protein expression of core-enriched EMT/MET genes products (by proteomics) after FAK gene deletion (EMT genes as defined in Howe et al., 2018) ; this strengthens the conclusion that the MET reversion morphological phenotype is accompanied by recognised MET protein changes.

      Fig. 2: Need further support if FAK effects impact glycolysis or oxidative phosphorylation in particular as implicated by the stem cell model.

      Response: We show that FAK impacts both glycolysis (Figure 2A, 2E, and 2F) and mitochondrial oxidative phosphorylation on the basis of the oxygen consumption rate (OCR) (Figure 2B, and 2D), showing both are contributing pathways to FAK-dependent energy production. We have clarified this in the text.

      Is there a combinatorial potential between FAKi and chemotherapies used for glioblastoma. Need to build upon past studies.

      Response: Yes, previous studies suggest that inhibiting FAK can sensitize GBM cells to chemotherapy (Golubovskaya et al., 2012; Ortiz-Rivera et al., 2023). We have included a paragraph in the discussion section to make sure this is clearer. Although it is not the subject of this study, we appreciate it is useful context.

      The notation of changes in glucose transporter expression should be followed up with regard to the potential that FAK-expressing cells may have different uptake of carbon sources and other amino acids. Altered uptake could be one potential explanation for increase glycolysis and glutamine flux.

      Response: We agree with the reviewer that glucose uptake could be contributing and we include data that 2 glucose transporters are indeed FAK-regulated namely Glucose transporter 1 (GLUT1, encoded by Slc2a1 gene) and Glucose transporter 3 (GLUT 3, encoded by Slc2a3 gene) (shown in Figure S2B and C).

      It would be helpful to support the confocal microscopy of mitos with EM.

      Response:

      We are concerned (and in our experience) that Electron microscopy (EM) may introduce artefacts during sample preparation. In contrast, immunofluorescence sample preparation is less susceptible to artefacts. The SORA system we used is not a conventional point-scanning confocal microscope, but is a super-resolution module based on a spinning disk confocal platform (CSU-W1; Yokogawa) using optical pixel reassignment with confocal detection. This method enhances resolution in all dimensions with resolution in our samples measured at 120nm. This has been instructive in defining a new level of changes in mitochondrial morphology upon FAK gene deletion.

      Lack of FAK expression with increased MTFR1 phosphorylation is difficult to interpret.

      Response: We do not directly show that this phosphorylation event is causal in our experiments; however, we think it important to document this change since it has been published that phosphorylation of MTFR1 has been causally linked to the mitochondrial morphology we observed in other systems (Tilokani et al., 2022).

      Need to have better support between loss of FAK and the increase in Rho signaling. Use of Rho kinase inhibitors is very limited and the context to FAK (and or Pyk2) remains unclear. Past studies have linked integrin adhesion to ECM as a linkage between FAK activation and the transient inhibition of RhoA GTP binding. Is integrin signaling and FAK involved in the cell and metabolism phenotypes in this new model?

      Response: To better support the antagonistic effect of FAK on Rho-kinase (ROCK) signalling, we included a new experiment in which the integrin-FAK signalling pathway has been disrupted by treating FAK WT cells with an agent that causes detachment from the substratum, Accutase, and growing the cells in suspension in laminin-free medium. We present ROCK activity data, as judged by phosphorylated MLC2 at serine 19 (pMLC2 S19), relating this to induced FAK phosphorylation at Y397 (a surrogate for FAK activity) that is supressed after integrin disengagement. These measurements have been compared with conditions whereby integrin-FAK signalling is activated by growing the cells on laminin coated surfaces. We observed a time-dependent decrease in pFAK(Y397) levels (normalised to total FAK) in suspended cells compared to those spread on laminin, while pMLC2(S19) levels increased in a reciprocal manner over time in detached cells relative to spread cells (S4A and B). There is therefore an inverse relationship between integrin-FAK signalling and ROCK-MLC2 activity, consistent with findings from FAK gene deletion experiments. In the former case, we do not rely on gene deletion cell clones.

      Significance

      The studies by Masalmeh provide interesting findings associating FAK expression with changes in mitochondrial morphology, energy metabolism, and glutamate uptake. According to the authors model, FAK expression is supporting a glioblastoma stem cell like phenotype in vitro and tumor growth in vivo. What remains unclear is the mechanistic connection to cell changes and whether or not these are be dependent on intrinsic FAK activity or as the Frame group has previously published, potentially FAK nuclear localization. The associations with MTFR1L phosphorylation and effects by Rho kinase inhibition are likely indirect and remind this reviewer of long-ago studies with FAK-null fibroblasts that exhibit epithelial characteristics, still express PYK2, exhibited elevated RhoA GTPase activity. Some of these phenotypes were linked to changes in RhoGEF and RhoGAP signaling with FAK and/or Pyk2. At a minimum, it would be informative to know whether Pyk2 signaling is relevant for observed phenotypes and whether the authors can further support their associations with FAK-targeted or FAK-Pyk2-targeted inhibitors or PROTACs.

      __Response: __

      Deleting the gene encoding FAK in mouse embryonic fibroblasts leads to elevated Pyk2 expression (Sieg, 2000). However, in the GBM stem cell model we used here, Pyk2 was not expressed (determined by both transcriptomics and proteomics). We have included Figure S1E to show that PYK2 expression was undetectable in FAK -/- and FAK Rx cells at the RNA level (Figure S1F). We conclude that there is no compensatory increase in Pyk2 upon FAK loss in these cells. In the transformed neural stem cell model of GBM, we do not consistently or robustly detect nuclear FAK.

      Review #2

      Masalmeh and colleagues employ a neural stem/progenitor cell-based glioma model (NPE cells) to investigate the role of Focal Adhesion Kinase (FAK) in GBM, with a focus on potential links between the regulation of morphological/adhesive and metabolic GBM cell properties. For this, the authors employ wt cells alongside newly generated FAK-KO and -reexpressing cells, as well as pharmacological interventions to probe the relevance of specific signaling pathways. The authors´ main claims are that FAK crucially modulates glioma cell morphology, cell-cell and cell-substrate interactions and motility, as well as their metabolism, and that these effects translate to changes to relevant in vivo properties such as invasion and tumor growth.

      My main issues are with the model chosen by the authors.

      As per the methods section, generation of FAK-KO and -"Rx" NPE cells entailed protracted selection/expansion processes, which may have resulted in inadvertent selection for cellular/molecular properties unrelated to the desired one (loss or gain of FAK expression) and which may have had cascading effects on NPE cells. The authors nonetheless repeatedly claim the parameters they quantify, such as mitochondrial or cytoskeletal properties or metabolic features, to have directly resulted from FAK loss or reintroduction. Examples of such causal inferences are to be found in lines 123, 134/135, 165, 181. Such causal claims are, in my view, unsupported.

      Acute perturbation of FAK expression/activity, genetically or pharmacologically, followed by a rapid assessment of the processes under investigation, would be needed to begin to assess causality, even if acute genetic perturbations may be technically challenging as sufficient gene expression reduction or restoration to physiologically relevant levels may be hard to achieve.

      Response:

      We would like to first comment on the model we used here, which we think will clarify the validity of our approach. The model is a transformed stem cell model of GBM that was published in (Gangoso et al., Cell, 2021) and is now used regularly in the GBM field. As mentioned in the response to Reviewer 1, we have added text (page 4 and 5 in the revised manuscript) and a new supplementary figure (Figure S1D) clarifying that the morphological changes we observed were consistent across multiple FAK -/- clones, showing this was not due to any inter-clonal variability. We also added images showing that the morphological changes were apparent at 48 h after nucleofecting FAK -/- cells with the FAK‑expressing vector specifically (not the empty vector), prior to starting G418 selection to enrich for FAK‑expressing cells (Figure S1C), addressing the worry that clonal variation and selection was the cause of the FAK-dependent phenotypes we observed. We believe that our model provides a type of well controlled, clean genetic cancer cell system of a type that is commonly used in cancer cell biology, allowing us to attribute phenotypes to individual proteins.

      We have also carried out a more acute treatment by using the FAK inhibitor VS4718 to perturb FAK kinase activity and assessed the effects on glycolysis and glutamine oxidation after 48h treatment (Figure S2D, E and F). We found that treating the transformed neural stem cells (parental population) with FAK inhibitor (300nM VS4718) decreases glucose incorporation into glycolysis intermediates and glutamine incorporation into TCA cycle intermediates, consistent with a role for FAK's kinase activity in maintaining glycolysis and glutamine oxidation.

      The employed pharmacological modulation of ROCK activity is the only approach that, given the presumably acute nature of the treatment, may have allowed the authors to probe the proposed functional links. The methods section of the manuscript does not however comprise details as to the duration of these treatments, which leaves open the possibility of long-term treatment having been carried out (data shown in Figure 5B refers to 72hr treatment).

      __Response: __

      We have added the duration of the treatment to the Methods section and Figure Legends, to clarify that cells were treated with ROCK inhibitors for 24h, before assessing the effects on mictochondria (Figure 4C, D, S4C and D) and glutamine oxidation (Figure 5A, and S5). For metabolic activity by AlamarBlue assay, cells were treated with ROCK inhibitors for 72h (Figure 5B).

      Even in the case of ROCK inhibitor experiments, it is however unclear if and how the effects on cell morphology and adhesion, mitochondrial organization and metabolic activity may be connected to each other and, if at all, to FAK expression.

      Given the above uncertainties due to the nature of the model and experimental approaches, it is hard to assess the reliability and thus the relevance of the findings.

      Response:

      FAK suppresses ROCK activity (as judged by pMLC2 S19, Figure 4A and B). Treating FAK -/- cells with two different ROCK inhibitors restored mesenchymal-like cell morphology, mitochondrial morphology and glutamine oxidation. As mentioned above, to strengthen our evidence for the antagonistic role of FAK in ROCK-MLC2 signalling, we have now introduced an experiment whereby integrin-FAK signalling was disrupted through treatment with a detachment agent (Accutase), and subsequently maintaining the cells in suspension in laminin-free medium. We assessed pMLC2 S19 levels (a measure of ROCK activity) relating this to FAK phosphorylation that is supressed after integrin disengagement. These results were evaluated relative to spread wild type cells growing on laminin where Integrin-FAK signalling was active (Figure S4A and B). We observed an inverse relationship between Integrin-FAK signalling and ROCK-MLC2 activity in keeping with our conclusions (Figure 4A and B).

      Experimental support for the ability of cell-substrate interaction modulation to concomitantly impact cellular metabolism and motility/invasion would be significant both in terms of advancing our understanding of glioma cell biology and of its translational potential, but the evidence being provided is at best compatible with the proposed model.

      Response: We carried out a new experiment to support the ability of cell-substrate interaction modulation to impact metabolism; specifically, we inhibited cell-substrate interactions by plating the cells on Poly-2-hydroxyethyl methacrylate (Poly 2-HEMA)-coated dishes. This suppressed FAK phosphorylation at Y397, as expected, with concomitant reduction in glutamine utilisation in the TCA cycle (Figure S3A, B and C).

      My background/expertise is in developmental and adult neurogenesis, in vivo modelling of gliomagenesis and cell fate control/reprogramming, with a focus on molecular mechanisms of differentiation and quantitative aspects of lineage dynamics; molecular details of the control of cellular metabolism, cell-cell adhesion and cytoskeletal dynamics are not core expertise of mine.

      We appreciate this reviewer's expertise are not necessarily in the cancer cell biology and genetic intervention aspects of our study. We hope that the explanations we have provided satisfy the reviewer that our conclusions are valid.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Masalmeh and colleagues employ a neural stem/progenitor cell-based glioma model (NPE cells) to investigate the role of Focal Adhesion Kinase (FAK) in GBM, with a focus on potential links between the regulation of morphological/adhesive and metabolic GBM cell properties. For this, the authors employ wt cells alongside newly generated FAK-KO and -reexpressing cells, as well as pharmacological interventions to probe the relevance of specific signaling pathways. The authors´ main claims are that FAK crucially modulates glioma cell morphology, cell-cell and cell-substrate interactions and motility, as well as their metabolism, and that these effects translate to changes to relevant in vivo properties such as invasion and tumor growth. My main issues are with the model chosen by the authors.

      As per the methods section, generation of FAK-KO and -"Rx" NPE cells entailed protracted selection/expansion processes, which may have resulted in inadvertent selection for cellular/molecular properties unrelated to the desired one (loss or gain of FAK expression) and which may have had cascading effects on NPE cells. The authors nonetheless repeatedly claim the parameters they quantify, such as mitochondrial or cytoskeletal properties or metabolic features, to have directly resulted from FAK loss or reintroduction. Examples of such causal inferences are to be found in lines 123, 134/135, 165, 181. Such causal claims are, in my view, unsupported. Acute perturbation of FAK expression/activity, genetically or pharmacologically, followed by a rapid assessment of the processes under investigation, would be needed to begin to assess causality, even if acute genetic perturbations may be technically challenging as sufficient gene expression reduction or restoration to physiologically relevant levels may be hard to achieve.

      The employed pharmacological modulation of ROCK activity is the only approach that, given the presumably acute nature of the treatment, may have allowed the authors to probe the proposed functional links. The methods section of the manuscript does not however comprise details as to the duration of these treatments, which leaves open the possibility of long-term treatment having been carried out (data shown in Figure 5B refers to 72hr treatment). Even in the case of ROCK inhibitor experiments, it is however unclear if and how the effects on cell morphology and adhesion, mitochondrial organization and metabolic activity may be connected to each other and, if at all, to FAK expression.

      Significance

      Given the above uncertainties due to the nature of the model and experimental approaches, it is hard to assess the reliability and thus the relevance of the findings.

      Experimental support for the ability of cell-substrate interaction modulation to concomitantly impact cellular metabolism and motility/invasion would be significant both in terms of advancing our understanding of glioma cell biology and of its translational potential, but the evidence being provided is at best compatible with the proposed model.

      My background/expertise is in developmental and adult neurogenesis, in vivo modelling of gliomagenesis and cell fate control/reprogramming, with a focus on molecular mechanisms of differentiation and quantitative aspects of lineage dynamics; molecular details of the control of cellular metabolism, cell-cell adhesion and cytoskeletal dynamics are not core expertise of mine.

    6. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Review of Masalmeh et al.

      Title: "FAK modulates glioblastoma stem cell energetics..."

      Previous studies have implicated FAK and the related tyrosine kinase PYK2 in glioblastoma growth, cell migration, and invasion. Herein, using a murine stem cell model of glioblastoma, the authors used CRISPR to inactivate FAK, FAK-null cells selected and cloned, and lentiviral re-expression of murine FAK in the FAK-null cells (termed FAK Rx) was accomplished. FAK-/- cells were shown to possess epithelial characteristics whereas FAK Rx cells expressed mesenchymal markers and increased cell migration/invasion in vitro. Comparisons between FAK-/- and FAK Rx cells showed that FAK re-expressed increased mitochondrial respiration and amino acid uptake. This was associated with FAK Rx cells exhibiting filamentous mitochondrial morphology (potentially an OXPHOS phenotype) and decreased levels of MTFR1L S235 phosphorylation (implicated in mito morphology fragmentation). Mito and epithelial cell morphology of FAK-/- cells was reversed by treatment with Rho-kinase inhibitors that also increased mito metabolism and cell viability. Last, FAK-dependent glioblastoma tumor growth was shown by comparisons of FAK-/- and FAK Rx implantation studies.

      The studies by Masalmeh provide interesting findings associating FAK expression with changes in mitochondrial morphology, energy metabolism, and glutamate uptake. According to the authors model, FAK expression is supporting a glioblastoma stem cell like phenotype in vitro and tumor growth in vivo. What remains unclear is the mechanistic connection to cell changes and whether or not these are be dependent on intrinsic FAK activity or as the Frame group has previously published, potentially FAK nuclear localization. The associations with MTFR1L phosphorylation and effects by Rho kinase inhibition are likely indirect and remind this reviewer of long-ago studies with FAK-null fibroblasts that exhibit epithelial characteristics, still express PYK2, exhibited elevated RhoA GTPase activity. Some of these phenotypes were linked to changes in RhoGEF and RhoGAP signaling with FAK and/or Pyk2. At a minimum, it would be informative to know whether Pyk2 signaling is relevant for observed phenotypes and whether the authors can further support their associations with FAK-targeted or FAK-Pyk2-targeted inhibitors or PROTACs.

      Some questions that would enhance potential impact.

      1. Cell generation. Please describe the analysis of FAK-/- clones in more detail. The "low viability" phenotype needs further explanation with regard to clonal expansion and growth characteristics?
      2. Figure 1F: need further support of MET change upon FAK KO and EMT reversion.
      3. Fig. 2: Need further support if FAK effects impact glycolysis or oxidative phosphorylation in particular as implicated by the stem cell model.
      4. Is there a combinatorial potential between FAKi and chemotherapies used for glioblastoma. Need to build upon past studies.
      5. The notation of changes in glucose transporter expression should be followed up with regard to the potential that FAK-expressing cells may have different uptake of carbon sources and other amino acids. Altered uptake could be one potential explanation for increase glycolysis and glutamine flux.
      6. It would be helpful to support the confocal microscopy of mitos with EM.
      7. Lack of FAK expression with increased MTFR1 phosphorylation is difficult to interpret.
      8. Need to have better support between loss of FAK and the increase in Rho signaling. Use of Rho kinase inhibitors is very limited and the context to FAK (and or Pyk2) remains unclear. Past studies have linked integrin adhesion to ECM as a linkage between FAK activation and the transient inhibition of RhoA GTP binding. Is integrin signaling and FAK involved in the cell and metabolism phenotypes in this new model?

      Significance

      The studies by Masalmeh provide interesting findings associating FAK expression with changes in mitochondrial morphology, energy metabolism, and glutamate uptake. According to the authors model, FAK expression is supporting a glioblastoma stem cell like phenotype in vitro and tumor growth in vivo. What remains unclear is the mechanistic connection to cell changes and whether or not these are be dependent on intrinsic FAK activity or as the Frame group has previously published, potentially FAK nuclear localization. The associations with MTFR1L phosphorylation and effects by Rho kinase inhibition are likely indirect and remind this reviewer of long-ago studies with FAK-null fibroblasts that exhibit epithelial characteristics, still express PYK2, exhibited elevated RhoA GTPase activity. Some of these phenotypes were linked to changes in RhoGEF and RhoGAP signaling with FAK and/or Pyk2. At a minimum, it would be informative to know whether Pyk2 signaling is relevant for observed phenotypes and whether the authors can further support their associations with FAK-targeted or FAK-Pyk2-targeted inhibitors or PROTACs.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      RESPONSE TO REVIEWERS

      We thank the reviewers for their thoughtful and constructive feedback, which has been instrumental in improving the overall quality of our manuscript.

      In response, we have undertaken a substantial revision that includes new experimental data, refined analyses, and clearer presentation of our findings. Specifically, we have addressed concerns about RNAi efficiency and protein-level validation, expanded our genetic models to include loss-of-function contexts, and clarified the interpretation of mitochondrial morphology using both confocal and electron microscopy. We also incorporated new data on Cyclin E regulation and mitochondrial membrane potential to strengthen the mechanistic link between dPGC1 depletion and Yki-driven tumorigenesis. These revisions not only address the specific points raised by the reviewers but also enhance the coherence and impact of the study. We are confident that the revised manuscript presents a more robust and compelling case for the role of dPGC1 as a context-dependent tumor suppressor and that it will be of broad interest to the fields of developmental biology, cancer metabolism, and mitochondrial dynamics.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): Sew et al. examine the master regulator of mitochondrial biogenesis, dPGC1, in the context of Drosophila wing and larval development. They primarily use confocal imaging to probe the interplay between dPGC1 and an overactive Hippo pathway, driven by overexpression of the main effector protein, Yki. In their study, they find that tumors, driven by overactivity of Yki grow larger when dPGC1 is downregulated, implicating the mitochondrial biogenesis pathway in tumor suppression. Furthermore, in the context of Yki overexpression, they find that levels of Mfn or Opa1 modulate tumor size. Lastly, they show a role of cyclin E in controlling the size of tumors formed by Yki OE + dPGC1 RNAi. The potential role of dPGC1 as a tumor suppressor is interesting because it highlights an emerging recognition of mitochondria in the aetiology of cancer. However, before publication, much of the data in this manuscript should be strengthened by a refinement in the methods/analysis and an increase in orthogonal approaches.

      We addressed concerns regarding RNAi efficiency and wing development by incorporating data from a dPGC1 mutant allele and using a ubiquitous driver for qPCR validation of transgene efficiency. We clarified the rationale for EM use. The manuscript now avoids overinterpretation of mitochondrial morphology and focuses on fusion-specific regulators. We also revised the narrative arc to maintain coherence and added loss-of-function models to support our conclusions.

      Below, we address each of the reviewer’s points in detail.

      Major comments:

      The authors indicate that for example, in lines127-28, that neither downregulating or overexpressing dPGC1 affects wing size. However, the quantification in Fig. 1C shows a significant decrease in wing size following RNAi treatment. This decrease is modest, but it is nevertheless significant. It is worth pointing out, too, that the efficiency of the RNAi in Fig. S1C suggests that the conclusions drawn are premature. While a roughly 55% drop in mRNA levels may be statistically significant, it is unclear whether this drop in transcripts corresponds to a commensurate depletion of protein. Moreover, it is unclear, in this context, how much dPGC1 may indeed be necessary to drive a relatively normal program of mitochondrial biogenesis in wing development. To obtain a clear result, it is necessary to show significant depletion of the dPGC1 protein. (Ultimately, if it is the case that dPGC1 is unnecessary for wing development and function, a more coherent line of inquiry would be to find out the reason for this rather than to pivot the story to studying tumorigenesis in larva.)

      We agree that the interpretation of the RNAi efficiency data requires clarification.

      The qPCR analysis shown in former Fig. S1C was performed using wing discs from flies expressing UAS-dPGC1-RNAi under the control of the MS1096-Gal4 driver. However, as shown in current Fig. 1C, MS1096-Gal4 is not expressed uniformly across the wing disc. Some regions remain RFP-negative, indicating that the RNAi construct is not active in all cells. As a result, the measured mRNA levels likely underestimate the true knockdown efficiency. This is because the qPCR includes mRNA from both RNAi-expressing and non-expressing cells, diluting the apparent reduction in transcript levels.

      To address this limitation and more accurately assess RNAi efficiency, we repeated the qPCR analysis using a ubiquitous driver (actin-Gal4) to ensure uniform expression of the RNAi construct. Under these conditions, we observed a more substantial knockdown, with dPGC1 mRNA levels reduced to approximately 25% of control levels (this is shown in current Fig S2). This result indicates that the RNAi line is more effective than initially suggested by the MS1096-Gal4-based analysis.

      To complement our RNAi-based analysis, we additionally used a mutant strain carrying a characterized allele of dPGC1 (dPGC11, also known as dPGC1KG08646; see FlyBase: https://flybase.org/reports/FBal0148128). This genetically distinct approach allowed us to validate and strengthen our findings regarding dPGC1 function. Flies homozygous for this allele exhibited a modest but statistically significant reduction in both wing disc and adult wing size. These results support the conclusion that dPGC1 is required for normal wing growth and development. The new data are now included in Figure 1 and referenced in the main text (lines 144-153).

      Additionally, as suggested by the reviewer, we have revised the relevant section to maintain a coherent line of inquiry. The updated text can be found in lines 163–172.

      In Figure 3H-K, it is not clear why the authors used electron microscopy to evaluate mitochondrial morphology. The very good confocal images in Figure 3C-G show a clear change in mitochondrial morphology following the knockdown of Mfn, Opa1, and Miro. While it is clear from the electron micrographs in Figure H that the mitochondria are enlarged, it is not obvious that this increase in length is a result of increased mitochondrial fusion. Indeed, if the mean form factor were used to quantify the shape, it is likely that in both conditions, the value would be close to 1, indicating more of a round object, and it not obvious whether there would be a difference between the Yki OE versus the YkI OE + dPGC1 RNAi. Therefore, from this data alone, it cannot be concluded that the YkI OE + dPGC1 RNAi condition leads to mitochondrial hyperfusion.

      Our rationale for including electron microscopy (EM) was to overcome specific limitations in imaging mitochondrial morphology within the main epithelium of the wing disc, where Yki-driven tumors arise. These tumors were generated using ap-Gal4, which drives expression specifically in the main epithelium and is not active in the peripodial membrane. This is an important distinction, as the peripodial membrane—used in Figures 3C–G—has a squamous architecture and larger cytoplasmic volume, making it ideal for high-resolution confocal imaging and for assessing the effects of manipulating dMfn, Opa1, and miro. However, because ap-Gal4 is not expressed in the peripodial membrane, this tissue could not be used to analyze mitochondrial morphology in the actual tumorous context.

      To directly evaluate mitochondria in the main epithelium, we employed EM, which provides the resolution necessary to visualize ultrastructural changes that are not easily captured by confocal microscopy in this densely packed tissue. While EM does not directly measure fusion events, it allowed us to detect changes in mitochondrial size and shape that support our broader findings.

      We acknowledge that mitochondrial enlargement alone does not definitively demonstrate hyperfusion. However, the EM data were interpreted alongside additional evidence: the upregulation of mitochondrial fusion genes (dMfn and Opa1) in Yki + dPGC1-RNAi tumors, and functional data showing that overexpression of these genes promotes fusion in the peripodial membrane. Together, these findings suggest that dPGC1 depletion enhances mitochondrial fusion in Yki-driven tumors.

      To further clarify this point, we also imaged mitochondria in the main epithelium using confocal microscopy. However, the resolution was considerably lower than that achieved with EM, limiting our ability to assess fine mitochondrial structures. We have prepared a representative figure for the reviewer (below), showing representative confocal images of wing discs from three genotypes: (A) ap-Gal4, UAS-GFP (control), (B) ap-Gal4, UAS-Yki, and (C) ap-Gal4, UAS-Yki, UAS-dPGC1-RNAi. We used anti-ATP-synthase (Abcam, ab14748, dilution 1:200), to label the mitochondria for this Figure. Despite the lower resolution, mitochondria in the Yki + dPGC1-RNAi tumors appear elongated (yellow arrows) compared to those in the other conditions, consistent with the changes observed by EM. We believe this example illustrates the limitations of confocal imaging in this tissue and reinforces the need for EM to accurately assess mitochondrial morphology in the tumorous epithelium.

      While our EM analyses reveal mitochondrial enlargement in wing discs co-expressing Yki and PGC1-RNAi, we acknowledge that these structural features alone do not conclusively demonstrate mitochondrial hyperfusion. To address this, we have revised the manuscript to avoid overinterpreting the EM data and instead emphasize the functional relevance of mitochondrial fusion regulators such as dMfn and Opa1 in promoting tumor growth.

      Taken together, the EM analysis provides structural validation in the tumorous epithelium (Fig 4), while the confocal imaging and functional manipulation of fusion genes in the peripodial membrane offer mechanistic insight (Fig 3). This integrated approach strengthens the conclusion that PGC1 depletion in a Yki-overexpressing context promotes changes in mitochondrial morphology and contributes to tumorigenesis, independent of whether these changes reflect hyperfusion.

      Figure 4. refers to changes in mitochondrial fusion and fission in tumor formation; however, the authors do not attempt to alter mitochondrial fission factors, so it is not accurate to mention a role of mitochondrial fission, in this context.

      As we did not directly manipulate fission-related factors in our experiments, we agree that it would be inappropriate to draw conclusions about the role of mitochondrial fission in this context. Our revised figure (current Fig 5) and accompanying text now focus exclusively on the effects of mitochondrial fusion and the genes directly involved in regulating this process.

      It must be noted, too, that the authors have not demonstrated that their genetic interventions have actually affected mitochondrial morphology in these experiments. As noted in the previous figure, the Yki OE + dPGC1 RNAi condition showed enlarged mitochondria, but not necessarily hyperfused organelles. Therefore, the downregulation of Mfn or Opa1 in this set of experiments may not necessarily have altered mitochondrial morphology. Perhaps suppression of Mfn or Opa1 would normalize the areas of these evidently swollen mitochondria, but this is unclear without images. Furthermore, it should be appreciated that both Opa1 and Mfn exhibit pleiotropic attributes - e.g., Opa1 not only regulates IMM fusion, but it also modulates the shape and tightness of cristae membranes, specialized sites of oxidative phosphorylation as well as sequestration of cytochrome c, the release of which influences apoptosis (Frezza et al., 2006). At least in mammalian cells, Mfn2 is thought to regulate contacts between mitochondria and endoplasmic reticulum (Naon et al., 2023), which may serve other functions than OMM fusion, such as stabilization of the MAM.

      To directly address this point, we performed EM to assess mitochondrial ultrastructure in Yki + dPGC1-RNAi wing disc tumors, with and without dMfn1 downregulation, the most upregulated mitochondrial fusion gene in this tumor context. In Yki + dPGC1-RNAi tumors, mitochondria appeared more elongated, consistent with increased fusion. Upon dMfn1 depletion, we observed a dramatic shift in mitochondrial morphology: mitochondria became larger and more rounded, with disrupted cristae and onion-like structures, indicative of compromised mitochondrial integrity and function (see current Fig. 4).

      As the reviewer rightly notes, these morphological changes are consistent with the pleiotropic roles of Mfn and Opa1, which extend beyond outer and inner membrane fusion to include regulation of cristae architecture and ER-mitochondria contacts (Frezza et al., 2006; Naon et al., 2023). We now discuss these broader roles in the revised manuscript (lines 493–497). Taken together, our EM and confocal analyses, combined with targeted genetic manipulations, provide evidence that mitochondrial morphology is indeed altered in response to dPGC1 depletion and fusion gene deregulation in the wing disc.

      Figure 5 highlights a connection between dysregulation of mitochondria and Cyclin E, which allows cells to prematurely enter S phase. The data presented here do not offer clarity on whether the enlargement of the tumors results from increase cellular proliferation and/or cell size. The role of the cell cycle adds a layer of complexity to these results, because it is thought that mitochondria undergo fragmentation during the cell cycle to promote an even distribution of the organelle population after mitosis (Taguchi et al., 2007); however, in this manuscript, the authors contend that the downregulation dPGC1 is promoting mitochondrial hyperfusion. It is unclear how and whether cellular division and proliferation would proceed at an accelerated rate in a situation with mitochondrial hyperfusion.

      To address this point, we started by analyzing whether Yki + dPGC1-RNAi tumors exhibit increased proliferation compared to tumors expressing Yki alone. We quantified mitotic activity using the phospho-Histone H3 (PH3) marker of mitotic cells and observed a significant increase in PH3-positive cells in the Yki + dPGC1-RNAi condition. These results indicate an elevated proliferation rate in these tumors and are now presented in Fig 2O–Q. In the text, can be found in lines 221-228.

      We agree with the reviewer that our findings challenge the conventional view that mitochondrial fragmentation is a prerequisite for mitosis, as we observe increased expression of gene promoting mitochondrial fusion in the context of dPGC1 downregulation alongside signs of accelerated cell cycle entry. It is important to note that we also show that the levels of the oncogene Cyclin E, a key driver of cell cycle progression and S-phase entry, were elevated in Yki + dPGC1-RNAi tumors compared to those expressing Yki alone, suggesting that the increased proliferation observed is at least in part driven by enhanced cycle activity. To further probe Cyclin E’s role, we used the CycE-05306 heterozygous mutant allele, which reduces Cyclin E levels by ~50% without affecting normal development. Notably, this partial reduction strongly suppressed tumor growth in the Yki + dPGC1-RNAi background (Fig 6), underscoring Cyclin E’s functional importance in supporting oncogenic growth in this context.

      These findings support the notion that defects in the expression of mitochondrial genes involved in mitochondrial morphology induced by dPGC1 depletion do not impair but rather coincide with accelerated cell division.

      Minor comments:

      Lines 69-72 contrast the roles of PGC1α and β. It is not clear whether the comparison is of their respective roles in cancer or in normal physiology. In either case, it is important to note that PGC1β has been shown to drive mitochondrial fusion as well as biogenesis through its control of MFN2, among other factors (Liesa et al., 2008).

      In response, we have clarified the comparison between PGC1α and PGC1β in the introduction to specify that it refers to their roles in cancer. Additionally, we now acknowledge that PGC1β has been shown to promote mitochondrial biogenesis and fusion, notably through the regulation of MFN2, as demonstrated by Liesa et al. (2008). This reference has been added to provide a more balanced and accurate representation of PGC1β’s functions. In the text it can be found in lines 77-81.

      Although this study focuses on PGC1, the authors do not seem to site the original literature from the Spiegelman lab.

      In response to the reviewer’s comment, we have added a new section in the introduction that cites key foundational studies from the Spiegelman lab. This addition can be found in the introduction in lines 68-73.

      There are 10-20 grammatical errors throughout the text.

      We apologize for this. We have carefully revised the text, and we are very confident those errors have been corrected.

      **Referee Cross-commenting**

      There is agreement among the referees that the potential role of PGC1 as a tumor suppressor is interesting and significant. However, various aspects of this work require attention prior to publication. For example, there needs to be a complete knock down of PGC1 to come to any conclusion as to its role in wing development. The methods for analyzing mitochondrial morphology need to be clarified and be consistent with standards in the field of mitochondrial dynamics. Also, the authors need to quantify their Western blots to obtain accurate assessments of protein levels. Generally, the study relies too heavily on overexpression experiments; understanding the potential role of mitochondria in regulating the Hippo pathway should include various knockdown and/or knockout models.

      Reviewer #1 (Significance (Required)):

      Overall, the authors show an interesting dampening effect of dPGC1 on growth of Yki-driven tumors. This data could be relevant for elucidating how dysregulation of the Hippo signalling pathway can underlie tumorigenesis.

      The narrative arc of the study, however, appears to lack a focused line of inquiry. Figure 1 highlights an attempt to modulate Drosophila wing size and/or structure by downregulating dPGC1, but to no effect. Although examination of the efficiency of the RNAi revealed that the transcripts were still present in significant quantities; so, the conclusion that dPGC1 is dispensable for wing formation is premature. To have clarity on this point, it would be necessary to completely knockdown the gene, preferably by showing a total loss of protein. This should be feasible for the authors, since they showed Western blotting in Figure 5A. In any event, it seems that this negative data led the authors to study the Hippo pathway in the larval stage. This transition from Figure 1 to 2 seemed somewhat arbitrary and leads to a rather disjointed sense of the main line of inquiry around dPGC1.

      It is important to note, too, that the authors highlight a role of mitochondrial dynamics in the pathway of Yki-driven tumor formation; however, they only directly evaluate mitochondrial dynamics in this context in a single assay, namely, Figure 3H-K, and this quantification is likely inaccurate because the mitochondria in the Yki OE + dPGC1 RNAi condition seem to be substantially enlarged, circular structures. It is critical to keep in mind that mitochondrial enlargement does not necessarily stem from hyperfusion. It could come from a decrease in the activity of Drp1 or result from an imbalance between mitochondrial biogenesis and mitophagy.

      As noted in our responses above, we have addressed these concerns by clarifying the limitations of our mitochondrial morphology analysis. Additionally, we have expanded the discussion (lines 498-504) to explicitly acknowledge that mitochondrial enlargement does not necessarily indicate hyperfusion. In that paragraph, we consider alternative explanations such as reduced fission or imbalances in mitochondrial biogenesis and mitophagy, and we outline the need for future studies using dynamic assays and additional markers to more precisely dissect mitochondrial remodeling in Yki-driven tumors.

      A marked limitation of this study is the overuse of rather artificial manipulations of transcriptional regulatory pathways. The study would benefit a lot from investigation of the loss of function of components of the Hippo pathway rather than just OE of Yki.

      We performed additional experiments using Warts (Wts) mutant clones to assess the role of dPGC1 in a loss-of-function context within the Hippo pathway. While our initial analyses were based on Yki overexpression, which allowed us to robustly probe the interaction between Yki and dPGC1, we agree that this approach may not fully reflect physiological conditions. By generating Wts mutant clones, which endogenously activate Yki through loss of upstream inhibition, we were able to evaluate the impact of dPGC1 depletion in a more physiologically relevant setting. These new results confirm and extend our previous findings, showing that dPGC1 limits tissue overgrowth even when Yki is activated through loss of Wts, thereby strengthening the biological relevance of our conclusions.

      These results are presented in Fig 2F-I. In the text, those results are presented in lines 181-189.

      My expertise is in mitochondrial biology, with specialization in super-resolution imaging, mitochondrial dynamics and membrane architecture. I have also worked in the interface between mitochondrial physiology and cancer. With this perspective, I think that the authors uncover a potentially interesting role of PGC1 as a tumor suppressor.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary In this manuscript the authors the investigate the role of the mitochondrial regulatory transcription factor dPGC1 in tissue growth and oncogenic transformation. They show that dPGC1 limits hyperplasia mediated by overexpression of Yki in the Drosophila wing disc, while having no effect on normal growth. dPGC1 depletion in discs overexpressing Yki results neoplastic overgrowth and hyperfused mitochondria, which was dependent on the increased expression of genes involved in promoting mitochondrial fusion. Additionally, the authors show that dPGC1 limits CycE levels post-transcriptionally in Yki tumors.

      In the revised version of our manuscript, we have clarified the relationship between our findings and prior work by Nagaraj et al., including new experiments that demonstrate the specificity of dPGC1’s role in Yki-driven growth. Specifically, we show that dPGC1 depletion does not enhance tissue overgrowth in EGFR or InR contexts, nor does it affect Yki expression or activity. Furthermore, we tested dPGC1 overexpression in Yki-overexpressing tissues and observed no significant changes in growth or mitochondrial fusion gene expression. Additional controls confirmed that Cyclin E upregulation is specific to the Yki + dPGC1 depletion condition, reinforcing the context-dependent nature of our findings.

      Each of the reviewer’s comments is addressed below.

      Major comments 1) The authors mention several times in passing in the results a manuscript from the Banerjee lab (Nagaraj et al 2012), which shows that many of the genes the authors of the present manuscript show are upregulated upon Yki overexpression + dPGC1-RNAi compared with Yki overexpression alone are in fact upregulated upon Yki overexpression alone compared with control (dMfn/marf, opa1, miro - while interestingly dPGC1 itself is not affected). Nagaraj et al further show that Yki-overexpressing discs have longer mitochondria suggesting increased fusion even in the absence of dPGC1 depletion. The findings from Nagaraj et al should be mentioned explicitly in the introduction and the relationship between this manuscript and the present work clearly outlined in the discussion.

      In the revised manuscript, we have now explicitly referenced the findings of Nagaraj et al. (2012) in the Introduction (lines 106-118), Results (lines 355-360) and Discussion (lines 466-468) sections.

      In the revised Introduction, we summarize their key observations that Yki overexpression alone upregulates mitochondrial fusion genes such as dMfn and Opa1, and leads to mitochondrial elongation, while not affecting dPGC1 expression.

      In the revised Results section, we mention that, building on that work, our study demonstrates that dPGC1 depletion further amplifies this effect, leading to enhanced mitochondrial elongation and tumor growth.

      In the revised Discussion, we now explicitly reference the findings by Nagaraj et al. (2012), which demonstrated that Yki overexpression promotes mitochondrial fusion and upregulates key fusion genes. We build upon this work by showing that dPGC1 depletion in a Yki-overexpressing background further enhances mitochondrial fusion gene expression and tumor growth. This supports a model in which dPGC1 acts as a safeguard against Yki-induced mitochondrial remodeling and oncogenesis, reinforcing its role as a context-dependent tumor suppressor.

      Importantly, we show that this effect is context-dependent and not observed in otherwise normal tissues, highlighting a sensitized mitochondrial response to Yki activation when dPGC1 is lost. These additions help delineate the novel contribution of our study in identifying dPGC1 as a critical modulator of mitochondrial dynamics and tumorigenesis downstream of Yki.

      2) Given that Yki overexpression alone induces mitochondrial fusion and that dMfn/marf and opa1 depletion suppresses Yki-induced overgrowth (Nagaraj et al), does dPGC1 overexpression also suppress Yki-induced overgrowth?

      If so, is this correlated with reduction in dMfn/marf and opa1 compared with Yki overexpression alone?

      In response, we performed additional experiments to assess whether dPGC1 overexpression influences Yki-driven overgrowth. We also analyzed the expression of mitochondrial fusion genes (dMfn and Opa1) in this context. As shown in new Fig. S8, dPGC1 overexpression in Yki-overexpressing wing discs did not significantly affect tissue growth, nor did it alter the mRNA levels of key fusion regulators, dMfn and Opa1. These findings suggest that the transcriptional upregulation of mitochondrial fusion genes observed upon dPGC1 depletion is not a general consequence of altered dPGC1 levels, but rather a specific response that emerges in the context of Yki activation. We now present and discuss these results in the revised manuscript (lines 278-285), highlighting the sensitized nature of mitochondrial remodeling in an oncogenic environment driven by Yki signaling.

      3) One important question raised by this study is: how specific is the effect of dPGC1 depletion to Yki-driven overgrowth? As Yki-driven overgrowth already have increased mitochondrial length, it is possible that Yki-expressing cells are already sensitised to the effects of dPGC1 depletion. Interestingly, Nagaraj et al show that mitochondrial morphology is not affected upon EGFR activation (hyperplasia) or upon scrib and avl depletion (neoplasia). The authors should therefore test if dPGC1 depletion can potentiate the growth of other hyperplasia drivers such as activated EGFR and InR in the wing disc.

      We tested whether the growth-suppressive effect of dPGC1 depletion was specific to Yki-driven overgrowth or could also potentiate tissue growth in other oncogenic contexts. Specifically, we downregulated dPGC1 in wing discs overexpressing either EGFR or InR. In both cases, we did not observe any enhancement of tissue overgrowth upon dPGC1 depletion, in contrast to what we observed in Yki-overexpressing discs. These results suggest that the sensitivity to dPGC1 depletion is specific to Yki-driven overgrowth and is not a general feature of hyperplastic growth induced by other oncogenes.

      These results are shown in Fig S4 and in lines 195-202.

      4) There are a few simple control experiments the authors should provide to clarify the relationship between Yki and dPGC1: - Are Yki levels affected by dPGC1 depletion?

      To address the potential regulation of Yki by dPGC1, we performed quantitative PCR (qPCR) analysis to measure the expression levels of yki and its well-established transcriptional targets—Cyclin E, Diap1, and bantam—in wing discs depleted of dPGC1. As shown in Fig. S3, we did not detect significant changes in the transcript levels of yki or its target genes, suggesting that the enhanced phenotype observed upon dPGC1 depletion is unlikely to be driven by increased Yki expression or activity. These results indicate that dPGC1 does not strongly influence Yki expression or activity. These new results are presented in lines 190-194.

      • Does dPGC1 knockdown alone modify the expression of the genes tested in Fig.3A? In other words, is this upregulation specific of the Yki-overexpression context?

      We have conducted this analysis, and the results are now presented in new Fig S7. While the trend is similar to that observed in tumors with both Yki depletion and dPGC1 depletion, the magnitude of change is smaller compared to the context of Yki overexpression. This is described in the text in lines 273-277.

      • Does dPCG1 knockdown also stabilise CycE in the absence of Yki overexpression or does the stabilisation of CycE occur only in Yki tumors?

      To address this, we examined Cyclin E levels in wing imaginal discs mutant for dPGC1 alone. Our analysis did not reveal any detectable changes in Cyclin E levels under these conditions. These findings suggest that the upregulation of Cyclin E is not a general consequence of dPGC1 loss, but rather a feature specific to the context of Yki overactivation. The corresponding data are now included in Fig S14 of the revised manuscript. In the text, it can be found in lines 442-448.

      5) Figure 3C-G: it is not clear how the authors can quantify the length of 3D structures like mitochondria from 2D TEM images (unless they have done volume reconstruction from consecutive sections) and no details are provided in the methods. The quantification of mitochondrial length has to be performed rigorously as it is a key part of the paper.

      We agree that TEM provides only 2D profiles of 3D mitochondrial structures, and that this does not allow for precise volumetric reconstruction. In our study, we measured the longest axis of mitochondria visible in thin TEM sections, which is a commonly used 2D proxy for mitochondrial length in the literature (e.g., PMID: 36367943 and PMID: 38637532). To avoid misunderstandings, we have clarified in the Material and Methods section that the reported values represent apparent mitochondrial length in 2D sections, not true 3D length. To enhance the accuracy of these estimates, we measured more than three tissues per genotype, multiple regions per tissue, several cells per region, and various fields of view per cell.

      Minor Comments:

      1) Line 51: "Mitochondria are highly dynamics organelles." should be "Mitochondria are highly dynamic organelles."

      We have corrected that mistake. Thanks!

      2) Introduction: the authors should summarise the known physiological functions of PGC1α in order to put their findings in context.

      We have added a section in the introduction (lines 66-81) summarizing the known physiological functions of PGC1α

      3) lines: 121-3: "...depletion of dPGC1...did not have a major impact on adult wing size and shape (Fig 1B, C)." There is a small but statistically significant difference so the authors should state this in the text.

      We have revised the text to acknowledge that dPGC1 depletion leads to a modest but statistically significant reduction in wing size. In addition to the original analysis, we have now included further experiments to strengthen this point. Specifically, we analyzed wings from flies homozygous for the dPGC11 allele (also known as dPGC1KG08646; see FlyBase: https://flybase.org/reports/FBal0148128) and confirmed a small but significant reduction in both wing disc and adult wing size compared to controls (this can be found in Fig. 1 and Fig. S1). These results support the conclusion that, although dPGC1 is dispensable for viability and gross morphology, it contributes to normal wing growth. These new results can be found in lines 144-153.

      4) Figure 5A (Cyclin E western blot): the authors should show molecular weight markers. In the revised version of our manuscript, we are including the molecular markers as indicated by the reviewer. These can be found in Fig S12.

      Reviewer #2 (Significance (Required)):

      The manuscript by Sew et al builds on the previous work by Nagaraj et al to explore the role of mitochondrial function in tumors driven by disruption of the Hippo pathway. In particular, the authors identify dPGC1 as a transcription factor that limits Yki-driven mitochondrial fusion and tissue growth. Interestingly, they further show that Yki/PGC1-depleted tumors are highly sensitive to Cyclin E levels, due to post-transcriptional Cyclin E increase. These results further our knowledge of how Yki drives growth and how mitochondria participate in oncogenic transformation. With appropriate revision as outlined above (for example exploring whether the mechanism proposed is Yki-specific), the manuscript will be of broad interest to developmental and cancer biologists.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript presents compelling evidence that dPGC1 acts as a context-dependent tumor suppressor in Drosophila by modulating mitochondrial dynamics and limiting Yorkie (Yki)-induced oncogenic growth. By leveraging the Drosophila wing imaginal disc as a model, the authors investigate how dPGC1 depletion exacerbates Yki-driven tissue overgrowth, mitochondrial hyperfusion, Cyclin E upregulation, and DNA damage, leading to tumorigenesis. The study provides valuable insights into the interplay between mitochondrial dynamics and cancer, with implications for understanding metabolic regulation in oncogenesis. While the findings are significant and well-aligned with the field, certain aspects of the experimental design, data presentation, and mechanistic insights require further attention to enhance clarity, reproducibility, and impact. Below, I outline my major concerns and recommendations.

      We addressed concerns about RNAi efficiency and protein-level validation with new qPCR data and mutant analysis. We provided EM and confocal evidence of mitochondrial changes. We clarified non-autonomous effects and quantified Mmp1 and F-actin and added data on miro and Opa1 manipulations. Cyclin E quantification was expanded using multiple Western replicates and a validated mutant allele, and we included new data on mitochondrial membrane potential to assess functional consequences.

      Our detailed responses to each point raised by the reviewer are provided below.

      Major Points

      1. One point is the knock-down efficiency of dPGC1 on the mRNA level, which is between 30 to >50% (Fig. S1C). This is not too strong, so the question arises how severly the protein levels are affected. If possible, an antibody staining with quantification should be performed. From these data it cannot be concluded dPGC1 is not required for normal development, half the dose could be sufficient. How do wings look like when the ap-GAL4 driver is used for dPGC1 knock-down, as this is the driver used in the subsequent experiments? Reviewer 1 also raised concerns about the potential inefficiency of the RNAi treatment in revealing a function during normal wing growth. We agree with both reviewers that the interpretation of the RNAi efficiency data requires clarification.

      The qPCR analysis shown in former Fig. S1C was performed using wing discs from flies expressing UAS-dPGC1-RNAi under the control of the MS1096-Gal4 driver. However, as shown in current Fig. 1C, MS1096-Gal4 is not expressed uniformly across the wing disc. Some regions remain RFP-negative, indicating that the RNAi construct is not active in all cells. As a result, the measured mRNA levels likely underestimate the true knockdown efficiency. This is because the qPCR includes mRNA from both RNAi-expressing and non-expressing cells, diluting the apparent reduction in transcript levels.

      To address this limitation and more accurately assess RNAi efficiency, we repeated the qPCR analysis using a ubiquitous driver (actin-Gal4) to ensure uniform expression of the RNAi construct. Under these conditions, we observed a more substantial knockdown, with dPGC1 mRNA levels reduced to approximately 25% of control levels (this is shown in current Fig S2). This result indicates that the RNAi line is more effective than initially suggested by the MS1096-Gal4-based analysis.

      To complement our RNAi-based analysis, we additionally used a mutant strain carrying a characterized allele of dPGC1 (dPGC11, also known as dPGC1KG08646; see FlyBase: https://flybase.org/reports/FBal0148128). This genetically distinct approach allowed us to validate and strengthen our findings regarding dPGC1 function. Flies homozygous for this allele exhibited a modest but statistically significant reduction in both wing disc and adult wing size. These results support the conclusion that dPGC1 is required for normal wing growth and development. The new data are now included in Figure 1 and referenced in the main text (lines 144-151).

      Unfortunately, we cannot perform antibody staining due to the unavailability of antibodies against dPGC1.

      How does the wing disc look like when dPGC1 is overepressed together with Yki?

      In response, we performed additional experiments to assess whether dPGC1 overexpression influences Yki-driven overgrowth. We also analyzed the expression of mitochondrial fusion genes (dMfn and Opa1) in this context. As shown in new Fig. S8, dPGC1 overexpression in Yki-overexpressing wing discs did not significantly affect tissue growth, nor did it alter the mRNA levels of key fusion regulators, dMfn and Opa1. These findings suggest that the transcriptional upregulation of mitochondrial fusion genes observed upon dPGC1 depletion is not a general consequence of altered dPGC1 levels, but rather a specific response that emerges in the context of Yki activation. We now present and discuss these results in the revised manuscript (lines 278-285), highlighting the sensitized nature of mitochondrial remodeling in an oncogenic environment driven by Yki signaling.

      In Fig 2D (but also in Fig. 2C) not only cells in the dorsal but also in the ventral comparmtent seem to overproliferate. Either this is a mis-conception or it is a non-autonomous effect from interfering with Yki and dPGC1 in the vertrnal compartment. In either cases, this has to be clarified.

      Ventral cells are not labelled by GFP. Fig 3D shows a tumor in which GFP-negative cells are not present, suggesting that they are not overproliferating but rather being eliminated. This phenomenon is consistent with cell competition, a well-characterized process in which transformed or tumorigenic cells outcompete and eliminate neighboring wild-type cells. We have previously described this behavior in wing disc tumors (PMID: 26853367; DOI: 10.1016/j.cub.2015.12.042), and it likely contributes to the expansion of the tumor mass by removing surrounding normal tissue also in this context.

      In Fig. 2F-H quantification of Mmp1 and F-actin is missing. Mmp1 is a JNK target, so the authors could do in addition an anti-phospho JNK antibody staining.

      In response, we have performed those quantifications. They are now included in Fig 2M, N.

      In Fig. 3: how does the mitochondrial network look like in the wing disc periopodial epithelium using the Gug>Yki+dPGC1 genotype? Is it similar to Gug>dMfn or Gug>miro?

      We attempted to perform this analysis; however, Yki overexpression under the control of Gug-GAL4 resulted in larval lethality, likely due to GAL4 activity in essential tissues such as the central nervous system. As a result, we were only able to induce transgene expression for 24 hours before lethality occurred.

      At this early point, no detectable changes in mitochondrial morphology were observed in the peripodial membrane, likely because the duration of transgene expression was insufficient to elicit phenotypic alterations in this specific tissue. Therefore, while we aimed to compare this genotype to Gug>dMfn and Gug>miro, the technical limitations prevented a conclusive analysis.

      We have prepared a representative figure for the reviewer (below), showing representative confocal images of wing discs showing mito-GFP and Dapi in the three genotypes indicated in the Fig.

      In Fig. 3I: what is really the mitochondrion? It would be good to outline the region(s) that was/were measured.

      To improve clarity, we have repeated the electron microscopy (EM) analysis and now provide representative images that more clearly illustrate mitochondrial morphology in the different genotypes analyzed. These updated images presented in Fig 4 better highlight the structural alterations observed upon genetic manipulation and help clarify the basis for our morphological assessments.

      We have extended our analysis and have assessed mitochondrial ultrastructure in Yki + dPGC1-RNAi wing disc tumors, with and without dMfn1 downregulation—the most upregulated mitochondrial fusion gene in this tumor context. In Yki + dPGC1-RNAi tumors, mitochondria appeared more elongated, consistent with increased fusion. Upon dMfn1 depletion, we observed a dramatic shift in mitochondrial morphology: mitochondria became larger and more rounded, with disrupted cristae and onion-like structures, indicative of compromised mitochondrial integrity and function (see new Fig 4).

      A quantification of RNAi and overexpression efficiencies of the different transgenes in Fig. 3 is required.

      To assess the efficiency of RNAi-mediated knockdown and transgene overexpression, we performed quantitative PCR (qPCR) using the ubiquitous Actin-Gal4 driver. While we acknowledge that this driver does not replicate the spatial specificity of the periodic membrane Gal4 driver used in the experiments shown in Figure 3 (Gug-Gal4), the latter targets a very limited number of cells within the imaginal disc, making reliable qPCR quantification unfeasible.

      Using Actin-Gal4 allows us to obtain a relative and informative measure of transgene efficiency across the different constructs. These data confirm effective knockdown and overexpression of the relevant genes and are now included in Figure S2.

      In Fig. 4: what is the phenotype when miro is over-expressed in combination with Yki? Or when it is knocked down in the ap>Yki-dPGC1 background? This was the gene tested in Fig. 3 with a clear mitochondrial phenotype

      To address whether miro contributes to Yki-mediated tumor growth, we performed the requested experiments and now include the results in the revised manuscript (see updated Results section, lines 374-377, and new Fig. S11).

      Our data show that overexpression of miro in combination with Yki does not lead to a significant increase in tissue growth or tumor-like phenotypes, in contrast to the effects observed with dMfn or Opa1 overexpression. Similarly, knockdown of miro in the ap>Yki-dPGC1-RNAi background did not suppress tumor growth, indicating that miro is not required for the enhanced proliferation observed in this context.

      These findings suggest that, although miro influences mitochondrial morphology in normal wing discs (as shown in Fig. 3), its role in tumorigenesis is distinct from that of dMfn and Opa1. We have revised the manuscript to clarify the gene-specific contributions of mitochondrial fusion regulators to Yki-driven tumorigenesis. This distinction underscores the complexity of mitochondrial dynamics and highlights that not all fusion-related genes exert the same functional impact in oncogenic settings.

      How does the mitochondrial morphology in the wing disc peripodial epithelium look like in Gug>Opa1RNAi or Gug>Opa1 discs?

      To assess the impact of Opa1 on mitochondrial morphology in the peripodial epithelium of the wing disc, we used the Gug-GAL4 driver to either overexpress or knock down Opa1. Our analysis revealed that Opa1 overexpression led to slightly elongated mitochondria, but did not result in extensive network formation, suggesting a modest enhancement of inner membrane fusion. In contrast, Opa1 knockdown caused clear mitochondrial fragmentation, closely resembling the phenotype observed upon dMfn depletion. These results shown in Fig 3 are consistent with the distinct roles of Opa1 and dMfn in regulating mitochondrial fusion: Opa1 primarily modulates inner membrane fusion and cristae architecture, while dMfn drives outer membrane fusion and network connectivity.

      The corresponding data are presented in Figure 3F, G, and quantified in Figure S9, alongside experiments manipulating other genes involved in mitochondrial dynamics.

      Why have the authors switched between the ap>Yki+dPGCRNAi and the ap>Yki+dPGC1shRNA lines? It would be important to have this series of experiments in the same backgrounds, as KD efficiencies are different (Fig. S1C).

      The primary reason for switching between the dPGC1-RNAi and dPGC1-shRNA lines was practical: the chromosomal insertion sites of the transgenes made certain genetic combinations more feasible with one line over the other. This flexibility significantly facilitated our experimental design and analysis.

      To address concerns regarding knockdown efficiency, we performed a comparative analysis using the ubiquitous actin-GAL4 driver, rather than MS1096-GAL4, which exhibits patchy and dynamic expression in the wing imaginal disc. This allowed us to obtain a more consistent and interpretable measure of mRNA downregulation for both transgenes. Our results show that both lines achieve comparable levels of knockdown, as shown in Figure S2.

      Fig. 5A: proper quantification of Western Blot signals is required. I do not agree that Cyclin E protein levels are elevated in ap>Yki or ap>Yki+dPGC1 discs. Even at the mRNA levels the increase in expression is rather weak. From these results nothing can be concluded.

      We have repeated the Western blot analysis using seven independent membranes to ensure robust quantification of Cyclin E levels in ap>Yki and ap>Yki+dPGC1-RNAi wing discs (Fig 6).

      Although the increase in Cyclin E protein levels is subtle, it is consistent across replicates and statistically significant. We have now included the quantification of these Western blot signals in the revised Figure 6, which supports the conclusion that Cyclin E levels are elevated in ap>Yki+dPGC1 discs.

      We hope this additional data addresses the reviewer’s concern and strengthens the interpretation of our results.

      Knock-down efficiencies for dap and CycE needs to be quantifiec (Fig. 5H-N). Although the rescue experiment with CycE knock down is from the phenotype convincing, it is nonetheless puzzling, as CycE is accodring to Fig. 5A+B hardly upregulated. An independent CycE RNAi line would be useful.

      We have quantified the knockdown efficiency of the dap-RNAi line, and the results are included in Figure S13.

      Regarding Cyclin E, we would like to clarify that we did not use an RNAi line in this experiment. Instead, we employed the CycE-05306 mutant allele in a heterozygous background, which is expected to reduce Cyclin E levels by approximately 50%. The CycE-05306 allele in Drosophila melanogaster is a loss-of-function allele of the Cyclin E gene. This allele carries a P-element insertion in the first intron of the CycE gene, which disrupts normal transcription and reduces Cyclin E expression. In a heterozygous background, as used in your experiments, CycE-05306/+ is expected to reduce Cyclin E levels by approximately 50%, which is typically sufficient to observe genetic interactions or sensitized phenotypes without affecting normal development. This makes it a valuable tool for studying gene dosage effects, particularly in tumor models where Cyclin E activity may be rate-limiting.

      Importantly, this partial reduction does not impair normal tissue growth, but it strongly limits tumor growth in the context of Yki overexpression combined with dPGC1 downregulation, as shown in Figure 6. This selective sensitivity highlights the functional importance of Cyclin E in supporting oncogenic growth driven by Yki and dPGC1 depletion. We believe this provides compelling evidence for Cyclin E’s role in this tumor model.

      Reviewer #3 (Significance (Required)):

      Strengths and Limitations of the Study Strengths Innovative Focus on Mitochondrial Dynamics and Oncogenesis: The study provides compelling evidence linking mitochondrial dynamics, particularly hyperfusion, to tumorigenesis in Drosophila. The identification of dPGC1 as a context-dependent tumor suppressor adds novel insights into the interplay between metabolism and oncogenesis. Comprehensive Use of Drosophila as a Model System: The study leverages the genetic tractability of Drosophila, allowing precise manipulation of mitochondrial regulators and signaling pathways. The use of wing imaginal discs as a model for tumor growth is well-established and appropriate. Integration of Morphological and Genetic Data: The manuscript combines confocal imaging, electron microscopy, and genetic tools to demonstrate the role of dPGC1 in regulating mitochondrial dynamics, Cyclin E levels, and tissue overgrowth. Relevance to Cancer Biology: The findings address key hallmarks of cancer, including deregulated metabolism, genomic instability, and cell cycle misregulation. The study's exploration of these processes in a simple model organism provides a strong basis for translating findings to mammalian systems.

      Limitations Validation of RNAi and Overexpression Efficiency: The knockdown efficiency of dPGC1 on the mRNA level is only moderate (30-50%), and protein-level validation is missing. Without this, the study cannot conclusively demonstrate the role of dPGC1 in normal development or tumorigenesis. Incomplete Mechanistic Insights: The manuscript identifies Cyclin E as a potential driver of tumor growth but does not adequately explore how mitochondrial hyperfusion leads to Cyclin E regulation (e.g., post-transcriptional mechanisms or protein stability). Inconsistencies in Experimental Backgrounds: The study uses different RNAi/shRNA lines and driver combinations inconsistently across experiments, making it difficult to compare results directly. This variability undermines the robustness of the conclusions. Limited Functional Analysis of Mitochondria: While mitochondrial morphology is well-characterized, functional assays (e.g., membrane potential or ATP production) are missing. These would confirm the impact of hyperfusion on cellular energetics and oncogenesis.

      In the revised manuscript, we have addressed each of the concerns raised.

      In addition to that, in the revised version of the manuscript, we have included new experiments to assess mitochondrial functionality in tumors co-expressing Yki and dPGC1-RNAi. Specifically, we analyzed the Mitochondrial Membrane Potential (MMP). We used TMRE staining to evaluate MMP, a key indicator of mitochondrial integrity and oxidative phosphorylation capacity. Our analysis revealed no significant differences in MMP between Yki tumors and Yki + dPGC1-RNAi tumors, suggesting that mitochondrial membrane potential is preserved despite the observed morphological abnormalities. These results are shown in Fig S6. In the text it is discussed in lines 233-243.

      Contribution to Existing Literature The study makes a significant contribution to the growing body of literature on the metabolic regulation of cancer by identifying dPGC1 as a tumor suppressor modulating mitochondrial dynamics. Previous work has established the dual roles of mammalian PGC1α in promoting or suppressing cancer depending on context. This study adds depth by demonstrating similar context-dependent effects in a simpler model organism, facilitating further exploration of the molecular mechanisms involved.

      By linking mitochondrial fusion, Yki signaling, and Cyclin E regulation, the manuscript aligns with and expands upon research on Hippo pathway regulation, cancer metabolism, and mitochondrial biology. The findings highlight the importance of integrating metabolic and signaling networks in understanding oncogenesis.

      Community Selection The current form of the manuscript is best suited for a specialized audience, particularly mitochondrial biologists, Drosophila researchers, and Hippo pathway specialists. To engage a broader community, additional work linking these findings to mammalian models or human cancer biology would be necessary.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript presents compelling evidence that dPGC1 acts as a context-dependent tumor suppressor in Drosophila by modulating mitochondrial dynamics and limiting Yorkie (Yki)-induced oncogenic growth. By leveraging the Drosophila wing imaginal disc as a model, the authors investigate how dPGC1 depletion exacerbates Yki-driven tissue overgrowth, mitochondrial hyperfusion, Cyclin E upregulation, and DNA damage, leading to tumorigenesis. The study provides valuable insights into the interplay between mitochondrial dynamics and cancer, with implications for understanding metabolic regulation in oncogenesis. While the findings are significant and well-aligned with the field, certain aspects of the experimental design, data presentation, and mechanistic insights require further attention to enhance clarity, reproducibility, and impact. Below, I outline my major concerns and recommendations.

      Major Points

      1. One point is the knock-down efficiency of dPGC1 on the mRNA level, which is between 30 to >50% (Fig. S1C). This is not too strong, so the question arises how severly the protein levels are affected. If possible, an antibody staining with quantification should be performed. From these data it cannot be concluded dPGC1 is not required for normal development, half the dose could be sufficient. How do wings look like when the ap-GAL4 driver is used for dPGC1 knock-down, as this is the driver used in the subsequent experiments?
      2. How does the wing disc look like when dPGC1 is overepressed together with Yki?
      3. In Fig 2D (but also in Fig. 2C) not only cells in the dorsal but also in the ventral comparmtent seem to overproliferate. Either this is a mis-conception or it is a non-autonomous effect from interfering with Yki and dPGC1 in the vertrnal compartment. In either cases, this has to be clarified.
      4. In Fig. 2F-H quantification of Mmp1 and F-actin is missing. Mmp1 is a JNK target, so the authors could do in addition an anti-phospho JNK antibody staining.
      5. In Fig. 3: how does the mitochondrial network look like in the wing disc periopodial epithelium using the Gug>Yki+dPGC1 genotype? Is it similar to Gug>dMfn or Gug>miro?
      6. In Fig. 3I: what is really the mitochondrion? It would be good to outline the region(s) that was/were measured.
      7. A quantification of RNAi and overexpression efficiencies of the different transgenes in Fig. 3 is required.
      8. In Fig. 4: what is the phenotype when miro is over-expressed in combination with Yki? Or when it is knocked down in the ap>Yki-dPGC1 background? This was the gene tested in Fig. 3 with a clear mitochondrial phenotype. How does the mitochondrial morphology in the wing disc peripodial epithelium look like in Gug>Opa1RNAi or Gug>Opa1 discs? Why have the authors switched between the ap>Yki+dPGCRNAi and the ap>Yki+dPGC1shRNA lines? It would be important to have this series of experiments in the same backgrounds, as KD efficiencies are different (Fig. S1C).
      9. Fig. 5A: proper quantification of Western Blot signals is required. I do not agree that Cyclin E protein levels are elevated in ap>Yki or ap>Yki+dPGC1 discs. Even at the mRNA levels the increase in expression is rather weak. From these results nothing can be concluded.
      10. Knock-down efficincies for dap and CycE needs to be quantifiec (Fig. 5H-N). Although the rescue experiment with CycE knock down is from the phenotype convincing, it is nonetheless puzzling, as CycE is accodring to Fig. 5A+B hardly upregulated. An independent CycE RNAi line would be useful.

      Significance

      Strengths and Limitations of the Study

      Strengths

      Innovative Focus on Mitochondrial Dynamics and Oncogenesis: The study provides compelling evidence linking mitochondrial dynamics, particularly hyperfusion, to tumorigenesis in Drosophila. The identification of dPGC1 as a context-dependent tumor suppressor adds novel insights into the interplay between metabolism and oncogenesis. Comprehensive Use of Drosophila as a Model System: The study leverages the genetic tractability of Drosophila, allowing precise manipulation of mitochondrial regulators and signaling pathways. The use of wing imaginal discs as a model for tumor growth is well-established and appropriate. Integration of Morphological and Genetic Data: The manuscript combines confocal imaging, electron microscopy, and genetic tools to demonstrate the role of dPGC1 in regulating mitochondrial dynamics, Cyclin E levels, and tissue overgrowth. Relevance to Cancer Biology: The findings address key hallmarks of cancer, including deregulated metabolism, genomic instability, and cell cycle misregulation. The study's exploration of these processes in a simple model organism provides a strong basis for translating findings to mammalian systems.

      Limitations

      Validation of RNAi and Overexpression Efficiency: The knockdown efficiency of dPGC1 on the mRNA level is only moderate (30-50%), and protein-level validation is missing. Without this, the study cannot conclusively demonstrate the role of dPGC1 in normal development or tumorigenesis. Incomplete Mechanistic Insights: The manuscript identifies Cyclin E as a potential driver of tumor growth but does not adequately explore how mitochondrial hyperfusion leads to Cyclin E regulation (e.g., post-transcriptional mechanisms or protein stability). Inconsistencies in Experimental Backgrounds: The study uses different RNAi/shRNA lines and driver combinations inconsistently across experiments, making it difficult to compare results directly. This variability undermines the robustness of the conclusions. Limited Functional Analysis of Mitochondria: While mitochondrial morphology is well-characterized, functional assays (e.g., membrane potential or ATP production) are missing. These would confirm the impact of hyperfusion on cellular energetics and oncogenesis.

      Contribution to Existing Literature

      The study makes a significant contribution to the growing body of literature on the metabolic regulation of cancer by identifying dPGC1 as a tumor suppressor modulating mitochondrial dynamics. Previous work has established the dual roles of mammalian PGC1α in promoting or suppressing cancer depending on context. This study adds depth by demonstrating similar context-dependent effects in a simpler model organism, facilitating further exploration of the molecular mechanisms involved.

      By linking mitochondrial fusion, Yki signaling, and Cyclin E regulation, the manuscript aligns with and expands upon research on Hippo pathway regulation, cancer metabolism, and mitochondrial biology. The findings highlight the importance of integrating metabolic and signaling networks in understanding oncogenesis.

      Community Selection

      The current form of the manuscript is best suited for a specialized audience, particularly mitochondrial biologists, Drosophila researchers, and Hippo pathway specialists. To engage a broader community, additional work linking these findings to mammalian models or human cancer biology would be necessary.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      In this manuscript the authors the investigate the role of the mitochondrial regulatory transcription factor dPGC1 in tissue growth and oncogenic transformation. They show that dPGC1 limits hyperplasia mediated by overexpression of Yki in the Drosophila wing disc, while having no effect on normal growth. dPGC1 depletion in discs overexpressing Yki results neoplastic overgrowth and hyperfused mitochondria, which was dependent on the increased expression of genes involved in promoting mitochondrial fusion. Additionally, the authors show that dPGC1 limits CycE levels post-transcriptionally in Yki tumors.

      Major comments

      1. The authors mention several times in passing in the results a manuscript from the Banerjee lab (Nagaraj et al 2012), which shows that many of the genes the authors of the present manuscript show are upregulated upon Yki overexpression + dPGC1-RNAi compared with Yki overexpression alone are in fact upregulated upon Yki overexpression alone compared with control (dMfn/marf, opa1, miro - while interestingly dPGC1 itself is not affected). Nagaraj et al further show that Yki-overexpressing discs have longer mitochondria suggesting increased fusion even in the absence of dPGC1 depletion. The findings from Nagaraj et al should be mentioned explicitly in the introduction and the relationship between this manuscript and the present work clearly outlined in the discussion.
      2. Given that Yki overexpression alone induces mitochondrial fusion and that dMfn/marf and opa1 depletion suppresses Yki-induced overgrowth (Nagaraj et al), does dPGC1 overexpression also suppress Yki-induced overgrowth? If so, is this correlated with reduction in dMfn/marf and opa1 compared with Yki overexpression alone?
      3. One important question raised by this study is: how specific is the effect of dPGC1 depletion to Yki-driven overgrowth? As Yki-driven overgrowth already have increased mitochondrial length, it is possible that Yki-expressing cells are already sensitised to the effects of dPGC1 depletion. Interestingly, Nagaraj et al show that mitochondrial morphology is not affected upon EGFR activation (hyperplasia) or upon scrib and avl depletion (neoplasia). The authors should therefore test if dPGC1 depletion can potentiate the growth of other hyperplasia drivers such as activated EGFR and InR in the wing disc.
      4. There are a few simple control experiments the authors should provide to clarify the relationship between Yki and dPGC1:
        • Are Yki levels affected by dPGC1 depletion?
        • Does dPGC1 knockdown alone modify the expression of the genes tested in Fig.3A? In other words, is this upregulation specific of the Yki-overexpression context?
        • Does dPCG1 knockdown also stabilise CycE in the absence of Yki overexpression or does the stabilisation of CycE occur only in Yki tumors?
      5. Figure 3C-G: it is not clear how the authors can quantify the length of 3D structures like mitochondria from 2D TEM images (unless they have done volume reconstruction from consecutive sections) and no details are provided in the methods. The quantification of mitochondrial length has to be performed rigorously as it is a key part of the paper.

      Minor Comments:

      1. Line 51: "Mitochondria are highly dynamics organelles." should be "Mitochondria are highly dynamic organelles."
      2. Introduction: the authors should summarise the known physiological functions of PGC1α in order to put their findings in context.
      3. lines: 121-3: "...depletion of dPGC1...did not have a major impact on adult wing size and shape (Fig 1B, C)." There is a small but statistically significant difference so the authors should state this in the text.
      4. Figure 5A (Cyclin E western blot): the authors should show molecular weight markers.

      Significance

      The manuscript by Sew et al builds on the previous work by Nagaraj et al to explore the role of mitochondrial function in tumors driven by disruption of the Hippo pathway. In particular, the authors identify dPGC1 as a transcription factor that limits Yki-driven mitochondrial fusion and tissue growth. Interestingly, they further show that Yki/PGC1-depleted tumors are highly sensitive to Cyclin E levels, due to post-transcriptional Cyclin E increase. These results further our knowledge of how Yki drives growth and how mitochondria participate in oncogenic transformation. With appropriate revision as outlined above (for example exploring whether the mechanism proposed is Yki-specific), the manuscript will be of broad interest to developmental and cancer biologists.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Sew et al. examine the master regulator of mitochondrial biogenesis, dPGC1, in the context of Drosophila wing and larval development. They primarily use confocal imaging to probe the interplay between dPGC1 and an overactive Hippo pathway, driven by overexpression of the main effector protein, Yki. In their study, they find that tumors, driven by overactivity of Yki grow larger when dPGC1 is downregulated, implicating the mitochondrial biogenesis pathway in tumor suppression. Furthermore, in the context of Yki overexpression, they find that levels of Mfn or Opa1 modulate tumor size. Lastly, they show a role of cyclin E in controlling the size of tumors formed by Yki OE + dPGC1 RNAi. The potential role of dPGC1 as a tumor suppressor is interesting because it highlights an emerging recognition of mitochondria in the aetiology of cancer. However, before publication, much of the data in this manuscript should be strengthened by a refinement in the methods/analysis and an increase in orthogonal approaches.

      Major comments:

      The authors indicate that for example, in lines127-28, that neither downregulating or overexpressing dPGC1 affects wing size. However, the quantification in Fig. 1C shows a significant decrease in wing size following RNAi treatment. This decrease is modest, but it is nevertheless significant. It is worth pointing out, too, that the efficiency of the RNAi in Fig. S1C suggests that the conclusions drawn are premature. While a roughly 55% drop in mRNA levels may be statistically significant, it is unclear whether this drop in transcripts corresponds to a commensurate depletion of protein. Moreover, it is unclear, in this context, how much dPGC1 may indeed be necessary to drive a relatively normal program of mitochondrial biogenesis in wing development. To obtain a clear result, it is necessary to show significant depletion of the dPGC1 protein. (Ultimately, if it is the case that dPGC1 is unnecessary for wing development and function, a more coherent line of inquiry would be to find out the reason for this rather than to pivot the story to studying tumorigenesis in larva.)

      In Figure 3H-K, it is not clear why the authors used electron microscopy to evaluate mitochondrial morphology. The very good confocal images in Figure 3C-G show a clear change in mitochondrial morphology following the knockdown of Mfn, Opa1, and Miro. While it is clear from the electron micrographs in Figure H that the mitochondria are enlarged, it is not obvious that this increase in length is a result of increased mitochondrial fusion. Indeed, if the mean form factor were used to quantify the shape, it is likely that in both conditions, the value would be close to 1, indicating more of a round object, and it not obvious whether there would be a difference between the Yki OE versus the YkI OE + dPGC1 RNAi. Therefore, from this data alone, it cannot be concluded that the YkI OE + dPGC1 RNAi condition leads to mitochondrial hyperfusion.

      Figure 4. refers to changes in mitochondrial fusion and fission in tumor formation; however, the authors do not attempt to alter mitochondrial fission factors, so it is not accurate to mention a role of mitochondrial fission, in this context. It must be noted, too, that the authors have not demonstrated that their genetic interventions have actually affected mitochondrial morphology in these experiments. As noted in the previous figure, the Yki OE + dPGC1 RNAi condition showed enlarged mitochondria, but not necessarily hyperfused organelles. Therefore, the downregulation of Mfn or Opa1 in this set of experiments may not necessarily have altered mitochondrial morphology. Perhaps suppression of Mfn or Opa1 would normalize the areas of these evidently swollen mitochondria, but this is unclear without images. Furthermore, it should be appreciated that both Opa1 and Mfn exhibit pleiotropic attributes - e.g., Opa1 not only regulates IMM fusion, but it also modulates the shape and tightness of cristae membranes, specialized sites of oxidative phosphorylation as well as sequestration of cytochrome c, the release of which influences apoptosis (Frezza et al., 2006). At least in mammalian cells, Mfn2 is thought to regulate contacts between mitochondria and endoplasmic reticulum (Naon et al., 2023), which may serve other functions than OMM fusion, such as stabilization of the MAM.

      Figure 5 highlights a connection between dysregulation of mitochondria and Cyclin E, which allows cells to prematurely enter S phase. The data presented here do not offer clarity on whether the enlargement of the tumors results from increase cellular proliferation and/or cell size. The role of the cell cycle adds a layer of complexity to these results, because it is thought that mitochondria undergo fragmentation during the cell cycle to promote an even distribution of the organelle population after mitosis (Taguchi et al., 2007); however, in this manuscript, the authors contend that the downregulation dPGC1 is promoting mitochondrial hyperfusion. It is unclear how and whether cellular division and proliferation would proceed at an accelerated rate in a situation with mitochondrial hyperfusion.

      Minor comments:

      Lines 69-72 contrast the roles of PGC1α and β. It is not clear whether the comparison is of their respective roles in cancer or in normal physiology. In either case, it is important to note that PGC1β has been shown to drive mitochondrial fusion as well as biogenesis through its control of MFN2, among other factors (Liesa et al., 2008).

      Although this study focuses on PGC1, the authors do not seem to site the original literature from the Spiegelman lab.

      There are 10-20 grammatical errors throughout the text.

      Referee Cross-commenting

      There is agreement among the referees that the potential role of PGC1 as a tumor suppressor is interesting and significant. However, various aspects of this work require attention prior to publication. For example, there needs to be a complete knock down of PGC1 to come to any conclusion as to its role in wing development. The methods for analyzing mitochondrial morphology need to be clarified and be consistent with standards in the field of mitochondrial dynamics. Also, the authors need to quantify their Western blots to obtain accurate assessments of protein levels. Generally, the study relies too heavily on overexpression experiments; understanding the potential role of mitochondria in regulating the Hippo pathway should include various knockdown and/or knockout models.

      Significance

      Overall, the authors show an interesting dampening effect of dPGC1 on growth of Yki-driven tumors. This data could be relevant for elucidating how dysregulation of the Hippo signalling pathway can underlie tumorigenesis.

      The narrative arc of the study, however, appears to lack a focused line of inquiry. Figure 1 highlights an attempt to modulate Drosophila wing size and/or structure by downregulating dPGC1, but to no effect. Although examination of the efficiency of the RNAi revealed that the transcripts were still present in significant quantities; so, the conclusion that dPGC1 is dispensable for wing formation is premature. To have clarity on this point, it would be necessary to completely knockdown the gene, preferably by showing a total loss of protein. This should be feasible for the authors, since they showed Western blotting in Figure 5A. In any event, it seems that this negative data led the authors to study the Hippo pathway in the larval stage. This transition from Figure 1 to 2 seemed somewhat arbitrary and leads to a rather disjointed sense of the main line of inquiry around dPGC1.

      It is important to note, too, that the authors highlight a role of mitochondrial dynamics in the pathway of Yki-driven tumor formation; however, they only directly evaluate mitochondrial dynamics in this context in a single assay, namely, Figure 3H-K, and this quantification is likely inaccurate because the mitochondria in the Yki OE + dPGC1 RNAi condition seem to be substantially enlarged, circular structures. It is critical to keep in mind that mitochondrial enlargement does not necessarily stem from hyperfusion. It could come from a decrease in the activity of Drp1 or result from an imbalance between mitochondrial biogenesis and mitophagy.

      A marked limitation of this study is the overuse of rather artificial manipulations of transcriptional regulatory pathways. The study would benefit a lot from investigation of the loss of function of components of the Hippo pathway rather than just OE of Yki.

      My expertise is in mitochondrial biology, with specialization in super-resolution imaging, mitochondrial dynamics and membrane architecture. I have also worked in the interface between mitochondrial physiology and cancer. With this perspective, I think that the authors uncover a potentially interesting role of PGC1 as a tumor suppressor.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC- 2025-03073

      Corresponding author(s): Shaul Yogev

      1. General Statements [optional]

      We kindly thank our reviewers for their enthusiasm, thoughtful feedback, and constructive suggestions on how to strengthen our manuscript. Below, we provide a point-by-point response to reviewer comments and outline the experiments we will do to address every concern that has been raised.

      2. Description of the planned revisions

      • *

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This interesting study uses an unbiased genetic screen in C. elegans to identify SAX-1/NDR kinase as a regulator of dendritic branch elimination. Loss of SAX-1 results in an excess branching phenotype that is striking and highly penetrant. The authors identify several additional regulators of branch elimination (SAX-2, MOB-1, RABI-1, RAB-11.2) by using a candidate genetic screen aimed at factors that interact physically or genetically with SAX-1. They propose that SAX-1 acts by promoting membrane retrieval based on the nature of these interactors and the results of an imaging-based in vivo assay for endocytic puncta.

      Major comments.

      1. My biggest concern is that the phenotypes are only observed in temperature-sensitive dauer-constitutive mutant backgrounds, and not in wild-type dauers. That is, wild-type animals exiting dauer do not require SAX-1 for dendrite elimination. While this does not undermine the importance of the results, it does require more explanation. The authors write that "the requirement for sax-1... relies on specific physiological states of the dauer stage," but I do not understand what this means. Are they saying that daf-7 and daf-2 dauers are in a different "physiological state" than wild-type dauers? In what way? What is the evidence for this? A more rigorous explanation is needed. We agree that this is puzzling, and we thank the reviewer for recognizing that this does not undermine the importance of the results. There is ample evidence that daf-2 and daf-7 differ from starvation-induced dauers. For example, a recent preprint finds that the transcriptomes of these two mutants at dauer cluster much closer to each other than to starvation-induced dauers (Corchado et al. 2024). Older work has noted other differences, such as the time the dauer entry decision is made (Swanson and Riddle 1981), the synchronicity of dauer exit, the ability to force dauer entry in daf-d mutants, as well as additional dauer-unrelated phenotypes (reviewed in Karp 2018). We agree with the reviewer that this merits further clarifications and will perform the experiments suggested by the reviewer below:

      To me, the simplest genetic explanation is that daf-7 and daf-2 are partially required for branch retraction in a manner redundant with sax-1, and the ts mutants are not fully wild-type at 15C. Thus, the sax-1 requirement is revealed only in these mutant backgrounds. Can the authors examine starvation-induced dauers of daf-7 or daf-2 raised continuously at 15C?

      We will do this experiment.

      daf-7 and daf-2 ts strains can form "partial dauers" that have a dauer-like appearance but are not SDS resistant. Could the difference between partial dauers and full dauers account for the difference in sax-1-dependence? The authors could use SDS selection of the daf-7 strain at 25C to ensure they are examining full dauers.

      We tested daf-7 mutants with 1% SDS when we set up the system – they are fully dauer at 25°C and are SDS sensitive after exit. We will repeat this important control with daf-7; sax-1 double mutants.

      The Bargmann lab has created a daf-2 FLP-OUT strain (ky1095ky1087) that allows cell-type-specific removal of daf-2. Could this be used to test for a cell-autonomous role of daf-2 in IL2Q related to branch elimination?

      We can attempt this experiment. However, since IL2 promoters turn on prior to dauer, the interpretation would not be straightforward – it would be hard to exclude that a cell autonomous defect in dauer entry does not account for the IL2 dauer exit phenotype, even if branching appears normal.

      These ideas are not a list of specific experiments the authors need to complete, rather they are meant to illustrate some possible approaches to the question. Whatever approach they use, it is important for them to more rigorously explain why SAX-1 is not required for branch removal in wild-type animals.

      We completely agree. We will carry out the 15°C experiment, examine morphological characteristics and test SDS resistance. In addition, we will test neuronal markers that differ between dauers and non-dauers to determine whether the mutants are full or partial dauers at the relevant timepoints.

      The SAX-2 localization (Fig. 4) and endocytosis assay (Fig. 6) results were not clear to me from the data shown. Overall a more rigorous analysis and presentation of the data would be important to make these conclusions convincing. This may involve refining the data presentation in the figures, modifying the claims (e.g., "we propose" vs "we find"), or saving some of the data to be more fully explored in a future paper. In my view, these figures are the biggest weak point of the manuscript and also are not important for the central conclusions (which are well supported and convincing), indeed these results are barely mentioned in the Abstract or last paragraph of Introduction.

      We agree that the analysis and presentation of Figures 4 and 6 need to be improved. The presentation has already been updated, and the figures are clearer now. In the revision, we will increase sample size to provide stronger conclusions, consolidate some of the analysis and further improve presentation. While we agree with the reviewer that conclusions from these figures are not as strong as those drawn from genetic experiments, they do complement and support the conclusions of those other figures.

      • In Fig. 4D, why is SAX-2 visible throughout the entire neuron and why is the "punctum" marked with an arrow also seen in the tagRFP channel? One gets the impression that some of the puncta may be background, bleed-through, or artifacts due to cell varicosities.

      There is no bleed-through: this is most evident by looking at the brightest signals in the cell body (now labelled with an asterisk in a zoomed-out image) and noting that they do not bleed between channels. In sax-1 mutants, the SAX-2::GFP puncta are very obvious and distinguishable from the tagRFP channel. In control, SAX-2::GFP is very faint in the dendrite, so we increased the contrast to allow visualization. The reviewer is correct that under these conditions, some puncta look like the cytosolic fill. In the revision, we will re-analyze the data and will not consider these as bona-fide SAX-2 puncta, but rather cytosolic SAX-2 that accumulates due to constrictions and varicosities in the dendrite.

      • Related to both Fig. 4 and Fig. 6, where does SAX-1 localize in IL2Q in dauer and post-dauer? Does its expression or localization change during branch retraction? Does it co-localize with SAX-2 or endocytic puncta?

      We generated an endogenously tagged sax-1 with a 7xspGFP11 tag; however, this was below detection in the IL2s. For the revisions, we can test an overexpressed cDNA construct.

      **Referee cross-commenting**

      I think we all touched on similar points. I wanted to follow up on Reviewer 3's comment, "Is the failure to eliminate branches an indication of incomplete dauer recovery? Do sax-1 mutants retain additional characteristics of dauer morphology in post dauer adults." I thought this was an excellent point. It made me wonder if that might explain why the defect is only seen in daf-7 and daf-2 mutant backgrounds - maybe these strains retain partial dauer traits even after exit. Is there a specific experiment that they could do? Did you have specific characteristics of dauer morphology in mind for them to check? (Ideally something in the nervous system that can be scored quantitatively.)

      Please see response to point #1 regarding experiments we will do to confirm the “dauer state” of daf-7 and daf-7; sax-1 double mutants.

      Reviewer #1 (Significance (Required)):

      A major strength of this work is the pioneering use of a novel system to study neuronal branch retraction. C. elegans has provided a powerful model for studying how dendrite branches form, but much less attention has been paid to how excess neuronal branches are removed. The post-dauer remodeling of IL2Q neurons provides an exciting and dramatic physiological example to explore this question.

      This paper is notable for taking the first steps towards developing this innovative model. It does exactly what is needed at the outset of a new exploration - a forward genetic screen to discover the main regulators of the process. Using a combination of classical and modern genetic approaches, the authors bootstrap their way to a sizeable list of factors and a solid understanding of the properties of this system, for example that retraction of higher vs lower order dendrites show different genetic requirements.

      We thank the reviewer for recognizing the novelty and significance of our work.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, the authors establish C. elegans IL2 neurons as a system in which to study dendrite pruning. They use the system to perform a genetic screen for pruning regulators and find an allele of sax-1. Unexpectedly sax-1 is only required for post-dauer pruning in two different genetic backgrounds that induce dauer formation, but not starvation-induced dauer formation. Sax-1/NDR kinase reduction has previously been associated with increased outgrowth and branching in other systems, so this is a new role for this protein. However, the authors show that proteins that work with Sax-1 in other systems, like sax-2/fry, also play a role in this pathway. The genetic experiments are beautiful and the findings are all clearly explained and strongly supported. The authors also examine sax-2 localization, which localizes sax-1 in other systems, and show it in puncta in dendrites that increase with dauer exit, consistent with function at the time of pruning. They also show that membrane trafficking regulators associated with NDR kinases function in the same pathway here, hinting that endocytosis may play a role during pruning as in Drosophila. The link to endocytosis was a little weak (see Major point below). Overall, this study describes a new system to study pruning and identifies NDR/fry/Rabs as regulators of pruning during dauer exit. The work is very high quality and both the imaging and genetics are extremely well done.

      We thank the reviewer for their positive assessment of the manuscript.

      Major points

      1. The only place where there were any questions about the data was the last figure (6G and I). Here they use uptake of GFP secreted from muscle as a readout of endocytosis in IL2 neurons. They nicely show that more internalized puncta accumulate as animals exit dauer. The claim that this is reduced in sax-1 mutants doesn't seem to match the images shown well. In the image there are many more puncta in the GFP channel and much more accumulation of the RFP-tagged receptor everywhere. It seems like some additional analysis of this data is important to fully capture what is going on and whether this really represents an endocytic defect. We agree and will provide additional data in Figure 6. The specific discrepancy between the image and the quantification is because we showed a single focal plane rather than a projection. This does not capture all the puncta in a neurite. The current version shows a projection, making it evident that the mutants has fewer puncta compared to the control.

      Reviewer #2 (Significance (Required)):

      Neurite pruning is important in all animals with neurons. Genetic approaches have primarily been applied to the problem using Drosophila, so identifying a new model system in which to study it is an important step. Using this system, a pathway known to function in a different context is linked to pruning. Thus the study provides new insights into both pruning and this pathway.

      We thank the reviewer for the positive assessment of our study’s significance.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      Summary: Figueroa-Delgado et al. use a C. elegans neuro plasticity model to examine how dendrites are eliminated upon recovery from the stress induced larval stage, dauer. The authors performed a mutagenesis screen to identify novel regulators of dendrite elimination and revealed some surprising results. Branch elimination mechanism varies between 2{degree sign}, 3{degree sign}, and 4{degree sign} branches. The NDR kinase, SAX-1 and it's interactors (SAX-2 and MOB-2) are required for elimination of second and third order branches but not fourth order branches. Interestingly they showed that branch elimination varies depending on the stimulus of dendrite outgrowth such that the NDR kinase is required for branch elimination after genetically inducing the dauer stage but is not required if dauers are produced through food deprivation. The authors go a step further to include a small candidate screen looking at various pathways of membrane remodeling and identify additional regulators of dendrite elimination related to membrane trafficking including RABI-1, RAB-8, RAB-10, and RAB-11.2.

      We thank the reviewer for their time and suggestions below

      Major comments:

      • While I find the data promising and exciting, several of the experiments have concerningly low sample sizes. Fig 3G, Fig 4G, Fig 5J and L, and Fig 6I all contain data sets that are fewer than 10 animals. Sample sizes should be stated specifically in the figure legends for all data represented in the graphs. We thank the reviewer for finding the data exciting. We agree that the sample sizes in some panels is low and will increase it in the revised version. Sample sizes are now specifically listed in the figure legends.

      • All statements based on data not shown should be amended to include the data as a supplemental figure or edited to omit the statement based on withheld data. We agree. Some “not shown” data are already added to the current version of the manuscript and the rest will be added to the fully revised version, or the statements will be omitted.

      • Rescue experiments (Fig 2J) should demonstrate failure to rescue from neighboring tissue types (hypodermis and muscle) to conclude cell autonomous rescue rather than a broadly acting factor. Thank you for the suggestion. We will use a hypodermal promoter and a muscle promoter driving SAX-1 cDNA expression to strengthen the claim of cell autonomy.

      • Fig 4 needs quantification of higher order branches and SAX-2 proximity to branch nodes as these are discussed in the text. We will add this quantification.

      Minor comments:

      • Fig 1C-F, It appears like the shy87 allele produces animals of significantly different body sizes. It would improve rigor to normalize the dendrite coverage to body size in the quantification. We do not see a biologically meaningful size difference between shy87 and control, it may be the specific image shown. We will confirm this by measuring animal size for the final revision.

      • Is the failure to eliminate branches an indication of incomplete dauer recovery? Do sax-1 mutants retain additional characteristics of dauer morphology in post dauer adults. This important point was also raised by Reviewer 1. We will test SDS sensitivity, morphological markers, and molecular markers to determine the dauer “state” of the mutants used in this study. The results will be included in the final revision.

      • The text references multiple transgenic lines tested in Fig 2I-J but only one line is shown. Additional lines were visually examined under a fluorescent compound microscope but not imaged or quantified. We will add this quantification to the final revision.

      • Fig 4F, Additional timepoints would enhance the sax-1 localization result and might provide insight into mechanism of action for sax-1. We will add the localization in post-dauer adults.

      • Fig 6I Control and sax-1(ky491) example images should be provided in the supplement. We will add these images to the final revision.

      **Referee cross-commenting**

      I agree that we shared many of the same concerns.

      There are several general assays for dauer characteristics that could be used here to determine if the post-dauer animals retain other characteristics of the dauer stage in addition to IL2 branches (SDS resistance, alae remodeling, pharyngeal bulb morphology, nictation behavior). The nictation behavior has been connected very nicely with IL2 neurons (Junho Lee's group). Additionally, FLP dendrites occupy the same space as the IL2 branches and outgrowth in post-dauers occurs in coordination with IL2 branch elimination - this might be another optional experiment, to check if FLP growth is impeded by persistent IL2 branches. All of these could be quantified similar to how the authors have already established with their IL2 model (FLP dendrite branches) or with a binary statistic.

      Please see responses to Reviewer 1 and 3 above for the list of experiments to determine whether the animals fail to completely enter or exit dauer.

      Reviewer #3 (Significance (Required)):

      SIGNIFICANCE ============ These results describe a new role for the NDR kinase complex in dendrite pruning that has clinical significance to our understanding of human brain development and human health concerns in which pruning is dysregulated, such as observed in the case of autism. The authors use an established neuro-plasticity, C. elegans model (Schroeder et al. 2013) which provides a tractable and reproduceable platform for discovering the mechanism of dendrite pruning. These results would influence future work in the fields of cell biology of the neuron and disease models of brain development.

      My expertise is in the field of C. elegans neuroscience and stress biology and have sufficient expertise to evaluate all aspects of this work.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #1

      • In Fig. 4C, the distinction between puncta in the primary or higher-order dendrites is not clear to me, and several puncta that I would have scored as primary are marked as higher-order.

      We apologize for a mistake in the arrowhead color and overall presentation of this figure. It has been fixed in the current version.

      • Related to this, in Fig. 4B are the two arrows meant to be white as in the top panel, or yellow as in the bottom panel?

      We thank Reviewer #1 for their observation, and we apologize for our oversight. We fixed this in the current version.

      • In Fig. 4, where in the head are we looking? It would help to show a more low-magnification view of the entire cell.

      We added zoomed-out images and indicated where the zoomed in insets are taken from. We thank the reviewer for helping us improve the clarity of the data.

      • The main sax-1 phenotype is increased SAX-2 puncta in dauer, but the branch retraction defect is in post-dauers. How is this relevant to the phenotype?

      This is a very good point. The increase in SAX-2 puncta in sax-1 mutants is stronger during dauer-exit than in dauer, consistent with this being the time when SAX-1 functions. We agree that some earlier activity of SAX-1 cannot be excluded, and we do not assume that the effect on SAX-2 completely accounts for the pruning defects. This is now acknowledged in the text. However, given that both proteins function together in pruning, and given that the effect is strongest during dauer exit, we do believe that this data is informative and worth showing.


      • The number of SAX-2 puncta in sax-1 mutants decreases almost to normal in post dauers. Is there a correlation between the number of remaining branches and the number of SAX-2 puncta? That is, do the many wild-type animals with "excess" SAX-2 puncta also fail to retract branches?

      There is no correlation. In other words, the number of SAX-2 puncta does not instruct the extent of pruning. Please note the quantifications underestimate the number of SAX-2 puncta in the mutants, since they were only done on the primary dendrite. This is necessary because the mutant and control have different arbor size, so only branch order that can be appropriately compared are primary dendrites.

      • The control post-dauer data in Fig. 4F and 4H are identical (re-used data) but the corresponding control dauer data in Fig. 4F and 4G are different. What is going on here?

      We thank the reviewer for raising this point and apologize for the oversight in data presentation. In the revised manuscript, we now show all control and experimental data integrated into a single graph, ensuring that each dataset is represented accurately to provide a comparison between dauer and post dauer recovery conditions.


      • Why are sample sizes so small for both strains in Fig. 4G compared to Fig. 4F and 4H?

      We sincerely apologize for this mistake, some of the data was erroneously grouped in the original submission. The revised version contains an updated number of neurons, presented on the same graph, and in the final revision we will further increase sample size. We apologize again for this error.

      • In Fig. 6C, why are the tagRFP (blue) puncta larger than the neurite? Aren't these meant to represent vesicles inside the surrounding neurite? One gets the impression that this is bleed-through from the GFP channel.

      Based on EM, both an endocytic punctum and the diameter of the neuron are smaller than a single pixel. The apparent difference in size in fluorescence microscopy is because the puncta are brighter (they contain more membrane) and thus appear larger. In the current version, the improved presentation of the figure contains zoomed out images that clearly show that there is no bleed-through.

      • In Fig. 6E and 6F, why are there no tagRFP (blue) puncta? Is CD8 not endocytosed at all if it lacks the nanobody sequence? One would expect the tagRFP (blue) signal to be the same in both strains and simply to lack yellow if the nanobody is not present.

      CD8 lacks clear endocytosis motifs, which is why it is advantageous for labelling neurites and testing endocytosis when paired with an endocytic signal (Lee and Luo 1999; Kozik et al. 2010). Conversely, extracellular GFP binding to a membrane GFP antibody can induce endocytosis (for example, see (Tang et al., 2020)), likely by inducing clustering, although we are not familiar with work that explored the mechanism. In the updated version we included a rare example of an mCD8 punctum.

      • The authors report a decrease in endocytic events in sax-1, but qualitatively it looks like there are vastly more puncta inside the neuron in Fig. 6H than in 6G.

      We apologize for the presentation in the original version of Figure 6. This impression was because we showed single focal planes that only captured some of the signal. In the revised version we show projections, which makes it evident that there are fewer endocytic events in the mutant.

      • In Fig. 6E and 6H, why are there so many GFP (yellow) puncta outside the neuron? What are these structures and why are they absent in the strain with the nanobody?

      These puncta are secreted or muscle-associated GFP that has not been internalized by IL2Q neurons. They are present in all strains in this figure, this can be clearly seen in the zoomed-out images that have been added to the updated figure.

      • What is the large central blue structure in Fig. 6H - is this the soma? - and why are puncta in this region not counted?

      This is indeed the soma. In the updated version this can be clearly seen in the zoom-out. The large puncta in the soma were not counted because they may arise from the fusion of an unknown number of smaller puncta, and their precise number cannot be determined at the resolution of fluorescence microscopy.

      • minor: there is text reading "40-" in the bottom panel of Fig. 6H. It is visible when printed but not on screen - adjust levels in Photoshop to reveal it.

      We thank the reviewer for catching this oversight, it is now fixed.

      Minor points:

      1. At several points the authors emphasize the relationship of neurite remodeling to stress, e.g. Abstract and Discussion: "we adapted C. elegans IL2 sensory dendrites as a model [of...] stress-mediated dendrite pruning". It seems unnecessary and potentially misleading to treat this as a neuronal stress response. First, it conflates organismal and cellular stress - there is no reason to think that IL2 neurons are under cellular stress in dauer. In fact parasitic nematodes go through dauer-like stages as part of healthy development and probably have similar remodeling of IL2. Second, dendrite pruning occurs during dauer exit, which is the opposite of a stress response - it reflects a return to favorable conditions. We agree. We modified the abstract and discussion to avoid conflating organismal stress (the alleviation of which is relevant for triggering pruning) and cellular stress. Thank you for pointing this out.

      In Fig. 1A, C. elegans is shown going directly from L1 to dauer in response to unfavorable conditions, which is incorrect. Animals proceed through L2 (in many cases actually an alternative L2d pre-dauer) and then molt into dauer (an alternative L3 stage) after completing L2.

      We updated the schematic to include the L2d stage where commitment to dauer entry or resumption to reproductive development is made.

      In Fig. 1B, please check if it is correct that hypodermis contacts the pharynx basement membrane as drawn. The schematic in the top panel makes it look like there is a single secondary branch and the quaternary branches are similar in length to the primary dendrite. The schematic in the bottom panel makes it look like the entire neuron is a small fraction of the length of the pharynx. Could these be drawn closer to scale?

      The hypodermis does contact the pharynx basement membrane. We redrew the schematic for clarity.

      Reviewer #2

      For context, it might be helpful to know whether branching of other dendrites is increased in sax-1 mutants (as expected based on phenotypes in other animals) or decreased like IL2 neurons.

      We examined the branching pattern of PVD, a polymodal nociceptive neuron (new Supplemental Figure 3). We find no significant difference between control and sax-1 or sax-2 mutants, suggesting that these genes function in the context of pruning. Recent work (Zhao et al. 2022) confirms that sax-1 is not required for PVD branching.

      Minor:

      "shy87 mutant dauers showed a minor reduction in secondary and tertiary branches compared to control (Figure 1G). These results indicate that shy87 is specifically required for the elimination of dauer-generated dendrite branches." Maybe temper the specificity claim some as the reduction in branches is definitely there.

      We agree, the claim was tempered.

      "three complimentary approaches" should be complementary

      Thank you for noticing. We fixed this.

      "In control animals, SAX-2 was mostly concentrated in the cell body (data not shown)" It might be nice to include some overview images that show the cell body for completeness.

      We added zoomed-out images to the revised figure, thank you for the suggestion.

      Reviewer #3


      Minor comments:


      • Fig 1G-H, are shy87 second and third order branch counts statistically different between dauer and post dauer adults? This comparison would strengthen the claim that these order branches fail to eliminate all together rather than undergo a partial elimination. We added this to Figure S2. The shy87 mutants show a complete failure in eliminating secondary branches (i.e. no difference between dauer and post-dauer) and a strong but incomplete defect in eliminating tertiary branches.

      • Fig 4B-E Indicate branch order in the images, this is unclear and a point that is focused on in the text. Done.

      • Discussion of Fig 1G from the text claims that shy87 is specifically required for branch elimination yet the data shows significant defects in branch outgrowth as well. This raises the question, are the branches abnormally stabilized that results in early underdevelopment and late atrophy? Authors should acknowledge alternative hypotheses. We agree and will revise the text accordingly. The difference between shy87 and control dauers, while statistically significant, is relatively minor and can only be detected by careful quantification, it is not apparent from looking at the images (in contrast for example to rab-8 and rab-10 mutants, where we acknowledge in the text that their branching defects might affect subsequent pruning.

      • Authors reference a branch elimination process but don't outline what this would entail and where their results fit in. We apologize for being unclear. Given that sax-1 and sax-2 function together, one would intuitively expect to see SAX-2 being reduced in sax-1 mutants, yet the opposite is observed. On potential explanation is that SAX-1 does not directly control SAX-2 abundance, but that clearance of SAX-2 is part of the pruning process that both proteins regulate. This would explain the enrichment of SAX-2 in sax-1 mutants. However, additional models cannot be excluded, and we acknowledge this in the revised text.

      References:

      Corchado, Johnny Cruz, Abhishiktha Godthi, Kavinila Selvarasu, and Veena Prahlad. 2024. “Robustness and Variability in Caenorhabditis Elegans Dauer Gene Expression.” Preprint, bioRxiv, August 26. https://doi.org/10.1101/2024.08.15.608164.

      Karp, Xantha. 2018. “Working with Dauer Larvae.” WormBook, August 9, 1–19. https://doi.org/10.1895/wormbook.1.180.1.

      Kozik, Patrycja, Richard W Francis, Matthew N J Seaman, and Margaret S Robinson. 2010. “A Screen for Endocytic Motifs.” Traffic (Copenhagen, Denmark) 11 (6): 843–55. https://doi.org/10.1111/j.1600-0854.2010.01056.x.

      Lee, T., and L. Luo. 1999. “Mosaic Analysis with a Repressible Cell Marker for Studies of Gene Function in Neuronal Morphogenesis.” Neuron 22 (3): 451–61.

      Swanson, M. M., and D. L. Riddle. 1981. “Critical Periods in the Development of the Caenorhabditis Elegans Dauer Larva.” Developmental Biology 84 (1): 27–40. https://doi.org/10.1016/0012-1606(81)90367-5.

      Tang, Rui, Christopher W Murray, Ian L Linde, et al. n.d. “A Versatile System to Record Cell-Cell Interactions.” eLife 9: e61080. https://doi.org/10.7554/eLife.61080.

      Zhao, Ting, Liying Guan, Xuehua Ma, Baohui Chen, Mei Ding, and Wei Zou. 2022. “The Cell Cortex-Localized Protein CHDP-1 Is Required for Dendritic Development and Transport in C. Elegans Neurons.” PLOS Genetics 18 (9): e1010381. https://doi.org/10.1371/journal.pgen.1010381.


      4. Description of analyses that authors prefer not to carry out

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Figueroa-Delgado et al. use a C. elegans neuro plasticity model to examine how dendrites are eliminated upon recovery from the stress induced larval stage, dauer. The authors performed a mutagenesis screen to identify novel regulators of dendrite elimination and revealed some surprising results. Branch elimination mechanism varies between 2{degree sign}, 3{degree sign}, and 4{degree sign} branches. The NDR kinase, SAX-1 and it's interactors (SAX-2 and MOB-2) are required for elimination of second and third order branches but not fourth order branches. Interestingly they showed that branch elimination varies depending on the stimulus of dendrite outgrowth such that the NDR kinase is required for branch elimination after genetically inducing the dauer stage but is not required if dauers are produced through food deprivation. The authors go a step further to include a small candidate screen looking at various pathways of membrane remodeling and identify additional regulators of dendrite elimination related to membrane trafficking including RABI-1, RAB-8, RAB-10, and RAB-11.2.

      Major comments:

      • While I find the data promising and exciting, several of the experiments have concerningly low sample sizes. Fig 3G, Fig 4G, Fig 5J and L, and Fig 6I all contain data sets that are fewer than 10 animals. Sample sizes should be stated specifically in the figure legends for all data represented in the graphs.
      • All statements based on data not shown should be amended to include the data as a supplemental figure or edited to omit the statement based on withheld data.
      • Rescue experiments (Fig 2J) should demonstrate failure to rescue from neighboring tissue types (hypodermis and muscle) to conclude cell autonomous rescue rather than a broadly acting factor.
      • Fig 4 needs quantification of higher order branches and SAX-2 proximity to branch nodes as these are discussed in the text.

      Minor comments:

      • Fig 1C-F, It appears like the shy87 allele produces animals of significantly different body sizes. It would improve rigor to normalize the dendrite coverage to body size in the quantification.
      • Fig 1G-H, are shy87 second and third order branch counts statistically different between dauer and post dauer adults? This comparison would strengthen the claim that these order branches fail to eliminate all together rather than undergo a partial elimination.
      • Discussion of Fig 1G from the text claims that shy87 is specifically required for branch elimination yet the data shows significant defects in branch outgrowth as well. This raises the question, are the branches abnormally stabilized that results in early underdevelopment and late atrophy? Authors should acknowledge alternative hypotheses.
      • Is the failure to eliminate branches an indication of incomplete dauer recovery? Do sax-1 mutants retain additional characteristics of dauer morphology in post dauer adults.
      • The text references multiple transgenic lines tested in Fig 2I-J but only one line is shown.
      • Fig 4B-E Indicate branch order in the images, this is unclear and a point that is focused on in the text.
      • Fig 4F, Additional timepoints would enhance the sax-1 localization result and might provide insight into mechanism of action for sax-1.
      • Authors reference a branch elimination process but don't outline what this would entail and where their results fit in.
      • Fig 6I Control and sax-1(ky491) example images should be provided in the supplement.

      Referee cross-commenting

      I agree that we shared many of the same concerns.

      There are several general assays for dauer characteristics that could be used here to determine if the post-dauer animals retain other characteristics of the dauer stage in addition to IL2 branches (SDS resistance, alae remodeling, pharyngeal bulb morphology, nictation behavior). The nictation behavior has been connected very nicely with IL2 neurons (Junho Lee's group). Additionally, FLP dendrites occupy the same space as the IL2 branches and outgrowth in post-dauers occurs in coordination with IL2 branch elimination - this might be another optional experiment, to check if FLP growth is impeded by persistent IL2 branches. All of these could be quantified similar to how the authors have already established with their IL2 model (FLP dendrite branches) or with a binary statistic.

      Significance

      These results describe a new role for the NDR kinase complex in dendrite pruning that has clinical significance to our understanding of human brain development and human health concerns in which pruning is dysregulated, such as observed in the case of autism. The authors use an established neuro-plasticity, C. elegans model (Schroeder et al. 2013) which provides a tractable and reproduceable platform for discovering the mechanism of dendrite pruning. These results would influence future work in the fields of cell biology of the neuron and disease models of brain development.

      My expertise is in the field of C. elegans neuroscience and stress biology and have sufficient expertise to evaluate all aspects of this work.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, the authors establish C. elegans IL2 neurons as a system in which to study dendrite pruning. They use the system to perform a genetic screen for pruning regulators and find an allele of sax-1. Unexpectedly sax-1 is only required for post-dauer pruning in two different genetic backgrounds that induce dauer formation, but not starvation-induced dauer formation. Sax-1/NDR kinase reduction has previously been associated with increased outgrowth and branching in other systems, so this is a new role for this protein. However, the authors show that proteins that work with Sax-1 in other systems, like sax-2/fry, also play a role in this pathway. The genetic experiments are beautiful and the findings are all clearly explained and strongly supported. The authors also examine sax-2 localization, which localizes sax-1 in other systems, and show it in puncta in dendrites that increase with dauer exit, consistent with function at the time of pruning. They also show that membrane trafficking regulators associated with NDR kinases function in the same pathway here, hinting that endocytosis may play a role during pruning as in Drosophila. The link to endocytosis was a little weak (see Major point below). Overall, this study describes a new system to study pruning and identifies NDR/fry/Rabs as regulators of pruning during dauer exit. The work is very high quality and both the imaging and genetics are extremely well done.

      Major points

      1. The only place where there were any questions about the data was the last figure (6G and I). Here they use uptake of GFP secreted from muscle as a readout of endocytosis in IL2 neurons. They nicely show that more internalized puncta accumulate as animals exit dauer. The claim that this is reduced in sax-1 mutants doesn't seem to match the images shown well. In the image there are many more puncta in the GFP channel and much more accumulation of the RFP-tagged receptor everywhere. It seems like some additional analysis of this data is important to fully capture what is going on and whether this really represents an endocytic defect.
      2. For context, it might be helpful to know whether branching of other dendrites is increased in sax-1 mutants (as expected based on phenotypes in other animals) or decreased like IL@ neurons.

      Minor:

      "shy87 mutant dauers showed a minor reduction in secondary and tertiary branches compared to control (Figure 1G). These results indicate that shy87 is specifically required for the elimination of dauer-generated dendrite branches." Maybe temper the specificity claim some as the reduction in branches is definitely there.

      "three complimentary approaches" should be complementary

      "In control animals, SAX-2 was mostly concentrated in the cell body (data not shown)" It might be nice to include some overview images that show the cell body for completeness.

      Significance

      Neurite pruning is important in all animals with neurons. Genetic approaches have primarily been applied to the problem using Drosophila, so identifying a new model system in which to study it is an important step. Using this system, a pathway known to function in a different context is linked to pruning. Thus the study provides new insights into both pruning and this pathway.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This interesting study uses an unbiased genetic screen in C. elegans to identify SAX-1/NDR kinase as a regulator of dendritic branch elimination. Loss of SAX-1 results in an excess branching phenotype that is striking and highly penetrant. The authors identify several additional regulators of branch elimination (SAX-2, MOB-1, RABI-1, RAB-11.2) by using a candidate genetic screen aimed at factors that interact physically or genetically with SAX-1. They propose that SAX-1 acts by promoting membrane retrieval based on the nature of these interactors and the results of an imaging-based in vivo assay for endocytic puncta.

      Major comments.

      1. My biggest concern is that the phenotypes are only observed in temperature-sensitive dauer-constitutive mutant backgrounds, and not in wild-type dauers. That is, wild-type animals exiting dauer do not require SAX-1 for dendrite elimination.

      While this does not undermine the importance of the results, it does require more explanation. The authors write that "the requirement for sax-1... relies on specific physiological states of the dauer stage," but I do not understand what this means. Are they saying that daf-7 and daf-2 dauers are in a different "physiological state" than wild-type dauers? In what way? What is the evidence for this? A more rigorous explanation is needed.

      To me, the simplest genetic explanation is that daf-7 and daf-2 are partially required for branch retraction in a manner redundant with sax-1, and the ts mutants are not fully wild-type at 15C. Thus, the sax-1 requirement is revealed only in these mutant backgrounds. Can the authors examine starvation-induced dauers of daf-7 or daf-2 raised continuously at 15C?

      daf-7 and daf-2 ts strains can form "partial dauers" that have a dauer-like appearance but are not SDS resistant. Could the difference between partial dauers and full dauers account for the difference in sax-1-dependence? The authors could use SDS selection of the daf-7 strain at 25C to ensure they are examining full dauers.

      The Bargmann lab has created a daf-2 FLP-OUT strain (ky1095ky1087) that allows cell-type-specific removal of daf-2. Could this be used to test for a cell-autonomous role of daf-2 in IL2Q related to branch elimination?

      These ideas are not a list of specific experiments the authors need to complete, rather they are meant to illustrate some possible approaches to the question. Whatever approach they use, it is important for them to more rigorously explain why SAX-1 is not required for branch removal in wild-type animals. 2. The SAX-2 localization (Fig. 4) and endocytosis assay (Fig. 6) results were not clear to me from the data shown. Overall a more rigorous analysis and presentation of the data would be important to make these conclusions convincing. This may involve refining the data presentation in the figures, modifying the claims (e.g., "we propose" vs "we find"), or saving some of the data to be more fully explored in a future paper. In my view, these figures are the biggest weak point of the manuscript and also are not important for the central conclusions (which are well supported and convincing), indeed these results are barely mentioned in the Abstract or last paragraph of Introduction.

      • In Fig. 4, where in the head are we looking? It would help to show a more low-magnification view of the entire cell.
      • In Fig. 4D, why is SAX-2 visible throughout the entire neuron and why is the "punctum" marked with an arrow also seen in the tagRFP channel? One gets the impression that some of the puncta may be background, bleed-through, or artifacts due to cell varicosities.
      • In Fig. 4C, the distinction between puncta in the primary or higher-order dendrites is not clear to me, and several puncta that I would have scored as primary are marked as higher-order.
      • Related to this, in Fig. 4B are the two arrows meant to be white as in the top panel, or yellow as in the bottom panel?
      • The main sax-1 phenotype is increased SAX-2 puncta in dauer, but the branch retraction defect is in post-dauers. How is this relevant to the phenotype?
      • The number of SAX-2 puncta in sax-1 mutants decreases almost to normal in post dauers. Is there a correlation between the number of remaining branches and the number of SAX-2 puncta? That is, do the many wild-type animals with "excess" SAX-2 puncta also fail to retract branches?
      • The control post-dauer data in Fig. 4F and 4H are identical (re-used data) but the corresponding control dauer data in Fig. 4F and 4G are different. What is going on here?
      • Why are sample sizes so small for both strains in Fig. 4G compared to Fig. 4F and 4H?
      • In Fig. 6C, why are the tagRFP (blue) puncta larger than the neurite? Aren't these meant to represent vesicles inside the surrounding neurite? One gets the impression that this is bleed-through from the GFP channel.
      • In Fig. 6E and 6F, why are there no tagRFP (blue) puncta? Is CD8 not endocytosed at all if it lacks the nanobody sequence? One would expect the tagRFP (blue) signal to be the same in both strains and simply to lack yellow if the nanobody is not present.
      • In Fig. 6E and 6H, why are there so many GFP (yellow) puncta outside the neuron? What are these structures and why are they absent in the strain with the nanobody?
      • What is the large central blue structure in Fig. 6H - is this the soma? - and why are puncta in this region not counted?
      • The authors report a decrease in endocytic events in sax-1, but qualitatively it looks like there are vastly more puncta inside the neuron in Fig. 6H than in 6G.
      • minor: there is text reading "40-" in the bottom panel of Fig. 6H. It is visible when printed but not on screen - adjust levels in Photoshop to reveal it.
      • Related to both Fig. 4 and Fig. 6, where does SAX-1 localize in IL2Q in dauer and post-dauer? Does its expression or localization change during branch retraction? Does it co-localize with SAX-2 or endocytic puncta?

      Minor points:

      1. At several points the authors emphasize the relationship of neurite remodeling to stress, e.g. Abstract and Discussion: "we adapted C. elegans IL2 sensory dendrites as a model [of...] stress-mediated dendrite pruning". It seems unnecessary and potentially misleading to treat this as a neuronal stress response. First, it conflates organismal and cellular stress - there is no reason to think that IL2 neurons are under cellular stress in dauer. In fact parasitic nematodes go through dauer-like stages as part of healthy development and probably have similar remodeling of IL2. Second, dendrite pruning occurs during dauer exit, which is the opposite of a stress response - it reflects a return to favorable conditions.
      2. In Fig. 1A, C. elegans is shown going directly from L1 to dauer in response to unfavorable conditions, which is incorrect. Animals proceed through L2 (in many cases actually an alternative L2d pre-dauer) and then molt into dauer (an alternative L3 stage) after completing L2.
      3. In Fig. 1B, please check if it is correct that hypodermis contacts the pharynx basement membrane as drawn. The schematic in the top panel makes it look like there is a single secondary branch and the quartenary branches are similar in length to the primary dendrite. The schematic in the bottom panel makes it look like the entire neuron is a small fraction of the length of the pharynx. Could these be drawn closer to scale?

      Referee cross-commenting

      I think we all touched on similar points. I wanted to follow up on Reviewer 3's comment, "Is the failure to eliminate branches an indication of incomplete dauer recovery? Do sax-1 mutants retain additional characteristics of dauer morphology in post dauer adults." I thought this was an excellent point. It made me wonder if that might explain why the defect is only seen in daf-7 and daf-2 mutant backgrounds - maybe these strains retain partial dauer traits even after exit. Is there a specific experiment that they could do? Did you have specific characteristics of dauer morphology in mind for them to check? (Ideally something in the nervous system that can be scored quantitatively.)

      Significance

      A major strength of this work is the pioneering use of a novel system to study neuronal branch retraction. C. elegans has provided a powerful model for studying how dendrite branches form, but much less attention has been paid to how excess neuronal branches are removed. The post-dauer remodeling of IL2Q neurons provides an exciting and dramatic physiological example to explore this question.

      This paper is notable for taking the first steps towards developing this innovative model. It does exactly what is needed at the outset of a new exploration - a forward genetic screen to discover the main regulators of the process. Using a combination of classical and modern genetic approaches, the authors bootstrap their way to a sizeable list of factors and a solid understanding of the properties of this system, for example that retraction of higher vs lower order dendrites show different genetic requirements.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their positive comments. Our manuscript is to our knowledge the first to investigate the role of VAIL (V-ATPase—ATG16L1 induced LC3 lipidation), a form of CASM (Conjugation of ATG8s to single membranes) in SARS-CoV-2 replication. We demonstrate that SARS-CoV-2 Envelope (E) induces VAIL and this contributes to viral replication, including by using a reverse genetics system to make an E mutant virus. There have been many high quality studies examining the role of canonical autophagy in SARS-CoV-2 replication and our manuscript does not argue that all or even most LC3 lipidation during infection is via VAIL. We will try to make this point more clearly in the text. We do not think this detracts from the novelty and importance of our manuscript.

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)): *

      • Figueras-Novoa et al present a short report demonstrating the induction of LC3 lipidation on single membranes by SARS-CoV-2 through a noncanonical autophagy pathway referred to as VAIL. The authors utilize elegant genetic tools to show that the induction of LC3 lipidation upon viral infection is mainly due to VAIL rather than canonical autophagy. They demonstrate that the activity of the viral E protein that can cause neutralization of acidic vesicles leads to the activation of non-canonical LC3 lipidation on single membranes. Interestingly, the authors also conclude that the impairment of VAIL leads to a reduction of viral load as a result of a defect in later stages of viral infection, although the underlying mechanism was not further explored. *

      • Overall, this is an elegant and well controlled study that provides a clear conclusion. I only have some minor comments.*

      We thank the reviewer for their assessment of our manuscript.

      In some experiments, LC3 lipidation does not appear to be fully disrupted upon VAIL inhibition (e.g. Fig.'s 1H, 3D, 4A). As other labs have shown that SARS-CoV2 blocks autophagic flux, this could be further clarified in this manuscript as both VAIL and autophagy may be co-induced upon viral infection.

      We agree with the reviewer that there is a contribution of canonical macroautophagy to the LC3B lipidation observed in SARS-CoV-2. We will extend the discussion in the manuscript to clarify this point for the readers.

      Can the authors test the induction of LC3 lipidation in cells expressing K490 mutant of ATG16L1 in ATG16L1 KO cells to compare them with ATG16L1-ATG13 double knockouts?

      The western blot in figure 3F (quantified in Figure 3G) shows LC3B lipidation in response to E expression in ATG16L1-ATG13 double knock out cells reconstituted with wild type ATG16L1 but not in cells complimented with ATG16L1 K490A mutant. We agree that the referee’s suggestion to perform these experiments in the context of infection would be informative. However in spite of numerous attempts, we have so far been unable to generate a cell clone fully devoid of ATG16L1 in a cell line that can be productively infected with SARS-CoV-2. For reasons unclear to us there appears to be a very low level of residual ATG16L1 activity despite multiple different CRISPR/Cas9 targeting attempts. The suggested complementation experiments might still be informative in the context of low level ATG16L1 expression so we will pursue this. Alternatively, as a contingency we can try to produce SARS-CoV-2 infectable cells with mutations in ATG16L1’s binding partner V1H, this interaction is required for VAIL. A further contingency could be to assess LC3B lipidation during infection and treatment with a Vps34 inhibitor, which inhibits canonical autophagy.

      Minor points: * * The difference between Fig. 1F&G is unclear and why the authors are including both analyses. Similarly figures 4G&H.

      We included both metrics to show that the decrease in LC3B lipidation in cells expressing SopF during infection is robust and observed in two separate readouts. While spot area measures the area of infected cells covered by GFP-LC3B fluorescence, spot intensity is a reading of the intensity of the area defined in an infected cell as being LC3 positive. Theoretically, these measurements could change in different ways. For example, if the same amount of lipidated LC3 were to distribute over a larger area of the cell. We prefer to keep both measurements in the manuscript.

      The authors should show boxed colocalisation of all images, including negative controls. For examples, the authors have shown boxed magnifications in only the lowest panel in Figure 2A but not the upper two panels. Figures 4E&F should include boxed examples. This serves to clarify both positive and negative colocalisation events.

      Boxed magnifications will be added to all images.

      • Reviewer #1 (Significance (Required)): *

      • Overall an elegant and well controlled study demonstrating the induction of non-canonical LC3 conjugation on single membranes (VAIL) during SARS-CoV2 infection. A further exploration of canonical autophagy (as previously published by others) in addition to VAIL would enhance this study.*

      As the reviewer noted, several excellent studies have explored canonical autophagy during SARS-CoV-2 infection, many of which we cite in our manuscript. Our focus, however, is to demonstrate that SARS-CoV-2 E induces LC3 lipidation via VAIL. We believe that exploring the diverse roles of canonical autophagy mechanisms in SARS-CoV-2 infection is beyond the scope of this study.

      *This study is of interest to researchers studying autophagy, viruses, immunology, single membrane LC3 lipidation, and lysosomes as well as potentially clinicians treating SARS-CoV2 infecteted individuals. *

      • This reviewer is experienced in autophagy research.*

      We thank the reviewer for this assessment of our manuscript.

      *Reviewer #2 (Evidence, reproducibility and clarity (Required)): *

      • Major Comments *

      • Figure 1D does not very clearly show an overlap between V1D and LC3B. Both proteins seem broadly present across the cell and there is no easily identifiable change in V1D distribution upon infection. As such the overlay may be purely stochastic. The authors should quantify the observed co-localization events across multiple cells and biological replicates and compare them to other protein(s) with a similar cellular distribution pattern.*

      We agree there is no obvious change in V1D staining on infection. The images in Figure 1D are purely intended to illustrate that LC3 and the V-ATPase can colocalise, not to demonstrate a change in V-ATPase distribution or to suggest a direct interaction. We will make this point more clearly in the text. We will also carry out analyses of the kind (see also response to the first two Minor Comments). We would be happy to provide an alternative method of visualising the V-ATPase (we could use any suitable antibody to the V-ATPase, or the bacterial effector SidK) if required. In response to reviewer 3’s comments, we will carry out a pull-down experiment to test the association of the V-ATPase and ATG16L1 during E expression, as this is a key interaction during VAIL activation.

      Based on Figure 2F the authors suggest that virus entry is unaffected by the inhibition of VAIL in early timepoints. However, according to the figure legend, the timepoint used is 7hpi, while 2D uses 24hpi. Some SARS-CoV-2 papers suggest 7-10 hours is sufficient time to release new virions (Ban-On et al., 2020). As such 7hpi can not necessarily be seen as an early time point. Did the authors test earlier ones? Also, based on this, would it be possible that the effects observed at 24hpi are actually secondary infections, meaning that the virus utilizes pathway components for virion production and a lack thereof reduces infectivity of newly formed virions? In this case it would be interesting to set up an assay that can distinguish between primary and secondary infection to study both individually more closely.

      Whereas 7 hours may be sufficient to release new virions, it is not sufficient to establish infections in other cells – this is why we chose that time point. The observation that there is no difference in the percentage of infected cells at 7 h p.i. (figure 2F) led us to suggest that viral entry is unaffected . We then confirmed this through the pseudovirus assay in Figure 2G, where no difference is found between SopF and mCherry expressing cells. For this assay, GFP-expressing, replication incompetent, lentiviral particles pseudotyped with Spike from different SARS-CoV-2 lineages were used to transduce mCherry and SopF expressing cells. A change in the percentage of GFP-positive cells would indicate an effect on viral entry, but no such change was observed in SopF-expressing cells.

      We agree with the reviewer that the effects observed at 24 hpi are likely due to a defect in subsequent rounds of infection, since no difference was observed at 7 hpi or with our pseudovirus assay. We will attempt to make this point in the text as clearly as possible.

      The authors nicely show in their study an involvement of VAIL in SARS-CoV-2 mediated LC3 lipidation. However, the observed effects are relatively moderate in several experiments, indicating that there may be another contributor to the observed phenotype. It would be nice to highlight this in the discussion and debate potential mechanisms that are causing the observed effects during infection.

      We agree with the reviewer’s analysis. We have discussed the contribution of canonical autophagy in the second paragraph of the discussion, but we will expand on this in a revised manuscript. E expression levels are moderate during infection, other structural proteins such as N and M are present in much higher amounts. Since E is the key protein in VAIL initiation, a moderate effect of VAIL inhibition in perhaps expected. Nonetheless this still plays a crucial role in the viral life cycle.

      *Minor Comments *

      • The re-localization events shown in Fig 3A should be quantified.*

      This quantification of GFP-LC3 relocalisation will be carried out and included.

      • The co-localization events displayed in Fig 4A should be quantified.*

      The quantification of V1D, E and GFP-LC3 will be carried out and included.

      For Figure 2H-K the authors perform KDs of ATG16L1 and ATG13. While the results for the two specific proteins are certainly convincing, the authors would strengthen their argument by testing additional proteins in the autophagy pathway to support their claim that VAIL but not autophagy affects protein abundance of N (OPTIONAL).

      As discussed in response to reviewer 1, we will attempt to infect ATG16L1 KO cells reconstituted with a K490A ATG16L1 mutant, which is an established tool and has been validated to be deficient in VAIL but not canonical autophagy.

      ***Referee cross-commenting** *

      • Overall I agree with the comments of my co-reviewers and I think the suggested experiments/comments are sensible. *
      • I in part already eluted to it my analysis, but I tend to agree with reviewer 3 on the limited effect VAIL seems to have on LC3b lipidation.*

      As outlined above in response to reviewer 1 and below to reviewer 3, we agree that there is a modest contribution of VAIL to overall LC3 lipidation, which correlates with a modest amount of E expression in SARS-CoV-2 infection. VAIL is clearly important for the viral life cycle, thus whatever the proportion of LC3 lipidation attributable to this pathway it must be biologically significant.

      *Reviewer #2 (Significance (Required)): *

      • While previous publications have shown interaction between SARS-CoV2 and autophagy, the authors of this manuscript demonstrate that V-ATPase-ATG16L1 induced LC3 lipidation (VAIL) is activated during infection and affects viral replication. *

      • This study provides an interesting new aspect to host-SARS_CoV-2 interactions. *

      • The manuscript is of interest for people studying virus-host cell interaction, as well as for researchers in the fields of infectious diseases, specifically SARS-CoV2, and autophagy/VAIL*.

      We thank the reviewer for their assessment of our manuscript.

      R*eviewer #3 (Evidence, reproducibility and clarity (Required)): *

      • The interaction of SARS-CoV-2 with canonical autophagy has been well documented. However, whether SARS-CoV-2 infection induces and benefits from non-canonical autophagy is unclear. In this manuscript, the authors demonstrated that SARS-CoV-2 infection induces V-ATPase-ATG16L1-induced LC3 lipidation (VAIL), a form of non-canonical autophagy in which LC3 is conjugated to single membranes. The SARS-CoV-2 envelope protein, through its ion channel activity, triggers the V-ATPase proton pump and induces VAIL during SARS-CoV-2 infection. Inhibiting VAIL during SARS-CoV-2 infection with SopF, a Salmonella effector, attenuates SARS-CoV-2 egress. *

      • While these findings are interesting and demonstrate that SARS-CoV-2 infection triggers VAIL for its own benefit, the mechanism by which VAIL promotes SARS-CoV-2 replication remains unclear. Moreover, the contribution of VAIL to LC3 lipidation during SARS-CoV-2 infection appears to be minimal, as blocking VAIL through SoPF expression only marginally reduced LC3B lipidation (Fig. 1H). Therefore, the contribution of VAIL to LC3 lipidation during SARS-CoV-2 infection is minimal.*

      We thank the reviewer for their assessment of our manuscript. As we have already alluded to in our response, we agree that only part of the LC3 lipidation observed during infection can be attributed to VAIL. There is a reproducible effect on viral replication which we have demonstrated in multiple ways, therefore the contribution of VAIL is of biological importance.

      *Comments: *

      • The authors show that the ion channel activity of E is essential for VAIL induction during SARS-CoV-2 infection. Since V-ATPase recruits the ATG16L complex to induce VAIL, and to clarify how SARS-CoV-2 infection triggers VAIL, the authors should examine whether SARS-CoV-2 infection or the expression of E induces V-ATPase-ATG16L interaction and whether this interaction is disrupted when SopF is expressed.*

      We agree with the reviewer that this would be an informative experiment. We can carry out this experiment in an E expression system, rather than infection. This is due to the difficulty of getting enough material to carry out this kind of pull-down experiment in infected cells (at the time of writing these experiments still have to be carried out in CL3).

      • Since the authors suggest that expression of SopF attenuates viral exit, one would expect that the number of N-positive cells will increase in SopF-expressing cells compared to the mCherry control cells. However, as shown in Figure 2D, this is not the case. Could the authors discuss why N-positive cells will be reduced in SopF-expressing cells when viral egress is impeded in these cells*?

      This is a reflection of multi-cycle kinetics. N is still very strongly expressed in infected cells, even after virions have egressed. SARS-CoV-2 can infect VAIL-deficient cells and expresses the same levels of N prior to subsequent rounds of infection (at 7 hours after infection for example). Egress in VAIL-deficient, SopF-expressing cells is defective. Therefore, fewer cells will be infected in subsequent rounds of infection in SopF expressing cells, resulting in fewer N-positive cells in the SopF expressing cell population (most obvious after 24 hours).

      Figure 2H. The authors show that knockdown of ATG16L1 reduces the expression of N during SARS-CoV-2 infection compared to the controls. To confirm that knockdown of ATG16L1, which is required for both canonical autophagy and VAIL, reduces N staining via VAIL, the authors should examine the impact of SopF expression on N levels in ATG16L KD cells. This experiment will confirm if the reduction in N staining in ATG16L1 KD cells is due to VAIL.

      As stated in the response to reviewer 1, we can attempt this experiment in an ATG16L1 KO system complemented with K490A ATG16L1, which is deficient in VAIL and not canonical autophagy.

      • Figure 2J. The quality of the Western blot data is poor.*

      In this western the exposure is deliberately turned up to show that minimal ATG13 was left after knock down. We will also show the full blot with less exposure – this will demonstrate high quality.

      Also, N appears as a single band in Figure 2J, but appears as double bands in Figures 2A and H. Could the authors explain this?

      An extra band can be seen in 2J for N. However, as the reviewer points out, the intensity of the lower band is fainter than in 2A or 2H. The biology of SARS-CoV-2 N is interesting and complicated, with different truncated isoforms and phosphorylation patterns observed (see for example Mears et al., 2025 PMID:39836705). We observed changes in abundance of the second band between experiments, but this did not obviously depend on VAIL. We therefore consider this to be beyond the scope of this investigation.

      *Reviewer #3 (Significance (Required)): *

      • This manuscript proposes a role for VAIL in LC3 lipidation during SARS-CoV-2 infection. While the findings are interesting, VAIL only marginally contributes to LC3 lipidation during SARS-CoV-2 infection. Therefore, the significance of VAIL to LC3B lipidation during SARS-CoV-2 infection is unclear.*

      Our experiments show unambiguously that VAIL contributes to viral replication. Therefore even if As alluded to above, we do not think a further investigation of canonical macroautophagy and SARS-CoV-2 would enhance the quality of our manuscript. We will try to make our description of the contribution of macroautophagy clearer in the revised manuscript (without providing a full literature review). We also do not think that exploring the nature of the multiple N bands on western blot is within the scope of this paper.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The interaction of SARS-CoV-2 with canonical autophagy has been well documented. However, whether SARS-CoV-2 infection induces and benefits from non-canonical autophagy is unclear. In this manuscript, the authors demonstrated that SARS-CoV-2 infection induces V-ATPase-ATG16L1-induced LC3 lipidation (VAIL), a form of non-canonical autophagy in which LC3 is conjugated to single membranes. The SARS-CoV-2 envelope protein, through its ion channel activity, triggers the V-ATPase proton pump and induces VAIL during SARS-CoV-2 infection. Inhibiting VAIL during SARS-CoV-2 infection with SopF, a Salmonella effector, attenuates SARS-CoV-2 egress.

      While these findings are interesting and demonstrate that SARS-CoV-2 infection triggers VAIL for its own benefit, the mechanism by which VAIL promotes SARS-CoV-2 replication remains unclear. Moreover, the contribution of VAIL to LC3 lipidation during SARS-CoV-2 infection appears to be minimal, as blocking VAIL through SoPF expression only marginally reduced LC3B lipidation (Fig. 1H). Therefore, the contribution of VAIL to LC3 lipidation during SARS-CoV-2 infection is minimal.

      Comments:

      The authors show that the ion channel activity of E is essential for VAIL induction during SARS-CoV-2 infection. Since V-ATPase recruits the ATG16L complex to induce VAIL, and to clarify how SARS-CoV-2 infection triggers VAIL, the authors should examine whether SARS-CoV-2 infection or the expression of E induces V-ATPase-ATG16L interaction and whether this interaction is disrupted when SopF is expressed.

      Since the authors suggest that expression of SopF attenuates viral exit, one would expect that the number of N-positive cells will increase in SopF-expressing cells compared to the mCherry control cells. However, as shown in Figure 2D, this is not the case. Could the authors discuss why N-positive cells will be reduced in SopF-expressing cells when viral egress is impeded in these cells?

      Figure 2H. The authors show that knockdown of ATG16L1 reduces the expression of N during SARS-CoV-2 infection compared to the controls. To confirm that knockdown of ATG16L1, which is required for both canonical autophagy and VAIL, reduces N staining via VAIL, the authors should examine the impact of SopF expression on N levels in ATG16L KD cells. This experiment will confirm if the reduction in N staining in ATG16L1 KD cells is due to VAIL.

      Figure 2J. The quality of the Western blot data is poor. Also, N appears as a single band in Figure 2J, but appears as double bands in Figures 2A and H. Could the authors explain this?

      Significance

      This manuscript proposes a role for VAIL in LC3 lipidation during SARS-CoV-2 infection. While the findings are interesting, VAIL only marginally contributes to LC3 lipidation during SARS-CoV-2 infection. Therefore, the significance of VAIL to LC3B lipidation during SARS-CoV-2 infection is unclear.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Major Comments

      Figure 1D does not very clearly show an overlap between V1D and LC3B. Both proteins seem broadly present across the cell and there is no easily identifiable change in V1D distribution upon infection. As such the overlay may be purely stochastic. The authors should quantify the observed co-localization events across multiple cells and biological replicates and compare them to other protein(s) with a similar cellular distribution pattern.

      Based on Figure 2F the authors suggest that virus entry is unaffected by the inhibition of VAIL in early timepoints. However, according to the figure legend, the timepoint used is 7hpi, while 2D uses 24hpi. Some SARS-CoV-2 papers suggest 7-10 hours is sufficient time to release new virions (Ban-On et al., 2020). As such 7hpi can not necessarily be seen as an early time point. Did the authors test earlier ones? Also, based on this, would it be possible that the effects observed at 24hpi are actually secondary infections, meaning that the virus utilizes pathway components for virion production and a lack thereof reduces infectivity of newly formed virions? In this case it would be interesting to set up an assay that can distinguish between primary and secondary infection to study both individually more closely.

      The authors nicely show in their study an involvement of VAIL in SARS-CoV-2 mediated LC3 lipidation. However, the observed effects are relatively moderate in several experiments, indicating that there may be another contributor to the observed phenotype. It would be nice to highlight this in the discussion and debate potential mechanisms that are causing the observed effects during infection.

      Minor Comments

      The re-localization events shown in Fig 3A should be quantified.

      The co-localization events displayed in Fig 4A should be quantified.

      For Figure 2H-K the authors perform KDs of ATG16L1 and ATG13. While the results for the two specific proteins are certainly convincing, the authors would strengthen their argument by testing additional proteins in the autophagy pathway to support their claim that VAIL but not autophagy affects protein abundance of N (OPTIONAL).

      Referee cross-commenting

      Overall I agree with the comments of my co-reviewers and I think the suggested experiments/comments are sensible. I in part already eluted to it my analysis, but I tend to agree with reviewer 3 on the limited effect VAIL seems to have on LC3b lipidation.

      Significance

      While previous publications have shown interaction between SARS-CoV2 and autophagy, the authors of this manuscript demonstrate that V-ATPase-ATG16L1 induced LC3 lipidation (VAIL) is activated during infection and affects viral replication.

      This study provides an interesting new aspect to host-SARS_CoV-2 interactions.

      The manuscript is of interest for people studying virus-host cell interaction, as well as for researchers in the fields of infectious diseases, specifically SARS-CoV2, and autophagy/VAIL.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Figueras-Novoa et al present a short report demonstrating the induction of LC3 lipidation on single membranes by SARS-CoV-2 through a noncanonical autophagy pathway referred to as VAIL. The authors utilize elegant genetic tools to show that the induction of LC3 lipidation upon viral infection is mainly due to VAIL rather than canonical autophagy. They demonstrate that the activity of the viral E protein that can cause neutralization of acidic vesicles leads to the activation of non-canonical LC3 lipidation on single membranes. Interestingly, the authors also conclude that the impairment of VAIL leads to a reduction of viral load as a result of a defect in later stages of viral infection, although the underlying mechanism was not further explored.

      Overall, this is an elegant and well controlled study that provides a clear conclusion. I only have some minor comments.

      In some experiments, LC3 lipidation does not appear to be fully disrupted upon VAIL inhibition (e.g. Fig.'s 1H, 3D, 4A). As other labs have shown that SARS-CoV2 blocks autophagic flux, this could be further clarified in this manuscript as both VAIL and autophagy may be co-induced upon viral infection. Can the authors test the induction of LC3 lipidation in cells expressing K490 mutant of ATG16L1 in ATG16L1 KO cells to compare them with ATG16L1-ATG13 double knockouts?

      Minor points:

      The difference between Fig. 1F&G is unclear and why the authors are including both analyses. Similarly figures 4G&H.

      The authors should show boxed colocalisation of all images, including negative controls. For examples, the authors have shown boxed magnifications in only the lowest panel in Figure 2A but not the upper two panels. Figures 4E&F should include boxed examples. This serves to clarify both positive and negative colocalisation events.

      Significance

      Overall an elegant and well controlled study demonstrating the induction of non-canonical LC3 conjugation on single membranes (VAIL) during SARS-CoV2 infection. A further exploration of canonical autophagy (as previously published by others) in addition to VAIL would enhance this study.

      This study is of interest to researchers studying autophagy, viruses, immunology, single membrane LC3 lipidation, and lysosomes as well as potentially clinicians treating SARS-CoV2 infecteted individuals.

      This reviewer is experienced in autophagy research.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We appreciate the constructive and supportive feedback on our manuscript. All three reviewers acknowledged the significance and novelty of our work on bacterial telomere protection. In response to their suggestions, we have conducted the requested experiments and revised the manuscript accordingly. These changes have enhanced the rigor of our study and clarified our interpretations and explanations.

      Moreover, we characterized an additional truncation mutant of TelN (TelN Δ445–631), which lacks the two C-terminal domains. Despite this deletion, the mutant retained protection activity (Supplementary Figure S4B), indicating that the remaining regions of the protein are sufficient to confer efficient protection in this assay.

      Finally, we removed three sequence alignments (previously Supplementary Figures S6A and S7), as we recognized that the high degree of sequence divergence could hinder proper alignment and potentially lead to misinterpretation.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This study addresses how the bacterial telomere protein TelN protects telomere ends against the action of the Mre11-Rad50 nuclease (MR). This protection is essential for the stability of hairpin-ended linear plasmid and chromosomes in bacteria but had not been explored before. The authors demonstrate that TelN is necessary and sufficient to block MR-dependent DNA cleavage when bound to its specific telomere sequence. By combining elegant genetics and biochemical approaches, it convincingly shows that TelN-dependent inhibition likely involves a specific interaction between TelN and the MR complex. The manuscript is well written, easy to read and focused on the relevant information. The claims and the conclusions are supported by the data. There is no over-interpretation.

      Comments: - Figure 1B, unnormalized transformation efficiency would be useful to show in SI

      The unnormalized B. subtilis transformation efficiency has now been added as new figure panel S1B.

      • Figures 2B, 2C, 3C, 3D, 4C, 5A and 5B: quantification of independent experiments should be added

      While these DNA protection experiments show a clearly reproducible pattern of DNA degradation, the exact response to TelN titration varies somewhat between experimental replicates. We initially included the quantification of remaining full-length DNA because the corresponding band is hard to discern in the gel image due to pixel saturation. However, we realize now that this may mislead readers to think that the degradation occurs always with the exact same dosage response.

      To avoid this, we have decided to remove the quantification and instead show the relevant part of the gel also at higher contrast to better visualize the loss of full-length DNA due to DNA degradation. In addition, we have included replicate experiments carried out at the same MR concentration (125 nM M₂R₂) or at higher concentration (500 nM M₂R₂) in the supplementary material. These examples demonstrate the general reproducibility of the assay.

      **Referee cross-commenting**

      Perfect for me. It seems that there is a consensus.

      Reviewer #1 (Significance (Required)):

      This pioneering study provides a very strong basis for a new understanding of telomeres in bacteria and offers fascinating evolutionary perspectives when compared to similar mechanisms active at telomeres in eukaryotic cells.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The paper is well-presented and well-written throughout. The paper shows convincingly that TelN protects hairpin DNA ends from the activity of SbcCD, presumably providing a protection mechanism for N15 phage DNA in vivo. Furthermore, this protection activity is shown not to require the catalytic (resolvase) activity of TelN, nor its poorly characterised C-terminal domain. The paper also suggests that this inhibition acts both at the level of competition for the DNA hairpin end and at the level of a direct protein:protein interaction between TelN and MR. An (acknowledged) weakness is that there is no real insight into the protein:protein interaction suggested by the experiments shown in Figure 5. Ideally, the protein:protein interaction interface would be identified and mutations in this interface would be shown to reduce hairpin protection.

      Specific comments/questions

      (1) What pathway (in vivo) leads to inactivation of linear hairpin DNA - one suspects that cleavage by SbcCD at the hairpins is probably not the full story. Presumably SbcCD cleavage facilitates further processing by other long range resection systems such as RecBCD, Exo1, RecQ/J etc. Would it be appropriate to view the hairpin as an adaption to protect against these nucleases, which then must be complemented with a mechanism to suppress SbcCD?

      The reviewer's suggestion that hairpin ends represent a first layer of adaptation against nucleolytic processing is compelling. Hairpin structures inherently resist many exonucleases due to their covalently closed nature (absence of free 3’ or 5’ ends) but remain vulnerable to MR processing (Connelly et al, 1998, 1999; Saathoff et al, 2018). This creates a scenario where effective telomere protection requires both the structural barrier provided by the hairpin and an active mechanism to suppress MR activity. We have added this perspective to the relevant paragraph in the discussion.

      (2) Section starting "Direct inhibition of MR by TelN in vitro". What is the word direct supposed to convey here? To me it suggests that the inhibition is via direct interaction of TelN with MR (rather than, for example, a result of competition for the hairpin DNA end) which is not shown here. Suggest either defining or removing the word direct. This point gains more importance considering that differentiating between inhibition mechanisms becomes a focus of later parts of the paper.

      By "direct inhibition," we meant that TelN blocks MR nuclease activity without requiring additional cofactors, as demonstrated in this minimal reaction system containing only TelN, MR complex, DNA substrate, and ATP. To avoid ambiguity, we have reworded the corresponding headline and paragraph.

      (3) Figure 2B - Why no control lane without MR? - this is a basic control to show that he degradation we are seeing in the absence of TelN is MR-dependent. Formally, as shown, the degradation could be caused by the ATP stock.


      We have now included ATP-only control lanes (without MR complex), which show no substrate degradation, confirming that ATP stocks do not contain contaminating nucleases and that the observed degradation is indeed MR-dependent. These controls are included in the supplementary data (Figure S3A) along with additional replicate experiments. Notably, the dose-dependent protection observed at low TelN concentrations (where MR activity is not fully inhibited) provides additional evidence for the specificity of the MR-TelN interaction system, as non-specific nuclease contamination would result in complete substrate degradation regardless of TelN concentration.

      (4) Why not use B. subtilis SbcCD for the species specificity experiment? Also, is it not surprising that TelN yielded zero protection against MRX given that the DNA sequence specificity experiments above suggest competition for DNA substrate is part of the inhibition mechanism?


      We agree that this would be a great addition. We attempted but were unable to purify active B. subtilis SbcCD protein despite multiple attempts. The yeast MRX experiment serves the same purpose of demonstrating species specificity and represents a more evolutionarily distant comparison, which strengthens our conclusions about bacterial-specific inhibition.

      (5) If the authors felt it appropriate, I thought there was scope for further discussion/introductory material. There are strong parallels here with mechanisms used by phage to protect themselves from the activities of RecBCD, which include both proteins that protect DNA ends like T4 gene 2, we well as proteins that bind directly to RecBCD to inactivate it like lambda Gam. As such, the work here will appeal as much to those interested in bacterial defence systems / phage:host interactions as it does to those interested in telomere biology. Especially significant is the inhibition of DNA end processing factors by lambda Gam since this protein is reported to interact with both RecBCD and SbcCD (PMID: 2531105).

      We agree that there are obvious parallels between lambda Gam and TelN as counter-defence factors. This was likely largely missed in previous work because the telomere resolution activity of TelN masked its function in counter-defence. We have added a statement on this matter at the end of the discussion.

      (6) Just a gripe really: it seems to be 'de rigeur' at the moment to re-name bacterial proteins for their human orthologues, presumably to elevate the perceived importance of the work(?), but it is not a practice I think is terribly helpful as it causes issues when searching literature. Minimally it would be great if the authors could ensure they add SbcCD as a keyword for search purposes.

      We appreciate the reviewer's concern about nomenclature inconsistencies in the literature. We have chosen MR over SbcCD as a more generic term that covers eukaryotes, archaea and lately also bacteria and will hopefully contribute to a more consistent terminology in the literature across the domains of life in the future. Our choice to use "Mre11-Rad50" (MR) for the E. coli SbcCD complex is also consistent with prominent recent publications (Käshammer et al., 2019; Gut et al., 2022), explicitly referring to the E. coli system as "Mre11-Rad50" while acknowledging the bacterial designation. To link to previous literature, we made sure that both "SbcCD" and "Mre11-Rad50" are mentioned in the abstract. And, as suggested, we have now also added “SbcCD” to our keyword list to facilitate comprehensive literature searches.

      **Referee cross-commenting**

      I have nothing to add. The reviewers' comments are all broadly positive and consistent.

      Reviewer #2 (Significance (Required):

      This is an excellent paper unveiling a phage encoded "counter-defence" mechanism designed to protect phage DNA from degradation. It will be of special interest to those studying telomere biology of phage:host interactions.



      Reviewer #3

      The authors investigate how the N15 phage protelomerase TelN protects linear chromosomes that terminate in hairpin structures (a sort of telomere). In E. coli and B. subtilis cells, removal or truncation of telN reduces transformation/survival of linear DNA, whereas complementation with full-length or a catalytically inactive TelN restores viability, consistent with TelN playing a nonenzymatic capping function.

      In vitro, TelN binds hairpin substrates with moderate affinity and protects them from the nuclease activity of the Mre11/Rad50 complex. The authors propose that TelN originated as an early, sequence specific barrier against MR mediated DNA end processing, establishing fundamental principles of telomere protection that persist from bacteria to eukaryotes.

      Major comments:

      The manuscript convincingly shows that TelN can functionally block the Mre11Rad50 (MR) nuclease on a hairpin DNA end in a sequence specific manner (suggesting a physical interaction), but it doesn't directly demonstrate this. A simple pull-down or equilibrium binding method would be useful in proving a physical interaction.

      We agree that this would be a valuable addition to the study. We have made several attempts to detect direct interaction by co-immunoprecipitation. However, without success so far. We do not have sufficient material for equilibrium binding methods (yet).__ ____ __


      The MR complex requires ATP hydrolysis for resection of DNA ends. It would be a nice addition to the manuscript if the effect of TelN of Rad50 ATPase activity was tested.


      We have tested the effect of TelN on Rad50 ATPase activity and found no significant impact under our experimental conditions, possible in line with the lack of stable interaction.

      The bar plot on Fig 3B indicates that the experiments are performed in triplicate. The statistical significance of the differences between conditions should be determined. The same general comment could be made regarding the quantification of the polyacrylamide gels - how reproducible are these values?


      We performed paired t-test analysis for the following figures and now indicate the p-values wherever significant (below 0.05): Figures 1D, 1E, 3B, 4B and S4B. We used paired t-tests to generally compare linear vs circular plasmid transformation efficiency for each condition. In Figure 4B, which included two different linear DNA constructs, we compared the two linear DNA constructs directly to each other. [Given that our experimental design included multiple control conditions with known expected outcomes to validate assay performance, rather than many independent exploratory comparisons, we report uncorrected p-values as the primary analysis. The inclusion of multiple controls with predictable outcomes reduces the likelihood of false positive interpretations.]

      As stated in response to reviewer 1, while the exact values for the DNA degradation profile vary somewhat between experiments (likely due to variations in band quantification – see also response to comment below), the general trends are robust as for example indicated by similar experiments performed with higher MR concentration (500 nM instead of 125 nM M₂R₂ concentrations for all TelN variants) demonstrating reproducibility across different conditions. For Figure 5, however, we are unable to provide additional repeat experiments due to limitations in reagent availability. Considering the robust effect seen with Ec MR controls and the presence of multiple samples in the dilution series, we are nevertheless confident about the conclusion.

      Minor comments:

      A better explanation of how the gels were quantified should be provided. Were the products included in the analysis, or was it just the decrease in the substrate band that was measured?

      As also stated above, we have removed the band quantification and instead show the bands also at different contrast settings.

      In our original approach, gel band quantification was performed using ImageQuant TL software (version 8.2.0, GE Healthcare). For each gel, individual lanes were defined using either fixed-width boundaries (95-103 pixels) or automatic edge detection, depending on the gel quality and band definition. Band volumes were calculated using rolling ball background subtraction (radius 180 pixels) with automatic band detection. Substrate degradation was assessed by measuring the integrated density (volume) of the remaining full-length (or near full-length) substrate bands under different treatment conditions. The band volume values were plotted directly to compare substrate levels across treatment groups.

      We now present the data as two gel panels: an exposure showing the full reaction profile, and another exposure focusing on the substrate bands to clearly demonstrate dose-dependent protection. Additional replicate experiments including ATP-only controls (confirming no contamination from ATP stocks) and experiments at 500 nM M₂R₂ concentrations, are provided in the supplementary data. This approach provides more direct visualization of the biological phenomenon with comprehensive control validation.

      I felt like the Results jump rather abruptly from B. subtilis chromosome assays to E. coli plasmid experiments. Maybe the addition of a few linking sentences would improve this transition.


      Upon re-reading the manuscript we agree with this assertion and have added further information to provide a smoother transition.

      A comment on the stoichiometry of TelN and genome ends during phage replication would be useful.

      Our in vitro data suggest that effective protection can be achieved at relatively low TelN:DNA ratios in vitro, consistent with the notion of formation of stable, protective nucleoprotein structures. We unfortunately do not currently have information on the copy number of TelN per cell or per hairpin end. It is not easy to obtain reliable values for these numbers. However, we can speculate that multiple TelN proteins are present due to the presence of three copies of a DNA sequence motif (binding to CTD1) in each telomeric DNA, consistent with the formation of stable, protective nucleoprotein structures.

      Reviewer #3 (Significance (Required)):

      General assessment:

      Strengths: A nice combination of genetics and biochemistry convincingly demonstrates that TelN protects linear chromosomes/replicons from MR-dependent degradation independent of its cleavage-ligase activity. It does this by binding to the hairpin DNA ends in a sequence specific fashion and the species specificity suggests a direct physical interaction, which likely inhibits the nuclease activity of the MR complex

      Limitations: The lack of characterization of the putative physical interaction between TelN and the MR complex is considered a weakness.

      Advance: The manuscript fills in a mechanistic gap between protelomerase-mediated telomere formation and maintenance by demonstrating a protective/capping role. This is the first quantitative analysis of DNA-end protection from MR nuclease activity by TelN.

      Audience: Readers interested in bacterial chromosome biology, DNA repair, the parallels to eukaryotic shelterin will be interesting to the broader telomere and genome stability communities.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The authors investigate how the N15 phage protelomerase TelN protects linear chromosomes that terminate in hairpin structures (a sort of telomere). In E. coli and B. subtilis cells, removal or truncation of telN reduces transformation/survival of linear DNA, whereas complementation with full‑length or a catalytically inactive TelN restores viability, consistent with TelN playing a non‑enzymatic capping function.

      In vitro, TelN binds hairpin substrates with  moderate affinity and protects them from the nuclease activity of the Mre11/Rad50 complex. The authors propose that TelN originated as an early, sequence‑specific barrier against MR‑mediated DNA end processing, establishing fundamental principles of telomere protection that persist from bacteria to eukaryotes.

      Major comments:

      The manuscript convincingly shows that TelN can functionally block the Mre11‑Rad50 (MR) nuclease on a hair‑pin DNA end in a sequence specific manner (suggesting a physical interaction), but it doesn't directly demonstrate this. A simple pull-down or equilibrium binding method would useful in proving a physical interaction.

      The MR complex requires ATP hydrolysis for resection of DNA ends. It would be a nice addition to the manuscript if the effect of TelN of Rad50 ATPase activity was tested.

      The bar plot on Fig 3B indicates that the experiments are performed in triplicate. The statistical significance of the differences between conditions should be determined. The same general comment could be made regarding the quantification of the polyacrylamide gels - how reproducible are these values?

      Minor comments:

      A better explanation of how the gels were quantified should be provided. Were the products included in the analysis, or was it just the decrease in the substrate band that was measured?

      I felt like the Results jump rather abruptly from B. subtilis chromosome assays to E. coli plasmid experiments. Maybe the addition of a few linking sentences would improve this transition.

      A comment on the stoichiometry of TelN and genome ends during phage replication would be useful.

      Significance

      General assessment:

      Strengths: A nice combination of genetics and biochemistry convincingly demonstrates that TelN protects linear chromosomes/replicons from MR-dependent degradation independent of its cleavage-ligase activity. It does this by binding to the hairpin DNA ends in a sequence specific fashion and the species specificity suggests a direct physical interaction, which likely inhibits the nuclease activity of the MR complex

      Limitations: The lack of characterization of the putative physical interaction between TelN and the MR complex is considered a weakness.

      Advance: The manuscript fills in a mechanistic gap between protelomerase‑mediated telomere formation and maintenance by demonstrating a protective/capping role. This is the first quantitative analysis of DNA-end protection from MR nuclease activity by TelN.

      Audience: Readers interested in bacterial chromosome biology, DNA repair, the parallels to eukaryotic shelterin will be interesting to the broader telomere and genome‑stability communities.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The paper is well-presented and well-written throughout. The paper shows convincingly that TelN protects hairpin DNA ends from the activity of SbcCD, presumably providing a protection mechanism for N15 phage DNA in vivo. Furthermore, this protection activity is shown not to require the catalytic (resolvase) activity of TelN, nor its poorly characterised C-terminal domain. The paper also suggests that this inhibition acts both at the level of competition for the DNA hairpin end and at the level of a direct protein:protein interaction between TelN and MR. An (acknowledged) weakness is that there is no real insight into the protein:protein interaction suggested by the experiments shown in Figure 5. Ideally, the protein:protein interaction interface would be identified and mutations in this interface would be shown to reduce hairpin protection.

      Specific comments/questions

      (1) What pathway (in vivo) leads to inactivation of linear hairpin DNA - one suspects that cleavage by SbcCD at the hairpins is probably not the full story. Presumably SbcCD cleavage facilitates further processing by other long range resection systems such as RecBCD, Exo1, RecQ/J etc. Would it be appropriate to view the hairpin as an adaption to protect against these nucleases, which then must be complemented with a mechanism to suppress SbcCD?

      (2) Section starting "Direct inhibition of MR by TelN in vitro". What is the word direct supposed to convey here? To me it suggests that the inhibition is via direct interaction of TelN with MR (rather than, for example, a result of competition for the hairpin DNA end) which is not shown here. Suggest either defining or removing the word direct. This point gains more importance considering that differentiating between inhibition mechanisms becomes a focus of later parts of the paper.

      (3) Figure 2B - Why no control lane without MR? - this is a basic control to show that he degradation we are seeing in the absence of TelN is MR-dependent. Formally, as shown, the degradation could be caused by the ATP stock.

      (4) Why not use B. subtilis SbcCD for the species specificity experiment? Also, is it not surprising that TelN yielded zero protection against MRX given that the DNA sequence specificity experiments above suggest competition for DNA substrate is part of the inhibition mechanism?

      (5) If the authors felt it appropriate, I thought there was scope for further discussion/introductory material. There are strong parallels here with mechanisms used by phage to protect themselves from the activities of RecBCD, which include both proteins that protect DNA ends like T4 gene 2, we well as proteins that bind directly to RecBCD to inactivate it like lambda Gam. As such, the work here will appeal as much to those interested in bacterial defence systems / phage:host interactions as it does to those interested in telomere biology. Especially significant is the inhibition of DNA end processing factors by lambda Gam since this protein is reported to interact with both RecBCD and SbcCD (PMID: 2531105).

      (6) Just a gripe really: it seems to be 'de rigeur' at the moment to re-name bacterial proteins for their human orthologues, presumably to elevate the perceived importance of the work(?), but it is not a practice I think is terribly helpful as it causes issues when searching literature. Minimally it would be great if the authors could ensure they add SbcCD as a keyword for search purposes.

      Referee cross-commenting

      I have nothing to add. The reviewers comments are all broadly positive and and consistent.

      Significance

      This is an excellent paper unveiling a phage encoded "counter-defence" mechanism designed to protect phage DNA from degradation. It will be of special interest to those studying telomere biology of phage:host interactions.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study addresses how the bacterial telomere protein TelN protect telomere ends against the action of the Mre11-Rad50 nuclease (MR). This protection is essential for the stability of hairpin-ended linear plasmid and chromosomes in bacteria but had not been explored before. The authors demonstrates that TelN is necessary and sufficient to block MR-dependent DNA cleavage when bound to its specific telomere sequence. By combining elegant genetics and biochemical approaches, it convincingly shows that TelN-dependent inhibition likely involves a specific interaction between TelN and the MR complex. The manuscript is well written, easy to read and focused on the relevant information. The claims and the conclusions are supported by the data. There is no over-interpretation.

      Comments:

      • Figure 1B, unnormalized transformation efficiency would be useful to show in SI
      • Figures 2B, 2C, 3C, 3D, 4C, 5A and 5B: quantification of independent experiments should be added

      Referee cross-commenting

      Perfect for me. It seems that there is a consensus.

      Significance

      This pioneering study provides a very strong basis for a new understanding of telomeres in bacteria and offers fascinating evolutionary perspectives when compared to similar mechanisms active at telomeres in eukaryotic cells.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We are very grateful for the positive feedback from all three reviewers. Below, we address each point in detail and outline proposed experiments and revision plans, with changes indicated by an underscore.

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this paper "Magnesium depletion unleashes two unusual modes of colistin resistance with different fitness costs," the authors examine how Pseudomonas aeruginosa evolves resistance to colistin, a last-resort antibiotic for multidrug-resistant Gram-negative infections. Although colistin resistance is a major clinical challenge, its underlying mechanisms, particularly under nutrient-limited conditions typical of infections, are not fully understood. The study shows that under low magnesium (Mg²_⁺_) conditions-mimicking infection or biofilm stress-P. aeruginosa can develop colistin resistance via two distinct genetic pathways, each with unique fitness costs. The first involves mutations in genes such as htrB2 and lpxO2, granting strong resistance but compromising the outer membrane and increasing susceptibility to other antibiotics. The second involves regulatory mutations (e.g., in the oprH/phoP/phoQ promoter) that confer resistance with minimal membrane defects and generally lower fitness costs. These resistance strategies lead to different trade-offs: membrane-compromising mutations reduce bacterial fitness without colistin, while regulatory mutations typically avoid these penalties, with context-dependent effects. The study underscores clinical relevance, noting that in infections-such as in cystic fibrosis-other microbes like Candida albicans may deplete magnesium, indirectly promoting resistance evolution. Overall, this work offers important insights into antibiotic resistance in nutrient-stressed, polymicrobial environments, highlighting how magnesium availability shapes resistance evolution and fitness costs. The findings suggest new avenues for therapeutic intervention and call for a reevaluation of antibiotic strategies in nutrient-competitive infection settings.

      Work is timely and important. Colistin resistance represents an urgent threat as colistin is a last-resort antibiotic used against multidrug-resistant Gram-negative pathogens. Insights into mechanisms evolving under nutrient limitation are highly relevant given the prevalence of such environmental conditions during infection and microbial biofilm growth. The study reveals two previously uncharacterized pathways to colistin resistance in P. aeruginosa triggered by magnesium (Mg²_⁺_) depletion, each with distinct genetic signatures and trade-offs. This finding directly impacts the understanding of polymicrobial infection dynamics, especially where magnesium sequestration by fungi/ or other microbes may occur. The identification of fitness costs and pleiotropic effects associated with specific resistance mutations provides crucial guidance for clinicians considering antibiotic stewardship and combination therapy strategies.

      __

      We thank the reviewer for their summary of our study and its potential impact.

      __Drawbacks

      • Experimental scope: While the study is comprehensive for P. aeruginosa, the broader applicability to other Gram-negative pathogens is not directly tested.__


      In our revision, we now explicitly point out that the magnesium limitation we have observed broadly applies to Gram-negative bacteria, as we demonstrated in our previous PLOS Biology paper. Therefore, we expect the same themes (and even genes, which are broadly conserved) to apply to Gram-negative bacteria in general. However, a full-fledged experimental study of other Gram-negative pathogens is outside the scope of our current study, which required a 90-day experimental evolution.

      __Strengths

      • Experimental evolution: This work uses laboratory evolution under controlled Mg²_⁺_-limited conditions to simulate selection pressures relevant to infection microenvironments. • Genetics: Systematic identification and functional validation of key mutations-particularly in htrB2, lpxO2, and the oprH/phoP/phoQ promoter-give mechanistic depth to the findings. • Two distinct resistance modes: Evidence for (i) one pathway leading to colistin resistance via htrB2 mutations, resulting in high resistance but significant membrane integrity loss and increased susceptibility to other antibiotics. (ii) a second pathway providing resistance without compromising membrane integrity, highlighting evolutionary flexibility and ecological implications. • Fitness assessments: measurement of the costs associated with each resistance strategy, both in terms of membrane integrity and susceptibility to other agents. • Relevance: Connection to natural scenarios, such as magnesium sequestration by fungi (e.g. Candida albicans) in polymicrobial environments, underscores the ecological and clinical significance. • This manuscript is well written with clearly logical hypothesis testing__


      We thank the reviewer for their appraisal, especially for recognizing the rigor and broader biological implications of our study.

      __Drawbacks

      • Experimental scope: While the study is comprehensive for P. aeruginosa, the broader applicability to other Gram-negative pathogens is not directly tested.__

      We agree with the reviewer's point about broader applicability in other Gram-negative bacteria, as many of the lipid A biosynthesis genes are conserved among diverse bacterial lineages. We will include this point in our revised Discussion to suggest relevance to other Gram-negative bacteria:

      "We previously showed that magnesium sequestration by fungi applies not only to P. aeruginosa but to other Gram-negative bacteria as well (ref). Our current study lays a foundation for developing evolution-guided strategies to combat multidrug-resistant P. aeruginosa and other Gram-negative bacteria that can also acquire colistin resistance. Since many other antibiotic mechanisms are similarly dependent on metal ions (refs), our work suggests that nutritional competition for metal ions may alter initial antibiotic resistance in Gram-negative bacteria and potentiate new evolutionary pathways of antibiotic resistance."

      • __ Mechanistic depth: Some inferred mechanisms (e.g., the precise molecular impact of late-occurring adaptive mutations) merit deeper biochemical analysis.__ We will emphasize in our Revision that the MS data of endpoint clones and triple mutants reveal that their lipid A structures are identical. This suggests that the role of other late-occurring mutations in enhancing resistance is likely through lipid A-independent pathways.

      • __ Results Lines 414- 423: While correlation is most what makes sense for some drugs, causality is implied (membrane defects increase susceptibility), but could be strengthened by directly measuring antibiotic uptake (e.g., fluorescence) or membrane permeability for these 3 antibiotics.__ We thank the reviewer for highlighting the issue of causality. For the three antibiotics tested, the most direct way to measure their effect is by measuring their impact on bacterial growth directly, which is what we have done. Our membrane permeability assay using NpN uptake operates under the same conditions suggested by the reviewer and directly measures molecular uptake. Moreover, only fluorescently labeled vancomycin is commercially available among the three antibiotics tested. Since it binds to the cell wall, its utility to measure membrane defects is more limited than the NpN assay we have already used. However, in response to this comment, we will make clear in our revision that we infer that increased susceptibility to other antibiotics is due to their increased membrane permeability.

      __ o Effect is mild and mostly not significant. It is also not clear whether authors only tested a handful of mutants shown in Fig. 7B-D or whether other clones were also tested. The sample of endpoints (P2, P5, P8) covers well-characterized lineages, but additional evolved clones or a broader panel could boost generality about other antibiotics. The authors note "significantly lower MICs" statistical treatment is implied; explicit statistical values and replicate numbers should be given in the text or figures.__

      We slightly disagree with the reviewer that the results are not significant. Even two-to-three-fold differences in MICs translate to large differences in microbial competition. These three endpoint clones are representative of all eight evolved strains after 90-day evolution experiments. Moreover, we will emphasize in the Revision that we have tested all the mutations found in the endpoint clones; we know what these are from whole genome sequencing of multiple endpoint clones. In addition, we will explicitly state the p-value in the legend of Figure 7.

      • __ The structural or physiological nature of "mild" vs. "severe" membrane defects could be better defined/quantified.__ Although we agree with the reviewer's suggestion, the variability of the SEM assay makes the classification of membrane defects based on cell morphology hard to quantify. We therefore only use the SEM images as representative of the various defects observed. For a more quantitative assay of the membrane defects, we instead rely on the standard NpN uptake assay to quantify membrane permeability as a quantifiable readout for membrane defects.

      • __ Quantitative limits: Authors should add in the discussion that statistical robustness could be strengthened-for example, by including longer-term evolutionary predictions.__ We are not sure what the reviewer means and so cannot address this point completely. We ask the reviewer to rephrase this point, and we will address it to the best of our abilities.

      • __ in vivo relevance: While the ecological context is discussed, direct in vivo confirmation (e.g., in animal infection models) of the observed resistance trajectories would increase translational impact and relevance.__ We agree with the reviewer's point. However, it is not trivial to directly perform evolution experiments of microbes in animal models. There are only a handful of labs worldwide that have working CF-relevant animal models. However, the colistin resistance mutations we identified provide a tool to look deeper into how colistin-resistant P. aeruginosa can evolve in vivo.

      • __ Some sections are repetitive or overly detailed; condense where possible (especially on mutation lists and background for each claim).__

      We will condense our manuscript as the reviewer suggested in our revision. Adding a graphical summary as suggested will also allow us to be more succinct in our description.

      __Other comments

      • Authors should provide clarification on how the Mg²_⁺_ concentrations used in vitro compare to those found in clinically relevant infection settings. This would be helpful to enhance significance.__

      We thank the reviewer for raising this good point. Based on our previous work, we know the Mg2+ levels in our model (0.3-0.45mM) are within the physiological range of Mg2+ in infection settings (0.1-0.8mM). We will highlight this point in the introduction.

      • __ Authors should explicitly report statistical methods (e.g., types of tests, adjustments for multiple comparisons) in figure legends for reproducibility.__

      We will include the details of our statistical tests in each panel of figures both in the main text and the supplement.

      • __ Nomenclature for key mutations and their position within the genetic context (e.g., htrB2 mutation specifics) could be more detailed in figures or supplemental materials.__

      We will name each of the particular mutations tested to be specific about the nature of all the evolved mutations in our figure legends.

      • __ The manuscript could benefit from a graphical summary illustrating the two distinct evolutionary pathways and their respective fitness landscapes.__ We thank the reviewer for this suggestion to enhance the clarity of our work. We will make a new graphical summary highlighting two different evolutionary pathways as a new figure.

      • __ A brief discussion of therapeutic implications-such as combining colistin with agents that target membrane integrity-would help bridge the gap from mechanism to clinical management.__ In our discussion, we have suggested that collateral sensitivity (line 446-453) and PhoPQ kinase inhibitors (line 512-515) could be exploited to combat colistin resistance. To make this point more clearly, we will slightly expand our Discussion to include the therapeutic implications of our study.

      • __ Additional discussion on whether the fitness costs are reversible or can be compensated by further adaptation would be valuable for long-term dynamics.__ We thank the reviewer for raising this interesting point. The evolution trajectory of P8 suggests that fitness costs can be compensated by later-occurring mutations during evolution. We will further discuss this point to highlight the importance of understanding the mutational dynamics of antibiotic resistance evolution.

      • __ It would be valuable for the authors to comment on, or further analyze, whether there is a direct association between specific fitness costs and sensitivity to other antibiotics. Such information could inform on evolutionary constraints and possible trade-offs relevant to clinical settings.__

      We will include a supplemental figure showing the correlation between fitness costs and antibiotic susceptibility for P2, P5, and P8.

      __ Main figures and support for claims

      The main and supplementary figures comprehensively illustrate the evolutionary trajectories, genetic bases, and phenotypic outcomes associated with colistin resistance under magnesium depletion in P. aeruginosa. The figures effectively detail: • Genetic pathways involved including the experimental evolution design (colistin selection under Mg²_⁺_ depletion), whole-genome sequencing results, and timelines of observed mutations (e.g., in htrB2, lpxO2, oprH/phoP/phoQ promoter, PA4824). • Phenotypes and biochemical analyses such as lipid A structure (via mass spectrometry), minimum inhibitory concentration (MIC) assays, and epistasis analyses between mutations are depicted. • Fitness trade-offs are demonstrated using bacterial survival, membrane integrity (e.g., scanning electron microscopy images), membrane permeability assays (NPN uptake), and competitive fitness assays. • Mechanistic claims about the necessity of early mutations, the requirement of the PhoPQ pathway at different evolutionary stages, and the fitness cost imposed by certain resistance mutations. To further enhance the rigor and clarity of the manuscript, the authors should implement the following improvements: • Labelling consistency: In some instances, figure legends could provide more granular detail about specific mutations (e.g., positions of amino acid changes). • Graphical summary: A schematic summary figure that visually integrates the three main evolutionary resistance trajectories, the mutational order, corresponding lipid A changes, and fitness costs, would enhance readability. • Replicates: Plots should more thoroughly indicate the number of replicates and show individual data points (not just means {plus minus} SD), add number of replicates in each experiment. • Supplementary: figures referenced in the text (e.g., lipid A structures or mutation reversion outcomes) should be made more prominent or better cross-referenced from the main results section. Authors should highlight when supplementary data provide critical functional confirmation (e.g., confirming mutation function or fitness reversal).__

      We thank the reviewer for their appreciation of our work and constructive feedback.

      __Statistics

      The authors have appropriately incorporated statistical analyses throughout the figures. To enhance the robustness and credibility of their findings, authors should also cross-check • Tests in legends: Every figure and supplementary figure should clearly state the type of statistical test used, how many biological replicates, and any corrections for multiple comparisons.__

      As mentioned above, we will provide more details about the statistical tests of each panel.

      • __ Effect sizes: Where appropriate, reporting effect sizes-rather than just p values-would contextualize the biological impact.__ We agree with the reviewer; we will mention the magnitude of MIC changes in the corresponding figure legends.

      • __ Raw data accessibility: For full transparency, consider sharing underlying raw data and analysis scripts.

      __ We will provide the raw data of each panel.

      __Overall, the main and supplementary figures effectively illustrate and substantiate the key claims-particularly the alternative molecular pathways, phenotypic trade-offs, and the role of environmental magnesium in mediating colistin resistance. Statistical analysis is generally robust and appropriately presented throughout, though improvements could include more explicit reporting, additional controls, and accessible raw data. The visual and quantitative data in the figures provide support for the authors' conclusions about the evolution of antibiotic resistance under nutrient limitation in microbial environments. Understanding these alternative pathways is important for designing better treatment strategies and for predicting how resistance might evolve under varying clinical and environmental conditions.

      __

      We thank the reviewer for their positive assessment.

      __ Reviewer #1 (Significance (Required)):

      Overall, this work offers important insights into antibiotic resistance in nutrient-stressed, polymicrobial environments, highlighting how magnesium availability shapes resistance evolution and fitness costs. The findings suggest new avenues for therapeutic intervention and call for a reevaluation of antibiotic strategies in nutrient-competitive infection settings.__

      We sincerely thank the reviewer for constructive and thoughtful feedback and the acknowledgement of our figure presentation and experimental design. We feel very encouraged by the reviewer's perspective that our study provides unique insights into resistance evolution in polymicrobial environments and may inform therapeutic strategies.

      __My expertise: Gut microbiome, gut microbiota resilience, ecology, and evolution in microbial communities, antimicrobial resistance, high-throughput drug-bacteria interactions

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: The paper by Hsieh and colleagues unravels the molecular basis of colistin resistance in Pseudomonas aeruginosa under low magnesium (Mg2+) conditions. Colistin is a last resort antibiotic that compromises bacterial cell wall integrity. Bacteria can respond (phenotypically and genotypically) to colistin by modifying membrane-anchored lipopolysaccharides. Mg2+ depletion can trigger similar responses. In their study, Hsieh et al. find that Mg2+ depletion (induced by a co-infecting fungal pathogen, Candida albicans) leads to evolutionary trajectories and resistance mechanisms that differ from those observed under Mg-rich conditions. The authors conducted a series of detailed genetic, chemical and fitness-based experiments to elucidate the molecular, physiological and evolutionary basis of these new resistance mechanisms.__


      We thank the reviewer for their summary of our study.__

      Major comments: __ 1. The authors reconstituted key mutations observed during experimental evolution in the ancestral background. Moreover, they took clones from the final stage of the evolution experiment and restored the ancestral state of the mutated genes. This dual approach is extremely strong and allows to decipher the causal effects of colistin resistance. I like to applaud the authors for this rigorous approach.


      We thank the reviewer's appreciation about the rigor and comprehensive analyses of our study.

      2. I understand that this work focusses on evolved mutants isolated from a previous experiment. The focus is on Mg2+ limitation. However, it would still have been nice to include a characterised colistin resistent strain featuring more standard resistance mechanisms. How different would such a strain be in the analyses shown in Fig. 3? Would morphological changes (Fig. 5A), fitness trade-offs (Fig. 6) and collateral sensitivity (Fig. 7) also occur in such a mutant. I do not regard it as imperative to include data from such a strain. But putting the new data into context (at least in the discussion) would clearly increase the overall impact of this work.

      We thank the reviewer for raising this fascinating and vital point. We will address the point in our Revision using the monoculture (high Mg2+) evolved strains, which acquired many known mutations for colistin resistance, as our reference. We will provide a supplemental figure about the membrane permeability, fitness costs, and collateral sensitivity of monoculture evolved strains. We will also contrast their difference from co-culture evolved strains in the revised Discussion.__


      1. I recommend to discuss the findings in the context of the work conducted by Jochumsen et al. 2016 Nature Communications https://doi.org/10.1038/ncomms13002. To me, this is one of the most insightful papers on the genetic basis and epistasis of colistin resistance.__

      We thank the reviewer for pointing out this important reference. We will include this reference and its findings in the Discussion.

      __Minor comments:

      1. First section of results and Fig. 1. It is unclear what parts are repetition from the ref. 37 and what is new. Please clarify.__

      We thank the reviewer for this suggestion. Figures 1A and 1B summarize the previous paper; all other panels are new data. We will make this clear in the revised text and figure legend.

      5. MIC-data (e.g. Fig. 2) come in discrete categories (based on the underlying dilution series). This comes with some challenges for statistical analysis. First, linear models like ANOVAs are based on normally distributed residuals. This is violated with discrete data distributions. Second, there is often no within-treatment variation (e.g., Fig. 2B), which makes statistical analyses obsolete. These points need to be addressed. Moreover, how is it possible to have subtle variations in MIC (e.g., Fig. 2A, P2 endpoint clone) with classic dilution series (as indicated on the y-axis, 128, 256, 512)? Please explain.

      We agree with the reviewer that statistical analysis of MIC data is not straightforward. ANOVAs are not well-suited for this type of discrete data, and the lack of variation within replicates reduces the power of non-parametric tests such as the Mann-Whitney U test. To improve the statistical reporting of MIC data, we will apply non-parametric tests and include effect size measurements, as recommended by Reviewer 1.

      Moreover, the design of dilution series may underestimate the true nature of antibiotic susceptibility. To address these issues, we have also performed survival assays to assess colistin resistance in both the endpoint and reversion strains; we will also include statistics to assess the significance of their different survival frequencies.

      We thank the reviewer for highlighting the point about subtle variations in a classical dilution series. Our endpoint strains grew robustly in media containing 192 μg/mL colistin-the highest concentration used in our evolution experiment. To more accurately determine and compare their maximum MICs, we expanded the colistin concentration range using finer fold increases (1.5×, 2×, 2.5×, 3×, 3.5×, and 4×) from 192 to 768 μg/mL. We will update these details in the Materials & Methods.

      __ Lines 264-269. This analysis focusses on enzyme impairment. However, mutations could also change enzyme activity. Could any of these mutations have such an effect?__

      The answer is "yes". As evolved strains with lpxA mutation still have lipid A, we suspect this mutation does not altogether abolish lipid A synthesis. However, this mutation could affect the amount of lipid A or change enzyme specificity. These are interesting ideas for further investigation, but they fall beyond the scope of our current study. We will, however, include the requested detail in the discussion.

      __ Figure 5A. Some arrows seem to be out of place and point at void spaces. Please check.__

      We thank the reviewer for pointing out this error, which we will correct.

      8. The use of polymyxin B is not well justified (Fig. 5 and Fig. S13). Did the authors aim to test whether there is cross-resistance to other antimicrobial peptides?

      We will more clearly justify our choice of using polymyxin B for directly assaying binding of polymyxin antibiotics to bacterial cells using fluorescence-labeled polymyxins, since no such reagents exist for colistin and since previous studies (including ours) have shown similarity of susceptibility to colistin and polymyxin B:

      "Although P2 and P5 endpoint clones have more permeable membranes, they exhibited greater resistance to polymyxin antibiotics, including colistin (polymyxin E) (Fig. 5D), and polymyxin B (Fig. S13A) than WT cells. To investigate how membrane-compromised cells gain increased resistance to antibiotics that target the outer membrane, we used dansyl-labeled polymyxin B [51] to quantify the binding of polymyxins to P. aeruginosa; dansyl-labeled polymyxin fluoresces upon binding the hydrophobic portion of bacterial membranes. We used polymyxin B binding as a surrogate for how bacterial cells bind to all polymyxin antibiotics, including colistin."

      __ Line 564. Please indicate the dilution factor used.__

      Thank you for pointing out this inadvertent omission. We will update our Materials & Methods accordingly, as in response to the Reviewer 2's comment 5.

      __Reviewer #2 (Significance (Required)):

      This is a very strong and well designed study. It provides novel and relevant insights into the resistance mechanisms against an important last resort antibiotic.__

      We sincerely thank the reviewer for their thoughtful summary and generous evaluation of our work.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This manuscript reports on biologically interesting and clinically-relevant findings, that upon passaging in the presence of spent media from C. albicans, P. aeruginosa develops resistance to colistin through lipid A modifications. The authors thoroughly characterize novel lipid A structures seen in their resistant mutants, and test a variety of genetically constructed mutants to determine the contributions of specific mutant alleles to resistance.__

      We thank the reviewer for the appreciation of our experimental design and comprehensive genetic and biochemical analyses of our evolved strains.

      However, additional experiments are needed to demonstrate the specific role and necessity of the lipid modifications for colistin resistance.

      We are also grateful for the reviewer's feedback and constructive criticisms to improve the clarity and impact of our manuscript. We have listed detailed responses to the reviewer below.

      1. __ Evidence that the lipid A mutations are causal for colistin resistance is sparse:
      2. Both the htrB2 mutations (in P2 and P5) are posited to be loss-of-function alleles. However, the phenotypes of the individual alleles are different (shown in Fig 2A and 2B). While the mutation in P2 shows a ~2x increase in resistance, the mutation in P5 does not. Thus it is not clear that the specific lipid A modifications seen in the htrB2 mutants are sufficient to confer colistin resistance. Can the authors test a clean deletion mutant of htrB2? Further, reversion of the htrB2 mutation in P2 has only a mild effect on colistin resistance, while reversion in P5 leads to a ~3-4x reduction in colistin resistance (Fig. S3), once again making it hard to parse out the exact effect of the lipid A modifications seen in the htrB2 mutants.
      3. Similarly, a single lpxO2 mutation does not have any effect on colistin resistance (in P5), indicating that the modifications seen in this mutant are not sufficient to lead to resistance.__ We thank the reviewer for making this suggestion. The reviewer is correct that a clean deletion will directly assess the effects of htrB2 mutations. We will make htrB2 deletion in WT and the triple mutants and endpoint clones of P2 and P5 to check the effect of htrB2 deletion on colistin resistance.

      Additionally, as Reviewer 2 pointed out, both mutation reconstruction and reversion experiments are required for understanding the roles of each mutation and interactions among different mutations in contributing to resistance. Combining all the results of htrB2 and lpxO2 mutations in these two orthogonal genetic experiments, it is the synergistic interactions among these mutations that lead to enhanced resistance after evolution. This explains why we saw genetic background effects of htrB2 mutation (P2 vs P5) and why each single mutation is required for resistance but doesn't contribute to resistance significantly by itself.

      - In P8, the effect of a single lpxA mutation is not tested. Further, the resistance of a P-oprH + lpxA mutant is the same as that of just the P-oprH mutant, indicating that the lpxA mutation likely does not directly alter colistin resistance. It is possible that mutations in lpxA were selected to compensate for fitness defects resulting from the other mutations, or for adaptation to some other component of the media conditions.

      This is an excellent suggestion. We will assess the MIC and fitness of reconstructed strains with the lpxA mutation to update the role of this mutation.

      - While reversion of the htrB2 and lpxO2 mutations do lead to ~3-4x reduced resistance in P5 indicating some contribution of these mutations, it is specific to this population, and thus not clear whether it is due to the specific lipid A modifications (some of which are seen in the other populations too). A specific combination of lipid A modifications may confer colistin resistance, but this needs to be demonstrated by generating just those clean deletion mutants and showing an effect on resistance.


      In response to this comment and comment 1, we will make lpxO2 deletions in WT, the triple mutant and the endpoint clone of P5 to test colistin resistance. However, our results of reverting single htrB2 or lpxO2 mutation to WT are robust and use two independent assays, including the standard MIC test and colistin survival assay. So, we are confident that each mutation is necessary for enhancing colistin resistance.

      __ Overall, given the high levels of colistin resistance still exhibited by single mutant revertants (Fig. S3) and the absence of double or triple revertants, it is hard to come to any conclusions regarding causality. This is especially the case for P8 but also true of P2 and P5. What are the other mutations in these populations, and what role do they play in colistin resistance?__

      We respectfully disagree with the reviewer on this point. One point that we have made and will re-emphasize in our Revision is that we have assayed all the mutations in these populations; this is one of the advantages of our experimental evolution and genome sequencing strategy. All the mutations that could play a role in colistin resistance have therefore been tested. Furthermore, due to genetic epistasis of mutations in different evolutionary lineages, we do not necessarily expect that a single revertant would altogether abolish colistin resistance, as has been demonstrated in several previous studies. As Reviewer 2 pointed out, combining mutation reconstruction and reversion is the best way to establish causality, and we have done so. Therefore, it is not correct to say that we cannot come to 'any conclusions regarding causality'.

      __ Figure 4 is titled "The PhoPQ pathway synergizes with early-arising mutations to confer colistin resistance.", but instead what this figure shows is that the mutation upstream of oprH increases PhoP activity. I'm not sure what the synergy here is. The same is true for the section starting on line 276. Further, the first sentence of that section states "We next investigated why the mutations conferring robust colistin resistance in low Mg2+ conditions are not observed in Mg2+ replete conditions.". However, there are no experiments there testing whether the mutations conferred resistance in Mg2+ conditions, instead the authors just test whether the mutations they are studying increase PhoP activity, and require PhoPQ to confer resistance.__

      We thank the reviewer for raising this point. We apologize for the unclear writing. We will use this opportunity to improve the clarity of this section by rewriting it to focus on two points: 1. Evolved resistance is PhoPQ-dependent, instead of PmrAB-dependent. 2. Two lineages evolved enhanced resistance by boosting PhoPQ activity in both high and low Mg2+ conditions. We will also remove the statement highlighted by the reviewer from this section that obfuscates the motivation of this section. We feel this approach will more clearly show how lipid A-related mutations contribute to resistance in low Mg2+.

      __ The authors claim that the identified mutations did not appear in the high magnesium conditions because they had a fitness cost under those conditions, but figure 6A shows that the evolved strains have fitness costs in low magnesium conditions as well. Further, the authors suggest that because the studied mutations act via increased PhoPQ activity, they do not lead to resistance under high magnesium conditions (lines 376-379). However, the increased PhoPQ activity is mediated by the P-oprH mutation in the isolates which likely increases PhoPQ activity even in high magnesium conditions. Overall, it is not clear why the mutations in the low magnesium condition were not selected for under high magnesium conditions.__

      The reviewer is correct about the fitness cost in high Mg2+ and low Mg2+ conditions. These fitness experiments were carried out in the absence of colistin, which explains the finding that there are fitness defects in both conditions. As is well known, evolution for antibiotic resistance will ultimately select for resistant mutants, despite their fitness costs. In contrast, colistin MIC of these endpoint strains in high Mg2+ conditions was still much lower than the colistin concentration we applied during evolution (Fig. S15), indicating it is much less likely for these mutations to be selected for in high Mg2+. We will clarify this point in our revised Results and Discussion.

      We agree with the reviewer about the P-oprH mutations (PhoPQ expression) and will note that, unlike the other mutations, it is not clear why these emerge only in the low Mg2+ condition.

      __ The authors used C. albicans spent BHI media as their low magnesium condition, but this condition has a lot of other C. albicans metabolites that may be affecting the results. It is possible that what the authors are observing is not related to magnesium at all, and the authors should test the phenotypes in normal BHI medium depleted for magnesium or some defined medium where magnesium levels can be controlled.__

      We thank the reviewer for mentioning this important point. In our prior PLOS Biology paper (https://doi.org/10.1371/journal.pbio.3002694.g005), we demonstrated that supplementing Mg2+ in evolved co-culture populations reduces colistin resistance, suggesting this evolved resistance is Mg2+ dependent. We also know that the MIC of our endpoint strains in C. albicans-spent BHI with supplemented Mg2+ (MIC of all three endpoint clones is less than 48 mg/mL colistin) is much lower than in C. albicans-spent BHI. We will mention this detail in the paper and include the data in our revision if the reviewer and editor require it.

      Other comments: - The authors use MIC assays as well as % survival to measure resistance against colistin, and sometimes use both in the same figure (e.g. Figure 2). This makes direct comparisons difficult. It would be better to consistently use one assay, preferably the MIC, at least in all the main figures. If the survival data needs to be included, it could go in the supplementary figures.

      We thank the reviewer for this suggestion. We will move the MIC data of mutation-reversion strains to the main Fig. 2D-F.

      - While the mutations seen in the low and high magnesium conditions were shown in the previous manuscript, given the extensive dissection here, it would be useful for readers if the authors gave some details about the serial passaging and evolution experiment, identification of mutations, and some mention of what mutations were seen in high Mg populations.

      We will add these details in the introduction.

      - Given that oprH is present in an operon, it would be more accurate to call that mutation as being in the promoter of the oprH-phoP-phoQ operon rather than it being an oprH mutation (at least in the text, e.g. lines 127-129).

      We agree. We will change this as the reviewer requested.

      - Unlike what is stated on lines 287-290, deletion of oprH in P2 leads to a greater than 2x reduction in colistin MIC, suggesting that OprH is playing a role (albeit a smaller role than phoP) - Line 50 has a typo, remove "160". - Line 122: Specify which Pa and Ca strain backgrounds were used. - Line 132: Were representative isolates derived from terminal passages? This should be defined.


      We will change these points according to the reviewer's suggestions; we thank them for these suggestions.

      - Line 215-219: It is interesting that Pa WT grown in spent medium additionally results in lipid A that is hexa-acylated. Is this sufficient to alter colistin resistance on its own?

      We find that WT PAO1 in low Mg2+ conditions has PagP-mediated acylation, which can slightly increase colistin resistance, but not to the extent of resistance as our evolved strains.

      - It would be useful to see a PCA plot for the samples shown in figures S6 and S7.

      We will include such a plot in Figures S6 and S7

      - Fig. S11: What are the colistin MICs of pmrA and phoP deletions in the WT background?

      MIC of pmrA and phoP deletions in WT is 1.5ug/mL. We will include these data in the Revision.

      - Instead of qualitative data, can the authors quantify cell length and perhaps some measure of cell shape (instead of just showing images in Fig. 5A and S12).

      We thank the reviewer for raising this point. A similar comment was raised by Reviewer 1. As it's challenging to quantify membrane changes from the morphological data obtained through SEM (a point which we will now clarify in our Revision), we used a quantifiable NpN uptake assay to quantify membrane defects of our evolved strains.

      - What is the WT MIC in high magnesium conditions? Please show that in Fig. S15.


      We will include this detail in Fig. S15

      - I am not an expert in lipid modifications and structures, but in figure S5, P2 and P4 show high peaks with lower m/z that seem specific to low magnesium conditions, but they are not labeled or discussed. What are these peaks?

      We thank the reviewer for bringing up this concern. The unlabeled lipids in these spectra are cardiolipin, not lipid A. These peaks are present in all the samples, and the reason they appear larger in the P1 and P4 low magnesium conditions is that both spectra are scaled to the relative intensity of one another. It is important to note that MALDI-TOF MS is not a quantitative technique, and the relative intensity of the peak heights between two samples should not be used to compare the amounts of lipids in one sample versus another. Therefore, we cannot say that these lipids are present in greater quantities in low magnesium conditions versus high magnesium conditions.

      - Lines 357-358 state that "mutant cells minimally bind polymyxin B (Fig. S13B)", but the figure shows increased binding compared to the WT. The legend of the figure also says something similar. Are the phoP pmrA mutants expected to bind more polymyxin B because they can't modify lipid A?

      We thank the reviewer for pointing out this substantial error. We will change 'minimally bind' to 'demonstrate increased binding'.

      - Given the fitness defects in just regular medium, is the data shown in Figure 7 specific collateral sensitivity to the antibiotics tested? Are there other conditions where P2 and P5 do not show increased sensitivity?

      These are all the antibiotics we have tested. It is conceivable that P2 and P5 might not show increased sensitivity to other antibiotics that use the same mode of action as colistin or polymyxin B.

      __Reviewer #3 (Significance (Required)):

      This study aims to dissect novel mechanisms of colistin resistance in P. aeruginosa that arise upon passaging in C. albicans spent media. While the authors identify novel lipid A modifications associated with the evolved strains, the significance of the modifications for resistance, and the mechanisms for why these evolutionary trajectories were not selected for in high magnesium are not clear from the data presented.__

      We thank the reviewer for recognizing the integrity of our work and for the constructive feedback on improving the clarity of our writing. We understand that some concerns may stem from a lack of clarity in our original submission, but that additional genetic experiments are necessary. We have already identified all mutations that arose independently across different lineages and characterized their contributions to resistance, which we believe supports a robust inference of causality. To strengthen our conclusions, we will incorporate additional experiments, including htrB2 deletion, lpxO2 deletion, and lpxA mutation, to better dissect the roles of these genes and mutations in colistin resistance. We hope this revision plan will ameliorate the reviewer's concerns.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript reports on biologically interesting and clinically-relevant findings, that upon passaging in the presence of spent media from C. albicans, P. aeruginosa develops resistance to colistin through lipid A modifications. The authors thoroughly characterize novel lipid A structures seen in their resistant mutants, and test a variety of genetically constructed mutants to determine the contributions of specific mutant alleles to resistance. However, additional experiments are needed to demonstrate the specific role and necessity of the lipid modifications for colistin resistance.

      1. Evidence that the lipid A mutations are causal for colistin resistance is sparse:
        • Both the htrB2 mutations (in P2 and P5) are posited to be loss-of-function alleles. However, the phenotypes of the individual alleles are different (shown in Fig 2A and 2B). While the mutation in P2 shows a ~2x increase in resistance, the mutation in P5 does not. Thus it is not clear that the specific lipid A modifications seen in the htrB2 mutants are sufficient to confer colistin resistance. Can the authors test a clean deletion mutant of htrB2? Further, reversion of the htrB2 mutation in P2 has only a mild effect on colistin resistance, while reversion in P5 leads to a ~3-4x reduction in colistin resistance (Fig. S3), once again making it hard to parse out the exact effect of the lipid A modifications seen in the htrB2 mutants.
        • Similarly, a single lpxO2 mutation does not have any effect on colistin resistance (in P5), indicating that the modifications seen in this mutant are not sufficient to lead to resistance.
        • In P8, the effect of a single lpxA mutation is not tested. Further, the resistance of a P-oprH + lpxA mutant is the same as that of just the P-oprH mutant, indicating that the lpxA mutation likely does not directly alter colistin resistance. It is possible that mutations in lpxA were selected to compensate for fitness defects resulting from the other mutations, or for adaptation to some other component of the media conditions.
        • While reversion of the htrB2 and lpxO2 mutations do lead to ~3-4x reduced resistance in P5 indicating some contribution of these mutations, it is specific to this population, and thus not clear whether it is due to the specific lipid A modifications (some of which are seen in the other populations too). It is possible that a specific combination of lipid A modifications confers colistin resistance, but this needs to be demonstrated by generating just those clean deletion mutants and showing an effect on resistance.
      2. Overall, given the high levels of colistin resistance still exhibited by single mutant revertants (Fig. S3) and the absence of double or triple revertants, it is hard to come to any conclusions regarding causality. This is especially the case for P8 but also true of P2 and P5. What are the other mutations in these populations, and what role do they play in colistin resistance?
      3. Figure 4 is titled "The PhoPQ pathway synergizes with early-arising mutations to confer colistin resistance.", but instead what this figure shows is that the mutation upstream of oprH increases PhoP activity. I'm not sure what the synergy here is. The same is true for the section starting on line 276. Further, the first sentence of that section states "We next investigated why the mutations conferring robust colistin resistance in low Mg2+ conditions are not observed in Mg2+ replete conditions.". However, there are no experiments there testing whether the mutations conferred resistance in Mg2+ conditions, instead the authors just test whether the mutations they are studying increase PhoP activity, and require PhoPQ to confer resistance.
      4. The authors claim that the identified mutations did not appear in the high magnesium conditions because they had a fitness cost under those conditions, but figure 6A shows that the evolved strains have fitness costs in low magnesium conditions as well. Further, the authors suggest that because the studied mutations act via increased PhoPQ activity, they do not lead to resistance under high magnesium conditions (lines 376-379). However, the increased PhoPQ activity is mediated by the P-oprH mutation in the isolates which likely increases PhoPQ activity even in high magnesium conditions. Overall, it is not clear why the mutations in the low magnesium condition were not selected for under high magnesium conditions.
      5. The authors used C. albicans spent BHI media as their low magnesium condition, but this condition has a lot of other C. albicans metabolites that may be affecting the results. It is possible that what the authors are observing is not related to magnesium at all, and the authors should test the phenotypes in normal BHI medium depleted for magnesium or some defined medium where magnesium levels can be controlled.

      Other comments:

      • The authors use MIC assays as well as % survival to measure resistance against colistin, and sometimes use both in the same figure (e.g. Figure 2). This makes direct comparisons difficult. It would be better to consistently use one assay, preferably the MIC, at least in all the main figures. If the survival data needs to be included, it could go in the supplementary figures.
      • While the mutations seen in the low and high magnesium conditions were shown in the previous manuscript, given the extensive dissection here, it would be useful for readers if the authors gave some details about the serial passaging and evolution experiment, identification of mutations, and some mention of what mutations were seen in high Mg populations.
      • Given that oprH is present in an operon, it would be more accurate to call that mutation as being in the promoter of the oprH-phoP-phoQ operon rather than it being an oprH mutation (at least in the text, e.g. lines 127-129).
      • Unlike what is stated on lines 287-290, deletion of oprH in P2 leads to a greater than 2x reduction in colistin MIC, suggesting that OprH is playing a role (albeit a smaller role than phoP)
      • Line 50 has a typo, remove "160".
      • Line 122: Specify which Pa and Ca strain backgrounds were used.
      • Line 132: Were representative isolates derived from terminal passages? This should be defined.
      • Line 215-219: It is interesting that Pa WT grown in spent medium additionally results in lipid A that is hexa-acylated. Is this sufficient to alter colistin resistance on its own?
      • It would be useful to see a PCA plot for the samples shown in figures S6 and S7.
      • Fig. S11: What are the colistin MICs of pmrA and phoP deletions in the WT background?
      • Instead of qualitative data, can the authors quantify cell length and perhaps some measure of cell shape (instead of just showing images in Fig. 5A and S12).
      • What is the WT MIC in high magnesium conditions? Please show that in Fig. S15.
      • I am not an expert in lipid modifications and structures, but in figure S5, P2 and P4 show high peaks with lower m/z that seem specific to low magnesium conditions, but they are not labeled or discussed. What are these peaks?
      • Lines 357-358 state that "mutant cells minimally bind polymyxin B (Fig. S13B)", but the figure shows increased binding compared to the WT. The legend of the figure also says something similar. Are the phoP pmrA mutants expected to bind more polymyxin B because they can't modify lipid A?
      • Given the fitness defects in just regular medium, is the data shown in Figure 7 specific collateral sensitivity to the antibiotics tested? Are there other conditions where P2 and P5 do not show increased sensitivity?

      Significance

      This study aims to dissect novel mechanisms of colistin resistance in P. aeruginosa that arise upon passaging in C. albicans spent media. While the authors identify novel lipid A modifications associated with the evolved strains, the significance of the modifications for resistance, and the mechanisms for why these evolutionary trajectories were not selected for in high magnesium are not clear from the data presented.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: The paper by Hsieh and colleagues unravels the molecular basis of colistin resistance in Pseudomonas aeruginosa under low magnesium (Mg2+) conditions. Colistin is a last resort antibiotic that compromises bacterial cell wall integrity. Bacteria can respond (phenotypically and genotypically) to colistin by modifying membrane-anchored lipopolysaccharides. Mg2+ depletion can trigger similar responses. In their study, Hsieh et al. find that Mg2+ depletion (induced by a co-infecting fungal pathogen, Candida albicans) leads to evolutionary trajectories and resistance mechanisms that differ from those observed under Mg-rich conditions. The authors conducted a series of detailed genetic, chemical and fitness-based experiments to elucidate the molecular, physiological and evolutionary basis of these new resistance mechanisms.

      Major comments:

      1. The authors reconstituted key mutations observed during experimental evolution in the ancestral background. Moreover, they took clones from the final stage of the evolution experiment and restored the ancestral state of the mutated genes. This dual approach is extremely strong and allows to decipher the causal effects of colistin resistance. I like to applaud the authors for this rigorous approach.
      2. I understand that this work focusses on evolved mutants isolated from a previous experiment. The focus is on Mg2+ limitation. However, it would still have been nice to include a characterised colistin resistent strain featuring more standard resistance mechanisms. How different would such a strain be in the analyses shown in Fig. 3? Would morphological changes (Fig. 5A), fitness trade-offs (Fig. 6) and collateral sensitivity (Fig. 7) also occur in such a mutant. I do not regard it as imperative to include data from such a strain. But putting the new data into context (at least in the discussion) would clearly increase the overall impact of this work.
      3. I recommend to discuss the findings in the context of the work conducted by Jochumsen et al. 2016 Nature Communications https://doi.org/10.1038/ncomms13002. To me, this is one of the most insightful papers on the genetic basis and epistasis of colistin resistance.

      Minor comments:

      1. First section of results and Fig. 1. It is unclear what parts are repetition from the ref. 37 and what is new. Please clarify.
      2. MIC-data (e.g. Fig. 2) come in discrete categories (based on the underlying dilution series). This comes with some challenges for statistical analysis. First, linear models like ANOVAs are based on normally distributed residuals. This is violated with discrete data distributions. Second, there is often no within-treatment variation (e.g., Fig. 2B), which makes statistical analyses obsolete. These points need to be addressed. Moreover, how is it possible to have subtle variations in MIC (e.g., Fig. 2A, P2 endpoint clone) with classic dilution series (as indicated on the y-axis, 128, 256, 512)? Please explain.
      3. Lines 264-269. This analysis focusses on enzyme impairment. However, mutations could also change enzyme activity. Could any of these mutations have such an effect?
      4. Figure 5A. Some arrows seem to be out of place and point at void spaces. Please check.
      5. The use of polymyxin B is not well justified (Fig. 5 and Fig. S13). Did the authors aim to test whether there is cross-resistance to other antimicrobial peptides?
      6. Line 564. Please indicate the dilution factor used.

      Significance

      This is a very strong and well designed study. It provides novel and relevant insights into the resistance mechanisms against an important last resort antibiotic.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this paper "Magnesium depletion unleashes two unusual modes of colistin resistance with different fitness costs," the authors examine how Pseudomonas aeruginosa evolves resistance to colistin, a last-resort antibiotic for multidrug-resistant Gram-negative infections. Although colistin resistance is a major clinical challenge, its underlying mechanisms, particularly under nutrient-limited conditions typical of infections, are not fully understood.

      The study shows that under low magnesium (Mg²⁺) conditions-mimicking infection or biofilm stress-P. aeruginosa can develop colistin resistance via two distinct genetic pathways, each with unique fitness costs. The first involves mutations in genes such as htrB2 and lpxO2, granting strong resistance but compromising the outer membrane and increasing susceptibility to other antibiotics. The second involves regulatory mutations (e.g., in the oprH/phoP/phoQ promoter) that confer resistance with minimal membrane defects and generally lower fitness costs. These resistance strategies lead to different trade-offs: membrane-compromising mutations reduce bacterial fitness without colistin, while regulatory mutations typically avoid these penalties, with context-dependent effects. The study underscores clinical relevance, noting that in infections-such as in cystic fibrosis-other microbes like Candida albicans may deplete magnesium, indirectly promoting resistance evolution. Overall, this work offers important insights into antibiotic resistance in nutrient-stressed, polymicrobial environments, highlighting how magnesium availability shapes resistance evolution and fitness costs. The findings suggest new avenues for therapeutic intervention and call for a reevaluation of antibiotic strategies in nutrient-competitive infection settings.

      Work is timely and important. Colistin resistance represents an urgent threat as colistin is a last-resort antibiotic used against multidrug-resistant Gram-negative pathogens. Insights into mechanisms evolving under nutrient limitation are highly relevant given the prevalence of such environmental conditions during infection and microbial biofilm growth. The study reveals two previously uncharacterized pathways to colistin resistance in P. aeruginosa triggered by magnesium (Mg²⁺) depletion, each with distinct genetic signatures and trade-offs. This finding directly impacts the understanding of polymicrobial infection dynamics, especially where magnesium sequestration by fungi/ or other microbes may occur. The identification of fitness costs and pleiotropic effects associated with specific resistance mutations provides crucial guidance for clinicians considering antibiotic stewardship and combination therapy strategies.

      Strengths

      • Experimental evolution: This work uses laboratory evolution under controlled Mg²⁺-limited conditions to simulate selection pressures relevant to infection microenvironments.
      • Genetics: Systematic identification and functional validation of key mutations-particularly in htrB2, lpxO2, and the oprH/phoP/phoQ promoter-give mechanistic depth to the findings.
      • Two distinct resistance modes: Evidence for (i) one pathway leading to colistin resistance via htrB2 mutations, resulting in high resistance but significant membrane integrity loss and increased susceptibility to other antibiotics. (ii) a second pathway providing resistance without compromising membrane integrity, highlighting evolutionary flexibility and ecological implications.
      • Fitness assessments: measurement of the costs associated with each resistance strategy, both in terms of membrane integrity and susceptibility to other agents.
      • Relevance: Connection to natural scenarios, such as magnesium sequestration by fungi (e.g. Candida albicans) in polymicrobial environments, underscores the ecological and clinical significance.
      • This manuscript is well written with clearly logical hypothesis testing

      Drawbacks

      • Experimental scope: While the study is comprehensive for P. aeruginosa, the broader applicability to other Gram-negative pathogens is not directly tested.
      • Mechanistic depth: Some inferred mechanisms (e.g., the precise molecular impact of late-occurring adaptive mutations) merit deeper biochemical analysis.
      • Results Lines 414- 423: While correlation is most what makes sense for some drugs, causality is implied (membrane defects increase susceptibility), but could be strengthened by directly measuring antibiotic uptake (e.g., fluorescence) or membrane permeability for these 3 antibiotics.
        • Effect is mild and mostly not significant. It is also not clear whether authors only tested a handful of mutants shown in Fig. 7B-D or whether other clones were also tested. The sample of endpoints (P2, P5, P8) covers well-characterized lineages, but additional evolved clones or a broader panel could boost generality about other antibiotics. The authors note "significantly lower MICs" statistical treatment is implied; explicit statistical values and replicate numbers should be given in the text or figures.
      • The structural or physiological nature of "mild" vs. "severe" membrane defects could be better defined/quantified.
      • Quantitative limits: Authors should add in the discussion that statistical robustness could be strengthened-for example, by including longer-term evolutionary predictions.
      • in vivo relevance: While the ecological context is discussed, direct in vivo confirmation (e.g., in animal infection models) of the observed resistance trajectories would increase translational impact and relevance.
      • Some sections are repetitive or overly detailed; condense where possible (especially on mutation lists and background for each claim).

      Other comments

      • Authors should provide clarification on how the Mg²⁺ concentrations used in vitro compare to those found in clinically relevant infection settings. This would be helpfu to enhance significance.
      • Authors should explicitly report statistical methods (e.g., types of tests, adjustments for multiple comparisons) in figure legends for reproducibility.
      • Nomenclature for key mutations and their position within the genetic context (e.g., htrB2 mutation specifics) could be more detailed in figures or supplemental materials.
      • The manuscript could benefit from a graphical summary illustrating the two distinct evolutionary pathways and their respective fitness landscapes.
      • A brief discussion of therapeutic implications-such as combining colistin with agents that target membrane integrity-would help bridge the gap from mechanism to clinical management.
      • Additional discussion on whether the fitness costs are reversible or can be compensated by further adaptation would be valuable for long-term dynamics.
      • It would be valuable for the authors to comment on, or further analyze, whether there is a direct association between specific fitness costs and sensitivity to other antibiotics. Such information could inform on evolutionary constraints and possible trade-offs relevant to clinical settings.

      Main figures and support for claims

      The main and supplementary figures comprehensively illustrate the evolutionary trajectories, genetic bases, and phenotypic outcomes associated with colistin resistance under magnesium depletion in P. aeruginosa. The figures effectively detail: - Genetic pathways involved including the experimental evolution design (colistin selection under Mg²⁺ depletion), whole-genome sequencing results, and timelines of observed mutations (e.g., in htrB2, lpxO2, oprH/phoP/phoQ promoter, PA4824). - Phenotypes and biochemical analyses such as lipid A structure (via mass spectrometry), minimum inhibitory concentration (MIC) assays, and epistasis analyses between mutations are depicted. - Fitness trade-offs are demonstrated using bacterial survival, membrane integrity (e.g., scanning electron microscopy images), membrane permeability assays (NPN uptake), and competitive fitness assays. - Mechanistic claims about the necessity of early mutations, the requirement of the PhoPQ pathway at different evolutionary stages, and the fitness cost imposed by certain resistance mutations. To further enhance the rigor and clarity of the manuscript, the authors should implement the following improvements: - Labelling consistency: In some instances, figure legends could provide more granular detail about specific mutations (e.g., positions of amino acid changes). - Graphical summary: A schematic summary figure that visually integrates the three main evolutionary resistance trajectories, the mutational order, corresponding lipid A changes, and fitness costs, would enhance readability. - Replicates: Plots should more thoroughly indicate the number of replicates and show individual data points (not just means {plus minus} SD), add number of replicates in each experiment. - Supplementary: figures referenced in the text (e.g., lipid A structures or mutation reversion outcomes) should be made more prominent or better cross-referenced from the main results section. Authors should highlight when supplementary data provide critical functional confirmation (e.g., confirming mutation function or fitness reversal).

      Statistics

      The authors have appropriately incorporated statistical analyses throughout the figures. To enhance the robustness and credibility of their findings, authors should also cross-check - Tests in legends: Every figure and supplementary figure should clearly state the type of statistical test used, how many biological replicates, and any corrections for multiple comparisons. - Effect sizes: Where appropriate, reporting effect sizes-rather than just p values-would contextualize the biological impact. - Raw data accessibility: For full transparency, consider sharing underlying raw data and analysis scripts.

      Overall, the main and supplementary figures effectively illustrate and substantiate the key claims-particularly the alternative molecular pathways, phenotypic trade-offs, and the role of environmental magnesium in mediating colistin resistance. Statistical analysis is generally robust and appropriately presented throughout, though improvements could include more explicit reporting, additional controls, and accessible raw data. The visual and quantitative data in the figures provide support for the authors' conclusions about the evolution of antibiotic resistance under nutrient limitation in microbial environments. Understanding these alternative pathways is important for designing better treatment strategies and for predicting how resistance might evolve under varying clinical and environmental conditions.

      Significance

      Overall, this work offers important insights into antibiotic resistance in nutrient-stressed, polymicrobial environments, highlighting how magnesium availability shapes resistance evolution and fitness costs. The findings suggest new avenues for therapeutic intervention and call for a reevaluation of antibiotic strategies in nutrient-competitive infection settings.

      My expertise:

      Gut microbiome, gut microbiota resilience, ecology, and evolution in microbial communities, antimicrobial resistance, high-throughput drug-bacteria interactions

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      *The authors have a longstanding focus and reputation on single cell sequencing technology development and application. In this current study, the authors developed a novel single-cell multi-omic assay termed "T-ChIC" so that to jointly profile the histone modifications along with the full-length transcriptome from the same single cells, analyzed the dynamic relationship between chromatin state and gene expression during zebrafish development and cell fate determination. In general, the assay works well, the data look convincing and conclusions are beneficial to the community. *

      Thank you for your positive feedback.

      *There are several single-cell methodologies all claim to co-profile chromatin modifications and gene expression from the same individual cell, such as CoTECH, Paired-tag and others. Although T-ChIC employs pA-Mnase and IVT to obtain these modalities from single cells which are different, could the author provide some direct comparisons among all these technologies to see whether T-ChIC outperforms? *

      In a separate technical manuscript describing the application of T-ChIC in mouse cells (Zeller, Blotenburg et al 2024, bioRxiv, 2024.05. 09.593364), we have provided a direct comparison of data quality between T-ChIC and other single-cell methods for chromatin-RNA co-profiling (Please refer to Fig. 1C,D and Fig. S1D, E, of the preprint). We show that compared to other methods, T-ChIC is able to better preserve the expected biological relationship between the histone modifications and gene expression in single cells.

      *In current study, T-ChIC profiled H3K27me3 and H3K4me1 modifications, these data look great. How about other histone modifications (eg H3K9me3 and H3K36me3) and transcription factors? *

      While we haven't profiled these other modifications using T-ChIC in Zebrafish, we have previously published high quality data on these histone modifications using the sortChIC method, on which T-ChIC is based (Zeller, Yeung et al 2023). In our comparison, we find that histone modification profiles between T-ChIC and sortChIC are very similar (Fig. S1C in Zeller, Blotenburg et al 2024). Therefore the method is expected to work as well for the other histone marks.

      *T-ChIC can detect full length transcription from the same single cells, but in FigS3, the authors still used other published single cell transcriptomics to annotate the cell types, this seems unnecessary? *

      We used the published scRNA-seq dataset with a larger number of cells to homogenize our cell type labels with these datasets, but we also cross-referenced our cluster-specific marker genes with ZFIN and homogenized the cell type labels with ZFIN ontology. This way our annotation is in line with previous datasets but not biased by it. Due the relatively smaller size of our data, we didn't expect to identify unique, rare cell types, but our full-length total RNA assay helps us identify non-coding RNAs such as miRNA previously undetected in scRNA assays, which we have now highlighted in new figure S1c .

      *Throughout the manuscript, the authors found some interesting dynamics between chromatin state and gene expression during embryogenesis, independent approaches should be used to validate these findings, such as IHC staining or RNA ISH? *

      We appreciate that the ISH staining could be useful to validate the expression pattern of genes identified in this study. But to validate the relationships between the histone marks and gene expression, we need to combine these stainings with functional genomics experiments, such as PRC2-related knockouts. Due to their complexity, such experiments are beyond the scope of this manuscript (see also reply to reviewer #3, comment #4 for details).

      *In Fig2 and FigS4, the authors showed H3K27me3 cis spreading during development, this looks really interesting. Is this zebrafish specific? H3K27me3 ChIP-seq or CutTag data from mouse and/or human embryos should be reanalyzed and used to compare. The authors could speculate some possible mechanisms to explain this spreading pattern? *

      Thanks for the suggestion. In this revision, we have reanalysed a dataset of mouse ChIP-seq of H3K27me3 during mouse embryonic development by Xiang et al (Nature Genetics 2019) and find similar evidence of spreading of H3K27me3 signal from their pre-marked promoter regions at E5.5 epiblast upon differentiation (new Figure S4i). This observation, combined with the fact that the mechanism of pre-marking of promoters by PRC1-PRC2 interaction seems to be conserved between the two species (see (Hickey et al., 2022), (Mei et al., 2021) & (Chen et al., 2021)), suggests that the dynamics of H3K27me3 pattern establishment is conserved across vertebrates. But we think a high-resolution profiling via a method like T-ChIC would be more useful to demonstrate the dynamics of signal spreading during mouse embryonic development in the future. We have discussed this further in our revised manuscript.

      Reviewer #1 (Significance (Required)):

      *The authors have a longstanding focus and reputation on single cell sequencing technology development and application. In this current study, the authors developed a novel single-cell multi-omic assay termed "T-ChIC" so that to jointly profile the histone modifications along with the full-length transcriptome from the same single cells, analyzed the dynamic relationship between chromatin state and gene expression during zebrafish development and cell fate determination. In general, the assay works well, the data look convincing and conclusions are beneficial to the community. *

      Thank you very much for your supportive remarks.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      *Joint analysis of multiple modalities in single cells will provide a comprehensive view of cell fate states. In this manuscript, Bhardwaj et al developed a single-cell multi-omics assay, T-ChIC, to simultaneously capture histone modifications and full-length transcriptome and applied the method on early embryos of zebrafish. The authors observed a decoupled relationship between the chromatin modifications and gene expression at early developmental stages. The correlation becomes stronger as development proceeds, as genes are silenced by the cis-spreading of the repressive marker H3k27me3. Overall, the work is well performed, and the results are meaningful and interesting to readers in the epigenomic and embryonic development fields. There are some concerns before the manuscript is considered for publication. *

      We thank the reviewer for appreciating the quality of our study.

      *Major concerns: *

        • A major point of this study is to understand embryo development, especially gastrulation, with the power of scMulti-Omics assay. However, the current analysis didn't focus on deciphering the biology of gastrulation, i.e., lineage-specific pioneer factors that help to reform the chromatin landscape. The majority of the data analysis is based on the temporal dimension, but not the cell-type-specific dimension, which reduces the value of the single-cell assay. *

      We focused on the lineage-specific transcription factor activity during gastrulation in Figure 4 and S8 of the manuscript and discovered several interesting regulators active at this stage. During our analysis of the temporal dimension for the rest of the manuscript, we also classified the cells by their germ layer and "latent" developmental time by taking the full advantage of the single-cell nature of our data. Additionally, we have now added the cell-type-specific H3K27-demethylation results for 24hpf in response to your comment below. We hope that these results, together with our openly available dataset would demonstrate the advantage of the single-cell aspect of our dataset.

      1. *The cis-spreading of H3K27me3 with developmental time is interesting. Considering H3k27me3 could mark bivalent regions, especially in pluripotent cells, there must be some regions that have lost H3k27me3 signals during development. Therefore, it's confusing that the authors didn't find these regions (30% spreading, 70% stable). The authors should explain and discuss this issue. *

      Indeed we see that ~30% of the bins enriched in the pluripotent stage spread, while 70% do not seem to spread. In line with earlier observations(Hickey et al., 2022; Vastenhouw et al., 2010), we find that H3K27me3 is almost absent in the zygote and is still being accumulated until 24hpf and beyond. Therefore the majority of the sites in the genome still seem to be in the process of gaining H3K27me3 until 24hpf, explaining why we see mostly "spreading" and "stable" states. Considering most of these sites are at promoters and show signs of bivalency, we think that these sites are marked for activation or silencing at later stages. We have discussed this in the manuscript ("discussion"). However, in response to this and earlier comment, we went back and searched for genes that show H3K27-demethylation in the most mature cell types (at 24 hpf) in our data, and found a subset of genes that show K27 demethylation after acquiring them earlier. Interestingly, most of the top genes in this list are well-known as developmentally important for their corresponding cell types. We have added this new result and discussed it further in the manuscript (Fig. 2d,e, , Supplementary table 3).

      *Minors: *

        • The authors cited two scMulti-omics studies in the introduction, but there have been lots of single-cell multi-omics studies published recently. The authors should cite and consider them. *

      We have cited more single-cell chromatin and multiome studies focussed on early embryogenesis in the introduction now.

      *2. T-ChIC seems to have been presented in a previous paper (ref 15). Therefore, Fig. 1a is unnecessary to show. *

      Figure 1a. shows a summary of our Zebrafish TChIC workflow, which contains the unique sample multiplexing and sorting strategy to reduce batch effects, which was not applied in the original TChIC workflow. We have now clarified this in "Results".

      1. *It's better to show the percentage of cell numbers (30% vs 70%) for each heatmap in Figure 2C. *

      We have added the numbers to the corresponding legends.

      1. *Please double-check the citation of Fig. S4C, which may not relate to the conclusion of signal differences between lineages. *

      The citation seems to be correct (Fig. S4C supplements Fig. 2C, but shows mesodermal lineage cells) but the description of the legend was a bit misleading. We have clarified this now.

      *5. Figure 4C has not been cited or mentioned in the main text. Please check. *

      Thanks for pointing it out. We have cited it in Results now.

      Reviewer #2 (Significance (Required)):

      *Strengths: This work utilized a new single-cell multi-omics method and generated abundant epigenomics and transcriptomics datasets for cells covering multiple key developmental stages of zebrafish. *

      *Limitations: The data analysis was superficial and mainly focused on the correspondence between the two modalities. The discussion of developmental biology was limited. *

      *Advance: The zebrafish single-cell datasets are valuable. The T-ChIC method is new and interesting. *

      *The audience will be specialized and from basic research fields, such as developmental biology, epigenomics, bioinformatics, etc. *

      *I'm more specialized in the direction of single-cell epigenomics, gene regulation, 3D genomics, etc. *

      Thank you for your remarks.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      *This manuscript introduces T‑ChIC, a single‑cell multi‑omics workflow that jointly profiles full‑length transcripts and histone modifications (H3K27me3 and H3K4me1) and applies it to early zebrafish embryos (4-24 hpf). The study convincingly demonstrates that chromatin-transcription coupling strengthens during gastrulation and somitogenesis, that promoter‑anchored H3K27me3 spreads in cis to enforce developmental gene silencing, and that integrating TF chromatin status with expression can predict lineage‑specific activators and repressors. *

      *Major concerns *

      1. *Independent biological replicates are absent, so the authors should process at least one additional clutch of embryos for key stages (e.g., 6 hpf and 12 hpf) with T‑ChIC and demonstrate that the resulting data match the current dataset. *

      Thanks for pointing this out. We had, in fact, performed T-ChIC experiments in four rounds of biological replicates (independent clutch of embryos) and merged the data to create our resource. Although not all timepoints were profiled in each replicate, two timepoints (10 and 24hpf) are present in all four, and the celltype composition of these replicates from these 2 timepoints are very similar. We have added new plots in figure S2f and added (new) supplementary table (#1) to highlight the presence of biological replicates.

      2. *The TF‑activity regression model uses an arbitrary R² {greater than or equal to} 0.6 threshold; cross‑validated R² distributions, permutation‑based FDR control, and effect‑size confidence intervals are needed to justify this cut‑off. *

      Thank you for this suggestion. We did use 10-fold cross validation during training and obtained the R2 values of TF motifs from the independent test set as an unbiased estimate. However, the cutoff of R2 > 0.6 to select the TFs for classification was indeed arbitrary. In the revised version, we now report the FDR-adjusted p-values for these R2 estimates based on permutation tests, and select TFs with a cutoff of padj supplementary table #4 to include the p-values for all tested TFs. However, we see that our arbitrary cutoff of 0.6 was in fact, too stringent, and we can classify many more TFs based on the FDR cutoffs. We also updated our reported numbers in Fig. 4c to reflect this. Moreover, supplementary table #4 contains the complete list of TFs used in the analysis to allow others to choose their own cutoff.

      3. *Predicted TF functions lack empirical support, making it essential to test representative activators (e.g., Tbx16) and repressors (e.g., Zbtb16a) via CRISPRi or morpholino knock‑down and to measure target‑gene expression and H3K4me1 changes. *

      We agree that independent validation of the functions of our predicted TFs on target gene activity would be important. During this revision, we analysed recently published scRNA-seq data of Saunders et al. (2023) (Saunders et al., 2023), which includes CRISPR-mediated F0 knockouts of a couple of our predicted TFs, but the scRNAseq was performed at later stages (24hpf onward) compared to our H3K4me1 analysis (which was 4-12 hpf). Therefore, we saw off-target genes being affected in lineages where these TFs are clearly not expressed (attached Fig 1). We therefore didn't include these results in the manuscript. In future, we aim to systematically test the TFs predicted in our study with CRISPRi or similar experiments.

      4. *The study does not prove that H3K27me3 spreading causes silencing; embryos treated with an Ezh2 inhibitor or prc2 mutants should be re‑profiled by T‑ChIC to show loss of spreading along with gene re‑expression. *

      We appreciate the suggestion that indeed PRC2-disruption followed by T-ChIC or other forms of validation would be needed to confirm whether the H3K27me3 spreading is indeed causally linked to the silencing of the identified target genes. But performing this validation is complicated because of multiple reasons: 1) due to the EZH2 contribution from maternal RNA and the contradicting effects of various EZH2 zygotic mutations (depending on where the mutation occurs), the only properly validated PRC2-related mutant seems to be the maternal-zygotic mutant MZezh2, which requires germ cell transplantation (see Rougeot et al. 2019 (Rougeot et al., 2019)) , and San et al. 2019 (San et al., 2019) for details). The use of inhibitors have been described in other studies (den Broeder et al., 2020; Huang et al., 2021), but they do not show a validation of the H3K27me3 loss or a similar phenotype as the MZezh2 mutants, and can present unwanted side effects and toxicity at a high dose, affecting gene expression results. Moreover, in an attempt to validate, we performed our own trials with the EZH2 inhibitor (GSK123) and saw that this time window might be too short to see the effect within 24hpf (attached Fig. 2). Therefore, this validation is a more complex endeavor beyond the scope of this study. Nevertheless, our further analysis of H3K27me3 de-methylation on developmentally important genes (new Fig. 2e-f, Sup. table 3) adds more confidence that the polycomb repression plays an important role, and provides enough ground for future follow up studies.

      *Minor concerns *

      1. *Repressive chromatin coverage is limited, so profiling an additional silencing mark such as H3K9me3 or DNA methylation would clarify cooperation with H3K27me3 during development. *

      We agree that H3K27me3 alone would not be sufficient to fully understand the repressive chromatin state. Extension to other chromatin marks and DNA methylation would be the focus of our follow up works.

      *2. Computational transparency is incomplete; a supplementary table listing all trimming, mapping, and peak‑calling parameters (cutadapt, STAR/hisat2, MACS2, histoneHMM, etc.) should be provided. *

      As mentioned in the manuscript, we provide an open-source pre-processing pipeline "scChICflow" to perform all these steps (github.com/bhardwaj-lab/scChICflow). We have now also provided the configuration files on our zenodo repository (see below), which can simply be plugged into this pipeline together with the fastq files from GEO to obtain the processed dataset that we describe in the manuscript. Additionally, we have also clarified the peak calling and post-processing steps in the manuscript now.

      *3. Data‑ and code‑availability statements lack detail; the exact GEO accession release date, loom‑file contents, and a DOI‑tagged Zenodo archive of analysis scripts should be added. *

      We have now publicly released the .h5ad files with raw counts, normalized counts, and complete gene and cell-level metadata, along with signal tracks (bigwigs) and peaks on GEO. Additionally, we now also released the source datasets and notebooks (.Rmarkdown format) on Zenodo that can be used to replicate the figures in the manuscript, and updated our statements on "Data and code availability".

      *4. Minor editorial issues remain, such as replacing "critical" with "crucial" in the Abstract, adding software version numbers to figure legends, and correcting the SAMtools reference. *

      Thank you for spotting them. We have fixed these issues.

      Reviewer #3 (Significance (Required)):

      The method is technically innovative and the biological insights are valuable; however, several issues-mainly concerning experimental design, statistical rigor, and functional validation-must be addressed to solidify the conclusions.

      Thank you for your comments. We hope to have addressed your concerns in this revised version of our manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript introduces T‑ChIC, a single‑cell multi‑omics workflow that jointly profiles full‑length transcripts and histone modifications (H3K27me3 and H3K4me1) and applies it to early zebrafish embryos (4-24 hpf). The study convincingly demonstrates that chromatin-transcription coupling strengthens during gastrulation and somitogenesis, that promoter‑anchored H3K27me3 spreads in cis to enforce developmental gene silencing, and that integrating TF chromatin status with expression can predict lineage‑specific activators and repressors.

      Major concerns

      1. Independent biological replicates are absent, so the authors should process at least one additional clutch of embryos for key stages (e.g., 6 hpf and 12 hpf) with T‑ChIC and demonstrate that the resulting data match the current dataset.
      2. The TF‑activity regression model uses an arbitrary R² {greater than or equal to} 0.6 threshold; cross‑validated R² distributions, permutation‑based FDR control, and effect‑size confidence intervals are needed to justify this cut‑off.
      3. Predicted TF functions lack empirical support, making it essential to test representative activators (e.g., Tbx16) and repressors (e.g., Zbtb16a) via CRISPRi or morpholino knock‑down and to measure target‑gene expression and H3K4me1 changes.
      4. The study does not prove that H3K27me3 spreading causes silencing; embryos treated with an Ezh2 inhibitor or prc2 mutants should be re‑profiled by T‑ChIC to show loss of spreading along with gene re‑expression.

      Minor concerns

      1. Repressive chromatin coverage is limited, so profiling an additional silencing mark such as H3K9me3 or DNA methylation would clarify cooperation with H3K27me3 during development.
      2. Computational transparency is incomplete; a supplementary table listing all trimming, mapping, and peak‑calling parameters (cutadapt, STAR/hisat2, MACS2, histoneHMM, etc.) should be provided.
      3. Data‑ and code‑availability statements lack detail; the exact GEO accession release date, loom‑file contents, and a DOI‑tagged Zenodo archive of analysis scripts should be added.
      4. Minor editorial issues remain, such as replacing "critical" with "crucial" in the Abstract, adding software version numbers to figure legends, and correcting the SAMtools reference.

      Significance

      The method is technically innovative and the biological insights are valuable; however, several issues-mainly concerning experimental design, statistical rigor, and functional validation-must be addressed to solidify the conclusions.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Joint analysis of multiple modalities in single cells will provide a comprehensive view of cell fate states. In this manuscript, Bhardwaj et al developed a single-cell multi-omics assay, T-ChIC, to simultaneously capture histone modifications and full-length transcriptome and applied the method on early embryos of zebrafish. The authors observed a decoupled relationship between the chromatin modifications and gene expression at early developmental stages. The correlation becomes stronger as development proceeds, as genes are silenced by the cis-spreading of the repressive marker H3k27me3. Overall, the work is well performed, and the results are meaningful and interesting to readers in the epigenomic and embryonic development fields. There are some concerns before the manuscript is considered for publication.

      Major concerns:

      1. A major point of this study is to understand embryo development, especially gastrulation, with the power of scMulti-Omics assay. However, the current analysis didn't focus on deciphering the biology of gastrulation, i.e., lineage-specific pioneer factors that help to reform the chromatin landscape. The majority of the data analysis is based on the temporal dimension, but not the cell-type-specific dimension, which reduces the value of the single-cell assay.
      2. The cis-spreading of H3K27me3 with developmental time is interesting. Considering H3k27me3 could mark bivalent regions, especially in pluripotent cells, there must be some regions that have lost H3k27me3 signals during development. Therefore, it's confusing that the authors didn't find these regions (30% spreading, 70% stable). The authors should explain and discuss this issue.

      Minors:

      1. The authors cited two scMulti-omics studies in the introduction, but there have been lots of single-cell multi-omics studies published recently. The authors should cite and consider them.
      2. T-ChIC seems to have been presented in a previous paper (ref 15). Therefore, Fig. 1a is unnecessary to show.
      3. It's better to show the percentage of cell numbers (30% vs 70%) for each heatmap in Figure 2C.
      4. Please double-check the citation of Fig. S4C, which may not relate to the conclusion of signal differences between lineages.
      5. Figure 4C has not been cited or mentioned in the main text. Please check.

      Significance

      Strengths: This work utilized a new single-cell multi-omics method and generated abundant epigenomics and transcriptomics datasets for cells covering multiple key developmental stages of zebrafish. Limitations: The data analysis was superficial and mainly focused on the correspondence between the two modalities. The discussion of developmental biology was limited.

      Advance: The zebrafish single-cell datasets are valuable. The T-ChIC method is new and interesting.

      The audience will be specialized and from basic research fields, such as developmental biology, epigenomics, bioinformatics, etc.

      I'm more specialized in the direction of single-cell epigenomics, gene regulation, 3D genomics, etc.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors have a longstanding focus and reputation on single cell sequencing technology development and application. In this current study, the authors developed a novel single-cell multi-omic assay termed "T-ChIC" so that to jointly profile the histone modifications along with the full-length transcriptome from the same single cells, analyzed the dynamic relationship between chromatin state and gene expression during zebrafish development and cell fate determination. In general, the assay works well, the data look convincing and conclusions are beneficial to the community.

      There are several single-cell methodologies all claim to co-profile chromatin modifications and gene expression from the same individual cell, such as CoTECH, Paired-tag and others. Although T-ChIC employs pA-Mnase and IVT to obtain these modalities from single cells which are different, could the author provide some direct comparisons among all these technologies to see whether T-ChIC outperforms?

      In current study, T-ChIC profiled H3K27me3 and H3K4me1 modifications, these data look great. How about other histone modifications (eg H3K9me3 and H3K36me3) and transcription factors?

      T-ChIC can detect full length transcription from the same single cells, but in FigS3, the authors still used other published single cell transcriptomics to annotate the cell types, this seems unnecessary?

      Throughout the manuscript, the authors found some interesting dynamics between chromatin state and gene expression during embryogenesis, independent approaches should be used to validate these findings, such as IHC staining or RNA ISH?

      In Fig2 and FigS4, the authors showed H3K27me3 cis spreading during development, this looks really interesting. Is this zebrafish specific? H3K27me3 ChIP-seq or CutTag data from mouse and/or human embryos should be reanalyzed and used to compare. The authors could speculate some possible mechanisms to explain this spreading pattern?

      Significance

      The authors have a longstanding focus and reputation on single cell sequencing technology development and application. In this current study, the authors developed a novel single-cell multi-omic assay termed "T-ChIC" so that to jointly profile the histone modifications along with the full-length transcriptome from the same single cells, analyzed the dynamic relationship between chromatin state and gene expression during zebrafish development and cell fate determination. In general, the assay works well, the data look convincing and conclusions are beneficial to the community.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-02879 Corresponding author(s): Matteo Allegretti; Alia dos Santos

      1. General Statements

      In this study, we investigated the effects of paclitaxel on both healthy and cancerous cells, focusing on alterations in nuclear architecture. Our novel findings show that:

      • Paclitaxel-induced microtubule reorganisation during interphase alters the perinuclear distribution of actin and vimentin. The formation of extensive microtubule bundles, in paclitaxel or following GFP-Tau overexpression, coincides with nuclear shape deformation, loss of regulation of nuclear envelope spacing, and alteration of the nuclear lamina.

      • Paclitaxel treatment reduces Lamin A/C protein levels via a SUN2-dependent mechanism. SUN2, which links the lamina to the cytoskeleton, undergoes ubiquitination and consequent degradation following paclitaxel exposure.

      • Lamin A/C expression, frequently dysregulated in cancer cells, is a key determinant of cellular sensitivity to, and recovery from, paclitaxel treatment.

      Collectively, our data support a model in which paclitaxel disrupts nuclear architecture through two mechanisms: (i) aberrant nuclear-cytoskeletal coupling during interphase, and (ii) multimicronucleation following defective mitotic exit. This represents an additional mode of action for paclitaxel beyond its well-established mechanism of mitotic arrest.

      We thank the reviewers for their time and constructive feedback. We have carefully considered all comments and have carried out a full revision. The updated manuscript now includes additional data showing:

      • Overexpression of microtubule-associated protein Tau causes similar nuclear aberration phenotypes to paclitaxel. This supports our hypothesis that increased microtubule bundling directly leads to nuclear disruption in paclitaxel during interphase.

      • Paclitaxel's effects on nuclear shape and Lamin A/C and SUN2 expression levels occur independently of cell division.

      • Reduced levels of Lamin A/C and SUN2 upon paclitaxel treatment occur at the protein level via ubiquitination of SUN2.

      • The effects of paclitaxel on the nucleus are conserved in breast cancer cells.

      Full Revision

      We have also edited our text and added further detail to clarify points raised by the reviewers. We believe that our revised manuscript is overall more complete, solid and compelling thanks to the reviewers' comments.

      1. Point-by-point description of the revisions

      Reviewer #1 Evidence, reproducibility and clarity

      This description of the down-regulation of the expression of lamin A/C upon treatment with paclitaxel and its sensitivity to SUN2 is quite interesting but still somehow preliminary. It is unclear whether this effect involves the regulation of gene expression, or of the stability of the proteins. How SUN2 mediates this effect is still unknown.

      We thank the reviewer for this valuable comment. To elucidate the mechanism behind the decrease in Lamin A/C and SUN2 levels, we have now performed several additional experiments. First, we performed RT-qPCR to quantify mRNA levels of these genes, relative to the housekeeping gene GAPDH (Supplementary Figure 3B and O). The levels of SUN2 and LMNA mRNA remained the same between control and paclitaxel-treated cells, indicating that this effect instead occurs at the protein level. We have also tested post-translational modifications as a potential regulatory mechanism for Lamin A/C and SUN2. In addition to the phosphorylation of Ser404 which we had already tested (Supplementary Figure 3C), we have now included additional Phos-tag gel and Western blotting data showing that the overall phosphorylation status of Lamin A/C is not affected by paclitaxel (Supplementary Figure 3E and F). We also pulled-down Lamin A/C from cell lysates and then Western blotted for polyubiquitin and acetyl-lysine, which showed that the ubiquitination and acetylation states of Lamin A/C are also not affected by paclitaxel (Supplementary Figure 3G-I). However, Western blots for polyubiquitin of SUN2 pulled down from cell lysates showed that paclitaxel treatment results in significant SUN2 ubiquitination (Figure 3M and N). Therefore, we propose that the downregulation of SUN2 following paclitaxel treatment occurs by ubiquitin-mediated proteolysis.

      The roles of free tubulins and polymerized microtubules, and thus the potential role of paclitaxel, need to be uncovered.

      We addressed this important point by using an alternative method to stabilise/bundle microtubules in interphase, namely by overexpressing GFP-Tau, as suggested by reviewer 2. Following GFP- Tau overexpression, large microtubule bundles were observed throughout the cytoplasm (Figure 4A), and this resulted in a significant decrease in nuclear solidity (Figure 4B). Furthermore, in cells where microtubule bundles extensively contacted the nucleus, the nuclear lamina became unevenly distributed and appeared patchy (Figure 4C). This supports our hypothesis that the aberrations to nuclear shape and Lamin A/C localisation in paclitaxel-treated cells are due to the presence of microtubules bundles surrounding the nucleus.

      The doses of paclitaxel at which occur the effects described in the paper are not fully consistent with all the conclusions. Most experiments have been done at 5 nM. However, at this dose the effect of lamin A/C over or down expression on the growth (differences in the slopes of the curves in Figure 4A) are not fully convincing and not fully consistent with the clear effect on viability as well (in addition, duration of treatments before assessing vialbility are not specified). At 1 nM, cell growth is reduced and the rescuing effect of lamin over-expression is much more clear (Fig 4A), and the nucleus deformation clear (Fig 2A) but this dose has no effect on lamin A/C expression (Fig 3C), which questions how lamins impact nucleus shape and cell survival. Cytoskeleton reorganisation in these conditions is not described although it could clarify the respective role of force production (suggested in figure 1) and nuclei resistance (shown in figure 2) in paclitaxel sensitivity.

      We thank the reviewer for raising this important point. We have addressed this by conducting additional repeats for the cell confluency measurements to increase the statistical power of our experiments (Figure 5A). Our data now show that GFP-lamin A/C had a statistically significant effect on rescuing cell growth at both 1 nM and 5 nM paclitaxel, while Lamin A/C knockdown exacerbated the inhibition of cell growth at 5 nM paclitaxel but not 1 nM paclitaxel (Figure 5A). In addition, we note that the duration of paclitaxel treatment before assessing viability was specified in the figure legend: "Bar graph comparing cell viability between wild-type (red), GFP-Lamin A/C overexpression (green), and Lamin A/C knockdown (blue) cells following 20 h incubation in 0, 1, 5, or 10 nM paclitaxel." We also repeated cell viability analysis after 48 h incubation in paclitaxel instead of 20 h to allow for a longer time for differences to take effect (Figure 5B).

      We also added figures showing the cytoskeletal reorganisation at both 1 and 10 nM in addition to 0 and 5 nM (Supplementary Figure 1A) showing that microtubule bundling and condensation of actin into puncta correlated with increased paclitaxel concentration. Vimentin colocalised well with microtubules at all concentrations.

      We have also included in our results section further clarification for the use of 5nM paclitaxel in this study. The new section reads as follows: "Experiments were performed at 5 nM paclitaxel (with additional experiments to determine dose relationships at 1 and 10 nM) because this aligns with previous studies7,14,24. Furthermore, previous analysis of patient plasma reveals that typical concentrations are within the low nanomolar range8, and concentrations of 5-10 nM are required in cell culture to reach the same intracellular concentrations observed in vivo in patient tumours9. This aligns with in vitro cytotoxic studies of paclitaxel in eight human tumour cell lines which show that paclitaxel's IC50 ranges between 2.5 and 7.5 nM41."

      Finally, although the absence of role of mitotic arrest is clear from the data, the defective reorganisation of the nucleus after mitosis still suggest that the effect of paclitaxel is not independent of mitosis.

      We thank the reviewer for pointing out the need for clarification in the wording of our manuscript. We have reworded the title and relevant sections of our abstract, introduction, and discussion to make it clearer that the effects of paclitaxel on the nucleus are due to a combination of aberrant nuclear cytoskeletal coupling during interphase and multimicronucleation following mitotic slippage. We have also added additional data in support of the effect of paclitaxel on nuclear architecture during interphase. For this, we used serum-starved cells (which divide only very slowly such that the majority of cells do not pass through mitosis during the 16 h incubation in paclitaxel [Supplementary Figure 2D]). Our new data confirmed that paclitaxel's effects on nuclear solidity, and Lamin A/C and SUN2 proteins levels can occur independently of cell division (Figure 2C; Figure 3H-J). Finally, when we overexpressed GFP-Tau (as discussed above) we observed similar aberrations to nuclear solidity and Lamin A/C localisation. This indicates that these effects occur due to microtubule bundling in interphase, especially as in our study GFP-Tau did not lead to multimicronucleation or appear to affect mitosis (Figure 4).

      Below are the main changes to the text regarding the interphase effect of paclitaxel:

      • Title: "Paclitaxel compromises nuclear integrity in interphase through SUN2-mediated cytoskeletal coupling"

      • Abstract: "Overall, our data supports nuclear architecture disruption, caused by both aberrant nuclear-cytoskeletal coupling during interphase and exit from defective mitosis, as an additional mechanism for paclitaxel beyond mitotic arrest."

      • Introduction: "Here we propose that cancer cells have increased vulnerability to paclitaxel both during interphase and following aberrant mitosis due to pre-existing defects in their NE and nuclear lamina."

      • Discussion: "Overall, our work builds on previous studies investigating loss of nuclear integrity as an anti-cancer mechanism of paclitaxel separate from mitotic arrest14,20,21. We propose that cancer cells show increased sensitivity to nuclear deformation induced by aberrant nuclear-cytoskeletal coupling and multimicronucleation following mitotic slippage. Therefore, we conclude that paclitaxel functions in interphase as well as mitosis, elucidating how slowly growing tumours are targeted."

      minor: a more thorough introduction of known data about dose response of cells in culture and in vivo would help understanding the range of concentrations used in this study.

      As mentioned above, we have now included additional information in our Results section to clarify our paclitaxel dose range: "Experiments were performed at 5 nM paclitaxel (with additional experiments to determine dose relationships at 1 and 10 nM) because this aligns with previous studies7,14,24. Furthermore, previous analysis of patient plasma reveals that typical concentrations are within the low nanomolar range8, and concentrations of 5-10 nM are required in cell culture to reach the same intracellular concentrations observed in vivo in patient tumours9. This aligns with in vitro cytotoxic studies of paclitaxel in eight human tumour cell lines which show that paclitaxel's IC50 ranges between 2.5 and 7.5 nM41."

      Significance

      In this manuscript, Hale and colleagues describe the effect of paclitaxel on nucleus deformation and cell survival. They showed that 5nM of paclitaxel induces nucleus fragmentation, cytoskeleton reorganisation, reduced expression of LaminA/C and SUN2, and reduced cell growth and viability. They also showed that these effects could be at least partly compensated by the over-expression of lamin A/C. As fairly acknowledged by the authors, the induction of nuclear deformation in paclitaxel-treated cells, and the increased sensitivity to paclitaxel of cells expressing low level of lamin A/C are not novel (reference #14). Here the authors provided more details on the cytoskeleton changes and nuclear membrane deformation upon paclitaxel treatment. The effect of lamin A/C over and down expression on cell growth and survival are not fully convincing, as further discussed below. The most novel part is the observation that paclitaxel can induce the down-regulation of the expression of lamin A/C and that this effect is mediated by SUN2.

      We appreciate the reviewer's summary and thank them for their time. We believe our comprehensive revisions have addressed all comments, strengthening the manuscript and making it more robust and compelling.

      Reviewer #2 Evidence, reproducibility and clarity This study investigates the effects of the chemotherapeutic drug paclitaxel on nuclear-cytoskeletal coupling during interphase, claiming a novel mechanism for its anti-cancer activity. The study uses hTERT-immortalized human fibroblasts. After paclitaxel exposure, a suite of state- of-the-art imaging modalities visualizes changes in the cytoskeleton and nuclear architecture. These include STORM imaging and a large number of FIB-SEM tomograms.

      We thank the reviewer for the summary and for highlighting our efforts in using the latest imaging technical advances.

      Major comments:

      The authors make a major claim that in addition to the somewhat well-described mechanism of paclitaxel on mitosis, they have discovered 'an alternative, poorly characterised mechanism in interphase'.

      However, none of the data proves that the effects shown are independent of mitosis. To the contrary, measurements are presented 48 hours after paclitaxel treatment starts, after which it can be assumed that 100% of cells have completed at least one mitotic event. The appearance of micronuclei evidences this, as discussed by the authors shortly. It looks like most of the results shown are based on botched mitosis or, more specifically, errors on nuclear assembly upon exit from mitosis rather than a specific effect of paclitaxel on interphase. The readouts the authors show just happen to be measurements while the cells are in interphase.

      Alternative hypotheses are missing throughout the manuscript, and so are critical controls and interpretations.

      We thank the reviewer for highlighting the lack of clarity in our wording. We have revised the title, abstract and relevant sections of the introduction and discussion to clarify our message that the effects of paclitaxel on the nucleus arise from a combination of aberrant nuclear-cytoskeletal coupling during interphase and multimicronucleation following exit from defective mitosis. We have also included additional data where we used slow-dividing, serum-starved cells (under these conditions, the majority of cells do not undergo mitosis during the 16 h incubation in paclitaxel [Supplementary Figure 2D]). Our new data show that even in these cells there is a clear effect of paclitaxel on nuclear solidity, and Lamin A/C and SUN2 protein levels, further supporting our hypothesis that these phenotypes can occur independently of cell division (Figure 2C; Figure 3H-J). Furthermore, we performed additional experiments where we used overexpression of GFP-Tau as an alternative method of stabilising microtubules in interphase and observed similar aberrations to nuclear solidity and Lamin A/C localisation. As GFP-Tau overexpression did not lead to micronucleation or appear to affect mitosis, these data support the hypothesis that nuclear aberrations occur due to microtubule bundling in interphase (Figure 4). We discuss these experiments in more detail below. Finally, we have reworded the introduction to better introduce alternative hypotheses and mechanisms for paclitaxel's activity.

      The authors claim that 'Previously, the anti-cancer activity of paclitaxel was thought to rely mostly on the activation of the mitotic checkpoint through disruption of microtubule dynamics, ultimately resulting in apoptosis.' The authors may have overlooked much of the existing literature on the topic, including many recent manuscripts from Xiang-Xi Xu's and another lab.

      We would like to note that the paper from Xiang-Xi Xu's lab (Smith et al, 2021) was cited in our original manuscript (reference 14 in both the original and revised manuscripts). We have now also included additional review articles from the Xiang-Xi Xu lab (PMID:36368286 20 and PMID: 35048083 21). Furthermore, we have clarified the wording in both the introduction and discussion to better reflect the current understanding of paclitaxel's mechanism and alternative hypotheses.

      The data, e.g. in Figure 1, does not hold up to the first alternative hypothesis, e.g. that paclitaxel stabilizes microtubules and that excessive mechanical bundling of microtubules induces major changes to cell shape and mechanical stress on the nucleus. Even the simplest controls for this effect (the application of an alternative MT stabilizing drug or the overexpression of an MT stabilizer, e.g., tau).

      We thank the reviewer for suggesting this control experiment using the microtubule stabiliser Tau. We have now included these experiments in the revised version of the manuscript (Figure 4). The overexpression of GFP-Tau supports our hypothesis that cytoskeletal reorganisation in paclitaxel exerts mechanical stress on the nucleus during interphase, resulting in nuclear deformation and aberrations to the nuclear lamina. In particular, GFP-Tau overexpression resulted in large microtubule bundles throughout the cytoplasm (Figure 4A). Notably, in cells where these bundles extensively contacted the nucleus, we observed a significant decrease in nuclear solidity (Figure 4B) accompanied by changes in nuclear lamina organisation, including a patchy lamina phenotype, similar to that induced by paclitaxel (Figure 4C).

      The focus on nuclear lamina seems somewhat arbitrary and adjacent to previously published work by other groups. What would happen if the authors stained for focal adhesion markers? There would probably be a major change in number and distribution. Would the authors conclude that paclitaxel exerts a specific effect on focal adhesions? Or would the conclusion be that microtubule stabilization and the following mechanical disruption induce pleiotropic effects in cells? Which effects are significant for paclitaxel function on cancer cells?

      We thank the reviewer for raising important points regarding the specificity of paclitaxel's effects. We agree that microtubule stabilisation can induce myriad cellular changes, including alterations to focal adhesions and other cytoskeletal components. Our focus on Lamin A/C and nuclear morphology is grounded both in the established clinical relevance of nuclear mechanics in cancer and builds on mechanistic work from other groups.

      Lamin A/C expression is commonly altered in cancer, and nuclear morphology is frequently used in cancer diagnosis35. Lamin A/C also plays a crucial role in regulating nuclear mechanics32 and, importantly, determines cell sensitivity to paclitaxel14. However, the mechanism by which Lamin A/C determines sensitivity of cancer cells to paclitaxel is unclear.

      Our data are consistent with Lamin A/C being a determinant of paclitaxel survival sensitivity. We also provide evidence that paclitaxel itself reduces Lamin A/C protein levels and disrupts its organisation at the nuclear envelope. We directly link these effects to microtubule bundling around the nucleus and degradation of force-sensing LINC component SUN2, highlighting the importance of nuclear architecture and mechanics to overall cellular function. Furthermore, we show that recovery from paclitaxel treatment depends on Lamin A/C expression levels. This has clinical relevance, as unlike cancer cells, healthy tissue with non-aberrant lamina would be able to selectively recover from paclitaxel treatment.

      Minor comments:

      While I understand the difficulty of the experiments and the effort the authors have put into producing FIB-SEM tomograms, I am not sure they are helping their study or adding anything beyond the light microscopy images. Some of the images may even be in the way, such as supplementary Figure 6, which lacks in quality, controls, and interpretation. Do I see a lot of mitochondria in that slice?

      We agree with the reviewer that Supplementary Figure 6 does not add significant value to the manuscript and thank the reviewer for pointing this out. We have removed it from the manuscript accordingly.

      I may have overlooked it, but has the number of cells from which lamellae have been produced been stated?

      We thank the reviewer for pointing out the missing information. For our cryo-ET experiments, we collected data from 9 lamellae from paclitaxel-treated cells and 6 lamellae from control cells, with each lamella derived from a single cell. This information has now been added to the figure legend (Figure 2F).

      Significance

      The significance of studying the effect of paclitaxel, the most successful chemotherapy drug, should be broad and of interest to basic researchers and clinicians.

      As outlined above, I believe that major concerns about the design and interpretation of the study hamper its significance and advancements.

      We appreciate the reviewer's concerns and have performed major revisions to strengthen the significance of our study. Specifically, we conducted two key sets of experiments to validate our original conclusions: serum starvation to control for the effects of cell division, and overexpression of the microtubule stabiliser Tau to demonstrate that paclitaxel can affect the nucleus via its microtubule bundling activity in interphase.

      By elucidating the mechanistic link between microtubule stabilisation and nuclear-cytoskeletal coupling, our findings contribute to our understanding of paclitaxel's multifaceted actions in cancer cells.

      My areas of expertise could be broadly defined as Cell Biology, Cytoskeleton, Microtubules, and Structural Biology.

      Reviewer #3 Evidence, reproducibility and clarity The manuscript presents interesting new ideas for the mechanism of an old drug, taxol, which has been studied for the last 40 years.

      We thank the reviewer for the positive feedback.

      Although similar ideas are published, which may be suitable to be cited? • Paclitaxel resistance related to nuclear envelope structural sturdiness. Smith ER, Wang JQ, Yang DH, Xu XX. Drug Resist Updat. 2022 Dec;65:100881. doi: 10.1016/j.drup.2022.100881. Epub 2022 Oct 15. PMID: 36368286 Review. • Breaking malignant nuclei as a non-mitotic mechanism of taxol/paclitaxel. Smith ER, Xu XX. J Cancer Biol. 2021;2(4):86-93. doi: 10.46439/cancerbiology.2.031. PMID: 35048083 Free PMC article.

      We thank the reviewer for bringing to our attention these important review articles. In our initial manuscript, we only cited the original paper (14, also reference 14 in the original manuscript). We have now included citations to the suggested publications (20,21).

      We would also like to emphasise how our manuscript distinguishes itself from the work of Smith et al.14,20,21:

      • Cell-type focus: In their study 14, Smith et al. examined the effect of paclitaxel on malignant ovarian cancer cells and proposed that paclitaxel's effects on the nucleus are limited to cancer cells. However, our data extends these findings by demonstrating paclitaxel's effects in both cancerous and non-cancerous backgrounds.

      • Cytoskeletal reorganisation: Smith et al. show reorganisation of microtubules in paclitaxel-treated cells14. Our data show re-organisation of other cytoskeletal components, including F-actin and vimentin.

      • Multimicronucleation: Smith et al. propose that paclitaxel-induced multimicronucleation occurs independently of cell division14. Although we observe progressive nuclear abnormalities during interphase over the course of paclitaxel treatment, our data do not support this conclusion; we find that multimicronucleation occurs only following mitosis.

      • Direct link between microtubule bundling and nuclear aberrations: We show that nuclear aberrations caused by paclitaxel during interphase (distinct from multimicronucleation) are directly linked to microtubule bundling around the nucleus, suggesting they result from mechanical disruption and altered force propagation.

      • Lamin A/C regulation: Consistent with Smith et al.14, we show that Lamin A/C depletion leads to increased sensitivity to paclitaxel treatment. However, we further demonstrate that paclitaxel itself leads to reduced levels of Lamin A/C and that this effect occurs independently of mitosis and is mediated via force-sensing LINC component SUN2. Upon SUN2 knockdown, Lamin A/C levels are no longer affected by paclitaxel treatment.

      • Recovery: Finally, our work reveals that cells expressing low levels of Lamin A/C recover less efficiently after paclitaxel removal. This might help explain how cancer cells could be more susceptible to paclitaxel.

      Only one cell line was used in all the experiments? "Human telomerase reverse transcriptase (hTERT) immortalised human fibroblasts" ? The cells used are not very relevant to cancer cells (carcinomas) that are treated with paclitaxel. It is not clear if the observations and conclusions will be able to be generalized to cancer cells.

      We thank the reviewer for this comment. Our initial study aimed to understand the effects of paclitaxel on nuclear architecture in non-aberrant backgrounds. To show that the observed effects of paclitaxel are also applicable to cancer cells, we have now repeated our main experiments using MDA-MB-231 human breast cancer cells (Supplementary Figure 1B; Supplementary Figure 3P-T). Similar to our findings in human fibroblasts, paclitaxel treatment of MDA-MB-231 led to cytoskeletal reorganisation (Supplementary Figure 1B), a decrease in nuclear solidity (Supplementary Figure 3P), aberrant (patchy) localisation of Lamin A/C (Supplementary Figure 3Q), and a reduction in Lamin A/C and SUN2 levels (Supplementary Figure 3R-T).

      "Fig. 1. (B) STORM imaging of α-tubulin immunofluorescence in cells fixed after 16 h incubation in control media or 5 nM paclitaxel. Lower panels show α-tubulin clusters generated with HDBSCAN analysis. Scale bars = 10 μm." It needs explanation of what is meaning of the different color lines in the lower panels, just different filaments?

      We have added further detail to the figure legend for clarification: "Lower panels show α-tubulin clusters generated with HDBSCAN analysis. Different colours distinguish individual α-tubulin clusters, representing individual microtubule filaments or filament bundles."

      Generally, the figures need additional description to be clear.

      We have added further clarification and detail to our figure legends.

      "Figure 3 - Paclitaxel results in aberrations to the nuclear lamina." The sentence seems not to be well constructed. "Paclitaxel treatment causes ..."?

      We changed this sentence to: "Figure 3 - Paclitaxel treatment results in aberrant organisation of the nuclear lamina and decreased Lamin A/C levels via SUN2."

      Lamin A and C levels are different in different images (Fig. 3B, H): some Lamin A is higher, and sometime Lamin C is higher? This may possibly due to culture condition or subtle difference in sample handling?.

      We thank the reviewer for pointing this out and we agree that the ratio of Lamin A to Lamin C can vary with culture conditions. To confirm that paclitaxel treatment reduces total Lamin A/C levels regardless of this ratio, we repeated the Western blot analysis in three additional biological replicates using cells in which Lamin C levels exceeded Lamin A levels. These experiments confirmed a comparable decrease in total Lamin A/C levels. Figure 3B and 3C have been updated accordingly.

      Also, the effect on Lamin A/C and SUN2 levels are not significant of robust.

      Decreased Lamin A/C and SUN2 levels following paclitaxel treatment were consistently seen across three or more biological repeats (Figure 3B-C), and this could be replicated in a different cell type (MDA-MB-231) (Supplementary Figure 3R-T). Furthermore, Western blotting results are consistent with the patchy Lamin A/C distribution observed using confocal and STORM following paclitaxel treatment (Figure 3A; Supplementary Figure 3A), where Lamin A/C appears to be absent from discrete areas of the lamina.

      Any mechanisms are speculated for the reason for the reduction?

      We have now included additional data which aims to shed light on the mechanism behind the decrease in Lamin A/C and SUN2 levels following paclitaxel treatment. We found that SUN2 is selectively degraded during paclitaxel treatment. Immunoprecipitation of SUN2 followed by Western blotting against Polyubiquitin C showed increased SUN2 ubiquitination in paclitaxel (Figure 3M and N). Furthermore, in our original manuscript, we showed that Lamina A/C levels remained unaltered during paclitaxel treatment in cells where SUN2 had been knocked down. We propose that changes in microtubule organisation affect force propagation to Lamin A/C specifically via SUN2 and that this leads to Lamina A/C removal and depletion. Future work will be needed to fully understand this mechanism.

      In addition to the findings described above, we report no significant changes in mRNA levels for LMNA or SUN2 in paclitaxel (Supplementary Figure 3B and O). Phos-tag gels followed by Western blotting analysis for Lamin A/C also did not detect changes to the overall phosphorylation status of Lamin A/C due to paclitaxel treatment. This is in agreement with our initial data showing no changes to Lamin A/C Ser 404 phosphorylation levels (Supplementary Figure 3E and F). Finally, Lamin A/C immunoprecipitation experiments followed by Western blotting for Polyubiquitin C and acetyl-lysine showed no significant changes in the ubiquitination and acetylation state of Lamin A/C in paclitaxel-treated cells (Supplementary Figure 3G-I).

      Also, the about 50% reduction in protein level is difficult to be convincing as an explanation of nuclear disruption.

      The nuclear lamina and LINC complex proteins play a critical role in regulating nuclear integrity, stiffness and mechanical responsiveness to external forces28,31-33,54,75, as well as in maintaining the nuclear intermembrane distance69,74. In particular, SUN-domain proteins physically bridge the nuclear lamina to the cytoskeleton through interactions with Nesprins, thereby preserving the perinuclear space distance30,69,74. Mutations in Lamins have been shown to disrupt chromatin organization, alter gene expression, and compromise nuclear structural integrity, and experiments with LMNA knockout cells reveal that nuclear mechanical fragility is closely coupled to nuclear deformation47. Furthermore, nuclear-cytoskeletal coupling is essential during processes such as cell migration, where cells undergo stretching and compression of the nucleus; weakening or loss of the lamina in such cases compromises cell movement47,73. In our work, we show that alterations to nuclear Lamin A/C and SUN2 by paclitaxel treatment coincide with nuclear deformations (Figure 2A-D, F, G; Figure 3A-D, F, G; Supplementary Figure 3A, P-T) and that these deformations are reversible following paclitaxel removal (Supplementary Figure 4B-D). Our experiments also demonstrate that Lamin A/C expression levels significantly influence cell growth, cell viability, and cell recovery in paclitaxel (Figure 5). Therefore, drawing on current literature and our results, we propose that, during interphase, paclitaxel induces severe nuclear aberrations through the combined effects of: i) increased cytoskeletal forces on the NE caused by microtubule bundling; ii) loss of ~50% Lamin A/C and SUN2; iii) reorganisation of nucleo-cytoskeletal components.

      Significance

      The manuscript presents interesting new ideas for the mechanism of an old drug, taxol, which has been studied for the last 40 years.

      The data may be improved to provide stronger support.

      Additional cell lines (of cancer or epithelial origin) may be repeated to confirm the generality of the observation and conclusions.?

      We thank the reviewer for the feedback and valuable suggestions. In response, we have included experiments using human breast cancer cell line MDA-MB-231 to further corroborate our findings and interpretations. We believe these additions have improved the clarity, robustness and impact of our manuscript, and we are grateful for the reviewer's contributions to its improvement.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript presents interesting new ideas for the mechanism of an old drug, taxol, which has been studied for the last 40 years. Although similar ideas are published, which may be suitable to be cited?

      • Paclitaxel resistance related to nuclear envelope structural sturdiness. Smith ER, Wang JQ, Yang DH, Xu XX. Drug Resist Updat. 2022 Dec;65:100881. doi: 10.1016/j.drup.2022.100881. Epub 2022 Oct 15. PMID: 36368286 Review.
      • Breaking malignant nuclei as a non-mitotic mechanism of taxol/paclitaxel. Smith ER, Xu XX. J Cancer Biol. 2021;2(4):86-93. doi: 10.46439/cancerbiology.2.031. PMID: 35048083 Free PMC article.

      Only one cell line was used in all the experiments? "Human telomerase reverse transcriptase (hTERT) immortalised human fibroblasts" ? The cells used are not very relevant to cancer cells (carcinomas) that are treated with paclitaxel. It is not clear if the observations and conclusions will be able to be generalized to cancer cells.

      "Fig. 1. (B) STORM imaging of α-tubulin immunofluorescence in cells fixed after 16 h incubation in control media or 5 nM paclitaxel. Lower panels show α-tubulin clusters generated with HDBSCAN analysis. Scale bars = 10 μm." It needs explanation of what is meaning of the different color lines in the lower panels, just different filaments?

      Generally, the figures need additional description to be clear.

      "Figure 3 - Paclitaxel results in aberrations to the nuclear lamina." The sentence seems not to be well constructed. "Paclitaxel treatment causes ..."?

      Lamin A and C levels are different in different images (Fig. 3B, H): some Lamin A is higher, and sometime Lamin C is higher? This may possibly due to culture condition or subtle difference in sample handling?. Also, the effect on Lamin A/C and SUN2 levels are not significant of robust. Any mechanisms are speculated for the reason for the reduction? Also, the about 50% reduction in protein level is difficult to be convincing as an explanation of nuclear disruption.

      Significance

      The manuscript presents interesting new ideas for the mechanism of an old drug, taxol, which has been studied for the last 40 years.

      The data may be improved to provide stronger support.

      Additional cell lines (of cancer or epithelial origin) may be repeated to confirm the generality of the observation and conclusions.?

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This study investigates the effects of the chemotherapeutic drug paclitaxel on nuclear-cytoskeletal coupling during interphase, claiming a novel mechanism for its anti-cancer activity. The study uses hTERT-immortalized human fibroblasts. After paclitaxel exposure, a suite of state-of-the-art imaging modalities visualizes changes in the cytoskeleton and nuclear architecture. These include STORM imaging and a large number of FIB-SEM tomograms.

      Major comments:

      The authors make a major claim that in addition to the somewhat well-described mechanism of paclitaxel on mitosis, they have discovered 'an alternative, poorly characterised mechanism in interphase'.

      However, none of the data proves that the effects shown are independent of mitosis. To the contrary, measurements are presented 48 hours after paclitaxel treatment starts, after which it can be assumed that 100% of cells have completed at least one mitotic event. The appearance of micronuclei evidences this, as discussed by the authors shortly. It looks like most of the results shown are based on botched mitosis or, more specifically, errors on nuclear assembly upon exit from mitosis rather than a specific effect of paclitaxel on interphase. The readouts the authors show just happen to be measurements while the cells are in interphase.

      Alternative hypotheses are missing throughout the manuscript, and so are critical controls and interpretations.

      The authors claim that 'Previously, the anti-cancer activity of paclitaxel was thought to rely mostly on the activation of the mitotic checkpoint through disruption of microtubule dynamics, ultimately resulting in apoptosis.' The authors may have overlooked much of the existing literature on the topic, including many recent manuscripts from Xiang-Xi Xu's and another lab.

      The data, e.g. in Figure 1, does not hold up to the first alternative hypothesis, e.g. that paclitaxel stabilizes microtubules and that excessive mechanical bundling of microtubules induces major changes to cell shape and mechanical stress on the nucleus. Even the simplest controls for this effect (the application of an alternative MT stabilizing drug or the overexpression of an MT stabilizer, e.g., tau).

      The focus on nuclear lamina seems somewhat arbitrary and adjacent to previously published work by other groups. What would happen if the authors stained for focal adhesion markers? There would probably be a major change in number and distribution. Would the authors conclude that paclitaxel exerts a specific effect on focal adhesions? Or would the conclusion be that microtubule stabilization and the following mechanical disruption induce pleiotropic effects in cells? Which effects are significant for paclitaxel function on cancer cells?

      Minor comments:

      While I understand the difficulty of the experiments and the effort the authors have put into producing FIB-SEM tomograms, I am not sure they are helping their study or adding anything beyond the light microscopy images. Some of the images may even be in the way, such as supplementary Figure 6, which lacks in quality, controls, and interpretation. Do I see a lot of mitochondria in that slice?

      I may have overlooked it, but has the number of cells from which lamellae have been produced been stated?

      Significance

      The significance of studying the effect of paclitaxel, the most successful chemotherapy drug, should be broad and of interest to basic researchers and clinicians.

      As outlined above, I believe that major concerns about the design and interpretation of the study hamper its significance and advancements.

      My areas of expertise could be broadly defined as Cell Biology, Cytoskeleton, Microtubules, and Structural Biology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This description of the down-regulation of the expression of lamin A/C upon treatment with paclitaxel and its sensitivity to SUN2 is quite interesting but still somehow preliminary. It is unclear whether this effect involves the regulation of gene expression, or of the stability of the proteins. How SUN2 mediates this effect is still unknown. The roles of free tubulins and polymerized microtubules, and thus the potential role of paclitaxel, need to be uncovered.

      The doses of paclitaxel at which occur the effects described in the paper are not fully consistent with all the conclusions. Most experiments have been done at 5 nM. However, at this dose the effect of lamin A/C over or down expression on the growth (differences in the slopes of the curves in Figure 4A) are not fully convincing and not fully consistent with the clear effect on viability as well (in addition, duration of treatments before assessing vialbility are not specified). At 1 nM, cell growth is reduced and the rescuing effect of lamin over-expression is much more clear (Fig 4A), and the nucleus deformation clear (Fig 2A) but this dose has no effect on lamin A/C expression (Fig 3C), which questions how lamins impact nucleus shape and cell survival. Cytoskeleton reorganisation in these conditions is not described although it could clarify the respective role of force production (suggested in figure 1) and nuclei resistance (shown in figure 2) in paclitaxel sensitivity.

      Finally, although the absence of role of mitotic arrest is clear from the data, the defective reorganisation of the nucleus after mitosis still suggest that the effect of paclitaxel is not independent of mitosis.

      minor: a more thorough introduction of known data about dose response of cells in culture and in vivo would help understanding the range of concentrations used in this study.

      Significance

      In this manuscript, Hale and colleagues describe the effect of paclitaxel on nucleus deformation and cell survival. They showed that 5nM of paclitaxel induces nucleus fragmentation, cytoskeleton reorganisation, reduced expression of LaminA/C and SUN2, and reduced cell growth and viability. They also showed that these effects could be at least partly compensated by the over-expression of lamin A/C. As fairly acknowledged by the authors, the induction of nuclear deformation in paclitaxel-treated cells, and the increased sensitivity to paclitaxel of cells expressing low level of lamin A/C are not novel (reference #14). Here the authors provided more details on the cytoskeleton changes and nuclear membrane deformation upon paclitaxel treatment. The effect of lamin A/C over and down expression on cell growth and survival are not fully convincing, as further discussed below. The most novel part is the observation that paclitaxel can induce the down-regulation of the expression of lamin A/C and that this effect is mediated by SUN2.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This manuscript addresses the question of whether inhibitors of the phosphatases Eya1-4 and of the kinase PLK1 provide an effective therapeutic approach to a range of cancers. Both Eyas and PLK1 have well documented roles in development, and have been implicated in a subset of tumors. Moreover, the authors have previously shown that PLK1 is a substrate of Eya phosphatase activity. Building on these previous findings, the authors assess the possibility of combining an Eya inhibitor, benzarone, with a PLK1 inhibitor, BI2536.

      There are several concerns with the study: 1. The authors suggest that these two drugs are synergistic. Synergy is usually taken as indicative of a greater than additive effect of the two drugs. The ZIP synergy score tested here indicates that the combination of the two drugs has a synergy score between 0 and 10 (figure1, and figure 5). According to "Synergy Finder", "A ZIP synergy score of greater than 10 often indicates a strong synergistic effect, while a score less than -10 suggests a strong antagonistic effect. Scores between -10 and 10 are typically considered additive or near-additive." The data in figure 2 on mitotic cell fraction and on cell death also seems to be more of an additive effect of the two drugs than synergy. The data in figure 3 are also additive effects on RAD51. Therefore a conclusion that "These data indicate that the drug combination was broadly synergistic" seems unwarranted.

      There is a general lack of nomenclature standardisation for defining synergy. Furthermore, multiple synergy models exist, with discrepancies between them. However, as the reviewer states, the prevailing view is that synergy is a combination effect that is stronger than the additive effect of the two drugs. Synergy scores derived from dose-response matrices using different synergy scoring models with scores that fall above 5 are considered truly synergistic (Malyutina A et al., 2019). To strengthen our conclusion of synergy between PLK1 and EYA inhibitors, we have calculated synergy scores using additional synergy models for both benzarone + BI2536 and benzarone + volasertib in H4 and T98G cell lines. Specifically, we find robust synergy (>5) using ZIP, HSA and Bliss calculations with the Benzarone + BI2536 drug combination in H4 cells and with Benzarone + Volasertib in H4 and T98G cells. Synergy scores for Benzarone + BI2536 fell just below 5 in T98G cells. These data are now included in Supplemental Fig S1G of the revised manuscript.

      The discovery of synergistic drug combinations can be further strengthened by evaluating synergy across multiple cellular models. In this study, we have tested a total of 27 different cancer models that universally support synergy.

      Regarding the phenotypic outcomes (mitotic cell fraction, cell death, RAD51 foci), we agree that the observed effects are additive. This is consistent with overall synergistic effects on viability being caused by a combination of additive mechanistic effects. We have amended the text in the revised manuscript to clarify this point.

      There was no statistical difference in the synergy scores of the "high expressing" versus "low expressing cells". So the conclusion that the drug combination "t was effective at lower doses in cell lines with high levels of EYA1 and/or EYA4" seems unwarranted based on the data. Moreover, since there was no statistical difference in synergy between high and low expressing cells, stating that "the potential utility of the combination treatment depends on the specific overexpression of EYA1 and/or EYA4 in cancer cells," seems unwarranted by the data.

      Synergy scores quantify the interaction between drugs, but do not capture absolute treatment effectiveness or dose sensitivity, both of which are crucial for therapeutic considerations. We have included the following sentence in the revised manuscript to clarify this distinction: “While synergy scores did not significantly differ between high and low EYA expressors, high EYA1/4 expression was associated with increased sensitivity to the combination treatment at lower doses, as evidenced by decreased cell viability.” We have also amended the conclusions in the Abstract and Discussion to reflect that the potential utility of the combination therapy in EYA1/4-high cancers is supported by potency rather than synergy scores alone.

      Benzarone and benzbromarone and their derivatives have been shown to bind and inhibit Eya phosphatases, albeit at fairly high doses. However, these two compounds also have a number of other, unrelated targets. The only demonstration that Eyas are a target of benzarone in this study are the CETSA data in supplemental figure 1. The data here seem to represent an n of 1, with no error bars shown. Even more importantly, there is no control. Looking at the blot of actin, it seems as if there may be a benzarone- temperature effect on this protein as well. It would be very helpful to show some evidence that knockdown of Eya similarly synergizes with the PLK1 inhibitor, show data that benzarone is in fact inhibiting Eya activity in these cells by looking at known targets (ie the carboxyterminal tyrosine of H2AX), and other evidence of specificity.

      The specificity of benzarone to the EYA proteins has been demonstrated previously using both in vitro phosphatase assays and the assessment of EYA-mediated pathways (Tadjuidje et al., 2012; Wang et al., 2021; Nelson et al., 2024). These publications have been cited in the manuscript. In addition, benzarone produces phenotypes consistent with the known functions of the EYAs (ie, reduction of PLK1 activity, reduction in RAD51 foci, G2/M arrest, and apoptosis). To further validate EYA target specificity, we have performed viability assays on control and EYA4-depleted HeLa, H4 and T98G cells in response to BI2536 treatment, demonstrating EYA4 depletion-mediated sensitization to BI2536. These data are now included in Fig 1H of the revised manuscript.

      To strengthen our CETSA data, we have now included: (i) densitometry of actin, demonstrating a lack of benzarone-temperature effect, (ii) CETSA analysis for an additional cell line (T98G), demonstrating enhanced thermal stability of the EYAs in the presence of benzarone, and (iii) CETSA analysis of an additional protein (BUB1) to demonstrate target specificity. These data are now included in Supplemental Fig S1E and F of the revised manuscript.

      The proteomic and transcriptomic data of cell lines that were vulnerable to the combination of BI2536 and benzarone implicate overall changes in chromatin with sensitivity. These findings call into question the idea that these two compounds are acting selectively on PLK1 and Eyas. The authors don't really provide any model for explaining this correlation of Nurd complex components with targeting Eyas and PLK1.

      The proteomic and transcriptomic data demonstrate that sensitivity to the combination treatment is associated with higher expression of NuRD complex members and other chromatin regulators. This suggests that cell lines with certain chromatin configurations might be more susceptible to the combined inhibition of PLK1 and EYA. This does not undermine the demonstrated on-target effects of the two compounds, but rather suggests a potential contextual dependence of drug efficacy on chromatin state. Our data thereby implicate NuRD complex expression as a predictive biomarker for tumours that are likely to respond to EYA and PLK1 combination therapy. This has now been clarified in the discussion section of the revised manuscript.

      Specificity of antibodies: I would like to see validation of the Eya antibodies, given the difficulty with such reagents in the field.

      All EYA antibodies have now been validated by western blot analysis following siRNA-mediated depletion. These data are presented in Supplemental Fig S1A of the revised manuscript.

      Reviewer #1 (Significance (Required)):

      New therapies targeting glioblastoma would be welcome. It is not clear that the combination tested here is an effective approach to therapy. It would be necessary to know the targets of the combination and understand the mechanism so that the approach could be pursued further,

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This study explores the sensitivity of cancer cell lines, particularly GBM cells, to dual inhibition of EYA and PLK1, aiming to uncover the connection between these pathways and the cancer stem cell state. Additionally, it investigates whether the NuRD complex modulates GBM cell responses to EYA and PLK1 inhibition. While the findings are interesting, further clarification is needed to establish the mechanistic links between EYA, PLK1, and NuRD, as well as a stronger rationale for their targeted inhibition in GBM therapy- this can be better clarified.

      Some key comments and recommendations: The findings demonstrate that the combination of Benzarone (EYAi) and volasertib (PLKi) significantly reduced cell proliferation in H4 and T98G GBM cell lines, both of which show high expression of EYA. In contrast, the low EYA-expressing A172 cells exhibited limited response. A possible explanation is the inherently slower proliferation rate of A172 cells, which may reduce their dependence on G2/M arrest, thereby diminishing the impact of PLK1i. Does A172 line show a similar growth or cell division rate to H4 and T98G lines.

      A172 cells have a slower proliferation rate than H4 or T98G cells, which may diminish their response to EYA/PLK1 inhibitors. However, in this study we have tested a total of 15 cancer cell lines and 12 GBM stem cell line models. No clear correlation between cell growth rate and sensitivity was observed. As a specific example, the low EYA expressing SJSA-1 cell line has a high proliferation rate but is a low responder to EYA1/PLK1 inhibitors.

      Additionally, although protein expression levels of EYA were assessed across these cell lines, the activity and expression levels of PLK1 were not fully characterized. Since PLK1 is a crucial regulator of mitotic entry and DNA damage repair, its activity across cell lines may contribute to the observed variations in drug sensitivity. Could the authors investigate levels of PLK in these cell lines?

      To address this point, we compared PLK1 expression levels across the panel of cancer cell lines used in our study. These data are now included in Supplemental Fig S1D of the revised manuscript, and show that PLK1 levels are comparable across the cell lines, indicating that baseline PLK1 abundance does not fully explain the observed differential sensitivity.

      The study describes the combination treatment as synergistic in H4 and T98G cells, however this synergy is unclear in Fig 2A and Supplemental Fig S2A. The data suggest that H4 and T98G cells exhibit sensitivity to either EYA or PLK1 inhibition alone, with combined treatment showing enhanced effects rather than synergy. This distinction is evident as BI2536 alone induces robust G2/M arrest with decreased G1 and S phase cells. To validate these findings, combination treatment should be tested in additional GBM cell lines. Additionally, repeating FUCCI cell cycle assays in A172 and H4 cells, particularly in H4, where increased γH2AX and phospho-H3 were detected in response to individual inhibitors, would provide more definitive insights into treatment-induced cell cycle dynamics.

      We agree that several of the phenotypic outcomes, for example G2/M arrest (Fig 2A) and micronuclei formation (Supplemental Fig S2A), produce additive rather than synergistic effects in the combination treated cells. The major claim of the study is that the combination treatment results in potent loss of cell viability in EYA1/EYA4 overexpressing cancer cell models. This is consistent with a combination of additive mechanistic effects causing overall synergistic effects on cancer cell viability. We have clarified this point in the revised manuscript.

      We have previously struggled to get adequate FUCCI sensor expression in H4 cells. However, to address this point, we have quantified cell cycle phase distribution in H4 cells treated with benzarone, BI2536, and the drug combination, using our quantitative image-based cytometry data (Fig 3A, B). These data demonstrate an accumulation of H4 cells in G2/M following combination treatment, consistent with the FUCCI data from T98G cells. Cell cycle dynamics of H4 cells are now included in Supplemental Fig S2A of the revised manuscript.

      A notable inconsistency: Figure 1 utilizes volasertib, whereas Figure 2 employs BI2536. Given that both inhibitors target PLK1 why these specific inhibitors were chosen for each experiment.

      This is not the case. To clarify, BI2536 is used in both Fig 1 and 2. Volasertib is used in Supplemental Fig S1 to reproduce the synergy matrix, thereby demonstrating consistent results with a second PLK1 inhibitor.

      The observation of increased Rad52 foci and sister chromatid exchange (SCE) upon EYA and PLK1 inhibition (Figure 3) is interesting. These findings suggest that dual inhibition impairs homologous recombination (HR), reinforcing the role of EYA and PLK1 in maintaining genomic stability.

      We agree.

      Figure 4 suggests that SJH1 cells, with low EYA expression, exhibit increased sensitivity to EYA inhibition - does this cell line show high expression of PLK or NuRD?

      To clarify, Fig 4 shows that SJH1 cells, which display moderate levels of EYA expression, are highly sensitive to EYA/PLK1 inhibition. Consistent with the observed positive correlation between NuRD protein expression and EYA/PLK1 inhibitor sensitivity, SJH1 cells exhibit the highest levels of NuRD components relative to the other GBM stem cell lines. Expression levels of NuRD components across the slightly sensitive, moderately sensitive, and highly sensitive GBM stem cell lines from publicly available proteomic data and western blot analysis have now been included in Supplemental Fig S5A and B of the revised manuscript, further demonstrating this positive correlation.

      It seems like EYA1 (HW1) and EY4 (SB2B and PB1) expression levels are better predictors of sensitivity to treatment, but not EYA2 and 3 (which is high in H4)- can the authors comment on this?

      Overall, EYA1 and EYA4 expression levels are the major predictors of EYA/PLK1 inhibitor sensitivity in both the cancer cell lines (Fig 1) and the GBM stem cell models (Fig 4). EYA3 levels are also positively associated with sensitivity in the GBM stem cell models, but not in the cancer cell lines. Despite being consistently high, EYA2 expression levels were not associated with sensitivity in either model. These intricacies are likely to reflect functional differences between the proteins, and their ability to form different sub-complexes with each other. We have now clarified these points in the discussion of the revised manuscript.

      Reviewer #2 (Significance (Required)):

      It remains unclear whether NuRD complex involvement is independent of EYA expression levels. Since EYA and PLK1 regulate cell cycle progression and DNA repair, further investigation is needed to delineate their connection to NuRD-mediated chromatin remodeling and differentiation programs. Overall, this study provides some interesting evidence for targeting transcriptional and mitotic vulnerabilities in GBM but requires further validation of synergistic mechanisms, differential inhibitor effects, and NuRD complex involvement in regulating the EYA-PLK1 axis.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This manuscript extends the findings of the interactions between EYA family members and PLK1. The idea to combine EYA inhibitors and PLK1 inhibitors is a thoughtful approach. The effects on proliferation and DNA damage are useful. This effort is a combination of preclinical efforts and some mechanistic efforts and will require additional efforts to support the conclusions drawn.

      Major concerns: 1. The preclinical studies will absolutely require in vivo studies. All brain tumor treatments are limited by delivery across the blood-brain barrier. It is critical to have intracranial survival studies to support the significance of the findings.

      In this study, we have focused on in vitro models including cancer cell lines, GBM stem cell models and 3D tumor spheroids, to establish proof-of-principle as well as mechanistic insight for combined EYA/PLK1 inhibition. We recognize that blood-brain-barrier penetration and therapeutic efficacy in vivo are key translational steps; however, we feel that benzarone is a suboptimal drug candidate for in vivo evaluation. Future development of second-generation EYA inhibitors with higher potency, improved selectivity, and better blood-brain-barrier permeability, is currently underway by ourselves and other groups. These compounds are likely to be more suitable for future in vivo studies, including pharmacokinetic profiling, blood-brain-barrier penetration assays, and orthotopic intracranial tumour models to assess their therapeutic potential more rigorously.

      Likewise, cancer stem cell studies require in vivo studies.

      As outlined above, we feel that in vivo studies fall beyond the scope of this study.

      The proper studies of sphere formation would include in vitro limiting dilution assays. I would suggest greater depth in stem cell and differentiation marker studies to understand what the connection to stemness is.

      The limiting dilution assay is used to measure the self-renewal potential of cancer stem cells, and would be used in this context to determine whether the treatments impact cellular differentiation. This is not the focus of this study. Rather, we are interested in comparing drug sensitivity in cancer stem cells versus differentiated cancer cells. Nevertheless, this is a great suggestion for future investigation as part of a more detailed evaluation of stemness and how these drugs and drug combinations impact self-renewal.

      DNA damage responses differ between cancer stem cells and differentiated tumor cells. I would suggest comparison of effects between matched cells with different cell states.

      We agree that cancer stem cells and their differentiated counterparts often display distinct DNA damage responses. We have tried to mimimise the impact of these differences on the overall conclusions by using multiple cancer cell lines and GBM stem models. To address this comment, we performed western blot analysis of DNA damage response proteins in matched PB1 stem cells and differentiated cells, demonstrating comparable expression of DNA damage response proteins. These data have now been included in Supplemental Fig S5C of the revised manuscript.

      While the inhibitors used may have general specificity for the molecular targets, I would suggest that the authors use genetic loss-of-function and gain-of-function studies to validate the findings. It is particularly important because the primary targets do not predict treatment responses. I would suggest that rescues with PLK1 phosphorylation mutants would be helpful.

      Our data demonstrate that EYA expression levels are predictive of treatment response in both cancer cell lines and GBM stem cell models. To further validate EYA target specificity, we have used a genetic loss-of-function approach. Specifically, we performed viability assays on control and EYA4-depleted HeLa, H4 and T98G cells in response to BI2536 treatment, demonstrating EYA4 depletion-mediated sensitization to BI2536. These data are now included in Fig 1H of the revised manuscript.

      We have previously performed comprehensive rescue experiments with PLK1 phosphorylation mutants (Fig 5C–K; Nelson et al., Nat. Commun. 2024). These experiments demonstrated that cell death in response to EYA depletion or inhibition is attributable to the phosphorylation status of pY445 on PLK1, with an accumulation of Y445 phosphorylation reducing PLK1 activity and functionality, culminating in the potent induction of mitotic cell death.

      Figure 5 should be performed with several lines across different response groups.

      Our study currently includes cell viability and proliferation data from multiple models including 15 cancer cell lines and 12 GBM stem cell line models, spanning different EYA expression levels, and displaying varying sensitivities to both single agents and the EYA/PLK1 combination treatment. We then narrowed the number of models significantly for follow-up analysis. In Fig 5, we selected the highly sensitive PB1 GBM stem cell line based on its ability to form and grow as spheroids. While we appreciate the suggestion to expand these analyses to additional lines, we would like to respectfully decline growing additional spheroids at this time due to limitations inherent in the expansion of these models. We believe that the current dataset adequately demonstrates the reproducibility and relevance of our findings across different response groups.

      The molecular associations are currently just associations. I would suggest greater analysis using genetic manipulation to test causation.

      To address this concern, we have performed additional experiments using siRNA-mediated knockdown of EYA4 in HeLa, H4 and T98G cells. These experiments demonstrate that depletion of EYA4 sensitizes cells to PLK1 inhibition, mimicking the effects observed with pharmacological EYA inhibition. These data have been included in Fig 1H of the revised manuscript, and provide additional functional evidence supporting a causal relationship between EYA activity and sensitivity to PLK1 inhibition.

      Figure 6 should be better developed to include protein testing and validation.

      To address this point, expression levels of NuRD components have been compared using publicly available proteomic datasets and western blot analysis across the slightly sensitive, moderately sensitive and highly sensitive GBM stem cell lines, supporting a positive correlation with sensitivity. These data have been included in Supplemental Fig S5A and B of the revised manuscript.

      Reviewer #3 (Significance (Required)):

      This is a modest advance in understanding how EYA family members may function with PLK1.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript extends the findings of the interactions between EYA family members and PLK1. The idea to combine EYA inhibitors and PLK1 inhibitors is a thoughtful approach. The effects on proliferation and DNA damage are useful. This effort is a combination of preclinical efforts and some mechanistic efforts and will require additional efforts to support the conclusions drawn.

      Major concerns:

      1. The preclinical studies will absolutely require in vivo studies. All brain tumor treatments are limited by delivery across the blood-brain barrier. It is critical to have intracranial survival studies to support the significance of the findings.

      2. Likewise, cancer stem cell studies require in vivo studies.

      3. The proper studies of sphere formation would include in vitro limiting dilution assays. I would suggest greater depth in stem cell and differentiation marker studies to understand what the connection to stemness is.

      4. DNA damage responses differ between cancer stem cells and differentiated tumor cells. I would suggest comparison of effects between matched cells with different cell states.

      5. While the inhibitors used may have general specificity for the molecular targets, I would suggest that the authors use genetic loss-of-function and gain-of-function studies to validate the findings. It is particularly important because the primary targets do not predict treatment responses. I would suggest that rescues with PLK1 phosphorylation mutants would be helpful.

      6. Figure 5 should be performed with several lines across different response groups.

      7. The molecular associations are currently just associations. I would suggest greater analysis using genetic manipulation to test causation.

      8. Figure 6 should be better developed to include protein testing and validation.

      Significance

      This is a modest advance in understanding how EYA family members may function with PLK1.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This study explores the sensitivity of cancer cell lines, particularly GBM cells, to dual inhibition of EYA and PLK1, aiming to uncover the connection between these pathways and the cancer stem cell state. Additionally, it investigates whether the NuRD complex modulates GBM cell responses to EYA and PLK1 inhibition. While the findings are interesting, further clarification is needed to establish the mechanistic links between EYA, PLK1, and NuRD, as well as a stronger rationale for their targeted inhibition in GBM therapy- this can be better clarified.

      Some key comments and recommendations:

      • The findings demonstrate that the combination of Benzarone (EYAi) and volasertib (PLKi) significantly reduced cell proliferation in H4 and T98G GBM cell lines, both of which show high expression of EYA. In contrast, the low EYA-expressing A172 cells exhibited limited response. A possible explanation is the inherently slower proliferation rate of A172 cells, which may reduce their dependence on G2/M arrest, thereby diminishing the impact of PLK1i. Does A172 line show a similar growth or cell division rate to H4 and T98G lines.

      • Additionally, although protein expression levels of EYA were assessed across these cell lines, the activity and expression levels of PLK1 were not fully characterized. Since PLK1 is a crucial regulator of mitotic entry and DNA damage repair, its activity across cell lines may contribute to the observed variations in drug sensitivity. Could the authors investigate levels of PLK in these cell lines?

      • The study describes the combination treatment as synergistic in H4 and T98G cells, however this synergy is unclear in Figure 2A and EV 2A. The data suggest that H4 and T98G cells exhibit sensitivity to either EYA or PLK1 inhibition alone, with combined treatment showing enhanced effects rather than synergy. This distinction is evident as BI2536 alone induces robust G2/M arrest with decreased G1 and S phase cells. To validate these findings, combination treatment should be tested in additional GBM cell lines. Additionally, repeating FUCCI cell cycle assays in A172 and H4 cells, particularly in H4, where increased γH2AX and phospho-H3 were detected in response to individual inhibitors, would provide more definitive insights into treatment-induced cell cycle dynamics.

      • A notable inconsistency: Figure 1 utilizes volasertib, whereas Figure 2 employs BI2536. Given that both inhibitors target PLK1 why these specific inhibitors were chosen for each experiment.

      • The observation of increased Rad52 foci and sister chromatid exchange (SCE) upon EYA and PLK1 inhibition (Figure 3) is interesting. These findings suggest that dual inhibition impairs homologous recombination (HR), reinforcing the role of EYA and PLK1 in maintaining genomic stability.

      • Figure 4 suggests that SJH1 cells, with low EYA expression, exhibit increased sensitivity to EYA inhibition - does this cell line show high expression of PLK or NuRD?

      • It seems like EYA1 (HW1) and EY4 (SB2B and PB1) expression levels are better predictors of sensitivity to treatment, but not EYA2 and 3 (which is high in H4)- can the authors comment on this?

      Significance

      It remains unclear whether NuRD complex involvement is independent of EYA expression levels. Since EYA and PLK1 regulate cell cycle progression and DNA repair, further investigation is needed to delineate their connection to NuRD-mediated chromatin remodeling and differentiation programs.

      Overall, this study provides some interesting evidence for targeting transcriptional and mitotic vulnerabilities in GBM but requires further validation of synergistic mechanisms, differential inhibitor effects, and NuRD complex involvement in regulating the EYA-PLK1 axis.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript addresses the question of whether inhibitors of the phosphatases Eya1-4 and of the kinase PLK1 provide an effective therapeutic approach to a range of cancers. Both Eyas and PLK1 have well documented roles in development, and have been implicated in a subset of tumors. Moreover, the authors have previously shown that PLK1 is a substrate of Eya phosphatase activity. Building on these previous findings, the authors assess the possibility of combining an Eya inhibitor,benzarone, with a PLK1 inhibitor, BI2536.

      There are several concerns with the study:

      1. The authors suggest that these two drugs are synergistic. Synergy is usually taken as indicative of a greater than additive effect of the two drugs. The ZIP synergy score tested here indicates that the combination of the two drugs has a synergy score between 0 and 10 (figure1, and figure 5) . According to "Synergy Finder" , "A ZIP synergy score of greater than 10 often indicates a strong synergistic effect, while a score less than -10 suggests a strong antagonistic effect. Scores between -10 and 10 are typically considered additive or near-additive." The data in figure 2 on mitotic cell fraction and on cell death also seems to be more of an additive effect of the two drugs than synergy. The data in figure 3 are also additive effects on RAD51. Therefore a conclusion that "These data indicate that the drug combination was broadly synergistic" seems unwarranted. Indeed, the data form

      2. There was no statistical difference in the synergy scores of the "high expressing" versus "low expressing cells". So the conclusion that the drug combination "t was effective at lower doses in cell lines with high levels of EYA1 and/or EYA4" seems unwarranted based on the data. Moreover, since there was no statistical difference in synergy between high and low expressing cells, stating that "the potential utility of the combination treatment depends on the specific overexpression of EYA1 and/or EYA4 in cancer cells," seems unwarranted by the data.

      3. Benzarone and benzbromarone and their derivatives have been shown to bind and inhibit Eya phosphatases, albeit at fairly high doses. However, these two compounds also have a number of other, unrelated targets. The only demonstration that Eyas are a target of benzarone in this study are the CETSA data in supplemental figure 1. The data here seem to represent an n of 1, with no error bars shown. Even more importantly, there is no control. Looking at the blot of actin, it seems as if there may be a benzarone- temperature effect on this protein as well. It would be very helpful to show some evidence that knockdown of Eya similarly synergizes with the PLK1 inhibitor, show data that benzarone is in fact inhibiting Eya activity in these cells by looking at known targets (ie the carboxyterminal tyrosine of H2AX), and other evidence of specificity.

      4. The proteomic and transcriptomic data of cell lines that were vulnerable to the combination of BI2536 and benzarone implicate overall changes in chromatin with sensitivity. These findings call into question the idea that these two compounds are acting selectively on PLK1 and Eyas. The authors don't really provide any model for explaining this correlation of Nurd complex components with targeting Eyas and PLK1.

      5. Specificity of antibodies: I would like to see validation of the Eya antibodies, given the difficulty with such reagents in the field.

      Significance

      New therapies targeting glioblastoma would be welcome. It is not clear that the combination tested here is an effective approach to therapy. It would be necessary to know the targets of the combination and understand the mechanism so that the approach could be pursued further,

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank the reviewers for taking the time to review our manuscript and for providing valuable comments on how to improve it. We are pleased to see that both reviewers recognize the novelty and importance of our study, its conceptual advance and potential clinical significance. They also noted the novelty and value of our functional mechanistic approach using epigenetic editing. Below, we provide a point-by-point response to their questions and points raised. The changes introduced in response to their feedback are highlighted in yellow in the revised manuscript file.

      Point-by-point description of the revisions

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __

      Summary This study by Prada et al. aimed to explore DNA methylation and gene expression in primary EpCAMhigh/PDPNlow cells, consisting of for (probably) the largest part of AT2 cells, to understand the molecular mechanisms behind the impaired regeneration of alveolar epithelial progenitor cells in COPD. They found that higher or lower promoter methylation in COPD-associated cells was inversely correlated with changes in gene expression, with interferon signaling emerging as one of the most upregulated pathways in COPD. IRF9 was identified as the master regulator of interferon signaling in COPD. Targeted DNA demethylation of IRF9 in an A549 cell line resulted in a robust activation of its downstream target genes, including OAS1, OAS3, PSMB8, PSMB9, MX2 and IRF7, demonstrating that demethylation of IRF9 is sufficient to activate the IFN signaling pathway, validating IRF9 as a master regulator of IFN signaling in (alveolar) epithelial cells.

      Major comments:

      • To remove airways (and blood vessels) completely from the lung tissue is difficult, if not impossible. This means that the assumption that the sorted EpCAMpos/PDPNlow cells primarily consisted of AT2 cells remains valid only if a quantitative analysis is conducted on the proportion of HT2-280pos cells in all samples in cytospins to exclude any significant contamination from bronchial epithelial cells. If authors cannot demonstrate >95% pure HT-280-positive cells, then the key conclusions suggesting that the epigenetic regulation of the IFN pathway might be crucial in AT2 progenitor cell regeneration could also potentially apply to bronchial progenitor cells. In addition, if >95% purity cannot be demonstrated, the data should be adjusted to account for differences in cell type composition.

      __Response: __

      We thank the reviewer for raising this important point. Although, as pointed out by the reviewer, we cannot guarantee that our sorted cells do not contain a minor contamination from respiratory / terminal bronchial cells, we carefully selected donors, tissue regions, and sorting strategy to ensure the highest possible enrichment of AT2 cells, as we explain below. We have now expanded the methods and results section and covered this point in the manuscript discussion.

      • The lung tissue pieces we received were distal, as evidenced by the presence of pleura. We collected representative tissue pieces for histology to validate sample quality. Our protocol includes a dissection of all visible airways and vessels using a dissecting microscope, which were cryopreserved separately from distal parenchyma. Hence, the starting material for tissue dissociation was depleted from airways and vessels. The importance of vessel/airway removal for enrichment of distal alveolar cells was established by Tata's group (PMID: 35712012).
      • We selected the AT2 sorting protocol (EpCAMpos/PDPNlow) based on previous publications that used tissue from both healthy and COPD lungs to separate AT2 cells from AT1 and airway basal cells, as AT1 and basal cells are both PDPNhigh (PMID: 22033268, PMID: 23117565; PMID: 35078977). This protocol was favoured due to the lack of information about HT2-280 expression and distribution in COPD lungs.
      • The sort quality for each sample was assessed by the FACS analysis (back sorting) of the sorted cells, where we observed 95-97% purity (EpCAMpos/PDPNlow, __ 1G __shown below). In addition, we validated the sorting protocol and high AT2 enrichment from both no COPD and COPD tissues by immunostaining the FACS-sorted cells with HT2-280, an AT2 marker widely used in the field (strategy suggested by the reviewer) and observed that close to 100% of cells were positive for this marker (__Fig. 1H __shown below). However, we could not do it retrospectively for those patients, where we didn't have enough material. Sorting primary AT2 from small tissue pieces is challenging, and we need at least 20.000 cells to obtain high-quality methylation & RNA-seq data.
      • AT2 marker genes (ABCA3, LPCAT1, LAMP3 and the surfactant genes SFTPA2, SFTPB and SFTPC) were among the top highly expressed genes in our RNA-seq data and were not significantly changed in COPD (please see expression data in __ S2A__ in the manuscript, and below for convenience), as well as Table 6, providing further evidence that the sorted cells carry a strong AT2 transcriptional signature. Fig. 1G* FACS plot examples showing the analysis of sorted AT2 cells (back sorting) from control (blue) and COPD (green) donors displayed over total cell lung suspensions (grey) H Representative IF staining of HT2-280 expression in sorted AT2 cells from no COPD (top) and COPD (bottom) donors. Nuclei (blue) were stained with DAPI, scale bars=20µm __Fig. S2A __Normalized read counts from RNA-seq data for AT2-specific genes in sorted AT2 cells from each donor (dots). Data points represent normalised counts from no COPD (blue), COPD I (light green) and COPD II-IV (dark green). Group median is shown as a black bar. *

      • In agreement with a previous study which profiled bulk AT2 using expression arrays (PMID: 23117565), we also observed upregulation of IFN signaling pathway in COPD AT2s. The enrichment of IFNα/β signature was also observed in COPD in the inflammatory AT2 cluster (iAT2) in a recent scRNA-seq study (PMID: 36108172). As part of the revision, we compared the IFN gene signature identified in our bulk AT2 RNA-seq with a recent scRNA-seq study (published after the submission of our manuscript, PMID: 39147413) that profiled EpCAMpos cells from COPD and non-smoker donor lungs. We observed an upregulation of our IFN signature genes in AT2 in COPD (mostly in AT2c and rbAT2 subsets), suggesting that similar signatures were observed in COPD AT2s in this dataset as well (please see __ S4E-F__ below). ____Figure S4E Expression values for the indicated genes of the IFN pathway from an external scRNA-seq dataset of AT2 cells from COPD patients and healthy controls (Hu et al, 2024). Y-axis shows log-normalized gene expression levels. F. Combined gene set score of the genes shown in (E) in different subsets of AT2 cells from Hu et al, 2024. The IFN signature genes were identified in our integrative analysis of TWGBS and RNA-seq in sorted AT2 cells.

      • We have also carefully examined DNA methylation profiles across all samples. The density plots of our T-WGBS DNA methylation data are very similar among the individual samples in all 3 groups, indicating that the sorted cells consist mostly of a single cell type, as there are no obvious intermediate (25-75%) methylation peaks, as observed in cell mixtures ( 2A and the panel below). No reference DNA methylation profiles are available for respiratory or terminal bronchial cells; hence, we cannot compare how epigenetically different these cells would be from AT2 nor perform a deconvolution for potential minor contamination with distal airway cells. *Figure: DNA methylation density plots of sorted EpCAMpos/PDPNneg cells from no COPD (blue, n=3), COPD I (light green, n=3) and COPD II-IV (dark green, n=5) showing a homogeneous methylation pattern and low abundance at intermediate (25%-75%) methylation values across all profiled samples, indicating that the sorted cells were mostly of a single cell type. *

      • We have now added a sentence to the limitations section of the discussion to cover that point specifically. CHANGES IN THE MANUSCRIPT:

      AT2 cells were isolated by fluorescence-activated cell sorting (FACS) from cryopreserved distal lung parenchyma, depleted of visible airways and vessels of three no COPD controls, three COPD I and five COPD II-IV patients as previously described (24, 52, 53)

      The isolated cells were positive for HT2-280, a known AT2 marker (54)*, as confirmed by immunofluorescence (Fig. 1H), validating the identity and high enrichment of the isolated AT2 populations. ** *

      *Known AT2-specific genes, including ABCA3, LAMP3 and surfactant genes (SFTPA2, SFTPB and SFTPC) were among the top highly expressed genes and were not significantly changed in COPD AT2s (Fig. S2A, Table 6), further confirming the AT2-characteristic transcriptional signature of our isolated cells. *

      However, 5-AZA is a global demethylating agent, and the observed effects may not be direct. To validate the epigenetic regulation of central AT2 pathways further, we took advantage of locus-specific epigenetic editing technology *(73). We focused on the IFN pathway because it was the most significantly enriched Gene Ontology (GO) term in our integrative analysis of TWGBS and RNA-seq data. Several IFN pathway members had associated hypomethylated DMRs within promoter-proximal regions and concomitant increased gene expression (Fig. 4C and S2C). Additionally, we confirmed the elevated expression of IFN-related genes with associated DMRs identified in our study in AT2 cells and AT2 cell subclusters from a recently published scRNA-seq cohort (74) (Fig. S4E-F). *

      We observed upregulation of multiple IFN genes in AT2 in COPD, consistent with a previous expression array study (24). IFNα/β signaling was also enriched in COPD patients in the inflammatory AT2 cluster (iAT2) in a recent scRNA-seq study (84) and our INF signature genes were also upregulated in AT2c and AT2rb subsets in COPD, identified by another scRNA-seq study recently (74)*. ** *

      Finally, despite careful removal of airways from distal lung tissue using a dissecting microscope, we cannot exclude the presence of some terminal/respiratory bronchiole cells in our FACS-isolated EpCAMpos/PDPNlow population. Recent scRNA-seq studies provided an unprecedented resolution and identified several epithelial subpopulations and transitional cells residing in the terminal/respiratory bronchioles and alveoli, including respiratory airway secretory cells (93), terminal airway-enriched secretory cells (28), terminal bronchiole-specific alveolar type-0 (AT0) (70), and emphysema-specific AT2 cells (74). These cells may contribute to alveolar repair in healthy and COPD lungs; however, with our bulk DNA methylation and RNA-seq study, we are unable to resolve all these subpopulations. Future development of single-cell methylation and non-reference-based algorithms for DNA methylation deconvolution will enable deeper epigenetic phenotyping of specific AT2 and bronchiolar cell subsets.

      (Methods) Validation of IFN gene upregulation in a published scRNA-seq dataset

      scRNA-seq data from (74), generously provided by M. Köningshoff, were processed using the default Seurat workflow (117). Expression of IFN-related genes was extracted and plotted as log-normalised gene expression levels in AT2 cells from control and COPD donors. Seurat's AddModuleScore() function was used to compute a gene set score for a custom IFN program using the genes listed in __Fig. S4E __and to analyse the IFN gene set scores in AT2 cell subclusters identified in (74). Briefly, average gene expression scores were computed for the gene set of interest, and the expression of control features (randomly selected) was subtracted as described in (118).

      Fig. S4E and F: E. Expression values for the indicated genes of the IFN pathway from an external scRNA-seq dataset of AT2 cells from COPD patients and healthy controls (74). Y-axis shows log-normalized gene expression levels. F. Combined gene set score of the genes shown in (E) in different subsets of AT2 cells from (74). The IFN signature genes were identified in our integrative analysis of TWGBS and RNA-seq in sorted AT2 cells.

      • The overrepresentation of several keratins (KRT5, KRT14, KRT16, KRT17), mucins (MUC12, MUC13, MUC16, MUC20) and the transcription factor FoxJ1 is now attributed by the authors to a possible dysregulation of AT2 identity and differentiation in COPD (lines 282 - 284) where they cite refs 28, 69, 70. Authors try to support this with IF double stains for KRT5 and HT-280 to identify co-expression of KRT5 and HT2-280 in lung tissue (Figure S2H). However, the evidence for the co-expression of both markers could be presented more convincingly.

      __Response: __

      We found the potential co-expression of airway and alveolar markers in COPD lungs interesting and hence included it in the original manuscript. The initial discovery came from our bulk RNA-seq data, where we observed upregulation of several genes typically found in more proximal airways in COPD (mentioned above by the reviewer). Of note, some of them (e.g., FoxJ1) are expressed at very low levels. Following reviewer's comments, to validate possible colocalization of AT2 and airway markers on protein level, we performed further IF analysis. We took Z-stack images to demonstrate the co-localization of HT2-280 and Krt5 more convincingly and co-stained the same tissue regions with SCGB3A2 (a TASC/distal airway cell marker, PMID 36796082). Even though these are rare events, we were able to reproduce the existence of HT2-280/Krt5 positive, SCGB3A2 negative cells in the alveoli of COPD patients on the protein level (__Fig. S2H __and panels below). Although interesting, we decided to keep this finding in the supplement and did not include it in the discussion to focus the story on the epigenetic regulation of the IFN pathway, which is the main discovery of our study. We will investigate this observation in future studies.

      Figure S2H and here: Examples of HT2-280/Krt5 double positive cells. Top, immunofluorescence staining of the alveolar region of a COPD II donor showing the existence of AT2 cells (HT2-280 positive (red), which are SCGB3A2 negative (green, left) but KRT5 positive (green, right). In conclusion, double-positive HT2-280/KRT5 cells are rare but present in the alveoli of COPD patients. Magnification: 20x. Scale bar: 50 µm. Bottom, Z-stack images highlighting HT2-280 (red) and KRT5 (green) double-positive cells at 63x magnification. Scale bar: 5 µm.

      CHANGES IN THE MANUSCRIPT:

      In addition, we observed an upregulation of several keratins (KRT5, KRT14, KRT16, KRT17) and mucins (MUC12, MUC13, MUC16, MUC20), suggesting a potential dysregulation of alveolar epithelial cell differentiation programs in COPD (Table 6, Fig. S2F). Immunofluorescence staining confirmed the presence of KRT5-positive cells in the distal lung in COPD and identified cells positive for both KRT5 and HT2-280 (Fig. S2H). Collectively, these results indicate a dysregulation of stemness and identity in the alveolar epithelial cells in COPD.

      Fig. S2H legend: The zoomed-in panel (right corner, bottom) demonstrates the presence of rare HT2-280/KRT5 double-positive cells in the alveoli of COPD patients.* Slides were counterstained with DAPI, scale bars = 50µm, 20µm or 5µm, as displayed in images. *

      • Double staining for KRT5 and HT2-280 did highlight the proximity of both cell types in lung tissue, underscoring the challenge of removing airways (including the smaller and terminal bronchi) from the tissue. In addition, HT-280/KRT5 co-expression is not consistent with recent studies from refs 28, 69, 70 where other markers for distal airway cell transition, such as SCGB3A2 and BPIFB1, have been demonstrated, which were not investigated in this study.

      Response:

      We provided a general overview of the different signatures observed in our data, but we could not validate every deregulated pathway or gene. We include the relevant tables detailing all differentially expressed genes and differentially methylated regions to enable and encourage the community to follow up on the data in subsequent studies.

      As demonstrated above, we detect the co-occurrence of HT2-280/KRT5 staining on the protein level in the same cells in the alveoli of COPD patients. We would like to emphasize that alveolar epithelial cell identity in CODP lungs has not been investigated in detail on the protein or RNA level, and HT2-280/KRT5 co-expression/co-localization has not been directly tested in the studies mentioned by the reviewer since, among other reasons, the gene encoding HT2-280 has not been identified. Notably, a recent study (published after the submission of our manuscript) focusing on enriched epithelial cells from the distal lungs of COPD patients (PMID 35078977), identified an emphysema-specific AT2 subtype co-expressing the AT2 marker SFTPC and distal airway cell transition marker SCGB3A2, indicating that disease-specific AT2 populations with possible co-occurrence of AT2 and airway markers exist. In our dataset, SCGB3A2 was not deregulated (log2 fold change=0.22, adj p-value= 0.47), as shown in Table 6, and the HT2-280/Krt5 positive cells were negative for SCGB3A2 in our IF staining (see above).

      BPIFB1 is one of the antimicrobial peptides genes with an associated DMR and is significantly upregulated in COPD cells in our study (log2 fold change=1.17, adj p-value=0.0016), as shown in the supplementary figure Fig S4C and here below for convenience.

      Figure S4C Fold-change in gene expression of BPIFB1 in AT2 cells in COPD (RNA-seq) and A549 cells treated with 0.5µM AZA (RT-qPCR) compared to control samples. Left, RNA-seq data from AT2 cells (no COPD, blue, n=3; COPD II-IV, green, n=5). Right, A549 treated with AZA (orange, n=3) compared to control DMSO-treated cells (grey, n=3). The group median is shown as a black bar.

      • The small (and not evenly divided) sample size of both COPD and non-COPD specimens may lead to a higher risk for false positive results as adjustments for multiple testing typically rely on the number of comparisons, and small sample sizes may not provide enough data points to adequately control for this.

      __Response: __

      We acknowledge the problem of testing for multiple traits with relatively small numbers of samples. The availability of donor tissue, especially from non-COPD and COPD-I donors, was limited, and we applied very strict donor matching and quality control criteria for sample inclusion to avoid additional variability and confounding factors. The importance of strict quality control in selecting appropriate control samples was highlighted in our previous study (PMID: 33630765), where we demonstrated that approximately 50% of distal lung tissue from cancer patients with normal spirometry has pathological changes. Hence, we believe that the quality of the tissue was paramount to the reliability of the data. Strict quality control and sample matching for multiple parameters, including age, BMI, smoking status and smoking history (critical for DNA methylation studies), and cancer type (for background tissue), is a key strength of our approach, but it inevitably limited our sample size.

      First, all samples were cryopreserved and then processed in parallel in groups of 1 non-COPD and 2-3 COPD samples. This process included tissue dissociation, FACS sorting, back sorting (always), and immunofluorescence staining (when enough material was available). Cell pellets were stored at -80{degree sign}C until the entire cohort was ready for sequencing. This was done to limit the potential variation introduced by processing and sorting. RNA and DNA isolations were performed in parallel for all the sorted cell pellets, which were then sequenced as a single batch.

      During data analysis, we applied stringent cutoffs for DMR detection to reduce the risk of false positives due to multiple comparisons and a small sample size. Specifically, we filtered for regions with at least 10% methylation difference and containing at least 3 CpGs. Additionally, we applied a non-parametric Wilcoxon test using average DMR methylation levels to remove potentially false-positive regions, as the t-statistic is not well suited for non-normally distributed values, as expected at very low/high (close to 0% / 100%) methylation levels. A significance level of 0.1 has been used. Therefore, we are confident that the rigorous analysis and strict criteria applied in this study allowed us to detect trustworthy DMRs that we could further functionally validate using epigenetic editing. All the details of the DMR analysis are provided in the methods section. To address this point and limitation, we have added the following paragraphs in the discussion section of the manuscript:

      CHANGE IN THE MANUSCRIPT:

      *The strengths of our study include the use of purified human alveolar type 2 epithelial progenitor cells from a well-matched and carefully validated cohort of human samples, including mild and severe COPD patients, providing high relevance to human COPD. *

      However, we acknowledge several limitations of our study that warrant further investigation. First, the sample size was small. The use of strict quality criteria for donor selection limited the available samples, particularly for the ex-smoker control group. This resulted in an unequal distribution of COPD and control samples. This impacts the power of statistical analysis, particularly in the WGBS analysis, where millions of regions genome-wide are tested. Nevertheless, the clear negative correlation between promoter methylation and corresponding gene expression highlights the robustness of the DMR selection. Additionally, we were able to experimentally validate interferon-associated DMRs using epigenetic editing, highlighting the power of integrated epigenetic profiling in identifying disease-relevant regulators.

      __Minor suggestions for improvement __

      __Introduction __ • In general, refer to the actual experimental studies rather than review papers where appropriate.

      Response:

      We have now carefully checked all the references and amended them to refer to experimental studies when required.

      • Clearly specify whether a study was conducted in mice or humans, as this distinction is crucial for understanding the relevance of the findings to COPD.

      __Response: __

      All our experiments were performed with human lung cells and tissues. No mouse samples were used. As suggested, we have now clearly stated that our study was performed using human tissue samples and cells in different parts of the manuscript, including the discussion, where we now explicitly highlight the strengths and limitations of our study.

      CHANGES IN THE MANUSCRIPT:

      ...we generated whole-genome DNA methylation and transcriptome maps of sorted human primary alveolar type 2 cells (AT2) at different disease stages.

      However, the regulatory circuits that drive aberrant gene expression programs in human AT2 cells in COPD are poorly understood

      Therefore, we set out to profile DNA methylation of human AT2 cells at single CpG-resolution across COPD stages.

      ...*suggesting that aberrant epigenetic changes may drive COPD phenotypes in human AT2. *

      To identify genome-wide DNA methylation changes associated with COPD in purified human AT2 cells...

      The similarity of the methylation and gene expression profiles in the PCAs suggested that epigenetic and transcriptomic changes in human AT2 cells during COPD might be interrelated ...

      *In this work, we demonstrate that genome-wide DNA methylation changes occurring in human AT2 cells may drive COPD pathology by dysregulating key pathways that control inflammation, viral immunity and AT2 regeneration. *

      *Using high-resolution epigenetic profiling, we uncovered widespread alterations of the DNA methylation landscape in human AT2 cells in COPD that were associated with global gene expression changes. *

      *Currently, it is unclear how cigarette smoking leads to changes in DNA methylation patterns in human AT2 *

      The strengths of our study include the use of purified human alveolar epithelial progenitor cells from a well-matched and carefully validated cohort of human samples, including mild and severe COPD patients, providing high relevance to human COPD.

      __Methods __ • Line 473, here is meant 3 ex-smoker controls instead of smoker controls?

      __Response: __

      All donors (no COPD and COPD) used in our study are ex-smokers. Matching the samples with regard to smoking status and history is critical for epigenetic studies, as cigarette smoke profoundly affects DNA methylation genome-wide (PMID: 38199042, PMID: 27651444). This has now been clarified in the revised manuscript.

      CHANGE IN THE MANUSCRIPT____:

      Of note, we included only ex-smokers in our profiling to avoid acute smoking-induced inflammation as a confounding factor (50)*. *

      Importantly, we matched the smoking status and smoking history of all donors, which is key in epigenetic studies, as cigarette smoking profoundly impacts the DNA methylation landscape of tissues (96).

      In total, 3 ex-smoker controls (no COPD), 3 mild COPD donors ex-smokers (GOLD I, COPD I) and 5 moderate-to-severe COPD donors ex-smokers (GOLD II-IV, COPD II-IV) were profiled (Fig. 1A-C, Table 1)

      __Discussion __ • A list of limitation should be added to the discussion. One is the use of the alveolar cell line A549, which produces mucus, a characteristic more commonly associated with bronchial epithelial cells. (ref 43)l530:

      __Response: __

      The profiling was performed using purified primary human alveolar epithelial progenitor cells. For technical reasons, A549 cells were only used for validation of the results using epigenetic editing. The A549 phenotype depends on the growth medium used, in our case, Ham's F-12 medium, which is recommended for long-term A549 culture and promotes multilamellar body formation and differentiation toward an AT2-like phenotype (PMID: 27792742)__. __We are developing epigenetic editing technology for use in primary lung cells; however, the approach currently relies on the high efficiency of transient transfections, which cannot yet be achieved with primary adult AT2 cells. We were positively surprised by how well the methylation data obtained from patient AT2s translated into mechanistic insights when using A549 cells, despite being a cancer cell line. This suggests that the fundamental mechanisms of epigenetic regulation of IRF9 and the IFN signaling pathway are conserved between A549 and primary AT2 cells.

      • Another limitation to consider is that cells were isolated primarily from individuals with lung cancer, except for patients with COPD stage IV. In particular as COPD stage II and IV samples were taken together. And discuss the small and unevenly divided sample size

      __Response: __

      We thank the reviewer for bringing up this important point, which we carefully considered when designing our study. To match our samples across the cohort, all the no-COPD, COPD I, and two of the COPD II-IV samples were obtained from cancer resections. In addition to other characteristics, like age, BMI and smoking status, we also matched the donors by cancer type (all profiled donors had squamous cell carcinoma). We collected lung tissue as far away from the carcinoma as possible and sent representative pieces for histological analysis by an experienced lung pathologist to confirm the absence of visible tumours. In addition, to ensure that our data represents COPD-relevant signatures, we intentionally included samples from three COPD donors undergoing lung resections (without a cancer background) in the profiling.

      Following the reviewer's suggestion, to investigate the potential impact of non-cancer samples on driving the observed differences, we carefully checked the PCAs for both DNA methylation and RNA-seq. We could not identify a clear separation of no-cancer COPD samples from the cancer COPD samples (or other cancer samples) in any examined PCs, indicating no cofounding effect of cancer background in the samples. We observed that one sample contributing to PC2 is a non-cancer sample, but this was a rather sample-specific effect, as the other two non-cancer samples clustered together with the other severe COPD samples with a cancer background. Notably, in our DNA methylation data, we do not observe typical features of cancer methylomes, like global loss of DNA methylation or aberrant methylation of CpG islands (e.g., in tumour suppressor genes) (see Fig 2A), further suggesting that we do not "pick up" confounding cancer signatures in our data.

      Following the comments from both reviewers, to clarify that point, we added the information about cancer and non-cancer samples to the PCA figures for DNA methylation (new Fig. 2B) and RNA-seq (new Fig. 3A) data in the revised manuscript, as shown below

      CHANGE IN THE MANUSCRIPT____:

      COPD samples from donors with a cancer background clustered together with the COPD samples from lung resections, confirming that we detected COPD-relevant signatures (Fig. 2B).

      Fig.2B* Principal component analysis (PCA) of methylation levels at CpG sites with > 4-fold coverage in all samples. COPD I and COPD II-IV samples are represented in light and dark green triangles, respectively, and no COPD samples as blue circles. COPD samples without a cancer background are displayed with a black contour. The percentage indicates the proportion of variance explained by each component. *

      Unsupervised principal component analysis (PCA) on the top 500 variable genes revealed a clear influence of the COPD phenotype in separating no COPD and COPD II-IV samples, as previously observed with the DNA methylation analysis, irrespective of the cancer background of COPD samples (Fig.3A, Fig. S2B).

      *Principal component analysis (PCA) of 500 most variable genes in RNA-seq analysis. PCA 1 and 2 are shown in Fig.3A, PCA 1 and 4 in Fig.S2B. COPD I and COPD II-IV samples are represented in light and dark green triangles, respectively, and no COPD samples as blue circles. COPD samples without a cancer background are displayed with a black contour. The percentage indicates the proportion of variance explained by each component. *

      __Response: __

      We thank the reviewer for suggestions on how to improve the discussion of our manuscript. We have now added a strength/limitation section to our discussion and included the points suggested by both reviewers.

      CHANGE IN THE MANUSCRIPT____:

      The strengths of our study include the use of purified human alveolar epithelial progenitor cells from a well-matched and carefully validated cohort of human samples, including mild and severe COPD patients, providing high relevance to human COPD. Importantly, we matched the smoking status and smoking history of all donors, which is key in epigenetic studies, as cigarette smoking profoundly impacts the DNA methylation landscape of tissues (96). With the first genome-wide high-resolution methylation profiles of isolated cells across COPD stages, we offer novel insights into the epigenetic regulation of gene expression in epithelial progenitor cells in COPD, expanding our understanding of how alterations in regulatory regions and specific genes could contribute to disease development. We identified IRF9 as a key IFN transcription factor regulated by DNA methylation. Notably, by targeting IRF9 through epigenetic modifications, we modulated the activity of the IFN pathway, which plays a crucial role in the immune response and lung tissue regeneration. Epigenetic editing techniques could offer a novel therapeutic strategy for COPD by downregulating IFN pathway activation and promoting the regeneration of epithelial progenitor cells in the lungs. Further preclinical and clinical studies are needed to validate the efficacy and safety of epigenetic editing approaches in COPD treatment (33)*. *

      *However, we acknowledge several limitations to our study that warrant further investigation. First is the small sample size and replication difficulty due to the lack of available data, common challenges for studies working with sparse human material and hard-to-purify cell populations. The use of strict quality criteria in donor selection limited the available samples, especially for the ex-smoker control group, leading to an unequal distribution of COPD and control samples. Overall, this impacts the power of statistical analysis, especially in the WGBS analysis, where millions of regions genome-wide are tested. Nevertheless, the clear negative correlation of promoter methylation to the corresponding gene expression highlights the robustness of the DMR selection. Furthermore, we could experimentally validate interferon-associated DMRs using epigenetic editing, highlighting the power of integrated epigenetic profiling for the discovery of disease-relevant regulators. *

      Overall, we detected a higher number of correlated DMR-DEG associations using our simple promoter-proximal linkage compared to the GeneHancer approach. Assigning enhancers to their target genes with high confidence is a complex and challenging task. Enhancers are often located far from the genes they regulate and can interact with their target genes through three-dimensional chromatin loops. Furthermore, enhancers can operate in a highly context-dependent manner, with the same enhancer regulating different genes depending on the cell type, developmental stage, or environmental signals. Determining which enhancer is active under specific conditions remains a hurdle in the field, especially since the AT2-specific chromatin profiles of enhancer marks are not yet available.

      In addition, while WGBS provides unprecedented resolution and high coverage of the DNA methylation sites across the genome, it does not allow distinguishing 5-methylcytosine from 5-hydroxymethylcytosine. Therefore, we cannot exclude that some methylated sites we detected are 5-hydroxymethylated. However, as 5-hydroxymethylcytosine is present at very low levels in the lung tissue (97)*, its effect is likely marginal. *

      Finally, despite careful removal of airways from distal lung tissue using a dissecting microscope, we cannot exclude the presence of some terminal/respiratory bronchiole cells in our FACS-isolated EpCAMpos/PDPNlow population. Recent scRNA-seq studies provided an unprecedented resolution and identified several epithelial subpopulations and transitional cells residing in the terminal/respiratory bronchioles and alveoli, including respiratory airway secretory cells (93), terminal airway-enriched secretory cells (28), terminal bronchiole-specific alveolar type-0 (AT0) (70), and emphysema-specific AT2 cells (74). These cells may contribute to alveolar repair in healthy and COPD lungs; however, with our bulk DNA methylation and RNA-seq study, we are unable to resolve all these subpopulations. Future development of single-cell methylation and non-reference-based algorithms for DNA methylation deconvolution will enable deeper epigenetic phenotyping of specific AT2 and bronchiolar cell subsets.

      __References __ • Check references. For instance, there is no reference in the text to ref 43.

      • Align format of references

      __Response: __

      We thank the reviewer for spotting this inconsistency. We have carefully checked and aligned the format of all references. The (old) reference 43 is now mentioned in the discussion part.

      __Reviewer #1 (Significance (Required)): __

      The strength of this study lies in its focus on the molecular mechanisms underlying the impaired regeneration of epithelial progenitor cells in COPD. The discovery of IRF9, which regulates IFN signaling and is prominently upregulated in COPD, together with the convincing validation of the epigenetic control of the IFN pathway by targeted DNA demethylation of the IRF9 gene, adds significant value to the COPD research field.

      Main limitations of the study are the relatively small sample size of both COPD and non-COPD specimens and the claim that the sorted EpCAMpos/PDPNlow cells primarily consisted of AT2 cells.

      __- Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. __

      The nature and significance of the advance in epigenetic editing of IRF9 in COPD can be described as both conceptual and potentially clinical:

      Conceptual Advance: The epigenetic editing of IRF9 enhances our understanding of the molecular mechanisms underlying COPD pathogenesis. By targeting IRF9 through epigenetic modifications, researchers were able to modulate the activity of the IFN pathway, which plays a crucial role in the immune response and lung tissue regeneration. This approach offers insights into the epigenetic regulation of gene expression in epithelial progenitor cells in COPD and expands our understanding of how alterations in specific gene methylation could contribute to disease progression.

      Clinical Significance: The potential clinical significance of epigenetic editing of IRF9 lies in its implications for COPD therapy. If successful, epigenetic editing techniques could offer a novel therapeutic strategy for COPD by downregulating IFN pathway activation and promoting regeneration of epithelial progenitor cells in the lungs. Obviously, further preclinical and clinical studies are needed to validate the efficacy and safety of epigenetic editing approaches in COPD treatment.

      __Response: __We thank the reviewer for recognising the importance of our study, its conceptual advance and potential clinical significance. We are pleased to see that the reviewer highlights the promise of epigenetic editing in both furthering our basic understanding of molecular mechanisms of chronic diseases and its future potential as a therapeutic strategy.

      __- Place the work in the context of the existing literature (provide references, where appropriate). __ Few experimental papers have been published on epigenetic editing in lung diseases, with limited research available beyond the study referenced in citation 43. Song J, Cano-Rodriquez D, Winkle M, Gjaltema RA, Goubert D, Jurkowski TP, Heijink IH, Rots MG, Hylkema MN. Targeted epigenetic editing of SPDEF reduces mucus production in lung epithelial cells. Am J Physiol Lung Cell Mol Physiol. 2017 Mar 1;312(3):L334-L347. doi: 10.1152/ajplung.00059.2016. Epub 2016 Dec 23. PMID: 28011616.

      Response:

      We thank the reviewer for recognising the uniqueness and novelty of our study and the lack of research on the functional understanding of DNA methylation in the context of lung and lung diseases.

      - State what audience might be interested in and influenced by the reported findings.

      This study is of broad interest to researchers investigating the pathogenesis and treatment of COPD.

      __- Define your field of expertise with a few keywords to help the authors contextualize your point of view. __

      Expertise in: Lung pathology, Immunology, COPD, Epigenetics

      - Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. Less expertise in: Epigenetic Editing

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      __Summary: __

      This study aim to understand the molecular mechanisms underlying dysfunction in AT2 cells in COPD, by profiling bulk genome wide DNA methylation using Tagmentation-based whole-genome bisulfite sequencing (T-WGBS) and RNA sequencing in selectively sorted primary AT2 cells. The study stands out in it's sequencing breadth and use of an incredibly difficult cell population, and has the potential to add substantially to our mechanistic understanding of epigenetic contributions to COPD. A further highlight is the concluding aspect of the study where the authors undertook targeted modification of specific CpG methylation, provided direct, site-specific evidence for transcriptional regulation by CpG methylation.

      Response:

      We thank the reviewer for recognizing the conceptual and methodological advance of our study and for noting the value of our functional mechanistic approach.

      __Major comments: __

      The authors clearly show that there is DNA methylation alteration in AT2 cells from COPD individuals that links functional to gene expression at some level. However, I think the statement "to identify genome-wide changes associated with COPD development and progression..." and similar other references to disease development understanding is not accurate given the DNA methylation primary comparison is between control and moderate to severe COPD, with no temporal detail or evidence that they drive progression rather than are a result of COPD development. The paragraph starting on line 186 where this is a addressed to some extent is quite vague and doesn't really provide confidence that DNAm dysregulation occurs at an early stage in this context. This can be addressed by changing the focus/style of the text.

      __Response: __

      Thank you for raising this point. We agree with the reviewer that our cross-sectional study describes the association of methylation changes with either COPD I or more established disease (COPD II-IV) and that the observed changes may be either the driver or a result of COPD development. This has been clarified in the revised manuscript, and we removed the statements about disease initiation and progression. This is an important point; hence, we added an extra line to the discussion to make that clear.

      __CHANGE IN THE MANUSCRIPT____: __

      Therefore, we set out to profile DNA methylation of human AT2 cells at single CpG-resolution across COPD stages to identify epigenetic changes associated with disease and combine this with RNA-seq expression profiles.

      To identify epigenetic changes associated with COPD, we collected lung tissue from patients with different stages of COPD,

      ....to identify methylation changes associated with mild disease, we included TWGBS data from AT2 isolated from COPD I patients (n=3) in the analysis.

      Currently, we do not know whether the identified DNA methylation changes are the cause or the consequence of the disease process and not much is known about the correlation of DNA methylation with disease severity.

      *However, our study is cross-sectional, our cohort included only 3 COPD I donors, and we did not have any follow-up data on the patients, so future large-scale profiling of mild disease (or even pre-COPD cohorts) in an extended patient cohort will be crucial for a better understanding of early disease and its progression trajectories. *

      __Results comments and suggestions: __

      For the integrated analysis, there is a focus on DMRs in promoters with very little analysis on other regions. The paragraph starting on line 317 describes some analysis on enhancers but is very brief, doesn't include information on how many/which DMRs were included, making it hard to interpret the impact of the 147 DMRs and 93 genes identified - is this nearly all DMRs and genes analysed or very few? A comparison to the promoter analysis would be of interest. Especially as the targeted region followed up with lovely functional assessment in the last sections is a gene body DMR, not a promoter DMR.

      __Response: __

      We thank the reviewer for pointing out the importance of changes in enhancers. We agree that extending the enhancer analysis is very interesting. However, assigning enhancers to their target genes with high confidence is a complex and challenging task. Enhancers are often located far from the gene they regulate, sometimes spanning hundreds of kilobases. They can interact with their target genes through three-dimensional chromatin loops, potentially bypassing nearby genes to activate more distant ones, making it difficult to confidently link specific enhancers to their target genes. Furthermore, enhancers can operate in a highly context-dependent manner. The same enhancer can regulate different genes depending on the cell type, developmental stage, or environmental signals. Another challenge is that enhancers often work in clusters or "enhancer landscapes," where multiple enhancers contribute to the regulation of a single gene. Disentangling the contribution of individual enhancers within such clusters and determining which enhancer is active under specific conditions remains an ongoing hurdle in the field, especially since the AT2-specific chromatin profiles of enhancer marks are not yet available.

      One approach we tried to account for more distal regulatory regions was to assign DMRs to the nearest gene with a maximum distance of up to 100 kb using GREAT (Genomic Regions Enrichment of Annotations Tool) and simultaneously perform gene enrichment analysis of the associated genes. The old Figure S1C (now S1D) shows the top 10 enriched terms of either hyper- or hypomethylated DMRs, and Table 4 shows the full list of enriched terms. However, in this analysis, we did not integrate the results of the RNA-seq analysis. To demonstrate that we can correlate methylation with gene expression associations in this analysis, we then took a closer look at the WNT/b-catenin pathway, which contains 147 DMRs associated with 93 genes from the respective pathway (old Figure S3D, now S3G). Here, we showed that distal DMRs up to 100 kb away from the TSS show a high correlation with gene expression. We are including the two figures below for convenience:

      *Left panels, functional annotation of genes located next to hypermethylated (top) and hypomethylated (bottom) DMRs using GREAT. Hits were sorted according to the binominal adjusted p-value and the top 10 hits are shown. The adjusted p-value is indicated by the color code and the number of DMR associated genes is indicated by the node size. Right panel, scatter plot showing distal DMR-DEG pairs associated with Wnt-signaling. Pairs were extracted from GREAT analysis (hypermethylated, DMR-DEG distance Following the reviewer's suggestion, we have now extended the enhancer analysis using the GeneHancer database, the most comprehensive, integrated resource of enhancer/promoter-gene associations. We used the GeneHancer version 5.14, which annotates 392,372 regulatory genomic elements (GeneHancer element) on the hg19 reference genome. Of the 25,028 DMRs, 18,289 DMRs (73% of all DMRs) coincided with at least one GeneHancer element, resulting in 19,661 DMR-GeneHancer associations. Next, we extracted the GeneHancer elements associated with protein-coding or long-non-coding RNAs genes, which left us with 2,144 DMR-GeneHancer associations. Next, we used only high-scoring gene GeneHancer associations ("Elite"), leaving 1,485 DMR-GeneHancer associations. Of those, we selected the GeneHancer elements, which are linked to genes differentially expressed in our RNA-seq analysis resulting in a final table of 376 DMR-GeneHancer associations (Table 9 DMR_DEG_GeneHancer, Tab 2). Similar to the promoter-proximal analysis, we analysed the correlation of expression and methylation changes of the DMR-GeneHancer associations, demonstrating a high number of negatively and positively correlated events (Fig.S3D). Finally, we performed the gene enrichment analysis for positively and negatively correlating genes. We detected significant GO term enrichments only for negatively correlating genes (Fig.S3E and Table 10_Enrichment_results, Tab2).

      CHANGE IN THE MANUSCRIPT

      To harness the full resolution of our whole-genome DNA methylation data, we extended the analysis beyond promoter-proximal regions and assessed how epigenetic changes in distal regulatory regions (enhancers) may relate to transcriptional differences in COPD. As the assignment of enhancer elements to the corresponding genes is challenging, we tried two different approaches. First, we used the GeneHancer database (72) to link DMRs to regulatory genomic elements (GeneHancer element). Of the 25,028 DMRs, 18,289 DMRs (73%) coincided with at least one GeneHancer element. Of those 2,144 DMR-GeneHancer associations were linked either to protein-coding or lncRNA genes. Next, we filtered for high-scoring gene GeneHancer associations ("Elite"), leaving 1,485 DMR-GeneHancer Elite associations. Of those, we selected the GeneHancer elements, which are linked to genes differentially expressed in our RNA-seq analysis, resulting in 376 DMR-GeneHancer associations (Table 9). Similar to the promoter-proximal analysis, we assessed the correlation of expression and methylation changes of the DMR-GeneHancer associations, demonstrating a high proportion of negatively and positively correlated events (Fig. S3E). Finally, we performed gene enrichment analysis for positively and negatively correlated genes. We detected significant GO term enrichments for negatively correlating genes only (Fig. S3F and Table 10), with the most pronounced term "regulation of tumor necrosis factor". In an alternative approach, we linked proximal and distal (within 100 kb from TSS) DMRs to the next gene using GREAT (57) (Fig S1C, Table 4) *and calculated Spearman correlation between DMRs and associated DEGs__. 147 DMRs were associated with high correlation rates with 93 genes from the WNT/β-catenin pathway (Fig. S3G)__, suggesting that DNA methylation may also drive the expression of genes of the WNT/β-catenin family. *

      Figure S3E and F: E. Spearman correlation between gene expression and DMR methylation of DMRs assigned to gene regulatory elements using the GeneHancer database. F. GO-Term over-representation analysis of DEGs negatively correlated to DMRs in gene regulatory elements. The adjusted p-value is indicated by the color code and the percentage number of associated DEGs is indicated by the node size.

      (Methods) For enhancer analysis, the GeneHancer database version 5.14, which annotates 392,372 regulatory genomic elements (GeneHancer element) on the hg19 reference genome, was used (72). Of the 25,028 DMRs 18,289 DMRs coincided with at least one GeneHancer element, resulting in 19,661 DMR-GeneHancer associations. Next, the GeneHancer elements were filtered for association with protein-coding or long-non-coding RNAs genes and high-scoring gene GeneHancer associations ("Elite"), leaving 1,485 DMR-GeneHancer associations. Of those, the GeneHancer elements were selected, which are linked to differentially expressed genes in COPD resulting in a final table of 376 DMR-GeneHancer associations. Similar to the promoter-proximal analysis, the Spearman correlation of expression and methylation changes of the DMR-GeneHancer associations was assessed. GO gene enrichment analysis for positively and negatively correlating genes was done using Metascape (111).

      A comparison to the promoter analysis would be of interest.

      Response:

      We detected more highly correlated (|correlation coefficient| > 0.5) DMR-DEG associations using our simple promoter proximal linkage (n=643) in comparison with the GeneHancer approach comprising annotated enhancer elements (n=327/2,144). Gene enrichment results pointed to the interferon pathway, which we could confirm using epigenetic editing. This pathway was not present in the GeneHancer analysis, indicating that regulation of the IFN pathway may be controlled by proximal elements.

      CHANGE IN THE MANUSCRIPT____:

      Overall, we detected a higher number of correlated DMR-DEG associations using our simple promoter-proximal linkage compared to the GeneHancer approach. Assigning enhancers to their target genes with high confidence is a complex and challenging task. Enhancers are often located far from the genes they regulate and can interact with their target genes through three-dimensional chromatin loops. Furthermore, enhancers can operate in a highly context-dependent manner, with the same enhancer regulating different genes depending on the cell type, developmental stage, or environmental signals. Determining which enhancer is active under specific conditions remains a hurdle in the field, especially since the AT2-specific chromatin profiles of enhancer marks are not yet available.

      Especially as the targeted region followed up with lovely functional assessment in the last sections is a gene body DMR, not a promoter DMR.

      Response:

      We thank the reviewer for bringing up that point. To clarify, we defined the promoter regions for the analysis as regions located {plus minus} 6 kb (upstream and downstream) from the transcriptional start site (TSS). Since the term "promoter" often refers to the region upstream of the transcriptional start site, its use may have been misleading. For clarity, we changed the text correspondingly to __promoter proximal methylation __and explained in the methods how the regions for analysis were defined.

      __CHANGE IN THE MANUSCRIPT____: __

      "DMR association per gene promoter" was changed to "Gene promoter proximal DMRs"

      Fig. S3B: "DMR in promoter" was changed to "promoter proximal DMR(s)"

      "by DNA methylation changes in promoters" was changed to "by DNA methylation changes in promoter proximity"

      "regulated by promoter methylation" was changed to "regulated by promoter-proximal methylation"

      "analysis of the promoter DMRs" was changed to "analysis of the promoter-proximal DMRs"

      "between promoter methylation" was changed to "between promoter proximal methylation"

      Cytoscape was used to analyse negatively or positively correlated DMR DEG pairs. ClueGO (v2.5.6) analysis was conducted using all DEG associated with a promoter proximal DMR (+/- 6 kb from TSS) and the Spearman correlation coefficient 0.5 (112).

      • Lines 299-301 - I'm not sure the graph in Fig S3A support the conclusion that there was a preferential negative relationship between DNAm and gene expression. Looks like there are a substantial number of cases where a positive relationship is observed and this needs to be acknowledged.

      Response:

      In this part, we refer to Fig S3C. In the left panel, downregulated genes clearly show higher counts for the hypermethylated DMRs, whereas the hypomethylated DMRs are enriched at upregulated genes (right panel), indicating a preference for negative correlation: lower methylation, higher gene expression. If there were no preference, we would expect a 50:50 ratio of hypo- and hypermethylated DMRs, and we observed a 77:23 ratio. Nevertheless, we agree that there is a substantial number of cases (n=151) with a high positive correlation, which we now highlight in the text. For clarity, we also modified the figure legend to indicate that a stacked histogram is represented in the panel.

      __CHANGE IN THE MANUSCRIPT____: __

      L303: Interestingly, 23.5% of the identified DMR DEG pairs (n=151) showed a positive correlation between gene expression and DNA methylation.

      *Figure legend in Fig. S3C was changed to: C Stacked histogram showing location of hyper- and hypomethylated DMRs relative to the TSS of DEGs in downregulated (left) and upregulated (right) genes. *

      • Line 307 - what are the "analysed DEGs"? Are they the methylation associated genes?

      Response:

      Those are the DEGs we identified in RNA-seq analysis. To clarify, we changed the text to "identified DEGs".

      __CHANGE IN THE MANUSCRIPT____: __

      • "analysed DEGs" was changed to "identified DEGs"*

      • Line 307-309 - "Among the analyzed DEGs, 76.5% (492) displayed a negative correlation (16.8% of the total DEGs), indicating a possible direct regulation by DNA methylation, while 23.5% (151) showed a positive correlation between gene expression and DNA methylation" - are the authors suggesting the positive correlation doesn't indicate direct regulation?

      __Response: __

      Thank you for highlighting this point. We did not intend to suggest that negative correlation indicates direct regulation, while positive correlation suggests a lack thereof. To clarify that point, we have reformulated this sentence.

      __CHANGE IN THE MANUSCRIPT____: __

      Among the identified DEGs, 76.5% (n=492) displayed a negative correlation (16.8% of the total DEGs), consistent with a repressive role of promoter DNA methylation. Interestingly, 23.5% of the identified DEG (n=151) showed a positive correlation between gene expression and DNA methylation.

      • Line 313 - why did the authors focus on only negatively correlated genes to identify their top dysregulated pathway of IFN signalling? Why not do pathway analysis on the DNAm associated genes separately to identify DNAm associated pathways?

      Response:

      We have also performed a pathway enrichment analysis using the positively correlated genes but did not identify any significantly enriched pathways/process/terms. When we examined the top hit of the gene set enrichment analysis, the interferon signaling pathway, we observed only negatively correlated DMR gene associations (Fig. 5B). Therefore, we decided to use only the negatively correlated DMRs, as using all correlated genes would give a higher background and dilute our results.

      CHANGE IN THE MANUSCRIPT____:

      Cytoscape was used to analyse negatively or positively correlated DMR DEG pairs. ClueGO (v2.5.6) analysis was conducted using all DEG associated with a promoter proximal DMR (+/- 6 kb from TSS) and the Spearman correlation coefficient 0.5 (113).

      • A comparison of the gene expression data with previous data in AT2 cell/single cell data would strengthen the gene expression section.

      __Response: __

      We compared our gene expression signatures with the study of Fujino et al., who profiled sorted AT2 cells (EpCAMhighPDPNlow) from COPD/controls using expression arrays (PMID: 23117565). Consistent with our study, the authors also observed the upregulation of interferon signalling (among other pathways) in COPD AT2s. However, no raw data was available in the published manuscript for a more in-depth analysis.

      Several recent scRNA-seq studies identified transcriptional signatures of COPD and control cells (e.g., PMIDs: 36108172, 35078977, 36796082, 39147413__). However, most studies did not match the smoking status of the control and COPD donors and looked at the whole lung tissue, with limited power to detect gene expression changes in distal alveolar cells. It is difficult to directly compare our data to the gene expression data from non-smokers vs COPD patients, as cigarette smoking profoundly remodels the epigenome and transcriptional signatures of cells. In addition, differences in technologies and depth of sequencing make such comparisons challenging. However, one study (PMID: 36108172) performed scRNA-seq analysis on 3 non-smokers, 4 ex-smokers and 7 COPD ex-smoker lungs. Despite relatively limited coverage of epithelial cells in the dataset (We also compared the main AT2 IFN signature identified in the integration of our DNA methylation in promoter-proximal regions and RNA-seq with a recent study (published after the submission of our manuscript, PMID: 39147413) that profiled EpCAMpos cells from COPD and control lungs (non-smokers) using scRNA-seq. We observed an upregulation of our IFN signature genes in AT2 in COPD (specifically in AT2-c and rbAT2 subsets), suggesting that similar signatures were observed in this dataset as well. However, ex-smokers were not included in this study, making direct comparisons difficult. We have now included the panels shown below as __Figure S4E and S4F:

      Figure S4E and F: Expression values for the indicated genes of the IFN pathway from an external scRNA-seq dataset of AT2 cells from COPD patients and healthy controls (74). Y-axis shows log-normalized gene expression levels. F. Combined gene set score of the genes shown in (E) in different subsets of AT2 cells from (74)*. The IFN signature genes were identified in our integrative analysis of TWGBS and RNA-seq in sorted AT2 cells. *

      CHANGES IN THE MANUSCRIPT:

      However, 5-AZA is a global demethylating agent, and the observed effects may not be direct. To validate the epigenetic regulation of central AT2 pathways further, we took advantage of locus-specific epigenetic editing technology (73). We focused on the IFN pathway because it was the most significantly enriched Gene Ontology (GO) term in our integrative analysis of TWGBS and RNA-seq data. Several IFN pathway members had associated hypomethylated DMRs within promoter-proximal regions and concomitant increased gene expression (Fig. 4C and Fig.S2C). Additionally, we confirmed the elevated expression of IFN-related genes with associated DMRs identified in our study in AT2 cells and AT2 cell subclusters from a recently published scRNA-seq cohort (74)* (Fig. S4E-F). *

      (Methods) Validation of IFN gene upregulation in a published scRNA-seq dataset

      scRNA-seq data from (74), generously provided by M. Köningshoff, were processed using the default Seurat workflow (117). Expression of IFN-related genes was extracted and plotted as log-normalised gene expression levels in AT2 cells from control and COPD donors. Seurat's AddModuleScore() function was used to compute a gene set score for a custom IFN program using the genes listed in __Fig. S4E __and to analyse the IFN gene set scores in AT2 cell subclusters identified in (74). Briefly, average gene expression scores were computed for the gene set of interest, and the expression of control features (randomly selected) was subtracted as described in (118).

      Fig. S4 E and F. E. Expression values for the indicated genes of the IFN pathway from an external scRNA-seq dataset of AT2 cells from COPD patients and healthy controls (74). Y-axis shows log-normalized gene expression levels. F. Combined gene set score of the genes shown in (E) in different subsets of AT2 cells from (74). The IFN signature genes were identified in our integrative analysis of TWGBS and RNA-seq in sorted AT2 cells. __ __

      • The paragraph starting on line 173 feels a little redundant when we know there is RNA available to test if the differential DNAm links to altered gene expression - this selected of example regions/genes would be better placed after the gene expression has been reported, at which point you could say whether the linked genes displayed altered transcription.

      Response:

      The current structure (with DNA methylation, followed by RNA-seq and integration) is intentional and serves several important purposes. As this is the first genome-wide high-resolution COPD DNA methylation study of AT2, we aimed to describe the methylation landscape independently of gene expression (noting the limitation of current understanding of how DNA methylation regulates expression). This early focus on DMRs lays clear groundwork by highlighting potential regulatory elements and pathways that could be disrupted, independent of or even before corroborative transcriptional data. Additionally, positioning these examples early in the narrative helps to frame subsequent gene expression analyses. Once RNA data are introduced later, the reader can directly compare the methylation patterns with transcriptional outcomes, thereby enhancing the overall story. In other words, by first showcasing disease-relevant methylation changes, we underscore a hypothesis that these epigenetic modifications are functionally meaningful. The later integration of gene expression data then serves as a confirmatory or complementary layer, rather than the sole basis for inferring biological significance. This is important as we still do not fully understand the function of DNA methylation outside promoters, and its role is also important for splicing, 3D genome organisation, non-coding RNA regulation, enhancer regulation, etc.

      • Similarly, the TF enrichment analysis is great but maybe would have added value to be done on DNA regions later shown to be linked to differential expression - was there different enrichment at DNA regions that are vs are not associated with altered expression? And could you test in vitro whether changing methylation of DNA (maybe a blunt too like 5-aza would be ok) alters TF binding (cut+run/ChIP?). Furthermore, it would be interesting to understand the TF sensitivity analysis within the context of positive versus negative DNA methylation:gene expression correlations.

      Response:

      As suggested by the reviewer, we now performed the TF enrichment analysis using the DMRs with a high correlation (|correlation coefficient|>0.5) between methylation and expression (Figure S3D) and expanded the method section to include TF analysis. We observed ETS domain motifs enriched at hypomethylated regions. They prefer unmethylated DNA (MethylMinus) and are therefore expected to bind with higher affinity to the respective DMRs in COPD. We agree with the reviewer that further verifying altered TF binding using cut&run or ChIP assays would be very interesting, but it is out of the scope of this manuscript. Such analysis is technically very challenging to perform with low numbers of primary AT2 cells and will be the focus of our follow-up mechanistic studies.

      CHANGE IN THE MANUSCRIPT____:

      Additionally, motif analysis of DMRs that were highly correlated (|Spearman correlation coefficient| > 0.5) with DEGs revealed a prominent enrichment of the cognate motif for ETS family transcription factors, such as ELF5, SPIB, ELF1 and ELF2 at hypomethylated DMRs (Fig. S3D). Interestingly, SPIB was shown to facilitate the recruitment of IRF7, activating interferon signaling (71)*, and our WGBS data uncovers SPIB motifs at hypomethylated DMRs, which aligns with its binding preferences at unmethylated DNA (methyl minus, Fig. S3D). *

      Figure S3D: Enrichment of methylation-sensitive binding motifs at hypo- (right) and hypermethylated (left) DMRs, using DMRs with a high correlation (|Spearman correlation coefficient| > 0.5) between methylation and gene expression. Methylation-sensitive motifs were derived from Yin et al (64). Transcription factors, whose binding affinity is impaired upon methylation of their DNA binding motif, are shown in red (Methyl Minus), and transcription factors, whose binding affinity upon CpG methylation is increased, are shown in blue (Methyl Plus).

      (Methods) To obtain information about methylation-dependent binding for transcription factor motifs which are enriched at DMRs, the results of a recent SELEX study (64)* were integrated into the analysis. They categorised transcription factors based on the binding affinity of their corresponding DNA motif to methylated or unmethylated motifs. Those whose affinity was impaired by methylation were categorised as MethylMinus, while those whose affinity increased were categorised as MethylPlus. A motif database of 1,787 binding motifs with associated methylation dependency was constructed. The log odds detection threshold was calculated for the HOMER motif search as follows. Bases with a probability > 0.7 got a score of log(base probability/0.25); otherwise, the score was set to 0. The final threshold was calculated as the sum of the scores of all bases in the motif. Motif enrichment analysis was carried out against a sampled background of 50,000 random regions with matching GC content using the findMotifsGenome.pl script of the HOMER software suite, omitting CG correction and setting the generated SELEX motifs as the motif database. *

      __Methods: __ • The authors should include more detail of the TWGBS rather than directing the reader to a previous publication. Also DNA concentration post bisulfite conversion would be a useful metric to provide.

      __Response: __

      Following the suggestion, we have now expanded the details of TWGBS in the methods part of the manuscript. Due to limited space, we did not include a detailed protocol but instead referred to a published step-by-step protocol (55). Of note, we do not measure DNA concentration post-bisulfite conversion but consistently use the starting input of 30 ng of genomic DNA across all samples.

      __CHANGE IN THE MANUSCRIPT____: __

      (Methods): 15 pg of unmethylated DNA phage lambda was spiked in as a control for bisulfite conversion. Tagmentation was performed in TAPS buffer using an in-house purified Tn5 assembled with load adapter oligos (55) at 55 {degree sign}C for 8 min. Tagmentation was followed by purification using AMPure beads, oligo replacement and gap repair as described (55). Bisulfite treatment was performed using EZ DNA Methylation kit (Zymo) following the manufacturer's protocol.

      *The T-WGBS library preparations were performed for all donors in parallel and sequenced in a single batch to minimize batch effects and technical variability. *

      • Differential DNA methylation analysis: It is stated that DNA regions had to contain 3 CpG sites but was this within a defined DNA size range?

      Response:

      The maximum distance between individual CpGs within DMR was set to 300 bp. To clarify, we added that information to the methods part.

      __CHANGE IN THE MANUSCRIPT____: __

      *"regions with at least 10% methylation difference and containing at least 3 CpGs with a maximum distance of 300 bp between them. *

      • Refence genome only provided for RNAseq not TWGBS?

      __Response: __We used hg19 as the reference genome. The information on the reference genome for DNA methylation analysis was provided in the methods L574 (original manuscript_: "The reads were aligned to the transformed strands of the hg19 reference genome using BWA MEM")

      • The tables do not appear in the PDF and I struggled to tally to the "Dataset" files provided if that is what they were referring to?

      Response:

      Full tables (uploaded as Datasets in the manuscript central due to their size) were uploaded together with the manuscript files. They are quite large and will not convert to pdf, so they may not have been included in the merged pdf file. We assume that they should be available to the reviewers with the other files and will clarify that with the editorial staff in the resubmission cover letter.

      • For the gene expression analysis, can it be made clearer that a full analysis was done on COPD I samples. It is a little confusing to the reader as this was not done for DNAm so might be assumed the same targeted analysis on only genes found to be differentially expressed between control and COPD II-IV, but that cannot be the case as an overlap of COPD1 vs COPD II-IV genes if provided. For this overlap, do genes show the same effect direction?

      __Response: __

      To clarify, for the RNA-seq analysis, we performed DEG analysis for no-COPD versus COPD II-IV, as well as no-COPD versus COPD I. We then took all differentially expressed genes (presented in the Venn diagram) and plotted them for all samples as a heatmap. To split the genes into groups displaying similar effect directions, we applied a clustering approach and identified 3 main signatures. Cluster 3 primarily comprises genes unique to COPD I samples, which are associated with the adaptive immune system and hemostasis (Fig. 4E). In the other two clusters, we mainly observe a transitioning pattern from control to severe COPD samples, correlating with the FEV1 values of the patients. This has now been clarified in the manuscript.

      • Replication is difficult on these studies as the samples are so difficult to come by. Also limited by sample size for the same reason. It doesn't mean the study is not worth doing and the data are still valuable. However, it may be pertinent to include technical validation of a few regions of interest, acknowledge the limitation (along side strengths) in the discussion, and perhaps provide actual p value rather than blanket Response:

      We thank the reviewer for acknowledging the replication challenges for studies working with sparse human material and hard-to-purify cell populations. Following the reviewer's suggestion, we have now included a strengths and limitations section in the discussion where we summarised the points highlighted by both reviewers.

      Regarding technical validation, we would like to note that the whole genome bisulfite sequencing (WGBS) technology, as well as the tagmentation-based WGBS (T-WGBS), have been validated in the past few years in several publications (e.g., PMID: 24071908) and shown to yield reliable DNA methylation quantification in comparison to other technologies (PMID: 27347756). For us, technical validation using alternative methods (e.g. bisulfite sequencing or pyrosequencing) is difficult as it requires significantly more input DNA than the low-input T-WGBS we have performed and obtaining sufficient amounts of material from primary human AT2 cells (especially from severe COPD) is not possible with the size of tissue we can access. However, while establishing the T-WGBS for this project, we initially validated our approach using Mass Array, a sequencing-independent method. For this, we performed T-WGBS on the commercially available smoker and COPD lung fibroblasts and selected 9 regions with different methylation levels for validation using a Mass Array. We obtained an excellent correlation between both methods, providing technical validation of T-WGBS and our analysis workflow. This validation was published in our earlier manuscript (PMID: 37143403), but we provided the data below for convenience.

      Scatter plots showing correlation of average methylation obtained with T-WGBS and Mass Array from COPD and smoker fibroblasts. Each dot represents one region with varying methylation levels. The blue diagonal represents the linear regression. Shaded areas are confidence intervals of the correlation coefficient at 95%. Correlation coefficients and P values were calculated by the Pearson correlation method.

      To enable further validation and follow-up by the community, we included the full list of DMRs, associated p-values and additional information for DNA methylation analysis (DMR width, n.CpGs, MethylDiff, etc) in Table 3 (Table_3_wgbs_dmr_info.xlsx) and the information about DEGs from RNA-seq in Table 6 (Table_6_RNAseq_DEG_info.xlsx).

      • It isn't clear to me if DNA and RNA are from the same cells? The results say "cells matching those used for T-WGBS" but the methods suggest separate extractions so not the same cells? If they are not the same cells a comment on the implications of this should be included in the discussion for example, potentially some differences in cell type composition, storage time etc.

      Response:

      Lung tissue samples were freshly cryopreserved, and H&E slides derived from exemplary pieces of the tissue analyzed. Once we had a group of at least 3 samples comprising one non-COPD and 2 COPD samples, we processed them in parallel to limit sorting variation between control and disease samples. The sorted cells were counted, aliquoted and pelleted at 4{degree sign}C before flash freezing and storing at -80{degree sign}C. The storage time of the cell pellets varied between the donors. RNA and DNA were isolated from cell pellets collected from the same FACS sorting experiment; therefore, we do not expect differences in cell type composition. In addition, RNA and DNA isolation were performed for all sorted pellets in parallel. All library preparations for TWGBS and RNA-seq were performed for all donors in parallel and sequenced in a single batch to minimise batch effects and technical variability. This has now been clarified in the methods part of the manuscript.

      __CHANGE IN THE MANUSCRIPT____: __

      To minimize potential technical bias, samples from no COPD and COPD donors were processed in parallel in groups of 3 (one no COPD and 2 COPD samples).

      RNA and genomic DNA for RNA-seq and TWGBS were isolated from identical aliquots of sorted cell pellets.

      Genomic DNA was extracted from 1-2x104 sorted alveolar epithelial cells isolated from cryopreserved lung parenchyma from 11 different donors in parallel using QIAamp Micro Kit

      The TWGBS library preparations were performed for all donors in parallel and sequenced in a single batch to minimize batch effects and technical variability.* *

      RNA was isolated from flash-frozen pellets of 2x104 sorted AT2 cells from 11 different donors in parallel.

      The RNA-seq library preparation for all donors was performed in parallel and all samples were sequenced in a single batch to minimize batch effects and technical variability.

      • Line 193 the authors say "Since DMRs were overrepresented at cis-regulatory sites...." - "cis" needs to be defined. If you link DNAm regions to gene via "closest gene" does this not automatically mean you're outputs will be cis? Just needs better definition/explanation.

      Response:

      The term "cis‐regulatory sites" in our manuscript is intended to denote regulatory elements-such as enhancers, promoters, and other nearby control regions-that reside on the same chromosome and close to the genes they regulate. While it's true that linking a DMR to its closest gene captures a cis association, our phrasing emphasises that the DMRs are enriched specifically at these functional regulatory elements (Fig. 2E) rather than being randomly distributed. This usage aligns with established conventions in the field. To avoid any misunderstandings, we have now changed the term to gene regulatory sites.

      __CHANGE IN THE MANUSCRIPT____: __

      *We changed the "cis-regulatory sites" to "gene regulatory sites" *

      __Minor comments: __

      Line 157: "we identified site-specific differences....". Change to region specific?

      Response:

      This has now been corrected as suggested.

      Line 102-103: needs a reference for the statement "Alterations in DNA methylation patterns have been implicated......"

      Response:

      Following the reviewer's suggestion, we added the relevant references (34-36) to this statement.

      Line 266 - what does "strong dysregulation" mean? Large fold change, very significant?

      Response:

      We removed the word "strong" from this sentence.

      Lines 423-425 - statement needs a reference

      Response:

      Following the reviewer's suggestion, we added the relevant reference to this statement.

      Line 428 - word missing between "epigenetic , we"?

      Response:

      This has now been corrected. The text reads: "Through treatment with a demethylating drug and targeted epigenetic editing, we demonstrated the ability to modulate..."

      Prior studies are well references, text and figures are clear and accurate.

      __Reviewer #2 (Significance (Required)): __

      This study has several strengths:

      1) Sample collection and characterisation. AT2 cells are incredibly hard to come by and the authors should be commended to generating the samples. However, proximity to cancer is always a potential issue, especially in epigenetic studies. Is it feasible to include any analysis to show the samples derived from those with cancer don't drive the changes observed? Even a high level PCA or an edit of fig 2A with non-cancer in a different colour in supplemental - looks like there is one outlier, is that a non-cancer? Or a correlation of change in beta between control and cancer/COPD and control and non-cancer:COPD (for want a better phrase!). just an indicator that the non-cancer COPD samples are not driving differences.

      Response:

      We thank the reviewer for highlighting the value of generating data from hard-to-work-with AT2 populations and bringing up the important point of cancer proximity, which we considered very carefully when designing our study. To match our samples across the cohort, all the no-COPD, COPD I, and two of the COPD II-IV distal lung samples were obtained from cancer resections. In addition to other characteristics, like age, BMI and smoking status, we also matched the donors by cancer type (all profiled donors had squamous cell carcinoma). We collected lung tissue as far away from the carcinoma as possible and sent representative pieces for histological analysis by an experienced lung pathologist to confirm the absence of visible tumours. In addition, to ensure that our data represents COPD-relevant signatures, we intentionally included samples from three COPD donors undergoing lung resections (without a cancer background) in the profiling.

      Following the reviewer's suggestion, to investigate the potential impact of non-cancer samples on driving the observed differences, we carefully checked the PCAs for both DNA methylation and RNA-seq. We could not identify a clear separation of no-cancer COPD samples from the cancer COPD samples (or other cancer samples) in any examined PCs, indicating no cofounding effect of cancer samples. We observed that one sample contributing to PC2 is a non-cancer sample, but this was a rather sample-specific effect, as the other two non-cancer samples clustered together with the other severe COPD samples with a cancer background. Notably, in our DNA methylation data, we do not observe typical features of cancer methylomes, like global loss of DNA methylation or aberrant methylation of CpG islands (e.g., in tumour suppressor genes) (see Fig. 2A), further suggesting that we do not "pick up" confounding cancer signatures in our data.

      Following the comments from both reviewers, to clarify that point, we added the information about cancer and non-cancer samples to the PCA figures for DNA methylation (new Fig. 2B) and RNA-seq (new Fig. 3A) data in the revised manuscript, as shown below

      CHANGE IN THE MANUSCRIPT____:

      COPD samples from donors with a cancer background clustered together with the COPD samples from lung resections, confirming that we detected COPD-relevant signatures (Fig. 2B).

      Fig. 2B.* Principal component analysis (PCA) of methylation levels at CpG sites with > 4-fold coverage in all samples. COPD I and COPD II-IV samples are represented in light and dark green triangles, respectively, and no COPD samples as blue circles. COPD samples without a cancer background are displayed with a black contour. The percentage indicates the proportion of variance explained by each component. *

      Unsupervised principal component analysis (PCA) on the top 500 variable genes revealed a clear influence of the COPD phenotype in separating no COPD and COPD II-IV samples, as previously observed with the DNA methylation analysis, irrespective of the cancer background of COPD samples (Fig.3A, Fig. S2B).

      *Principal component analysis (PCA) of 500 most variable genes in RNA-seq analysis. PCA 1 and 2 are shown in Fig.3A, PCA 1 and 4 in Fig.S2B. COPD I and COPD II-IV samples are represented in light and dark green triangles, respectively, and no COPD samples as blue circles. COPD samples without a cancer background are displayed with a black contour. The percentage indicates the proportion of variance explained by each component. *

      2) This is the first time DNAm has been profiled in AT2 cells. It is incredibly difficult, valuable and novel data that will increase the fields capability technically, their understanding of functional mechanisms and potential translation considerably. It's audience will be primarily translational respiratory however the fundamental science aspect of gene expression regulation by DNA methylation with have wider reach across developmental and disease science.

      Response:

      We thank the reviewer for recognising the uniqueness and novelty of our study and highlighting the value and potential impact of our datasets for the lung field.

      3) the functional analysis using targeted CRISPR-Cas9 is very well done and adds impact.

      Response:

      We thank the reviewer for recognising the strengths and added value of the functional analysis using epigenetic editing.

      __Potential weaknesses/areas for development __

      I feel the main weakness is the in the section integrating DNA methylation and gene expression. The rationale for a focus on various aspects, for example inversely related DNAm/gene expression pairs, the IFN pathway and IRF9, are not clear. Also further understanding of the differences between DNAm associated genes and non-DNAm associated genes could be expanded, at the pathway level, TF regulation level, effect size level (are DNAm associated changes to gene expression larger, enriched for earlier differential expression)

      Response:

      Our rationale for focusing on the inversely related DNAm/gene expression pairs in promoter proximal is purely data-driven, as they represent the biggest group in our data (Fig. 4A-B). Among those negatively correlated genes, we observed the strongest enrichment for the IFN pathway (Fig. C), making it an obvious, data-driven target for further studies. The negative correlation of expression and methylation for IFN pathway genes could be validated in 5-AZA assays in A549 cells (Fig. 5A). Next, we made an interaction network analysis showing IRF9 and STAT2 as master regulators (Fig. 5B) of the negatively correlated IFN genes. As IRF9 itself displayed a negative correlation between DNA methylation and expression (Fig. 5C), we used the associated DMR for further epigenetic editing (Fig. 5D-E). We performed the additional requested analyses of the enhancer-associated changes and genes, as described above. We fully agree with the reviewer that our data sets are a great resource and can be further used to elaborate on other relationships of DNA methylation and RNA expression or other pathways, but this is out of the scope of this study. To enable further studies by the research community, we provide all necessary information about DMRs and DEGs in the associated supplementary tables and the raw data through the EGA, as well as the CRISPRa editing assay.

      The authors could comment on potential masking of differences between 5hmC and mC and the implications it may have

      Response:

      We thank the reviewer for bringing up this important point. Indeed, bisulfite sequencing cannot differentiate between methylated and hydroxymethylated cytosines; hence, some of the methylated sites may be hydroxymethylated. However, the overall levels of hydromethylation in differentiated adult tissues are very low (except for the brain), orders of magnitude lower compared to DNA methylation. Following the reviewer's suggestion, we have added a sentence in the limitation section of the discussion to clarify that point.

      __CHANGE IN THE MANUSCRIPT: __

      In addition, while WGBS provides unprecedented resolution and high coverage of the DNA methylation sites across the genome, it does not allow distinguishing 5-methylcytosine from 5-hydroxymethylcytosine. Therefore, we cannot exclude that some methylated sites we detected are 5-hydroxymethylated. However, the 5-hydroxymethylcytosine is present at very low levels in the lung tissue (97)*. ** *

      Furthermore, while the rationale for looking at DMRs is clear, especially given the sample number, I am interested to understand what proportion of the assayed CpGs "fit" within the cut off stipulations of the DMR analysis - that is, is their potentially COPD effects at sparse CpG regions/individual CpG sites that are not being identified. A comment on this would be useful and seems the strength of profiling genome wide. I'm happy genome wide is beneficial it just feels a little circular that the authors have chosen whole genome to avoid the bias of the Illumina array and a focus on promotors, but have primarily reported promoter DNAm. This caught my attention again in the discussion where the authors state that cis-regulatory regions were also identified in their fibroblast data .....is this finding a factor of the analysis performed? (also a comparison of regions Identified in AT2 cells versus fibroblasts would be really interesting for a future paper)

      Response:

      We decided to focus our analysis on regions rather than individual CpG sites when looking at differential methylation, as DNA methylation is spatially correlated, and methylation changes in larger regions are more likely to have a biological function. Extending the analysis to single CpG sites would require a higher number of samples for a reliable analysis compared to the DMR analysis (as mentioned by the reviewer).

      Of note, we addressed the platform comparison between Illumina array technology and WGBS in our previous fibroblast study (PMID: 37143403), where we compared our WGBS data with the published 450k array data of COPD parenchymal fibroblasts (Clifford et al., 2018). We observed only a marginal overlap between the CpGs from our DMRs and the CpGs probes available on the array (which was due to the differences in technologies used and the limited coverage of the 450K array in comparison to our genome-wide approach, in which we covered 18 million CpGs). Out of the 6279 DMRs identified in our fibroblast study, only 1509 DMRs overlapped with at least one CpG probe on the 450K array, and after removing low-quality CpGs from the array data, only 1419 DMRs were left. This comparison highlighted the increased resolution of the WGBS compared to Illumina arrays.

      The reason why we focused on promoter proximal DMRs are the following: 1) the assignment of the enhancer elements in AT2 to the corresponding gene is still too inaccurate in the absence of AT2 specific enhancer chromatin maps 2) regulation at enhancers by DNA methylation might be more complex and might change (increase or attenuate) binding affinities of certain transcription factors (Fig.2H), which might lead to gene expression changes or 3) methylation changes might be an indirect effect of differential TF binding PMID: 22170606). However, we agree with the reviewer that despite these limitations, expanding the analysis beyond promoters adds value to the manuscript; hence, as described above, we expanded the analysis of non-promoter regions, including enhancers, in the revised manuscript.

      We thank the reviewer for the suggestion to compare the regions identified in AT2 cells and fibroblasts in a future paper.

      My expertise:Respiratory, cell biology, epigenetics.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This study aim to understand the molecular mechanisms underlying dysfunction in AT2 cells in COPD, by profiling bulk genome wide DNA methylation using Tagmentation-based whole-genome bisulfite sequencing (T-WGBS) and RNA sequencing in selectively sorted primary AT2 cells. The study stands out in it's sequencing breadth and use of an incredibly difficult cell population, and has the potential to add substantially to our mechanistic understanding of epigenetic contributions to COPD. A further highlight is the concluding aspect of the study where the authors undertook targeted modification of specific CpG methylation, provided direct, site-specific evidence for transcriptional regulation by CpG methylation.

      Major comments:

      The authors clearly show that there is DNA methylation alteration in AT2 cells from COPD individuals that links functional to gene expression at some level. However, I think the statement "to identify genome-wide changes associated with COPD development and progression..." and similar other references to disease development understanding is not accurate given the DNA methylation primary comparison is between control and moderate to severe COPD, with no temporal detail or evidence that they drive progression rather than are a result of COPD development. The paragraph starting on line 186 where this is a addressed to some extent is quite vague and doesn't really provide confidence that DNAm dysregulation occurs at an early stage in this context. This can be addressed by changing the focus/style of the text.

      Results comments and suggestions:

      For the integrated analysis, there is a focus on DMRs in promoters with very little analysis on other regions. The paragraph starting on line 317 describes some analysis on enhancers but is very brief, doesn't include information on how many/which DMRs were included, making it hard to interpret the impact of the 147 DMRs and 93 genes identified - is this nearly all DMRs and genes analysed or very few? A comparison to the promoter analysis would be of interest. Especially as the targeted region followed up with lovely functional assessment in the last sections is a gene body DMR, not a promoter DMR.

      • Lines 299-301 - I'm not sure the graph in Fig S3A support the conclusion that there was a preferential negative relationship between DNAm and gene expression. Looks like there are a substantial number of cases where a positive relationship is observed and this needs to be acknowledged.

      • Line 307 - what are the "analysed DEGs"? Are they the methylation associated genes?

      • Line 307-309 - "Among the analyzed DEGs, 76.5% (492) displayed a negative correlation (16.8% of the total DEGs), indicating a possible direct regulation by DNA methylation, while 23.5% (151) showed a positive correlation between gene expression and DNA methylation" - are the authors suggesting the positive correlation doesn't indicate direct regulation?

      • Line 313 - why did the authors focus on only negatively correlated genes to identify their top dysregulated pathway of IFN signalling? Why not do pathway analysis on the DNAm associated genes separately to identify DNAm associated pathways?

      • A comparison of the gene expression data with previous data in AT2 cell/single cell data would strengthen the gene expression section.

      • The paragraph starting on line 173 feels a little redundant when we know there is RNA available to test if the differential DNAm links to altered gene expression - this selected of example regions/genes would be better placed after the gene expression has been reported, at which point you could say whether the linked genes displayed altered transcription.

      • Similarly, the TF enrichment analysis is great but maybe would have added value to be done on DNA regions later shown to be linked to differential expression - was there different enrichment at DNA regions that are vs are not associated with altered expression? And could you test in vitro whether changing methylation of DNA (maybe a blunt too like 5-aza would be ok) alters TF binding (cut+run/ChIP?). Furthermore it would be interesting to understand the TF sensitivity analysis within the context of positive versus negative DNA methylation:gene expression correlations.

      Methods:

      • The authors should include more detail of the TWGBS rather than directing the reader to a previous publication. Also DNA concentration post bisuphite conversion would be a useful metric to provide.

      • Differential DNA methylation analysis: It is stated that DNA regions had to contain 3 CpG sites but was this within a defined DNA size range?

      • Refence genome only provided for RNAseq not TWGBS?

      • The tables do not appear in the PDF and I struggled to tally to the "Dataset" files provided if that is what they were referring to?

      • For the gene expression analysis, can it be made clearer that a full analysis was done on COPD I samples. It is a little confusing to the reader as this was not done for DNAm so might be assumed the same targeted analysis on only genes found to be differentially expressed between control and COPD II-IV, but that cannot be the case as an overlap of COPD1 vs COPD II-IV genes if provided. For this overlap, do genes show the same effect direction?

      • Replication is difficult on these studies as the samples are so difficult to come by. Also limited by sample size for the same reason. It doesn't mean the study is not worth doing and the data are still valuable. However, it may be pertinent to include technical validation of a few regions of interest, acknowledge the limitation (along side strengths) in the discussion, and perhaps provide actual p value rather than blanket < p 0.1, seems very lenient but may all be super significant (this may already be in the tables I wasn't able to find).

      • It isn't clear to me if DNA and RNA are from the same cells? The results say "cells matching those used for T-WGBS" but the methods suggest separate extractions so not the same cells? If they are not the same cells a comment on the implications of this should be included in the discussion for example, potentially some differences in cell type composition, storage time etc.

      • Line 193 the authors say "Since DMRs were overrepresented at cis-regulatory sites...." - "cis" needs to be defined. If you link DNAm regions to gene via "closest gene" does this not automatically mean you're outputs will be cis? Just needs better definition/explanation.

      Minor comments:

      • Line 157: "we identified site-specific differences....". Change to region specific?

      • Line 102-103: needs a reference for the statement "Alterations in DNA methylation patterns have been implicated......"

      • Line 266 - what does "strong dysregulation" mean? Large fold change, very significant?

      • Lines 423-425 - statement needs a reference

      • Line 428 - word missing between "epigenetic , we"?

      • Prior studies are well references, text and figures are clear and accurate.

      Significance

      This study has several strengths:

      1) Sample collection and characterisation. AT2 cells are incredibly hard to come by and the authors should be commended to generating the samples. However, proximity to cancer is always a potential issue, especially in epigenetic studies. Is it feasible to include any analysis to show the samples derived from those with cancer don't drive the changes observed? Even a high level PCA or an edit of fig 2A with non-cancer in a different colour in supplemental - looks like there is one outlier, is that a non-cancer? Or a correlation of change in beta between control and cancer/COPD and control and non-cancer:COPD (for want a better phrase!). just an indicator that the non-cancer COPD samples are not driving differences.

      2) This is the first time DNAm has been profiled in AT2 cells. It is incredibly difficult, valuable and novel data that will increase the fields capability technically, their understanding of functional mechanisms and potential translation considerably. It's audience will be primarily translational respiratory however the fundamental science aspect of gene expression regulation by DNA methylation with have wider reach across developmental and disease science.

      3) the functional analysis using targeted CRISPR-Cas9 is very well done and adds impact.

      Potential weaknesses/areas for development:

      I feel the main weakness is the in the section integrating DNA methylation and gene expression. The rationale for a focus on various aspects, for example inversely related DNAm/gene expression pairs, the IFN pathway and IRF9, are not clear. Also further understanding of the differences between DNAm associated genes and non-DNAm associated genes could be expanded, at the pathway level, TF regulation level, effect size level (are DNAm associated changes to gene expression larger, enriched for earlier differential expression) The authors could comment on potential masking of differences between 5hmC and mC and the implications it may have

      Furthermore, while the rationale for looking at DMRs is clear, especially given the sample number, I am interested to understand what proportion of the assayed CpGs "fit" within the cut off stipulations of the DMR analysis - that is, is their potentially COPD effects at sparse CpG regions/individual CpG sites that are not being identified. A comment on this would be useful and seems the strength of profiling genome wide. I'm happy genomewide is beneficial it just feels a little circular that the authors have chosen whole genome to avoid the bias of the Illumina array and a focus on promotors, but have primarily reported promoter DNAm. This caught my attention again in the discussion where the authors state that cis-regulatory regions were also identified in their fibroblast data ..... is this finding a factor of the analysis performed? (also a comparison of regions Id'ed in AT2 cells versus fibroblasts would be really interesting for a future paper)

      My expertise: Respiratory, cell biology, epigenetics.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This study by Prada et al. aimed to explore DNA methylation and gene expression in primary EpCAMhigh/PDPNlow cells, consisting for (probably) the largest part of AT2 cells, to understand the molecular mechanisms behind the impaired regeneration of alveolar epithelial progenitor cells in COPD. They found that higher or lower promoter methylation in COPD-associated cells was inversely correlated with changes in gene expression, with interferon signaling emerging as one of the most upregulated pathways in COPD. IRF9 was identified as the master regulator of interferon signaling in COPD. Targeted DNA demethylation of IRF9 in an A549 cell line resulted in a robust activation of its downstream target genes, including OAS1, OAS3, PSMB8, PSMB9, MX2 and IRF7, demonstrating that demethylation of IRF9 is sufficient to activate the IFN signaling pathway, validating IRF9 as a master regulator of IFN signaling in (alveolar) epithelial cells.

      Major comments:

      • To remove airways (and blood vessels) completely from the lung tissue is difficult, if not impossible. This means that the assumption that the sorted EpCAMpos/PDPNlow cells primarily consisted of AT2 cells remains valid only if a quantitative analysis is conducted on the proportion of HT2-280pos cells in all samples in cytospins to exclude any significant contamination from bronchial epithelial cells. If authors cannot demonstrate >95% pure HT-280-positive cells, then the key conclusions suggesting that the epigenetic regulation of the IFN pathway might be crucial in AT2 progenitor cell regeneration could also potentially apply to bronchial progenitor cells. In addition, if >95% purity can not be demonstrated, the data should be adjusted to account for differences in cell type composition.

      • The overrepresentation of several keratins (KRT5, KRT14, KRT16, KRT17), mucins (MUC12, MUC13, MUC16, MUC20) and the transcription factor FoxJ1 is now attributed by the authors to a possible dysregulation of AT2 identity and differentiation in COPD (lines 282 - 284) where they cite refs 28, 69, 70. Authors try to support this with IF double stains for KRT5 and HT-280 to identify co-expression of KRT5 and HT2-280 in lung tissue (Figure S2H). However, the evidence for the co-expression of both markers could be presented more convincingly.

      • Double staining for KRT5 and HT2-280 did highlight the proximity of both cell types in lung tissue, underscoring the challenge of removing airways (including the smaller and terminal bronchi) from the tissue. In addition, HT-280/KRT5 co-expression in not consistent with recent studies from refs 28, 69, 70 where other markers for distal airway cell transition, such as SCGB3A2 and BPIFB1, have been demonstrated, which were not investigated in this study.

      • The small (and not evenly divided) sample size of both COPD and non-COPD specimens may lead to a higher risk for false positive results as adjustments for multiple testing typically rely on the number of comparisons, and small sample sizes may not provide enough data points to adequately control for this.

      Minor comments:

      Introduction:

      • In general, refer to the actual experimental studies rather than review papers where appropriate.

      • Clearly specify whether a study was conducted in mice or humans, as this distinction is crucial for understanding the relevance of the findings to COPD.

      Methods:

      • Line 473, here is meant 3 ex-smoker controls instead of smoker controls?

      Discussion:

      • A list of limitation should be added to the discussion. One is the use of the alveolar cell line A549, which produces mucus, a characteristic more commonly associated with bronchial epithelial cells. (ref 43)

      • Another limitation to consider is that cells were isolated primarily from individuals with lung cancer, except for patients with COPD stage IV. In particular as COPD stage II and IV samples were taken together.

      • And discuss the small and unevenly divided sample size

      References:

      • Check references. For instance, there is no reference in the text to ref 43.

      • Align format of references

      Significance

      The strength of this study lies in its focus on the molecular mechanisms underlying the impaired regeneration of epithelial progenitor cells in COPD. The discovery of IRF9, which regulates IFN signaling and is prominently upregulated in COPD, together with the convincing validation of the epigenetic control of the IFN pathway by targeted DNA demethylation of the IRF9 gene, adds significant value to the COPD research field.

      Main limitations of the study are the relatively small sample size of both COPD and non-COPD specimens and the claim that the sorted EpCAMpos/PDPNlow cells primarily consisted of AT2 cells.

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      The nature and significance of the advance in epigenetic editing of IRF9 in COPD can be described as both conceptual and potentially clinical: Conceptual Advance: The epigenetic editing of IRF9 enhances our understanding of the molecular mechanisms underlying COPD pathogenesis. By targeting IRF9 through epigenetic modifications, researchers were able to modulate the activity of the IFN pathway, which plays a crucial role in the immune response and lung tissue regeneration. This approach offers insights into the epigenetic regulation of gene expression in epithelial progenitor cells in COPD and expands our understanding of how alterations in specific gene methylation could contribute to disease progression. Clinical Significance: The potential clinical significance of epigenetic editing of IRF9 lies in its implications for COPD therapy. If successful, epigenetic editing techniques could offer a novel therapeutic strategy for COPD by downregulating IFN pathway activation and promoting regeneration of epithelial progenitor cells in the lungs. Obviously, further preclinical and clinical studies are needed to validate the efficacy and safety of epigenetic editing approaches in COPD treatment.

      • Place the work in the context of the existing literature (provide references, where appropriate).

      Few experimental papers have been published on epigenetic editing in lung diseases, with limited research available beyond the study referenced in citation 43. Song J, Cano-Rodriquez D, Winkle M, Gjaltema RA, Goubert D, Jurkowski TP, Heijink IH, Rots MG, Hylkema MN. Targeted epigenetic editing of SPDEF reduces mucus production in lung epithelial cells. Am J Physiol Lung Cell Mol Physiol. 2017 Mar 1;312(3):L334-L347. doi: 10.1152/ajplung.00059.2016. Epub 2016 Dec 23. PMID: 28011616.

      • State what audience might be interested in and influenced by the reported findings.

      This study is of broad interest to researchers investigating the pathogenesis and treatment of COPD.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view.

      Expertise in: Lung pathology, Immunology, COPD, Epigenetics

      • Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Less expertise in: Epigenetic Editing

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: findings and key conclusions Epithelial cell competition in larval imaginal discs involves signaling with the Sas ligand and Ptd10D receptor. In wild type cells both are typically found at the apical surface, but relocalize to the lateral cortex at the winner-loser interface. Ptd10D activation leads to reduced Ras signaling, increased pro-apoptotic Jnk signaling and consequently the elimination of loser cells. In the manuscript the authors address the role of the actin cytoskeleton in the context of the signaling controlling cell elimination in Drosophila larval eye imaginal discs. They interfere by clonal overexpression of the guanyl nucleotide exchange factor RhoGEF2 (RG2), which has previously been shown to induce dominant gain-of-function phenotypes by activation of Rho signaling. In this context the requirement of and genetic interactions with the other pathways implicated in cell elimination is tested. They find that RG2 induced cell elimination depends on PtD10D, Hippo signaling and Crumbs.

      Major comments: claims and conclusions The experimental setting, using clonal analysis in imaginal discs, is straight-forward and well-established, including quantification of clone size and comparison of phenotypes. The presented data are of high quality and thus the direct conclusions are fully supported by the data as long as they refer to the actual experimental interference. What is not supported by the data is the generalization of the conclusions, i. e. that RG2 overexpression would be equivalent to Actin cytoskeletal deregulation. This equivalence is expressed in the title "Actin cytoskeletal deregulation, caused by RhoGEF2 overexpression.." and the summary " that actin cytoskeleton deregulated cells (as induced by RhoGEF2 overexpression (RhoGEF2OE))...". In my view such an equivalence is not justified. There is no doubt that RG2 overactivation affects the actin cytoskeleton in multiple ways, such as contractility via MyoII or polymerization via Dia, among others. There is also no doubt that other pathways are also directly or indirectly affected beside the actin cytoskeleton. The authors do not present data showing the specificity of RG2 overexpression. For example, the authors could investigate the phenotype and genetic interaction with an alternative way of interference, independent of RG2, of the actin cytoskeleton to support their conclusion. There is a second assumption, which may not be justified, that the function of the cytoskeleton would be generally downstream of cell polarity, see abstract l24 "triggering cytoskeletal deregulation (which occurs downstream of cell polarity disruptions)..". There are certainly cytoskeletal activities such as cell shape changes that mediate the execution of cell elimination. However interfering with the cortical cytoskeleton also affect the distribution of cortical polarity proteins. The authors do not present data to demonstrate the specificity of RG2 overexpression concerning a function downstream of cell polarity.

      Response: We apologise for our phrasing of the title and the sentence in the summary that suggests that it is the actin cytoskeleton disruption caused by RhoGEF2 overexpression that is responsible for the effects on cell competition. We have rephrased the title and edited the text to avoid such an inference.

      With regard to the reviewer’s second concern regarding the link between cell polarity disruption and actin cytoskeletal deregulation, there is indeed evidence that this occurs.

      There are numerous examples of how cell polarity regulators affect the actin cytoskeleton in both Drosophila and mammalian cells (reviewed by Humbert et al., 2015, DOI 10.1007/978-3-319-14463-4_4). Indeed, in our previous paper (Brumby et al., 2011. PMID: 21368274), we found genetic evidence that the knockdown of the polarity regulator, dlg, cooperates with activated Ras (RasACT) to produce a hyperplastic eye phenotype, and that this phenotype is rescued by knockdown of actin cytoskeletal regulators like RhoGEF2 or Rho. This data suggests that these actin cytoskeleton regulators act downstream of cell polarity disruption to cooperate with RasACT. Furthermore, another study has shown that the activation of Myosin II is increased in scrib mutants and impairs Hippo pathway signaling, and is also required for the cooperation of scrib mutants with RasACT (Külshammer, et al., 2013. PMID: 23239028). Consistent with this finding, we have previously shown that RhoGEF2 acts via Rho, Rok, and Myosin II activation in cooperation with RasACT (Khoo et al., 2013. PMID: 23324326). Furthermore, another cell polarity regulator, Lgl, binds to and negatively regulates Myosin II function in Drosophila (Strand et al., 1994. PMID: 7962095; Betschinger et al., 2005. PMID: 15694314). Moreover, Drosophila Scrib and Dlg bind to GUK-holder/NHS1 (Nance–Horan syndrome-like 1), which is a regulator of the WAVE/SCAR-ARP2/3-branched F-actin pathway, and this interaction is required for epithelial tissue development (Caria et al., 2018. PMID: 29378849). Thus, although cell polarity gene loss can affect the actin cytoskeleton by different means, and RhoGEF2 can activate Rho to regulate various actin cytoskeletal effectors (Limk, Dia, PKN, Rok), what they have in common is the activation of Myosin II. To make this clearer, we have now added brief sections to the introduction and Discussion highlighting and contextualising evidence for the effect of cell polarity disruption on the actin cytoskeleton.

      Reviewer #1 (Significance (Required)):

      The study establishes genetic interactions and dependencies concerning cell elimination following a very specific experimental interference of RG2 overexpression. It remains unclear, however, to which degree these genetic interactions contribute to controlling cell competition in situations that are physiologically relevant. The generalization of RG2 overexpression as a specific test the function of the actin cytoskeleton is an interpretation not supported by the presented data and the experimental set up.

      Response: Although RhoGEF2 overexpression does lead to actin cytoskeletal disruption via Rho effectors, the reviewer is correct that we do not know whether it is the actin cytoskeleton disruption per se that is involved in triggering cell competition. We have edited the text accordingly.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: In the manuscript "Actin cytoskeletal deregulation, caused by RhoGEF2 overexpression, induces cell competition dependent on Ptp10D, Crumbs, and the Hippo signaling pathway", Natasha et al. investigate how actin cytoskeletal deregulation drives cell competition in the Drosophila eye disc. By overexpressing RhoGEF2 to induce cytoskeletal disruption and utilizing genetic knockdowns of various candidate genes, the authors examine the spatial distribution and interaction between normal and deregulated cell populations. Their findings demonstrate that cell competition and clone elimination in this context are dependent on sas-Ptp10D, scrib, and components of the Hippo signaling pathway. The study is well executed and provides a potentially impactful contribution to the field. The experimental design is solid, and the conclusions are generally well supported by the data. Only minor revisions are needed to strengthen clarity and presentation. Specific suggestions and comments regarding significance are listed below.

      Major comments: There is a discrepancy between the representative images (Fig. 3A-C′) and the quantification in Fig. 3J. The statistical analysis may be limited by small sample size or suboptimal test choice (e.g., Kruskal-Wallis vs. ANOVA). Increasing the sample size and reassessing the statistical approach could strengthen this otherwise well-executed section.

      Response: The data is normally distributed, so we have repeated the analysis using a one-way ANOVA (instead of the Kruskal-Wallis test – Initially we used this one because of the small sample number, but the data is normally distributed, and so a one-way ANOVA is appropriate). From examining all the images again, we can ascertain that there is indisputably less active caspase-3 staining in RhoGEF2-OE Ptp10D-KD compared to RhoGEF2-OE Dicer2. We have selected a more suitable image that better represents this snapshot of active caspase-3 staining in RhoGEF2-OE Ptp10D-KD. Also, a more representative control image is now shown, where some baseline active caspase-3 staining is present.

      A minor concern relates to the interpretation and consistency of the statistical analyses used. For example, in Figure 5I, both the Kruskal-Wallis test and an unpaired t-test were used, with the authors stating that the t-test was applied specifically to compare wild-type and crb-/- clones (p = 0.0147). However, in the adjacent panel (Figure 5J), only a one-way ANOVA was used. This inconsistency may give the impression that the choice of statistical test in Figure 5I was influenced by a lack of significance with the Kruskal-Wallis test, rather than by experimental design. Unifying the statistical approach within related panels would improve clarity and minimize potential reader misinterpretation. Additionally, some of the statistical tests applied may not fully align with the underlying data distributions. Statistical methods used in parts of the manuscript may need to be reevaluated, and the rationale for their selection should be clarified in the text.

      Response: We have checked the data carefully, plotted all the individual data sets in R, and the data is not normally distributed. Therefore, conducting a Kruskal-Wallis test is the best approach. This analysis shows that there is no significant difference between crb-/- and WT in our experimental setting. However, there is a slight trend towards increased crb-/- clone size. We have added a more detailed description of the statistical methods used in different situations in the Materials and Methods section.

      In the section on how crb-/- affects actin distribution and accumulation within the tissue (Figure 6H′ and Supplementary Figure 5), it appears that F-actin may accumulate more prominently in cytoplasmic regions rather than at cell-cell junctions under crb-/- conditions. However, due to the current level of magnification, it is difficult to determine the precise subcellular localization. Although this question is somewhat tangential to the main focus of the manuscript and not essential for publication, it could be valuable, if the authors included a few higher-magnification images showing F-actin distribution in RhoGEF2OE Dicer2, RhoGEF2OE Ptp10D KD, and RhoGEF2OE crb-/- conditions. Including these in the supplementary figures could help clarify how actin cytoskeletal regulation is affected.

      Response: We have added zoomed-in images to Figs 6G and 6H to show the effect on F-actin more clearly. It is possible that F-actin may be more prominent in the cytoplasm in crb-/- clones, however further experiments would be needed to provide more evidence for this, which are unfortunately beyond the scope of our capabilities at this time.

      In Figure 6H′ the Diap1 signal in the RhoGEF2OE condition appears non-uniform, with noticeably weaker intensity on the left side of the image and stronger signal on the right. This asymmetry is not observed in the RhoGEF2OE crb-/- condition shown in Figure 6K′. It is unclear whether this pattern reflects a biological phenomenon consistently observed in RhoGEF2OE tissues or if it might result from technical factors such as uneven mounting or imaging. To prevent potential misinterpretation, we recommend clarifying this point, providing additional representative images if available, or replacing the current image with one that more clearly reflects the typical expression pattern.

      Response: We assume the reviewer means Fig 6J, and we have replaced the image with a more representative one.

      In Fig. 3B′, cleaved Caspase-3 appears localized to specific regions at the WT/RhoGEF2OE interface, suggesting spatial bias in Ptp10D-dependent elimination. This raises important questions about what determines regional susceptibility-are certain tissue conditions or cell states more prone to apoptosis in this context? Figure 3 raises the question of whether RhoGEF2OE-induced, actin-deregulated clones undergo dynamic changes, such as expanding or regressing, over the course of the larval stage. Such temporal variability could influence GFP⁺ clone size and the expression of apoptotic markers like cleaved Caspase-3 and Diap1. The stated use of the L3 stage, which spans ~48 hours (Tennessen & Thummel, 2011), lacks sufficient temporal resolution. Clarifying the timing of dissection and fixation relative to clone induction would improve interpretation of clone behavior and marker dynamics.

      Response: While the reviewer raises an interesting question about spatial and temporal sensitivities to apoptosis upon genetic perturbations, we have conducted all of our experiments on samples obtained from the wandering L3 stage. We have added the following text to the Materials and Methods to make it clearer: “Wandering third-instar larvae (L3) were picked for all experiments, and for each experiment all larvae were of equivalent size.”.

      Minor comments: GFP signal appears weaker in the wild-type group compared to experimental conditions, raising the question of whether image processing (e.g., contrast and color balance) was applied uniformly and if this difference reflects true variation in expression.

      Response: Yes, images were always identically processed. We have stated in the Materials and Methods imaging section: “Laser intensity and gain was unchanged within each experimental group”.

      For Figures 2, 3, and 5, including representative images for each eye phenotype category would clarify the scoring criteria. In Figure 5, the use of a "2.5" category in the main figure should be explained-does it correspond to category 3 or indicate an intermediate phenotype?

      Response: Apologies for this error, and thanks to the reviewer for highlighting this. The “2.5” rating was a mistake based on a previous classification scale we used, and we have changed 2.5 to 3 in the graph. We have also included a new supplementary figure explaining our rankings (Supp Fig 10).

      In Figure 5I, the y-axis range (0-150%) is broader than needed; adjusting it to 0-100% would better reflect the data and improve clarity.

      Response: We have edited the Fig 5I graph accordingly.

      The sentence from line 343- 348 is long and challenging to follow.

      Response: We have reworded the sentence.

      Missing the Figure number on Line 286.

      Response: We have added the Figure number.

      Reviewer #2 (Significance (Required)):

      Significance: This study is well executed and rigorously addresses previously reported variations in phenotypic outcomes across laboratories. Beyond clarifying the role of Ptp10D in cell competition, the authors establish RhoGEF2 overexpression as a reliable method to induce cell competition and identify key molecular players involved in this process. This work represents a meaningful advance by introducing novel approaches and deepening understanding of known factors in clone elimination. The mosaic RhoGEF2 overexpression technique developed in this study provides a valuable tool for investigating cell-cell interactions at the tissue level, with broad applicability in basic research. This approach holds particular promise for probing.

      Response: We thank the reviewer for their support of the significance and quality of our manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In the manuscript "Actin cytoskeletal deregulation, caused by RhoGEF2 overexpression, induces cell competition dependent on Ptp10D, Crumbs, and the Hippo signaling pathway", Natasha et al. investigate how actin cytoskeletal deregulation drives cell competition in the Drosophila eye disc. By overexpressing RhoGEF2 to induce cytoskeletal disruption and utilizing genetic knockdowns of various candidate genes, the authors examine the spatial distribution and interaction between normal and deregulated cell populations. Their findings demonstrate that cell competition and clone elimination in this context are dependent on sas-Ptp10D, scrib, and components of the Hippo signaling pathway. The study is well executed and provides a potentially impactful contribution to the field. The experimental design is solid, and the conclusions are generally well supported by the data. Only minor revisions are needed to strengthen clarity and presentation. Specific suggestions and comments regarding significance are listed below.

      Major comments:

      There is a discrepancy between the representative images (Fig. 3A-C′) and the quantification in Fig. 3J. The statistical analysis may be limited by small sample size or suboptimal test choice (e.g., Kruskal-Wallis vs. ANOVA). Increasing the sample size and reassessing the statistical approach could strengthen this otherwise well-executed section. A minor concern relates to the interpretation and consistency of the statistical analyses used. For example, in Figure 5I, both the Kruskal-Wallis test and an unpaired t-test were used, with the authors stating that the t-test was applied specifically to compare wild-type and crb-/- clones (p = 0.0147). However, in the adjacent panel (Figure 5J), only a one-way ANOVA was used. This inconsistency may give the impression that the choice of statistical test in Figure 5I was influenced by a lack of significance with the Kruskal-Wallis test, rather than by experimental design. Unifying the statistical approach within related panels would improve clarity and minimize potential reader misinterpretation. Additionally, some of the statistical tests applied may not fully align with the underlying data distributions. Statistical methods used in parts of the manuscript may need to be reevaluated, and the rationale for their selection should be clarified in the text. In the section on how crb-/- affects actin distribution and accumulation within the tissue (Figure 6H′ and Supplementary Figure 5), it appears that F-actin may accumulate more prominently in cytoplasmic regions rather than at cell-cell junctions under crb-/- conditions. However, due to the current level of magnification, it is difficult to determine the precise subcellular localization. Although this question is somewhat tangential to the main focus of the manuscript and not essential for publication, it could be valuable, if the authors included a few higher-magnification images showing F-actin distribution in RhoGEF2OE Dicer2, RhoGEF2OE Ptp10D KD, and RhoGEF2OE crb-/- conditions. Including these in the supplementary figures could help clarify how actin cytoskeletal regulation is affected. In Figure 6H′, the Diap1 signal in the RhoGEF2OE condition appears non-uniform, with noticeably weaker intensity on the left side of the image and stronger signal on the right. This asymmetry is not observed in the RhoGEF2OE crb-/- condition shown in Figure 6K′. It is unclear whether this pattern reflects a biological phenomenon consistently observed in RhoGEF2OE tissues or if it might result from technical factors such as uneven mounting or imaging. To prevent potential misinterpretation, we recommend clarifying this point, providing additional representative images if available, or replacing the current image with one that more clearly reflects the typical expression pattern. In Fig. 3B′, cleaved Caspase-3 appears localized to specific regions at the WT/RhoGEF2OE interface, suggesting spatial bias in Ptp10D-dependent elimination. This raises important questions about what determines regional susceptibility-are certain tissue conditions or cell states more prone to apoptosis in this context? Figure 3 raises the question of whether RhoGEF2OE-induced, actin-deregulated clones undergo dynamic changes, such as expanding or regressing, over the course of the larval stage. Such temporal variability could influence GFP⁺ clone size and the expression of apoptotic markers like cleaved Caspase-3 and Diap1. The stated use of the L3 stage, which spans ~48 hours (Tennessen & Thummel, 2011), lacks sufficient temporal resolution. Clarifying the timing of dissection and fixation relative to clone induction would improve interpretation of clone behavior and marker dynamics.

      Minor comments:

      GFP signal appears weaker in the wild-type group compared to experimental conditions, raising the question of whether image processing (e.g., contrast and color balance) was applied uniformly and if this difference reflects true variation in expression. For Figures 2, 3, and 5, including representative images for each eye phenotype category would clarify the scoring criteria. In Figure 5, the use of a "2.5" category in the main figure should be explained-does it correspond to category 3 or indicate an intermediate phenotype? In Figure 5I, the y-axis range (0-150%) is broader than needed; adjusting it to 0-100% would better reflect the data and improve clarity. The sentence from line 343- 348 is long and challenging to follow. Missing the Figure number on Line 286.

      Significance

      This study is well executed and rigorously addresses previously reported variations in phenotypic outcomes across laboratories. Beyond clarifying the role of Ptp10D in cell competition, the authors establish RhoGEF2 overexpression as a reliable method to induce cell competition and identify key molecular players involved in this process. This work represents a meaningful advance by introducing novel approaches and deepening understanding of known factors in clone elimination. The mosaic RhoGEF2 overexpression technique developed in this study provides a valuable tool for investigating cell-cell interactions at the tissue level, with broad applicability in basic research. This approach holds particular promise for probing

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: findings and key conclusions

      Epithelial cell competition in larval imaginal discs involves signaling with the Sas ligand and Ptd10D receptor. In wild type cells both are typically found at the apical surface, but relocalize to the lateral cortex at the winner-loser interface. Ptd10D activation leads to reduced Ras signaling, increased pro-apoptotic Jnk signaling and consequently the elimination of loser cells. In the manuscript the authors address the role of the actin cytoskeleton in the context of the signaling controlling cell elimination in Drosophila larval eye imaginal discs. They interfere by clonal overexpression of the guanyl nucleotide exchange factor RhoGEF2 (RG2), which has previously been shown to induce dominant gain-of-function phenotypes by activation of Rho signaling. In this context the requirement of and genetic interactions with the other pathways implicated in cell elimination is tested. They find that RG2 induced cell elimination depends on PtD10D, Hippo signaling and Crumbs.

      Major comments: claims and conclusions

      The experimental setting, using clonal analysis in imaginal discs, is straight-forward and well-established, including quantification of clone size and comparison of phenotypes. The presented data are of high quality and thus the direct conclusions are fully supported by the data as long as they refer to the actual experimental interference. What is not supported by the data is the generalization of the conclusions, i. e. that RG2 overexpression would be equivalent to Actin cytoskeletal deregulation. This equivalence is expressed in the title "Actin cytoskeletal deregulation, caused by RhoGEF2 overexpression.." and the summary " that actin cytoskeleton deregulated cells (as induced by RhoGEF2 overexpression (RhoGEF2OE))...". In my view such an equivalence is not justified. There is no doubt that RG2 overactivation affects the actin cytoskeleton in multiple ways, such as contractility via MyoII or polymerization via Dia, among others. There is also no doubt that other pathways are also directly or indirectly affected beside the actin cytoskeleton. The authors do not present data showing the specificity of RG2 overexpression. For example, the authors could investigate the phenotype and genetic interaction with an alternative way of interference, independent of RG2, of the actin cytoskeleton to support their conclusion.<br /> There is a second assumption, which may not be justified, that the function of the cytoskeleton would be generally downstream of cell polarity, see abstract l24 "triggering cytoskeletal deregulation (which occurs downstream of cell polarity disruptions)..". There are certainly cytoskeletal activities such as cell shape changes that mediate the execution of cell elimination. However interfering with the cortical cytoskeleton also affect the distribution of cortical polarity proteins. The authors do not present data to demonstrate the specificity of RG2 overexpression concerning a function downstream of cell polarity.

      Significance

      The study establishes genetic interactions and dependencies concerning cell elimination following a very specific experimental interference of RG2 overexpression. It remains unclear, however, to which degree these genetic interactions contribute to controlling cell competition in situations that are physiologically relevant. The generalization of RG2 overexpression as a specific test the function of the actin cytoskeleton is an interpretation not supported by the presented data and the experimental set up.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Cells need to adjust their gene expression pattern, including nutrient transporters and enzymes to process the available nutrient. How cells maintain the coordination between these processes is one of the most critical questions in biology. In this work authors elegantly combined a range of relevant experimental techniques, ranging from time-lapse microscopy, microfluidics, and mathematical modelling to address this question. Combining these methods, authors proposed a push-pull like mechanism, involving two pairs of repressors (Mth1, Std1 and Migs) in the glucose sensing network. In budding yeast there are multiple hexose transporter genes with varying affinity and transport rate. Authors postulated that on sensing glucose, cells switch between expressing high affinity glucose transporters (when extracellular glucose is low), and low affinity glucose transporters (in high extracellular glucose), and these processes are mediated by the pairs of repressors as mentioned earlier. Following the expressing patterns of fluorescently tagged hexose transporters and varying the extracellular glucose concentrations in media, authors proposed that pairs of repressors switch their activity depending on extracellular glucose level, and which is matched by the promoters of the hexose transporter genes to achieve optimality of glucose transport.

      This study is elegantly designed and addressed an interesting question. The mechanism (push-pull involving two pairs of repressors) is plausible and justified by the data. Authors also presented a mathematical model and made predictions, which are also verified. We will recommend the publication of this work with minor modifications.

      Major comments:

      This study is well designed and experiments performed accordingly. We have only minor comments for revision.

      Minor comments:

      1. Although authors covered a wide array of literature, but while discussing tradeoffs and nutrient sensing, it will be good to include bacterial growth law and related literature, and physiological level tradeoffs should be discussed. Moreover, authors vouched that the push-pull mechanism helps to circumvent the rate-affinity tradeoff of the transporter, whereas expressing genes to more precisely corelate with the extracellular glucose level brings out physiological optimality. This rate-affinity tradeoff and its physiological role should be discussed clearly.
      2. Authors described the ALCATRAS device in their previous publication, but for better clarity, a supplementary figure with schematic diagram and experimental plan should be included.
      3. Microscopic images of transporter expression pattern should be shown as kymographs in the supplementary, in this version of the manuscript plots from processed microscopy images are shown only.
      4. GFP was used to tag HXT1-7 as mentioned by the authors and expression of these genes are evaluated in separate experiments. We suggest including a schematic diagram describing the experimental design while using the microfluidic device and the experimental plan should be written in more detail in general. We found this part confusing. Did authors considered tagging two separate transporters with different fluorescent tag from either end of the affinity spectrum and showing the expression pattern in one experiment? Authors mentioned co expression of receptors at a particular glucose concentration over time, is this inferred from separate timelapse experiments? This need to be more clearly stated.
      5. Please mark the second phase of media glucose concentration in panel 1C, 1% glucose phase is marked, please mark the other phases for clarity.
      6. For the repressors to sense glucose and to initiate the push pull mechanism, there should be baseline glucose flux, which is not clearly mentioned in the manuscript. Authors mentioned that minimal intracellular glucose in absence of extracellular glucose and deployed a logistic function to increase intracellular glucose. The baseline glucose level is crucial, and authors should comment on this. Also, glucose mediated protection of HXT4 should be discussed in this context.
      7. Figure 3B and 3C, details of the error bars should be mentioned in the figure legend.

      Referee cross-commenting

      All other reviewers also identified this study insightful and interesting, similar to our comments. We also agree with the suggestions made by other reviewers. Suggested changes and modifications can be addressed within a month as mentioned by most of the reviewers. Excellent point raised by other reviewers on technicalities and addressing those points will improve the readability of this work even more.

      Significance

      General assessment:

      Use of innovative microfluidics platform to trap mother cells and following the gene expression pattern by fluorescence microscopy and combining the experimental approach with mathematical model are the strengths of this work. Whereas the proposed push-pull mechanism is not generalizable to other carbons. Model is merely used to fit the data, rather than making interesting predictions. Also how does the mechanism holds when cells are switched from other nutrient sources is also not clear in this work, which are the limitations of this work.

      Advance

      This work involves experimental technique and mathematical model to test the hypothesis. Use of custom-built microfluidics set up and live cell imaging to track gene expression levels in varying nutrient condition. This study links single cell level gene expression pattern to model and predict system level behavior. Nutrient sensing and subsequent rearrangement of gene regulatory network is an important question to address, and the proposed push-pull mechanism in this study adds up to the existing body of literature.

      Audience:

      This work is interdisciplinary and researchers across multiple fields will be interested in this work, including researchers interested in microbial nutrient sensing, systems biology, topology of gene regulatory network, metabolism, and general microbiology.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The yeast Saccharomyces cerevisiae possesses a large family of hexose transporters, the HXT genes. Some of these transporters play known roles in transport to feed metabolism while others seem to respond to glucose levels but have differing cellular functions, acting more as sensors than as drivers of carbon and energy production. The authors use single cell fluidics to monitor steady state expression of specific transporters under controlled glucose levels. The authors then used published information on the regulatory network of HXT4 gene expression to predict expression levels and confirm the role of the prior identified regulators. Thus, this work confirms prior work as to the levels of substrate leading to optimized expression of transporters and confirms the role of the identified regulatory network. The fact that the main single cell fluidics findings confirm the prior culture analyses affirms the utility of the prior work.

      Major Comments:

      1. The analysis uses protein expression levels (HXT4-GFP) as a proxy for transcriptional regulation. This study assumes no regulation of protein expression beyond transcription under steady state conditions. This seems like a reasonable assumption. However, for the dynamic change analysis (Figure 1 C, lines 70-78) loss of GFP-tagged protein from a single cell would be due not just to absence of transcription but also to differential rates of endocytosis and degradation, which could vary across the different HXTs. In cell populations the plasma membrane composition of the bud can be dynamically different from that of the mother cell and will reflect changes in transcription patterns. Meaning that cells with buds might have reduced expression due to the presence of the bud versus non-budding cells. And if buds are washed away during the time course of the experiment this could impact assessment of GFP signal - I am assuming controls were done to address this and should be included in the presentation. Did the authors consider this in their experimental design and interpretation?
      2. The modeling was based upon the assumption of the validity of prior work and observations and authors show that models based upon that prior knowledge work to explain the single cell data. One wonders what perturbing prior modes of action would do to fit the data. That is, if the role of one regulator was downplayed or modified in concert with another would data still fit in a reasonable way? My concern again is that loss of signal (protein) is equated exclusively with transcription and not post-transcriptional regulation. This timeline in 1C and in fig 2 of 20 hours certainly would accommodate post-transcriptional regulation of protein levels.
      3. Lines 142-150: two models are proposed: Std1 activating Snf1 with std1 deletion therefore hyperactivating Mig1. The second model is for Std1 to repress Mig2 with deletion of std1 then leading to hyperactivation of Mig2. It seems this could be directly tested using multiple deletant strains, or modified repressor proteins. For example, is the effect lost in a std1 mig1 double mutant?
      4. Lines 121-122 the comment that comparing expressing GFP from the HXT4 promoter to GFP tagged HXT4 protein allows glucose to protect HXT4 from degradation needs to be explained.
      5. Line 180-186: this is an important analysis - I assume binding sites for repressors/inducers of the HXT genes have been mapped -then the comparison to known promoter structure (lines 214-246) is a great test of the model. It seems the finding are consistent with previously published data on differential regulation of these promoters in full-culture studies.
      6. Lines 293-299: one thing the authors should highlight is the contrast between these single cell studies and prior population studies that are influenced strongly by the heterogeneity between bud and mother cell plasma membrane composition. The mother cell can of course benefit from the differential expression in the daughter cell and the daughter cell benefits from the differential composition of the mother cell. This study shows that mother cells adapt membrane composition as well, but perhaps the potential role of cell membrane protein turnover should also be included.

      no Minor Comments

      Significance

      It has been known for quite some time that glucose transport in the yeast Saccharomyces cerevisiae is dynamically regulated to optimize sugar depletion to sugar metabolism. This intricate system involves a family of hexose transporters of differing affinities for substrate, the timing and level of expression of which is regulated by both eternal hexose levels and internal ability to metabolize keeping cytoplasmic sugar levels low. Since facilitated diffusion systems can transport in both directions, the consumption of substrate assures the direction of uptake will be dominant. The authors demonstrate in this paper that differential expression of the known major regulators of HXT gene expression work in concert to adjust the expression patterns of transporters of differing affinities leading to optimization of hexose uptake. The study monitored changes in single cells and findings confirm prior work conducted in cell populations. One assumption has always been that the mother cell might "sacrifice" itself by not being able to dynamically clear the membrane of environmentally unmatched hexose transporters relying on the altered membrane composition of the bud. This work's focus on "mother cells" demonstrates that regulation still occurs if cells are allowed to reach a steady state. The timeline may be slower than bud adaptation, but these authors confirm that mother cells respond dynamically to glucose levels.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This is a very insightful work showing how to disentangle one of the most complex transcriptional networks in yeast (S. cerevisiae) by combining single-cell dynamics, dynamical-systems modeling, Bayesian-style inference, and genetic perturbations. The authors tackle a problem that has eluded quantitative resolution for over two decades-how yeast regulates its seven primary glucose importer genes (HXT1-HXT7) in response to both steady and temporally changing extracellular [glucose]. Their integrated experimental-theoretical approach delivers the most satisfying mechanistic and quantitative explanation to date, and I enthusiastically recommend this manuscript for publication.

      Yeast relies on seven passive hexose transporters (Hxt1-Hxt7) to import glucose, its preferred sugar; deleting all seven abolishes growth on glucose. The underlying regulatory network is exceptionally intricate, reflecting yeast's evolutionary priority for glucose. Two membrane sensors-Snf3 (high affinity) and Rgt2 (low affinity)-detect extracellular glucose and thereby inactivate two co-repressors, Mth1 and Std1, which modulate the DNA-binding factor Rgt1. Concurrently, intracellular glucose activates the SNF1 kinase, phosphorylating and exporting the repressor Mig1, while Mth1/Std1 also govern the transcription and stability of Mig2, another DNA-binding repressor. Together, Rgt1, Mig1, and Mig2 integrate these inputs to control HXT promoter activity (Fig. 2A). Importantly, Mth1 and Std1 do not directly bind to DNA and this complication - the protein-protein interaction that one cannot get from DNA sequence - is just one source of difficulty that the authors overcame.

      To map the network's behavior, the authors used microfluidic "cages" housing single cells expressing GFP-tagged HXTs, monitoring fluorescence under three constant glucose levels-low (0.01%), medium (0.1%), and high (1%) (Fig. 1B-C). The authors confirm that steady-state Hxt abundances rank by transporter affinity. But the more important and surprising discovery is that when the cells were subjected to gradual glucose up-shifts and down-shifts, they discovered that some transporters transiently spike only when [glucose] rises and others only when [glucose] falls (Fig. 1C and Fig. S1F). This discovery establishes that the HXT network not only "senses" the absolute external [glucose] concentration but also the direction of the temporal change in external [glucose].

      To understand how the regulatory network yields such intricate temporal changes in HXT expression, the authors first focused on the medium-affinity transporter, Hxt4. Targeted knockouts of Mig1/Mig2 versus Mth1/Std1 confirmed that Hxt4 dynamics arise from differential repressor kinetics. To formalize these findings, the authors built an ODE model grounded in literature-based constraints (pg. 13 of the Supplement) with explicit separation of repressor timescales. They rigorously fit the model to wild-type and knockout time series-exploring parameter sensitivity in depth (Fig. S5).

      The authors discovered that their model and experiments converged on a push-pull mechanism: fast-acting Mig1/Mig2 dominate during glucose up-shifts, while slower Mth1/Std1 govern down-shifts, determining whether each HXT gene is repressed or de-repressed (i.e., "who gets there first"). Extending this analysis across all seven HXTs via approximate Bayesian computation revealed the most likely repressor-promoter interactions for each transporter, reducing a vast parameter space to unique or small sets of plausible regulatory schemes. The authors thus revealed what could be happening and which regulations are improbable - a more nuanced and comprehensive view than giving just one outcome for each HXT.

      Overall, this work represents a role model - textbook-worthy - for quantitative systems biology. Beyond the rigor and novelty of its findings, the authors explain complex mathematical concepts with clarity, and the narrative flows logically from experiment to model to inference. This study provides a definitive mechanistic resolution of the HXT network and establishes a broadly applicable framework for dissecting dynamic and complex gene circuits.

      Major points:

      I don't recommend any new experiments or modeling; the major claims are already well supported by the data and models. Below are comments and questions intended to improve clarity and facilitate the reader's understanding. Please feel free to disregard any that you find not sensible or beyond the scope of the current work.

      1. Preconditioning (Fig. 1B-C): What medium were cells in immediately before t = 0? Were they in log-phase or stationary-phase growth just prior to the glucose addition?
      2. Transporter ranking in medium glucose: In the medium [glucose] regime, why is a low-affinity Hxt the second-most highly expressed, rather than the next-highest-affinity transporter? Could co-expression of multiple affinities (e.g., as a bet-hedging strategy) be advantageous? The Discussion section already mentions bet-hedging but I think you could further discuss ideas such as evolutionarily trained "Pavlovian" response or what the 2nd-ranking says about what the yeast anticipates as an upcoming change in the environment.
      3. Defining low/medium/high regimes: Low = 0.01%, Medium = 0.1%, and High = 1%. This is indeed in line with the standard classification of [glucose] in the literature regarding HXTs. But how might your results change at intermediate concentrations - those between these three levels. Using the model, could you comment on whether HXT expression dynamics "sharply" change as a function of either the [glucose]/time or the final concentration of [glucose] after the ramping-up phase?
      4. Rate-affinity trade-off (Lines 18-20): Give a brief explanation of the rate-affinity trade-off. Why does higher affinity necessarily entail a lower maximal transport rate (Vₘₐₓ) for passive transporters? Perhaps you can give an intuitive explanation backed by mass-action kinetics (e.g., to attain a higher affinity, the glucose-binding pocket on Hxt cannot be flipping rapidly back-and-forth between facing cytoplasm and extracellular space -- the binding pocket must allow sufficient time for molecule to find and bind it).
      5. Single-transporter expression (Lines 39-40): It's unclear to me why cells would express only the "optimal" Hxt and suppress all others. For instance, a bet-hedging strategy might favor simultaneous expression of multiple affinities. Consider revising these lines or adding a brief explanation. Related to above is a subtle point I think that was glossed over: there must be a fitness cost associated with making too many copies of Hxtn. After all, why not make as many transporters as possible? Is the cell operating near the upper limit of Hxt abundance, beyond which there's a fitness cost? Is there a pareto-optimal-type front in the space of expression level and another axis? I think this could go into the Discussion section.
      6. Hxt5 exception (Fig. 1B): Although Hxt5 follows a distinct regulatory scheme, it is most highly expressed at medium [glucose] (0.1%), consistent with its affinity like the other Hxts. I think you could mention this in lines 51-58.
      7. Glucose-ramp details (Fig. 1C; Lines 66-67): You state that [glucose] rises from 0 to 1 % over 15 min and reaches 1 % at t = 3 h. However, the actual ramp slope ([glucose]/time) and when the [glucose] starts to increase from zero aren't specified. The Hxt5-GFP behavior and differing Hxt6/7 levels at t = 0 vs. t = 20 h suggest the ramp may begin later than t = 0. Please clarify these details in the caption and main text, and consider adding a [glucose] vs. time schematic above the panel in Fig. 1C (like in Fig. 1B).
      8. Pre-t < 0 incubation (Fig. 1C): Related to point 1, how long were the cells incubated in pyruvate (or other medium) before t < 0? The Hxt6-GFP level at t = 20 h does not match that at t = 0; what is the timescale for Hxt6-GFP and Hxt7-GFP decay to steady state after glucose removal?
      9. Hxt-GFP localization: Does the reported Hxt#-GFP level include fluorescence from both the plasma membrane and internal compartments (e.g., vacuole)? Clarifying which pools of fluorescence are quantified would help interpretation, even if they don't change the main conclusions are unchanged.
      10. Predominantly transcriptional" wording (Lines 90-92): The phrase "...the regulation is predominantly transcriptional" should specify that it refers to the induction of HXT4 transcription during glucose down-ramping, rather than the subsequent decrease in Hxt4-GFP. The experiments do not rule out post-translational regulation (e.g., endocytosis) once glucose levels fall below a threshold.
      11. Glucose "protection" of Hxt4 (Lines 121-122): The statement "we allowed glucose to protect Hxt4 from degradation" is unclear. First, Hxt4-GFP likely degrades at a different rate than free GFP-you could estimate its half-life from Fig. S3. Second, please explain precisely what "protection" means in the model or experiment.
      12. Quantifying repressor kinetics (Lines 158-162): The push-pull mechanism is compelling, but it would be helpful to report the quantitative separation of timescales-e.g., how much faster do Mig1/Mig2 respond compared to Mth1/Std1? Including fold-difference would strengthen this explanation.
      13. Mechanism of repressor regulation (Lines 197-213): Be clearer about whether and how changes in extracellular glucose alter the expression levels of Mth1, Std1, Mig1, and Mig2, as opposed to modulating say, how Mth1 and Std1 bind to Rgt2 protein. I think you could be clearer here about which regulatory steps (transcriptional, post-translational, or binding-affinity changes) are assumed in the model and supported by the data.

      Minor points:

      1. Abstract: Original: "...how an HXT for a medium-affinity transporter can be made to respond like the HXTs for the other transporters." Suggestion: "...how the gene-expression regulation of a medium-affinity HXT can be rewired to respond like that of any other HXT." (You might also generalize beyond "medium-affinity" if the converse holds.)
      2. Lines 64-66: Please emphasize that the "synthetic complete medium" used for pre-conditioning contains no glucose.
      3. Line 143: The phrase "low expression of the std1\Delta strain in glucose" is ambiguous-low expression of which gene or reporter? Please specify.
      4. Line 240: Change "should weakened" to "should weaken."
      5. Fig. S9 caption (typo) Change "Rtg1 sites are..." to "Rgt1 sites are...."

      Hyun Youk.

      Referee cross-commenting

      I agree with the other reviewers' comments. The other reviewers noticed important points I have missed. But like them, I'm still supportive of the work being published with < 1 month spent on revision. I still don't recommend any further experiments or modeling.

      Significance

      This is a very insightful work showing how to disentangle one of the most complex transcriptional networks in yeast (S. cerevisiae) by combining single-cell dynamics, dynamical-systems modeling, Bayesian-style inference, and genetic perturbations. The authors tackle a problem that has eluded quantitative resolution for over two decades-how yeast regulates its seven primary glucose importer genes (HXT1-HXT7) in response to both steady and temporally changing extracellular [glucose]. Their integrated experimental-theoretical approach delivers the most satisfying mechanistic and quantitative explanation to date, and I enthusiastically recommend this manuscript for publication via Review Commons.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-03083 Corresponding author(s): David Fay General Statements [optional] This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      We greatly appreciate the input of the four reviewers, all of whom carried out a careful reading of our manuscript, provided useful suggestions for improvements, and were enthusiastic about the study including its thoroughness and utility to the field. Because the reviewers required no additional experiments, we were able to address their comments in writing.

      However, in response to a comment from reviewer #4 we decided to add an additional new biological finding to our study given that our functional validation of proximity labeling targets was not extensive. Namely, we now show that a missense mutation affecting BCC-1, one of the top NEKL-MLT interactors identified by our proximity labeling screen, is a causative mutation (together with catp-1) in a strain isolated through a forward genetic screen for suppressors of nekl molting defects (new Fig 9C). This finding, combined with our genetic enhancer tests, further strengthens the functional relevance of proteins identified though our proximity labeling approach and highlights the synergy of proteomics combined with classical genetics.

      Positive statements from reviewers include: Reviewer #1: Overall, this is an outstanding study that will be of great interest to those interested in using proximity labeling to identify interactors of their favorite protein. The experiments are well executed and the data presented in a mostly clear manner.

      Reviewer #2: The key conclusions are convincing, and the work is rigorous. The work provides a clear roadmap to reproducing the data. The experiments are adequately replicated, and statistical analysis is adequate... In many papers, TurboID seems very trivial but this paper clearly highlights the limitations and will be an invaluable resource for labs that want to get proximity labeling established in their labs.

      Reviewer #3: Overall, the claims are solid and conclusions supported. The data and methods are substantial to enable reproducibility in other labs. The experiments have been repeated multiple times with particular attention to statistical analysis. ...This manuscript represents a methodological advance that will likely become an oft-cited reference for members of the C. elegans community and a springboard for other basic biomedical scientists wanting to adapt rigorous proximity labeling techniques to their system.

      Reviewer #4: Fay et al. present a solid, clear and comprehensive BioID-based proteomics study that takes into account and discusses decisive aspects for the (re)production and analysis of high-quality TurboID-based mass spectrometry data. Claims and conclusions are generally well and sufficiently supported by the presented data and illustrated with figures (throughout the text as well as with plenty of supplementary data)... Basic consideration and thoughts for the experimental design and MS data analysis are given in detail and can serve as another guideline for future studies.

      Based on these reviews and comments, we believe that our manuscript is suitable for publication in a high-impact journal. 1. Point-by-point description of the revisions This section is mandatory. Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript.

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)): *

      *Proximity labeling has become a powerful tool for defining protein interaction networks and has been utilized in a growing number of multicellular model systems. However, while such an approach can efficiently generate a list of potential interactors, knowledge of the most appropriate controls and standardized metrics to judge the quality of the data are lacking. The study by Fay systematically investigates these questions using the C. elegans NIMA kinase family members NEKL-2 and NEKL-2 and their known binding partners MLT-2, MLT-3 and MLT-4. The authors perform eight TurboID experiments each with multiple NEKL and MLT proteins and explore general metrics for assessing experimental outcomes as well as how each of the individual metrics correlates with one another. They also compare technical and biological replicates, explore strategies for identifying false positives and investigate a number of variations in the experimental approach, such as the use of N- versus C-terminal tags, depletion of endogenous biotinylated proteins, combining auxin-inducible degradation, and the use of gene ontology analysis to identify physiological interactors. Finally, the authors validate their findings by demonstrating that a number of the candidate identified functionally interact with NEKL-2 or components of the WASH complex. *

      Overall this is an outstanding study that will be of great interest to those interested in using proximity labeling to identify interactors of their favorite protein. The experiments are well executed and the data presented in a mostly clear manner. I really like this study (particularly because I plan to do a proximity labeling study of my own), but I did come away less than impressed with some of the analysis. This is a data-dense manuscript, and it appears to me that the authors tried to cover so much ground that in some cases very little insight was provided. For instance, the authors promote the use of data independent acquisition (DIA) as compared to the more commonly used data dependent acquisition (DDA). However the authors do not provide any analysis to indicate one approach is better than the other. Likewise the combined use of auxin-induced degradation and proximity labeling is explored but there is very little to take away from these experiments. Despite these issues, I am very enthusiastic about the study as a whole. Below I list major and minor concerns.

      Major concerns * 1. My biggest issue with the manuscript is that a lot is made of the use of data independent acquisition (DIA) as compared to the more commonly used data dependent acquisition (DDA). The authors perform experiments using DIA and DDA approaches but do not directly compare the outcomes. As a result there is really no way to know if one approach is better than the other. I would suggest the authors either perform the necessary analysis to compare the two approaches or tone down their promotion of DIA.* We agree and have scaled back any statements comparing DDA to DIA as our manuscript did not address this directly. We also now point out this caveat in our closing thoughts section, while referencing other studies that compared the two (lines 926-929). Our main point was to convey that DIA worked well for our proximity labeling studies but has seen little use by the model organism field. Surprising (to us), DIA was also considerably less expensive than DDA options.

      2. Line 75, The authors promote the use of data-independent acquisition (DIA) without defining what this approach is and how it differs from the more conventional data-dependent acquisition. As a non-mass spectroscopist, I found myself with lots of question concerning DIA, what it is and how it differs from DDA. I think it would really be helpful to expand the description of DIA and its comparison with DDA in the introduction. As non-mass-spectroscopists ourselves, we understand the reviewer's point. Because the paper is quite long, we were trying to avoid non-essential information. We have now added some information to explain some of the key differences between DDA and DIA. We have also included references for readers who may want to learn more. (lines 77-80)

      Minor concerns: * Line 92 typo. I believe the authors meant to say NEKL-2-MLT-2-MLT-4. * Corrected. (line 95)

      Line169. Is exogenous the correct word to use here? It suggests that you are talking about non-worm proteins, but I know you are not. Corrected. Changed to "Moreover, the detection of biotinylated proteins may be difficult if the bait-TurboID fusion is expressed at low levels..." (line 181).

      Line 177 typo (D) should be (C). Corrected. (line 1122)

      Figure 1C: Lucky Charms may sue you for infringement of their trademarked marshmallow treats. Thank you for picking up on this. The authors accept full responsibility for any resulting lawsuits.

      Figure 1D. The NEKL-2::TurboID band is indicated with a green triangle in the figure but the figure legend states that green triangles indicate mNG::TurboID control. I know this triangle is a shade off the triangle that indicates mNG::TurboID but it's really hard to see the difference. All of the differently colored triangles in panel F are unnecessary. I would either just pick one color for all non-control bait proteins or better yet, only use a triangle to point to bands that are not obvious. For instance I don't need the triangles that point to NEKL-2 -3 and -4 fusion proteins. These are just distracting. We understand the reviewer's point. We colored the triangles to match the colors used for the proteins in the figures. We have now added "bright green triangles with white outlines" (Fig 1 legend) to indicate the Pdpy-7::mNG::TurboID control" and changed triangles in the corresponding figures. Although we would be fine with removing or changing the triangles, we think that they may aid somewhat with clarity.

      Line: 316: Conceivably, another factor that could contribute to the counterintuitive upregulation of some proteins in the N2 samples is related to the fusion proteins that are being expressed in the TurboID lines. A partially functional bait protein (one with a level of activity similar to nekl-2(fd81) that may not result in an obvious phenotype) could directly or indirectly affect gene expression leading to lower levels of a subset of proteins in the TurboID samples. The same could be said for fusion proteins with a gain-of-function effect. This is an interesting idea, and we tested this possibility by looking for consistent overlap between N2-up proteins between biological replates of individual bait proteins. We now include a representative Venn diagram in S3C Fig to highlight this comparison. In summary, although we cannot rule out this possibility, our analysis did not support the widespread occurrence of this effect in our study. We also made certain that our statement regarding N2 up proteins was not too definitive. (lines 285-288)

      *Fig 3 B-E. I am a little confused how the data in these graphs is normalized. For instance, I would have expected that for NEKL-3 in panel B, that the normalized (log2) intensity value in N2 be set at 0 as it is for NEKL-2. Maybe I just don't have enough information on how these plots were generated. * The difference is that in the N2 sample, NEKL-3 was detected but NEKL-2 was not. The numbers themselves are assigned by the Spectronaut software used to quantify the DIA results but are not meaningful beyond indicating relative amounts (intensity values) of a given protein within an individual biological experiment. We've added some lines to the figure legend to make this clearer. (lines 1165-1169)

      *Figure 6C legend is not correct. * Corrected. (line 1214)

      Line 575: Figure reference should be Fig. S5G. The authors should check to make sure all references to supplemental figures include correct panel information. Corrected. (line 464) In addition, we have now gone through the manuscript and added panel numbers references where applicable. Note that the addition of a new supplemental file has shifted the numbering.

      Line 576. The authors reference a study by Artan and colleagues and report a weak correlation between their study and that of Artan. They reference figure S4 but it should be Fig S5H. Apologies and many thanks to the reviewer for catching these errors. (line 464)

      Line 652. The authors note that numerous proteins were present at substantially reduced levels in the mNG::TurboID samples and suggest that sticky proteins may have been outcompeted or otherwise excluded from beads incubated with the mNG::TurboID lysates. Why would sticky proteins only be a problem in these samples? The reasoning is not clear to me. The idea was that in the sample with very high levels of biotinylated proteins (mNG::TurboID), the surface of the beads might become saturated with high-affinity biotinylated proteins. This could prevent or out complete the binding of random proteins that are not biotinylated but nevertheless have some affinity to the beads ("sticky" proteins). We have reworded this section to make this clearer. (lines 546-550)

      Line 745: The term "bait overlaps" is a bit vague. Ultimately, I figured out what it meant but it was not immediately obvious. We have changed this to "overlap between baits" and made this section clearer. (line 624-628)

      *S7B Fig. Why is actin missing from the eluate? * In S7B we refer to the purified eluate as the "eluate", which may have caused some confusion. In other sections of the manuscript, we refer to the bead-bound proteins as the "purified eluate" (Figs 1 and 5). For the purified eluate a portion of the streptavidin beads are boiled in sample buffer to elute the bound proteins before running a western. Actin would not be expected in these samples because it's (presumably) not biotinylated in our samples and doesn't detectably bind the beads. This result was seen in all relevant westerns in S1 Data. For consistency, however, we've gone through all our files to make sure we consistently use the term "purified eluate" versus "eluate", which is less specific.

      L*ine 873: The authors state the extent of overlap in GO terms between the various experiments and provide percentages. I tried to extract this information from Figure 8C and came up with different values. For instance, in the case of Molecular Function, they state that they observed a 54% overlap between NEKL-2 and NEKL-3 but in the Venn diagram in Figure 8C I see that the NEKL-2 and NEKL-3 experiments had 71 (25+46) GO terms in common. Out of 98 GO terms for NEKL-2 or 104 for NEKL-3 the percentage I got is closer to 72. Am I analyzing this correctly? * Thanks for checking this. We believe our method for calculating the percent overlap is correct. In the case of NEKL-2/NEKL-3 overlap for Molecular Function, there are 131 total unique terms, of which 71 overlap, giving a 54% overlap. In the case of NEKL-2/NEKL-3 overlap for Biological Process, however, we made an error in arithmetic (415 unique, 239 overlap), such that the correct percentage is 58%, which we have corrected in the text.

      *Reviewer #1 (Significance (Required)): *

      *Overall this is an outstanding study that will be of great interest to those interested in using proximity labeling to identify interactors of their favorite protein. The experiments are well executed and the data presented in a mostly clear manner. I really like this study (particularly because I plan to do a proximity labeling study of my own), but I did come away less than impressed with some of the analysis. This is a data-dense manuscript, and it appears to me that the authors tried to cover so much ground that in some cases very little insight was provided. For instance, the authors promote the use of data independent acquisition (DIA) as compared to the more commonly used data dependent acquisition (DDA). However the authors do not provide any analysis to indicate one approach is better than the other. Likewise the combined use of auxin-induced degradation and proximity labeling is explored but there is very little to take away from these experiments. Despite these issues, I am very enthusiastic about the study as a whole. *

      *Reviewer #2 (Evidence, reproducibility and clarity (Required)): *

      *This study expanded the use of data-independent acquisition-mass spectrometry (DIA-MS) in TurboID proximity-labeling proteomics to identify novel interactors of NEKL-2, NEKL-3, MLT-2, MLT-3, and MLT-4 complexes in C. elegans. The authors described several useful metrics to evaluate the quality of TurboID experiments, such as using the percentage of upregulated genes, the percentage of proteins present only in bait-TurboID experiments as compared to N2 controls, and the percentage of endogenously biotinylated carboxylases as internal controls. Further, the authors introduced methodological variability across 23 TurboID experiments and evaluated any improvement to the resulting data, such as N-terminally tagging bait proteins with TurboID, depleting endogenous carboxylases, and auxin-inducible degradation of known complex members. Finally, this study identified the kinase folding chaperone CDC-37 and the WASH complex component DDL-2 as novel interactors with the NEKL-MLT complexes through an RNAi-based enhancer approach following their identification by TurboID. *

      Major comments: * The key conclusions are convincing, and the work is rigorous. The work provides a clear roadmap to reproducing the data. The experiments are adequately replicated, and statistical analysis is adequate. We only have minor comments.*

      Minor comments: * •In the western blot in Fig 1 why does the mNG::Turbo have two bands? * Thank you for point this out. To our knowledge this is a breakdown product that was especially prevalent in replicate 3 (also see S1 Data), which we chose to shown because all the NEKL-MLTs were clearly visible in this western. The expected size of the mNeonGreen::TurboID (including linker and tags) is ~68 kDa and our blots are roughly consistent those of Artan et al., (2001). This lower band was not evident in Exp 8. We have now included a statement in the figure legend to indicate that the upper band is the full-length protein whereas the lower band is likely to be a breakdown product (lines 1141-1142).

      •Fig 2B is difficult to parse as a reader. Columns labeled "Upreg," "Downreg," "TurboID only," "N2 only," "Filter-1," "Filter-2," and "Epi %" could be moved to Supplemental. Fold change vs N2 could be represented as a bar chart, allowing for trends between fold change and the metrics Upreg %, Turbo %, and Carboxylase % to be seen more clearly. Further, rows headed "Carboxylase depletion," "DDA," and "Auxin treated" could be presented as separate panels to better match the distinct points made in the text. After serious consideration we have made several changes including the addition of S2 Fig, which may provide readers with a better visual representation of the bait and prey fold changes observed in all our experiments. However, we feel that the detailed data embedded in Fig 2 is the most concise and accurate means by which to convey our full results and is key to our methodological conclusions. As such we did not want to relegate this information to a supplemental table. We note that this figure was not found to be problematic by other reviewers, although we do understand the points made by this reviewer.

      •Line 179: in vivo should be italicized Because journals differ in their stylistic practices, we are currently waiting before doing our final formatting. We did keep our use of Latin phrases consistently non-italicized in the draft.

      •Lines 215-217: The comparison between Western blot expression levels and prior fluorescent reporter levels is unclear. Could be reformatted to make it clearer that relative expression of the different NEKL-MLTs in this study is consistent with prior data. We reformatted this sentence to improve clarity. (lines 205-207)

      *•Lines 267-268: The final line of the passage is unclear and can be removed. * This sentence has been removed.

      •Lines 311-313: This study is able to use the recovery of bait and known interactor proteins as internal controls to determine the quality of each experiment, but this may not always be the case for other users' experiments. The authors should comment on how Upreg %, a value influenced by many factors, can actually be used as a quality check when a bait protein has no known interactors. We have added language to highlight this point. (lines 344-348)

      *•Line 702: There is a [new REF] that should be removed * As described above, we have now included this finding on bcc-1 as part of this manuscript (Fig 9C).

      •The approach used mixed stage animals, but some genes oscillate or are transiently expressed. Please discuss cost-benefit of mixed stage vs syncing. This is an important point. We have added a discussion on the benefits and drawbacks of using mixed stages to the discussion. (lines 901-911)

      *•Authors were working on hypodermally expressed proteins. It would be valuable to discuss what tissues are amenable to TurboID. Ie are the cases where there are few cells (anchor cell, glial sockets, etc) that it will be extremely challenging to perform this technique * We agree that certain tissues/proteins will not be amenable to proximity labeling. We believe that we have addressed this point together with the above comment throughout the manuscript and now on lines 936-940.

      •Authors mention approaches such as nanobodies, split Turbo. Based on their experiences it would be valuable to add Discussion on strengths and weaknesses of these approaches to guide folks considering TurboID and DIA-MS experiments in C. elegans Because we have not tested these methods, we feel that we cannot provide a great deal of insight into these alternate approaches. We mention and reference these methods in the introduction so that readers are aware of them.

      *Reviewer #2 (Significance (Required)): *

      •Advance in technique: This study expands the use cases of data-independent acquisition MS method (DIA-MS) in C. elegans, which fragments all ions independent of the initial MS1 data. The benefits of this approach include better reproducibility across technical replicates and better recovery of low abundance peptides, which are critical for advancing our ability to capture weak and transient interactions.

      •The use of DIA-MS in this study has improved our understanding of the partners of these NEKL-MLTs in membrane trafficking, molting, and cell adhesion within the epidermis.

      •In many papers, TurboID seems very trivial but this paper clearly highlights the limitations and will be an invaluable resource for labs that want to get proximity labeling established in their labs.

      *Reviewer #3 (Evidence, reproducibility and clarity (Required)): *

      *Summary: *

      Fay and colleagues perform a series of proximity labeling experiments in C. elegans followed by thorough and rational analysis of the resulting biotinylated proteins identified by LC-MS/MS. The overall goals of the study are to evaluate different techniques and provide practical guidance on how to achieve success. The major takeaways are that integration of data-independent acquisition (DIA) along with comparison of endogenously tagged TurboID alleles to soluble TurboID expressed in the same tissue results in improved detection of bona-fide interactors and reduced numbers of false-positives.

      *Major comments: *

      Overall the claims are solid and conclusions supported. The data and methods are substantial to enable reproducibility in other labs. The experiments have been repeated multiple times with particular attention to statistical analysis. I have no major concerns with the manuscript and focus primarily on improving the accessibility of this important contribution to the scientific community. As such, I suggest that the authors:

      1) Provide more explanation of and rationale for using DIA. This is not yet a standard technique and most basic biomedical scientists will be unaware of the jargon. As I expect many labs in the C. elegans community and beyond will be interested in the guidance provided in this manuscript, the introduction offers a great opportunity to bring the reader up to speed, as opposed to sending them to the complicated proteomics analysis literature. We have added some additional context (lines 77-80) as well as new references. We note that getting into the technical differences between DIA and DDA, beyond what we briefly mention, would take a substantial amount of space, may not be of interest to many readers, and can be found through standard internet and (sigh) AI-based searches.

      *2) Provide a better overview of the various protocols tested (Experiments 1-8). Maybe at the beginning of the results, and maybe with an accompanying schematic. As currently written, it is difficult to figure out details regarding how the experiments vary and why. * We have now added a short paragraph to better inform the reader at the front end regarding the major experiments. (lines 139-146).

      3) As to be expected, expression of TurboID tags at endogenous levels via low abundance proteins in a complex multicellular system results in somewhat weak signals that flirt with the limit of detection. Perhaps by combining tagged alleles within the same complex (NEKL-3/MLT-3 or NEKL-2/MLT-2/MLT-4) the signals could be boosted? Tandem tags, either on one end or multiple ends of proteins might help as well. As the authors point out, a benefit of tagging the two NEKL-MLT complexes is that there are strong loss-of-function phenotypes (lethal molting defects) to help evaluate whether a tagging strategy results in a non-functional complex. THESE EXPERIMENTS ARE OPTIONAL and might simply be discussed at the authors discretion. These are interesting ideas that we have now incorporated into our discussion. (lines 936-940)

      *Minor Comments: *

      *1) Figure 3A is cropped on the right. * Thank you for catching this. Corrected.

      *2) Better define [new REF] on line 702. * We have added new results (Fig 9C), obviating the need for this reference.

      ***Referee cross-comments** *

      Overall, I am in agreement with, and supportive of, the other reviewers' comments.

      *Reviewer #3 (Significance (Required)): *

      *Significance: *

      Proximity labeling is often proposed as a technique to determine interaction networks of proteins in vivo, but in practice it remains challenging for most labs to execute a successful experiment, especially within the context of multicellular model organisms. Fay and colleagues provide a much needed roadmap for how to best approach proximity labeling experiments in C. elegans that will likely apply to other model systems.

      They establish a rigorous approach by choosing to endogenously tag components of two essential NEKL-MLT complexes required for C. elegans molting. These complexes are relatively low abundance as they are only expressed in a single cell type, the hyp7 epidermal syncytium. In addition, as inactivation of any member of the complexes results in molting defects, they have a powerful selection for functional tags. Thus, they have set a high bar for themselves in order to discern whether a given variation on the experimental approach results in improved detection of interactors and fewer false positives.

      *Potential areas for improvement include lowering the expression level of the skin-specific soluble TurboID used to determine non-specific biotinylation events. This control results in much higher levels of biotinylation compared to the TurboID-tagged NEKL-MLT alleles and likely affects their analysis, which they openly admit. In addition, to reduce the high level of background biotinylation signals generated by endogenous carboxylases, they adopt a depletion strategy pioneered by other researchers but this does not offer major improvements in detection of specific signals. The source of these conflicting results remains to be determined. It is also curious that auxin-inducible degradation of components of the NEKL-MLT complexes did not robustly alter the resulting biotinylating capacity of other members. This approach should be evaluated in subsequent studies. Finally, as mentioned in Major Comment #3 (above), it would be interesting to see if combining TurboID tags within the same complex might improve signal-to-background ratios. *

      This manuscript represents a methodological advance that will likely become an oft-cited reference for members of the C. elegans community and a springboard for other basic biomedical scientists wanting to adapt rigorous proximity labeling techniques to their system. I am a cell biologist that uses a variety of genetic, molecular and biochemical approaches, mostly centered around C. elegans. I have used LC/MS-MS in our studies but have relatively little expertise in evaluating all aspects of proteomic pipelines.

      *Reviewer #4 (Evidence, reproducibility and clarity (Required)): *

      *Fay et al. describe an extensive proximity labeling BioID study in C. elegans with TurboID and DIA-LCMS analysis. They chose the NEKL-2/3 kinases and their known interactors MLT-2/3/4 as TurboID-fused bait proteins (C- and partially N-terminal fusions encoded from CRISPR-mediated genome edited genes). With eight biological replicates (and three to four technical replicates each) and with the unmodified wildtype or mNeonGreen-TurboID expressing worms as controls, a comprehensive dataset was generated. Although starting from quite different abundances of the bait-fusions within the cell lysates all bait proteins and known complex-binding partners were convincingly enriched with capturing streptavidin beads after only one hour of incubation with the lysate. This confirms the general applicability of TurboID-BioID approach in C. elegans. The BioID method typically gives rise to large proteomics datasets (up to more than thousand proteins identified after biotin capture) with several tens to hundreds enriched proteins (against negative control strains) as potential proteins that localize proximal to the bait-TurboID protein. However, substantial variations of candidates between biological replicates are frequently observed in BioID experiments. The authors scrutinized their dataset towards indicative metrics, filters and cutoffs in order to separate high-confidence from low-confidence candidates. With the workflow applied the authors melt down the number of candidates to 15 proteins that were grouped in four functional groups reasonably associated to NEKL-MLT function. *

      Successful BioID experiments depend on reliable enrichment quantification with mass spectrometry using control cell lines that require a carefully bait-tailored design. Those must adequately express TurboID controls matching the abundance of the bait-TurboID fusion protein and its biotinylation activity. After affinity capture, sample preparation and LCMS data acquisition there is no silver bullet towards the identification true bait neighbors. Fay et al. elaborately describe their considerations and workflow towards high-confidence candidates. The workflow considered (i) data analysis with Volcano plots to account for statistical reproducibility of biological replicates against negative controls, (ii) fraction of proteins only detected in the positive or negative controls thus evading the fold-enrichment quantification approach, (iii) evaluation of variations in carboxylase enrichment as a measure for variations in the general biotin capture quality between experiments, (iv) an assessment of technical reproducibility with scatter plots and Venn diagrams, (v) exclusion of potentially false positives, e.g. promiscuously biotinylated non-proximal proteins, through comparisons with control worms expressing a non-localized mNeonGreen-TurboID fusion protein, (vi) batch effects, (vii) the impact of endogenous biotinylated carboxylases through depletion, (viii) gene ontology analysis of enriched proteins, (ix) weighing data according to the quality of individual experiments according to the afore mentioned metrics, and finally (x) genetic interaction studies to functionally associate high-confidence candidates with the bait.

      *Major comments: *

      Fay et al. present a solid, clear and comprehensive BioID-based proteomics study that takes into account and discusses decisive aspects for the (re)production and analysis of high-quality TurboID-based mass spectrometry data. Claims and conclusions are generally well and sufficiently supported by the presented data and illustrated with figures (throughout the text as well as with plenty of supplementary data). However, although the authors claim to seek for substrates of the kinase complex they drew no further attention to the phosphorylation status of the captured proteins. Haven't the MS data been analyzed in this respect? Information regarding this issue would enhance the manuscript. Data generation and method description appear reproducible for readers. Also, the statistical analyses appear adequate. The authors should also consider to deposit their MS raw and analysis data in a public repository (e.g. PRIDE) for future reviewing processes and as reference data for readers and followers. Our raw MS data have been deposited by the Arkansas Proteomics Facility. I have followed up to ensure that they are publicly available.

      *Minor comments: *

      The authors should combine supplementary data files to reduce the number of single files readers have to deal with. We have combined these files as suggested.

      The authors should avoid the term "upregulation" or "increased biotinylation" when capture enrichment is meant. We agree with reviewer's point. We now use the terms enriched versus reduced or up versus down, depending on the context, and clearly define these terms. These changes have been incorporated throughout the manuscript.

      *Reviewer #4 (Significance (Required)): *

      The manuscript presents a robust BioID proteomics screening for co-localizing proteins of NEKL-2/3 kinases and their known interactors MLT-2/3/4. The ongoing validation of their functional interactions and whether the protein candidates reflect phosphorylation substrates or else remains elusive and is announced for upcoming manuscripts. The knowledge gain in terms of molecular mechanisms with NEKL-2/3 MLT-2/3/4 involvement in C. elegans is therefore limited to a table of - promising - interacting candidates that have to be studied further. Information about the phosphorylation status of the captured proteins from the MS data are not given. However, knowing the protein candidates will be of interest for groups working with these complexes (or the identified potentially interacting proteins) either in C. elegans or any other organism. Also, in-depth proteomics screenings with novel approaches such as BioID have to be established for individual organisms. For C. elegans there is only one prior BioID publication (Holzer et al. 2022). Many of the aspects discussed here have also been addressed earlier for BioIDs in other organisms and are not principally new. However, the presented study can be of conceptual interest for labs delving into or entangled with the BioID method in C. elegans or other organisms. The study addresses especially proteomics groups working on protein-protein interactions using proximity labeling/MS approaches. Basic consideration and thoughts for the experimental design and MS data analysis are given in detail and can serve as another guideline for future studies.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Fay et al. describe an extensive proximity labeling BioID study in C. elegans with TurboID and DIA-LCMS analysis. They chose the NEKL-2/3 kinases and their known interactors MLT-2/3/4 as TurboID-fused bait proteins (C- and partially N-terminal fusions encoded from CRISPR-mediated genome edited genes). With eight biological replicates (and three to four technical replicates each) and with the unmodified wildtype or mNeonGreen-TurboID expressing worms as controls, a comprehensive dataset was generated. Although starting from quite different abundances of the bait-fusions within the cell lysates all bait proteins and known complex-binding partners were convincingly enriched with capturing streptavidin beads after only one hour of incubation with the lysate. This confirms the general applicability of TurboID-BioID approach in C. elegans. The BioID method typically gives rise to large proteomics datasets (up to more than thousand proteins identified after biotin capture) with several tens to hundreds enriched proteins (against negative control strains) as potential proteins that localize proximal to the bait-TurboID protein. However, substantial variations of candidates between biological replicates are frequently observed in BioID experiments. The authors scrutinized their dataset towards indicative metrics, filters and cutoffs in order to separate high-confidence from low-confidence candidates. With the workflow applied the authors melt down the number of candidates to 15 proteins that were grouped in four functional groups reasonably associated to NEKL-MLT function.

      Successful BioID experiments depend on reliable enrichment quantification with mass spectrometry using control cell lines that require a carefully bait-tailored design. Those must adequately express TurboID controls matching the abundance of the bait-TurboID fusion protein and its biotinylation activity. After affinity capture, sample preparation and LCMS data acquisition there is no silver bullet towards the identification true bait neighbors. Fay et al. elaborately describe their considerations and workflow towards high-confidence candidates. The workflow considered (i) data analysis with Volcano plots to account for statistical reproducibility of biological replicates against negative controls, (ii) fraction of proteins only detected in the positive or negative controls thus evading the fold-enrichment quantification approach, (iii) evaluation of variations in carboxylase enrichment as a measure for variations in the general biotin capture quality between experiments, (iv) an assessment of technical reproducibility with scatter plots and Venn diagrams, (v) exclusion of potentially false positives, e.g. promiscuously biotinylated non-proximal proteins, through comparisons with control worms expressing a non-localized mNeonGreen-TurboID fusion protein, (vi) batch effects, (vii) the impact of endogenous biotinylated carboxylases through depletion, (viii) gene ontology analysis of enriched proteins, (ix) weighing data according to the quality of individual experiments according to the afore mentioned metrics, and finally (x) genetic interaction studies to functionally associate high-confidence candidates with the bait.

      Major comments:

      Fay et al. present a solid, clear and comprehensive BioID-based proteomics study that takes into account and discusses decisive aspects for the (re)production and analysis of high-quality TurboID-based mass spectrometry data. Claims and conclusions are generally well and sufficiently supported by the presented data and illustrated with figures (throughout the text as well as with plenty of supplementary data). However, although the authors claim to seek for substrates of the kinase complex they drew no further attention to the phosphorylation status of the captured proteins. Haven't the MS data been analyzed in this respect? Information regarding this issue would enhance the manuscript. Data generation and method description appear reproducible for readers. Also, the statistical analyses appear adequate. The authors should also consider to deposit their MS raw and analysis data in a public repository (e.g. PRIDE) for future reviewing processes and as reference data for readers and followers.

      Minor comments:

      The authors should combine supplementary data files to reduce the number of single files readers have to deal with. The authors should avoid the term "upregulation" or "increased biotinylation" when capture enrichment is meant.

      Significance

      The manuscript presents a robust BioID proteomics screening for co-localizing proteins of NEKL-2/3 kinases and their known interactors MLT-2/3/4. The ongoing validation of their functional interactions and whether the protein candidates reflect phosphorylation substrates or else remains elusive and is announced for upcoming manuscripts. The knowledge gain in terms of molecular mechanisms with NEKL-2/3 MLT-2/3/4 involvement in C. elegans is therefore limited to a table of - promising - interacting candidates that have to be studied further. Information about the phosphorylation status of the captured proteins from the MS data are not given. However, knowing the protein candidates will be of interest for groups working with these complexes (or the identified potentially interacting proteins) either in C. elegans or any other organism. Also, in-depth proteomics screenings with novel approaches such as BioID have to be established for individual organisms. For C. elegans there is only one prior BioID publication (Holzer et al. 2022). Many of the aspects discussed here have also been addressed earlier for BioIDs in other organisms and are not principally new. However, the presented study can be of conceptual interest for labs delving into or entangled with the BioID method in C. elegans or other organisms. The study addresses especially proteomics groups working on protein-protein interactions using proximity labeling/MS approaches. Basic consideration and thoughts for the experimental design and MS data analysis are given in detail and can serve as another guideline for future studies.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Fay and colleagues perform a series of proximity labeling experiments in C. elegans followed by thorough and rational analysis of the resulting biotinylated proteins identified by LC-MS/MS. The overall goals of the study are to evaluate different techniques and provide practical guidance on how to achieve success. The major takeaways are that integration of data-independent acquisition (DIA) along with comparison of endogenously tagged TurboID alleles to soluble TurboID expressed in the same tissue results in improved detection of bona-fide interactors and reduced numbers of false-positives.

      Major comments:

      Overall the claims are solid and conclusions supported. The data and methods are substantial to enable reproducibility in other labs. The experiments have been repeated multiple times with particular attention to statistical analysis. I have no major concerns with the manuscript and focus primarily on improving the accessibility of this important contribution to the scientific community. As such, I suggest that the authors:

      1. Provide more explanation of and rationale for using DIA. This is not yet a standard technique and most basic biomedical scientists will be unaware of the jargon. As I expect many labs in the C. elegans community and beyond will be interested in the guidance provided in this manuscript, the introduction offers a great opportunity to bring the reader up to speed, as opposed to sending them to the complicated proteomics analysis literature.
      2. Provide a better overview of the various protocols tested (Experiments 1-8). Maybe at the beginning of the results, and maybe with an accompanying schematic. As currently written, it is difficult to figure out details regarding how the experiments vary and why.
      3. As to be expected, expression of TurboID tags at endogenous levels via low abundance proteins in a complex multicellular system results in somewhat weak signals that flirt with the limit of detection. Perhaps by combining tagged alleles within the same complex (NEKL-3/MLT-3 or NEKL-2/MLT-2/MLT-4) the signals could be boosted? Tandem tags, either on one end or multiple ends of proteins might help as well. As the authors point out, a benefit of tagging the two NEKL-MLT complexes is that there are strong loss-of-function phenotypes (lethal molting defects) to help evaluate whether a tagging strategy results in a non-functional complex. THESE EXPERIMENTS ARE OPTIONAL and might simply be discussed at the authors discretion.

      Minor Comments:

      1. Figure 3A is cropped on the right.
      2. Better define [new REF] on line 702.

      Referee cross-comments

      Overall, I am in agreement with, and supportive of, the other reviewers' comments.

      Significance

      Proximity labeling is often proposed as a technique to determine interaction networks of proteins in vivo, but in practice it remains challenging for most labs to execute a successful experiment, especially within the context of multicellular model organisms. Fay and colleagues provide a much needed roadmap for how to best approach proximity labeling experiments in C. elegans that will likely apply to other model systems.

      They establish a rigorous approach by choosing to endogenously tag components of two essential NEKL-MLT complexes required for C. elegans molting. These complexes are relatively low abundance as they are only expressed in a single cell type, the hyp7 epidermal syncytium. In addition, as inactivation of any member of the complexes results in molting defects, they have a powerful selection for functional tags. Thus, they have set a high bar for themselves in order to discern whether a given variation on the experimental approach results in improved detection of interactors and fewer false positives.

      Potential areas for improvement include lowering the expression level of the skin-specific soluble TurboID used to determine non-specific biotinylation events. This control results in much higher levels of biotinylation compared to the TurboID-tagged NEKL-MLT alleles and likely affects their analysis, which they openly admit. In addition, to reduce the high level of background biotinylation signals generated by endogenous carboxylases, they adopt a depletion strategy pioneered by other researchers but this does not offer major improvements in detection of specific signals. The source of these conflicting results remains to be determined. It is also curious that auxin-inducible degradation of components of the NEKL-MLT complexes did not robustly alter the resulting biotinylating capacity of other members. This approach should be evaluated in subsequent studies. Finally, as mentioned in Major Comment #3 (above), it would be interesting to see if combining TurboID tags within the same complex might improve signal-to-background ratios.

      This manuscript represents a methodological advance that will likely become an oft-cited reference for members of the C. elegans community and a springboard for other basic biomedical scientists wanting to adapt rigorous proximity labeling techniques to their system. I am a cell biologist that uses a variety of genetic, molecular and biochemical approaches, mostly centered around C. elegans. I have used LC/MS-MS in our studies but have relatively little expertise in evaluating all aspects of proteomic pipelines.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This study expanded the use of data-independent acquisition-mass spectrometry (DIA-MS) in TurboID proximity-labeling proteomics to identify novel interactors of NEKL-2, NEKL-3, MLT-2, MLT-3, and MLT-4 complexes in C. elegans. The authors described several useful metrics to evaluate the quality of TurboID experiments, such as using the percentage of upregulated genes, the percentage of proteins present only in bait-TurboID experiments as compared to N2 controls, and the percentage of endogenously biotinylated carboxylases as internal controls. Further, the authors introduced methodological variability across 23 TurboID experiments and evaluated any improvement to the resulting data, such as N-terminally tagging bait proteins with TurboID, depleting endogenous carboxylases, and auxin-inducible degradation of known complex members. Finally, this study identified the kinase folding chaperone CDC-37 and the WASH complex component DDL-2 as novel interactors with the NEKL-MLT complexes through an RNAi-based enhancer approach following their identification by TurboID.

      Major comments:

      The key conclusions are convincing, and the work is rigorous. The work provides a clear roadmap to reproducing the data. The experiments are adequately replicated, and statistical analysis is adequate. We only have minor comments.

      Minor comments:

      • In the western blot in Fig 1 why does the mNG::Turbo have two bands?
      • Fig 2B is difficult to parse as a reader. Columns labeled "Upreg," "Downreg," "TurboID only," "N2 only," "Filter-1," "Filter-2," and "Epi %" could be moved to Supplemental. Fold change vs N2 could be represented as a bar chart, allowing for trends between fold change and the metrics Upreg %, Turbo %, and Carboxylase % to be seen more clearly. Further, rows headed "Carboxylase depletion," "DDA," and "Auxin treated" could be presented as separate panels to better match the distinct points made in the text.
      • Line 179: in vivo should be italicized
      • Lines 215-217: The comparison between Western blot expression levels and prior fluorescent reporter levels is unclear. Could be reformatted to make it clearer that relative expression of the different NEKL-MLTs in this study is consistent with prior data.
      • Lines 267-268: The final line of the passage is unclear and can be removed.
      • Lines 311-313: This study is able to use the recovery of bait and known interactor proteins as internal controls to determine the quality of each experiment, but this may not always be the case for other users' experiments. The authors should comment on how Upreg %, a value influenced by many factors, can actually be used as a quality check when a bait protein has no known interactors.
      • Line 702: There is a [new REF] that should be removed
      • The approach used mixed stage animals, but some genes oscillate or are transiently expressed. Please discuss cost-benefit of mixed stage vs syncing.
      • Authors were working on hypodermally expressed proteins. It would be valuable to discuss what tissues are amenable to TurboID. Ie are the cases where there are few cells (anchor cell, glial sockets, etc) that it will be extremely challenging to perform this technique
      • Authors mention approaches such as nanobodies, split Turbo. Based on their experiences it would be valuable to add Discussion on strengths and weaknesses of these approaches to guide folks considering TurboID and DIA-MS experiments in C. elegans

      Significance

      • Advance in technique: This study expands the use cases of data-independent acquisition MS method (DIA-MS) in C. elegans, which fragments all ions independent of the initial MS1 data. The benefits of this approach include better reproducibility across technical replicates and better recovery of low abundance peptides, which are critical for advancing our ability to capture weak and transient interactions.
      • The use of DIA-MS in this study has improved our understanding of the partners of these NEKL-MLTs in membrane trafficking, molting, and cell adhesion within the epidermis.
      • In many papers, TurboID seems very trivial but this paper clearly highlights the limitations and will be an invaluable resource for labs that want to get proximity labeling established in their labs
    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Proximity labeling has become a powerful tool for defining protein interaction networks and has been utilized in a growing number of multicellular model systems. However, while such an approach can efficiently generate a list of potential interactors, knowledge of the most appropriate controls and standardized metrics to judge the quality of the data are lacking. The study by Fay systematically investigates these questions using the C. elegans NIMA kinase family members NEKL-2 and NEKL-2 and their known binding partners MLT-2, MLT-3 and MLT-4. The authors perform eight TurboID experiments each with multiple NEKL and MLT proteins and explore general metrics for assessing experimental outcomes as well as how each of the individual metrics correlates with one another. They also compare technical and biological replicates, explore strategies for identifying false positives and investigate a number of variations in the experimental approach, such as the use of N- versus C-terminal tags, depletion of endogenous biotinylated proteins, combining auxin-inducible degradation, and the use of gene ontology analysis to identify physiological interactors. Finally, the authors validate their findings by demonstrating that a number of the candidate identified functionally interact with NEKL-2 or components of the WASH complex.

      Overall this is an outstanding study that will be of great interest to those interested in using proximity labeling to identify interactors of their favorite protein. The experiments are well executed and the data presented in a mostly clear manner. I really like this study (particularly because I plan to do a proximity labeling study of my own), but I did come away less than impressed with some of the analysis. This is a data-dense manuscript, and it appears to me that the authors tried to cover so much ground that in some cases very little insight was provided. For instance, the authors promote the use of data independent acquisition (DIA) as compared to the more commonly used data dependent acquisition (DDA). However the authors do not provide any analysis to indicate one approach is better than the other. Likewise the combined use of auxin-induced degradation and proximity labeling is explored but there is very little to take away from these experiments. Despite these issues, I am very enthusiastic about the study as a whole. Below I list major and minor concerns.

      Major concerns

      1. My biggest issue with the manuscript is that a lot is made of the use of data independent acquisition (DIA) as compared to the more commonly used data dependent acquisition (DDA). The authors perform experiments using DIA and DDA approaches but do not directly compare the outcomes. As a result there is really no way to know if one approach is better than the other. I would suggest the authors either perform the necessary analysis to compare the two approaches or tone down their promotion of DIA.
      2. Line 75, The authors promote the use of data-independent acquisition (DIA) without defining what this approach is and how it differs from the more conventional data-dependent acquisition. As a non-mass spectroscopist, I found myself with lots of question concerning DIA, what it is and how it differs from DDA. I think it would really be helpful to expand the description of DIA and its comparison with DDA in the introduction.

      Minor concerns:

      Line 92 typo. I believe the authors meant to say NEKL-2-MLT-2-MLT-4.

      Line169. Is exogenous the correct word to use here? It suggests that you are talking about non-worm proteins, but I know you are not.

      Line 177 typo (D) should be (C).

      Figure 1C: Lucky Charms may sue you for infringement of their trademarked marshmallow treats.

      Figure 1D The NEKL-2::TurboID band is indicated with a green triangle in the figure but the figure legend states that green triangles indicate mNG::TurboID control. I know this triangle is a shade off the triangle that indicates mNG::TurboID but it's really hard to see the difference. All of the differently colored triangles in panel F are unnecessary. I would either just pick one color for all non-control bait proteins or better yet, only use a triangle to point to bands that are not obvious. For instance I don't need the triangles that point to NEKL-2 -3 and -4 fusion proteins. These are just distracting.

      Line: 316: Conceivably, another factor that could contribute to the counterintuitive upregulation of some proteins in the N2 samples is related to the fusion proteins that are being expressed in the TurboID lines. A partially functional bait protein (one with a level of activity similar to nekl-2(fd81) that may not result in an obvious phenotype) could directly or indirectly affect gene expression leading to lower levels of a subset of proteins in the TurboID samples. The same could be said for fusion proteins with a gain-of-function effect.

      Fig 3 B-E. I am a little confused how the data in these graphs is normalized. For instance, I would have expected that for NEKL-3 in panel B, that the normalized (log2) intensity value in N2 be set at 0 as it is for NEKL-2. Maybe I just don't have enough information on how these plots were generated.

      Figure 6C legend is not correct.

      Line 575: Figure reference should be Fig. S5G. The authors should check to make sure all references to supplemental figures include correct panel information.

      Line 576. The authors reference a study by Artan and colleagues and report a weak correlation between their study and that of Artan. They reference figure S4 but it should be Fig S5H.

      Line 652. The authors note that numerous proteins were present at substantially reduced levels in the mNG::TurboID samples and suggest that sticky proteins may have been outcompeted or otherwise excluded from beads incubated with the mNG::TurboID lysates. Why would sticky proteins only be a problem in these samples? The reasoning is not clear to me.

      Line 745: The term "bait overlaps" is a bit vague. Ultimately, I figured out what it meant but it was not immediately obvious.

      S7B Fig. Why is actin missing from the eluate?

      Line 873: The authors state the extent of overlap in GO terms between the various experiments and provide percentages. I tried to extract this information from Figure 8C and came up with different values. For instance, in the case of Molecular Function, they state that they observed a 54% overlap between NEKL-2 and NEKL-3 but in the Venn diagram in Figure 8C I see that the NEKL-2 and NEKL-3 experiments had 71 (25+46) GO terms in common. Out of 98 GO terms for NEKL-2 or 104 for NEKL-3 the percentage I got is closer to 72. Am I analyzing this correctly?

      Significance

      Overall this is an outstanding study that will be of great interest to those interested in using proximity labeling to identify interactors of their favorite protein. The experiments are well executed and the data presented in a mostly clear manner. I really like this study (particularly because I plan to do a proximity labeling study of my own), but I did come away less than impressed with some of the analysis. This is a data-dense manuscript, and it appears to me that the authors tried to cover so much ground that in some cases very little insight was provided. For instance, the authors promote the use of data independent acquisition (DIA) as compared to the more commonly used data dependent acquisition (DDA). However the authors do not provide any analysis to indicate one approach is better than the other. Likewise the combined use of auxin-induced degradation and proximity labeling is explored but there is very little to take away from these experiments. Despite these issues, I am very enthusiastic about the study as a whole.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: The authors present ASPEN - a tool for allelic imbalance estimation in haplotype-resolved single-cell RNA-seq data. Besides the mean of the allelic ratio, ASPEN manages to assess its under- and overdispersion as well as perform group-level comparisons. Dr. Wong with colleagues applied ASPEN to the simulated and publicly available single-cell data from mouse brain organoids and T cells. They showed a general applicability of the tool to this type of data, compared it with scDALI in terms of statistical power, and made numerous conclusions regarding the allele-specific regulation of housekeeping and cell-specific gene expression in general and during cell differentiation, as well as identified examples of X inactivation, imprinting and random monoallelic expression.

      Major comments:

      1. Considering biological insights, the authors focus on genes with the allelic imbalance variance being lower than expected based on the gene expression level, and find them being enriched by the processes essential for cell integrity. I am curious if the variation depends on the number of available cells as well, i.e. housekeeping genes may be more stably expressed from cell to cell. In this context, the authors can compare their results with the stably expressed genes from Lin et al. [https://doi.org/10.1093/gigascience/giz106].
      2. Continuing with the concerns regarding gene expression level changes, authors do not provide information about the differential expression of their findings. Even where they mention "F1 hybrids revealed 33 genes with significant changes in mean allelic expression and 193 with dynamic variance, independent of total expression changes (Supp. Fig. 3B; Supp. Table 4)" in "Allelic variance reveals transcriptional plasticity across cell states" I could not find the relevant info in the corresponding Figure and Supplementary table. Furthermore, it was shown that low number of cells and gene expression level can affect allelic imbalance estimates as well as lead to false positive random monoallelic expression [https://doi.org/10.1371/journal.pcbi.1008772]. The authors admit it but do not properly discuss how it is related to their RME examples. Are they lowly expressed and/or detected in a limited number of cells?
      3. The histogram provided in Figure 5C suggests the general RME preference towards maternal (C57BL/6J) haplotype. Can it be caused by the reference mapping bias? The authors suggest the total shifts of a null allelic mean, 0.52 for T cells and 0.54 for brain organoid, being the result of a reference mapping bias. However, using parental genomes should have eliminated this problem unless a substantial part of individual variants were missed due to the strict quality filters.
      4. Among the genes demonstrating a dynamic allelic imbalance variance during early neurogenesis, the authors found several examples involved in autism spectrum disorders and neuroanatomical phenotypes in mice. They suggest the temporal modulation of variance as a possible regulatory mechanism which may be perturbed in disease states. However, it is hard to estimate the significance of this finding without any enrichment tests. How many disease relevant genes among those with dynamic variance can be expected by chance?

      Minor comments:

      1. Methods would definitely benefit from proofreading, e.g. there are mistakes in the beta-binomial distribution formula, log-transformed gene-level dispersion distribution (it does not follow N(0,1) with zero mean) and gamma likelihood function. Is rho a shape parameter instead of a rate? Specifically, I suggest describing the equitations from the "Bayesian shrinkage implementation" section in more detail. Why does the formula for corrected theta provided in the article deviate from the one presented on github https://github.com/ewonglab/ASPEN/blob/main/R/allelic_imbalance.R, i.e. "thetaCorrected = N/(N-K) * (theta + theta_smoothed(delta/(N-K)))/(1 + (delta/(N-K)))" where K = 1, instead of "thetaCorrected = (N-1)/N * (theta + theta_smootheddelta)/(1 + delta)"? Both gamma and rho also deviate from the script as far as I understood. Moreover, a few steps from the Methods remained unclear to me. First, does ASPEN apply a fixed theta threshold (i.e. of 0.001 from the manual or 0.005 from the article) or performs a more sophisticated MAD-based procedure? Does ASPEN obtain the stabilized thetas using N = 20 and theta = 10, followed by ML to correct both parameters and recalculate the posterior dispersion? Why do tests for static and dynamic allelic variance use different gene-level thetas, stabilized and non-stabilized ones? Does it affect the sensitivity and specificity of group-level analysis?
      2. Besides formulas, there are minor mistakes throughout the text as well. As such, I assume the sentence "In the dyn-mean test, the dispersion parameter (set to the stabilized group-level value)" from the "Detecting dynamic changes" section should include global dispersion, not the one estimated on the group-level. In the section "Allelic variance reveals transcriptional plasticity across cell states" FDR threshold of 0.5 is mentioned instead of 0.05. Figure captions also contain minor mistakes such as "Genes below the dashed line were excluded from the trend modelling" from Figure 4 which corresponds to B instead of C.
      3. Why does Figure 5B contain missing allelic ratio estimations? If it is due to the expression filters, please mention it in the caption.
      4. Given the principles of the dynamic tests, I would suggest calling them "differential", "ANOVA-like" or "group-level" instead of dynamic, since there is no actual possibility to account for the continuous changes over time.
      5. The example of differential variance from Figure 6D is not very clear to me and Supplementary Figure 5C does not help. I suggest adding histograms to emphasize changes in the allelic imbalance variation.
      6. The authors managed to uniquely map and unambiguously assign 20-38% of total reads. The weighted allocation procedure from Choi et al. [https://doi.org/10.1038/s41467-019-13099-0] might help to increase the total coverage.
      7. The discrete low dispersion values in Figures 2, 3A, 4B, 5C and 6A possibly stem from rounding to 4 decimal places. I suggest increasing the accuracy to improve the visual clarity.
      8. The sentence "Of these, 27 were X-linked, consistent with random X-inactivation dynamics in female cells, and five (Bex2, Ndufb11, Pcsk1n, Sh3bgrl, Uba1) displayed signatures of incomplete X inactivation, by demonstrating largely monoallelic expression in each cell" in the "Monoallelic expression reveals regulatory complexity" section should be rephrased to reflect the proportion of cells demonstrating both alleles expressed.

      Significance

      Nowadays the allele-specific gene expression analysis using single-cell RNA-seq data is widely used to study allele-specific bursting [https://doi.org/10.1186/s13059-017-1200-8], imprinting, X chromosome inactivation [https://doi.org/10.1038/s42003-022-03087-4] and other processes [https://doi.org/10.1016/j.tig.2024.07.003].

      1. My field of expertise mostly includes bioinformatic analysis of allele-specific expression and gene regulation using bulk sequencing data. However, to the best of my knowledge, there are three publicly available modern solutions allowing to assess the allelic imbalance using single-cell gene expression data: scDALI published in January 2022 [https://doi.org/10.1186/s13059-021-02593-8], Airpart published in May 2022 [https://doi.org/10.1093/bioinformatics/btac212] and DAESC published in 2023 [https://doi.org/10.1038/s41467-023-42016-9], with the latter not being mentioned by the authors.
      2. While the authors used simulations to compare ASPEN to scDALI-Hom in terms of sensitivity, I could not find any specificity estimates. The reasons for the statement "ASPEN demonstrated high sensitivity (98%) and specificity (92%) with a low false positive rate (<12%), confirming its capacity to distinguish distinct modes of regulatory variation during lineage differentiation (Fig. 4G)" are also unclear to me since Figure 4G only demonstrates a true positive rate in test and control simulations. Should not FPR be equal to 1 - specificity?
      3. Moreover, I suggest authors compare ASPEN to Airpart and DAESC along with scDALI as it can underline the scenarios where ASPEN is the best or the only option. Moreover, all these tools can estimate either heterogenous (scDALI-Het) or dynamic (Airpart, DAESC) allelic imbalance which can be compared to the allelic variance and group-level tests, respectively.
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors introduce ASPEN (Allele-Specific Parameter Estimation in scRNA-seq), a statistical framework designed to model cis-regulatory variation in single-cell RNA-sequencing data, and demonstrate that ASPEN effectively detects cell state-specific allelic imbalances. Using simulated datasets, the authors show that ASPEN outperforms existing methods (e.g., scDALI) in both sensitivity and specificity. Furthermore, they demonstrate that ASPEN can be used to further dissect allelic imbalance, enabling the identification of random monoallelic expression (RME), gene expression pulsing, and dynamic regulatory shifts.

      My main concerns are:

      • Framework similarity with scDALI: The ASPEN framework shares many conceptual similarities with scDALI. It is not clear why ASPEN significantly outperforms scDALI. The authors should elaborate more clearly on the differences between the two approaches and provide a detailed explanation for the observed improvements.
      • Scalability and runtime: The manuscript does not report computational performance metrics (e.g., runtime, memory usage), which would be important for users planning to apply ASPEN to large-scale datasets.
      • Comparison to additional tools: While the comparison to scDALI is appropriate, including benchmarking against other recent allele-specific methods (e.g., SCALE, AirPart) would strengthen the evaluation and broaden its relevance.
      • User guidance: A figure or supplementary table summarizing required inputs, preprocessing steps, recommended parameters, and filtering strategies would be highly beneficial for potential users.
      • Time-series smoothing: The manuscript would benefit from a clearer explanation of how time-series smoothing is implemented within ASPEN, particularly in dynamic cell state contexts.

      Significance

      The ASPEN framework is useful for identifying single cell ASE and related analysis, which currently is under developed. It is timely and the framework is rigorous and flexible and driving by the data.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is an interesting paper, which introduces a new approach and software ASPEN for analysis of allele-specific gene expression, which is applied to transcriptomes of F1 hybrids of mouse lines. The manuscript introduces an interesting statistical technique, which up to my knowledge is correct and brings about new biological results, identifying genes with systematically decreased or increased expression variance and statle allelic expression ratio, which seems to be controlled by the regulatory machinery.

      The manuscript has some shortcomings in presentation, it is written very concisely, especially in its methods part, and is somewhat difficult to follow.

      I'm not sure that the authors make the correct claim in the manuscript. The title and the abstract says that the manuscript discusses the cis-regulatory heterogeneity, but in fact there is very little in the manuscript about gene regulation per ce. The study demonstrates that allele specific expression is controlled by some yet unknown mechanisms, rather than a product of technical noise and then presents a number of examples of different pathways which the increased and decreased allele specific variance. Also the manuscript presents several examples of shifts in the variance of particular genes in temporal development.

      Yet, the manuscript tells virtually nothing about regulation, thus the conclusion that 'ASPEN enables the interrogation of cis regulatory effects on gene expression' is not justified in its literal terms; what ASPEN does it quantifies the allele-specific transcription activity effects in a single cell transcriptomics experiment. Mechanistically the observed effects can be explained by any regulatory effect like DNA methylation, chromatin structure or whatever. To prove that cis-regulatory effects are important here the authors need to show the allele specific nature of transcription factor binding (for instance by showing the TF binding motifs destroyed/created by variants). It is more difficult to take into account the chromatin effects without ATAC experiments but it might be that ATAC-seq experiments are available for parental line and there is a differential DNA accessibility in the locality of genes of interest. I think only with such mechanistic illustrations one can conclude that cis-regulatory interactions play a major role here.

      As an other option, the authors may publish the study per se but with a changed title, the abstract and the discussion, formulating it in a more phenomenological way.

      Minor note

      In Figures 2-5 the low variance genes are shown with dots occupying lines parallele to x axis. This can be related to some wrong digitising of variance or to a low numbers of reads contributing to the variance. Please double check.

      Significance

      The paper introduces a new interesting statistical approach for quantifying allele specific transcription from the single cell data, using Bayesian shrinkage technique similar to that used in edgeR. The paper has clear biological meaning demonstrating that there are genes with a decreased variability in gene expression. I believe, the paper draws attention to the interesting area of facts and as such may be published.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-03031

      Corresponding author(s): Lara-Pezzi, Enrique and Gómez-Gaviro, María Victoria

      1. General Statements [optional]

      Dear Editors,

      Following the review of our article entitled "Loss of the alternative calcineurin variant CnAβ1 enhances brown adipocyte differentiation and drives metabolic overactivation through FoxO1 activation", we propose below a number of experiments to be performed in order to address the issues raised by the reviewers.

      While we acknowledge the limitations of the full CnAβ1 knockout mouse and we unfortunately lack a tissue-specific knockout mouse, we believe that the proposed new experiments together with the (abundant) existing information in the paper will help clarify the concerns raised by the reviewers.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      *The current study examines the metabolic phenotype of mice lack the calcineurin variant CnAb1 (CnAb1KO). On a high fat diet, CnAb1KO mice gain less weight compared to WT controls, which is accompanied by improvements in obesity-related metabolic dysfunction, such as glucose/insulin intolerance and hyperlipidemia. The authors attribute most of the observed phenotypes to enhanced brown fat function, notably fatty acid catabolism and the thermogenic capacity. Mechanistically, the authors propose that CnAb1KO increases FoxO1 transcriptional activity, as a result of reduced mTOR/Akt signaling, which in turn mediates the hyper-catabolism of BAT in CnAb1KO mice. *

      * Major comments: *

      *Q1. The main issue of the study is it's not hypothesis driven. Based on high fat diet-induced metabolic phenotype of the whole body CnAb1KO mice, the authors put together a mechanism focusing on potential roles of CnAb1 in BAT functions that affect systemic metabolic homeostasis. However, the rationales to establish this link were based largely on correlative results and at times incorrect data interpretation (for instance, using the expression of Myf5 and Pax7 as markers for brown adipocyte differentiation). The sequential event from CnAb1 loss of function to reduced mTOR signaling and increased FoxO1 activity (or conversely, how CnAb1 increases mTOR signaling to reduce FoxO1 activity) has not been mechanistically characterized. There are also no studies to explain how FoxO1 is involved in brown fat differentiation and hyper-catabolism of BAT downstream of the CnAb1-mTOR pathway. In addition, the UCP-1 FoxO1KO experiment in Fig. 6 fails to provide strong evidence to support the claim. Thus, there are many gaps between the observed phenotype and the proposed mechanism. *

      A1. We thank the reviewer for the insightful comments. We agree with the reviewer that, historically, this project did not originally focus on the BAT. Instead, we arrived at the BAT after ruling out other possibilities to explain the reduced body weight observed in these animals, together with the reduced body temperature after starvation, which was our first observation. While the BAT involvement was not our first hypothesis a priori, we do not agree that this would invalidate or reduce the interest of our work. While our initial evidence may have been correlative at first, the FoxO1 BAT-specific knockout experiments and the AAV/Ucp1-Cre CnAβ1 expression restoration experiments prove that the BAT is indeed involved in the phenotype observed in CnAβ1Δi12 (KO) mice. It is likely that other organs may be also involved (since the phenotype is not fully prevented by the BAT-specific approaches) but the BAT is definitely involved.

      To further substantiate the involvement of the BAT in the improved metabolic phenotype observed in CnAβ1Δi12 mice, we propose to perform BAT transplantation, monitoring body weight over 8 weeks following transplantation. If successful, BAT transplantation from CnAβ1Δi12 mice into WT mice should improve their metabolic response to high-fat diet (HFD), thereby reinforcing the role of the BAT in these mice.

              In addition, we propose to measure the __*levels of so-called batokines*__ FGF21, VEGFA, IL6, and also of 12,13-diHOME in BAT and serum from 12-week-old chow and HFD mice.
      
              With regards to Pax7 and Myf5, while we agree that these are common precursors to other lineages (skeletal muscle), we show in Fig. S1E additional differentiation markers such as Cox2 and Cpt1b. __*The 5 markers assessed showed an increase in *____*CnAβ1Δi12 mice, pointing towards a cell-autonomous effect of the absence of CnAβ1 on the BAT*__. Nevertheless, to further substantiate the accelerated differentiation of brown preadipocytes in the absence of CnAβ1, we propose to __*measure the expression of additional BAT markers*__ (although they are not exclusive of BAT), such as Ucp1, Prdm16, PPARγ, and AdipoQ in brown preadipocytes isolated from 6–8-week-old mice.
      
              With regards to the activation of mTOR (specifically mTORC2) by CnAβ1, we published this in previous papers from our group: Gómez-Salinero et al (Cell Chem Biol, 2016), Felkin et al (Circulation, 2011), Lara-Pezzi et al (J Cell Biol 2007), Padrón-Barthe et al (J Am Coll Cardiol 2018). The mechanism involves the interaction between CnAβ1 and mTORC2 in cellular membranes. Knockdown of CnAβ1 results in mTORC2 mislocalisation and Akt inhibition. In addition, we show in Fig. 6C in this paper that PTEN inhibition reduces the improved differentiation of BAT adipocytes from CnAβ1Δi12 mice, further involving the Akt pathway in the observed phenotype. Furthermore, Fig. 6 shows a significant increase in body weight and BAT weight in BAT-specific FoxO1 knockout CnAβ1Δi12 mice, together with a significant decrease in different Pnpla1, Irf4, and Bcat2 expression. While we agree that the reversal of the phenotype is only partial, the effect of knocking out FoxO1 in the BAT of CnAβ1Δi12 mice is both statistically significant and biologically relevant. We would be happy to provide additional information at the Editors’ request. In addition, we propose to carry out __BAT preadipocyte differentiation experiments comparing cells isolated from CnAβ1Δi12 mice to those isolated from CnAβ1Δi12 mice with BAT-specific FoxO1 knockout__.
      

      Q2. A second issue is that most of the phenotypes can be explained by the difference in weight gain. With the available data, it's difficult to pinpoint the tissue origin(s) mediating the weight gain/loss phenotype. The authors would first need to generate a BAT-CnAb1KO mouse line to convincingly show a main role for BAT CnAb1 in systemic metabolic homeostasis. There are also many problems with data presentations/interpretations of the metabolic phenotyping studies. For example, Fig. 1A shows that CnAb1KO mice are about 5 g lighter than controls. However, Fig. 1G indicates a 10 g difference in fat mass. The EM images in Fig. 3B are of poor quality, which seems to suggest that HFD fed CnAb1KO mice have the highest mitochondrial density. Lastly, in Fig. 4C/D, the authors interpret the reduced FFA and glycerol levels in CnAb1KO after b3-agonist injection as increased fatty acid burning by BAT, which is incorrect. If anything, the reduced glycerol release in the KO mice would suggest a reduction in lipolysis. However, the most likely explanation is that WT mice have more fat mass and as such, more fat hydrolysis.

      A2. While we agree with the reviewer that some of the features may be explained by reduced body weight gain (reduced WAT weight, for instance), many other changes showed by CnAβ1Δi12 mice cannot be explained by reduced body weight gain alone, including higher expression of differentiation markers in BAT, higher number of mitochondria in BAT, or improved cold-tolerance, among others. Therefore, we respectfully disagree with the reviewer’s opinion.

      Unfortunately, we do not have a tissue-specific CnAβ1 knockout mouse and we cannot commit to having one in the short term. While we acknowledge the limitations of using a full knockout mouse, we provided several pieces of evidence that the BAT is involved in the observed phenotype, as pointed out in the discussion: 1) Placing CnAβ1Δi12 mice in thermoneutral conditions mitigated the weight loss. 2) Reintroducing CnAβ1 in BAT with a CnAβ1-overexpressing virus partially prevented the weight loss. 3) Minimal changes in mitochondrial gene expression were observed in skeletal muscle and liver, suggesting that the phenotype is primarily driven by alterations in BAT. 4) BAT adipocytes from CnAβ1Δi12 mice differentiated more effectively than those from wild type mice, suggesting a cell-autonomous effect. While a direct effect of CnAβ1 on WAT cannot be entirely ruled out, our results strongly suggest that loss of CnAβ1 in BAT is a major contributor to the observed metabolic changes.

              With regards to Fig. 1E, this is an estimation of fat weight from __MRI__ images. We agree with the reviewer that this is obviously wrong and we will __revise this quantification__. We propose to __add measurements of subcutaneous WAT__, which we also have, to further support the difference observed in eWAT.
      
              With regards to Fig. 3B, we agree that some of the individual figures may have been poorly chosen, but the graph in Fig. 3C (which quantifies the electron microscopy pictures) clearly shows that the reduction in mitochondria in WT mice as a result of HFD feeding is prevented in CnAβ1Δi12 mice. Fig. 3C does not show an increase in mitochondria with HFD, as implied by the reviewer based on Fig. 3B. We propose to __provide adequate panels for Fig. 3B that better reflect the averages shown in Fig. 3C__.
      
              Regarding Fig. 4C and D, we thank the reviewer for this correction, which we agree with. We still believe that the BAT of CnAβ1Δi12 mice is burning fat more effectively than that of WT mice, but we agree that these experiments are not the proof of this claim. We will__ move or remove panels C and D from Fig. 4__ and focus this figure on thermogenic capacity.
      
              To assess systemic lipolysis, we will __measure in vivo serum levels of NEFA__ (non-esterified fatty acids) __and glycerol__ in 12-week-old mice fed a HFD. Additionally, to evaluate BAT lipolytic activation, we will perform __BAT explant and *ex vivo* experiments__ to determine the lipolysis rate. This should provide valuable information supporting the role of the BAT in the observed phenotype in CnAβ1Δi12 mice.
      

      *Q3. The authors should take a fresh, unbiased look at existing data, form a testable hypothesis and design a series of new experiments (including new tissue-specific KO mice) to assess the function of CnAb1 in BAT or other tissues responsible for the metabolic phenotype. If BAT is indeed involved, the authors need to mechanistically determine the role of CnAb1 in brown adipocyte differentiation vs BAT function and explain why the ratio of CnAb1/CnAb2 ratio matters in this context, as this is the basis for the entire study. A revision addressing main issues of the manuscript will not likely to be completed in a typical revision time (e.g. 3 months). *

      A3. As explained above, unfortunately we do not have tissue-specific CnAβ1 knockout mice. If the Editors consider that this is essential for resubmission of a revised article, we are afraid that we cannot comply. This said, we believe that our manuscript contains relevant data about metabolic regulation by the CnAβ1 calcineurin isoform that are new and relevant to the field.

              Our data provide clear evidence that the BAT is indeed involved in the phenotype observed in CnAβ1Δi12 mice, as explained in our previous answers above. It may not be the *only* tissue involved, but it is most definitely involved. The BAT transplant experiments will add further evidence of this.
      
              We already show evidence of the role of CnAβ1 (or rather, its absence) in the differentiation of BAT pre-adipocytes (Fig. S1E and Fig. 6C) and we will __provide additional evidence through the proposed new experiments__. Similarly, we provide evidence of the role of CnAβ1 in BAT weight, transcriptional profile, lipid content, and number of mitochondria. Also here, we believe that __the proposed experiments will reinforce this aspect of the paper__.
      

      Reviewer #1 (Significance (Required)):

      *Q4. The thermogenic capacity of brown and beige adipocytes has shown promise as a means to reduce fat burden to treat obesity and related metabolic diseases. Identification of brown/beige adipocyte promoting mechanisms may provide druggable targets for therapeutic development. As such, the topic and findings of the current study would be of interest to researchers in the metabolism and drug development fields. The weakness of the study is that it's descriptive and the authors jump to conclusions without strong supporting evidence. Most of the metabolic phenotypes associated with CnAb1KO mice are likely secondary to the weight difference. The rationale to focus on BAT is not well justified. A well-thought-out approach would be needed to identify the tissue origins mediating the metabolic phenotypes of CnAb1KO mice and to dissect the underlying mechanisms. *

      *Reviewer's field of expertise: adipose tissue biology, systemic metabolic regulation, immunometabolism *

      A4. We agree with the reviewer about the potential relevance of our findings. The shortcomings pointed out in this comment have been addressed above. Overall, we thank the reviewer for their thorough review of our ms.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      *The manuscript entitled « Loss of the alternative calcineurin variant CnAβ1 enhances brown adipocyte differentiation and drives metabolic overactivation through FoxO1 activation » by Dr Lara-Pezzi and colleagues describes the role of the calcium/calmodulin dependent serine/threonine phosphatase catalytic subunit calcineurin variant CnAß1 in brown adipose tissue physiology and function. Through the use of global CnAß1 KO mice, the authors show that these mice are resistant to diet-induced obesity, have increased thermogenesis due to increased mitochondrial activity, decreased body weight, improved glucose homeostasis, increased fatty acid oxidation. The authors also demonstrate that these effect are mostly mediated through improved brown adipose tissue (BAT) function, through increased Foxo1 activation in BAT. Genetic deletion of Foxo1 in BAT resulted in increased body weight and impaired mitochondrial gene expression. In addition, the authors also correlate their findings to potential CNAß1 polymorphism from the UK biobank associated to improved metabolic traits in humans (blood glucose mainly). *

      Although interesting, the conclusion are not always supported by the data. The manuscript requires additional experiments to further consolidate their claims.

      *Q1. It should be mentioned that all experiments are performed in global CnAβ1 KO mice. Thus, it is difficult to assess the cell-autonomous role if this protein in BAT function (even if an AAV9 driving CnAβ1 expression is used; or if other tissues have been studied). This should be discussed at least as a limitation of the study, except if floxed mice are available. *

      A1. We thank the reviewer for the positive comments about our work.

      Unfortunately, we do not have a tissue-specific CnAβ1 knockout mouse. However, we believe we provide abundant evidence of the involvement of the BAT in the phenotype observed in CnAβ1Δi12 mice, including the following: 1) Placing CnAβ1Δi12 mice in thermoneutral conditions mitigated the weight loss. 2) Reintroducing CnAβ1 in BAT with a CnAβ1-overexpressing virus partially prevented the weight loss. 3) Minimal changes in mitochondrial gene expression were observed in skeletal muscle and liver, suggesting that the phenotype is primarily driven by alterations in BAT. 4) BAT adipocytes from CnAβ1Δi12 mice differentiated more effectively than those from wild type mice, suggesting a cell-autonomous effect. While a direct effect of CnAβ1 on WAT cannot be entirely ruled out, our results strongly suggest that loss of CnAβ1 in BAT is a major contributor to the observed metabolic changes.

      This said, we fully agree with the reviewer to acknowledge in the discussion the limitation of using a full knockout mouse for this study.

      Q2. Is there good antibodies for CnAβ1? The protein levels of the protein should be shown in, at least, adipose tissues of WT and KO mice under chow and HFD.

      A2. There is no good antibody against CnAβ1. The main reason is that the C-ter domain of this isoform is not very immunogenic. We did try to generate an antibody, but we got no immune response against the unique C-ter domain. We do have an old antibody generated against CnAβ1 years ago. We propose to try to perform WB and immunohistochemistry in WT and ____CnAβ1Δi12 mice. However, we need to be clear that we cannot make any commitments towards these results, since the antibody may not work. In any case, we believe that the RT-PCR results, which clearly discriminate both isoforms, are very clear.

      *Q3. A general comment is that most of the conclusions are drawn from qRT-PCR data. It lacks functional experiments that may reinforce the conclusion. For example, did the authors measure mitochondrial function in BAT of WT and KO mice using different substrate (fatty acids, glucose, ...)? *

      A3. We thank the reviewer for this suggestion and we therefore propose to include in the revised paper measurements of mitochondrial activity with different substrates in WT and ____CnAβ1Δi12 mice.

      *Q4. Lack of validation of the mouse model used (CnAβ1 expression in BAT upon AAV9 over expression confirmed? What about the other tissues?). *

      A4. We showed in Fig. 5E the increase in CnAβ1 expression in the BAT of Ucp1-Cre mice infected with the floxed AAV-CnAβ1 virus. We propose to include similar expression analyses in other tissues.

      Reviewer #2 (Significance (Required)):

      Q5. This is a novel study addressing the role of CnAβ1 in energy homeostasis, more specifically in BAT function. This study reports for the first time the role of CnAβ1 in energy homeostasis, with new mechanistic insights related to the crosstalk between CnAβ1 and Foxo1.

      The authors have previously described the role of this protein in cardiac function. There are not a lot of publications describing the function of this protein, thus this study may be interested for the community working on diabetes/obesity/cardio-metabolic field.

      *Limitations : see below (lack of functional data, ...). *

      A5. We thank the reviewer for these comments, with which we agree.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      • *

      • *

      4. Description of analyses that authors prefer not to carry out

      As much as we would like to have a tissue-specific CnAβ1 knockout mouse, the reality is that we do not have it. In any case, we believe that our paper provides a considerable amount of data that is relevant to the field.

      We remain open to incorporating the suggested experiments, or others, should they be considered necessary to further strengthen the manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript entitled « Loss of the alternative calcineurin variant CnAβ1 enhances brown adipocyte differentiation and drives metabolic overactivation through FoxO1 activation » by Dr Lara-Pezzi and colleagues describes the role of the calcium/calmodulin dependent serine/threonine phosphatase catalytic subunit calcineurin variant CnAß1 in brown adipose tissue physiology and function. Through the use of global CnAß1 KO mice, the authors show that these mice are resistant to diet-induced obesity, have increased thermogenesis due to increased mitochondrial activity, decreased body weight, improved glucose homeostasis, increased fatty acid oxidation. The authors also demonstrate that these effect are mostly mediated through improved brown adipose tissue (BAT) function, through increased Foxo1 activation in BAT. Genetic deletion of Foxo1 in BAT resulted in increased body weight and impaired mitochondrial gene expression. In addition, the authors also correlate their findings to potential CNAß1 polymorphism from the UK biobank associated to improved metabolic traits in humans (blood glucose mainly).

      Although interesting, the conclusion are not always supported by the data. The manuscript requires additional experiments to further consolidate their claims.

      It should be mentioned that all experiments are performed in global CnAβ1 KO mice. Thus, it is difficult to assess the cell-autonomous role if this protein in BAT function (even if an AAV9 driving CnAβ1 expression is used; or if other tissues have been studied). This should be discussed at least as a limitation of the study, except if floxed mice are available. Is there good antibodies for CnAβ1? The protein levels of the protein should be shown in, at least, adipose tissues of WT and KO mice under chow and HFD.

      A general comment is that most of the conclusions are drawn from qRT-PCR data. It lacks functional experiments that may reinforce the conclusion. For example, did the authors measure mitochondrial function in BAT of WT and KO mice using different substrate (fatty acids, glucose, ...) ? Lack of validation of the mouse model used (CnAβ1 expression in BAT upon AAV9 over expression confirmed? What about the other tissues?).

      Significance

      This is a novel study addressing the role of CnAβ1 in energy homeostasis, more specifically in BAT function. This study reports for the first time the role of CnAβ1 in energy homeostasis, with new mechanistic insights related to the crosstalk between CnAβ1 and Foxo1.

      The authors have previously described the role of this protein in cardiac function. There are not a lot of publications describing the function of this protein, thus this study may be interested for the community working on diabetes/obesity/cardio-metabolic field.

      Limitations: see below (lack of functional data, ...).

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The current study examines the metabolic phenotype of mice lack the calcineurin variant CnAb1 (CnAb1KO). On a high fat diet, CnAb1KO mice gain less weight compared to WT controls, which is accompanied by improvements in obesity-related metabolic dysfunction, such as glucose/insulin intolerance and hyperlipidemia. The authors attribute most of the observed phenotypes to enhanced brown fat function, notably fatty acid catabolism and the thermogenic capacity. Mechanistically, the authors propose that CnAb1KO increases FoxO1 transcriptional activity, as a result of reduced mTOR/Akt signaling, which in turn mediates the hyper-catabolism of BAT in CnAb1KO mice.

      Major comments:

      1. The main issue of the study is it's not hypothesis driven. Based on high fat diet-induced metabolic phenotype of the whole body CnAb1KO mice, the authors put together a mechanism focusing on potential roles of CnAb1 in BAT functions that affect systemic metabolic homeostasis. However, the rationales to establish this link were based largely on correlative results and at times incorrect data interpretation (for instance, using the expression of Myf5 and Pax7 as markers for brown adipocyte differentiation). The sequential event from CnAb1 loss of function to reduced mTOR signaling and increased FoxO1 activity (or conversely, how CnAb1 increases mTOR signaling to reduce FoxO1 activity) has not been mechanistically characterized. There are also no studies to explain how FoxO1 is involved in brown fat differentiation and hyper-catabolism of BAT downstream of the CnAb1-mTOR pathway. In addition, the UCP-1 FoxO1KO experiment in Fig. 6 fails to provide strong evidence to support the claim. Thus, there are many gaps between the observed phenotype and the proposed mechanism.
      2. A second issue is that most of the phenotypes can be explained by the difference in weight gain. With the available data, it's difficult to pinpoint the tissue origin(s) mediating the weight gain/loss phenotype. The authors would first need to generate a BAT-CnAb1KO mouse line to convincingly show a main role for BAT CnAb1 in systemic metabolic homeostasis. There are also many problems with data presentations/interpretations of the metabolic phenotyping studies. For example, Fig. 1A shows that CnAb1KO mice are about 5 g lighter than controls. However, Fig. 1G indicates a 10 g difference in fat mass. The EM images in Fig. 3B are of poor quality, which seems to suggest that HFD fed CnAb1KO mice have the highest mitochondrial density. Lastly, in Fig. 4C/D, the authors interpret the reduced FFA and glycerol levels in CnAb1KO after b3-agonist injection as increased fatty acid burning by BAT, which is incorrect. If anything, the reduced glycerol release in the KO mice would suggest a reduction in lipolysis. However, the most likely explanation is that WT mice have more fat mass and as such, more fat hydrolysis.
      3. The authors should take a fresh, unbiased look at existing data, form a testable hypothesis and design a series of new experiments (including new tissue-specific KO mice) to assess the function of CnAb1 in BAT or other tissues responsible for the metabolic phenotype. If BAT is indeed involved, the authors need to mechanistically determine the role of CnAb1 in brown adipocyte differentiation vs BAT function and explain why the ratio of CnAb1/CnAb2 ratio matters in this context, as this is the basis for the entire study. A revision addressing main issues of the manuscript will not likely to be completed in a typical revision time (e.g. 3 months).

      Significance

      The thermogenic capacity of brown and beige adipocytes has shown promise as a means to reduce fat burden to treat obesity and related metabolic diseases. Identification of brown/beige adipocyte promoting mechanisms may provide druggable targets for therapeutic development. As such, the topic and findings of the current study would be of interest to researchers in the metabolism and drug development fields. The weakness of the study is that it's descriptive and the authors jump to conclusions without strong supporting evidence. Most of the metabolic phenotypes associated with CnAb1KO mice are likely secondary to the weight difference. The rationale to focus on BAT is not well justified. A well-thought-out approach would be needed to identify the tissue origins mediating the metabolic phenotypes of CnAb1KO mice and to dissect the underlying mechanisms.

      Reviewer's field of expertise: adipose tissue biology, systemic metabolic regulation, immunometabolism

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      REVIEWER 1

      This is an important and solid study that identified sequences that can improve circRNA translation and that as or more importantly are very short and hence are suitable for generating of efficient protein expressing circRNAs. This manuscript fills an important gap in the field, and it is highly significant. The study is well controlled, the rationale clear and the results conclusive with no major flaws.

      • While this is a minor concern as the vector has been used before, it will greatly improve the quality of the paper if the authors could just verify that the vector only generates circRNA molecules and not linear concatenamers. To do so the authors can focus only in their control and the most optimal transcripts and perform northern blot or well controlled RNAseR experiments to show that all RNA molecules containing the back splicing junction are circular We thank the reviewer for raising this point. As suggested, we performed RNaseR resistance assays on our three most efficient candidates driving cGFP translation (VCIP, T3-glo, and T3-U3) to confirm that all derived RNA molecules containing the back-splicing junction are circular. As proof of this, cGFP proved strongly resistant to RNase R (new Fig. S1N), confirming its circular structure. We further ruled out the possibility that molecules other than the circRNA encoding GFP serve as templates for translation from our vectors. Specifically, ad hoc PCR amplifications performed for this purpose (new Fig. S1M) showed no bands that would indicate the presence of concatemers. Indeed, ad hoc PCR amplifications (new Fig. S1M) revealed no bands indicative of concatemer formation. The primers used and the expected sizes of the amplicons are schematically represented in new Fig. S1M. In brief, we used a divergent primers set spanning the BSJ (3-4) to specifically detect the mature circRNA and a set of convergent primers (1-2) pairing on the GFP ORF, thus detecting both the circRNA and its linear precursor as well as the putative concatemer expected. Although a ~1 kb band was expected if a trans-splicing by-product was present, no such band was observed (new Fig. S1M). Moreover, RT-PCR amplification of the cGFP back-splice junction was markedly more efficient when reverse transcription was primed with random hexamers than with oligo(dT), priming total RNA or preferentially polyA+ RNA, respectively. These results are expected for a circRNA, as also indicated by the fact that the circZNF609 positive control behaves in a similar manner. Collectively, these results confirm the circular nature of our transcript and exclude translation originating from possible concatemers.

      • These results are shown in new Fig. S1M and S1N and described in the text as follows: Importantly, we ruled out the possibility that templates other than the GFP-encoding circRNA drive translation from our best performing constructs (V-cGFP, T3-glo-cGFP and T3-U3-cGFP). Ad hoc PCRs amplifications (Fig. S1M) revealed no bands indicative of concatemer formation. The left panel of Fig. S1M schematically illustrates the primer sets and expected amplicons sizes. In particular, we used a divergent primers set spanning the BSJ (3-4) to specifically detect the mature circRNA and a set of convergent primers (1-2) pairing on the GFP ORF detecting both the circRNA and its linear precursor as well as the putative concatemer expected. Although a ~1 kb band was expected if a trans-splicing by-product was present, no such band was observed. Moreover, RT-PCR amplification of the cGFP back-splice junction was markedly more efficient when reverse transcription was primed with random hexamers than with oligo(dT), priming total RNA or preferentially polyA+ RNA, respectively (Fig. S1M). These results are consistent with the circularity of the transcripts tested and coherent with the results obtained for circZNF609, used as control (Fig. S1M). Finally, cGFP resulted resistant to RNAseR treatment (Fig. S1N), further supporting its circular nature.”*

      • There is a repetition of the world "a" in the abstract. We thank the reviewer for the attention paid to our text, we removed the extra “a” from the abstract.

      • All circRNA translation studies should be cited when describing translation of circRNAs. We thank the reviewer for the suggestions, we corrected the mistake present in the text and included extra referenced about circRNA translation.

      *Specifically, we included: *

      • Fan, X., Yang, Y., Chen, C. et al. Pervasive translation of circular RNAs driven by short IRES-like elements. Nat Commun 13, 3751 (2022). https://doi.org/10.1038/s41467-022-31327-y
      • Chen CK et al. Structured elements drive extensive circular RNA translation. Mol Cell. 2021 Oct 21; 81(20):4300-4318.e13.doi: 10.1016/j.molcel.2021.07.042. Epub 2021 Aug 25. PMID: 34437836; PMCID: PMC8567535.
      • Obi P, Chen YG. The design and synthesis of circular RNAs. Methods. 2021 Dec;196:85-103. doi: 10.1016/j.ymeth.2021.02.020. Epub 2021 Mar 2. PMID: 33662562; PMCID: PMC8670866.
      • Fukuchi, K., Nakashima, Y., Abe, N. et al. Internal cap-initiated translation for efficient protein production from circular mRNA. Nat Biotechnol (2025). https://doi.org/10.1038/s41587-025-02561-8
      • Du, Y., Zuber, P.K., Xiao, H. et al. Efficient circular RNA synthesis for potent rolling circle translation. Nat. Biomed. Eng 9, 1062–1074 (2025). https://doi.org/10.1038/s41551-024-01306-3
      • Wang F, Cai G, Wang Y, Zhuang Q, Cai Z, Li Y, Gao S, Li F, Zhang C, Zhao B, Liu X. Circular RNA-based neoantigen vaccine for hepatocellular carcinoma immunotherapy. MedComm (2020). 2024 Jul 29;5(8):e667. doi: 10.1002/mco2.667. PMID: 39081513; PMCID: PMC11286538.
      • Andries O, Mc Cafferty S, De Smedt SC, Weiss R, Sanders NN, Kitada T. N(1)-methylpseudouridine-incorporated mRNA outperforms pseudouridine-incorporated mRNA by providing enhanced protein expression and reduced immunogenicity in mammalian cell lines and mice. J Control Release. 2015 Nov 10;217:337-44. doi: 10.1016/j.jconrel.2015.08.051. Epub 2015 Sep 3. PMID: 26342664.
      • Yang Y, Fan X, Mao M, Song X, Wu P, Zhang Y, Jin Y, Yang Y, Chen LL, Wang Y, Wong CC, Xiao X, Wang Z. Extensive translation of circular RNAs driven by N6-methyladenosine. Cell Res. 2017 May;27(5):626-641. doi: 10.1038/cr.2017.31. Epub 2017 Mar 10. PMID: 28281539; PMCID: PMC5520850. REVIEWER 2

      Circular RNAs (circRNAs) have attracted significant interest due to their unique properties, which make them promising tools for expressing exogenous proteins of therapeutic value. However, several limitations must be addressed before circRNAscan become a biologically and economically viable platform for the biotech industry.One of the main challenges is the reliance on large, highly structured sequences withinternal ribosome entry site (IRES) activity to initiate translation of the downstream open reading frame. In this study, the authors propose an alternative strategy that combines the 5′ untranslated region (5′UTR) of a previously characterized natural circRNA(circZNF609) with a short 13-nt nucleotide sequence shown to act as a translational enhancer. By evaluating the activity of various constructs containing a reporter geneacross multiple cell lines, they identify the most efficient and compact sequence, 63-nt long, capable of boosting translation within a circular RNA context.

      Major Comments:

      • This study is well-executed and relies on standard in vitro molecular biology techniques, which are adequate to support the conclusions drawn. *We thank the reviewer for the very positive opinion on the execution of our study. *

      • The experimental procedures are clearly described, and the statistical analyses have been performed according to accepted standards. *We thank the reviewer for the very positive comment about the analyses we performed. *

      Minor Comments:

      • The manuscript would greatly benefit from a comprehensive revision to improve clarity and language. Involving a native English speaker during the editing process could significantly enhance the manuscript's readability and overall quality. The Results section would benefi t from closer attention, as certain parts of the description are attimes confusing and could be clarifi ed for better reader comprehension. We thank the reviewer for the input. We performed a huge revision of the text to improve language quality and enhance readability. We extended the descriptions in the results sections in order to explicit and clarify our data.

      • The references should be carefully reviewed for accuracy and consistency-forinstance, references 9 and 10 appear to require correction or clarifi cation. We thank the reviewer for the careful reading of our paper. We amended the reference section, and we expanded it.

      Reviewer #2 (Significance (Required)):

      This study addresses a critical bottleneck in RNA therapeutics. The use of the proposed short sequences could significantly enhance the in vivo activity of protein-encoding circular RNAs. A highly efficient, compact translational enhancer has thepotential to substantially improve the therapeutic applicability of circRNAs and broaden their range of applications. Given the potential utility of these findings, we would anticipate pursuing intellectual property (IP) protection. To further strengthen the study, future work should include additional data on polysome association and a detailed analysis of the secondary structure of the 66-nt enhancer sequence. This work should be of broad interest to molecular biologists working on RNA biology, translation, and RNA-based therapeutics. I expect the identified sequence will betested by multiple laboratories to evaluate its strength and versatility, further underscoring the potential impact of this study. For context, I am actively engaged in research on non-coding RNAs.

      • *

      REVIEWER 3

      In this brief report, the authors take advantage of circular RNA expression plasmids to define elements that can be used to enable efficient translation. They test a handful of known IRES elements as well as short translation enhancing elements (TEEs) for their ability to promote translation of circular GFP and c-ZNF609 reporters. They focus on one particular element that is of a short length and seems to work as well as longer IRES elements. My major concern relates to possible alternative sources of the translated proteins, which the authors have not ruled out (see below). I find themanuscript to be too preliminary in its current state.

      • Work from the Meister group (Ho-Xuan et al 2020 Nucleic Acids Res 48:10368) has shown that apparent translation from circRNA over-expression plasmids is not from circular RNAs, but instead from trans-splicing linear by-products. The authors have not ruled out such alternative explanations here, e.g. by using deletion constructs that prevent backsplicing. We thank the reviewer for raising this point. *We ruled out the possibility that molecules other than the circRNA encoding GFP serve as templates for translation from our vectors. Specifically, ad hoc PCR amplifications performed for this purpose (new Fig. S1M) showed no bands that would indicate the presence of concatemers. Indeed, ad hoc PCR amplifications (new Fig. S1M) revealed no bands indicative of concatemer formation. The primers used and the expected sizes of the amplicons are schematically represented in new Fig. S1M. In particular, we used a divergent primers set spanning the BSJ (3-4) to specifically detect the mature circRNA and a set of convergent primers (1-2) pairing on the GFP ORF detecting both the circRNA and its linear precursor as well as the putative concatemer expected. Although a ~1 kb band was expected if a trans-splicing by-product was present, no such band was observed. Moreover, RT-PCR amplification of the cGFP back-splice junction was markedly more efficient when reverse transcription was primed with random hexamers than with oligo(dT), priming total RNA or preferentially polyA+ RNA, respectively. These results are expected for a circRNA, as also indicated by the fact that the circZNF609 positive control behaves in a similar manner. Collectively, these results confirmed the circular nature of our transcript and excluded translation originating from possible concatemers. *

      These results are shown in new Fig. S1M and S1N and described in the text as follows: Importantly, we ruled out the possibility that templates other than the GFP-encoding circRNA drive translation from our top constructs (V-cGFP, T3-glo-cGFP and T3-U3-cGFP). Ad hoc PCRs amplifications (Fig. S1M) revealed no bands indicative of concatemer formation. The left panel of Fig. S1M schematically illustrates the primer sets and expected amplicons sizes. In brief, we used a divergent primers set spanning the BSJ (3-4) to specifically detect the mature circRNA and a set of convergent primers (1-2) pairing on the GFP ORF detecting both the circRNA and its linear precursor as well as the putative concatemer expected. Although a ~1 kb band was expected if a trans-splicing by-product was present, no such band was observed (new Fig. S1M). Moreover, RT-PCR amplification of the cGFP back-splice junction was markedly more efficient when reverse transcription was primed with random hexamers than with oligo(dT), priming total RNA or preferentially polyA+ RNA, respectively (Fig. S1M). These results are consistent with the circularity of the transcripts tested (Fig. S1M). Importantly, cGFP PCR amplifications showed similar results as a validated endogenous circRNA, namely circZNF609, used as control (Fig. S1M, right panel), confirming the circular nature of cGFP. Finally, cGFP resulted resistant to RNAseR treatment (Fig. S1N), further supporting its circular nature.”* *

      • Echoing the point above, the overall results would be stronger if the authors couldconfirm IRES activity using highly pure, in vitro transcribed RNAs that are transfected into cells * We thank the reviewer for this suggestion. Unfortunately, we are currently unable to produce synthetic circular molecules in-house, and the cost and time for purchasing synthetic ones are prohibitive. Nevertheless, we have performed the experiments described above to ensure the circularity of the transcripts tested.*

      • The authors should also confirm their IRES activity using standard dual luciferase reporter (linear) constructs which have long been a standard approach in the field. We thank the reviewer for raising this point. As recommended, we cloned our three best candidates (VCIP, T3-glo, and T3-U3) into the pRL-TK/pGL3 dual-luciferase vector to assess their IRES activity (producing the vectors VCIP-Luc, T3-glo-Luc, and T3-U3-Luc), transfected them into RD cells, and, after 24 h of incubation, measured luciferase activity to assess the IRES performance of each candidate. From our analyses, VCIP and T3-U3 confirmed their IRES activity, although showing different relative efficiency, whereas T3-glo was inactive in the linear luciferase context. This finding is consistent with previous observations (Legnini et al., 2017) showing that the performance of IRES sequences in a linear luciferase reporter may differ from their activity when driving translation from a circRNA template. Overall, these results highlight the need for further investigation into the sequences and contexts specifically governing circRNA translation, rather than relying solely on knowledge derived from linear RNAs. *The results are shown below. We did not include them in the text to not overcomplicate the readability. However, we are happy to add and discuss them if required. *

      ***

      ***

      Bar plot representing the relative luciferase activity deriving from VCIP-Luc (“V”), T3-glo-Luc (“T3-glo”), and T3-U3-Luc (“T3-U3”)*. Dual luciferase assay was performed and Renilla luciferase activity from each candidate was normalized against the Firefly luciferase. An empty ptKRL-pgl3 vector was used as reference. The ratio of each sample versus its experimental control was tested by two-tailed Student’s t test. * indicates a Student’s t test-derived p-value * *

      • Methods, Plasmids Construction Section: Rather than including long lists of oligos and forcing a reader to figure out the final product that was cloned, it would be more intuitive if the authors provided the full sequences of the ORF and IRES sequencesthat were tested. We thank the reviewer for the comment, we added the sequences to the methods (Supplementary Table 1).

      • The manuscript needs extensive English editing. Parts of it are also formatted in anunusual style, especially the introduction where it seems like each paragraph is a single sentence. As requested by the reviewer, we edited the text to make the language and content more accessible to readers.

      • References included by the authors are selective and surprisingly do not include Chen et al (2021) Mol Cell 20:4300-4318 which already defined IRES elements for circRNAs that are fairly small. *Thank you for pointing this out. We have now cited the elegant work of Chen et al. (2021, Mol Cell 20:4300–4318) in the revised manuscript. While Chen and colleagues screened IRES-like elements of roughly 200 nt, our study was designed to uncover an even more minimal motif. The elements we report are therefore markedly shorter, highlighting a complementary, rather than overlapping, aspect of IRES available for driving circRNA translation. However, we now refer to Chen et al. in our text. *

      • Error bars in Fig 2, especially Fig 2B, are huge. It seems impossible to make any conclusion given the large variety across these experiments. Thank you for your input. Although the error bars appear relatively large, the overall conclusions remain robust, as also noted by the other reviewers: both T3-glo and T3-U3 are intrinsically compact elements, yet they drive translation as efficiently as larger canonical IRESs. The error bars largely reflect the inherent variability of transient transfection assays, which naturally increases with the number of constructs examined. To strengthen our dataset without discarding existing replicates, we chose not to repeat experiments in the previously tested lines. Instead, we assessed our vectors in an additional model, the D283 medulloblastoma cell line. In this setting, we unexpectedly observed that the EMCV IRES surpasses the VCIP IRES, opposite to what we saw in the other lines, yet even here the short elements we identified remain strong competitors (new Fig. 2C, S2G, S2H). The evaluation of multiple CDSs across several cell lines, make our findings to be solid and well supported.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this brief report, the authors take advantage of circular RNA expression plasmids to define elements that can be used to enable efficient translation. They test a handful of known IRES elements as well as short translation enhancing elements (TEEs) for their ability to promote translation of circular GFP and c-ZNF609 reporters. They focus on one particular element that is of a short length and seems to work as well as longer IRES elements. My major concern relates to possible alternative sources of the translated proteins, which the authors have not ruled out (see below). I find the manuscript to be too preliminary in its current state.

      Major comments:

      • Work from the Meister group (Ho-Xuan et al 2020 Nucleic Acids Res 48:10368) has shown that apparent translation from circRNA over-expression plasmids is not from circular RNAs, but instead from trans-splicing linear by-products. The authors have not ruled out such alternative explanations here, e.g. by using deletion constructs that prevent backsplicing.
      • Echoing the point above, the overall results would be stronger if the authors could confirm IRES activity using highly pure, in vitro transcribed RNAs that are transfected into cells.
      • The authors should also confirm their IRES activity using standard dual luciferase reporter (linear) constructs which have long been a standard approach in the field.
      • Methods, Plasmids Construction Section: Rather than including long lists of oligos and forcing a reader to figure out the final product that was cloned, it would be more intuitive if the authors provided the full sequences of the ORF and IRES sequences that were tested.
      • The manuscript needs extensive English editing. Parts of it are also formatted in an unusual style, especially the introduction where it seems like each paragraph is a single sentence.
      • References included by the authors are selective and surprisingly do not include Chen et al (2021) Mol Cell 20:4300-4318 which already defined IRES elements for circRNAs that are fairly small.
      • Error bars in Fig 2, especially Fig 2B, are huge. It seems impossible to make any conclusion given the large variety across these experiments.

      Minor comments:

      • Provide a reference for the claim in the introduction that "the smaller the RNA to be circularized, the greater the circularization efficiency".
      • Supplemental Table: Please clarify what each qPCR primer was used for. E.g. what does "49 hung rev" refer to?
      • Fig 1C should be explained better. What do the numbers in white refer to? In the main text, it is written that "Furthermore, we also added TEEs elements upstream the VCIP IRES" but Fig 1C suggests they were inserted downstream.

      Referees cross-commenting

      I stand by my comments regarding the need for the authors to perform additional controls and validation.

      Significance

      This work is most relevant for researchers aiming to use circular RNAs as therapeutic modalities to express proteins. Defining optimal methods, including IRES elements, that enable maximal translational output would be helpful. Note, however, that this is far from the first study to look for IRES elements in circular RNAs (e.g. Chen et al (2021) Mol Cell 20:4300-4318) which did it in a much more extensive manner.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Circular RNAs (circRNAs) have attracted significant interest due to their unique properties, which make them promising tools for expressing exogenous proteins of therapeutic value. However, several limitations must be addressed before circRNAs can become a biologically and economically viable platform for the biotech industry. One of the main challenges is the reliance on large, highly structured sequences with internal ribosome entry site (IRES) activity to initiate translation of the downstream open reading frame. In this study, the authors propose an alternative strategy that combines the 5′ untranslated region (5′UTR) of a previously characterized natural circRNA (circZNF609) with a short 13-nt nucleotide sequence shown to act as a translational enhancer. By evaluating the activity of various constructs containing a reporter gene across multiple cell lines, they identify the most efficient and compact sequence, 63-nt long, capable of boosting translation within a circular RNA context.

      Major Comments:

      • This study is well-executed and relies on standard in vitro molecular biology techniques, which are adequate to support the conclusions drawn.
      • The experimental procedures are clearly described, and the statistical analyses have been performed according to accepted standards.

      Minor Comments:

      • The manuscript would greatly benefit from a comprehensive revision to improve clarity and language. Involving a native English speaker during the editing process could significantly enhance the manuscript's readability and overall quality. The Results section would benefit from closer attention, as certain parts of the description are at times confusing and could be clarified for better reader comprehension.
      • The references should be carefully reviewed for accuracy and consistency-for instance, references 9 and 10 appear to require correction or clarification.

      Significance

      This study addresses a critical bottleneck in RNA therapeutics. The use of the proposed short sequences could significantly enhance the in vivo activity of protein-encoding circular RNAs. A highly efficient, compact translational enhancer has the potential to substantially improve the therapeutic applicability of circRNAs and broaden their range of applications.

      Given the potential utility of these findings, we would anticipate pursuing intellectual property (IP) protection.

      To further strengthen the study, future work should include additional data on polysome association and a detailed analysis of the secondary structure of the 66-nt enhancer sequence.

      This work should be of broad interest to molecular biologists working on RNA biology, translation, and RNA-based therapeutics. I expect the identified sequence will be tested by multiple laboratories to evaluate its strength and versatility, further underscoring the potential impact of this study.

      For context, I am actively engaged in research on non-coding RNAs.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In the manuscript entitled "A Short 63-Nucleotide Element Promotes Efficient circRNA Translation", Biagi et al. aim to identify sequences and layouts that would allow high expression of proteins from an engineered circular RNA (circRNA). Briefly, the authors utilize a circRNA-producing plasmid that produces a GFP protein encoded across the splice junction when translated and test different IRESs in combination with Translation Enhancing Element (TEEs). While performing these experiments they found that a short sequence containing the TEE (13-glo) is enough to promote significant levels of translation while keeping the size of the circRNA small. The authors then tested whether the presence of a spacer could help improving translation and identified a 50base sequence that in combination with the TEE can promote very efficient translation. The authors then went on and showed that this element can promote the translation from a circRNA expressing another protein (in this case was a circRNA-encoded peptide), demonstrating the versatility of this approach. Moreover, the authors showed that their approach can promote translation in other cell lines.

      This is an important and solid study that identified sequences that can improve circRNA translation and that as or more importantly are very short and hence are suitable for generating of efficient protein expressing circRNAs. This manuscript fills an important gap in the field, and it is highly significant. The study is well controlled, the rationale clear and the results conclusive with no major flaws. While this is a minor concern as the vector has been used before, it will greatly improve the quality of the paper if the authors could just verify that the vector only generates circRNA molecules and not linear concatamers. To do so the authors can focus only in their control and the most optimal transcripts and perform northern blot or well controlled RNAseR experiments to show that all RNA molecules containing the back splicing junction are circular.

      Minor comments:

      • There is a repetition of the world "a" in the abstract.
      • All circRNA translation studies should be cited when describing translation of circRNAs.

      Significance

      While other studies have identified sequences that can drive circRNA translation, this study has done a great job identifying a very short sequence and additional requirements for optimal translation. This is an important study that will be of high interest for the molecular, cell biology and general biology communities.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      A detailed response to the reviewer comments has been uploaded as a separate file. It contains several embedded figures that cannot be shown through this posting option

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Hamadou, Alunno et al. have found evidence for the notion that although translational regulation plays a key role in determining cell behavior, few studies have explored how single nucleotide polymorphisms (SNPs) affect mRNA translation. They developed a method to analyze allele-specific expression in both total and polysome-associated mRNA using RNA-seq data from HCT116 cells. This approach revealed 40 potential "tranSNPs"-SNPs linked to differences in translation between alleles. One SNP, rs1053639 (T/A) in the 3' untranslated region of the DDIT4 gene, was found to influence translation: the T allele was more often associated with polysomes. Cells engineered to carry the TT genotype produced more DDIT4 protein than those with the AA genotype, especially when exposed to stressors like Thapsigargin or Nutlin that boost DDIT4 transcription. The authors found that the RNA-binding protein RBMX mediates this allele-specific protein expression. Knocking down RBMX in TT cells lowered DDIT4 protein levels to those seen in AA cells. Functionally, TT cells suppressed mTORC1 activity more effectively under ER stress, whereas AA cells had a growth advantage in cell culture and in zebrafish models. In human cancer data from TCGA, individuals with the AA genotype had poorer outcomes under a recessive genetic model.

      The manuscript needs major revision due to additional data interpretation, lack of statistical analysis, and lack of mechanistic and causal insights. The paper is overall correlative and descriptive and has not enough data to claim a translation regulation aspect of DDIT4 and the protein product to cause the observed genotypic differences stemming from a SNP in the 3' UTR. The paper reads as a collection of individual findings that do not seem to be very cohesive and ranges from polysome-seq, RBP binding, ER stress, mTOR activity, cellular co-culture tumor models and zebrafish tumor models. I wish the authors would have focused on one aspect and described one finding well. Without addressing these fundamental concerns, the study's core claims regarding p53-dependent responses in cancer remain unsubstantiated. Overall, this reviewer supports the publication in a Review Commons journal dependent on that the points of criticism are adequately addressed in the course of a major revision.

      Major comments:

      1. Fig.1: The presentation of the location of the tranSNPs in the target mRNAs from polysome data should be presented in a schematic in Fig.1. It should be emphasized; what fold change was considered relevant to select mRNA targets. Do SNPs overlap other regulatory element in the 3' UTRs of the mRNA targets?
      2. Fig.2: If mRNA steady-state levels and protein levels are not affected by the SNP, what mechanism can be assumed for translation? Can you perform luciferase reporter mRNA experiments with the different SNPS under ER/thapsigargin stress conditions? Can you isolate the region that has the SNP and show that the effect on translation is local?
      3. Fig.3: Given the subtle differences in polysome association of mRNA distributions in the mutants, the polysomes need quantifications of the area under the curve in 3 categories: sub-polysomal, light and heavy polysomes. The overall decreased translation of all 3 mRNAs in tg-stress cells of the AA SNPs needs to be explained. This effect is not specific to DDIT4.
      4. Fig.4: The cherry-picking based on CLIP data of RBMX needs to be addressed more. A pulldown of all 3 identified RBPs needs to be done to determine if RBMX is the strongest regulator of DDIT4 via the 3' UTR. The EMSA in (A) needs to be quantified to determine the Kd. In (C) the RBMX is mainly nuclear which does not align with the translation effect on DDIT4 mRNA. Please explain. The effect on localization upon RBMX on DDIT4 protein seems subtle. Are there more dominant mechanisms at play for translation regulation other than via RBMX?
      5. Fig.5: How do you interpret the TT-specific effect on mTOR activity? Is there a link between RBMX binding, DDIT4 protein levels/activity and mTOR? The stats in (F) are missing.
      6. Fig.6: The rationale for these sets of experiments is not clear. Is it expected that the DDIT4 protein alone and its regulation through the AA phenotype is affecting global translation? Thapsigargin is a global ER stress but the expectation is not that DDIT4 itself is such a strong global regulator. This figure can move to the supplement.
      7. Fig.7: The data in (A) is very clear, can you expand a bit on that how translation regulation of the genotypes in co-culture can have such a strong effect? The data in (C) needs to be reevaluated with stats as there does not seem to be a strong difference.
      8. Fig.8: How much is the AA-induced tumor growth in zebrafish comparable to a co-culture tumour model? Again, how are the DDIT4 proteins levels derived from AA related and responsible for this?

      Minor comment:

      1. The manuscript is littered with non-intuitive abbreviations that make the figures less accessible without reading all main text. Please simplify and reduce abbreviations.

      Significance

      The manuscript needs major revision due to additional data interpretation, lack of statistical analysis, and lack of mechanistic and causal insights. The paper is overall correlative and descriptive and has not enough data to claim a translation regulation aspect of DDIT4 and the protein product to cause the observed genotypic differences stemming from a SNP in the 3' UTR. The paper reads as a collection of individual findings that do not seem to be very cohesive and ranges from polysome-seq, RBP binding, ER stress, mTOR activity, cellular co-culture tumor models and zebrafish tumor models. I wish the authors would have focused on one aspect and described one finding well. Without addressing these fundamental concerns, the study's core claims regarding p53-dependent responses in cancer remain unsubstantiated. Overall, this reviewer supports the publication in a Review Commons journal dependent on that the points of criticism are adequately addressed in the course of a major revision.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, Hamadou et al. describe the functional characterization of a 3'UTR SNP (rs1053639) in the DDIT4 gene that influences mRNA localization and translation. The authors use polysome profiling, isogenic HCT116 clones, and molecular assays to link the SNP to allele-specific protein expression, proposing a mechanistic role for RBMX and potentially m6A. The manuscript is clearly written and presents compelling evidence to support the authors conclusion.

      Major Comments:

      1. The comparison between TT and AA clones relies on a very limited number of HCT116-derived edited lines. The possibility that the observed differences in DDIT4 translation are due to clonal artifacts cannot be excluded. The authors could partially address this by transfecting the luciferase reporters carrying the A or T allele into both AA and TT clones to assess whether genotype-specific effects persist independently of clone background.
      2. All functional assays are restricted to HCT116 cells. It is essential that key findings, such as especially allele-specific effects on protein levels and mRNA localization, are validated in at least one additional cell line to generalize the findings.
      3. While TT and AA clones show differences in DDIT4 protein levels, the downstream biological effects (e.g., in co-culture or zebrafish xenografts) are modest and not clearly attributable to DDIT4 expression. The authors should strengthen this connection by manipulating DDIT4 expression (e.g., knockdown or overexpression) in both genotypic backgrounds to determine whether the observed growth or localization phenotypes are DDIT4-dependent.

      Minor Comments:

      1. Fig4B: IgG controls for the RIP-qPCR are missing.
      2. Figure 7C is not properly aligned and the total proportion of cells is not 100%.
      3. The discussion section, while informative, is overly long and could be more concise and focused to improve readability and impact.

      Significance

      The authors present a novel and sound pipeline to identify SNPs that regulate mRNA translation using allelic differences in polysome association. Using this approach, they focus on rs1053639 in the 3'UTR of DDIT4 and provide convincing evidence of its impact on mRNA localization and protein expression in HCT116 cells. While the molecular findings are robust, the biological consequences appear relatively modest, and the proposed clinical relevance remains speculative at this stage.

      Overall, the study will be of primary interest to a specialized audience of researchers in the fields of post-transcriptional regulation, RNA biology, and functional genomics. The proof-of-concept framework may also attract broader interest for its potential applications in understanding non-coding genetic variation in cancer biology.

      Reviewer expertise: p53 biology, molecular cancer biology

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study investigates the role of a 3'UTR SNP variant in DDIT4 mRNA on allele specific expression at post transcriptional level. The authors have previously developed an experimental approach to identify differences in allele specific transcript distribution in polysomes vs. This was done using polysome profiling combined with RNA-seq analysis of polysome associated and total RNA fractions. This systematic approach identified 40 candidate transcripts exhibiting differential polysome association between reference and variant alleles, indicating post transcriptional effects. Focusing on DDIT4, the study demonstrated that the SNP variant alters subcellular mRNA localization patterns between cytoplasm and nucleus through an impaired interaction with a specific RNA binding protein. Since DDIT4 functions as a negative regulator of mTORC1 signalling, the study examined the mTOR pathway status in homozygous reference and variant genotypes. Using genome-edited cell lines revealed enhanced proliferative capacity of the homozygous AA variant in both co-culture assays and zebrafish xenograft models. I agree with the authors that we don't know much about allele specific effects on mRNA translation mechanisms. However this study doesn't provide much evidence for translational effects either because the differences appear to be mostly due to the impaired export of the variant RNA from the nucleus. Irrespective, the findings are very important as they show how genetic variants in non-coding regions can result in changes of expression at posttranscriptional level.<br /> A comprehensive suite of experimental approaches was utilized to systematically assess both the SNP's impact on mRNA translation and the gene specific functional consequences for DDIT4. The manuscript is well written and presents the work with great clarity.

      Major comments

      "HCT116, about 11% of genes with analyzable heterozygous SNPs show a difference in AF between paired total and polysome-bound mRNAs, suggesting allele-specific post-transcriptional and translational control." For the remaining candidate transcripts that did not undergo targeted experimental validation like DDIT4, it remains possible that the observed allele specific translational effects could be attributed to other SNPs located elsewhere within these transcripts or to combinatorial effects involving multiple variants. Have the authors considered this possibility? The authors employed RNA probes designed to mimic the secondary structures of the T and A alleles of endogenous DDIT4 mRNA. Could you clarify the exact composition of these probes, do they contain a partial DDIT4 3'UTR sequence? Is it possible that the probes lack critical sequences required for complete protein recognition? Figure 3A - the authors suggest that "in the mock condition, AA cells showed a slight reduction in translation efficiency for the DDIT4 mRNA, as revealed by higher relative abundance in lighter polysomes (fraction 9)" I am not convinced that this is the case, first because the number of ribosomes per mRNA doesn't necessarily reflect translation efficiency and also the TT seems to have increased monosome fraction, and overall to me the profile suggests of slightly reduced translation for TT. Was the nucleotide sequence of the binding site of RBMX determined and if so is this sequence present within the DDIT4 3'UTR?

      Minor

      Could the authors maybe define what is meant by "analyzable" SNPs or genes? What was the rationale for the selection of HCT116 cells, from a quick search it appears that DDIT4 effects on mTORC1 inhibition could be cell type specific ("mediates mTORC1 inhibition in fibroblasts and thymocytes, but not in hepatocytes"), have the authors considered other cell types Results section 2: Editing of HCT116 cell... I appreciate the clear methodological explanations provided in this section; however, the manuscript might benefit from more concise organization with substantial portions of this descriptive content relocated to the Methods section. Regarding statistical presentation, I recommend reporting exact significance values rather than using threshold indicators (ns, , *, etc.). This approach provides more informative and transparent statistical reporting as differences between "non-significant" and "significant" designations can be minimal neighbouring p-values that fall on opposite sides of arbitrary thresholds and may be misleadingly interpreted. For instance in Figure 2D, the comparison between TT and AA genotypes may approach statistical significance, and displaying the actual p-values would allow readers to better assess the strength of evidence. Fig 3 What is the significance of the control mRNAs? According to the plots it seems as if these also have variants TT/AA? Figure 5A why does AA clone 6 look so different on the gel? "rs1053639 genotype, a relatively common SNP" - what is the estimated frequency of the SNP?

      Significance

      It is a substantial study and a very interesting story. The findings will be of interest for a broad audience, because it combines elements of basic research and clinical significance. The work allows for interpretation of an allele specific genomic variant outside of the coding region and it reveals the importance of similar characterisation of other SNPs.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to the Reviews

      We thank the reviewers for their input and detailed feedback, which has helped us improve both the manuscript and the Microscopy Nodes software. Based on the comments, we have implemented new features, currently available as version 2.2.1 of Microscopy Nodes. We have edited the text and figures of the manuscript to reflect these changes and add clarification where needed.

      Reviewer #1

      Evidence, reproducibility and clarity

      *The work by Gros et al. presents a paper introducing Microscopy Nodes, a new plugin for Blender 3D visualization software designed to import and visualize multi-dimensional (up to 5D) light and electron microscopy datasets. Given that Blender is not directly suited for such tasks, this plugin significantly simplifies the process, making its visualization engine accessible to a wide range of researchers without prior knowledge of Blender. The plugin supports importing volumes and labels from generic TIF or modern OME-Zarr image formats and includes supplementary video tutorials on YouTube to facilitate basic understanding of the visualization workflows.

      Major comments: - The manuscript suggests that Microscopy Nodes can easily handle large datasets, as evidenced by the showcases. However, in my personal tests, I was unable to import a moderate TIF stack of about 5GB, which is considerably smaller than the showcased datasets. Post-import, a data cube was displayed, but the Blender interface became unresponsive. The manuscript should include a section stating limitations and addressing issues and providing suggestions for visualization of large datasets.*

      We want to thank the reviewer for this valuable comment, which led us to find a core issue in Blender’s large data handling. Specifically, Blender’s rasterized pipeline causes issues with > 4 GiB of data loaded. This issue does not occur in the raytraced (Cycles) renderer, which is why we had not previously encountered it.

      To address this, we have extended the reloading workflow of Microscopy Nodes to provide a workaround for this. If the data is larger than 4 Gibibytes (GiB) (per timepoint, or per timepoint per channel), Microscopy Nodes now automatically downsamples these data during import. While using these downsampled options is recommended for adjusting the visualization settings, the user can then still make their animation and reload their data to the largest scale for the final render by using the raytraced (Cycles) renderer. Additionally, we have raised this bug with the core Blender developers, and hope to work this out in the long term (blender/blender#136263).

      We reflect these changes in the manuscript in the segment:

      “Blender currently has a notable limitation that its default ‘quick’ rasterized rendering engines (such as ‘EEVEE’, but also the viewport ‘Surface’ and ‘Wireframe’ modes) do not support more than 4 Gibibytes (GiB) of volumetric data. The raytracing render mode ‘Cycles’, however, can handle large volumetric data. To allow users with large data to flexibly use Microscopy Nodes, we implemented a reloading scheme, where one first loads a smaller version of the data (under 4 GiB per timeframe for all loaded channels combined) - and only upon final render in Cycles, exchange it for the full/larger scale copy (Fig 3A). This downscaling of data offers additional benefits as it allows for fast adjustment of the render settings on e.g. a personal computer which can eventually be transferred to a larger workstation or HPC cluster for the final render at full resolution. This feature is critical as working in Cycles with larger files requires sufficient RAM to fit the (temporary) VDB files comfortably. For example, multiple figures in this manuscript were made on a 32GB RAM M1 Macbook Pro (Fig 1A, Video SV1, Fig 1D, Figure 2A-D, Fig S2A-B), but for larger data or long movies the movies were made on workstations or prepared on a laptop and then transferred to an HPC cluster for final rendering.”

      * - The feature of importing Zarr-datasets over HTTP is great, but the import process was very slow in my tests, even on a robust network. For reference, loading 1.8 GB of the PRPE1_4x dataset at s1 level took 52 minutes. This raises concerns about potential code issues and general usability of the suggested workflow.*

      We believe that this loading time may have been caused by the same issue that plagued all of our datasets of >4GB outside of the raytraced mode, as we have not seen loading issues like that. Moreover, Microscopy Nodes now supports Zarr version to Zarr 3/OME-Zarr 0.5, which allows ‘sharded’ Zarr datasets, which should be even faster at loading large blocks of data at the same time, as Microscopy Nodes does.

      - The onsite documentation is a bit outdated and fails to fully describe the plugin settings.

      We have updated our documentation to offer new written tutorials, which include full start-up tutorials, but also for some key extra instructions.

      - The YouTube tutorials feature an outdated version of the plugin, which could confuse the general microscopy audience. These should be updated to better align with the current plugin functionality. Additionally, using smaller, easily accessible datasets for these tutorials would improve user testing experiences. Hosting complete (downsampled) demo project folder on platforms like zenodo.org could also enhance usability of such tutorials.

      We have made a new series of YouTube tutorials that align with the current interface of Microscopy Nodes. These tutorials include public datasets, allowing users to follow along easily. We have chosen to also retain the older tutorials for users running legacy versions of the plugin, as they cover different workflows.

      - The manuscript describes a novel dataset used in Fig. 2, but no reference is provided. Additionally, practical implementation of the coloring description for Fig. 2D can be unclear for inexperienced users, necessitating either step-by-step instructions or the provision of downsampled Blender files to aid understanding.

      We have now shared the OME-Zarr address in the text (https://uk1s3.embassy.ebi.ac.uk/idr/share/microscopynodes/FIBSEM_dino_masks.zarr), and included this both in the manuscript and the tutorials. Additionally, to guide the implementation and explain the logic behind the coloring we introduced additional panels in Fig S1 and Fig S2 to showcase the shader setups used for this image.

      [OPTIONAL] When importing labels, they can be assigned to individual materials only if initially split into multiple color channels. It would be great if the same logic is implemented when those materials are provided as indices within a single color channel. There can be a switch to define the logic used during the import process: e.g. the current one, when the objects are just colored based on a color map, or when they are arranged as individual materials as done when labels are imported from multiple color channels.

      We agree with the reviewer and to address this concern with the update to version 2.2, we have implemented a new colorpicking system (See Fig 3B, inset 3, Fig 3C), this allows users to choose between a single color, various continuous, or categorical color maps.

      Minor comments: - The manuscript shows nice visualizations of time series, light, and electron microscopy datasets, but in its current state, it is targeted more for light microscopy, where the signal is white. On the other hand, many EM datasets are rendered in inverted contrast (TEM-like), where the signal is black. To render such volume properly, it is needed to go into the Shading tab and flip the color ramp. Would it be possible to perhaps define the data type during import to accommodate various data types or perhaps select the flipped color ramp when the emission mode is switched off? It could make it easier for inexperienced EM users to use the plugin.

      To address this, we include new default settings, with ‘invert colormaps on load’ option in the preferences, and default colors per channel (See Fig S4). We have also implemented a new color picking system in version 2.2 (See Fig 3B, inset 3, Fig 3C) that hopefully makes it easier before and after load to change colors.

      - It was not completely clear to me whether it is possible to render a single/multiple EM slices using the inverted (TEM-like) contrast. For example, XY, XZ, YZ ortho slices across the volume. The manuscript contains: "This visualization is also supported in Blender, allowing for arbitrary selections of viewing angles (Fig 2B).", but it is not clear how to achieve that.

      We introduced an additional explanation in Fig S1A and added a separate density window in the default shader to make this opaque view easier. To get a single slicing plane, users can reduce the scale of the slicing cube in one axis, at it is now also explained in Fig S2B.

      - In 3D microscopy, it is quite common to have data with anisotropic voxels. As a result, the surfaces may require smoothing. I was not able to quickly find a way to smooth the surfaces (at least smooth modifiers for surfaces did not work for me). Is it possible to apply smoothing during the import of labels, or alternatively, smoothing of the generated surfaces can be a topic for an additional YouTube video.

      The smoothness of the loaded masks can be indirectly affected in the preferences by changing the mesh resolution (changing the relative amount of vertices per pixel), but can be further affected by operations such as the Blender “Smooth” or e.g. the “Smooth by Laplacian” modifiers. To guide the users in doing so, we have included instructions for smoothing in the written tutorials on the website https://aafkegros.github.io/MicroscopyNodes/tutorials/surface_smoothing/ .

      - It is also typical to have somewhat custom color maps for materials. It would be great if the plugin remembers the previously used color map for labels.

      We have implemented new Preference settings, which include default colors and colormaps per channel, improving customization and reproducibility. This new option is described in Figure S4.

      * - The pixel size edit box rounds up the values to 2 digits after the dot. Could it be changed to accommodate 3 or 4 digits as the units are um.*

      Blender’s interface truncates the display, but stores higher-precision values internally, and become visible when users click or edit the values. We have added support for alternative pixel units to reduce the impact of the truncation.

      - Import is not working when: - Start Blender - Select Data storage: with project - Overwrite files: on, set env: on, chunked: on - Select a file to import - Save Blender file - Pressing the Load button gives an error: "Empty data directory - please save the project first before using With Project saving."

      We thank the reviewer for finding this bug which is now fixed in version 2.2.

      - I was not able to play the downloaded supplementary video 3 using my VLC media player, while it was working fine in a browser. The video can be opened but looks distorted and heavily zoomed in. It may need to be re-saved from a video editor.

      We have recompiled this video.

      - References 12 and 16 are URL links instead of proper references to articles.

      Thanks for catching this mistake in our bibliography. We have corrected this.

      Significance

      *This work effectively bridges a gap in the availability of tools for 3D microscopy dataset visualization. While many visualization programs exist, the high-quality ones are often expensive and thus not accessible to all researchers. The integration of Blender with Microscopy Nodes democratizes access to high-quality 3D visualization, enabling researchers to explore datasets and models from multiple perspectives, potentially leading to new discoveries and enhancing the understanding of key study findings. Despite its limitations, my experience with the plugin was engaging and useful. I would like to thank the authors for such useful work!

      Limitations: - There remains a steep learning curve associated with using Microscopy Nodes, primarily due to Blender's complexity. More comprehensive tutorials could help mitigate this. - The conversion of imported images to Blender's internal 32-bit format results in a 4x increase in data size for 8-bit datasets. - Managing moderate-sized volumes (5-10 GB) can be challenging without clear strategies for effective handling. - The import of Zarr-datasets over the net is notably slow.

      Audience: The plugin is suitable for a broad audience with a basic understanding of 3D visualization concepts, providing a solid foundation for exploring Blender's extensive features and options for optimal visualizations.

      Reviewer expertise: Light microscopy, electron microscopy, image segmentation and analysis, software development, no experience with Blender*

      Reviewer #2

      *Evidence, reproducibility and clarity *

      *Summary:

      The article introduces Microscopy Nodes, a Blender add-on designed to simplify the loading and visualization of 3D microscopy data. It supports TIF and OME-Zarr images, handling datasets with up to five dimensions. The authors present different visualization modes, including volumetric rendering, isosurfaces, and label masks, demonstrating the application in light and electron microscopy. They provide examples using expansion microscopy, electron microscopy, and real-time imaging, highlighting how the tool enhances scientific communication and interactive visualization.

      Comments:

      However, some key aspects could be improved to enhance usability and reproducibility:

      Example datasets: The images used in the YouTube tutorials were not accessible, making it difficult to reproduce the workflows shown in the figures and tutorials. It would be helpful if the authors provided direct links to the datasets or ensured that the same examples used in the tutorials were readily available for replication.*

      We created new and updated tutorials and for all new tutorials, the data is now easily available from an S3 server.

      Input file specifications: The article does not clearly detail how input files should be formatted. Many users will pre-visualize images in Fiji to convert their original images to a compatible format. It would be beneficial to specify which formats are supported for hyperstack creation, including details on bit depth, dimension ordering, label formats, and metadata compatibility, if applicable.

      We have added new documentation on this on the website and in the manuscript. The addon can take 8, 16, and 32 bit data, and any dimension order (with the letters tzcyx) and pixel size. Dimension order and pixel size can be edited in the GUI. This is reflected in the manuscript in the rewritten section in Design and Implementation:

      “It can handle 8bit to 32bit integer and floating point data, although all data types will be resaved into 32bit floating point VDB files, which can cause temporary files to take up more space than the original. Microscopy Nodes loads 2D to 5D files of containing data across time, z, y, x and channels, in arbitrary order (can be remapped in the user interface as well, Fig 3B, inset 2). To focus on relevant data, users can clip the time axis, which can be useful for long videos.”

      * Hardware requirements: The article does not discuss RAM or hardware constraints in detail. In testing, attempting to load two images into the same project caused the program to freeze (tested on Mac M1). Specifying hardware requirements and limitations would help users manage expectations when working with large datasets.*

      We have since found a limitation in the Blender engine that indeed limits the amount of data loaded (see also comment by Reviewer 1). Currently, rasterized engines are capped at 4 GiB, and only the raytraced engine can handle larger data. As such, the Microscopy Nodes pipeline, where one works with small images until it is time to render a final version, and the data is only exchanged for the final render, is still viable. To make this easier, we now also included optional downscaling for Tif images. This is described in the rewritten section on Design and Implementation:

      “Blender currently has a notable limitation that its default ‘quick’ rasterized rendering engines (such as ‘EEVEE’, but also the viewport ‘Surface’ and ‘Wireframe’ modes) do not support more than 4 Gibibytes (GiB) of volumetric data. The raytracing render mode ‘Cycles’, however, can handle large volumetric data. To allow users with large data to flexibly use Microscopy Nodes, we implemented a reloading scheme, where one first loads a smaller version of the data (under 4 GiB per timeframe for all loaded channels combined) - and only upon final render in Cycles, exchange it for the full/larger scale copy (Fig 3A). This downscaling of data offers additional benefits as it allows for fast adjustment of the render settings on e.g. a personal computer which can eventually be transferred to a larger workstation or HPC cluster for the final render at full resolution. This feature is critical as working in Cycles with larger files requires sufficient RAM to fit the (temporary) VDB files comfortably. For example, multiple figures in this manuscript were made on a 32GB RAM M1 Macbook Pro (Fig 1A, Video SV1, Fig 1D, Figure 2A-D, Fig S2A-B), but for larger data or long movies the movies were made on workstations or prepared on a laptop and then transferred to an HPC cluster for final rendering.”

      Significance

      *General Assessment:

      One of the major strengths of this work is its seamless compatibility with Blender, a powerful and widely used animation and 3D rendering tool. Integrating advanced visualization techniques from the animation and graphics industry into scientific imaging opens new possibilities for presenting complex microscopy data in an intuitive and accessible way. Additionally, the support for OME-Zarr is particularly valuable, as this format represents a major shift in bioimaging towards scalable, cloud-compatible, and standardized data storage solutions. The adoption of OME-Zarr facilitates large-scale data handling and improves interoperability across imaging platforms, making this integration a significant step forward for the field. Overall, the greatest strength of the tool lies in its flexibility for rendering microscopy data, but its accessibility for users without Blender experience might be a challenge.

      Advance in the Field This work introduces a novel solution to the visualization challenges in microscopy by leveraging Blender's advanced rendering capabilities.

      Audience This paper will be of interest to: Bioimage researchers seeking to enhance their microscopy data visualization. Image analysis tool developers interested in integrating advanced visualization into their workflows.

      Field of Expertise This review is based on expertise in image analysis, segmentation, and 3D biological data visualization.*

      *Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The paper "Microscopy Nodes: Versatile 3D Microscopy Visualization with Blender" presents an easy and accessible approach for microscopists and microscopy users to visualize their data in a different and more controlled way. The authors have developed a plug-in script that enables the integration of complex 3D datasets into Blender, a widely used software for 3D visualization and illustration. By leveraging Blender's advanced rendering engine, the plug-in provides greater control over the scene, enviromint and presentation of the 3D data.

      I believe that this development, especially when combined with additional analysis tools can be of a great value for microscopist and advanced users to presenting their 3D data sets.

      However, at this stage, the paper does not seem to fully demonstrate the benefits of using Microscopy Nodes. To enhance the paper impact, it would be helpful for the authors to further emphasize and provide examples of how Blender's rendering specifically improves data presentation and, in turn, enhances the understanding of the data compared to existing solutions. Specifically, the authors claim at the end of the introduction that their development provides powerful tools for high-quality, visually compelling presentations, enabling "more effective communication of 3D biological data." I believe this statement should be supported by a figure comparing currently available visualization methods and demonstrating how using Blender enhances data presentation and by which enhances the communication of the results. *

      *Additionally, at the end of the first paragraph of the results, the authors say: "These options allow us to combine the data and its analyzed interpretation in the same representation with Microscopy Nodes." However, this capability already exists in currently available software. Aside from now being able to achieve this in Blender, what additional benefits does it offer? *

      We now include a new Table 1, to showcases which requirements for visualizing complex biological data are available in different visualization software, and discuss this in the text:

      “Although several tools for 3D visualization of bioimages already exist and offer essential features for microscopy data (Table 1), many are proprietary, and open-source alternatives often struggle to deliver a comprehensive user experience, such as advanced animation and annotation controls. Proprietary solutions may offer some of these capabilities, but they are frequently limited by licensing costs, platform restrictions, and a lack of customizability. In contrast, Blender is a mature, well-supported open-source platform with a large community of developers that excels in both animation and visualization. By integrating microscopy-specific functionality through Microscopy Nodes, Blender becomes a uniquely powerful solution that bridges the gap between high-end graphics capabilities and the specialized needs of bioimage visualization.”

      Additionally, we attempted to remake Figure 2C and 2D in the EM-field standard software Amira, but were not able to. This is because without an advanced light scattering algorithm, it is very hard to see the depth in the nucleus, and the semi-transparent masks do show each other behind them, but cannot interact with the volume.

      We chose not to include this in the actual manuscript, as we are not experts at the Amira software, and will, by the nature of this manuscript, present a challenge that Blender is especially good at, such as here the combination of scattering light and semitransparent masks.

      * In the last sentence of the second paragraph of the results, it is stated: "Blender powered by Microscopy Nodes: the ability to combine microscopy data with any 3D illustration in the same 3D environment." Could you please elaborate on the accuracy of the models that can be built and provide guidelines for achieving this using the data coordinates imported by Microscopy Nodes? If the illustrations are purely freehand and do not require specific accuracy, it would be helpful to clarify the advantages of creating them within the same environment rather than separately, as many scientists currently do. Additionally, if the inclusion of 3D model illustrations is one of the key advantages of using Blender, I believe it would be beneficial to present this in a figure rather than only in the supplementary video. *

      We thank the reviewer for this comment and agree that in the previously submitted version of Microscopy Nodes, it was very difficult to align objects accurately, as the coordinate space was not transparent. A hurdle in this was the fact that Blender only works well with the unit ‘meters’. To address this issue, we now provide a choice of mapping the physical size to meters, as shown in the new interface (See Fig 3B, inset 5). Here the user can choose from the default ‘px -> cm’ (this will always look fine for a quick look) to options such as ‘nm -> m’ or ‘µm -> m’, which, combined with the new choice for adjusting the object origin upon load, allow users to treat the Blender coordinate space as based on the actual physical scales. Additionally, other Blender addons, such as Molecular Nodes (Reference 25 of the manuscript), also allow for accurate localization for cryo-EM datasets.

      We appreciate the note that we should more clearly display the ability to show our illustrations and the data together in the figure and have added a visualization to show this in Figure 1C.

      * Reviewer #3 (Significance (Required)):

      The significance of the paper at this stage is primarily technical and mainly relevant to the field of microscopy

      My field of expertise is microscopy and 3D visualization of models using mainly Maya3D and AMIRA.*

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The paper "Microscopy Nodes: Versatile 3D Microscopy Visualization with Blender" presents an easy and accessible approach for microscopists and microscopy users to visualize their data in a different and more controlled way. The authors have developed a plug-in script that enables the integration of complex 3D datasets into Blender, a widely used software for 3D visualization and illustration. By leveraging Blender's advanced rendering engine, the plug-in provides greater control over the scene, enviromint and presentation of the 3D data.

      I believe that this development, especially when combined with additional analysis tools can be of a great value for microscopist and advanced users to presenting their 3D data sets.

      However, at this stage, the paper does not seem to fully demonstrate the benefits of using Microscopy Nodes. To enhance the paper impact, it would be helpful for the authors to further emphasize and provide examples of how Blender's rendering specifically improves data presentation and, in turn, enhances the understanding of the data compared to existing solutions.

      Specifically, the authors claim at the end of the introduction that their development provides powerful tools for high-quality, visually compelling presentations, enabling "more effective communication of 3D biological data." I believe this statement should be supported by a figure comparing currently available visualization methods and demonstrating how using Blender enhances data presentation and by which enhances the communication of the results.

      Additionally, at the end of the first paragraph of the results, the authors say: "These options allow us to combine the data and its analyzed interpretation in the same representation with Microscopy Nodes." However, this capability already exists in currently available software. Aside from now being able to achieve this in Blender, what additional benefits does it offer?

      In the last sentence of the second paragraph of the results, it is stated: "Blender powered by Microscopy Nodes: the ability to combine microscopy data with any 3D illustration in the same 3D environment." Could you please elaborate on the accuracy of the models that can be built and provide guidelines for achieving this using the data coordinates imported by Microscopy Nodes? If the illustrations are purely freehand and do not require specific accuracy, it would be helpful to clarify the advantages of creating them within the same environment rather than separately, as many scientists currently do. Additionally, if the inclusion of 3D model illustrations is one of the key advantages of using Blender, I believe it would be beneficial to present this in a figure rather than only in the supplementary video.

      Significance

      The significance of the paper at this stage is primarily technical and mainly relevant to the field of microscopy

      My field of expertise is microscopy and 3D visualization of models using mainly Maya3D and AMIRA.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The article introduces Microscopy Nodes, a Blender add-on designed to simplify the loading and visualization of 3D microscopy data. It supports TIF and OME-Zarr images, handling datasets with up to five dimensions. The authors present different visualization modes, including volumetric rendering, isosurfaces, and label masks, demonstrating the application in light and electron microscopy. They provide examples using expansion microscopy, electron microscopy, and real-time imaging, highlighting how the tool enhances scientific communication and interactive visualization.

      Comments:

      However, some key aspects could be improved to enhance usability and reproducibility:

      Example datasets: The images used in the YouTube tutorials were not accessible, making it difficult to reproduce the workflows shown in the figures and tutorials. It would be helpful if the authors provided direct links to the datasets or ensured that the same examples used in the tutorials were readily available for replication.

      Input file specifications: The article does not clearly detail how input files should be formatted. Many users will pre-visualize images in Fiji to convert their original images to a compatible format. It would be beneficial to specify which formats are supported for hyperstack creation, including details on bit depth, dimension ordering, label formats, and metadata compatibility, if applicable.

      Hardware requirements: The article does not discuss RAM or hardware constraints in detail. In testing, attempting to load two images into the same project caused the program to freeze (tested on Mac M1). Specifying hardware requirements and limitations would help users manage expectations when working with large datasets.

      Significance

      General Assessment:

      One of the major strengths of this work is its seamless compatibility with Blender, a powerful and widely used animation and 3D rendering tool. Integrating advanced visualization techniques from the animation and graphics industry into scientific imaging opens new possibilities for presenting complex microscopy data in an intuitive and accessible way. Additionally, the support for OME-Zarr is particularly valuable, as this format represents a major shift in bioimaging towards scalable, cloud-compatible, and standardized data storage solutions. The adoption of OME-Zarr facilitates large-scale data handling and improves interoperability across imaging platforms, making this integration a a significant step forward for the field. Overall, the greatest strength of the tool lies in its flexibility for rendering microscopy data, but its accessibility for users without Blender experience might be a challenge.

      Advance in the Field

      This work introduces a novel solution to the visualization challenges in microscopy by leveraging Blender's advanced rendering capabilities.

      Audience

      This paper will be of interest to: Bioimage researchers seeking to enhance their microscopy data visualization. Image analysis tool developers interested in integrating advanced visualization into their workflows.

      Field of Expertise

      This review is based on expertise in image analysis, segmentation, and 3D biological data visualization.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The work by Gros et al. presents a paper introducing Microscopy Nodes, a new plugin for Blender 3D visualization software designed to import and visualize multi-dimensional (up to 5D) light and electron microscopy datasets. Given that Blender is not directly suited for such tasks, this plugin significantly simplifies the process, making its visualization engine accessible to a wide range of researchers without prior knowledge of Blender. The plugin supports importing volumes and labels from generic TIF or modern OME-Zarr image formats and includes supplementary video tutorials on YouTube to facilitate basic understanding of the visualization workflows.

      Major comments:

      • The manuscript suggests that Microscopy Nodes can easily handle large datasets, as evidenced by the showcases. However, in my personal tests, I was unable to import a moderate TIF stack of about 5GB, which is considerably smaller than the showcased datasets. Post-import, a data cube was displayed, but the Blender interface became unresponsive. The manuscript should include a section stating limitations and addressing issues and providing suggestions for visualization of large datasets.
      • The feature of importing Zarr-datasets over HTTP is great, but the import process was very slow in my tests, even on a robust network. For reference, loading 1.8 GB of the PRPE1_4x dataset at s1 level took 52 minutes. This raises concerns about potential code issues and general usability of the suggested workflow.
      • The onsite documentation is a bit outdated and fails to fully describe the plugin settings.
      • The YouTube tutorials feature an outdated version of the plugin, which could confuse the general microscopy audience. These should be updated to better align with the current plugin functionality. Additionally, using smaller, easily accessible datasets for these tutorials would improve user testing experiences. Hosting complete (downsampled) demo project folder on platforms like zenodo.org could also enhance usability of such tutorials.
      • The manuscript describes a novel dataset used in Fig. 2, but no reference is provided. Additionally, practical implementation of the coloring description for Fig. 2D can be unclear for inexperienced users, necessitating either step-by-step instructions or the provision of downsampled Blender files to aid understanding.

      [OPTIONAL] When importing labels, they can be assigned to individual materials only if initially split into multiple color channels. It would be great if the same logic is implemented when those materials are provided as indices within a single color channel. There can be a switch to define the logic used during the import process: e.g. the current one, when the objects are just colored based on a color map, or when they are arranged as individual materials as done when labels are imported from multiple color channels.

      Minor comments:

      • The manuscript shows nice visualizations of time series, light, and electron microscopy datasets, but in its current state, it is targeted more for light microscopy, where the signal is white. On the other hand, many EM datasets are rendered in inverted contrast (TEM-like), where the signal is black. To render such volume properly, it is needed to go into the Shading tab and flip the color ramp. Would it be possible to perhaps define the data type during import to accommodate various data types or perhaps select the flipped color ramp when the emission mode is switched off? It could make it easier for inexperienced EM users to use the plugin.
      • It was not completely clear to me whether it is possible to render a single/multiple EM slices using the inverted (TEM-like) contrast. For example, XY, XZ, YZ ortho slices across the volume. The manuscript contains: "This visualization is also supported in Blender, allowing for arbitrary selections of viewing angles (Fig 2B).", but it is not clear how to achieve that.
      • In 3D microscopy, it is quite common to have data with anisotropic voxels. As a result, the surfaces may require smoothing. I was not able to quickly find a way to smooth the surfaces (at least smooth modifiers for surfaces did not work for me). Is it possible to apply smoothing during the import of labels, or alternatively, smoothing of the generated surfaces can be a topic for an additional YouTube video.
      • It is also typical to have somewhat custom color maps for materials. It would be great if the plugin remembers the previously used color map for labels.
      • The pixel size edit box rounds up the values to 2 digits after the dot. Could it be changed to accommodate 3 or 4 digits as the units are um.

      • Import is not working when:

      • Start Blender
      • Select Data storage: with project
      • Overwrite files: on, set env: on, chunked: on
      • Select a file to import
      • Save Blender file
      • Pressing the Load button gives an error: "Empty data directory - please save the project first before using With Project saving."
      • I was not able to play the downloaded supplementary video 3 using my VLC media player, while it was working fine in a browser. The video can be opened but looks distorted and heavily zoomed in. It may need to be re-saved from a video editor.
      • References 12 and 16 are URL links instead of proper references to articles.

      Significance

      This work effectively bridges a gap in the availability of tools for 3D microscopy dataset visualization. While many visualization programs exist, the high-quality ones are often expensive and thus not accessible to all researchers. The integration of Blender with Microscopy Nodes democratizes access to high-quality 3D visualization, enabling researchers to explore datasets and models from multiple perspectives, potentially leading to new discoveries and enhancing the understanding of key study findings. Despite its limitations, my experience with the plugin was engaging and useful. I would like to thank the authors for such useful work!

      Limitations:

      • There remains a steep learning curve associated with using Microscopy Nodes, primarily due to Blender's complexity. More comprehensive tutorials could help mitigate this.
      • The conversion of imported images to Blender's internal 32-bit format results in a 4x increase in data size for 8-bit datasets.
      • Managing moderate-sized volumes (5-10 GB) can be challenging without clear strategies for effective handling.
      • The import of Zarr-datasets over the net is notably slow.

      Audience: The plugin is suitable for a broad audience with a basic understanding of 3D visualization concepts, providing a solid foundation for exploring Blender's extensive features and options for optimal visualizations.

      Reviewer expertise: Light microscopy, electron microscopy, image segmentation and analysis, software development, no experience with Blender

    1. Note: This response was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      In Arabidopsis, DNA demethylation is catalyzed by a family of DNA glycosylases including DME, ROS1, DML2, and DML3. DME activity in the central cell leads to the hypomethylation of maternal alleles in endosperm. While ROS1, DML2, and DML3 function in vegetative tissues to prevent spreading DNA methylation from TE boundaries, their function in the endosperm was unclear.<br /> Using whole genome methylome analysis, the authors showed that ROS1 prevents hypermethylation of paternal alleles in the endosperm thus promotes epigenetic symmetry between maternal and paternal genomes.<br /> The approach and experimental desighs are appropriate, and the key conclusions are adequately supported by the results.<br /> However, there is not sufficient evidence to support the claim that DME demethylates the maternal allele at ROS1-dependent biallelically-demethylated regions. To clarify the issue, the authors could analyze if there is an overlap between DMRs identified in ros1 endosperm and those identified in dme endosperm using published data. If there is any, the authors could show a genome browser example of DMR including dme data.

      Response: Thank you for your insight on our work. To address your concern and further test our model that DME prevents methylation of the maternal allele at regions where ROS1 is prevents methylation of the paternal allele, we turned to the allele-specific bisulfite-sequencing data published in Ibarra et al 2012. These data were from endosperm isolated at 7-8 DAP from aborting seeds of dme-2 +/- (Col-gl) plants pollinated by L_er_. Our analysis of these data is now included in Figures 6 and 7 and Supplemental Figures 13-17. We show that when the loss-of-function allele dme-2 is inherited maternally, average methylation of the maternal allele increases at ROS1-dependent regions (in the revised version of the paper now referred to as ROS1 paternal, DME maternal regions) from less than 10% CG methylation to approximately 40% CG methylation (Fig. 6D), consistent with our previous analysis using the non-allelic Hsieh et al 2009 data (now moved to Supplemental Figure 15). These results thus provide additional evidence that DME removes maternal allele methylation at regions where ROS1 removes paternal allele methylation (compare Fig. 6B and 6D). We included relevant genome browser examples in Figure 7E and Supplemental Figure 14. In the revised version, the relationship between ROS1 and DME is further expanded upon in the text.

      Reviewer #1 (Significance):

      Endosperm is a tissue unique to flowering plants. Though it is an ephemeral tissue, the endosperm plays essential roles for seed development and germination. The endosperm is also the site genomic imprinting occurs, and it has a distinct epigenomic landscape. This work provides a new insight that ROS1 may antagonize imprinted gene expression in the endosperm. However, it was not shown whether imprinted gene expression is indeed affected in ros1, or whether the ros1 mutation has phenotypic consequences. These results would be useful to discuss the evolution and significance of genomic imprinting.

      Response: We agree that the biological significance of ROS1-mediated paternal allele demethylation is presently unknown. We performed RNA-seq on wild-type and ros1 3C and 6C endosperm nuclei, but these data were unfortunately not of high enough quality to include in the manuscript. In the Discussion we suggest that disrupting ROS1-mediated paternal allele demethylation might lead to a gain of imprinting over evolutionary time. In future work we are planning to address potential relationships to gene imprinting using a molecular, RNA-sequencing approach as well as an evolutionary comparative approach. As expected, given the expectation that imprinted genes are associated with a parent-of-origin specific epigenetic mark, we did not find any relationship between known imprinted genes and ROS1-dependent regions that are biallelically-demethylated regions in wild-type endosperm (see lines 362-372).

      Reviewer #2 (Evidence, reproducibility and clarity):

      SUMMARY

      Hemenway and Gehring present evidence that the paternal genome in Arabidopsis endosperm is demethylated at several hundred loci by the DNA glycosylase/lyase ROS1. The evidence is primarily based on analysis of DNA methylation of ros1 mutants and of hybrid crosses where each parental genome can be differentiated by SNPs. I have some comments/questions/concerns, two of them potentially serious, but I think Hemenway and Gehring can address them through additional analyses of data that they already have available and a bit of clarification in writing.

      Response: Thank you for your thoughtful review of this study. Your insight and suggestions have helped add clarity to the paper.

      MAJOR COMMENTS:

      1. Could the excess methylation in ros1-3 relative to ros1-7 shown in Figures 1A and 1C be explained by a second mutation in the ros1-3 background that elevates methylation at some loci? Any mutation that increased RdDM at these loci, for example could have this effect. This could confound the identification and interpretation of biallelicly demethylated loci.

      Response: We propose a simpler explanation for the additional hypermethylation observed in ros1-3: ros1-3 is a loss-of-function (null) allele whereas ros1-7 is likely a hypomorphic allele. For clarity, we have added a diagram of all of the alleles used in this study as Supplemental Figure 1B. The ros1-3 allele was first described in Penterman et al, PNAS, 2007. It is a T-DNA insertion allele that was isolated in the Ws accession and then backcrossed 6 times to Col-0, greatly minimizing the risk of unlinked secondary mutations being present. There is no genetic evidence that there is another T-DNA insertion in this line. The ros1-7 allele was described in Williams et al, Plos Genet, 2015. It was isolated from the Arabidopsis Col-0 TILLING population and is missense mutation (E956K) in a residue in the glycosylase domain that is conserved among the four DNA glycosylases. It is known that ROS1 transcripts are produced from the ros1-7 allele (Williams et al 2015). We observe less hypermethylation in the ros1-7 background compared to the ros1-3 background, and thus propose that the ros1-7 allele is a hypomorphic allele of ROS1. The use of two independent ros1 mutant alleles for initial endosperm methylation profiling strengthens the findings of our study. Importantly, regions that are hypermethylated in ros1-3 are also hypermethylated in ros1-7, but to a lesser extent, and vice versa (Fig 1D, Supplemental Figs. 3 and 4).

      We also use a third allele in this study, ros1-1, which is a nonsense allele in the C24 accession. Notably, we find that the regions are demethylated on both maternal and paternal alleles in wild-type C24 gain DNA methylation primarily on the paternal allele in ros1-1 endosperm (Figure 4C,D and Supplemental Figure 10). This is discussed further in response to your second point.

      Given these lines of evidence, a gain-of-function mutation in a methylation pathway, like RdDM, in the ros1-3 background is an unlikely explanation for increased hypermethylation compared to ros1-7. The use of three independent ros1 alleles for methylation profiling, all of which lead to the same conclusions, is a major strength of our study.

      1. It appears that the main focus of the manuscript, the existence of loci that are paternally demethylated by ROS1, is supported by a set of 274 DMRs. This is a small number relative to the size of the genome and raises suspicions of rare false positives. Even the most stringent p-values that DMR-finding tools report do not guarantee that the DMRs are actually reproducible in an independent experiment. Demonstrating overlap between these 274 DMRs and an independently defined set using a different WT control and different ros1 allele would suffice to remove this concern. It appears that authors already have the needed raw data with ros1-1 and ros1-7 alleles.

      Response: First, we should clarify that paternal demethylation by ROS1 is supported by more than the 274 DMRs. All ros1 CG hyperDMRs show an increase in paternal allele methylation in ros1 (Fig. 4B,D). The 274 DMRs are a distinct subset defined as having less methylation on the maternal allele than the paternal allele in ros1 endosperm and where there is no maternal allele hypomethylation in wild-type endosperm (refer to Fig. 5B).

      We agree with your sentiments about DMR-finders and we are cautious of relying exclusively on DMR calls when making conclusions. We verify the nature of identified DMRs using metaplots and weighted average comparisons throughout the paper, which we think increases confidence in the conclusions and goes beyond a simple DMR-calling approach.

      We argue that we have replicated the major conclusion of the paper, that ROS1 prevents paternal allele hypermethylation at target regions in the endosperm, in the following ways:

      1. In the dataset without allelic-specific methylation information (Figures 1-3), we found that both ros1-3 and ros1-7 CG hyperDMRs have a limited capacity for hypermethylation in the endosperm relative to leaf or sperm (Table 1, Fig 3, Supplemental Fig. 4). In the allele-specific dataset, ros1-3 CG hyperDMRs were revealed to have particularly low maternal mCG relative to paternal mCG in ros1 mutant endosperm (Fig 4A-B, Supplemental Fig. 10).
      2. We found that ros1-3 and ros1-1 hyperDMRs, which we identified using non-allelic data, are biased for paternal allele hypermethylation in the endosperm of F1 hybrids (Fig 4B,D). The replicability of the paternal bias in hypermethylation in both ros1-3 in the Col-0 ecotype and ros1-1 in the C24 ecotype is a critical result, and we have moved the ros1-1 hyperDMR plots from the supplement to main figure 4C-D in the revised version of the manuscript as a result of your comment.
      3. The 274 DMRs identified as “biallelically-demethylated, ROS1-dependent” are by definition replicated between reciprocal cross directions. (Note that we now refer to these regions as ROS1 paternal, DME maternal regions in the revision.) Regions in this category had to be called as maternally-hypomethylated in both ros1-1 x ros1-3 and ros1-3 x ros1-1 endosperm. These regions also had to not be identified as maternally-hypomethylated in both C24 x Col-0 and Col-0 x C24. We hope this is clarified for readers by Table 1, which we have included based on your suggestion in comment #3, as well as other clarifying edits we made in this section of the paper.comparisons between maternal and paternal methylation in endosperm, DMRs defined by comparison between mutants and wildtype, and more. These need clearer descriptions of which sets are being referred to throughout the main text and in figure legends. A table summarizing them might help (not in the supplement). Use of consistent and precisely defined terms would help. Stating the number of DMRs along with the name for each set would help a lot, even though this would make for some redundancy. (The number of DMRs in each set not only helps with interpretation but also act as a sort of ID). The reason I put this as a major concern is because the text and figures are difficult to understand, and it is currently hard to evaluate both the results and the authors' conclusions from those results.

      Response: Thank you for your feedback and suggestions. We have edited the main text so that only one descriptive name is used for each DMR type throughout the paper. We have also renamed regions for greater clarity. The previous “ROS1-independent, maternally demethylated regions” are now referred to as “DME maternal regions”. The previous “ROS1-_independent, biallelically-demethylated regions” are now referred to as “_ROS1 paternal, DME maternal regions”. These changes provide greater clarity and also emphasize the role of DME at regions that are paternally hypermethylated in ros1. We have added Table 1 to summarize the DMR classes of interest.

      MINOR COMMENTS

      1. The sRNA results in Figure 2B are difficult to interpret because they do not reveal anything about the number of TEs that have siRNAs overlapping them or their flanks. While the magnitude of some of the highest endosperm sRNA peaks is higher than the embryo peaks, that could be explained by a small number of TEs with large numbers of sRNAs. To make this result more interpretable, we also need some information about how many TEs have a significant number of sRNAs associated with them in endosperm and embryo in each region (e.g., middle, 5', 3', and flanks of TEs). What a "significant number of sRNAs" is would be up to the authors to decide based on the distribution of sRNA counts they observe for TEs. Perhaps the top quartile of TEs? Combined with the same analysis done in parallel with non-ROS1 target TEs, this would reveal whether there is any evidence for ROS1 counteracting sRNA-driven methylation spread from TEs.

      Response: Thank you for the suggestion. We now present these data and the data for individual TEs underlying the metaplots in Supplemental Figure 7. As suggested by the reviewer, ROS1 TEs do not have uniformly higher levels of sRNA in their flanks in the endosperm compared to the embryo. We have modified our interpretations accordingly.

      1. The statement "we are likely underestimating the true degree of differential methylation among genotypes" should be validated and partially quantified using a methylation metaplot like Figure 2A, but substitute DMRs for TEs. Related to that, Figure 1B needs an indicator of scale in bp.

      Response: We have now included a methylation metaplot over ros1-3 hyperDMRs and ros1-7 hyperDMRs as Supplemental Figure 3 These plots show that indeed there is additional hypermethylation in DMR-proximal regions. We have added a scale bar to Figure 1B and other browser examples in the paper.

      1. The statement "Over half of ROS1 target regions identified in the ros1-3 mutant endosperm were within 1 kb or intersecting a TE (Fig. 1D)" is hard to interpret without some kind of ROS1 non-target regions or whole-genome control comparison. How different are the numbers in Fig. 1D from a random expectation?

      Response: We have now included a control for random regions in Figure 1E. We define these as regions where there was sufficient methylation data coverage and a low enough methylation level in wild-type to detect hypermethylation if it existed.

      1. The sentence at line 262 is confusing. Is the comparison between dme mutant and ros1 mutant or between different types of regions? And it appears that the comparison value is missing in the "3-5% CG methylation gain..." e.g., "3-5% CG methylation vs 10-20%" or something like that.

      Response: This section has been re-written as we now focus on allele-specific dme endosperm methylation data for our comparisons.

      1. The dme mutant data in Figure 5C appear to be key to the model in Figure 7. The relative impact of the dme mutant in the two types of regions should be quantified.

      Response: Thank you for this comment. To further probe our model that DME prevents hypermethylation of the maternal allele at regions where ROS1 is preventing hypermethylation of the paternal allele, we turned to the allele-specific bisulfite-sequencing data published in Ibarra et al 2012 (see also response to reviewer #1). Using these data, we show that when the loss-of-function allele dme-2 is inherited maternally, ROS1 paternal, DME maternal regions (previous referred to as ROS1-_dependent, biallelically-demethylated regions) are CG hypermethylated on the maternal allele (Figure 6D). Thus, these results both replicate the observations made with the Hsieh et al 2009 data, and provide additional evidence that _DME prevents maternal allele hypermethylation at regions were ROS1 is preventing paternal allele hypermethylation. These results have replaced the Hsieh et al 2009 results in Figure 6, and we have moved the analysis of Hsieh et al 2009 data to Supplemental Figure 15.

      1. Looks like sRNA methods are missing.

      Response: Thank you for identifying this. We previously included the reference for the analyzed dataset we used and the method for plotting under an unclear section header. These methods are now in the section “Analysis of average methylation and 24-nt sRNA patterns for features of interest”, and we have added additional reference to the specific dataset we used.

      1. Supplemental Figure 1 is hard to interpret since it only list gene IDs, not gene names.

      Response: As suggested, we have added gene names to this figure.

      The last comments are suggestions for increasing the impact of this study:

      1. Figure 2A and 3B suggest that ROS1 target TEs show demethylation in their flanks but not in the TE themselves. This is an interesting result. If it is true, more DMRs would be expected in the ROS1 target flanks than in the ROS1 target TEs. Reporting how many ROS1 target TEs have DMRs in them and what proportion have DMRs in their flanking 1-Kb regions would answer this question. Given the significance of this result, it also deserves a bit more context: Is the magnitude of increased methylation flanking TEs in ros1 mutant endosperm different than in ros1 mutant leaves or other tissue? Does methylation in TE flanks behave the way in dme mutant endosperm?

      Response: We define “ROS1 target TEs” (now referred to more simply as ROS1 TEs) as TEs within 1kb or intersecting a ros1-3 hyperDMR. Consistent with your interpretation, 80% of the TEs in this category do not have a DMR overlapping them, instead they have a TE within 1kb. We now mention this in the text on line 150.

      The total level of DNA methylation at ROS1 TEs is lower in the endosperm than in leaf, as DNA methylation levels are overall lower in endosperm than in leaf. The magnitude of increased methylation flanking TEs in ros1 mutant endosperm is not different between the two tissues. This is observable in Supplemental Fig. 5 in the revised version of the paper, and we report this result in the revised text. In the revision we also present methylation profiles of DME TEs in WT and ros1 endosperm (Fig. 7B-D). DME TEs are hypomethylated in both the body and flanks in WT and ros1.

      1. The idea of biallelic demethylation has been theoretically suggested in maize to explain weak overlap between endosperm DMRs and imprinting (Gent et al 2022). If that were true in Arabidopsis, then ROS1 target, biallelicly demethylated loci would be less likely to have imprinted expression than maternally demethylated loci. This prediction could be tested using available data in Arabidopsis.

      Response: Indeed, as you hypothesize, there are no known imprinted genes (Pignatta et al 2014) associated with biallelically-demethylated, ROS1-dependent regions (now referred to as ROS1 paternal, DME maternal regions). Expectedly, there are imprinted genes associated with maternally-demethylated regions (now referred to as DME regions). 23 imprinted genes identified in the Pignatta et al 2014 study are within 1 kb or intersecting a DME region. This is discussed on lines 364-374.

      1. There is currently no evidence for biological significance of biallelicly demethylated loci. Knowing where they are in the genome might give some hints. A figure like Fig. 1D but specifically showing the biallelicly demethylated DMRs would be valuable.

      Response: This is now included in Figure 7A.

      1. It is hard to make the comparisons between genotypes and parental genomes in Figure 6 and know what they mean. Maybe a different way of displaying the data would help. Or maybe even a different labeling system could make it a little more accessible.

      Response: We have revised this figure (now Fig. 8) in the following ways, which we believe address your comments and clarify the main conclusions:

      Figure 8C is now a boxplot comparing methylation of the paternal allele of ROS1 paternal, DME maternal regions (previously referred to as biallelically-demethylated, ROS1-dependent regions) across endosperm ROS1 genotypes. This plot shows increased methylation of paternal alleles when the paternal parent is a ros1 mutant, regardless of whether the resultant F1 endosperm is homozygous or heterozygous for ros1 (columns 3, 4, 6).

      Figure 8B remains as a scatterplot, where we can observe significant correlation between individual ROS1 paternal, DME maternal regions in homozygous ros1 endosperm and heterozygous ros1/+ endosperm. Note that paternal allele methylation is higher in homozygous ros1 endosperm for most regions.

      Reviewer #2 (Significance):

      Demethylation of the maternal genome in endosperm has been the subject of much research because it can result in genomic imprinting of gene expression. The enzymes responsible, DNA glycosylases/lyases, also demethylate DNA in other cell types as well, where DNA methylation is not confined to one parental genome (biallelic or biparental as opposed to uniparental demethylation). To the best of my knowledge, the extent or even existence of biallelelic demethylation in endosperm has not been studied until now (except for a superficial look in a bioRxiv preprint, https://www.biorxiv.org/content/10.1101/2024.07.31.606038v1). Hemenway and Gehring have carried out a thoughtful and detailed analysis of the topic in Arabidopsis at least as far as it depends on the DNA glycosylase ROS1.

      A limitation is that the study design would miss biallelic demethylation by any of the other three DNA glycosylases in Arabidopsis. A second limitation is that there is no clear biological significance, just some conjecture about evolution. Nonetheless, given the novelty of the topic, biological significance may follow.

      The audience for biallelic DNA demethylation in Arabidopsis endosperm is certainly in the "specialized" category, but its relevance to the larger topic of gene regulation in endosperm will attract a larger audience.

      Response: With regard to the other demethylases, note that we also profiled methylation in ros1 dml2 dml3 triple mutant endosperm. We did not find evidence for many DMRs that were present in the triple mutant that were not present in the ros1 single mutant. We do not rule out a function for DML2 or DML3 in the endosperm, but this is not observed at the level of bulk endosperm.

      The reviewer is correct that we have shown a molecular phenotype (paternal allele hypermethylation) and not a developmental or morphological phenotype. A function that occurs in one parent but not the other is, to us, exciting. Our thoughts about how this finding might relate to imprinting are indeed speculative, but not wildly so.

      Reviewer #3 (Evidence, reproducibility and clarity):

      DNA demethylases play a key role in DNA methylation patterning during flowering plant reproduction. The demethylase DME, in particular, is critical for proper endosperm development. While the function of DME in endosperm development has been explored, the contributions of the other demethylases in the same family, ROS1, DML2 and DML3 in Arabidopsis, have not yet been investigated. In vegetative tissues, ROS1 prevents hypermethylation of some loci. In this work, Hemenway and Gehring explore whether ROS1, DML2 and DML3 also affect DNA methylation patterns in endosperm. Using EM-seq of sorted endosperm nuclei, they show that loss of ROS1 indeed causes hypermethylation of a number of loci, particularly the flanks of methylated transposons, while loss of DML2 and DML3 has minimal additional effect. By obtaining allele-specific EM-seq data through crosses of Col and C24, the authors show that ros1 endosperm hypermethylation is mostly restricted to the paternal allele. The authors propose that at some sites, ROS1 helps bring down paternal methylation levels to match maternal methylation levels, which are typically reduced in endosperm due to DME activity in the female gametophyte prior to fertilization. In a ros1 mutant with paternal hypermethylation, these sites become differentially methylated on the maternal and paternal alleles, resembling imprinted loci. This work convincingly establishes a function for ROS1 in DNA methylation patterning in endosperm. However, I struggled with the clarity of the writing and reasoning in a few places, and would suggest clarification of a few points and additional analyses below.

      Response: Thank you for your thoughtful review of our paper. Your questions and suggestions have been invaluable in revising the work.

      I think making a few simple changes to streamline nomenclature would improve readability. For example, in the section starting on line 129, the same set of genomic features are called ROS1 target-proximal TEs, TEs that are near a ROS1 target region, and ROS1 target-associated TE regions. Also for example in line 254 "regions that are maternally-demethylated in wild-type endosperm, and are not dependent on ROS1 for proper demethylation" - are these the same as the "ROS1-independent, maternally-demethylated" regions in Fig. 5a? Given how complex these terms are, being consistent throughout the manuscript really helps the reader.

      Response: We edited the text and figures so that only one descriptive name is used for each DMR class or region throughout the paper. Thank you for this feedback; these edits have made the paper much clearer.

      Is there any notable effect of ros1 on gene expression in endosperm? Endosperm is a terminal tissue, so maintaining DNA methylation boundaries as ROS1 does in vegetative tissues seems less important. It begs the question of why ROS1 is doing this in endosperm, is it just because it's there, or is there an endosperm-specific function? Exploring effects on imprinting would be particularly interesting (does loss of ROS1 'create' imprinted loci at these newly asymmetrically methylated sites?) but probably beyond the scope of the present work.

      Response: We agree, the question of the functional consequence of ROS1 activity in the endosperm is something we are keen to address in future work. We performed RNA-seq on wild-type and ros1 3C and 6C endosperm nuclei, but these data were unfortunately not of high enough quality to include in the manuscript. We are in particular interested in this question you have proposed – if loss of ROS1 can ‘create’ imprinted loci. We are planning to address this both using a molecular, RNA-sequencing approach as well as an evolutionary comparative approach. This is an important and exciting future direction.

      Is DME expressed in sperm, or is expression of DME affected in ros1 sperm or endosperm? One other explanation for ros1 hypermethylation occurring primarily on the paternal allele is that, potentially, DME can substitute for ROS1 in the central cell where DME is already very active, but not in sperm cells. Related, how well expressed is ROS1 vs. DME in sperm cells?

      Response: This is an important series of questions, and something we are very interested in as well. Studies of Arabidopsis pollen have shown that both ROS1 and DME, while they prevent some hypermethylation in sperm, are more active in the vegetative nucleus of pollen than in sperm. ROS1 is expressed at a low level in the microspore and bicellular pollen and DME is expressed at a low level throughout pollen development. We have included Supplemental Fig. 17 with available expression data to make this point in the paper. Likely, any effects of loss of ROS1 or DME on sperm DNA methylation are inherited from precursor cells (Ibarra et al 2012, Calarco et al 2012, Khouider et al 2021). Your proposal that perhaps DME can sub in for ROS1 in the central cell but not in sperm is intriguing. Unfortunately there’s not enough data in the central cell to convincingly address this at this time.

      To investigate the relationship between DME and ROS1 in the male germline, we used the bisulfite-sequencing data generated in sperm cells in Khouider et al 2021. We calculated average DNA methylation levels in dme/+, ros1, dme/+;ros1, and wild-type Col-0 sperm cells at ROS1 paternal, DME maternal regions, shown in Supplemental Fig. 18A. We observed little increase in mCG methylation in dme/+ sperm relative to wild-type Col-0 sperm. This is consistent with your proposed model that DME is unable to demethylate these regions outside of the female germline. As expected, there is increased mCG in ROS1 paternal, DME maternal regions in ros1-3 mutant sperm relative to wild-type Col-0 sperm. DME maternal regions are highly methylated in wild-type Col-0 sperm.

      Fig 2b shows that ROS1 target-associated TEs are enriched for sRNAs in endosperm relative to embryo, whereas the reverse is true for non-ROS1-assoc TEs. Since TEs are not always well annotated and some may be missing from this analysis, what about trying the reverse analysis - are regions enriched for 24nt sRNAs in endosperm significantly hypermethylated in ros1 endosperm? All regions or only some?

      Response: We performed an analysis to address your inquiry and observed a low magnitude increase in DNA methylation in ros1 mutant endosperm at regions defined by Erdmann et al as more sRNA producing in the endosperm relative to the embryo (endosperm DSRs). Endosperm DSRs are generally lowly methylated in wild-type endosperm, as was observed originally in Erdmann et al 2017. Small increases in DNA methylation are observed at endosperm DSRs in all sequence contexts in ros1 endosperm. Overall, this is consistent with ROS1 targets being a subset of sRNA-producing regions in the endosperm. This analysis is now included in Supplemental Fig. 7C.

      What is the relationship between previously-defined DME targets and ROS1 targets identified in this paper? DME tends to target small euchromatic TE bodies, whereas Fig. 3 suggests that ROS1 helps prevent methylation spreading on the outer edges of the TEs, rather than in the TE body. Do all DME targets tend to be adjacent to or flanked by ROS1 target sites? Or are the TEs affected by DME (in body) and by ROS1 (at edges) largely nonoverlapping? Fig. 5a suggests that the ROS1-dependent, biallelically-demethylated sites are both DME and ROS1 targets, but how often do these really appear to overlap? More than by chance?

      Response: We have sought to address your comments through a series of analyses that we have included in Fig. 7 and Supplemental Fig. 16. We found that ROS1 paternal, DME maternal regions (formerly referred to as ROS1-dependent, biallelically-demethylated regions) and DME maternal regions (formerly referred to as ROS1-independent, maternally-demethylated regions) do not occupy the same genomic regions. However, we do observe some evidence for ROS1 activity in flanking regions of DME targets (Fig. 6A, Fig. 7B-D). To look at TEs specifically, as you suggest, we first identified TEs that were within 1kb or intersecting a DME maternal region. Based on our characterization of these regions, we assume these to be DME-targeted TEs. We then performed ends analysis to see if there was evidence of ROS1 activity at the ends of these TEs. Indeed, at a global level there is a slight hypermethylation of the paternal allele in a ros1 mutant at the end of these DME TEs (Fig. 7B). To better visualize how many DME TEs are showing ROS1 activity at their ends, we then plotted the difference between the median ros1-3 methylation and median Col-0 values in the non-allelic endosperm for each TE in a clustered heatmap (Fig. 7C). The parent-of-origin data does not have enough coverage for clustering in this way, so we used the non-allelic data. A small fraction of “DME TEs” gain methylation in the ros1 mutant endosperm relative to wild-type (Fig. 7C-D).

      Are the TEs whose boundaries are demethylated by ROS1 more likely to be expressed in vegetative or endosperm tissues than TEs not affected by loss of ROS1? Expressed TEs likely produce more sRNAs, which would increase RdDM in a way that might need to be more actively countered by ROS1 than transcriptionally silent or evolutionarily older TEs.

      Response: This is an interesting line of inquiry, although perhaps out of the scope of our present study. It has been shown that TEs demethylated by ROS1 are targeted by the RdDM pathway in Arabidopsis vegetative tissue (Tang et al 2016). Using data from Erdmann et al 2017, we looked at 24 nt sRNAs at ROS1-TEs in the endosperm and embryo (Supplemental Fig. 7). sRNA production at ROS1 TE-flanking regions is observed in both embryo and endosperm, but clearly not all ROS1 TEs produce 24 nt sRNA production in the seed. Future work comparing sRNA profiles in a ros1 mutant to those of wild-type could inform our understanding of TE spreading in a ros1 mutant, as would a comprehensive analysis of TE expression, again in both a ros1 mutant and in wild-type. It’s unclear to us if the endosperm would be the most informative or useful tissue to perform such analyses in.

      Fig6 - as noted in the text, one way to test whether demethylation by ROS1 occurs before or after fertilization is to provide functional ROS1 through only one parent via reciprocal WT x ros-1 crosses, so that the endosperm always has ROS1 but either sperm or central cell does not, and see if this can rescue the paternal hypermethylation. If ROS1 acts prior to fertilization, then paternal ROS1 will rescue ros1 hypermethylation, but maternal ROS1 won't. If after fertilization, then either maternally or paternally supplied ROS1 will rescue the hypermethylation phenotype (assuming both are well expressed). Thus, to distinguish the two, it is sufficient to test whether maternally supplied ROS1 in an otherwise mutant background can rescue the hypermethylation phenotype, which is what is shown in Fig. 6. However, I think it's also important to show that paternally supplied ROS1 can also rescue the hypermethylation phenotype, which is not currently shown. The plots showing no effect on maternal mCG aren't as informative, since maternal methylation levels are mostly unaffected by ros1 anyway. Instead of comparing pairs of samples in a scatterplot, it might be clearer to show paternal mCG across all four comparisons (WT x WT, WT x ros1, ros1 x WT, and ros1 x ros1) side by side in a heatmap, using clustering to group similar behavior.

      Response: We have revised this figure, now Fig. 8, in the following ways, which we believe addresses your comments and clarify the main conclusions (see same response to reviewer 2 for point 14):

      Figure 8B remains as a scatterplot, where we observe significant correlation between individual ROS1 paternal, DME maternal regions in homozygous ros1 endosperm and heterozygous ros1/+ endosperm. Note that paternal allele methylation is higher in homozygous ros1 endosperm for most regions.

      Figure 8C is now a boxplot comparing methylation of the paternal allele of ROS1 paternal, DME maternal regions (previously referred to as biallelically-demethylated, ROS1-dependent regions) across endosperm ROS1 genotypes. This plot shows increased methylation of paternal alleles when the paternal parent is a ros1 mutant, regardless of whether the resultant F1 endosperm is homozygous or heterozygous for ros1 (columns 3, 4, 6).

      I would also suggest including a little more information in the main plots rather than only in the figure legends. For example, in Fig 2 including a label of 'ROS1-associated TE' for the two plots on the left, and 'TEs not associated with ROS1' on the right. Or for example in Fig. 3a indicating 'ros1-3 CG hyperDMRs' somewhere on the plot. This would just help make the figures easier to read at a glance. Please add common gene names to figures, instead just the ATG gene ID (Fig. S1a).

      Response: Thank you for this feedback, we have made the suggested edits and additional edits of a similar nature.

      Minor:<br /> - Fig. 1E is referenced in the text before Fig. 1D<br /> - Fig. S4 and S5 - there are more lines in the plot than the 6 genotypes listed in the legend, do these represent different replicates? If so that should be noted in the legend<br /> - Fig. 1B has no color legend for the different methylation sequence contexts (looks like same as 1A,C but should indicate either in plot or legend)<br /> - Line 42 should be "correspond to TE ends"<br /> - Line 93 "Based on previous studies..." should have references to those studies<br /> - When referring to the protein (rather than the genetic locus or mutant), ROS1 should not be italicized - for example line 130<br /> - Line 150 "we conclude that the loss"<br /> - Should add a y=x line to scatterplots, like those in Fig. 6<br /> - In fig. 1d, it's hard to evaluate the significance of the overlap of ROS1 targets with genes and TEs. Comparing these numbers to a control where the ROS1 targets have been randomly shuffled would help.

      Response: We have made edits and additions where requested.

      Reviewer #3 (Significance):

      In this work, Hemenway and Gehring explore whether ROS1, DML2 and DML3 also affect DNA methylation patterns in endosperm. Using EM-seq of sorted endosperm nuclei, they show that loss of ROS1 indeed causes hypermethylation of a number of loci, particularly the flanks of methylated transposons, while loss of DML2 and DML3 has minimal additional effect. By obtaining allele-specific EM-seq data through crosses of Col and C24, the authors show that ros1 endosperm hypermethylation is mostly restricted to the paternal allele. The authors propose that at some sites, ROS1 helps bring down paternal methylation levels to match maternal methylation levels, which are typically reduced in endosperm due to DME activity in the female gametophyte prior to fertilization. In a ros1 mutant with paternal hypermethylation, these sites become differentially methylated on the maternal and paternal alleles, resembling imprinted loci. This work convincingly establishes a function for ROS1 in DNA methylation patterning in endosperm. However, I struggled with the clarity of the writing and reasoning in a few places, and would suggest clarification of a few points and additional analyses.

      Response: Thank you for your comments. We have worked on streamlining the text and analysis.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      DNA demethylases play a key role in DNA methylation patterning during flowering plant reproduction. The demethylase DME, in particular, is critical for proper endosperm development. While the function of DME in endosperm development has been explored, the contributions of the other demethylases in the same family, ROS1, DML2 and DML3 in Arabidopsis, have not yet been investigated. In vegetative tissues, ROS1 prevents hypermethylation of some loci. In this work, Hemenway and Gehring explore whether ROS1, DML2 and DML3 also affect DNA methylation patterns in endosperm. Using EM-seq of sorted endosperm nuclei, they show that loss of ROS1 indeed causes hypermethylation of a number of loci, particularly the flanks of methylated transposons, while loss of DML2 and DML3 has minimal additional effect. By obtaining allele-specific EM-seq data through crosses of Col and C24, the authors show that ros1 endosperm hypermethylation is mostly restricted to the paternal allele. The authors propose that at some sites, ROS1 helps bring down paternal methylation levels to match maternal methylation levels, which are typically reduced in endosperm due to DME activity in the female gametophyte prior to fertilization. In a ros1 mutant with paternal hypermethylation, these sites become differentially methylated on the maternal and paternal alleles, resembling imprinted loci. This work convincingly establishes a function for ROS1 in DNA methylation patterning in endosperm. However, I struggled with the clarity of the writing and reasoning in a few places, and would suggest clarification of a few points and additional analyses below.

      I think making a few simple changes to streamline nomenclature would improve readability. For example, in the section starting on line 129, the same set of genomic features are called ROS1 target-proximal TEs, TEs that are near a ROS1 target region, and ROS1 target-associated TE regions. Also for example in line 254 "regions that are maternally-demethylated in wild-type endosperm, and are not dependent on ROS1 for proper demethylation" - are these the same as the "ROS1-independent, maternally-demethylated" regions in Fig. 5a? Given how complex these terms are, being consistent throughout the manuscript really helps the reader.

      Is there any notable effect of ros1 on gene expression in endosperm? Endosperm is a terminal tissue, so maintaining DNA methylation boundaries as ROS1 does in vegetative tissues seems less important. It begs the question of why ROS1 is doing this in endosperm, is it just because it's there, or is there an endosperm-specific function? Exploring effects on imprinting would be particularly interesting (does loss of ROS1 'create' imprinted loci at these newly asymmetrically methylated sites?) but probably beyond the scope of the present work.

      Is DME expressed in sperm, or is expression of DME affected in ros1 sperm or endosperm? One other explanation for ros1 hypermethylation occurring primarily on the paternal allele is that, potentially, DME can substitute for ROS1 in the central cell where DME is already very active, but not in sperm cells. Related, how well expressed is ROS1 vs. DME in sperm cells?

      Fig 2b shows that ROS1 target-associated TEs are enriched for sRNAs in endosperm relative to embryo, whereas the reverse is true for non-ROS1-assoc TEs. Since TEs are not always well annotated and some may be missing from this analysis, what about trying the reverse analysis - are regions enriched for 24nt sRNAs in endosperm significantly hypermethylated in ros1 endosperm? All regions or only some?

      What is the relationship between previously-defined DME targets and ROS1 targets identified in this paper? DME tends to target small euchromatic TE bodies, whereas Fig. 3 suggests that ROS1 helps prevent methylation spreading on the outer edges of the TEs, rather than in the TE body. Do all DME targets tend to be adjacent to or flanked by ROS1 target sites? Or are the TEs affected by DME (in body) and by ROS1 (at edges) largely nonoverlapping? Fig. 5a suggests that the ROS1-dependent, biallelically-demethylated sites are both DME and ROS1 targets, but how often do these really appear to overlap? More than by chance?

      Are the TEs whose boundaries are demethylated by ROS1 more likely to be expressed in vegetative or endosperm tissues than TEs not affected by loss of ROS1? Expressed TEs likely produce more sRNAs, which would increase RdDM in a way that might need to be more actively countered by ROS1 than transcriptionally silent or evolutionarily older TEs.

      Fig6 - as noted in the text, one way to test whether demethylation by ROS1 occurs before or after fertilization is to provide functional ROS1 through only one parent via reciprocal WT x ros-1 crosses, so that the endosperm always has ROS1 but either sperm or central cell does not, and see if this can rescue the paternal hypermethylation. If ROS1 acts prior to fertilization, then paternal ROS1 will rescue ros1 hypermethylation, but maternal ROS1 won't. If after fertilization, then either maternally or paternally supplied ROS1 will rescue the hypermethylation phenotype (assuming both are well expressed). Thus, to distinguish the two, it is sufficient to test whether maternally supplied ROS1 in an otherwise mutant background can rescue the hypermethylation phenotype, which is what is shown in Fig. 6. However, I think it's also important to show that paternally supplied ROS1 can also rescue the hypermethylation phenotype, which is not currently shown. The plots showing no effect on maternal mCG aren't as informative, since maternal methylation levels are mostly unaffected by ros1 anyway. Instead of comparing pairs of samples in a scatterplot, it might be clearer to show paternal mCG across all four comparisons (WT x WT, WT x ros1, ros1 x WT, and ros1 x ros1) side by side in a heatmap, using clustering to group similar behavior.

      I would also suggest including a little more information in the main plots rather than only in the figure legends. For example, in Fig 2 including a label of 'ROS1-associated TE' for the two plots on the left, and 'TEs not associated with ROS1' on the right. Or for example in Fig. 3a indicating 'ros1-3 CG hyperDMRs' somewhere on the plot. This would just help make the figures easier to read at a glance. Please add common gene names to figures, instead just the ATG gene ID (Fig. S1a).

      Minor:

      • Fig. 1E is referenced in the text before Fig. 1D
      • Fig. S4 and S5 - there are more lines in the plot than the 6 genotypes listed in the legend, do these represent different replicates? If so that should be noted in the legend
      • Fig. 1B has no color legend for the different methylation sequence contexts (looks like same as 1A,C but should indicate either in plot or legend)
      • Line 42 should be "correspond to TE ends"
      • Line 93 "Based on previous studies..." should have references to those studies
      • When referring to the protein (rather than the genetic locus or mutant), ROS1 should not be italicized - for example line 130
      • Line 150 "we conclude that the loss"
      • Should add a y=x line to scatterplots, like those in Fig. 6
      • In fig. 1d, it's hard to evaluate the significance of the overlap of ROS1 targets with genes and TEs. Comparing these numbers to a control where the ROS1 targets have been randomly shuffled would help.

      Significance

      In this work, Hemenway and Gehring explore whether ROS1, DML2 and DML3 also affect DNA methylation patterns in endosperm. Using EM-seq of sorted endosperm nuclei, they show that loss of ROS1 indeed causes hypermethylation of a number of loci, particularly the flanks of methylated transposons, while loss of DML2 and DML3 has minimal additional effect. By obtaining allele-specific EM-seq data through crosses of Col and C24, the authors show that ros1 endosperm hypermethylation is mostly restricted to the paternal allele. The authors propose that at some sites, ROS1 helps bring down paternal methylation levels to match maternal methylation levels, which are typically reduced in endosperm due to DME activity in the female gametophyte prior to fertilization. In a ros1 mutant with paternal hypermethylation, these sites become differentially methylated on the maternal and paternal alleles, resembling imprinted loci. This work convincingly establishes a function for ROS1 in DNA methylation patterning in endosperm. However, I struggled with the clarity of the writing and reasoning in a few places, and would suggest clarification of a few points and additional analyses

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Hemenway and Gehring present evidence that the paternal genome in Arabidopsis endosperm is demethylated at several hundred loci by the DNA glycosylase/lyase ROS1. The evidence is primarily based on analysis of DNA methylation of ros1 mutants and of hybrid crosses where each parental genome can be differentiated by SNPs. I have some comments/questions/concerns, two of them potentially serious, but I think Hemenway and Gehring can address them through additional analyses of data that they already have available and a bit of clarification in writing.

      Major comments:

      1. Could the excess methylation in ros1-3 relative to ros1-7 shown in Figures 1A and 1C be explained by a second mutation in the ros1-3 background that elevates methylation at some loci? Any mutation that increased RdDM at these loci, for example could have this effect. This could confound the identification and interpretation of biallelicly demethylated loci.
      2. It appears that the main focus of the manuscript, the existence of loci that are paternally demethylated by ROS1, is supported by a set of 274 DMRs. This is a small number relative to the size of the genome and raises suspicions of rare false positives. Even the most stringent p-values that DMR-finding tools report do not guarantee that the DMRs are actually reproducible in an independent experiment. Demonstrating overlap between these 274 DMRs and an independently defined set using a different WT control and different ros1 allele would suffice to remove this concern. It appears that authors already have the needed raw data with ros1-1 and ros1-7 alleles.
      3. Because of the multiple sets of DMRs identified and used throughout the paper, it is hard to follow which one is which. There are DMRs defined solely by one sequence context, DMRs defined by all three contexts merged, DMRs defined by comparisons between maternal and paternal methylation in endosperm, DMRs defined by comparison between mutants and wildtype, and more. These need clearer descriptions of which sets are being referred to throughout the main text and in figure legends. A table summarizing them might help (not in the supplement). Use of consistent and precisely defined terms would help. Stating the number of DMRs along with the name for each set would help a lot, even though this would make for some redundancy. (The number of DMRs in each set not only helps with interpretation but also act as a sort of ID). The reason I put this as a major concern is because the text and figures are difficult to understand, and it is currently hard to evaluate both the results and the authors' conclusions from those results.

      Minor comments

      1. The sRNA results in Figure 2B are difficult to interpret because they do not reveal anything about the number of TEs that have siRNAs overlapping them or their flanks. While the magnitude of some of the highest endosperm sRNA peaks is higher than the embryo peaks, that could be explained by a small number of TEs with large numbers of sRNAs. To make this result more interpretable, we also need some information about how many TEs have a significant number of sRNAs associated with them in endosperm and embryo in each region (e.g., middle, 5', 3', and flanks of TEs). What a "significant number of sRNAs" is would be up to the authors to decide based on the distribution of sRNA counts they observe for TEs. Perhaps the top quartile of TEs? Combined with the same analysis done in parallel with non-ROS1 target TEs, this would reveal whether there is any evidence for ROS1 counteracting sRNA-driven methylation spread from TEs.
      2. The statement "we are likely underestimating the true degree of differential methylation among genotypes" should be validated and partially quantified using a methylation metaplot like Figure 2A, but substitute DMRs for TEs. Related to that, Figure 1B needs an indicator of scale in bp.
      3. The statement "Over half of ROS1 target regions identified in the ros1-3 mutant endosperm were within 1 kb or intersecting a TE (Fig. 1D)" is hard to interpret without some kind of ROS1 non-target regions or whole-genome control comparison. How different are the numbers in Fig. 1D from a random expectation?
      4. The sentence at line 262 is confusing. Is the comparison between dme mutant and ros1 mutant or between different types of regions? And it appears that the comparison value is missing in the "3-5% CG methylation gain..." e.g., "3-5% CG methylation vs 10-20%" or something like that.
      5. The dme mutant data in Figure 5C appear to be key to the model in Figure 7. The relative impact of the dme mutant in the two types of regions should be quantified.
      6. Looks like sRNA methods are missing.
      7. Supplemental Figure 1 is hard to interpret since it only list gene IDs, not gene names.

      The last comments are suggestions for increasing the impact of this study:<br /> 11. Figure 2A and 3B suggest that ROS1 target TEs show demethylation in their flanks but not in the TE themselves. This is an interesting result. If it is true, more DMRs would be expected in the ROS1 target flanks than in the ROS1 target TEs. Reporting how many ROS1 target TEs have DMRs in them and what proportion have DMRs in their flanking 1-Kb regions would answer this question. Given the significance of this result, it also deserves a bit more context: Is the magnitude of increased methylation flanking TEs in ros1 mutant endosperm different than in ros1 mutant leaves or other tissue? Does methylation in TE flanks behave the way in dme mutant endosperm?<br /> 12. The idea of biallelic demethylation has been theoretically suggested in maize to explain weak overlap between endosperm DMRs and imprinting (Gent et al 2022). If that were true in Arabidopsis, then ROS1 target, biallelicly demethylated loci would be less likely to have imprinted expression than maternally demethylated loci. This prediction could be tested using available data in Arabidopsis.<br /> 13. There is currently no evidence for biological significance of biallelicly demethylated loci. Knowing where they are in the genome might give some hints. A figure like Fig. 1D but specifically showing the biallelicly demethylated DMRs would be valuable.<br /> 14. It is hard to make the comparisons between genotypes and parental genomes in Figure 6 and know what they mean. Maybe a different way of displaying the data would help. Or maybe even a different labeling system could make it a little more accessible.

      Significance

      Demethylation of the maternal genome in endosperm has been the subject of much research because it can result in genomic imprinting of gene expression. The enzymes responsible, DNA glycosylases/lyases, also demethylate DNA in other cell types as well, where DNA methylation is not confined to one parental genome (biallelic or biparental as opposed to uniparental demethylation). To the best of my knowledge, the extent or even existence of biallelelic demethylation in endosperm has not been studied until now (except for a superficial look in a bioRxiv preprint, https://www.biorxiv.org/content/10.1101/2024.07.31.606038v1). Hemenway and Gehring have carried out a thoughtful and detailed analysis of the topic in Arabidopsis at least as far as it depends on the DNA glycosylase ROS1.

      A limitation is that the study design would miss biallelic demethylation by any of the other three DNA glycosylases in Arabidopsis. A second limitation is that there is no clear biological significance, just some conjecture about evolution. Nonetheless, given the novelty of the topic, biological significance may follow.

      The audience for biallelic DNA demethylation in Arabidopsis endosperm is certainly in the "specialized" category, but its relevance to the larger topic of gene regulation in endosperm will attract a larger audience.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In Arabidopsis, DNA demethylation is catalyzed by a family of DNA glycosylases including DME, ROS1, DML2, and DML3. DME activity in the central cell leads to the hypomethylation of maternal alleles in endosperm. While ROS1, DML2, and DML3 function in vegetative tissues to prevent spreading DNA methylation from TE boundaries, their function in the endosperm was unclear.

      Using whole genome methylome analysis, the authors showed that ROS1 prevents hypermethylation of paternal alleles in the endosperm thus promotes epigenetic symmetry between maternal and paternal genomes.<br /> The approach and experimental desighs are appropriate, and the key conclusions are adequately supported by the results.

      However, there is not sufficient evidence to support the claim that DME demethylates the maternal allele at ROS1-dependent biallelically-demethylated regions. To clarify the issue, the authors could analyze if there is an overlap between DMRs identified in ros1 endosperm and those identified in dme endosperm using published data. If there is any, the authors could show a genome browser example of DMR including dme data.

      Significance

      Endosperm is a tissue unique to flowering plants. Though it is an ephemeral tissue, the endosperm plays essential roles for seed development and germination. The endosperm is also the site genomic imprinting occurs, and it has a distinct epigenomic landscape. This work provides a new insight that ROS1 may antagonize imprinted gene expression in the endosperm. However, it was not shown whether imprinted gene expression is indeed affected in ros1, or whether the ros1 mutation has phenotypic consequences. These results would be useful to discuss the evolution and significance of genomic imprinting.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer comment: *“The authors did not clarify whether the observed protection to PTZ-induced convulsions after mild TBI is due to the reduced size of gap junctions and/or increased activity in hemichannels.” And “The super-resolution imaging only assesses Cx43 gap junction plaque size and density but not the non-junctional portion of Cx43.” *

      Response and planned revision: To determine whether seizure protection in Cx43 S368A mice is due to reduced gap junction plaque density or reduced hemichannel function, we will conduct solubility assays to assess the ratio of insoluble (junctional) to soluble (cytoplasmic/hemichannel) Cx43 in Cx43S368A and C57BL/6 control mice after TBI/sham (as in Fig. 2A-D currently only in C57BL/6 control mice). In parallel, we will perform EtBr uptake assays in acute brain slices from Cx43S368A and C57BL/6 control animals to assess hemichannel function.

      Additionally, we will include super-resolution images without background subtraction, which show diffuse staining indicative of soluble Cx43. Of note, even at super-resolution individual gap junctions or hemichannels cannot be resolved. They appear as diffuse signal (currently not visible in our super-resolution images due to image deconvolution and background substration performed to isolate Cx43 plaques). Super-resolution imaging was used to count Cx43 gap junction plaque densities and size. Cx43 gap junction plaques are dense accruals of Cx43 immunostaining reminiscent functional and closed gap junctions. Complimentary experiments measured soluble (cytoplasmic Cx43 and hemichannels) and insoluble Cx43 (gap junctions) using biochemistry (Fig. 2A-D).

      Reviewer comment: “The immunofluorescent images for Fig. 2E and Fig. 5 were not counterstained for astrocytes or cell membrane. How can the authors be sure that these are expressed by astrocytes and not other cells in the brain?”

      Response and planned revision: Cx43 is predominantly expressed in astrocytes, with expression levels 10–100 times higher than in brain endothelial cells (e.g., Zhang et al., 2014; Vanlandewijck et al., Nature, 2018). As shown in Supplementary Fig. 2, our immunohistochemistry data reveal no overlap between Cx43 and endothelial cell markers, confirming that our staining protocol does not detect Cx43 in endothelial cells. Instead, the apparent localization of Cx43 along blood vessels reflects expression in astrocytic endfeet, which closely ensheath the vasculature. To further support this conclusion, we will conduct quantitative co-localization analyses of Cx43 with markers for neurons, microglia, oligodendrocytes, and NG2 glia in both Cx43S368A and C57BL/6 control mice. Additionally, we will include plots generated from publicly available single-cell RNA sequencing datasets to show that Cx43 mRNA is highly enriched in astrocytes and present at much lower levels in endothelial cells of the brain vasculature.

      • *

      Reviewer comment about developmental contributions to the phenotype of Cx43 S368A animals.

      Response: We cannot exclude a potential developmental component to the observed seizure protection in Cx43S368A mice. We included discussion of this possibility in the revised manuscript.

      Reviewer comments indicative of a lack of clarity around rationale and intent of specific experiments.

      Response: We thoroughly revised the Results section to explicitly state the rationale and purpose of each experiment. For example:

      Reviewer comment: “The immunofluorescent images for Fig. 1D and E were taken at low resolution compared to the Cx43 puncta size. This does not allow accurate quantification of the Cx43 GJs or HCs.”

      Response: The purpose of this experiment was to assess the heterogeneity of Cx43 expression (both junctional and non-junctional portions) with spatial resolution across a larger brain area. Complementary experiments here are quantification of protein amounts using western blot (Fig. 1B), quantification of junctional versus non-junctional Cx43 using the solubility assay and quantification of Cx43 plaques using super-resolution imaging (Fig. 2).

      Reviewer comment: “TBI did not change Cx43 plaque size or density (Fig. 5). What was the rationale for examining the effects in the S368A mutant?”

      Response: We found an increase in phosphorylated Cx43 at ____S____368 after TBI and Cx43__S368A mutants are protected from seizures after administration of PTZ suggesting an important role for this specific Cx43 phosphorylation site in pathology. __We discussed in the manuscript that “in cardiovascular infection/disease has demonstrated maintenance of gap junction coupling (Gy et al., 2011; Padget et al., 2024) while reduced hemichannel opening probability was reported (Hirschhäuser et al., 2021) in Cx43S368A mice”, suggesting that the protective phenotype is likely due to modification of either Cx43 gap junctions or hemichannels. However, functional consequences on Cx43 biology upon phosphorylation at S368 or lack thereof in the Cx43S368A mutant remain unexplored in the brain. Cx43 plaque size and density are reflective of Cx43 gap junctions and was therefore examined in Cx43S368A mice to reveal potential mechanism by which this mouse mutant is protected from seizures (even in the absence of TBI).

      Reviewer comment: * “The IC50 for Tat-Gap19 for Cx43 HC is ~7 μM (Tocris). How can using it at 2 μM be effective?”*

      Response: We reviewed our lab records and confirmed that 2 μM was a typographical error. The actual concentration used was 200 μM. This is consistent with the dose-response literature for astrocytes (e.g., Walrave et al., Glia 2018; Abudara et al., Front. Cell. Neurosci. 2014). We now included these references in the manuscript.

      Reviewer comment: “Unclear whether mice in Fig. 4C received TBI.”

      Response: We clarified that these mice were naïve, i.e. not subjected to TBI or sham procedures. This is now explicitly stated in both the Methods and the Results.

      Reviewer comment: “CBX or Tat-Gap19 do not affect the phosphorylation state of Cx43.”

      Response: We clarified that we used CBX and Tat-Gap19 as established gap junction and hemichannel blockers, irrespective of phosphorylation state. We now noted that Tat-GAP19 is a Cx43 mimetic peptide to specifically block Cx43 hemichannels.

      Reviewer comment: “It is unclear whether the EtBr quantification in Fig. 3D is for S100β+ astrocytes.”

      Response: We clarified that the quantification in Fig. 3D was performed exclusively in S100β+ astrocytes. Although neurons may take up EtBr under inflammatory conditions, they do not express Cx43 (as will be shown in Fig. 1 and Supplementary Data).

      Reviewer comment: “I believe that the 'W.' in ref 'W. Chen et al., 2018' is unnecessary.”

      Response: We will use the journal citation style implemented by a reference manager in the final version of the manuscript.

      Reviewer request to include two references related to phosphorylation and hemichannel permeability and the role of gap junctional coupling in epilepsy.

      Response: The PNAS reference was added to the manuscript.

      That reduction in gap junctional communication is a relevant factor in epilepsy is discussed in the introduction where we also cite original literature of the authors of the proposed review article: “Many pathologies (Gajardo-Gómez et al., 2017; Masaki, 2015; Orellana et al., 2011; Sarrouilhe et al., 2017; Vis et al., 1998; Wang et al., 2018), including traumatic brain injury (TBI) (B. Chen et al., 2017; W. Chen et al., 2019; Wu et al., 2013; Xia et al., 2024) and acquired epilepsy (Bedner et al., 2015; Deshpande et al., 2017; Walrave et al., 2018) present with altered Cx43 regulation, and are often equated with GJ dysfunction.”

      We feel that citing the original manuscripts more accurately reflect the current knowledge around the role of Cx43 in the context of epilepsy and other pathologies. Reader’s access to the original literature also highlights the gaps in knowledge more precisely that this manuscript seeks to close.

      Reviewer comment: “I think the data of this manuscript is missing a control animal that would present all the compensation changes that occur during development that occur in mice carrying the mutated Cx43. Alternatively, a doable experiment would be the use of inducible KO/KI.”

      Response: Previous studies investigating the role of Cx43 in neuronal excitability have primarily used full or conditional knockout models, as described in our introduction. Interestingly, these studies report that global deletion of Cx43 increases seizure susceptibility. However, such models eliminate all Cx43-dependent functions—both junctional and non-junctional—making it difficult to pinpoint the specific mechanisms underlying the observed effects. They do not distinguish whether increased excitability results from loss of gap junction coupling, disruption of hemichannel function, or depletion of cytoplasmic Cx43 signaling. In contrast, our current study does not aim to eliminate Cx43, but instead employs a targeted approach to interrogate the functional significance of a regulatory phosphorylation site, S368. This site is dynamically phosphorylated following TBI and has been previously associated—albeit only through correlative data—with seizure activity and other neuropathologies. By isolating the contribution of this post-translational modification while preserving overall Cx43 expression, our study provides novel mechanistic insight into how phosphorylation modulates Cx43 function and astrocyte-mediated regulation of brain excitability.

      We appreciate the thoughtful suggestion to generate a conditional knock-in model to isolate developmental from acute effects of the Cx43 S368A mutation. However, the GJA1 gene locus is not amenable to this type of targeting (we explored this possibility with a . We also considered AAV-mediated CRISPR/dCas9 editing as an alternative, but current limitations in CNS transduction efficiency, promoter specificity, and guide RNA availability for precise point mutation insertion make this approach similarly unfeasible at this stage. Thus, while we acknowledge the developmental caveat (which we now discuss in the manuscript), the current manuscript provides novel and meaningful insight into the role of the Cx43S368 regulatory phosphorylation site in the context of astrocyte biology and seizure susceptibility and forms a strong foundation for future studies.

      Thank you again for the opportunity to revise and strengthen our manuscript. We believe these planned experiments and clarifications address the reviewers' concerns in a thorough and scientifically rigorous manner.