7,787 Matching Annotations
  1. Last 7 days
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Major comments: 1) The study focused on regulatory activity in a lung-derived cellular setting and was well executed. However, the degree that non-coding variation in lung elements, particularly alveolar basal epithelial cells, modeled by A549 cells, contributes genetic risk for COVID19 severity is unclear. Especially as non-coding variants in other contexts such as immune cells have been shown to be enriched for disease risk. To strengthen the choice for the A549 cellular context, the authors can assess enrichment for COVID19 severity heritability using stratified LD-score regression (PMID: 26414678) using A549/lung epithelium chromatin data (ATAC-seq, CHIP-seq) to check for enrichment polygenic signal or if the lung associated-risk is focused on a restricted set of genome-wide significant signals.

      The reviewer is correct that most analysis of non-coding variants to date has been in immune cells, as is the case for many GWAS studies. However, severe COVID-19 affects many systems, especially the lung alveolar epithelium, and so there is a pressing need for functional genomic studies that go beyond immune cells. We chose A549 due to its lung origin, experimental tractability, and availability of published datasets. While enrichment such as S-LDSC suggested by the reviewer would be a good indication to screen for cell types with enrichment in e.g. open chromatin, many of our STARR-seq hits were found in closed chromatin and so would have been missed by such an analysis. To further emphasize the importance of type II alveolar epithelial cells in severe COVID-19 progression, we added the following to the manuscript: * "Cell death and the innate immune response of type II alveolar epithelial cells, which also function as progenitors for type I epithelial cells, are the main driver of alveolar damage and acute respiratory distress syndrome in coronavirus infection (Bridges et al 2022 PMID 34404754; Qian et al 2013 PMID: 23418343)."*

      2) It would strengthen the manuscript to compare the results to prior analyses where overlap exists (eg PMID:36763080). Particularly it would be informative to address if nominated variants for signals have different variants operating in different cell types. Also, one prominent variant, rs17713054, that had previously been nominated to operate in lung through in silico predictions and CRISPR perturbations (PMID:34737427) appears to be non-significant in this STARR-seq analysis. Was a different variant nominated at this locus? Could the authors expand on methodological differences that could explain this difference?

      rs17713054 (chr3:45,818,159:G>A) was nominated by Downes et al (2021) based on in silico predictions. While the authors found rs17713054 resides in open chromatin and is a chromatin accessibility (ca)QTL, the variant did not validate in CRISPR perturbations. Deletion of the putative enhancer encompassing rs17713054 across 4 cell lines led to no detectible changes in expression of the predicted target gene LZFTL1. The lack of H3K27ac at the putative enhancer led the authors to conclude that this enhancer is not active in any of the lung epithelial cell lines tested, consistent with our STARR-seq results which suggest that rs17713054 is inactive in A549 cells.

      We nominated 6 amVars at the LZFTL1 locus (Table 1) and propose there are multiple functional variants with small effect sizes operating at this locus which together significantly contribute to risk. We have included an additional supplemental figure panel (Fig. S2H) showing a genome browser view of these variants. As suggested by the reviewer, we also compared our results to Jagoda et al. That study only reported allele-specific change and not baseline activity, it is thus possible that very weak signal (below our thresholds) can show up as allele-specific. This appears to be the case for at least one variant (rs35454877) which we call as inactive but nevertheless has a significant allele-specific activity (mpralm padj

      3) Given a subset of the prioritized variants originate from the credible set, were the amVars enriched in terms of posterior inclusion probability than the tested set? This technical information could be valuable for interpreting fine-mapping efforts.

      We did not observe an enrichment of posterior inclusion probabilities (PIPs) for the amVars or active variants compared to inactive variants. One reason could be that we primarily find variants in weak enhancers with moderate effect sizes which may be too subtle to be attributed a high PIP by GWAS due to insufficient statistical power. It is also possible that variants with a high PIP are active in other cell types. Fine-mapped variant sets already contain variants likely to be functional, so observing no difference between already statistically likely functional variants is perhaps not surprising. Another study testing melanoma risk variants similarly observed no statistically significant differences in PIPs between MPRA functional and non-functional variants (Long et al 2022 PMID: 36423637). We have included a supplemental plot of the PIP scores (Fig. S2G) for inactive, active and amVars and added this analysis in the first results section (see revised manuscript lines 181-186).

      4). Similarly, for the eQTL comparisons, what proportion of the amVar/eQTL pairs are directionally consistent (e.g. increased activity/increased expression)?

      For the 29 amVars, there are a total of 4689 combined eQTLs across all GTEx tissues. When filtering for lung, there are 180 eQTLs across the 29 amVars, whereby only 17/29 amVars have eQTLs in lung. For 16 of these 17 amVars, there is at least 1 eQTL in lung that is directionally concordant - listed in Table 1. Notably, however, almost all variants which have lung eQTLs with concordant direction also have lung eQTLs with discordant direction, suggesting the effects may be more complicated. When considering all lung eQTLs in GTEx v11, amVars were surprisingly enriched for discordant direction of effect (see figure below, left). However, we noticed this signal was driven entirely by the variants in the H2 haplotype block (as proposed by the reviewer in question 5), which includes many genes with varying effects which may be unrelated to our amVars. When excluding chr17, no enrichment was seen (see figure below, right). There was also no significant correlation between the effect size magnitude of eQTL and STARR-seq. Therefore, globally comparing amVars and eQTLs was not informative per se. We emphasize that we have few amVars (29), which makes subtle correlations/enrichments less likely to be detectable. Siraj et al. (2026) (PMID: 41741648), testing a much larger variant set than ours and in multiple cell lines, observed weak correlation between MPRA allelic effects and eQTL normalized effect size (Spearman;s p = 0.35), although these libraries were selected to include only fine-mapped eQTLs in high PIP, in comparison to our libraries which also include a large number of additional variants in LD. Overall, this suggests eQTL effect size is not a strong predictor for variant effects observed by MPRAs. We have included a discussion about this (see revised manuscript lines 186-190).

      5) Several of the variants implicated by STARR-seq, including several of the pairs with non-additive activity were associated with the MAPT locus. This locus has a common 900kb inversion in Europeans (PMID: 15654335), were these variants linked to the same H1/H2 haplotype?

      Indeed, all five prioritised variant pairs, as well as 5 amVars and 2 further variant pairs showing STARR-seq activity at this locus (Table 1, Table 2), are linked to the same (H2) haplotype. More specifically, all variants show high LD with the H2 haplotype-tagging SNP rs8070723‐G in European ancestry (r2 > 0.73) and are not linked to one of the H1 haplotype-tagging SNPs (rs242557-A, r2

      6) Were the variants with non-additive effects analyzed for transcription factor motifs?

      We looked for both motifs (FIMO and motifbreakR) and predictions of contribution scores using Malinois and AlphaGenome in the non-additive combinations, without finding any evidence for synergistic binding/activity. For example, see below the non-additive example in Fig. 3C (rs77819001_rs76667867), where the total activity prediction by Malinois is low (0.10-0.14), and there is no evidence of non-additive contribution scores as expected from the STARR-seq results. Because of the few examples, we cannot determine whether this is due to a systematic inability of the models to predict non-additivity, and therefore we chose not to present them. For transparency, we added the following sentence to the results: "For prioritized, non-additive variants pairs neither model identified an impact on transcription factor motifs that could explain the observed non-additivity. However, the few examples preclude drawing any general conclusions regarding the ability of these models to detect non-additivity."

      Minor comments: 1) The sentence in the methods, Variant selection and design section, "the 95% credible set from the second GenOMICC releases containing causal variants to 95% statistical probability," is somewhat unclear. Given that the next sentence describes the 99% credible set, the authors should use more consistent terminology.

      For the 3rd release we used the 99% credible set of variants to increase comprehensiveness of our library, meaning the list of variants contains causal variants to 99% statistical probability. In contrast, for the first and second release we used the 95% credible set as is the standard for fine-mapped variants. We clarified the phrasing in the methods as follows: "Fine-mapped severe COVID-19 risk variants encompassing causal variants to 95% statistical probability (95% credible set) from the first and second GenOMICC release2 and a more comprehensive 99% credible set of variants from the third GenOMICC release2 were included in the STARR-seq library."

      2) Some text in supplemental figures (Fig S6) is too small to be legible. Please either remove or adjust the figure size.

      We removed the variant IDs from figure panels S6A and S6B to aid readability.

      Reviewer #2

      Major comments: 1). The major conclusion is well supported by the main data presented; but additional clarification and extension in the discussion part may be helpful to determine the potential impacts and application of such conclusions especially related splicing isoform changes regulated by potential functional variants. "OPTIONAL" CRISPR editing for a couple of selected genes/variants will be helpful to confirm effects of these novel pathways.

      We thank the reviewer for the positive appraisal.

      Regarding splicing isoform changes, rs2297480 lies within the promoter-proximal region of two alternative FDPS isoforms which lack the penultimate exon encoding part of the catalytic domain. Therefore, we propose the variant could increase expression of a non-enzymatically functional FDPS isoform, which may compete with the functional isoform for substrate binding, thereby decreasing overall FDPS enzymatic activity. There are other examples of such "promoter usage" QTLs. We have rephrased this section and included references to studies supporting such a situation at other loci *"While speculative, global analyses have found examples where enhancer/promoter variants are proposed to lead to isoform expression changes (so called promoter usage QTLs), which may have disease implications (PMID 36037215, 30618377)" *

      While CRISPR editing could be interesting, it would require extensive additional resources and is outside of the scope of the current manuscript. As a significant proportion of our amVars are not within accessible chromatin nor overlapping active chromatin modifications, we expect these to be functional in a different cell type rather than A549. Identifying a suitable cell line for CRISPR editing would therefore be non-trivial. Furthermore, the small effect size of our hits suggests seeing clear effects of single variants on transcription may be hard, as genes can be controlled by multiple enhancers simultaneously.

      Minor comments: 1, please be specific about the proportion here in the text "Similarly, the proportion of active candidate sequences overlapping predicted ENCODE CREs in A549 cells was increased compared to inactive sequences (Fig. S2F)."

      We added the specific proportions to the text: "Similarly, the proportion of active candidate sequences (39.3%) overlapping predicted ENCODE CREs in A549 cells was increased compared to inactive sequences (23.2%) (Fig. S2E)."

      2, Out of 29 variants showing allele-specific effects, how many of them are close to the known TSS of candidate genes. Is IFNA is the only gene nearby these 29 variants?

      rs7041102/rs7040981 reside ~4.5-5.5kb from the TSSs of IFNA10 and IFNA16. More gene-proximal variants include rs2297490, residing within in the first intron of the reference FDPS isoform and in the promoter-proximal region of three further FDPS isoforms (discussed above). In addition, rs145951274 resides in the first intron of HCG27 and rs3130925 in the first intron of the reference MICB isoform and within the promoter region of an alternative MICB isoform. We have added information on the distance to the closest TSS for all 29 amVars in Table 1 and indicated whether the variant is intronic.

      3, out of 166 variants, what are genes with TSS closer to the 166 variants. It will be helpful to have a table or list of these genes since their promoter close to the significant variants

      Of 166 active variants, 20 are within 1kb of the nearest TSS. Of those, 16 occur in the 1kb upstream or 100bp downstream of the TSS, the other 4 variants are within 1kb downstream. We have added this information as a new column to Supplementary Table 3 now showing the distance to the nearest TSS for all single variants tested, and we have modified figure S2F (previously S2E) which now shows the comparison of TSS proximity for inactive, active, and amVars.

      4, Do these 16 combinations of variants pairs have genetic interaction in the population levels, i.e. epistasis?

      This is an intriguing point, but challenging to test and beyond the scope of this work. 12/16 variant pairs are in very high or perfect LD (Table 2) and therefore either both risk variants or neither will co-occur in the population. We therefore cannot test if two variants have epistatic (beyond additive) effects in the population, nor can we directly link individual variants to a biological phenotype and therefore test for epistatic effects on phenotypes. We are limited to testing for beyond additive, i.e. epistatic, regulatory effects in the context of the STARR-seq assay, which we show in Fig 3B.

      5, It needs more clarification why the risk allele of rs2297480 at the FDPS locus is associated with increased enhancer activity and decreased levels or activity of FDPS?

      We have addressed this point under major comment 1 in the context of enhancer/promoter-driven isoform switches as plausible disease mechanism.

      To clarify, the rs2297480 risk allele showed increased enhancer activity by STARR-seq. The variant lies within the promoter-proximal region of two alternative FDPS isoforms which lack the penultimate exon encoding part of the catalytic domain. Therefore, we propose the variant could increase expression of a non-enzymatically functional FDPS isoform, thereby decreasing overall FDPS enzymatic activity as the enzymatically inert isoform may compete with the functional isoform for substrate binding. We emphasize that this possible mechanism at FDPS is speculative.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      This Manuscript by Dr. Weykopf and Friman et al applied STARR-Seq method to screen and identify function variants that determine the severity of COVID-19 patients in lung epithelial cell lines followed by functional validation. Both additive and non additive effects of various regulatory variants were evaluated. Furthermore machine learning models were applied to interpret allele specific variant effects. This is a pioneering work to identify functional variants on GWAS loci associated with severe COVID-19 with solid methods and modeling. sufficient literatures were cited and discussed. The major limitations were well discussed.

      Significance

      Major comments

      The major conclusion is well supported by the main data presented; but additional clarification and extension in the discussion part may be helpful to determine the potential impacts and application of such conclusions especially related splicing isoform changes regulated by potential functional variants. "OPTIONAL" CRISPR editing for a couple of selected genes/variants will be helpful to confirm effects of these novel pathways. .

      Minor comments

      1. please be specific about the proportion here in the text "Similarly, the proportion of active candidate sequences 153 overlapping predicted ENCODE CREs in A549 cells was increased compared to 154 inactive sequences (Fig. S2F)."
      2. Out of 29 variants showing allele-specific effects, how many of them are close to the known TSS of candidate genes. Is IFNA is the only gene nearby these 29 variants?
      3. out of 166 variants, what are genes with TSS closer to the 166 variants. It will be helpful to have a table or list of these genes since their promoter close to the signficant variants
      4. Do these 16 combinations of variants pairs have genetic interaction in the population levels, i.e. epistasis?
      5. It needs more clarification why the risk allele of rs2297480 at the FDPS locus is associated with increased enhancer activity and decreased levels or activity of FDPS?
    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      To investigate the contribution of non-coding GWAS variants linked to COVID19 severity across individuals, Weykopf and colleagues assayed allelic differences in reporter activity using STARR-seq in the A549 lung epithelial cancer cell line. The authors prioritize 4,894 COVID19 severity associated variants from several GWAS for functional screening. Of these, 166 of the variants designed elements displayed significant enhancer activity, and 29 of these also displayed allele-modulated activity. These results were further contextualized through comparisons with lung-cell relevant chromatin marks. Additionally, a subset of variants in close proximity were analyzed for non-additive effects in the reporter assay. In addition, results were compared results to predictions using deep modeling approaches such as AlphaGenome and Malinois. This approach allows for a systematic characterization of these variants and expands on previous work focused on narrower sets of variants. While well written and results are presented clearly, several additions could help with placing results in context.

      Major comments:

      • The study focused on regulatory activity in a lung-derived cellular setting and was well executed. However, the degree that non-coding variation in lung elements, particularly alveolar basal epithelial cells, modeled by A549 cells, contributes genetic risk for COVID19 severity is unclear. Especially as non-coding variants in other contexts such as immune cells have been shown to be enriched for disease risk. To strengthen the choice for the A549 cellular context, the authors can assess enrichment for COVID19 severity heritability using stratified LD-score regression (PMID: 26414678) using A549/lung epithelium chromatin data (ATAC-seq, CHIP-seq) to check for enrichment polygenic signal or if the lung associated-risk is focused on a restricted set of genome-wide significant signals.
      • It would strengthen the manuscript to compare the results to prior analyses where overlap exists (eg PMID:36763080). Particularly it would be informative to address if nominated variants for signals have different variants operating in different cell types. Also, one prominent variant, rs17713054, that had previously been nominated to operate in lung through in silico predictions and CRISPR perturbations (PMID:34737427) appears to be non-significant in this STARR-seq analysis. Was a different variant nominated at this locus? Could the authors expand on methodological differences that could explain this difference?
      • Given a subset of the prioritized variants originate from the credible set, were the amVars enriched in terms of posterior inclusion probability than the tested set? This technical information could be valuable for interpreting fine-mapping efforts. Similarly, for the eQTL comparisons, what proportion of the amVar/eQTL pairs are directionally consistent (e.g. increased activity/increased expression)?
      • Several of the variants implicated by STARR-seq, including several of the pairs with non-additive activity were associated with the MAPT locus. This locus has a common 900kb inversion in Europeans (PMID: 15654335), were these variants linked to the same H1/H2 haplotype?
      • Were the variants with non-additive effects analyzed for transcription factor motifs?

      Minor comments:

      • The sentence in the methods, Variant selection and design section, "the 95% credible set from the second GenOMICC releases containing causal variants to 95% statistical probability," is somewhat unclear. Given that the next sentence describes the 99% credible set, the authors should use more consistent terminology.
      • Some text in supplemental figures (Fig S6) is too small to be legible. Please either remove or adjust the figure size.

      Significance

      The authors performed a systematic evaluation of COVID19 risk variants in a lung relevant cell line. This study expands the number of variants tested as well as explores them in the lung cellular context. As this study did not filter tested variants allowing for comprehensive integration with chromatin annotations and computational predictions. This provides nominates a short list of lung relevant variants for further investigation. This paper will be of interest to genetics community interested in basic research in COVID19 severity.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      __ __We thank all reviewers for the valuable feedback and critical insight on our study. We acknowledge the concern that the manuscript, in its initial form, appeared descriptive and did not provide the mechanistic insight inferred from the current data. In the revised manuscript, we will (i) more clearly delineate what mechanistic inferences can be drawn from the existing data, (ii) expand our discussion of the caspase-independent mechanisms, and (iii) incorporate additional experiments/analyses aimed at identifying downstream effectors that mediate the observed phenotypes. In this revision plan, we have included six new figures addressing some of the major issues raised by reviewers.

      1. Specifically, to address questions about mechanistic insight, we generated stable ACSL1:HaloTag expressing hESCs. Currently presented as Figure 1A for reviewers____. __ACSL1 is a critical enzyme that catalyzes the first step of fatty acid oxidation at the outer mitochondrial membrane. Our previous analysis and work from the Opferman lab demonstrated that ACSL1 contains a BH3-like domain. Thus, we examined the effects of MCL-1 inhibition on the mitochondrial localization of this enzyme. Our findings pinpoint that MCL-1 inhibition is causing the displacement of ACSL1 from the mitochondria (__Figures 1B-C for reviewers). Our interpretations of the effects of MCL-1 inhibition are 2-fold: 1) as we show in our data, MCL-1 inhibition causes disruption of the mitochondrial cristae, altering the microenvironment for fatty acid oxidation, and 2) as seen in cancer cells, the MCL-1 inhibitor may also displace ACSL1 from the mitochondria. In the new version of the manuscript, we will focus on these 2 mechanisms as mechanistic outcomes of MCL-1 inhibition.
      2. We have included data of cells treated with Perhexilin (CPT1/2 inhibitor), and Etomoxir (CPT1a inhibitor) (Figure 2 for reviewers). This experiment determines whether direct perturbation the FAO pathway mimics the effects of the MCL-1i.
      3. We have assayed the effects of MCL-1 inhibition on oxygen consumption rates in NPCs. Currently presented as Figure 3 for reviewers.
      4. We will perform MCL-1:MICOS proximity ligation assays and/or immunoprecipitation assays to determine whether MCL-1 inhibitors disrupt the association of MCL-1 with MICOS. Preliminary data suggesting an association (albeit, very weak) are shown in Figure 4 for reviewers. __Reviewer #1____ (Evidence, reproducibility and clarity (Required)): __

      Summary: This study claims that beyond its canonical anti-apoptotic function, MCL-1 has essential non-apoptotic roles in human neurodevelopment. Pharmacologic inhibition of MCL-1 in human neural stem cells disrupts mitochondrial inner membrane architecture by destabilizing cristae and the OPA1-MICOS complex, leading to swollen mitochondria with disorganized cristae. These structural defects impair fatty acid oxidation and lipid droplet homeostasis, linking cristae integrity to metabolic competence. Independently of apoptosis or proliferation, MCL-1 inhibition selectively depletes intermediate neural progenitors, indicating a direct role in lineage progression. Overall, the work positions MCL-1 as a key regulator of mitochondrial structure-metabolism coupling that instructs neural progenitor identity and human neurogenesis.

      Overall: The study does a good job of using (in most assays) caspase inhibition (e.g., QVD treatment) to block apoptotic responses induced by MCL-1 inhibition. As a result, many of the phenotypes caused by inhibition are likely to be independent of caspase activation. As a result, this manuscript would be of interest to researchers that study the topics of the BCL-2 family and cell death signaling, mitochondrial bioenergetics and dynamics, neurodevelopment, and cellular metabolism. However, as currently presented the manuscript is only descriptive and lacks mechanistic insight.

      We thank Reviewer 1 for the insightful evaluation of our work. We are encouraged that the reviewer finds the study relevant to investigators in the fields of BCL-2 family biology, mitochondrial dynamics and bioenergetics, neurodevelopment, and cellular metabolism. We also thank the reviewer for pointing out the need to increase the mechanistic insight of our findings. As mentioned above, in the revised manuscript, we are proposing to address this.

      Major Concerns:

      1) The authors only use a single MCL-1 inhibitor and never use other non-targeting BH3-mimetics (such as venetoclax) as negative controls. This seems like a missed opportunity to demonstrate that the phenotypes observed are MCL-1 dependent.

      This is an excellent point. We will include venetoclax (ABT-199) to examine their effect on intermediate progenitors (TBR2 +) and early born neurons (BIII tubulin +).

      2) There is no mechanism proposed in this study other than reliance upon QVD as not affecting the phenotypes. As submitted, the manuscript only can speculate that these phenotypes are due to non-apoptotic roles of MCL-1 inhibition. The authors have missed an opportunity to explore MCL-1's non-apoptotic functions directly.

      Mechanistically, we propose MCL-1 is acting in 2 ways: 1) as we show in our data, MCL-1 inhibition causes disruption of the mitochondrial cristae, altering the microenvironment for fatty acid oxidation, and 2) as seen in cancer cells, MCL-1 inhibitors may also displace ACSL1 from the mitochondria.

      In the past few weeks, since receiving the initial reviews, we have focused on testing the 2nd possibility, since the accumulation of lipids was also seen in cancer cells (see PMID: 38503284). We have successfully generated stable ACSL1:HaloTag expressing hESCs (Figure 1A for reviewers). Our findings included here, ACSL1 is displaced from the mitochondria by MCL-1 inhibition in NPCs (Figures 1B-C for reviewers).

      Other concerns exist that weaken the impact of the study.

      1. Figure 1 should include the fact that QVD inhibition (shown in Sup Fig 2) does not obviate the phenotype induced by pharmacological inhibition of MCL-1 on mitochondrial morphology. We would like to clarify that QVD does prevent the phenotypes induced by MCL-1 inhibition on mitochondrial morphology. In Fig1B, we report an increase in volume and surface area at 24h and 48h along with a decrease in mitochondrial content at 48h when NPCs were treated with MCL-1i only. However, NPCs co-treated with QVD in Supp Fig 2B did not exhibit any significant morphological phenotypes on average or at min/max values. Reviewer 1 may be referring to Fig 1B’s corresponding min/max values presented in Supp Fig 2A where we reported an increase in __max __volume.

      Figure #

      Volume

      Surface Area

      Fig 1B (MCL-1i only, avg values)

      Increase (avg vol)

      increase (avg)

      Supp Fig 2B (MCL-1i+QVD)

      no change

      no change

      Supp Fig 2A (MCL-1i only, max/min values)

      increase (max vol)

      no change (max)

      For clarity, we will move Supplementary Fig 2A into Supplementary Fig 1.

      Figure 2 would benefit from evidence that caspase inhibition does not repress the phenotype on mitochondrial cristae morphology (volume and area). Furthermore, the FIB-SEM data are very hard to appreciate as the size precludes visualization of individual mitochondria.

      While we included the visualization of the segmented mitochondria and cristae (Figure 2C), as well as snapshots through the z-stack for segmented cristae only (Figure 2E) and segmented mitochondria separately (Supp Figure 3A) in the original manuscript, we are also now attaching the FIB-SEM 3D reconstruction videos (New Supplementary Videos 1-2 for reviewers) (1. Mito and cristae, 2. Cristae only, 3. Mito only) for ease of visualization purposes.

      Figure 3 reports that MIC60 and OPA1 appear to be downregulated in response to MCL-1 inhibition, but these appear to be more significant only when QVD is added. Why would the phenotype be obscured in the non-QVD setting (Fig. 2B&C). How does MCL-1 inhibition lead to changes in MIC60/MICOS/OPA1? This seems quite preliminary at this point.

      In Figures 3B and 3C, we report decreased protein levels of short-form OPA1 and MIC10 only, not MIC60. We argue that our data with QVD shows that the cell death function of MCL-1 (i.e., inhibiting cell death effectors from initiating the caspase cascade) is not the main trigger of the phenotypes we report (cristae dysregulation and fatty acid oxidation disruption), however, cells without a functional cristae and/or defects in FAO, may not be able to survive long-term. Thus, QVD treatment preserves these cells that may not survive the dismantling of such an essential structure. To confirm this, we have performed immunofluorescence of cleaved caspase 3 (Figure 5 for reviewers). These results show that indeed MCL-1 inhibition at the time points of our study doesn’t result in increased activation of Caspase-3. We reported similar results of MCL-1 inhibition in oligodendrocyte precursor cells (Gil and Hanna et al., Glia, 2025, PMID: 41420072)

      The loss of MIC60 and OPA1 should repress electron transport chain function, are such impacts observed in the cultured cells? This could be shown by assessing oxygen consumption, etc. Such data would enhance the authors' conclusion that MCL-1 inhibition leads to defects in mitochondrial physiology*. *

      We completely agree with this comment by Reviewer 1. In our revision, we will include an assessment of mitochondrial oxygen consumption rate, using the Seahorse analyzer (mitochondrial stress test), of NPCs treated with MCL-1i. Preliminary data (n=3) are currently presented as Figure 3 for reviewers. Interestingly, these data show a more nuanced cellular response. Consistent with our conclusion that MCL-1 inhibition does not cause apoptotic cell death, MCL-1i did not affect mitochondrial respiration at baseline. The specific deficits appear in spare respiratory capacity and maximal respiration, meaning cells can sustain routine mitochondrial function but lose the ability to respond to increased energetic demand. This suggests MCL-1 loss creates a mitochondrial reserve deficiency rather than a generalized bioenergetic failure. The results with caspase inhibitors show a near-zero OCR across both 24h and 48h timepoints, and significant reductions in maximal respiration, spare respiratory capacity, and non-mitochondrial OCR. Remarkably, these conditions are not detrimental to newborn neurons, as shown in Figure 7. This is very interesting because it suggests that, under severe bioenergetic failure, neural stem cells (PAX6+) can differentiate into newborn neurons in a TBR2-independent manner. More relevant to this study, our results unequivocally demonstrate that TBR2-positive cells depend on the non-apoptotic function of MCL-1

      In Figure 4, the differences between transcripts (qPCR data) and protein (immunoblot) data are often confusing and not well explained. Why do the authors propose that mRNA expression is decreasing whereas the protein expression is increasing? Example CPT1. Furthermore, it is unclear what these data mean functionally? Is this reflective of enhanced lipid oxidation or simply a response to inhibition of fatty acid oxidation? Clarification of the impact of these findings is necessary.

      We agree with Reviewer 1 that the results could be hard to interpret. However, the effects of MCL-1 inhibitors on the transcription of fatty acid oxidation genes have been widely cited by the work of Opferman and Walensky (PMID: 36198266). We speculate that the effects on transcription are triggered by mitochondrial signaling. The mechanistic insight into this phenomenon would be an interesting next step.

      In the case of CPT1, we addressed this comment and found that the difference is due to differential expression of isoforms The RT-qPCR shown in Figure 4, is on CPT1c, whereas the western blot is on CPT1a. Unfortunately, after trying several products, we determined that there are no good antibodies for CPT1c. Thus, since we can’t compare gene and protein expression, we will include CPT1a RT-qPCR data to complement the western blot.

      The increase in lipid droplet number induced by MCL-1 inhibition has been previously documented, but it is unclear whether this increase is related to an inability to oxidize lipid (defective fatty acid oxidation) that leads to increases in the cellular abundance or whether this indicates that MCL-1 inhibition leads to enhanced storage. Do other inhibitors of fatty acid oxidation lead to similar increases in lipid droplet size and abundance? Does QVD inhibition affect this phenotype?

      This is a great point raised by Reviewer 1, and one we have also wondered about. We conducted an experiment using C16 BODIPY to address this point (Figure 6 for Reviewers). We observed no changes in C16 lipid droplet accumulation in count, volume, or surface area when cells were treated with MCL-1 inhibitor for 24 hours total with or without a starvation period in the last 6 hours of treatment. However, we observed significant pan-lipid droplet accumulation in the same conditions. This contrast suggests that FAO of exogenous LC-fatty acids is not reliant on MCL-1. This finding does not discount from the requirement of MCL-1 for other FAO processes especially given the major limitation of how much C16 BODIPY (fluorescent palmitate) can be administered to the cells (10µM) which was 10-fold less than what we exogenously supplied to the cells for the pan-BODIPY experiment (100µM, see Figure 5). It is entirely possible that this small dose was not enough to detect any lipid droplet accumulation.

      We have now also included experiments using etomoxir and perhexiline to assess their effects on TBR2/PAX6 (Figure 2 for reviewers). The results indicate that inhibiting the FAO pathway does not fully mimic the effects of MCL-1i on TBR2. However, we show that MCL-1i displaces ACSL1 from the mitochondria, a step that is upstream of CPT1/2. We suggest a model in which the coordinated non-apoptotic function of MCL-1 at the outer mitochondrial membrane promotes ACSL1 activity and, in the inner mitochondrial membrane, regulates mitochondrial cristae morphology. While our data point to this model, we are limited by the tools to investigate it further, but it will be a great direction for future experiments.

      For Figure 6, while these data may be very meaningful, as presented they are very hard to appreciate. Insets that show the neuronal populations would help to convey the point that the differentiation is impacted. Also, are there other methods that could confirm these observations (qPCR to show changes in differentiation).

      We agree with Reviewer 1. In the new version of the manuscript, we will include panels that zoom into the cell populations we quantified. The current panels will go to a new Supplemental figure. We will also add the TUBB3 to the qPCR panel in the new version.

      Figure 7 is also very hard to appreciate. What is the reader to see? Can these be quantified? It seems that QVD may be rescuing in this figure, does this suggest that MCL-1 inhibition might be inducing death. All of this needs to be quantified.

      We will provide quantification of BIII tubulin branching, and it will be included next to the images provided.

      BCL-XL has also been implicated in affecting mitochondrial electron transport chain function (See PMID: 19255249, 21926988, 21987637). Can BCL-XL inhibitors affect any of the phenotypes associated here?

      We will include experiments to test the effect of BCL-2 and BCL-XL inhibitors on TBR2 cells to address this comment.

      Please be carefully avoid using the term "MCL-1 loss", when talking about pharmacological inhibition. Only genetic ablation (e.g. knockout, silencing, etc.) should be termed loss.

      We have now removed the reference to MCL-1 loss in line 199.

      __*Reviewer #1 (Significance (Required)):

      The study advances in human cells the impacts of MCL-1 inhibition. They replicate many impacts previously observed in mouse systems and refine analyses to impacts on MICOS complex, lipid droplet storage, and neuronal differentiation. While these findings are important and would be well received by a wide audience, the study fails to provide almost any mechanistic insight into how these phenotypes are being induced. The only common theme is that blocking caspase activation in many assays fails to block the phenotype.

      *__

      __Reviewer #2_ (Evidence, reproducibility and clarity (Required)): _*

      Summary: This manuscript by Hanna et al. investigates non-apoptotic roles of MCL-1 in human neural stem cells and connects MCL-1 inhibition to mitochondrial cristae formation and beta-oxidation. Connecting these roles to brain development, the authors also show a reduction in the number of progenitor cells upon MCL-1 inhibition, independently of caspase activity. Throughout their work, the authors make use of an impressive array of imaging techniques. While the methods used offer sufficient evidence to connect MCL-1 inhibition to cristae architecture, the mechanistic underpinnings of this effect remain unexplored. *__

      We thank Reviewer 2 for the thoughtful and positive assessment of our manuscript. We appreciate the reviewer’s recognition that our study reveals non-apoptotic roles of MCL-1 in human neural stem cells. We are also grateful for the acknowledgment of the imaging approaches employed, which allowed us to connect MCL-1 function to cristae architecture with multiple complementary techniques. We acknowledge the reviewer’s point that the mechanistic basis by which MCL-1 influences cristae structure remains insufficiently defined. In the revised manuscript, we will clarify the limitations of the current data, expand our discussion of potential mechanisms, and incorporate additional analyses to identify downstream effectors that mediate these structural and metabolic changes.

      Major comments:

      - In Fig. 1B, the very same representative images are shown for both conditions (DMSO and S63845) at 48 hours.

      We deeply appreciate Reviewer 2 for catching this unintentional duplication that occurred during figure preparation. We have now corrected this issue.

      - For Western Blot analysis, it looks like the authors only quantified the band density of their proteins of interest without considering varying levels of control protein (Actin) levels. Normalizing the protein levels to actin would account for any differences in loaded protein amounts (although a Ponceau staining might be preferable still to exclude this). This is especially relevant for Fig. 4E, where actin levels visibly differ between the conditions.

      All WB quantifications were normalized to Actin (this detail is now added to the y-axis of all band density graphs and figure legends). In addition, we will transform the data to a logarithmic scale to “normalize” for gel-to-gel variability.

      - The authors offer evidence that MCL-1 inhibition impedes proteolytic cleavage of OPA1-L into the OPA-1-S isoforms, yet do not explore the mechanism behind this. Since OPA1 is cleaved by both OMA1 and YME1L, determination of the levels of these proteases could help shed some light on the mechanism leading to cristae reorganization.

      We will follow up on Reviewer 2's comment with a WB analysis of OMA1 and YMEL in cells treated with an MCL-1 inhibitor.

      - Generally speaking, while the authors show all those effects (cristae defects, FAO dysfunction) upon MCL-1 inhibition, it would be interesting to see whether any of those effects can be rescued by blocking FA import e.g. through carnitine palmitoyl- transferase 1a (CPT1a) inhibition with etomoxir to understand if they are downstream of altered Fa supply. This could affect cristae morphology through altered Cardiolipin biogenesis.

      This is an excellent point, which was also raised by reviewer 1. We have now included experiments using etomoxir and perhexiline to assess their effects on TBR2/PAX6 (Figure 2 for Reviewers). As mentioned above, the results indicate that inhibiting the FAO pathway does not fully mimic the effects of MCL-1i on TBR2. However, we show that MCL-1i displaces ACSL1 from the mitochondria, a step that is upstream of CPT1 and 2. We suggest a model in which the coordinated non-apoptotic function of MCL-1 at the outer mitochondrial membrane promotes ACSL1 activity and, in the inner mitochondrial membrane, regulates mitochondrial cristae morphology. While our data point to this model, we are limited by the tools to investigate it further, but it will be a great direction for future experiments. The suggestion of Reviewer 2 that the effects on FAO could impact cardiolipin biogenesis is a very exciting possibility. However, difficult to test with the tools available.

      - In line 262 the authors discuss that mitochondria lose metabolic function upon MCL-1 inhibition. This claim would require additional experiments. While the authors look at lipid droplet accumulation and FAO enzymes, there are many more aspects to mitochondrial metabolic function that should be investigated. While measuring the oxygen consumption rate via Seahorse might require additional resources (optional), measurements of ATP production, ROS generation or determination of the mitochondrial membrane potential should be feasible.

      We fully agree with Reviewer 2's comment, which was also raised by Reviewer 1. In our revision, we will include an assessment of the mitochondrial oxygen consumption rate of NPCs treated with MCL-1i, measured using the Seahorse analyzer (mitochondrial stress test). These data are presented as Figure 3 for reviewers. Interestingly, these data show a more nuanced cellular response. While MCL-1i does not globally collapse mitochondrial respiration at baseline, the specific deficits appear in spare respiratory capacity and maximal respiration, meaning cells can sustain routine mitochondrial function but lose the ability to respond to increased energetic demand. This suggests MCL-1 loss creates a mitochondrial reserve deficiency rather than a generalized bioenergetic failure. The results with caspase inhibitors show a near-zero OCR across both 24h and 48h timepoints, and significant reductions in maximal respiration, spare respiratory capacity, and non-mitochondrial OCR. These conditions are detrimental for TBR2-positive NPCs (Figure 6) , but not for newborn neurons (Figure 7).

      - While the authors "propose a model in which MCL-1 associates with MICOS", they do not offer direct scientific to support this hypothesis. Co-immunoprecipitation experiments or e.g. proximity ligation assays would better support the proposed model.

      We agree with this statement. Preliminary, we have performed proximity ligation assays and immunoprecipitation analyses to test for this interaction (see below and ____Figure 4 for reviewers), and the results indicate an interaction, albeit very weak. In the revised version of the manuscript, we will attempt to repeat these experiments with MCL-1i.

      - While Fig. 7 shows representative images, quantification e.g. for the truncation of neuronal processes is missing.

      We will provide quantification of BIII tubulin branching, which will be included alongside the images provided.

      - In lines 219f. the authors state that they "observed a significant downregulation of PAX6 and EOMES at 24 hours that was not rescued by QVD co-treatment". While there is still a trend towards a downregulation, there is no statistical significance anymore. In fact, PAX6 levels almost mirror those of SOX2 which is not described as "downregulated" by the authors. In order to be more consistent, I would suggest rephrasing this part, or at least reword it to be less absolute.

      In the new version, we will clarify that while QVD rescued TBR2 and PAX6 transcript levels at 24h, it did not rescue them at 48h. We will also mention the downregulation of SOX2 at 48h that persists with co-treatment.

      - Brinkmann et al. (2025) also investigated cristae structure upon MCL-1 deletion in vivo and found no effect when MCL-1 was replaced with other Bcl-2 family members. It would be interesting to combine MCL-1 inhibition with overexpression of MCL-1 versus BCL-XL to reconsolidate some of the discrepant findings.

      While this is a great suggestion for future studies, there are some complications. Specifically, it is likely that the inhibitor may also target the overexpressed MCL-1 and thus, a mutant form is needed.

      To address this, we generated a Flag-tagged MCL-1 construct with a mutated BH3 domain, previously described by Kotschy et al. Nature 2016. We validated the construct in HeLa cells, but unfortunately the mutant protein appears to be significantly less stable than the WT construct, complicating analysis of this experiment.

      Minor comments:

      - In Supp. Fig. 1C the MCL-1 protein is shown both to run above 37kDa (upper panel) and below 37 kDa (lower panel). Could the authors please comment on why this is the case?

      The observed variation is caused by drift in the gel during electrophoresis. In Fig 1C, the protein ladder is on the edge of the gel, whereas in Fig 1E, the protein ladder is in the middle of the gel, and the last sample is on the edge and also exhibits edge drift.

      - In line 64 of the introduction the authors mention clinical trials yet do not give a citation for these trials making it hard to judge whether the content of these trials is actually related to the brain.

      This information is anecdotal, based on an Amgen press release.

      - MCL-1 as well as ACSL-1 are sometimes written without the hyphen both in the text and figures.

      We will carefully check the manuscript before submission.

      - Lines 92-94 and 106-108 essentially highlight the same existing knowledge gap. Maybe the content of these two paragraphs could be combined in order to avoid repetition.

      We thank Reviewer 2 for this suggestion. We will do this in the new version of the manuscript.

      - In Fig. 1A, the authors provide a schematic for their experimental design. While the figure legend is very thorough, some of this information (like the days of collection) could also be included in the figure itself. The same is true for schematics in the following figures.

      We agree with this and will incorporate the suggestion in the new version.

      - Fig. 2A includes a typo (analyze) but would maybe also be more suitable for the supplement figures or could even be combined with Fig. 1A as not much new content is added.

      We already incorporated these changes in the new version of the manuscript.

      - Regarding statistical analysis, could the authors please comment on why they did not consider one-sample t-tests suitable for the cases where control values were set at 1 (e.g. Fig. 4B, C for the relative expression).

      This is a valid suggestion. We will rerun RT-qPCR data using a one-sample t-test.

      - In lines 247f. the authors state that "inhibition of MCL-1 leads to [...] and disassembly of the MICOS complex as well as OPA1". This sounds like OPA1 is still cleaved upon MCL-1, which is not at all what the authors showed and further discuss. Rewording of the sentence would help in avoiding any misunderstandings.

      We agree with this comment and have now reworded the paragraph: “Inhibition of MCL-1 leads to structural collapse of the cristae likely due to the possible disassembly of the MICOS complex, as suggested by decreased MIC10 levels, and interruption of OPA1 cleavage, as suggested by decreased short-form OPA1, two scaffolds required for cristae maintenance.”

      - In lines 210f. the authors state that "quantitative imaging increased the average and maximum volume of lipid droplets". While there is definitely a trend towards an increase for the maximum volume, the increase is in fact not statistically significant. This should be reflected in the wording.

      We have reworded this to “Quantitative imaging revealed a significant increase in average lipid droplet volume and a trending increase in maximum volume of lipid droplets.”

      - In Fig. 6 the overlap between TBR2 and PAX6 is hard to judge when printed out. Including a zoom-in may make it easier to judge.

      We agree with Reviewer 2. In the new version of the manuscript, we will include panels that zoom into the cell populations we quantified. The current panels will go to a new Supplemental figure. We will also add the TUBB3 to the qPCR panel in the new version.

      - In Fig. 7 the color-coding is listed in the figure legend but is missing from the figure itself. If the authors could include this, as they did for the other figures, it would further improve this figure.

      We agree. We have specified the channel color in the new figure.

      - Line 238 should reference Fig. 7A, as Fig 7B does not exist.

      Thanks for catching this. It is already corrected

      - In the figure legends the authors state that biological replicates were used. Were technical replicates also performed?

      Yes, technical replicates were performed for RT-qPCR.

      Reviewer #2 (Significance (Required)):____ Significance

      The authors make use of a wide array of imaging techniques to further elucidate non-apoptotic roles of MCL-1. The study has the potential to offer new insights into mitochondrial biology on the level of basic research rather than translational. While the methods used offer sufficient evidence to connect MCL-1 inhibition to cristae architecture, the mechanistic underpinnings of this effect remain unexplored. Nevertheless, the study offers additional knowledge on the role of MCL-1 in human neural stem cells, whereas previous research mostly focused on cardiomyocytes or cancer cells.

      Reviewer #3____ (Evidence, reproducibility and clarity (Required)):

      Summary: ____ In this study, Gama et al. describe a non-canonical role for the anti-apoptotic protein Myeloid Cell Leukemia-1 (MCL-1) in mitochondrial cristae organization and suggest a role of MCL-1 in regulating metabolism and neuronal differentiation. Using fluorescence microscopy imaging and electron microscopy, the authors show changes to mitochondrial morphology upon treatment with MCL-1 inhibitor S63845. MCL-1 inhibition results in altered protein and transcript levels of some key proteins involved in mitochondrial cristae organization and fatty acid metabolism. While some of the findings are interesting and indeed point towards a non-canonical role of MCL-1, several key conclusions of the authors are not sufficiently supported by the data shown in the manuscript.

      We thank Reviewer 3 for the careful evaluation of our manuscript. We appreciate the reviewer’s recognition that our study identifies a potential non-canonical role for MCL-1 in mitochondrial cristae organization, metabolism, and neuronal differentiation. As with Reviews 1 and 2, we are encouraged that the reviewer finds these observations interesting and suggestive of previously unappreciated functions for MCL-1. We agree that stronger evidence is required to firmly link MCL-1 inhibition to specific changes in MICOS organization and metabolic regulation. In the revised manuscript, we will (i) more clearly distinguish between observations and mechanistic inferences, (ii) temper conclusions where appropriate, and (iii) incorporate additional analyses and controls to better substantiate the proposed model.

      Major comments:

      1. The authors try to disentangle the apoptotic and non-apoptotic role of MCL-1 through addition of a caspase inhibitor. However, I am not convinced that phenotypes found under the addition of caspase inhibitor are necessarily caused by non-canonical functions independent of apoptosis. It could also be that the observed changes happen upstream of caspase activation. In addition, many of the described finding, such as CPT1 expression changes, only happen in the presence of the caspase inhibitor. If one follows the logic of the authors, changes associated by non-canonical MCL-1 functions should happen under MCL-1 inhibition and caspase inhibition, but not with MCL-1 inhibition only____. __ The reviewer is right that we expected non-canonical functions to happen under MCL-1 inhibition and caspase inhibition. Our data with QVD shows that the cell death function of MCL-1 (i.e., inhibiting cell death effectors from initiating the caspase cascade) is not the main trigger of the phenotypes we report (cristae dysregulation and fatty acid oxidation disruption), however, cells without a functional cristae and/or defects in FAO, may not be able to survive long-term. Thus, QVD treatment preserves these cells that may not survive the dismantling of such an essential structure. To confirm this, we performed immunofluorescence of cleaved caspase 3 (__Figure 5 for reviewers). These results show that, indeed, MCL-1 inhibition at the time points of our study doesn’t result in increased Caspase-3 activation. We reported similar results of MCL-1 inhibition in oligodendrocyte precursor cells (Gil and Hanna et al., Glia, 2025, PMID: 41420072).

      The authors show no data on the viability of the cells in response to the MCL-1 inhibitor. To exclude secondary effects of the inhibitor, at least some of the results should be validated with an MCL-1 knock down.

      We will include this experiment in our revised manuscript. To check the effects of MCL-1 knockdown on TBR2 positive cells, we tested 5 different ASOs for MCL-1. Knockdown efficiency with ASOs was very low (on average In Figure 1, the authors show immunofluorescence data of mitochondria and nucleus staining and conclude that MCL-1 inhibition alters mitochondrial morphology. Based on the images shown in Fig. 1, I do not think that individual mitochondria can be segmentd to measure their volume and length. In addition, some metrics such as mitochondrial content are not explained in the text or methods.

      We can achieve mitochondrial segmentation with a SoRa Spinning Disk Confocal Microscope, which has a lateral (XY) resolution of approximately 120 nm to 150 nm and an axial (Z) resolution of approximately 300 nm–320 nm. All images are first denoised prior to sharpening using the Richardson-Lucy deconvolution algorithm. Additionally, the FIB-SEM data are consistent with the IF data (both show increase in mitochondrial volume and surface area).

      We agree with Reviewer 3 that we need to explain some metrics in the revised version. We will specify the meaning of mitochondrial content (count of all mitochondria in FOV, not normalized to Hoechst).

      In Fig. 2 B-D, the authors show TEM and FIB-SEM imaging to demonstrate alterations in the cristae architecture upon treatment with MCL-1 inhibitor. However, based on the images shown, it looks that cristae area and density is reduced under S63845 treatment in TEM images, while the FIB-SEM data come to the opposite conclusion. In addition, the quantification of cristae volume quantified as cristae volume in percentage is unclear to me.

      We apologize for the confusion. No conclusions about the cristae area and density were made using the TEM data, because TEM data represent a single snapshot section of a mitochondrion without a discernible orientation. Cristae from TEM were described as “aberrant” and preliminarily revealed changes in cristae and were followed up with FIB-SEM, 3D reconstruction of intact mitochondria, and quantification of volume.

      In the new version of the manuscript, we will specify that the cristae volume is normalized to the volume of its respective mitochondria (i.e., how much of the mitochondrial volume is attributed to cristae).

      The change in CPT1/2 protein levels (Fig. 4) is interesting but does not directly proof that fatty acid oxidation is altered, as concluded by the authors. For this, the authors would need to directly measure fatty acid oxidation for example using Seahorse or metabolic tracing experiments. Also, to prove that the MCL-1 inhibition affects neural differentiation through fatty acid oxidation, a rescue experiment should be performed through CPT1 overexpression.

      We agreed that this is an important point. We have optimized the fatty acid oxidation test using Seahorse and will make sure to include it in the revised version of the manuscript.

      In Figure 6, the authors show decreased intermediate progenitor cells after MCL-1 inhibition by immunofluorescence staining. I am not convinced that this can be concluded from the data shown, since the concentration of intermediate progenitor cells is very close to the noise levels. Since the MCL-1 treated cells look much less sparse, I don't think the percentages can be compared (total counts are between 2-20). Although this data might give some indication that differentiation could be impaired, the measured effect could be very well due to lower viability of the cells. The authors need to control for this or come up with a different method for measuring differentiation.

      The number of TBR2 is low, but we disagree with the reviewer’s assessment of noise levels. We focused on cells expressing only TBR2 and rigorously examined this population of cells. The percentages are compared to account for the lower density of the MCL-1i-treated cultures, as the IPC counts are normalized to the Hoechst total cell count within the FOV. Moreover, the immunofluorescence images are complemented with RT-qPCR, which shows significant downregulation of EOMES (gene encoding TBR2).

      Figure 7 is missing quantification

      We will include this quantification in the revised version of the manuscript.

      Reviewer #3 (Significance (Required)):

      General assessment____: The manuscript reports an interesting finding, which suggest a non-canonical role of MCL-1 in mitochondrial remodeling, regulation of fatty acid oxidation and neuronal fate. While this finding would be highly interesting and relevant, the presented data do not sufficiently support this conclusion. Further experiments would have to be performed to proof causality. ____ Advance: Should the authors manage to proof their hypothesis by additional experiments, this would indeed advance the field on mitochondrial remodeling and its effect on neuronal differentiation by

      identifying a novel molecular player. ____ Audience: mitochondrial biology, cell biology, developmental neuroscience Own expertise: mitochondrial biology, cell biology, advanced imaging techniques

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this study, Gama et al. describe a non-canonical role for the anti-apoptotic protein Myeloid Cell Leukemia-1 (MCL1) in mitochondrial cristae organization and suggest a role of MCL1 in regulating metabolism and neuronal differentiation. Using fluorescence microscopy imaging and electron microscopy, the authors show changes to mitochondrial morphology upon treatment with MCL1 inhibitor S63845. MCL1 inhibition results in altered protein and transcript levels of some key proteins involved in mitochondrial cristae organization and fatty acid metabolism. While some of the findings are interesting and indeed point towards a non-canonical role of MCL1, several key conclusions of the authors are not sufficiently supported by the data shown in the manuscript.

      Major comments:

      1. The authors try to disentangle the apoptotic and non-apoptotic role of MCL1 through addition of a caspase inhibitor. However, I am not convinced that phenotypes found under the addition of caspase inhibitor are necessarily caused by non-canonical functions independent of apoptosis. It could also be that the observed changes happen upstream of caspase activation. In addition, many of the described finding, such as CPT1 expression changes, only happen in the presence of the caspase inhibitor. If one follows the logic of the authors, changes associated by non-canonical MCL1 functions should happen under MCL1 inhibition and caspase inhibition, but not with MCL1 inhibition only.
      2. The authors show no data on the viability of the cells in response to the MCL1 inhibitor. To exclude secondary effects of the inhibitor, at least some of the results should be validated with an MCL1 knock down.
      3. In Figure 1, the authors show immunofluorescence data of mitochondria and nucleus staining and conclude that MCL1 inhibition alters mitochondrial morphology. Based on the images shown in Fig. 1, I do not think that individual mitochondria can be segmentd to measure their volume and length. In addition, some metrics such as mitochondrial content are not explained in the text or methods.
      4. In Fig. 2 B-D, the authors show TEM and FIB-SEM imaging to demonstrate alterations in the cristae architecture upon treatment with MCL1 inhibitor. However, based on the images shown, it looks that cristae area and density is reduced under S63845 treatment in TEM images, while the FIB-SEM data come to the opposite conclusion. In addition, the quantification of cristae volume quantified as cristae volume in percentage is unclear to me.
      5. The change in CPT1/2 protein levels (Fig. 4) is interesting but does not directly proof that fatty acid oxidation is altered, as concluded by the authors. For this, the authors would need to directly measure fatty acid oxidation for example using Seahorse or metabolic tracing experiments. Also, to prove that the MCL1 inhibition affects neural differentiation through fatty acid oxidation, a rescue experiment should be performed through CPT1 overexpression.
      6. In Figure 6, the authors show decreased intermediate progenitor cells after MCL1 inhibition by immunofluorescence staining. I am not convinced that this can be concluded from the data shown, since the concentration of intermediate progenitor cells is very close to the noise levels. Since the MCL1 treated cells look much less sparse, I don't think the percentages can be compared (total counts are between 2-20). Although this data might give some indication that differentiation could be impaired, the measured effect could be very well due to lower viability of the cells. The authors need to control for this or come up with a different method for measuring differentiation.
      7. Figure 7 is missing quantification

      Significance

      General assessment: The manuscript reports an interesting finding, which suggest a non-canonical role of MCL1 in mitochondrial remodeling, regulation of fatty acid oxidation and neuronal fate. While this finding would be highly interesting and relevant, the presented data do not sufficiently support this conclusion. Further experiments would have to be performed to proof causality.

      Advance: Should the authors manage to proof their hypothesis by additional experiments, this would indeed advance the field on mitochondrial remodeling and its effect on neuronal differentiation by identifying a novel molecular player.

      Audience: mitochondrial biology, cell biology, developmental neuroscience

      Own expertise: mitochondrial biology, cell biology, advanced imaging techniques

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This manuscript by Hanna et al. investigates non-apoptotic roles of MCL-1 in human neural stem cells and connects MCL-1 inhibition to mitochondrial cristae formation and beta-oxidation. Connecting these roles to brain development, the authors also show a reduction in the number of progenitor cells upon MCL-1 inhibition, independently of caspase activity. Throughout their work, the authors make use of an impressive array of imaging techniques.While the methods used offer sufficient evidence to connect MCL-1 inhibition to cristae architecture, the mechanistic underpinnings of this effect remain unexplored.

      Major comments:

      • In Fig. 1B, the very same representative images are shown for both conditions (DMSO and S63845) at 48 hours.
      • For Western Blot analysis, it looks like the authors only quantified the band density of their proteins of interest without considering varying levels of control protein (Actin) levels. Normalizing the protein levels to actin would account for any differences in loaded protein amounts (although a Ponceau staining might be preferable still to exclude this). This is especially relevant for Fig. 4E, where actin levels visibly differ between the conditions.
      • The authors offer evidence that MCL-1 inhibition impedes proteolytic cleavage of OPA1-L into the OPA-1-S isoforms, yet do not explore the mechanism behind this. Since OPA1 is cleaved by both OMA1 and YME1L, determination of the levels of these proteases could help shed some light on the mechanism leading to cristae reorganization.
      • Generally speaking, while the authors show all those effects (cristae defects, FAO dysfunction) upon MCL-1 inhibition, it would be interesting to see whether any of those effects can be rescued by blocking FA import e.g. through carnitine palmitoyl- transferase 1a (CPT1a) inhibition with etomoxir to understand if they are downstream of altered Fa supply. This could affect cristae morphology through altered Cardiolipin biogenesis.
      • In line 262 the authors discuss that mitochondria lose metabolic function upon MCL-1 inhibition. This claim would require additional experiments. While the authors look at lipid droplet accumulation and FAO enzymes, there are many more aspects to mitochondrial metabolic function that should be investigated. While measuring the oxygen consumption rate via Seahorse might require additional resources (optional), measurements of ATP production, ROS generation or determination of the mitochondrial membrane potential should be feasible.
      • While the authors "propose a model in which MCL-1 associates with MICOS", they do not offer direct scientific to support this hypothesis. Co-immunoprecipitation experiments or e.g. proximity ligation assays would better support the proposed model.
      • While Fig. 7 shows representative images, quantification e.g. for the truncation of neuronal processes is missing.
      • In lines 219f. the authors state that they "observed a significant downregulation of PAX6 and EOMES at 24 hours that was not rescued by QVD co-treatment". While there is still a trend towards a downregulation, there is no statistical significance anymore. In fact, PAX6 levels almost mirror those of SOX2 which is not described as "downregulated" by the authors. In order to be more consistent, I would suggest rephrasing this part, or at least reword it to be less absolute.
      • Brinkmann et al. (2025) also investigated cristae structure upon MCL-1 deletion in vivo and found no effect when MCL-1 was replaced with other Bcl-2 family members. It would be interesting to combine MCL-1 inhibition with overexpression of MCL-1 versus BCL-XL to reconsolidate some of the discrepant findings.

      Minor comments:

      • In Supp. Fig. 1C the MCL-1 protein is shown both to run above 37kDa (upper panel) and below 37 kDa (lower panel). Could the authors please comment on why this is the case?
      • In line 64 of the introduction the authors mention clinical trials yet do not give a citation for these trials making it hard to judge whether the content of these trials is actually related to the brain.
      • MCL-1 as well as ACSL-1 are sometimes written without the hyphen both in the text and figures.
      • Lines 92-94 and 106-108 essentially highlight the same existing knowledge gap. Maybe the content of these two paragraphs could be combined in order to avoid repetition.
      • In Fig. 1A, the authors provide a schematic for their experimental design. While the figure legend is very thorough, some of this information (like the days of collection) could also be included in the figure itself. The same is true for schematics in the following figures.
      • Fig. 2A includes a typo (analyze) but would maybe also be more suitable for the supplement figures or could even be combined with Fig. 1A as not much new content is added.
      • Regarding statistical analysis, could the authors please comment on why they did not consider one-sample t-tests suitable for the cases where control values were set at 1 (e.g. Fig. 4B, C for the relative expression).
      • In lines 247f. the authors state that "inhibition of MCL-1 leads to [...] and disassembly of the MICOS complex as well as OPA1". This sounds like OPA1 is still cleaved upon MCL-1, which is not at all what the authors showed and further discuss. Rewording of the sentence would help in avoiding any misunderstandings.
      • In lines 210f. the authors state that "quantitative imaging increased the average and maximum volume of lipid droplets". While there is definitely a trend towards an increase for the maximum volume, the increase is in fact not statistically significant. This should be reflected in the wording.
      • In Fig. 6 the overlap between TBR2 and PAX6 is hard to judge when printed out. Including a zoom-in may make it easier to judge.
      • In Fig. 7 the color-coding is listed in the figure legend but is missing from the figure itself. If the authors could include this, as they did for the other figures, it would further improve this figure.
      • Line 238 should reference Fig. 7A, as Fig 7B does not exist.
      • In the figure legends the authors state that biological replicates were used. Were technical replicates also performed?

      Significance

      The authors make use of a wide array of imaging techniques to further elucidate non-apoptotic roles of MCL-1. The study has the potential to offer new insights into mitochondrial biology on the level of basic research rather than translational. While the methods used offer sufficient evidence to connect MCL-1 inhibition to cristae architecture, the mechanistic underpinnings of this effect remain unexplored. Nevertheless, the study offers additional knowledge on the role of MCL-1 in human neural stem cells, whereas previous research mostly focused on cardiomyocytes or cancer cells.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: This study claims that beyond its canonical anti-apoptotic function, MCL-1 has essential non-apoptotic roles in human neurodevelopment. Pharmacologic inhibition of MCL-1 in human neural stem cells disrupts mitochondrial inner membrane architecture by destabilizing cristae and the OPA1-MICOS complex, leading to swollen mitochondria with disorganized cristae. These structural defects impair fatty acid oxidation and lipid droplet homeostasis, linking cristae integrity to metabolic competence. Independently of apoptosis or proliferation, MCL-1 inhibition selectively depletes intermediate neural progenitors, indicating a direct role in lineage progression. Overall, the work positions MCL-1 as a key regulator of mitochondrial structure-metabolism coupling that instructs neural progenitor identity and human neurogenesis.

      Overall: The study does a good job of using (in most assays) caspase inhibition (e.g., QVD treatment) to block apoptotic responses induced by MCL-1 inhibition. As a result, many of the phenotypes caused by inhibition are likely to be independent of caspase activation. As a result, this manuscript would be of interest to researchers that study the topics of the BCL-2 family and cell death signaling, mitochondrial bioenergetics and dynamics, neurodevelopment, and cellular metabolism. However, as currently presented the manuscript is only descriptive and lacks mechanistic insight.

      Major Concerns:

      1) The authors only use a single MCL-1 inhibitor and never use other non-targeting BH3-mimetics (such as venetoclax) as negative controls. This seems like a missed opportunity to demonstrate that the phenotypes observed are MCL-1 dependent.

      2) There is no mechanism proposed in this study other than reliance upon QVD as not affecting the phenotypes. As submitted, the manuscript only can speculate that these phenotypes are due to non-apoptotic roles of MCL-1 inhibition. The authors have missed an opportunity to explore MCL-1's non-apoptotic functions directly.

      Other concerns exist that weaken the impact of the study.

      1. Figure 1 should include the fact that QVD inhibition (shown in Sup Fig 2) does not obviate the phenotype induced by pharmacological inhibition of MCL-1 on mitochondrial morphology.
      2. Figure 2 would benefit from evidence that caspase inhibition does not repress the phenotype on mitochondrial cristae morphology (volume and area). Furthermore, the FIB-SEM data are very hard to appreciate as the size precludes visualization of individual mitochondria.
      3. Figure 3 reports that MIC60 and OPA1 appear to be downregulated in response to MCL-1 inhibition, but these appear to be more significant only when QVD is added. Why would the phenotype be obscured in the non-QVD setting (Fig. 2B&C). How does MCL-1 inhibition lead to changes in MIC60/MICOS/OPA1? This seems quite preliminary at this point.
      4. The loss of MIC60 and OPA1 should repress electron transport chain function, are such impacts observed in the cultured cells? This could be shown by assessing oxygen consumption, etc. Such data would enhance the authors' conclusion that MCL-1 inhibition leads to defects in mitochondrial physiology.
      5. In Figure 4, the differences between transcripts (qPCR data) and protein (immunoblot) data are often confusing and not well explained. Why do the authors propose that mRNA expression is decreasing whereas the protein expression is increasing? Example CPT1. Furthermore, it is unclear what these data mean functionally? Is this reflective of enhanced lipid oxidation or simply a response to inhibition of fatty acid oxidation? Clarification of the impact of these findings is necessary.
      6. The increase in lipid droplet number induced by MCL-1 inhibition has been previously documented, but it is unclear whether this increase is related to an inability to oxidize lipid (defective fatty acid oxidation) that leads to increases in the cellular abundance or whether this indicates that MCL-1 inhibition leads to enhanced storage. Do other inhibitors of fatty acid oxidation lead to similar increases in lipid droplet size and abundance? Does QVD inhibition affect this phenotype?
      7. For Figure 6, while these data may be very meaningful, as presented they are very hard to appreciate. Insets that show the neuronal populations would help to convey the point that the differentiation is impacted. Also, are there other methods that could confirm these observations (qPCR to show changes in differentiation).
      8. Figure 7 is also very hard to appreciate. What is the reader to see? Can these be quantified? It seems that QVD may be rescuing in this figure, does this suggest that MCL-1 inhibition might be inducing death. All of this needs to be quantified.
      9. BCL-XL has also been implicated in affecting mitochondrial electron transport chain function (See PMID: 19255249, 21926988, 21987637). Can BCL-XL inhibitors affect any of the phenotypes associated here?
      10. Please be carefully avoid using the term "MCL-1 loss", when talking about pharmacological inhibition. Only genetic ablation (e.g. knockout, silencing, etc.) should be termed loss.

      Significance

      The study advances in human cells the impacts of MCL-1 inhibition. They replicate many impacts previously observed in mouse systems and refine analyses to impacts on MICOS complex, lipid droplet storage, and neuronal differentiation. While these findings are important and would be well received by a wide audience, the study fails to provide almost any mechanistic insight into how these phenotypes are being induced. The only common theme is that blocking caspase activation in many assays fails to block the phenotype.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer 1

      Point

      Summary

      Response

      1.1

      Overall, the study lacks well-controlled experiments comparing hypoxia induced by DMOG with hypoxia induced by 1% O₂ for assessing ERα occupancy throughout.

      To assess whether DMOG-induced changes in ERα occupancy reflect bona fide hypoxia, we measured ERα binding by ChIP-qPCR under 1% oxygen over 48 hours, compared to normoxic (21% oxygen) cells and input controls in matched cells at the GREB1 and TFF1 loci. Our findings demonstrate that 1% oxygen treatment recapitulates the ERα binding changes observed with DMOG, at the time points of our RNA-seq experiments.

      We have included these results in __Figure 1F __of the preliminary revision of the manuscript.

      1.2

      Lack of evidence for other co-transcription factors impact under hypoxia HIF's in Fig1.

      We thank the reviewer for this comment. We have clarified that motif enrichment analysis is included to characterise the sequence context of ERα binding sites and to confirm enrichment of known ER-associated motifs (e.g. EREs), rather than to infer functional involvement of additional transcription factors under hypoxia. Corresponding interpretative statements have been removed from the Results and restricted to the Discussion.

      1.3

      Lack of evidence for DMOG induce HIF protein expression in MCF7 cells.

      To confirm DMOG induces HIF-protein expression we have analysed HIF1α and HIF2α protein levels by western blot. We have included these in __Supplementary Figure S1A __within the preliminary revision to address this concern.

      1.4

      Figure 1: ATAC-seq was performed under 1% O₂, whereas ChIP-seq was conducted with DMOG treatment, making these conditions not directly comparable.

      We acknowledge that the ERα ChIP-seq (DMOG) and ATAC-seq datasets were generated under different conditions and are therefore not directly comparable. To address this, we have performed ChIP-qPCR under bona fide hypoxia (1% oxygen) at canonical ERα target loci (TFF1 and GREB1), demonstrating that the directionality of ERα binding changes observed with DMOG is recapitulated under physiological hypoxia. These data provide a direct comparison of ERα occupancy across conditions and support the use of DMOG as a proxy for hypoxia in our ChIP-seq experiments.

      If requested, we are willing to perform ATAC-seq at 16 h under 1% oxygen. However, because the original dataset was generated under 0.1% oxygen, and canonical ERα-bound sites show minimal accessibility changes under severe hypoxia, we anticipate limited additional insight from repeating this experiment.

      1.5a

      Figure S1: ERα ChIP lacks estradiol (E2) treatment in MCF7 cells with or without DMOG.

      The statement that the ERα ChIP samples lack estrogen treatment is incorrect. Estradiol was not an experimental variable and cells were intentionally maintained under estrogen-rich conditions to preserve tumour-relevant ERα activity.

      We have now clarified within the preliminary revision by stating that cells were routinely cultured in “estrogen-rich Dulbecco’s Modified Eagle Medium” in the methods section, and clarified the use of estrogen-rich conditions in the Figure S1 legend.

      1.5b

      The single-gene examples of DMOG effects shown in Fig. S1A are not significant.

      The peak illustrated in Figure S1A (now Figure S1D) __is intended to provide a visual confirmation of peak calling and enrichment patterns underlying the genome-wide redistribution observed in __Figure 1. The peak was called by the MACS2 pipeline (code available from https://doi.org/10.5281/zenodo.17221105) with a log10(q-value) = 268.5, which passes the MACS2 cut-off q

      1.6a

      Fig. S2 lacks 1% O₂ conditions,

      We wish to clarify that Figure S2 (now Figure S4) serves as quality control specifically for the DMOG-treated ChIP-seq dataset presented in Figure 1C. The purpose of the plot is to visualize unfiltered motif enrichment to confirm that the identified peaks represent bona fide ERα binding events within the DMOG condition. Motif enrichment under a 1% oxygen environment would not provide this validation. In all cases the ERE is the most significantly enriched motif.

      With respect to ERα binding under 1% oxygen, we have now assessed this via targeted ChIP-qPCR validation (Figure 1F).

      1.6b

      Fig. S3 lacks DMOG-induced HIF factor assessments.

      The DMOG-induced changes in HIF1α and HIF2α expression are shown in the__ Figure S1__ of this revision proposal and have been incorporated into the manuscript as part of the changes described in response 1.3.

      1.7a

      Figure S4: Estradiol (E2) treatment is missing from the controls, and the figure labeling is of poor quality.

      We have substantially improved the labelling of Figure S4, now__ Figure S6.__

      Additionally, we have clarified that all samples were cultured in estrogen-rich media and treated with either vehicle control or 100 nM fulvestrant; thus estrogen is present in all conditions including the controls.

      1.7b

      Hypoxic conditions for assessing ER status and appropriate controls are also lacking.

      We agree that monitoring ERα stability under hypoxic conditions is essential.

      We provided a western blot assessment of ERα protein levels at 0, 8 and 48 hours of treatment with 1% oxygen or DMOG, compared to normoxic controls, included as Supplementary Figures S1B, C in the preliminary revision.

      These demonstrate the cells remain positive for ERα protein expression at 0, 8 and 48h.

      1.8

      Figure S5: The description of fulvestrant treatments under hypoxic conditions is unclear.

      We thank the reviewer for this comment. To clarify the experimental design, we now signpost the reader in the figure legend of Figure S5 (now S7) to the schematic diagram provided in Figure 3B, and provide a summary stating the experiment employed a factorial design combining a 96-hour fulvestrant treatment with exposure to 1% oxygen for the final 48 hours.**

      1.9

      Supplemental legends: These require major revision; they are of poor quality and lack statistical details and references to biological replicates.

      We have extensively revised all supplementary figure legends to ensure clarity and precision.

      1.10

      Overall comparisons throughout the manuscript are weak; the figures appear sloppy and lack sufficient effort in presentation.

      Following this comment, we carefully reviewed the presentation of all figures throughout the manuscript. We improved the organisation and labelling of the Supplementary Figures to facilitate clearer comparison of the data. In particular, full western blots are now clearly annotated and supplementary legends have been expanded to provide sufficient context for each figure to be interpreted independently.

      1.11

      i) In general, the manuscript in its present form does not greatly contribute from published work as the ERα cistrone is well documented work studied for its role in regulating gene expression, particularly in ERα-positive breast cancer.

      ii) Additionally, a lack of a thorough comparison between DMOG and or 1 %oxygen induce hypoxia in the MCF7 ER+ model, diminished initial interest in the manuscript.

      iii) The lack of considering estradiol exposure under hypoxic conditions with either 1%oxygen and or DMOG also limits relevance to patients with ER+ BrCa.

      iv) The ERα epigenomic profile has been extensively studied including work under hypoxic conditions.

      i) We respectfully disagree that the manuscript does not extend prior work. Despite extensive characterisation of ERα, its role in shaping hypoxia-driven transcription in ER+ breast cancer has not been defined. Here, we identify an ERα-dependent hypoxic response (EDHR), demonstrating a reciprocal interaction between hypoxia and ERα activity.

      ii) In revision, we address concerns regarding DMOG by validating ERα binding under 1% oxygen using ChIP-qPCR thereby confirming our result in bona fide hypoxia. Additionally, all RNA-seq and functional assays, including ENaC targeting, were performed under 1% oxygen in the original manuscript.

      iii) All experiments were conducted under estrogen-complete conditions, now explicitly clarified, reflecting tumour-relevant ERα activity.

      iv) Together, these data establish a reciprocal interaction between ERα and hypoxia and uncover a targetable vulnerability in hypoxic ER+ breast cancer, linking transcriptional regulation to therapeutic opportunity.

      Reviewer 2

      No.

      Summary

      Response

      General Comments

      2.1

      ENAC is proposed as a therapeutic vulnerability based on amiloride sensitivity assays. Additional experiments are required, such as western blot validation of ENaC regulation under hypoxia and loss-of-function approaches to assess its contribution to the phenotype.

      We agree that further validation of ENaC involvement would strengthen this observation. We will assess ENaC protein levels under 1% hypoxia ± fulvestrant by western blot and perform siRNA-mediated depletion of ENaC subunits to test their contribution to the hypoxia-specific amiloride-sensitive phenotype by viability assay (see also response 3.3).

      2.2

      Fulvestrant is used to dissect ERa dependency. However, as a SERD, it may alter chromatin and transcription independently of a simple loss of ERα. Addition control would strengthen interpretation.

      The experimental design already controls for potential fulvestrant-specific transcriptional effects, as all four conditions (± hypoxia, ± fulvestrant) were included. EDHR genes were defined based on induction under hypoxia, loss of this induction following ERα degradation, and absence of residual hypoxic induction in the presence of fulvestrant. Consistent with this, SCNN1B and SCNN1G do not show significant fulvestrant-responsive changes under normoxia (Figure 5C,D).

      We also note that fulvestrant has been shown to induce minimal global chromatin remodelling (Guan et al., 2019), supporting its use to assess ERα dependency without broadly confounding chromatin accessibility; this reference is now included in the manuscript.

      2.3

      The molecular mechanism by which ERα modulates the hypoxic transcriptome, specifically how ERα and HIF pathways converge at ENAC loci should be more studied.

      We further examined the potential convergence of ERα and hypoxic signalling at the ENaC loci (included as __Figure 5E __in the revision proposal) showing genome browser views of the SCNN1G and SCNN1B loci, highlighting hypoxia-induced HIF1α binding and ERα association at these sites.

      To further support this, we will perform RT-qPCR validation of SCNN1G and SCNN1B expression following treatment ± IOX5 and ± fulvestrant. IOX5 is a selective PHD inhibitor that stabilises HIF proteins, enabling us to assess the contribution of HIF signalling independently of other oxygen-dependent effects associated with hypoxia.

      2.4

      In addition, to assess the relevance of this work for luminal breast cancer and ERα expression, specific validation in TNBC should be performed

      To assess the clinical relevance of SCNN1B and SCNN1G in ER-positive and ER-negative subgroups, we performed Cox proportional hazards analyses in TCGA and METABRIC cohorts individually, including ER status and stratifying by ER-positive and ER-negative cases (Figure 6C). These analyses support the association of SCNN1G with poorer relapse-free survival specifically in ER-positive patients.

      2.5

      The authors should provide RT-qPCR validation of the key EDHR genes, especially since this signature is later used for downstream analyses.

      We agree that independent validation would strengthen these findings. We will perform RT-qPCR validation of key EDHR genes (including SCNN1B and SCNN1G) under ± hypoxia and ± fulvestrant conditions to confirm ERα-dependent hypoxic induction.

      Limitations

      2.6

      Reprogramming of the ERα cistrome under cellular stress is well documented. The study extends these ideas but does not clearly demonstrate a new mechanistic paradigm, particularly because the EDHR is defined primarily through omics approaches without strong mechanistic validation. In addition, we have to keep in mind that the study uses DMOG to model hypoxia-driven chromatin changes, but DMOG inhibits many 2-oxoglutarate-dependent dioxygenases non-selectively.

      This makes it difficult to attribute ERα cistrome reprogramming specifically to hypoxia, rather than to broad off-target effects. The transcriptomic dataset is more convincing by need the validation suggested previously.

      While ERα cistrome reprogramming has been described, our study demonstrates a reciprocal interaction in which ERα not only responds to hypoxia but actively shapes hypoxia-driven transcription, defining an ERα-dependent hypoxic response (EDHR).

      We acknowledge the limitations of DMOG and have addressed this by validating key ERα binding events under bona fide hypoxia (1% oxygen) using ChIP–qPCR, confirming our findings under physiological conditions (response 1.1).

      To further strengthen mechanistic insight, we will assess the requirement for HIF stabilisation using the selective PHD inhibitor IOX5, combined with RT-qPCR analysis of SCNN1G and SCNN1B ± IOX5 ± fulvestrant (response 2.3 and 2.5). In addition, we will validate the functional relevance of ENaC through protein-level analysis and siRNA-mediated depletion, as described in__ response 2.1.__

      Together, these additions address concerns regarding DMOG specificity and provide further support for a functional interaction between ERα and hypoxic signalling.

      Audience

      2.7

      Given its reliance on omics datasets and preliminary functional assays, the paper will likely appeal to a specialized audience in transcriptional regulation, hypoxia signalling, and ER+ breast cancer biology. However, the limited mechanistic depth and uncertain translational relevance due to the lack of in vivo validation, may reduce its impact for broader oncology or therapeutic-development audiences. Without stronger validation, the findings may be perceived as niche and mainly of interest to researchers focused on ERα chromatin dynamics rather than to the wider cancer research community.

      The study incorporates multiple layers of human relevance, including spatial transcriptomic analyses demonstrating enrichment of EDHR within hypoxic tumour regions and survival analyses linking EDHR and ENaC expression to clinical outcome.

      In revision, we address the reviewer’s concerns through targeted validation (ChIP-qPCR in hypoxia, western blotting, and RT–qPCR). Together, these additions strengthen the mechanistic and translational relevance of the study.

      Reviewer 3

      No.

      Summary

      Response

      Major comments

      3.1

      The DMOG ChIP-seq provides a valuable first look at ERα redistribution. Since DMOG inhibits both HIF hydroxylases and oxygen-dependent demethylases, the driver of the observed changes remains ambiguous. It would help to include either ERα ChIP-seq under bona fide hypoxia or a selective PHD inhibitor condition (for example IOX5, as you discuss) to separate HIF stabilisation from broad demethylase inhibition. If ChIP-seq is not feasible, a brief ATAC validation at a small panel of gained and lost loci would still increase confidence.

      We acknowledge that mimetics of hypoxia can introduce off-target effects. To address this, we have validated our ERα ChIP-seq findings using ChIP-qPCR at representative loci (TFF1 and GREB1), demonstrating consistent changes in ERα binding under bona fide hypoxia (1% oxygen) (now included in Figure 1F).

      As acknowledged by the reviewer, ChIP-seq under these conditions is likely not feasible due to cell number constraints. We are willing to undertake ATAC-seq if required (as stated in response 1.1); however, we do not feel it would directly address ERα occupancy at these loci. We therefore consider our targeted ChIP-qPCR to be the most appropriate approach to validate ERα redistribution under hypoxia.

      3.2a

      The factorial RNA-seq is well designed and the attenuation analyses are clear. The EDHR selection is stringent and reproducible across two ER+ lines.

      To support the claim of ERα dependence mechanistically, a small number of targeted perturbations would go far. For example,

      i) confirm EDHR induction for SCNN1B and SCNN1G in hypoxia with and without fulvestrant by RT-qPCR

      We agree that targeted validation would strengthen the mechanistic support for ERα dependence. We will perform RT-qPCR validation of SCNN1B and SCNN1G under hypoxia ± fulvestrant to confirm ERα-dependent hypoxic induction (see also response 2.5).

      3.2b

      ii) test whether short-term ERα knockdown reproduces the effect.

      ERα dependency is already assessed through fulvestrant-mediated degradation within the factorial design, which provides a well-established and direct approach to evaluate ERα function. As EDHR genes are defined by loss of hypoxic induction following ERα degradation, this constitutes a robust assessment of ERα-dependent effects.

      We will therefore focus on orthogonal validation through RT-qPCR (response__ 2.5__), together with additional mechanistic and functional analyses using IOX5 and ENaC perturbation (responses 2.1 and 2.3), rather than introducing an ERα knockdown approach, although we would consider this if required.

      3.2c

      iii) A complementary test with a HIF-1α or HIF-2α knockdown at one time point would help position EDHR relative to HIF.

      This request aligns with point 2.3, which addresses the convergence of ERα and HIF signalling. While HIF knockdown under hypoxia would assess necessity, we will instead assess the contribution of HIF signalling using the selective PHD inhibitor IOX5, as this allows us to isolate HIF stabilisation from broader hypoxia-associated effects and avoids additional perturbation associated with transfection-based approaches. We will perform RT-qPCR analysis of SCNN1B and SCNN1G following treatment ± IOX5 ± fulvestrant to determine whether HIF stabilisation is sufficient to support ERα-dependent induction of EDHR genes.

      3.3

      The amiloride result is intriguing and consistent with a hypoxia-specific dependency. Because amiloride is pleiotropic, it would strengthen the conclusion to add one genetic and one pharmacological specificity control. A brief SCNN1B or SCNN1G knockdown in hypoxia should phenocopy the viability effect if ENaC contributes. In parallel, testing benzamil at sub-micromolar doses would provide a more ENaC-selective pharmacological readout. These can be performed in MCF7 and, resources permitting, in T47D.

      To address the reviewer’s concern regarding pleiotropic effects, we propose (aligning with our__ response to 2.1__) to apply siRNA-mediated knockdown of SCNN1B and SCNN1G under hypoxia to determine whether this reproduces our observed viability effect, thereby providing direct evidence for ENaC involvement.

      We agree that additional pharmacological validation could further support specificity, and would consider inclusion of a more ENaC-selective inhibitor if required.

      3.4

      The RFS associations for

      SCNN1B and SCNN1G are compelling. It would be helpful to report whether the associations persist in a multivariable model that at least includes ER status, grade and nodal status where available, or to state clearly when this is not possible across merged datasets. Even a sensitivity analysis in TCGA with ER+ cases only would contextualise the hazard ratios.

      We have analysed TCGA and METABRIC cohorts individually using Cox proportional hazards models, as this functionality is not available for merged datasets in KMplot. ER status was included in the models, and analyses were additionally stratified by ER-positive and ER-negative subgroups. The number of relapse events per subgroup is approximately 40; therefore, additional covariates such as grade and nodal status were not included given the limited number of events per model.

      Within ER-positive patients, high SCNN1G expression is associated with poorer relapse-free survival (TCGA HR 1.45, p = 0.0027), while SCNN1B shows a similar trend that does not reach statistical significance. These analyses are presented in Figure 6C and in the results section of the preliminary revision, and support the findings from the Kaplan–Meier analysis.

      3.5

      The spatial association of EDHR with EMT hotspots is a nice piece of the story. A short clarification of how spot-level cell type composition was handled will help readers interpret proximity results. If cell type deconvolution scores are available in the source dataset, adding a sentence on whether EDHR enrichment tracks tumour epithelial content would be useful.

      Spatial cell type composition and spot annotations were used as provided in the SpottedPy dataset, based on Cell2location-derived deconvolution scores and STARCH tumour annotations, without additional re-estimation.

      To address the reviewer’s suggestion, we examined the relationship between EDHR enrichment and epithelial content and observed no significant correlation at the neighbourhood level.

      These points have now been clarified in the manuscript.

      3.6

      Data processing for ChIP-seq and RNA-seq is documented and accessions are provided. The RNA-seq includes n=3 per condition, which is appropriate, and the correlation and LFC analyses are clearly presented. For the amiloride assay, the two-way ANOVA with interaction is appropriate; please add the exact n and whether experiments were independently repeated, and include the underlying values in a source table for transparency. These are small presentational edits rather than new experiments.

      In the preliminary revision we have added a statement to the amiloride assay figure (Figure 6D) clarifying that n = 3 independent biological replicates were performed per condition. In addition, we now provide the underlying numerical values for this assay in Table S11.

      3.7

      A small, hypothesis-driven mechanistic link from EDHR to ENaC function would substantially elevate impact without becoming a long project. For example, testing whether hypoxia increases amiloride-sensitive Na⁺ current in MCF7 and whether fulvestrant abrogates that increase would directly connect the transcriptional and functional observations. If available, patch-clamp or a simple SBFI-based Na⁺ imaging readout could suffice.

      We agree that directly linking EDHR to ENaC channel activity would further strengthen the mechanistic connection. We will prioritise genetic validation of ENaC function through siRNA-mediated depletion (response 2.1), which directly tests the requirement for ENaC in the hypoxia-specific viability phenotype.

      We are willing to explore the feasibility of measuring the amiloride-sensitive Na+ currents under normoxia and acute hypoxia (via perfusion of cells with bathing solution bubbled with nitrogen during recording) ± fulvestrant to further connect hypoxic regulation to channel activity.

      Minor Comments

      3.8

      Please show representative ERα ChIP-seq browser snapshots for at least one gained, one conserved and one lost locus alongside input for both conditions.

      We have now included representative ERα ChIP-seq browser snapshots for gained, conserved, and lost loci, together with input controls for both conditions, in Figure S3 of the revised manuscript.

      3.9

      In Figure 1D, the ATAC-seq comparison uses 0.1% O₂ for 48 h while the RNA-seq uses 1% O₂. Briefly justify the choice and discuss any expected differences.

      We thank the reviewer for this point. The ATAC-seq dataset was generated under 0.1% oxygen in the original study, whereas RNA-seq experiments in this work were performed at 1% oxygen to reflect tumour-relevant hypoxic conditions. The more severe hypoxia used for ATAC-seq would be expected to maximise detection of chromatin accessibility changes. Despite this, chromatin accessibility changes were limited, with ERα binding occurring predominantly at pre-accessible regions. This has now been clarified in the manuscript.

      3.10

      In the Methods for spatial analyses, specify the thresholds for hotspot calling and how the neighbourhood radius was chosen.

      The neighbourhood parameter was set to 8, corresponding to the immediate neighbouring spots in Visium data, consistent with package guidance. We have clarified this in the manuscript text.

      3.11

      For the EDHR heatmap, consider marking the 14 consensus genes and indicating which belong to the ENaC module to aid readability.

      We have marked the 14 EDHR consensus genes and indicated the ENaC module in the revised heatmap to aid readability.

      3.12

      Please report exact sample sizes and replicate numbers in all figure legends and provide a single table with all statistical tests, n, and p values.

      We have reported exact sample sizes and replicate numbers in all relevant figure legends and included Table S11 summarising all statistical tests, sample sizes (n), and p values.

      3.13

      A schematic summarising the experimental timelines for ChIP-seq, RNA-seq and viability would help orient readers.

      We have added timelines for these experiments as requested.

      3.14

      Minor copyedits: consistent formatting of O₂, gene symbols and reagent catalogue numbers.

      We have standardised oxygen notation throughout the manuscript to use “oxygen” in the main text and “O2” where appropriate (e.g. figures).

      Reagent catalogue numbers have now been standardised for consistency of presentation in the revised manuscript.

      Gene and protein nomenclature were already formatted according to accepted conventions and were verified for consistency.

      3.15

      The manuscript is well referenced. Where you contrast your findings with long-term CoCl₂ hypoxia, a sentence on why acute DMOG and short-term 1% O₂ may reveal different ERα behaviours would help position the novelty.

      We thank the reviewer for this suggestion. We have expanded the manuscript to clarify that acute hypoxia (1% oxygen) and DMOG treatment capture early, dynamic hypoxic responses, in contrast to chronic CoCl2 exposure, which reflects longer-term adaptation. This distinction is relevant to tumour biology, where hypoxia is often transient due to unstable vascularisation. The following statement has been added to the manuscript:

      “In addition to such chronic hypoxic adaptation, tumour hypoxia can also be dynamic, with cells experiencing acute or transient hypoxic exposure due to unstable vascularisation; an established contributor to tumour progression (Liu et al, 2022a; Koh & Powis, 2012). Thus, in contexts where both signalling pathways remain active, the dependence of the hypoxic response on ERα in ER+ cells has not been previously characterised.”

      Primary Limitations

      3.16

      DMOG vs hypoxia in the cistrome experiment,

      To address concerns regarding the use of DMOG, we have validated key ERα binding events from the ChIP-seq dataset by ChIP–qPCR at the TFF1 and GREB1 loci under bona fide hypoxia (1% oxygen) in biological triplicate__ (Figure 1F)__. These data demonstrate consistent changes in ERα binding under hypoxia, supporting that the DMOG-induced redistribution reflects hypoxia-driven changes.

      3.17

      the absence of direct HIF or cofactor perturbations

      We acknowledge the absence of direct HIF perturbation. To address this, we will assess the contribution of HIF signalling through stabilisation approaches, including RT-qPCR analysis of SCNN1B and SCNN1G ± IOX5 ± fulvestrant (response 3.2), to determine whether HIF activation is sufficient to support ERα-dependent induction.

      3.18

      and the pleiotropy of amiloride.

      To address the potential pleiotropy of amiloride, we will perform siRNA-mediated knockdown of SCNN1G and SCNN1B to provide independent validation of ENaC-dependent effects (response 3.3).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      This study explores how hypoxia reshapes ERα signalling in ER-positive breast cancer and whether this cross-talk exposes targetable vulnerabilities. The authors first map ERα binding in MCF7 cells after dioxygenase inhibition with DMOG and observe a genome-wide redistribution with enrichment of ERE, FOXA1 and AP-1 motifs at gained sites while chromatin accessibility at these loci appears unchanged in public ATAC-seq after hypoxia. They then perform RNA-seq in MCF7 and T47D using a factorial design that combines fulvestrant-mediated ERα degradation with 1% O₂ to define an ERα-dependent hypoxia response (EDHR). A 14-gene consensus EDHR signature includes ENaC regulatory subunits SCNN1B and SCNN1G, whose higher expression is associated with poorer RFS in ER+ cohorts. Functionally, amiloride increases viability in normoxia but reduces viability under hypoxia in MCF7 across a dose range. Spatial transcriptomics from ER+ tumours shows EDHR expression enriched at the margins of hypoxia and estrogen-hallmark regions and adjacent to EMT hotspots. Raw data and code availability are stated for the central datasets and accessions are provided. Together the results argue that ERα helps organise a distinct hypoxic programme and suggest a context-specific sensitivity to ENaC inhibition.

      Major comments

      The paper addresses a timely question with a clear narrative arc and brings together ChIP-seq, RNA-seq, pharmacology, survival analysis and spatial transcriptomics. The EDHR concept is interesting and the ENaC angle is original. The work is already strong and with a few targeted additions and clarifications it can be made more persuasive without becoming a new project.

      1) The DMOG ChIP-seq provides a valuable first look at ERα redistribution. Since DMOG inhibits both HIF hydroxylases and oxygen-dependent demethylases, the driver of the observed changes remains ambiguous. It would help to include either ERα ChIP-seq under bona fide hypoxia or a selective PHD inhibitor condition (for example IOX5, as you discuss) to separate HIF stabilisation from broad demethylase inhibition. If ChIP-seq is not feasible, a brief ATAC validation at a small panel of gained and lost loci would still increase confidence. Estimated time: 6-8 weeks for a focused follow up with two conditions and biological duplicates/triplicates.

      2) The factorial RNA-seq is well designed and the attenuation analyses are clear. The EDHR selection is stringent and reproducible across two ER+ lines. To support the claim of ERα dependence mechanistically, a small number of targeted perturbations would go far. For example, confirm EDHR induction for SCNN1B and SCNN1G in hypoxia with and without fulvestrant by RT-qPCR and test whether short-term ERα knockdown reproduces the effect. A complementary test with a HIF-1α or HIF-2α knockdown at one time point would help position EDHR relative to HIF. Estimated time: 3-4 weeks for qPCR and siRNA validations.

      3) The amiloride result is intriguing and consistent with a hypoxia-specific dependency. Because amiloride is pleiotropic, it would strengthen the conclusion to add one genetic and one pharmacological specificity control. A brief SCNN1B or SCNN1G knockdown in hypoxia should phenocopy the viability effect if ENaC contributes. In parallel, testing benzamil at sub-micromolar doses would provide a more ENaC-selective pharmacological readout. These can be performed in MCF7 and, resources permitting, in T47D. Estimated time: 4-6 weeks.

      4) The RFS associations for SCNN1B and SCNN1G are compelling. It would be helpful to report whether the associations persist in a multivariable model that at least includes ER status, grade and nodal status where available, or to state clearly when this is not possible across merged datasets. Even a sensitivity analysis in TCGA with ER+ cases only would contextualise the hazard ratios. Estimated time: 1-2 weeks.

      5) The spatial association of EDHR with EMT hotspots is a nice piece of the story. A short clarification of how spot-level cell type composition was handled will help readers interpret proximity results. If cell type deconvolution scores are available in the source dataset, adding a sentence on whether EDHR enrichment tracks tumour epithelial content would be useful. Estimated time: 1 week.

      Reproducibility and statistics

      Data processing for ChIP-seq and RNA-seq is documented and accessions are provided. The RNA-seq includes n=3 per condition, which is appropriate, and the correlation and LFC analyses are clearly presented. For the amiloride assay, the two-way ANOVA with interaction is appropriate; please add the exact n and whether experiments were independently repeated, and include the underlying values in a source table for transparency. These are small presentational edits rather than new experiments.

      Optional

      A small, hypothesis-driven mechanistic link from EDHR to ENaC function would substantially elevate impact without becoming a long project. For example, testing whether hypoxia increases amiloride-sensitive Na⁺ current in MCF7 and whether fulvestrant abrogates that increase would directly connect the transcriptional and functional observations. If available, patch-clamp or a simple SBFI-based Na⁺ imaging readout could suffice. Estimated time: 6-8 weeks.

      Minor comments

      1. Please show representative ERα ChIP-seq browser snapshots for at least one gained, one conserved and one lost locus alongside input for both conditions.
      2. In Figure 1D, the ATAC-seq comparison uses 0.1% O₂ for 48 h while the RNA-seq uses 1% O₂. Briefly justify the choice and discuss any expected differences.
      3. In the Methods for spatial analyses, specify the thresholds for hotspot calling and how the neighbourhood radius was chosen.
      4. For the EDHR heatmap, consider marking the 14 consensus genes and indicating which belong to the ENaC module to aid readability.
      5. Please report exact sample sizes and replicate numbers in all figure legends and provide a single table with all statistical tests, n, and p values.
      6. A schematic summarising the experimental timelines for ChIP-seq, RNA-seq and viability would help orient readers.
      7. Minor copyedits: consistent formatting of O₂, gene symbols and reagent catalogue numbers.

      Prior studies

      The manuscript is well referenced. Where you contrast your findings with long-term CoCl₂ hypoxia, a sentence on why acute DMOG and short-term 1% O₂ may reveal different ERα behaviours would help position the novelty.

      Significance

      General assessment

      The strongest aspects are the carefully designed factorial RNA-seq that cleanly separates ERα and hypoxia effects, the discovery of a concise EDHR signature reproducible across two ER+ lines, and the integration with spatial transcriptomics that places EDHR near EMT-rich tumour regions. The ENaC connection is new and potentially actionable, and the context-dependent amiloride response is a practical lead. Limitations are primarily mechanistic: DMOG vs hypoxia in the cistrome experiment, the absence of direct HIF or cofactor perturbations, and the pleiotropy of amiloride.

      Advance

      To my knowledge, this is the first description of a distinct ERα-dependent hypoxic programme in ER+ breast cancer that includes ENaC regulatory subunits and links to an EMT-adjacent spatial niche. The conceptual advance is the positioning of ERα as a coordinator of a subset of hypoxia-induced genes rather than as a parallel pathway, together with an initial functional readout that suggests a therapeutic angle through ENaC modulation. With the targeted additions outlined above, the study would move from strong association to a more mechanistic and translationally relevant model.

      Audience

      The work will interest a specialised audience in nuclear receptor biology, hypoxia signalling, tumour microenvironment, and ion transport in cancer. It has potential relevance for basic researchers studying ERα cistrome dynamics, for groups using spatial transcriptomics to define micro-niches, and for translational researchers exploring metabolic and ionic vulnerabilities in ER+ disease.

      Expertise disclosure

      Keywords: nuclear receptors,, chromatin profiling, transcriptomics, spatial transcriptomics, breast cancer biology.

      I am not a domain expert in ion channel electrophysiology; my comments on ENaC pharmacology focus on specificity and study design rather than detailed channel biophysics.

      Tone

      I find the paper well conceived and already compelling. The suggested experiments are focused, realistic in scope, and primarily aim to turn several strong associations into concise mechanistic statements that would further increase confidence and impact.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      ERα drives most luminal breast cancers. However, how hypoxia reshapes ERα activity and how ERα itself might influence the hypoxic response remain unclear. Understanding this interaction is crucial, as hypoxia is strongly linked to endocrine resistance and poor outcomes. In this study, authors investigated how hypoxia modifies ERα signalling in ER+ breast cancer and whether ERα contributes to the transcriptional response to low oxygen. Using MCF7 and T47D cells, they combined genome-wide profiling of the ERα cistrome under DMOG, hypoxic transcriptomics with or without ERα degradation, and spatial transcriptomics in tumours. This revealed an ERα-dependent hypoxic response (EDHR), prominently involving regulation of epithelial sodium channel (ENaC) subunits, whose expression requires both hypoxia and active ERα signalling. Functionally, ENaC inhibition with amiloride reduced cell viability under hypoxia. Together, these findings uncover a previously unrecognised ERα-dependent layer of the hypoxic transcriptome and identify ENaC as a potential therapeutic vulnerability in hypoxic ER+ breast cancer. Although the study is interesting, the manuscript lacks several essential functional and experimental validations. ENAC is proposed as a therapeutic vulnerability based on amiloride sensitivity assays. Additional experiments are required, such as western blot validation of ENaC regulation under hypoxia and loss-of-function approaches to assess its contribution to the phenotype. Fulvestrant is used to dissect ERa dependency. However, as a SERD, it may alter chromatin and transcription independently of a simple loss of ERα. Addition control would strengthen interpretation. The molecular mechanism by which ERα modulates the hypoxic transcriptome, specifically how ERα and HIF pathways converge at ENAC loci should be more studied. In addition, to assess the relevance of this work for luminal breast cancer and ERα expression, specific validation in TNBC should be performed Finally, the authors should provide RT-qPCR validation of the key EDHR genes, especially since this signature is later used for downstream analyses.

      Significance

      General assessment strengths:

      This study uncovers a previously unrecognised ERα-dependent hypoxic response in breast cancer, revealing that ERα actively shapes the hypoxic transcriptome rather than functioning as an isolated pathway. To me, the main strength of this work is the identification of ENaC as a novel hypoxia-specific therapeutic vulnerability in ER+ breast cancer, suggesting that ion-channel regulation may play a broader and underappreciated role in endocrine resistance.

      Limitation:

      Reprogramming of the ERα cistrome under cellular stress is well documented. The study extends these ideas but does not clearly demonstrate a new mechanistic paradigm, particularly because the EDHR is defined primarily through omics approaches without strong mechanistic validation. In addition, we have to keep in mind that the study uses DMOG to model hypoxia-driven chromatin changes, but DMOG inhibits many 2-oxoglutarate-dependent dioxygenases non-selectively. This makes it difficult to attribute ERα cistrome reprogramming specifically to hypoxia, rather than to broad off-target effects. The transcriptomic dataset is more convincing by need the validation suggested previously.

      Audience:

      Given its reliance on omics datasets and preliminary functional assays, the paper will likely appeal to a specialized audience in transcriptional regulation, hypoxia signalling, and ER+ breast cancer biology. However, the limited mechanistic depth and uncertain translational relevance due to the lack of in vivo validation, may reduce its impact for broader oncology or therapeutic-development audiences. Without stronger validation, the findings may be perceived as niche and mainly of interest to researchers focused on ERα chromatin dynamics rather than to the wider cancer research community.

      Expertise:

      My evaluation is based on my background in breast cancer, ERα signaling and breast tumorigenesis. However, I have limited expertise in spacial transcriptomic analyses and advanced CHiP-seq bioinformatic analyses, which may affect my assessment of some computational analyses.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Malcom et al. present evidence that, under hypoxic conditions, hypoxia-inducible factors (HIFs) alter the estrogen receptor alpha (ERα) epigenomic landscape in a model of estrogen receptor-positive (ER+) breast cancer (BrCa). The response of ER+ BrCa to estradiol (E2) in MCF7 (ER+) cells, as well as ERα signaling in both primary and metastatic breast cancer, has been well studied, and the epigenomic landscape of ERα+ BrCa is well documented. The differentially expressed genes (DEGs) identified under treatment with the hypoxia mimetic dimethyloxalylglycine (DMOG) revealed a subset of ERα-dependent hypoxic response (EDHR) genes. The outcome was a reprogramming of the basal ERα cistrome, coinciding with sites enriched for estrogen response elements (EREs) and co-transcription factor binding motifs for ERα, including FOXA1 and AP-1. This was demonstrated by ERα ChIP-seq (i.e. DMOG) and ATAC-seq (i.e. 1% O2) performed under different hypoxic conditions. The transcripts identified following DMOG treatment were leveraged and compared to publicly available RNA-seq datasets from various breast cancer subtypes exposed to 1% hypoxic oxygen. Although the comparison methods varied, the results suggested that BrCa cell lines under 1% hypoxic oxygen conditions showed strong similarity to MCF7 cells treated with DMOG. Genes upregulated in response to DMOG correlated with poorer survival outcomes. To demonstrate the requirement for ERα in this model, MCF7 cells were treated with the selective estrogen receptor degrader (SERD) fulvestrant-the only FDA-approved SERD for ER+ BrCa-showing a dampening of the HIF response among EDHR genes. This suggests that ERα is necessary for the expression of DEGs under hypoxic conditions induced by DMOG. Finally, the sodium channel protein ENaC subunits (i.e., SCNN1B and SCNN1G) were further characterized as candidate EDHR genes. Analyses of publicly available datasets indicated that high mRNA expression levels of these subunits were associated with worse survival outcomes, supporting the clinical relevance of EDHR genes SCNN1B and SCNN1G. To further validate clinical relevance, utilize the Spatial Transcriptome in a small subset of ER+ BrCa.

      Major:

      1. Overall, the study lacks well-controlled experiments comparing hypoxia induced by DMOG with hypoxia induced by 1% O₂ for assessing ERα occupancy throughout.
      2. Lack of evidence for other co-transcription factors impact under hypoxia HIF's in Fig1.
      3. Lack of evidence for DMOG induce HIF protein expression in MCF7 cells.
      4. Figure 1: ATAC-seq was performed under 1% O₂, whereas ChIP-seq was conducted with DMOG treatment, making these conditions not directly comparable.
      5. Figure S1: ERα ChIP lacks estradiol (E2) treatment in MCF7 cells with or without DMOG. The single-gene examples of DMOG effects shown in Fig. S1A are not significant.
      6. Figures S2 and S3: Fig. S2 lacks 1% O₂ conditions, and Fig. S3 lacks DMOG-induced HIF factor assessments.
      7. Figure S4: Estradiol (E2) treatment is missing from the controls, and the figure labeling is of poor quality. Hypoxic conditions for assessing ER status and appropriate controls are also lacking.
      8. Figure S5: The description of fulvestrant treatments under hypoxic conditions is unclear.
      9. Supplemental legends: These require major revision; they are of poor quality and lack statistical details and references to biological replicates.

      Minor:

      1. Overall comparisons throughout the manuscript are weak; the figures appear sloppy and lack sufficient effort in presentation.

      Significance

      In general, the manuscript in its present form does not greatly contribute from published work as the ERα cistrone is well documented work studied for its role in regulating gene expression, particularly in ERα-positive breast cancer. Additionally, a lack of a through comparison between DMOG and or 1 %O2 induce hypoxia in the MCF7 ER+ model, diminished initial interest in the manuscript. The lack of considering estradiol exposure under hypoxic conditions with either 1%O2 and or DMOG also limits relevance to patients with ER+ BrCa. The ERα epigenomic profile has been extensively studied including work under hypoxic conditions.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We sincerely appreciate the constructive and insightful comments on our manuscript.

      Both reviewers raised important concerns regarding our use of the term lysosome-related organelle. We fully acknowledge this criticism and will revise the terminology throughout the manuscript with greater care, referring to these structures as Rab32/Rab38-positive vacuoles where appropriate, and discussing their possible relationship to lysosome-related organelles in the Discussion.

      We believe that the remaining comments can be adequately addressed through additional experiments, including CLEM and three-dimensional reconstruction analyses. We therefore submit this revision plan and hope that it will be viewed favorably.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This manuscript employs overexpression and knockdown experiments in an immortalized hepatocyte cell line to probe roles for RAB32 and RAB38 in lipid metabolism by lysosomes or lysosome-related organelles (LROs). Using these approaches, the authors show that both RAB32 and RAB38 colocalize with LAMP1 on late endosomes/ lysosomes, that the appearance of enlarged, round lysosomal structures that they refer to as LROs scales with both RAB32 and RAB38 expression, and they provide some evidence to suggest that material from lipid droplets (LD) are taken up into these large rounded compartments in a manner that requires RAB32 or RAB38. Additional experiments are interpreted to suggest that macroautophagy is not required for this uptake but that PtdIns3-kinase, PtdIns5-kinase, and ESCRT complexes are required. Analyses of Rab32/Rab38 knockout mice shows an accumulation of white fat, and in liver an accumulation of what the author interpret to be lipofuscin. The authors conclude that lipid droplets are consumed by LROs in an autophagy-independent manner.

      Major Comments:

      While the topic of the paper is interesting, the conclusions of the paper are not supported by the data shown. No evidence is presented in this paper that the structures analyzed are actual LROs rather than lysosomes, other than their content of RAB32 and RAB38 - which are not limited in expression to LROs. The fact that lipid accumulates in the white fat and not the livers of double knockout mice and that hepatocytes express very little RAB32 and no RAB38 renders the model cell system studied here artifactual; the paper should start with the in vivo analysis and then progress with an appropriate cell type using a line that mimics the behaviour of the endogenous cells. Moreover, the only experiments documenting partial overlap of lipid droplet (LD) material - interpreted as uptake of LDs - into these structures is in cells that massively overexpress LAMP1-mRFP, RAB32, and/or RAB38; in untransduced cells, only a handful of LAMP1-containing structures are enlarged and there is no evidence that they overlap with LD material. Moreover, the only evidence that colocalization is independent of autophagy is that it is blocked by overexpression of a single dominant-negative autophagy component, ATG4B. Finally, the data quantification throughout the paper lacks sufficient power to support the conclusions. Thus, the none of the major conclusions from this paper are well supported, and the physiological significance of the observations for liver function is not at all clear. Altogether, the authors present an interesting idea for which the data are unconvincing.

      Below are detailed concerns throughout the paper.

      1. Abstract:

      i. Please explain why there was a reason to look at the involvement of Rab32/38 in hepatic lipid metabolism.

      ii. It seems rather unlikely that microautophagy can result in the engulfment of an entire lipid droplet in toto; is it more sensible to think of this as a means to transfer the contents of LDs, perhaps piece by piece, into lysosomes? 2. Introduction:

      i. There is a vast literature on the roles of Rab32 and/or Rab38 in the biogenesis of other LROs besides melanosomes, including platelet granules, lamellar bodies in lung epithelial type II cells, and various non-vertebrate structures that should be cited.

      ii. The authors fail to cite the first papers describing roles of Rab32 or Rab38 in bacterial killing by macrophages (Spano et al 2012, PMID: 23162001 and several additional papers from the Galan/ Spano groups), and papers ascribing roles for Rab32 in mitophagy and perhaps other mitochondrial functions, including ER:mitochondrial contacts, prior to the authors' 2025 paper (various papers).

      iii. There have been quite a few papers addressing Rab32/38 effectors in pigment cells (see papers from the Di Pietro group) and other cell types (see Rab32 in mitochondria papers).These facts and at least some of the papers should be cited in the Introduction to better reflect the depth of understanding - and some of the confusion - surrounding Rab32 and Rab38 function.

      iv. Reference to the definition of LROs should also be cited.

      Results: 3. In all experiments where quantification was done, the number of structures or cells analyzed is listed but not the number of experiments. Were these experiments repeated at least three times, and are the values and statistics calculated from the experiment to experiment variation? If not, the statistical values are inaccurate. In all, the number of structures or cells analyzed appears to be quite small. 4.Figure 1.

      i. How did the authors validate the specificity of the anti-Rab32 and anti-Rab38 antibodies used in Figure 1 and elsewhere? Data should be shown with individual knockdowns. Additionally, the overlap with LAMP1 seems too good to be true (it looks 100% and with similar labeling intensities in all cases) - were controls done to ensure lack of cross-reactivity of the secondary antibodies?

      ii. If anti-Rab32 and -Rab38 actually labeled all LAMP1-positive compartments, it seems likely that these are classical late endosomes/ lysosomes and not lysosome-related organelles. Rab32 is expressed by many cell types that do not harbor traditional LROs and may have more ubiquitous functions. The larger ring-like structures mentioned in the text only appear when Rab32 or Rab38 are overexpressed as GFP fusion proteins (compare Fig. 1A and B with 1C-F, and note that the scale bars are the same) and fail to overlap with smaller structures only when LAMP1-mRFP is overexpressed (compare Fig. 1A and B with S1A); these structures likely represent earlier endosomal intermediates illuminated by LAMP1 overexpression. The authors need to reconsider their interpretation of these data in light of these overexpression artifacts.

      iii. In Fig. 1C-F and Fig. S1, were cells transfected or infected with recombinant lentiviruses? This should be indicated in the figure legend. 5. Figure 2. In Fig. 2E-G, cells depleted of Rab32 and/or Rab38 should be compared to cells transduced with a control shRNA, such as a non-coding shRNA, and not to untransduced cells. The quantification of these data "per field" is quite concerning, given that a field could have very different numbers of cells. The data should be normalized to cell number or cell area. 6. Figure 3.

      i. It should be noted in the text that the Lipi- dyes fluoresce in high hydrophobic environments, and thus would indicate a cluster of lipid tails within a lysosome and not just an entire LD. Interpreting these spots as LD under lipase inhibitory conditions is a stretch.

      ii. The evidence that the Lipi-Blue labeled structures are actually inside of the lysosomal structures is not convincing. Three-D reconstructions would need to be done to be more convincing of this. 7. Suppl. Fig. S2. In panel A, there is no obvious difference in intensity of p62 under any of the conditions, and this reviewer does not see any LC3-II in the gel- only LC3-I with a very slight smear underneath that may or may not be specific. The interpretation that autophagy is increased at higher confluency is thus not well founded. In panel B, I see weak labeling of the interior of the giant Rab38-GFP-containing compartments for LC3-mRFP, as if the mRFP was in the process of degradation. How this correlates with the biochemistry in panel A is unclear. 8. Fig. 4 and Suppl. Fig. S3.

      i. All of the graphs in Fig. S3 require appropriate statistical analyses.

      ii. The interpretation of the size of the structures in the double DKD sample is complicated by their accumulation in the perinuclear area, which is very dense. If all samples look like the one in Fig. 3A, then it is not possible to measure their size by this technique and that sample should remain unanalyzed. It is misleading to refer to these as large when they appear to be clusters of small puncta.

      iii. The label on the image itself in Fig. 3C should indicate Lysotracker, not "LRO". This is misleading.

      iv. The same concern raised above that it is not clear whether the Lipi-Blue labeled structures are present within the lysosomal structures is true here. Indeed, in the unstransfected control, many of the LD structures appear to be present adjacent to (on one side of) the Lysotracker-labeled structures, as is also apparent in the shRab32 and shRab38 cells; those where they appear to be inside might simply be above them in these non-super-resolution images. This is a great example of how it is necessary to do 3D reconstructions to fully determine whether the Lipi-Blue structures are engulfed by or adjacent to lysosomes.

      v. Note, the LC3 flux experiment and identification of LC3-II and -I is correct in S4D, unlike the experiment in S2A. 9. Fig. 5. The data in Figure 5A are incorrectly interpreted. PtdIns3P or PtdIns(3,5)P2 are present only on the cytoplasmic leaflet of endosomes and lysosomes; if those membranes were to be internalized, the phosphate would be removed. Thus, the presence of signal on the inside of the lysosomal structures does not indicate the presence of PtdIns3P or PtdIns(3,5)P2; it represents likely free mCherry, or perhaps the full conjugate with 2XFYVE, that has been engulfed by the lysosome and is no longer bound to its ligand. The observation that the mCherry signal accumulates near the Lipi-Blue signal in orlistat-treated cells thus cannot be interpreted as an interaction of the phosphoinositide with the LD or its content phospholipids or acyl chains. The disappearance of a punctate 2XFYVE signal is expected upon treatment with a PI3kinase inhibitor since it eliminates the ligand, and the failure of Lipi-Blue to accumulate in lysosomes of inhibitor-treated cells could reflect just about any defect in endolysosomal maturation since PtdIns3P is required for the early to late endosome transition as well as for several aspects of late endosome and lysosome biology. All this experiment shows is that uptake of Lipi-Blue labeled structures into lysosomes requires endolysosomal maturation. The same goes for the shVps4 experiments in Fig. 5B, which are also less convincing of any phenotype, and Fig. S5.

      Significance

      Because the conclusions are not supported by the data shown and because the authors exploit an immortalized cell type that does not mimic the behavior of the endogenous cells, the significance of the work as presented is very low. If the conclusions were justified, the advance could potentially be conceptual in showing that RAB32 and RAB38 redundantly functionalize lysosomes in some cell types to metabolize lipids through a mechanism distinct from macroautophagy. Such an advance would be of broad interest to investigators interested in the functions of lysosomes and lysosome-related organelles, as well as membrane trafficking machinery. However, the authors are unfortunately a long way from such an advance.

      My expertise is in the biogenesis of LROs, and I am considered a leading expert in the field. In my opinion, the authors require a functional readout unique to LROs to define the compartments shown as LROs. Otherwise, they might consider altering their language, abandon the LRO designation, and focus on mechanisms of fatty acid uptake promoted by RAB32 and/or RAB38 in appropriate cell types. Unfortunately, their own data show that the cell type used here is not such an appropriate cell type.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study investigates the roles of Rab32 and Rab38 in hepatic lipid droplet metabolism. The authors propose that Rab32/38-positive lysosome-related organelles (LROs) mediate lipid droplet degradation through a mechanism independent of conventional macroautophagy. While the study addresses an interesting question, several conceptual and technical issues need to be addressed before the conclusions can be fully supported.

      Major Concerns

      1.The authors primarily define the Rab32/38-positive ring-like structures as "lysosome-related organelles (LROs)" based on their morphological characteristics and co-localization with LAMP1. However, this classification lacks biochemical validation. Would it be more appropriate to include a Lyso-IP assay to provide additional supporting evidence? 2.In hepatocytes, what is the operational definition of LROs? Beyond being "larger in size," how are these structures functionally distinguished from conventional lysosomes? If Rab32/38 defines LRO identity, why does GFP-Rab32/38 not co-localize with all LAMP1-positive structures (Figure S1A)? 3.In Figure 2A, the dextran pulse-chase experiment shows fluid-phase uptake into large vacuoles; however, dextran can enter any endocytic compartment after prolonged chase periods. What evidence supports that these structures are bona fide LROs rather than enlarged late endosomes or lysosomes resulting from long-term culture? What determines why only certain lysosomes become Rab32/38-positive? This heterogeneity is not explained. Does it imply that pre-existing lysosomes convert into LROs, or that LROs are newly formed under high-density stress? The developmental trajectory of these structures has not been explored. 4.The authors propose a microautophagy mechanism based on the "invagination-like" structures observed by light microscopy (Figure 3A). However, the resolution of light microscopy is insufficient to distinguish true membrane invaginations from lipid droplets that are closely apposed to, or partially wrapped by, the outer membrane of LROs in three-dimensional space. Would a CLEM experiment be necessary to confirm that lipid droplets are indeed located within the lumen of LROs, rather than in deep invaginations that remain connected to the cytosol? In addition, multilamellar membrane structures were observed after Bafilomycin A1 treatment (Figure 3A). Have these structures been validated by electron microscopy, or could they simply represent complex membrane infoldings within swollen lysosomes? The conclusions drawn from light microscopy alone appear somewhat insufficient. 5.The authors use ATG4B C74A overexpression to claim macroautophagy independence. However, while this mutant blocks LC3 lipidation, the study still lacks genetic evidence, such as ATG knockouts. In Figure S2B, the authors state that the "majority" of Rab38-positive LRO-associated lipid droplets are LC3-negative, but no quantitative data are provided. 6.The manuscript does not clearly distinguish the functions of Rab32 and Rab38. Although the authors describe these proteins as paralogs with overlapping roles, multiple data points indicate that they have differential effects on lipid droplet (LD) metabolism. Notably, Rab38-but not Rab32-significantly affects LD delivery to acidic compartments, exerts a stronger influence on LRO size, and responds more robustly to VPS4B perturbation. These observations suggest that Rab32 and Rab38 regulate distinct steps of LD metabolism rather than functioning redundantly. However, the manuscript does not clearly highlight these functional differences and lacks mechanistic validation. 7.Figure 5A shows that the PI3P probe (2×FYVE) forms ring-like structures inside or near the LRO membrane. However, LROs themselves are Rab5-negative (Figures 1C-E), and PI3P is typically generated by Vps34 on early endosomes. Where do these PI3P signals originate? Are they transported from other organelles, or is there a local PI3P-generating mechanism on the LRO membrane? If the latter, which kinase is responsible, and is Vps34 recruited to the LRO membrane? This issue is not discussed. If PI3P is indeed locally generated on LROs, it could represent a key feature distinguishing LROs from classical lysosomes.

      Minor Concerns

      1.The double-knockout mice exhibit obesity and fatty liver; however, Rab32 and Rab38 are expressed in multiple tissues. A whole-body knockout model cannot distinguish whether these effects are hepatocyte-autonomous or arise from contributions by adipose tissue or macrophages, emphasizing the need for liver-specific knockout animals or cell models. Serum TAG levels were unchanged, and the authors speculate that VLDL secretion may be impaired, but this was not directly tested. Furthermore, the authors do not address the observed sex-specific effects, which appear to be male-specific. 2.The concentration of Orlistat used is relatively high (50-200 μM) and may cause non-specific effects. Have dose-response experiments been performed, or have other LAL inhibitors (e.g., Lalistat) been tested? 3.LysoTracker reflects acidity rather than lysosome identity, and reduced acidification in DKD cells may affect co-localization analysis.

      Significance

      Assessment of Significance Overall Assessment

      Strengths:

      Conceptual novelty: Introduces lysosome-related organelles (LROs) into hepatic lipid metabolism, expanding the functional repertoire of Rab32/38 beyond pigment cells and macrophages.

      Mechanistic exploration: Links LD uptake to PI3P/PI(3,5)P2 signaling and VPS4B, providing molecular handles for future studies.

      In vivo validation: DKO mice show age-dependent obesity and HFD sensitivity, establishing physiological relevance.

      Weaknesses:

      Rab32 vs. Rab38 functions remain blurred: Data suggest differential roles (Rab38 in LD delivery, Rab32 in LD size regulation), but authors default to "redundancy" narrative.

      Microautophagy evidence incomplete: Relies on light microscopy; EM/CLEM needed to confirm true internalization.

      Model relevance unclear: High-confluence AML12 vacuoles lack clear physiological correlate in healthy liver.

      Audience

      Primary:

      Lysosome biologists

      Autophagy researchers

      Lipid metabolism researchers

      Secondary:

      Cell biologists

      Metabolic disease researchers

      Geneticists

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility and clarity

      1) Summary

      This study investigates the mechanochemistry of Arp2/3-mediated branched actin networks at the level of individual branch junctions under load. Using microfluidic single-filament/branch force assays (including constant-force flow and open-chamber imaging) the authors quantify debranching, re‑nucleation, and mother- vs daughter‑interface stability across nucleotide states of Arp2/3 (ADP-Pi, ADP, and an ADP-BeFx proxy for ADP-Pi). They further test effects by two branch regulators (GMF and cortactin). Key findings include: (i) ADP-Pi and ADP complexes share similar force dependence but differ markedly (~20×) in intrinsic dissociation rate; (ii) phosphate turnover on the Arp2/3 complex is rapid ii) affinity for Pi drops when Arp2/3 loses its daughter filament; (iii) quantification from model fits uncovers large stability differences between daughter and mother interfaces of the Arp2/3 complex; (iv) extraordinary high stability of ADP-Pi-like Arp2/3 on the mother filament; and (v) distinct effects of GMF and cortactin on force‑dependent stability. Overall, the work combines technically demanding measurements with mechanistic modeling to probe how nucleotide state and regulatory factors tune branch mechanics.

      2) Major comments:

      1. Low force kinetics and completeness of survival curves (Figure 1). "For all forces, the surviving curves exhibited a clear single exponential behavior...." While the data can be fitted to monoexponential decay curves, data at low forces is clearly incomplete. >90% of branches have not dissociated by the end of the experiment. For the particular data shown in 1C (F00nN, n=60 total branches) it means that the time information is coming from

      Essential; experiment might already be performed. Otherwise straightforward to do (weeks time).

      In figure 1B, we indeed show a Survival curve for ADP-Arp2/3 complex branch dissociation at 0 pN up to 900 seconds. As now shown in updated supp figure S2, the data was in fact acquired for at least 5000 seconds for ADP-Arp2/3 and ADP-Pi states (N=2 repeats for each condition, with n = 60 and 90 branches for ADP-Arp2/3 branches, and 90 and 132 branches for ADP-Pi-Arp2/3 branches). The debranching rates reported in the initial submission were already obtained by fitting the surviving curves over the whole duration of the experiments.

      1. Stability Analysis (Figure 4). I can follow much of the arguments presented in the stability analysis of the daughter vs mother interfaces, which is in principle extremely interesting! However, there are some concerns here:

      i) The authors emphasize the zero force ratio derived from fits (which is linked to the stability difference of the two interfaces in the absence of force) despite this being only weakly constrained by data. Intuitively in the model, the stability difference should grow to very large values as the re-nucleation ratio approaches 1 at low force. This combined with the noise in the data poses an issue in my opinion. Looking at the data and the error margin, I think that the authors cannot state with high confidence that there is a real difference between the relative stability of the daughter and mother interfaces between the two nucleotide states of the complex.

      Essential; analysis and textual revision only

      We thank the reviewer for this comment. The difference in stability between the two interfaces is strongly constrained by the shape of the branch renucleation ratio versus force curve, and its value at 0 pN. This is illustrated in the figure shown below (new Supp Fig. S8), showing the dissociation rates of the two interfaces (in 'dashed' and 'point-dashed' style) that contribute to the overall debranching rate in each nucleotide condition. Despite the limited force range at which we probed the debranching rate, the branch renucleation ratio curve informs us on which interface is the weakest, and how this evolves with force.

      We have assessed the confidence intervals of the parameters obtained from the fits, taking into account the error bars on our experimental datapoints. It seems to indicate that the simultaneous fits of the debranching rate and the branch renucleation ratio curves indeed constrain the parameters quite strongly. These confidence intervals are now reported in the main text and in the summarizing table.

      We have repeated branch renucleation experiments for ADP-BeFx- and ADP-Pi-Arp2/3 complex branches (see new figure 4C&D, and our response to the next point). We believe these new measurements allow a better assessment of the relative stability between the two interfaces for Arp2/3 complex branch junctions in the ADP-BeFx state.

      Still, we agree with the reviewer that the dispersion of the experimental data does not allow us to have a strong confidence on the crossover force and relative stability difference of the interfaces. Therefore, we have slightly toned down the way we present and discuss the differences in stability when comparing the two nucleotide states.

      ii) For ADP-Pi, the renucleation ratio essentially remains flat over the measured force range. Hence, the data can only provide little leverage to estimate both the zero force ratio and, more importantly, the differential distance to the transition state in the slip-bond model in my opinion, which will show in the crossover force. Consequently, the quoted ">100×" stability difference at F=0 and the crossover force >20pN are driven largely by extrapolation rather than direct constraint by data. Given the high number of free parameters in the model, I would anticipate that several crossover forces and differential distances might explain the data nearly equally well. Instead of loosely reporting exact number from fits, I would have hoped for some sort of sensitivity analysis, for instance relying on profile likelihoods. Also parameter values could be reported as bounds (e.g crossover force≫measured range) rather than precise point estimates. This issue re-occurs (albeit not as drastically) for the cortactin experiments (Figure 6).

      Essential; analysis and textual revision only

      As mentioned in our response to the previous point, we have repeated renucleation experiments for ADP-BeFx- (and also for Arp2/3 complex branches in the presence of 50 mM Pi) (see new figure 4C&D) to better characterize the differential distance between to the transition force. The crossover force for the ADP-BeFx state is now 13.5 pN and the ratio of the stability between the two interfaces is roughly 100 times.

      We agree with the reviewer that the dispersion of the experimental data does not allow us to have a strong confidence on the crossover force and relative stability difference of the interfaces. We have thus toned down the way we report these values. We do believe though that the difference we report between the ADP and ADP-BeFx state appears to be significant and needs to be acknowledged.

      As a side note, it has proven to be challenging to pull on branches at forces higher than 7 pN. To apply a large force on the branch junction, we need to have a high flow rate. In this case, it appeared that the height of the filaments (both mother and daughter filaments) above the surface seem to deviate from what we have established in our previous studies (Jegou et al, Nat. Comm. 2013 & Wioland et al, PNAS 2019). This may originate from the fact branched filaments have a more complex shape than an individual filament. Characterizing accurately the evolution of the branch height as a function of the flow rate and applied force would require quite extensive additional characterization, which, we believe, is beyond the current focus of this study on the stability of Arp2/3 complexes.

      iii) One important expectation from the "two slip bond" model is that branch dissociation rates should not necessarily scale mono-exponentially as they mostly do over the accessible force range of the paper. However, once the "minor" pathway of dissociation from the mother starts to dominate at high forces, rates become more force sensitive. This is nicely recaptured by the model fits in Figure S6 but deserves some explanation in the text. Otherwise, people will simply remember the "ADP-Pi is 20-fold more stable than ADP at all forces" message.

      Essential; textual revision only

      We now have rephrased the key sentences (in the Abstract and Results sections) to more clearly state that the debranching rate is not increasing mono-exponentially with force.

      In the Abstract: "Remarkably, we find that branch junctions are over 30-fold more stable when the Arp2/3 complex is in the ADP-Pi rather than ADP state, and that force accelerates debranching with similar exponential factors in both states."

      In the Results section: "The debranching rate seems to increase exponentially with the applied pulling force, in the range of 0 to 6 pN (Fig. 1F; see more refined analysis below). This behaviour is predicted by the Bell-Evans model for a slip bond."

      iv) One important prerequisite for the model is that isolated Arp2/3 complexes (without a daughter filament) should dissociate with equal rates from mother filaments at all flow rates. Since the Arp2/3 complex prefers mother filament curvature, forces experienced by the mother might change its off-rate. It would be good to refer to this assumption in the text and experimentally verify it. I could not find it in the paper nor in Ghasemi et al 2024.

      Essential; simple experiment (a weeks time).

      We thank the reviewer for this important comment.

      First, we investigated whether the viscous drag force, applied on the ADP-Arp2/3 complexes which remain bound to mother filaments could affect their stability. We have performed branch renucleation experiments at different flow rates but with the same pulling force on branch junctions (average force 3.9 pN) by adapting the length of the daughter filament. As shown in new supp. figure S11 (shown below), we did not observe any significant differences between 'low' and 'high' flow rates. If the off-rate of the surviving Arp2/3 was significantly affected by the flow, this would have led to a variation of the renucleation ratio with the flow rate.

      Second, we have investigated the impact of the tension experienced by the mother filament at the location of the branch junction for ADP-Arp2/3 complex branches, with the same pulling force on the branches (average 4.1 pN pulling force on branches). We have quantified the debranching rate from three groups of branches depending on their position along mother filaments. As shown in new supp. figure S12 (shown below), we can observe a small trend, where the debranching rate decreases with the tension on the mother filament at the branching point.

      Doubling the tension on the mother filament from 15 to 30 pN decreases the debranching rate by a third. Though, pairwise logrank tests performed between the survival fractions of the three binned groups do not report any statistical significant difference (all p values > 0.05). One possible explanation for this is the height of the mother filament in the microfluidics flow that increases linearly from the anchoring point to the free barbed end. As a consequence the pulling force on the branches will be higher, as branches experience faster flows.

      For these same groups, upon branch dissociation, all remaining-bound Arp2/3 complexes are exposed to the same flow rate; the branch renucleation ratios were similar. Thus branch renucleation ratio seems to not significantly depend on the tension experienced by the mother filament at the branching point.

      Similarly, Pandit et al PNAS 2020, Extended figure S1, also reported no detectable impact of the mother filament tension on the debranching rate in their assay.

      v) The force dependence of the branch re-nucleation rate (Fig 3D) has been measured previously by the same group (Ghasemi et al). While the data in the older paper has not been fitted by a model, the trend of the data in the previous paper looks conspicuously different. Are there any explanations for this? I speculate that it might be related to actin and ATP not being saturated (low-force re-nucleation rate rarely exceeds 80%) in Ghasemi et al., but it would be good to know what the authors think about this. Essential; textual revision only

      This is a good point. We have plotted the data of the renucleation ratio from ADP-Arp2/3 complex from figure 1F of Ghasemi et al, Sc. Adv. 2024 (performed at 0.3 and 1 µM actin), together with the data of the current study from figure 4D (performed at 1.5 µM actin). We feel this comparison could be of interest to the readers, and have thus integrated it in the manuscript as new supp. figure S13 (shown below).

      As expected, the branch renucleation ratio is lower with lower concentrations of actin. The experimental data points from Ghasemi et al are similarly well fitted by the branch renucleation function obtained for 1.5 µM multiplied by a scaling parameter, which reflects the fact that the branch renucleation ratio is actin concentration dependent (Fig. 6A in Ghasemi et al). This scaling parameter was the only free parameter of those fits.

      Since the branch renucleation ratio depends on the actin concentration as follows, 0.97.kon.([actin] - Cc)kon.([actin] - Cc)+koffATP-Arp2/3 , with kon = 3.4 µM-1.s-1 and koff ATP-Arp2/3 = 0.66 s-1 from (Ghasemi et al. 2024), the scaling parameter obtained by the fits give estimates of the actin concentration in these experiments, of 0.6({plus minus}0.05) and 0.9({plus minus}0.2) µM for the experiments performed at 0.3 and 1 µM respectively in (Ghasemi et al. 2024).

      1. Stability of the authentic ADP-Pi-Arp2/3 complex on the mother filament. The extraordinary stability of the isolated ADP-BeFx-Arp2/3 complex on mother filaments is surprising, especially considering that both ATP and ADP states are much more labile (Ghasemi et al 2024). I would recommend repeating this experiment in the authentic ADP-Pi state with labelled Arp2/3 complexes as a more direct readout, even if this would require working with very high phosphate concentrations.

      Essential; simple experiment (a weeks time).

      We have followed the recommendation of the reviewer and have performed new experiments using fluorescent Arp2/3 complexes for ADP, ADP-BeFx and ADP-Pi states, now displayed in new figure 5C (also shown below).

      For fluorescent Arp2/3 complexes remaining bound to the mother filament, the Arp2/3 complex - mother filament interface is ~ 100 times more stable in the ADP-BeFx state (0.0046 s-1) compared to the ADP state (0.56 s-1). We also assessed the dissociation of surviving ADP-BeFx-Arp2/3 complexes using unlabelled Arp2/3 complexes (previously in figure 4B, repeated experiment shown in new supp. figure S10), which also indicates a remarkable stability.

      The dissociation curve of surviving Arp2/3 complexes in the presence of 50 mM Pi and 200 µM ATP in solution reflects the mixture of Arp2/3 dissociating in the ADP/ATP state and ADP-Pi-Arp2/3 that can either dissociate in the ADP-Pi state or lose their Pi and dissociate in the ATP state. Despite the presence of 50 mM Pi, the rate at which ADP dissociates and ATP reloads rate is much faster than Pi binding. Fitting this survival curve with a function that accounts for the initial double populations and the evolution of the ADP-Pi population (see Methods) gives a good estimate of the Pi release rate.

      OPTIONAL: Further, but beyond the scope of the present paper, would be titrating phosphate in these experiments, which would even allow the authors to independently verify the reduced Pi affinity for Arp2/3 in the mother filament. Of note, this affinity difference is needed to satisfy detailed balance in the reaction scheme (Fig 4 D)!

      We thank the reviewer for this suggestion. High concentrations of phosphate in the buffer renders glass surfaces quite sticky in our assays. We've tried several different passivation strategies (BSA, PLL-PEG, K-casein, ...) but none gave satisfactory results. So titrating phosphate, by going beyond 50 mM phosphate, proved to be quite challenging.

      Detailed balance, considering the two possible routes connecting the ADP-Pi-Arp2/3 complex branch junction state and the surviving ADP-Arp2/3 complex state, can be written as KPi rel.branch junction . Kdebranching ADP-Arp2/3 = KdebranchingADP-Pi-Arp2/3 . KPi rel.surviving Arp2/3.. Some of these affinity constants are not known, because of the inability to determine reverse reactions rates such as the rebinding of a daughter filament to a surviving Arp2/3. It is thus hard to determine how the affinity of Pi for Arp2/3 complex changes between Arp2/3 complexes at branch junctions and surviving Arp2/3 complexes on mother filaments.

      While we cannot determine the affinity constant of Pi for a surviving Arp2.3 complex, our data indicates that the dissociation rate of Pi is higher from Arp2/3 complexes at branch junction (koff = 0.21 s-1) than from surviving Arp2/3 complexes (koff = 0.05 s-1). This unexpected finding indicates that surviving Arp2/3 complexes adopt a conformation where the nucleotides are readily exchanged, but where the 'back door' for Pi release is less open. We now discuss this point in our revised manuscript.

      1. Importance of "surviving" ADP-Pi-Arp2/3 complexes. The authors show a) rapid turnover of Pi on the ADP-Arp2/3 complex in both branch- or mother filament-bound state and b) the lowered Pi affinity of the latter. Nonetheless, they emphasize the importance of long-lived "surviving" ADP-Pi bound complexes on the mother (even stated in the abstract). I understand that this fraction shows under some experimental conditions (BeFx), but unless I am missing something, most complexes should rapidly lose their phosphate and either exchange nucleotide or dissociate from the mother under physiological conditions. Please clarify or tone done.

      Essential; textual revision only

      We thank the reviewer for their remark. We have tried to clarify this aspect in the manuscript.

      As shown now with the departure rate of fluorescent surviving Arp2/3 complexes together with branch renucleation data, we show that surviving ADP-Pi-Arp2/3 complexes are quite stable on mother filaments, because they detach and release their Pi slowly, such that branch regrowth will occur provided there is actin in solution. In the absence of actin monomers, as the reviewer correctly points out, the surviving ADP-Pi-Arp2/3 will predominantly release its Pi and thus become a surviving ADP-Arp2/3 complex. We have modified the text to avoid any confusion.

      1. GMF mechanism. The authors claim that GMF "...accelerates the departure of the surviving Arp2/3 complex from the mother...". I assume that they infer this from decrease in the re-nucleation ratio. However, alternatively GMF could simply dwell on the complex, inhibiting re-nucleation without promoting dissociation from the mother. The authors should either monitor Arp2/3 dwell times directly to discriminate between these possibilities or be more cautious in their conclusions.

      Essential; simple experiment (a weeks time) or textual revision.

      In Ghasemi et al. Sci. Adv. 2024, we examined the departure of Arp2/3 from the mother filament after GMF-induced debranching using fluorescent Arp2/3. Most of the fluorescent Arp2/3 dissociated from mother filaments within the same frame as the branch, i.e. within 0.5 seconds after the debranching event, and none were visible after another second . This could be due to Arp2/3 departing with the branch or an accelerated departure after branch dissociation. In any case, this rules out the possibility that GMF would dwell on the surviving complex for a substantial amount of time without promoting dissociation from the mother.

      In the present manuscript, we now show that increasing the ATP concentration 10-fold (from 0.2 to 2 mM) is sufficient to restore the branch renucleation ratio to its level without GMF. This shows that GMF does not cause Arp2/3 to leave with the branch, but rather that it (also) acts on the surviving Arp2/3 complex, in a way that is countered by high concentrations of ATP. More specifically, it suggests that GMF accelerates the departure of the surviving ADP-Arp2/3 complex, either directly and by hindering the reloading of ATP, and that GMF does not affect the surviving Arp2/3 complex once it has reloaded ATP.

      We now discuss these two non-mutually exclusive possibilities for the accelerated dissociation of the surviving ADP-Arp2/3 complex in the manuscript.

      6.Cortactin mechanism and the "leash model". I must say that the cortactin data are the most puzzling part of the paper and hard to reconcile with what we know from structure. I was hoping to find some of this resolved in the discussion. However, I do not understand the "leash model" in the discussion section for cortactin-mediated branch stabilization: "This would explain the observed increase in branch survival compared to the absence of cortactin. As the pulling force is increased, this rebinding mechanism becomes less efficient." According to my understanding of the data, this is opposite to what happens. Cortactin only stabilizes the labile interface at elevated forces! Some re-writing might help here.

      Essential; textual revision.

      We thank the reviewer for having us think more thoroughly about the model we initially proposed. We now believe that our 'leash' mechanism is not able to fully recapitulate our observations in a simple and satisfactory manner.

      We now propose a much simpler model, where the binding of cortactin to the Arp2/3 complex at the branch junction simply changes the energy landscape of the Arp2/3-daughter interface without the need to invoke a rebinding of the daughter filament upon branch departure. We have updated our interpretation of the data in the Discussion section accordingly.

      Overall, our results on the impact of cortactin on branch renucleation highlights a surprising behaviour that would require further investigation to fully decipher the underlying molecular mechanism.

      3) Minor comments

      Organization: - I do not want to impose on how to best tell the story, but I felt that Fig1 A-D and Fig 2 A-B belong to one logical unit (nucleotide dependence), whereas Fig 1 E-F and Fig 2 C belong to the other (Pi binding and exchange). Perhaps consider re-organizing to streamline presentation?

      We thank the reviewer for their suggestion. We agree that it flows more naturally as suggested, and have made the changes! Thank you.

      Semantics/Typos: - Abstract: „... ADP-Pi and ADP-Arp2/3 detach with the same exponential increase as a function of force...". Increase should refer to the dissociation rate, which should be added to the sentence.

      We have corrected this.

      Results page 8: "...and the majority of Arp2/3 complexes detach from the mother filament while remaining bound to the branch at the debranching time." "Branch" should likely be daughter here, as there is no branch after dissociation of either interface.

      We have corrected this, thank you.

      Results page 13: "Exposing ADP-BeFx-Arp2/3 complex branch junctions to a saturating amount of GMF...". It is strange to imply saturation, because GMF likely simply does not bind to the complex in this nucleotide state with appreciable affinity. Suggest to change to "high".

      We have made the changes accordingly.

      Discussion page 18: "Moreover, in mammalian Arp2/3, His80 in Arp3 (corresponding to His73 in mammalian actin) is not methylated, and corresponds to residue N77 in Arp3, which is also not modified." N77 likely belongs to Arp2?

      We have made the changes accordingly.

      Discussion page 19: "We showed that Pi affinity for Arp2/3 complexes at branch junctions is around 3.7 mM (Fig. 1), a value which lies within the reported 1-10 mM Pi concentration measured in the cytosol in different mammalian cell types". Notably, this is not too different from F-actin, which should be mentioned. By this measure alone, free inorganic phosphate could also directly regulate actin filament stability!

      We now mention this and discuss that intracellular Pi can also impact actin filament nucleotide state.

      Future interest (non essential): - It would be utterly exciting (but beyond current scope) to quantify how instantaneous debranching rates evolve for naturally aging branches starting from ATP-Arp2/3 complexes!

      We thank the reviewer for this remark. It is indeed quite beyond the scope of the current study, as this would require a way to probe ATP-Arp2/3 complex branches while daughter filaments are still quite short (so pulling on them is difficult). An interesting alternative could be to use ATP analogs, such as App-NHp (aka AMP-PNP), to stabilize this state. However, some studies have mentioned that App-NHp is not very stable.

      Significance

      General assessment:

      This is a compelling and carefully executed study that delivers a clear mechanistic framework for how Arp2/3 branch junctions fail and re‑form under load. The central strength is the tight integration of state‑of‑the‑art reconstitutions with careful and original kinetic analysis. The experimental design is elegant and experiments have been carried out to a masterful standard. The figures are clear, the statistics are appropriate with some exceptions as detailed above. There are very few labs in the world that could have achieved this feat!

      A few aspects could be further strengthened, most notably the explanation and application of the "two slip bond" model as well as slightly more restraint in speculating around specific regulatory mechanisms. However, these are minor refinements that do not detract from the important contributions of the paper.

      Overall, the clearly work merits publication with high priority after revision; most requested changes are textual/analytical with very few targeted experiments, which would substantially strengthen core claims.

      We thank the reviewer for their positive evaluation of our manuscript. We hope that our responses to the detailed points above, along with the corresponding revisions of the manuscript, will alleviate their concerns.

      Advance relative to prior literature: The major novel findings of the paper are already summarized above. There is some recent work done on the subject of branch mechanics by the authors (Ghasemi et al 2024, PMID: 38277459) and others (Pandit et al 2020 PMID: 32461373), but the focus of the present work is clearly unique and the there is plenty of novel insight.

      Audience and impact: Primary audience: specialists in cytoskeleton dynamics, in vitro reconstitution single molecule biophysics, and mechanobiochemistry. Secondary: researchers in cell motility, morphogenesis and mechanobiology, physicists working on active matter and modelers studying force producing and load-bearing biopolymer networks. The results and analysis framework should inform quantitative models of branched network turnover under load and the interpretation of regulatory factor action in vivo and in cells.

      Reviewer expertise: Actin dynamics; biochemical reconstitution; single molecule approaches; biophysics.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Xiao et al examine the molecular events occurring when Arp2/3 complex-mediated actin filament branches are removed from mother actin filaments. They do this using microfluidics assay with purified proteins combined with single filament TIRF imaging of branched actin filaments with distinct fluorescent labels. The contribution of different nucleotide states of Arp2/3 complex are tested in conjunction with the relationship force exerted on the branches and regulatory protein involvement from GMF and cortactin. The data seem comprehensive and highly quantified in response to concentration, force, fraction of branches and survival times and branching rates. They find that ADP-BeFx and high phosphate concentrations (leading to the ADP-Pi state) leads to a slower debranching rate at a given level of force applied. The ability to rapidly switch the buffer gives powerful information about response times of debranching compared with other actin remodelling events. They use renucleation experiments to determine that the previous debranching event most often occurs at the Arp2/3 complex/daughter interface, showing that filaments will be ready to re-branch in the stable ADP-Pi bound state. GMF addition allows debranching of the ADP state to occur at a lower force. Cortactin acts similarly to the ADP-Pi state to increase branch stability.

      Specific comments

      The pulling force on the branches seems to arise from different flow rates in the microfluidics. Viscous drag is mentioned and I can see there is methylcellulose in the buffer. It would be helpful to have the explanation of the conversion between flow and force, even if it has been standard in previous work.

      We apologize if this was unclear: in microfluidics experiments, the buffer does not contain methylcellulose. Methylcellulose is only used for 'open chamber' experiments, where no force is applied to Arp2/3 branches, to maintain them in the TIRF field of excitation (Figure S2).

      To better clarify the conversion between flow and force, we have rephrased and extended the Methods section to explain how the force on the branch junction is computed based on the local flow velocity and the length of the daughter filament.

      Pg 5 - what was the motivation to titrate phosphate? It seems a stretch that intracellular Pi levels are tuning branching inside cells more than protein-mediated control (GMF or cortactin) - can the authors evidence this at all?

      We are not claiming that the level of Pi plays a stronger regulatory role than proteins. We show that inorganic phosphate tunes the state of the Arp2/3 complex, which in turn modulates the action of regulatory proteins, such as GMF and cortactin.

      Nonetheless, we do show that the contribution of inorganic phosphate is quite central as it can (1) strongly stabilize branch junctions (~30-fold decrease in the dissociation rate), and (2) tune the activity of GMF and cortactin on Arp2/3 complexes at branch junctions as well as on the 'surviving' Arp2/3 complexes that remain bound to mother filaments.

      We thus titrated phosphate and found that its impact on Arp2/3 complex stability is significant in the range of Pi concentration that is explored in cells. For the sake of completeness, and following a comment from reviewer #1, we now also mention the affinity of Pi for actin subunits in filaments in the Discussion, and discuss the impact of intracellular Pi on actin itself.

      Minor comments

      • In the introduction, while the structural and mutagenesis evidence is clearly stated, in other cases a bit more detail would be helpful e.g. 'biochemical studies', which referred measurement of hydrolysis rates using radiolabelling

      We have made changes to more precisely define which biochemical assays were used in previous studies.

      • Page 3 Figures shouldn't be referenced in the introduction

      We have removed the references to the figures from the introduction.

      • Page 3 slip bond behaviour needs explanation

      We now explain the concept when first using this concept in the manuscript, as follows: "The debranching rate seems to increase exponentially with the applied pulling force, in the range of 0 to 6 pN (Fig. 1F; see more refined analysis below). This behaviour of accelerated debranching with the increase of the applied force is similar to the 'slip bond' concept, as predicted by the Bell-Evans model of the force-dependent lifetime of the interaction between two proteins".

      • Figure 1B seems to be a theoretical schematic which is superfluous

      We suppose that the reviewer is actually referring to figure 3B of the initial manuscript, describing the energy potential of a molecular interaction as a function of the reaction coordinate. We agree with the reviewer that it is not absolutely required and we have removed it.

      • Figure 4D is helpful, different weight lines might help even more to explain the dominant pathways

      We have made modifications to the biochemical reaction scheme in this figure (now figure 5F in the revised version). We hope we succeeded in improving its readability. Since the different paths depend on mechano-chemical parameters, there is no real dominant pathway per se.

      **Referee cross-commenting**

      Rev1 sounds like the specialist here. I can't comment on their requests. Some similar points arise between the reviewers which need addressing.

      Reviewer #2 (Significance (Required)):

      Significance

      Taking a look at references 16 and 19, I do not find it clear what is achieved differently in the current work compared to these papers and what agrees and what disagrees. If it's a species difference I might expect the two species would be analysed side-by-side in this paper.

      We thank the reviewer for this important comment. The goal of our study was not to compare the behaviour of mammalian and yeast Arp2/3 complexes.

      We now try to better explain that the motivation of the present work is to address how the nucleotide state of the Arp2/3 complex tunes actin branch mechanosensitive stability, and regulates interactions with well known Arp2/3 complex binding proteins. Most of the reactions are quantified here for the first time. Moreover, the experiments with branch junctions in different nucleotide states are done under controlled mechanical conditions, providing the first direct measurements of the force-dependence of the debranching reactions. Our detailed kinetic analysis of the full reaction scheme allows us to model the different binding interfaces of the Arp2/3 complex.

      In addition, it is worth noting that:

      1. Species matter and this is why ref 16 and 19 can give the impression to disagree on the ability to renucleate branches thanks to the stability of surviving Arp2/3 complexes on mother filaments.
      2. In ref 16 (Pandit et al, PNAS 2020) species are mixed (yeast Arp2/3 and mammalian alpha actin from skeletal muscle), likely leading to a different behaviour compared to the only mammalian protein situation we examine in our current work. In particular, with mixed species one misses the ability to renucleate, as shown in our previous study Ghasemi et al (ref 19). However, since mixing species does not correspond to anything physiological, we do not think it is worth repeating these conditions alongside our experiments.
      3. Further, the analysis carried out in ref 16 suffers from important limitations: the force was unknown (not calibrated) and the data was fitted by a model that compounded several reactions, providing only an indirect estimation of the rates, in particular at zero force. In contrast, we have worked with calibrated forces (including dedicated experiments at zero force) and we have carried out specific experiments to directly measure several rates.
      4. In ref 19 (our earlier work) we did not investigate the impact of the nucleotide state of the branch junction at all, and we did not systematically measure the dissociation rates as a function of force. Contrary to Pandit et al, we directly measure the difference in branch stability at zero force between ADP and ADP-Pi states and show that the ~ 30 fold difference holds true at all probed forces. Last, the force dependence of the branch renucleation success rate gives us crucial information on which of the two Arp2/3 complex interfaces ruptures first.

      I'm not understanding how the authors can distinguish effects of adding phosphate and BeFx on Arp 2 and 3 compared to effects on actin. Importantly, are possible accompanying changes in the actin filament a confounding factor?

      We have checked that the nucleotide state (ADP-BeFx and ADP-Pi versus ADP) of the mother and daughter filaments have no impact on branch stability:

      • In the experiments shown in figure 2F, where the buffer condition to which branches are exposed is quickly changed from phosphate buffer to buffer without phosphate, we observe a rapid change of branch stability. Actin subunits at the branch junction are in F-actin conformation according to recent cyroEM observations (ref. Chavani et al, Nat Comm. 2024; Liu et al, NSMB 2024). These actin subunits, initially in the ADP-Pi state, are expected to age and become ADP with a rate of ~ 0.007 s-1 (ie half-time of 100 s; ref. Jegou et al, PLoS Biology 2011, Ooosterhert et al, NSMB 2023), a much lower rate than the observed change of the debranching rate (0.21 s-1). This means that the debranching rate is independent of the nucleotide state of daughter and mother filaments.

      • In new supp. Figure S4, we show that the debranching rate is similar for ADP-Arp2/3 complex branch junctions initiated from ADP- or ADP-BeFx-actin mother filaments.

      • In new supp. Figure S9, we initially exposed branch junctions to a BeFx solution then monitored debranching and branch renucleation in our standard buffer (ie without BeFX or Pi). We observed multiple rounds of branch renucleation, the first with ADP-BeFx-actin daughter filaments, and the following with daughter filaments never exposed to BeFx. They all had the same debranching rates and renucleation success rates.

      The paper is quite specialist to read and the advance appears to be incremental. My expertise is in molecular pathways to actin regulation outside the main area of the paper.

      The results we present in this study are often unexpected, and some go counter long-standing assumptions. The regulation of Arp2/3-nucleated branches is of importance for the stability and the force-generating capabilities of many actin networks in cells. Last, most of the measurements that we present had never been done, mainly because experiments are difficult to achieve, and require specific tools to monitor several events while controlling the applied force.

      We believe our results are of broad interest as they go counter long-standing assumptions. We have rewritten the text in several instances to convey our message more clearly.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Please find enclosed the review of the manuscript "Inorganic phosphate in Arp2/3 complex acts as a rapid switch for the stability of actin filament branches" by Xiao et al.

      The authors provide a detailed investigation of how the nucleotide bound to the Arp2/3 complex affects branch stability under flow force. From a kinetic perspective, this is an elegant study with generally high-quality data, although some conclusions rest on assumptions rather than direct experimental evidence.

      We thank the reviewer for their positive feedback. We have improved our manuscript and performed important additional experiments to provide more direct experimental evidence of our conclusions.

      A key question concerns the physiological relevance of these findings. For instance, the concept of branch regrowth may not be applicable in cellular contexts, since forces by actin polymerization would displace existing branches away from sites where they generate this active forces. The authors should clarify the relevance of regrowth during active force generation by branched networks.

      We thank the reviewer for this comment. Our in vitro results indeed point to a previously unreported property of branched actin networks, i.e. the ability of Arp2/3 complexes to readily renucleate branches in the ADP-Pi state and that it does require reloading ATP within Arp2/3.

      Branched actin networks, especially the lamellipodia or endocytotic patches, do exert active force thanks to actin polymerization of the individual branches at the forefront. Though, the whole actin network is exposed to stress, and the architecture of the network (inter-branch distance, crosslink between branches, ...) presumably strongly impact its mechanical properties.

      In the case of other types of branched actin networks, such as the actin cortex, myosin motor put the whole network under tension. Such pulling forces on actin branches, depending on the amplitude of the pulling force, can lead to branch regrowth, and network self-repair.

      We have modified the text to make the physiological relevance clearer.

      Additionally, all experiments employ flow conditions that branches would probably not experience in cells-notably, the flow direction in the cellular context would be reversed. Altering the flow direction relative to the branches could affect not only the relationship between flow rate and branch stability, but potentially other system properties as well.

      We agree with the reviewer that in cells branches will not experience flow conditions similar to the ones we use in our in vitro assay. Nonetheless, in cells we expect mechanical stress on the branch junction to be applied in all directions. In lamellipodia, the compressive force applied at the leading edge is expected to result in diverse local orientations of the force on individual branch junctions within the network (as explained in Lappalainen et al. Nat Rev MBC 2022). Also, branch junctions are found in the cell cortex, where they are exposed to pulling forces resulting from the action of myosin motors and crosslinkers on mother and daughter filaments.

      This impact of the direction of the flow was addressed in our previous publication (Ghasemi et al, Sc. Adv. 2024, figure 2) and, to a lesser extent, by the lab of Enrique de la Cruz in Pandit et al, PNAS 2020 (ref. 16). We reported that flow direction has a minimal effect, if any, on branch dissociation rate and renucleation ratio.

      Reviewer #3 (Significance (Required)):

      Furthermore, the study appears not to account for the mother filament (particularly its nucleotide state) or the actin subunit bound to the Arp2/3 complex. The authors should discuss why their interpretation focuses exclusively on the Arp2/3 complex rather than on the actin filaments or Arp2/3-bound actin subunit.

      We have checked that the nucleotide state (ADP-BeFx and ADP-Pi versus ADP) of the mother and daughter filaments has no impact on branch stability :

      • In the experiments shown in figure 2F, where the buffer condition to which branches are exposed is quickly changed from phosphate buffer to buffer without phosphate, we observe a rapid change of branch stability. Actin subunits at the branch junction are in F-actin conformation according to recent cyroEM observations (ref. Chavani et al, Nat Comm. 2024; Liu et al, NSMB 2024). These actin subunits, initially in the ADP-Pi state, are expected to age and become ADP with a rate of ~ 0.007 s-1 (ie half-time of 100 s; ref. Jegou et al, PLoS Biology 2011, Ooosterhert et al, NSMB 2023), a rate much lower than the observed change of the debranching rate (0.21 s-1). This means that the debranching rate is independent of the nucleotide state of daughter and mother filaments.

      • In new supp. Figure S4, we show that the debranching rate is similar for ADP-Arp2/3 complex branch junctions initiated from ADP- or ADP-BeFx-actin mother filaments.

      • In new supp. Figure S9, we initially exposed branch junctions to a BeFx solution then monitored debranching and branch renucleation in a regular buffer. We observed multiple rounds of branch renucleation, the first with ADP-BeFx-actin daughter filaments, and the following with daughter filaments never exposed to BeFx. They all had the same debranching rates and renucleation success rates.

      An important concern involves the use of KPi (inorganic phosphate). Based our experience, KPi appears to have effects beyond simply impacting nucleotide state-actin filaments seem to assemble differently in the presence of KPi. The authors should exercise caution in their interpretation of KPi-based experiments.

      Concentration of KPi (up to 50 mM Pi) did not slow down barbed end elongation rate in our experiments.

      Overall, while the technical quality and kinetic analyses are state-of-the-art, relating this work to physiological contexts remains challenging, and some conclusions appear overstated.

      We have made changes in the discussion to try to more clearly relate our in vitro observations and conclusions with the cellular context where branch renucleation could have a strong impact on the architecture and mechanics of actin networks.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Please find enclosed the review of the manuscript "Inorganic phosphate in Arp2/3 complex acts as a rapid switch for the stability of actin filament branches" by Xiao et al.

      The authors provide a detailed investigation of how the nucleotide bound to the Arp2/3 complex affects branch stability under flow force. From a kinetic perspective, this is an elegant study with generally high-quality data, although some conclusions rest on assumptions rather than direct experimental evidence.

      A key question concerns the physiological relevance of these findings. For instance, the concept of branch regrowth may not be applicable in cellular contexts, since forces by actin polymerization would displace existing branches away from sites where they generate this active forces. The authors should clarify the relevance of regrowth during active force generation by branched networks.

      Additionally, all experiments employ flow conditions that branches would probably not experience in cells-notably, the flow direction in the cellular context would be reversed. Altering the flow direction relative to the branches could affect not only the relationship between flow rate and branch stability, but potentially other system properties as well.

      Significance

      Furthermore, the study appears not to account for the mother filament (particularly its nucleotide state) or the actin subunit bound to the Arp2/3 complex. The authors should discuss why their interpretation focuses exclusively on the Arp2/3 complex rather than on the actin filaments or Arp2/3-bound actin subunit.

      An important concern involves the use of KPi (inorganic phosphate). Based our experience, KPi appears to have effects beyond simply impacting nucleotide state-actin filaments seem to assemble differently in the presence of KPi. The authors should exercise caution in their interpretation of KPi-based experiments.

      Overall, while the technical quality and kinetic analyses are state-of-the-art, relating this work to physiological contexts remains challenging, and some conclusions appear overstated.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Xiao et al examine the molecular events occurring when Arp2/3 complex-mediated actin filament branches are removed from mother actin filaments. They do this using microfluidics assay with purified proteins combined with single filament TIRF imaging of branched actin filaments with distinct fluorescent labels. The contribution of different nucleotide states of Arp2/3 complex are tested in conjunction with the relationship force exerted on the branches and regulatory protein involvement from GMF and cortactin. The data seem comprehensive and highly quantified in response to concentration, force, fraction of branches and survival times and branching rates. They find that ADP-BeFx and high phosphate concentrations (leading to the ADP-Pi state) leads to a slower debranching rate at a given level of force applied. The ability to rapidly switch the buffer gives powerful information about response times of debranching compared with other actin remodelling events. They use renucleation experiments to determine that the previous debranching event most often occurs at the Arp2/3 complex/daughter interface, showing that filaments will be ready to re-branch in the stable ADP-Pi bound state. GMF addition allows debranching of the ADP state to occur at a lower force. Cortactin acts similarly to the ADP-Pi state to increase branch stability.

      Specific comments

      The pulling force on the branches seems to arise from different flow rates in the microfluidics. Viscous drag is mentioned and I can see there is methylcellulose in the buffer. It would be helpful to have the explanation of the conversion between flow and force, even if it has been standard in previous work.

      Pg 5 - what was the motivation to titrate phosphate? It seems a stretch that intracellular Pi levels are tuning branching inside cells more than protein-mediated control (GMF or cortactin) - can the authors evidence this at all?

      Minor comments

      • In the introduction, while the structural and mutagenesis evidence is clearly stated, in other cases a bit more detail would be helpful e.g. 'biochemical studies', which referred measurement of hydrolysis rates using radiolabelling
      • Page 3 Figures shouldn't be referenced in the introduction
      • Page 3 slip bond behaviour needs explanation
      • Figure 1B seems to be a theoretical schematic which is superfluous
      • Figure 4D is helpful, different weight lines might help even more to explain the dominant pathways

      Referee cross-commenting

      Rev1 sounds like the specialist here. I can't comment on their requests. Some similar points arise between the reviewers which need addressing.

      Significance

      Taking a look at references 16 and 19, I do not find it clear what is achieved differently in the current work compared to these papers and what agrees and what disagrees. If it's a species difference I might expect the two species would be analysed side-by-side in this paper.

      I'm not understanding how the authors can distinguish effects of adding phosphate and BeFx on Arp 2 and 3 compared to effects on actin. Importantly, are possible accompanying changes in the actin filament a confounding factor?

      The paper is quite specialist to read and the advance appears to be incremental. My expertise is in molecular pathways to actin regulation outside the main area of the paper.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      This study investigates the mechanochemistry of Arp2/3-mediated branched actin networks at the level of individual branch junctions under load. Using microfluidic single-filament/branch force assays (including constant-force flow and open-chamber imaging) the authors quantify debranching, re‑nucleation, and mother- vs daughter‑interface stability across nucleotide states of Arp2/3 (ADP-Pi, ADP, and an ADP-BeFx proxy for ADP-Pi). They further test effects by two branch regulators (GMF and cortactin). Key findings include: (i) ADP-Pi and ADP complexes share similar force dependence but differ markedly (~20×) in intrinsic dissociation rate; (ii) phosphate turnover on the Arp2/3 complex is rapid ii) affinity for Pi drops when Arp2/3 loses its daughter filament; (iii) quantification from model fits uncovers large stability differences between daughter and mother interfaces of the Arp2/3 complex; (iv) extraordinary high stability of ADP-Pi-like Arp2/3 on the mother filament; and (v) distinct effects of GMF and cortactin on force‑dependent stability. Overall, the work combines technically demanding measurements with mechanistic modeling to probe how nucleotide state and regulatory factors tune branch mechanics.

      Major comments:

      1. Low force kinetics and completeness of survival curves (Figure 1). "For all forces, the surviving curves exhibited a clear single exponential behavior...." While the data can be fitted to monoexponential decay curves, data at low forces is clearly incomplete. >90% of branches have not dissociated by the end of the experiment. For the particular data shown in 1C (F00nN, n=60 total branches) it means that the time information is coming from <6 observations, which is rather low for the single molecule field. I am slightly worried by this point, since the debranching rates under ADP-Pi conditions at zero force, are even by one magnitude slower. Yet, no raw data is shown. Given that the dissociation rate at low forces is a contentious point, the authors should show the raw data and the corresponding fits. At present, they only show an experimental scheme and images for these "open chamber" assay (Fig S2). Ideally, they would image for much longer than 900s with lower sampling time in those assays, to firmly establish that 20-fold difference also holds at 0 force.

      Essential; experiment might already be performed. Otherwise straightforward to do (weeks time).

      1. Stability Analysis (Figure 4). I can follow much of the arguments presented in the stability analysis of the daughter vs mother interfaces, which is in principle extremely interesting! However, there are some concerns here:

      i) The authors emphasize the zero force ratio derived from fits (which is linked to the stability difference of the two interfaces in the absence of force) despite this being only weakly constrained by data. Intuitively in the model, the stability difference should grow to very large values as the re-nucleation ratio approaches 1 at low force. This combined with the noise in the data poses an issue in my opinion. Looking at the data and the error margin, I think that the authors cannot state with high confidence that there is a real difference between the relative stability of the daughter and mother interfaces between the two nucleotide states of the complex.

      Essential; analysis and textual revision only

      ii) For ADP-Pi, the renucleation ratio essentially remains flat over the measured force range. Hence, the data can only provide little leverage to estimate both the zero force ratio and, more importantly, the differential distance to the transition state in the slip-bond model in my opinion, which will show in the crossover force. Consequently, the quoted ">100×" stability difference at F=0 and the crossover force >20pN are driven largely by extrapolation rather than direct constraint by data. Given the high number of free parameters in the model, I would anticipate that several crossover forces and differential distances might explain the data nearly equally well. Instead of loosely reporting exact number from fits, I would have hoped for some sort of sensitivity analysis, for instance relying on profile likelihoods. Also parameter values could be reported as bounds (e.g crossover force≫measured range) rather than precise point estimates. This issue re-occurs (albeit not as drastically) for the cortactin experiments (Figure 6).

      Essential; analysis and textual revision only

      iii) One important expectation from the "two slip bond" model is that branch dissociation rates should not necessarily scale mono-exponentially as they mostly do over the accessible force range of the paper. However, once the "minor" pathway of dissociation from the mother starts to dominate at high forces, rates become more force sensitive. This is nicely recaptured by the model fits in Figure S6 but deserves some explanation in the text. Otherwise, people will simply remember the "ADP-Pi is 20-fold more stable than ADP at all forces" message.

      Essential; textual revision only

      iv) One important prerequisite for the model is that isolated Arp2/3 complexes (without a daughter filament) should dissociate with equal rates from mother filaments at all flow rates. Since the Arp2/3 complex prefers mother filament curvature, forces experienced by the mother might change its off-rate. It would be good to refer to this assumption in the text and experimentally verify it. I could not find it in the paper nor in Ghasemi et al 2024.

      Essential; simple experiment (a weeks time).

      v) The force dependence of the branch re-nucleation rate (Fig 3D) has been measured previously by the same group (Ghasemi et al). While the data in the older paper has not been fitted by a model, the trend of the data in the previous paper looks conspicuously different. Are there any explanations for this? I speculate that it might be related to actin and ATP not being saturated (low-force re-nucleation rate rarely exceeds 80%) in Ghasemi et al., but it would be good to know what the authors think about this.

      Essential; textual revision only 3. Stability of the authentic ADP-Pi-Arp2/3 complex on the mother filament. The extraordinary stability of the isolated ADP-BeFx-Arp2/3 complex on mother filaments is surprising, especially considering that both ATP and ADP states are much more labile (Ghasemi et al 2024). I would recommend repeating this experiment in the authentic ADP-Pi state with labelled Arp2/3 complexes as a more direct readout, even if this would require working with very high phosphate concentrations.

      Essential; simple experiment (a weeks time).

      OPTIONAL: Further, but beyond the scope of the present paper, would be titrating phosphate in these experiments, which would even allow the authors to independently verify the reduced Pi affinity for Arp2/3 in the mother filament. Of note, this affinity difference is needed to satisfy detailed balance in the reaction scheme (Fig 4 D)! 4. Importance of "surviving" ADP-Pi-Arp2/3 complexes. The authors show a) rapid turnover of Pi on the ADP-Arp2/3 complex in both branch- or mother filament-bound state and b) the lowered Pi affinity of the latter. Nonetheless, they emphasize the importance of long-lived "surviving" ADP-Pi bound complexes on the mother (even stated in the abstract). I understand that this fraction shows under some experimental conditions (BeFx), but unless I am missing something, most complexes should rapidly lose their phosphate and either exchange nucleotide or dissociate from the mother under physiological conditions. Please clarify or tone done.

      Essential; textual revision only 5. GMF mechanism. The authors claim that GMF "...accelerates the departure of the surviving Arp2/3 complex from the mother...". I assume that they infer this from decrease in the re-nucleation ratio. However, alternatively GMF could simply dwell on the complex, inhibiting re-nucleation without promoting dissociation from the mother. The authors should either monitor Arp2/3 dwell times directly to discriminate between these possibilities or be more cautious in their conclusions.

      Essential; simple experiment (a weeks time) or textual revision. 6. Cortactin mechanism and the "leash model". I must say that the cortactin data are the most puzzling part of the paper and had to reconcile with what we know from structure. I was hoping to find some of this resolved in the discussion. However, I do not understand the "leash model" in the discussion section for cortactin-mediated branch stabilization: "This would explain the observed increase in branch survival compared to the absence of cortactin. As the pulling force is increased, this rebinding mechanism becomes less efficient." According to my understanding of the data, this is opposite to what happens. Cortactin only stabilizes the labile interface at elevated forces! Some re-writing might help here.

      Essential; textual revision.

      Minor comments

      Organization:

      • I do not want to impose on how to best tell the story, but I felt that Fig1 A-D and Fig 2 A-B belong to one logical unit (nucleotide dependence), whereas Fig 1 E-F and Fig 2 C belong to the other (Pi binding and exchange). Perhaps consider re-organizing to streamline presentation?

      Semantics/Typos:

      • Abstract: „... ADP-Pi and ADP-Arp2/3 detach with the same exponential increase as a function of force...". Increase should refer to the dissociation rate, which should be added to the sentence.
      • Results page 8: "...and the majority of Arp2/3 complexes detach from the mother filament while remaining bound to the branch at the debranching time." "Branch" should likely be daughter here, as there is no branch after dissociation of either interface.
      • Results page 13: "Exposing ADP-BeFx-Arp2/3 complex branch junctions to a saturating amount of GMF...". It is strange to imply saturation, because GMF likely simply does not bind to the complex in this nucleotide state with appreciable affinity. Suggest to change to "high".
      • Discussion page 18: "Moreover, in mammalian Arp2/3, His80 in Arp3 (corresponding to His73 in mammalian actin) is not methylated, and corresponds to residue N77 in Arp3, which is also not modified." N77 likely belongs to Arp2?
      • Discussion page 19: "We showed that Pi affinity for Arp2/3 complexes at branch junctions is around 3.7 mM (Fig. 1), a value which lies within the reported 1-10 mM Pi concentration measured in the cytosol in different mammalian cell types". Notably, this is not too different from F-actin, which should be mentioned. By this measure alone, free inorganic phosphate could also directly regulate actin filament stability!

      Future interest (non essential):

      • It would be utterly exciting (but beyond current scope) to quantify how instantaneous debranching rates evolve for naturally aging branches starting from ATP-Arp2/3 complexes!

      Significance

      General assessment:

      This is a compelling and carefully executed study that delivers a clear mechanistic framework for how Arp2/3 branch junctions fail and re‑form under load. The central strength is the tight integration of state‑of‑the‑art reconstitutions with careful and original kinetic analysis. The experimental design is elegant and experiments have been carried out to a masterful standard. The figures are clear, the statistics are appropriate with some exceptions as detailed above. There are very few labs in the world that could have achieved this feat!

      A few aspects could be further strengthened, most notably the explanation and application of the "two slip bond" model as well as slightly more restraint in speculating around specific regulatory mechanisms. However, these are minor refinements that do not detract from the important contributions of the paper.

      Overall, the clearly work merits publication with high priority after revision; most requested changes are textual/analytical with very few targeted experiments, which would substantially strengthen core claims.

      Advance relative to prior literature:

      The major novel findings of the paper are already summarized above. There is some recent work done on the subject of branch mechanics by the authors (Ghasemi et al 2024, PMID: 38277459) and others (Pandit et al 2020 PMID: 32461373), but the focus of the present work is clearly unique and the there is plenty of novel insight.

      Audience and impact:

      Primary audience: specialists in cytoskeleton dynamics, in vitro reconstitution single molecule biophysics, and mechanobiochemistry. Secondary: researchers in cell motility, morphogenesis and mechanobiology, physicists working on active matter and modelers studying force producing and load-bearing biopolymer networks. The results and analysis framework should inform quantitative models of branched network turnover under load and the interpretation of regulatory factor action in vivo and in cells.

      Reviewer expertise:

      Actin dynamics; biochemical reconstitution; single molecule approaches; biophysics.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      • *

      Background and unknown in the field:

      This study investigates how fibroblast alignment influences the migration of intestinal epithelial cells, contributing to tissue integrity and repair. It is well established that intestinal fibroblasts are important regulators in the tissue through their ability to secrete essential paracrine factors for the epithelium. However, it is less well understood if they also play additional structural, tissue architecture instructing role and how the communication between the fibroblasts and the epithelia is regulated.

      Advance over state of the art:

      Here the authors have set-up an elegant three-component system to investigate this. They have gone beyond the recent advances of culturing intestinal and colonic organoids in 2D (in a manner that preserves- and villus-like organization) and bioengineered epithelial-stromal model comprising organoid-derived intestinal epithelial cells (IECs), primary intestinal fibroblasts, and a basement membrane matrix. Using this model, they have uncovered fibroblasts enhancing the directed and persistent migration of intestinal epithelial cells (IECs). They used scRNAseq to carefully analyse the stromal cell populations present in their co-cultures of primary mouse intestinal subepithelial fibroblasts and organoid-derived intestinal mouse epithelial cells. They observed that this reflected well the stromal cell-type composition as well as the paracrine activity previously reported for these cells in tissue. Using a clever system with Matrigel and an elastomeric barrier, the authors were able to induce non-epithelial gaps in different scenarios (IECs alone or with fibroblasts or with conditioned media) and observe the wound-closure as well as the presence of specific cell types. They observed that the epithelial monolayers showed significant gap closure when in direct contact with fibroblasts compared to controls. Interestingly, the enhanced efficiency of epithelial migration and gap closure, in the presence of fibroblasts, was independent of PGE-EP4 signaling and was not due to differences in cell proliferation. Instead, the imaging revealed that the fibroblasts were in direct contact with the epithelium. The authors observed that in the absence of fibroblasts the migration properties of cells in the villus and the crypt regions were dramatically different and the fibroblast presence was necessary to efficiently synchronize these to support gap closure. In addition, the presence of fibroblasts enhanced the directionality of the epithelial cell migration. Detailed imaging and image analyses revealed that gap closure involved activation of the fibroblasts and co-ordinated coalignment of IECs and fibroblasts. They also explored matrix deposition of the fibroblasts during the process and found that they deposited aligned ECM fibers that guide epithelial migration. Mere cell-derived matrix (devoid of live fibroblasts) was able to partially recapitulate the fibroblast-coordinated epithelial migration that the fibroblast generated matrix and its alignment are key contributors to the phenotype.

      Comments:

      This is overall a very interesting and well-written study. The imaging and the image analysis are state-of-the art and the bioengineered model is an exciting advancement over current methods developed by these researchers and others. This study meets all the criteria for a publication in the since that all the experiments seem to be carefully conducted, with appropriate controls and sufficient quantifications and statistics. The claims made by the authors are supported by the data. This is currently suitable to be published as a method/protocol and as a descriptive study uncovering interesting cross-talk and co-dependencies of epithelial and stromal cells during injury repair. There are of course aspects that could improve the study further like more mechanistic insight into the underpinnings of the direct epithelia-fibroblast interaction and its involvement in the directed IEC migration. However, these may be topics to investigate in a future study.

      • *

      Reviewer #1 (Significance (Required)):

      • *

      The strengths of the study are the highly in vivo relevant model system that is amendable to imaging and detailed image analysis of distinct cell populations. This may be adapted by others in in the field and has the potential to transform the way cell dynamics in the intestinal epithelium are visualized and investigated in vitro

      • *

      We thank the reviewer for their thoughtful and positive assessment of our work, and their recognition of the relevance of the bioengineered epithelial-stromal model and its potential for quantitative imaging and analysis of epithelial and fibroblast dynamics.

      We agree that further mechanistic insight into epithelial-fibroblast crosstalk would strengthen the study. While the current manuscript establishes this tractable system and identifies a role for fibroblast organization and matrix alignment in coordinating epithelial migration, we also aim to deepen the mechanistic understanding in the revision. As outlined in our response to Reviewer 2, we will perform additional experiments to further investigate the epithelial-fibroblast crosstalk and force-dependent interactions underlying this process.

      We believe that these additions will complement the current findings and strengthen the conceptual contribution of the study beyond its methodological advances.

      • *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      • *

      Please find enclosed my review comments on the manuscript entitled "Fibroblast alignment coordinates epithelial migration and maintains intestinal tissue integrity" by Jordi Comelles et al.

      In this manuscript, the authors use a bioengineered epithelial-stromal system composed of organoid-derived intestinal epithelial cells, primary intestinal fibroblasts, and a basement membrane matrix to show that direct physical interactions between fibroblasts and epithelial cells drive a large-scale organization of the fibroblast network. This spatial reorganization, in turn, promotes persistent and oriented migration of epithelial cells, ultimately enabling restoration of the intestinal epithelium in an in vitro gap-closure assay. Overall, while the authors use an elegant in vitro model to study intestinal wound closure, and more specifically the role of fibroblasts in this context, I find this manuscript not suitable for publication in its present form. The data are overinterpreted, the novelty is limited, and the molecular mechanisms underlying WAE-fibroblast interactions are insufficiently addressed.

      • *

      We thank the reviewer for their contribution to the revision process with their valuable assessments. We will address their specific points below.

      • *

      Figure 1 - What are the units of the "fraction gap closure" shown in panels d and e? Is it expressed as a percentage?

      We thank the reviewer for pointing this out. The "fraction of gap closed" was calculated as (A(t = 0h)-A(t))/A(t = 0h), where A(t = 0h) corresponds to the initial gap area and A(t) is the area of the gap measured at the time point t. With this definition, the fraction of gap closed is dimensionless, it is 0 at the initial time point, will reach 1 if the gap is fully closed and will have negative values if the gap area increases beyond the initial size, as observed in some replicates of the control condition. To avoid misinterpretation, we will express this quantity as a percentage (i.e., multiplied by 100), as suggested by the reviewer. Moreover, we realized it was ill defined in the methods section. This will be corrected as well in the revised version.

      • *

      "Actually, epithelial monolayers achieved the most effective gap closure when cultured in direct physical contact with fibroblasts (Figure 1e and Movies 2 and 3)." From the data shown in panels c, d, and e, it appears that fibroblast-conditioned medium alone promotes efficient gap closure, comparable to the + fibroblast condition.

      We agree with the reviewer that the original closing sentence overstated the effect. While both fibroblast-conditioned medium and direct fibroblast contact promote efficient gap closure compared to control conditions, the data do not support a consistent difference between these two conditions. We will therefore remove this statement in the revised version to more accurately reflect the results.

      • *

      Figure 2 - The use of a cell proliferation inhibitor during the gap-closure assay would help determine the contribution of cell proliferation at the migration front.

      We agree with the reviewer that inhibiting proliferation would help assess the contribution of cell proliferation to gap closure. However, in the 2D gap-closure assay, our Ki67 immunostaining showed no significant differences in the proportion of proliferative cells between conditions, either within the monolayer or at the migration front. This suggests that differential proliferation is unlikely to account for the differences in gap closure observed between control and fibroblast-containing conditions.

      We note that, in a separate 3D organoid assay, fibroblast-derived signals induced a WAE-like transcriptional program associated with reduced Ki67 mRNA expression, indicating that fibroblasts can promote a more migratory epithelial state without increasing proliferation. Thus, while proliferation may contribute to epithelial homeostasis and repair, our data do not point it as the main determinant of the differences observed in the 2D gap-closure phenotypes.

      In addition, pharmacological inhibition of proliferation would likely perturb the homeostasis of the organoid-derived epithelial monolayers, in which proliferative crypt compartments are essential, and would be difficult to restrict to epithelial cells without also affecting fibroblasts in co-culture. For these reasons, although such experiments could inform the general contribution of proliferation to gap closure, we do not think they would directly clarify the differences observed between conditions in our system.

      • *

      Figure 2f and 2g - Has a dose-dependent effect of PGE2 been tested?

      We thank the reviewer for pointing this out. We did not perform a dose-response analysis of PGE2 in this study, as our aim was to assess the involvement of the PGE2-EP4 axis rather than to characterize its quantitative dynamics. We therefore selected a concentration based on previous work demonstrating dose-dependent induction of the WAE program in 3D organoid systems (Miyoshi et al., 2017). In that study, 1 µM PGE2 was sufficient to induce a significant increase in the WAE marker Cldn4, and we used this concentration as a biologically relevant reference condition. We will clarify this in the methods section.

      • *

      Figure 2i - The + fibroblast + EP4i condition (pink) is missing.

      We thank the reviewer for pointing this out. The + fibroblast + EP4i condition is present in the plot but not visually distinguishable because it overlaps with the + fibroblast condition and is therefore masked by it. As shown in Figure S4e, the + fibroblast + EP4i condition falls within the variability range of the + fibroblast condition. To improve clarity, we will revise the figure to ensure that this condition is visually identifiable.

      • *

      "This suggests a mechanical or contact-mediated role for fibroblasts in preserving epithelial integrity and promoting coordinated migration beyond their paracrine signaling." While PGE2-EP4 signaling does not appear to be involved in the fibroblast-mediated enhancement of gap-closure efficiency, the conclusion that physical interactions are more important than paracrine effects is overstated. For instance, an experimental condition in which fibroblast-conditioned medium is inactivated (boiling for 5 minutes) would strengthen this conclusion. In addition, inhibition of actomyosin contractility in fibroblasts would be informative.

      Figure 3 - The data presented here do not convincingly support the dismissal of conditioned medium as a contributing factor. The differences between the + fibroblast-conditioned medium and + fibroblast conditions are modest. In both cases, epithelial cells migrate and gaps close.

      We agree with the reviewer that inhibition of actomyosin contractility in fibroblasts would provide valuable insight into the role of force-dependent interactions in epithelial-stromal coupling. However, pharmacological inhibitors of the Rho-ROCK-myosin pathway (e.g., blebbistatin, ML-7, or the ROCK inhibitor Y-27632) would also affect epithelial contractility in our co-culture system, making it difficult to specifically attribute any observed effects to fibroblast mechanics.

      We also agree that paracrine signaling plays an important role in epithelial gap closure. Indeed, supplementation of control media with PGE improves gap closure compared to control conditions, although it does not reach the levels observed with fibroblast-conditioned medium, suggesting that additional soluble factors contribute beyond the PGE-EP4 axis. However, time-lapse imaging revealed direct and dynamic interactions between fibroblasts and epithelial cells (Movie 6; Figure S5a-d; Movie 7), which prompted us to further investigate the contribution of physical interactions, as addressed in Figure 3.

      In Figure 3, we analyzed migration at the single-cell level, in contrast to the tissue-level measurements used for gap closure quantification. In organoid-derived intestinal monolayers, two distinct compartments can be identified: crypt-like and villus-like regions. In vivo, these compartments exhibit different migration behaviors: cells in the crypt are primarily displaced due to crowding, whereas cells in the villus actively migrate, as suggested by the presence of cryptic lamellipodia (Krndija et al., 2019). Consistent with this, tracking individual cells revealed that crypt cells are largely static, while villus cells migrate toward the gap. This compartmentalized behavior was observed in both control and fibroblast-conditioned medium conditions. Strikingly, in the presence of fibroblasts, this differential behavior was reduced, resulting in coordinated migration of both crypt and villus regions.

      This mismatch between compartments in control conditions may contribute to the appearance of discontinuities ("holes") within the epithelial layer during migration. In control experiments, these defects failed to close, whereas in conditioned medium they closed slowly or incompletely. In contrast, in the presence of fibroblasts, these disruptions were rapidly and efficiently resolved, indicating improved tissue integrity.

      Additionally, analysis of individual trajectories near the migration front showed that cells exhibit significantly increased directional persistence (i.e., movement aligned with the direction of gap closure) in the presence of fibroblasts compared to conditioned medium alone.

      Taken together, while paracrine signaling from fibroblasts contributes to epithelial migration and gap closure, the physical presence of fibroblasts induces qualitative changes in epithelial behavior, including coordinated migration across compartments, improved hole closure, and enhanced directional persistence.

      • *

      Figure 4a - "Upon removal of the barrier (t = 0 h), fibroblasts at the epithelial front were small and evenly distributed, with no prominent α-SMA fibers present." Here, fibroblasts are α-SMA positive but not elongated. α-SMA may therefore not be the most appropriate marker. What are the levels of phosphorylated MLC2? These may increase during wound closure. Also, fibroblasts culture promotes aSMA expression, therefore, it may be possible that the fibroblasts used in this assay may not represent the healthy fibroblasts found in vivo.

      We agree with the reviewer that fibroblasts are α-SMA positive at early time points but are not yet elongated. In our system, we observe that α-SMA is already present at t = 0 h, while fibroblasts progressively elongate and reorganize α-SMA into prominent fiber structures over time. This suggests that changes in α-SMA organization, rather than its initial presence, are associated with fibroblast activation during gap closure.

      We note that baseline α-SMA expression may be influenced by in vitro culture conditions prior to the assay, which could differ from the state of fibroblasts in vivo. We will clarify this point in the Discussion to better contextualize our observations relative to native fibroblast populations.

      In addition, we agree that assessing phosphorylated myosin light chain 2 (pMLC2) levels would provide complementary information on contractile activity. We will therefore perform pMLC2 staining, as suggested, to further evaluate force generation by fibroblasts during the wound closure process.

      • *

      Figure 5 - Fibroblast alignment could also result from paracrine signals secreted by epithelial cells. This possibility should be tested.

      We thank the reviewer for this suggestion. To test whether fibroblast alignment could be driven by epithelial-derived paracrine signals, we will culture fibroblasts in conditioned medium collected from epithelial monolayers undergoing gap closure (control condition without fibroblasts) and quantify their alignment over time. This will be compared to fibroblasts maintained in standard fibroblast medium.

      This experiment will directly assess whether epithelial-derived soluble factors are sufficient to induce fibroblast alignment, or whether direct physical interactions are required.

      • *

      In summary, this manuscript demonstrates that epithelial cells migrate more efficiently on extracellular matrix proteins deposited and oriented by fibroblasts. This concept is not novel. Identifying the molecular mechanisms governing interactions between WAE and subepithelial fibroblasts would significantly enhance the novelty and impact of this study.

      • *

      Reviewer #2 (Significance (Required)):

      • *

      In this manuscript, the authors use a bioengineered epithelial-stromal system composed of organoid-derived intestinal epithelial cells, primary intestinal fibroblasts, and a basement membrane matrix to show that direct physical interactions between fibroblasts and epithelial cells drive a large-scale organization of the fibroblast network. This spatial reorganization, in turn, promotes persistent and oriented migration of epithelial cells, ultimately enabling restoration of the intestinal epithelium in an in vitro gap-closure assay. Overall, while the authors use an elegant in vitro model to study intestinal wound closure, and more specifically the role of fibroblasts in this context, I find this manuscript not suitable for publication in its present form. The data are overinterpreted, the novelty is limited, and the molecular mechanisms underlying WAE-fibroblast interactions are insufficiently addressed.

      *We thank the reviewer for this thorough and critical assessment. We have clarified the overstatements in the rebuttal and we will modify the text to address concerns regarding overinterpretation and clearly acknowledge the limitations of our approach. In particular, we will refine the framing of the study to better distinguish between the contributions of paracrine signaling and physical epithelial-stromal interactions. *

      *To address the reviewer's concerns regarding mechanism and novelty, we will perform additional experiments aimed at further characterizing epithelial-stromal cross-talk, and experiments to assess fibroblast contractility and its contribution to epithelial coordination. *

      We believe that these revisions and proposed experiments will strengthen the manuscript and clarify its conceptual contribution.

      • *

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      • *

      Summary:

      - Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      The study by Comelles et al. focuses on how primary intestinal fibroblasts contribute to organoid-derived intestinal epithelial migration in wound healing assays. Using fibroblast-epithelial co-cultures in a 2D in vitro gap closure system, the authors found that direct interaction with fibroblasts drives cohesive and directed migration of intestinal epithelia toward the gap. They further propose that long-range fibroblast alignment promotes the deposition of extracellular matrix (ECM) proteins in an oriented fashion, contributing to directed epithelial migration.

      Major comments:

      - Are the key conclusions convincing?

      Some of the key conclusions of this manuscript are not entirely convincing given the available data. The manuscript would benefit from additional evidence and/or clarifications to support their conclusions. See comments below.

      • *

      - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      (Fig 4a) The authors claim that fibroblasts become activated during gap closure as evidenced by the enhanced assembly of a-SMA fibers 24 hours following barrier removal. Yet, long a-SMA fibers are also observed when fibroblasts are cultured in the absence of epithelial cells or barrier removal (Fig. S1b). To support this conclusion, the authors should consider including additional controls to account for potential time-dependent assembly of a-SMA fibers (e.g., fibroblast-only control).

      We thank the reviewer for pointing this out. We agree that a fibroblast-only control would be important to account for potential time-dependent assembly of α-SMA fibers. We will therefore perform additional experiments monitoring α-SMA organization in fibroblasts cultured alone over time, which will allow us to better interpret the dynamics observed in the co-culture conditions.

      • *

      (Fig. 5a) The authors conclude that fibroblasts align parallel to the direction of epithelial migration during gap closure. While quantifications are convincing, again, a fibroblast-only control accounting for time-dependent spreading and elongation (as seen in Fig. S1) is missing. Including such a control would strengthen their claim that alignment is specific to the gap closure context rather than a time-dependent phenotype.

      We agree with the reviewer that, given the intrinsic ability of fibroblasts to form ordered domains with long-range alignment, this control would be highly informative. We will therefore quantify fibroblast alignment over time in fibroblast-only cultures, which will allow us to determine to what extent the long-range organization observed in co-culture is specific to the gap closure context.

      • *

      (Fig 6) The authors claim that fibroblast-derived aligned ECM drives directional epithelial migration. While fibronectin fibers appear scarce and weakly aligned with the direction of migration, laminin and type IV collagen fibers are barely detectable (Fig. 6f). This may reflect a defect in ECM deposition rather than fiber alignment, which contrasts with Fig. S1, where fibroblasts are shown to deposit and assemble laminin and type IV collagen fibers. One possible explanation is that primary fibroblasts were not cultured long enough to allow robust ECM deposition. Alternatively, the observed effect may be specific to fibronectin, which is consistent with fibroblasts being its major source. The authors should revise their interpretation or provide additional evidence to support their current claim.

      We thank the reviewer for this important point. We agree that differences in ECM signal within the gap may reflect not only fiber alignment but also differences in the amount of protein deposited. In the +fibroblast condition, fibroblasts in the gap have more time to secrete ECM compared to the "empty gap" condition, where fibroblasts remain confined beneath the epithelium.

      In addition, the presence of Matrigel likely masks the contribution of certain ECM components, making laminin or type IV collagen more apparent than fibronectin. We will therefore revise the interpretation of these results to explicitly acknowledge the contribution of ECM abundance in addition to alignment.

      • *

      (Fig 6i) The authors propose that the presence of ECM alone within the gap enhances epithelial gap closure compared to empty gap conditions, although gap closure remains less effective than in the presence of primary fibroblasts. From the figure legend and methods, it seems that the decellularized ECM condition is generated using NIH-3T3 fibroblasts cultured for 8 days, whereas the other conditions used primary fibroblasts cultured for 1 day (Fig. 6a-h). This comparison is confounded by differences in cell source and ECM deposition time. If I am misunderstanding this, please clarify, otherwise consider repeating the decellularized ECM condition using primary fibroblasts and matching culture times for a fair comparison. Along these lines, please include images showing that ECM fibers remain intact following decellularization.

      We thank the reviewer for this suggestion. We will include additional staining to confirm that ECM fibers remain intact after decellularization in the revised version.

      Regarding the use of NIH-3T3 fibroblasts for CDM generation, this choice was made to minimize potential residual paracrine signaling from primary intestinal fibroblasts after decellularization. We acknowledge that this introduces differences in cell source.

      Concerning culture time, we followed established protocols for CDM formation, which recommend extended culture periods ({greater than or equal to}8 days) to allow robust ECM deposition (Cukierman et al., 2001; Franco-Barraza et al., 2016; Godeau et al., 2020). We will clarify these points in the revised manuscript and discuss the limitations associated with these differences.

      • *

      - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Yes. The additional experiments outlined above would help support the current conclusions of the manuscript, rather than to explore new directions beyond its scope.

      • *

      - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Yes, the additional experiments primarily involve the inclusion of controls and additional immunofluorescence imaging to their existing experimental setups. They should be relatively straightforward to implement (~2-3 months).

      • *

      - Are the data and the methods presented in such a way that they can be reproduced?

      Yes.

      • *

      - Are the experiments adequately replicated and statistical analysis adequate?

      Overall, yes. But some plot legends should specify the number of replicates analyzed (e.g. Fig. 2b, Fig. 2d, Fig. 3h).

      We will review and correct these issues.

      • *

      Minor comments:

      - Specific experimental issues that are easily addressable.

      (Fig. 1c-e) The authors state that intestinal epithelial monolayers exhibit the most effective gap closure when in direct contact with fibroblasts. However, fibroblast-conditioned media and co-cultures show comparable gap closure efficiencies (Fig. 1e). The authors should consider revising this interpretation based on the provided data.

      We thank the reviewer for pointing this out, which was also raised by Reviewer 2. As discussed above, we agree that the original statement overstated the effect. Both fibroblast-conditioned medium and direct fibroblast contact promote efficient gap closure compared to control conditions, and we will revise the text accordingly to reflect that no consistent quantitative difference is observed between these two conditions.

      • *

      (Fig. 3b) The authors suggest that crypt-like epithelial cells undergo migration when grown on fibroblasts, but not in conditioned media alone. This is interesting, but it is not clear how they identify crypt-like cells for tracking. The authors should clarify if crypt-like cells are defined based on markers or inferred from their morphology.

      We thank the reviewer for this comment. In these tracking analyses, crypt-like cells were identified based on morphology. As shown in Figure S3 and in Larrañaga et al., 2025, crypt-like cells, defined by specific molecular markers, are significantly smaller than villus-like cells and form high-density regions. These features allow their identification based on morphology in fluorescently labeled monolayers. We will clarify this criterion in the Methods section of the revised manuscript.

      • *

      (Fig 3f-h) The authors conclude that fibroblasts promote directed epithelial cell motility based on cell trajectory analysis. Although they state that this analysis is performed on epithelial monolayers, their tdTomato epithelial population appears sparse in some conditions (control and conditioned media; Fig. S6a). Such variability in cell density may bias measurements of migration directionality at the cell-level, unless a mixed population is being used for tracking. The authors should clarify whether this analysis was indeed conducted on confluent monolayers.

      We thank the reviewer for this comment. For trajectory analysis, we used a mixed population of tdTomato-positive and non-fluorescent epithelial cells in some experiments to facilitate individual cell tracking. Importantly, epithelial monolayers were confluent in all conditions analyzed. We will clarify this in the Methods section.

      • *

      (Fig 6b) Their gap closure experimental setup indicates that fibroblasts are cultured on a Matrigel-coated surface, which should already contain abundant laminin and type IV collagen. Thus, it is unclear why type IV collagen is not detected underneath fibroblasts. The authors should explain why this is the case for clarity.

      We thank the reviewer for pointing out this observation. Indeed, fibroblasts are cultured on a Matrigel-coated surface which contains laminin and collagen type IV among many other components. We observed thick collagen-rich structures between the fibroblasts and the epithelia that we atributed, not only to fibroblasts' secreted collagen, but also a rearrengement of the collagen available in the coated surface. We will clarify this in the discussion of the revised version for clarity.

      • *

      - Are prior studies referenced appropriately?

      Yes

      • *

      - Are the text and figures clear and accurate?

      Mostly. Figures 6d and 6g seem to be duplicated by mistake.

      We thank the reviewer for noting this. We will correct this mistake.

      • *

      - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      There are some missing frames in Movie 2. If they are not available, it's okay to include black frames, so that the sequence remains consistent with the timestamps.

      The authors may consider using asterisks as significance indicators instead of reporting precise p-values directly on their plots. Having this format would facilitate visual comparison of statistical significance across conditions.

      Displaying single channels of experiments where co-cultures are used would help to better interpret their data.

      We thank the reviewer for pointing out these issues and for their valuable suggestions. We will correct the errors in the movie and improve the presentation as suggested where possible.

      • *

      Reviewer #3 (Significance (Required)):

      • *

      - Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      This study provides a valuable contribution to understanding how fibroblasts influence intestinal epithelial migration. The main advance lies in the use of a co-culture system combining organoid-derived intestinal epithelial cells that assemble into a crypt-villus organization with primary intestinal fibroblasts in a 2D gap closure system. This approach allows the authors to examine epithelial-fibroblast interactions in a more physiologically relevant context compared to prior work.

      We thank the reviewer for their positive assessment of the significance of our work.

      • *

      - Place the work in the context of the existing literature (provide references, where appropriate).

      Addressed above.

      • *

      - State what audience might be interested in and influenced by the reported findings.

      Cell and developmental biology, extracellular matrix biology, tissue regeneration.

      • *

      - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Tissue morphogenesis, cell motility, extracellular matrix dynamics.

      We thank the reviewer for their positive assessment and for their suggestions to improve the manuscript.

      • *
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      • Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      The study by Comelles et al. focuses on how primary intestinal fibroblasts contribute to organoid-derived intestinal epithelial migration in wound healing assays. Using fibroblast-epithelial co-cultures in a 2D in vitro gap closure system, the authors found that direct interaction with fibroblasts drives cohesive and directed migration of intestinal epithelia toward the gap. They further propose that long-range fibroblast alignment promotes the deposition of extracellular matrix (ECM) proteins in an oriented fashion, contributing to directed epithelial migration.

      Major comments:

      • Are the key conclusions convincing?

      Some of the key conclusions of this manuscript are not entirely convincing given the available data. The manuscript would benefit from additional evidence and/or clarifications to support their conclusions. See comments below. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      (Fig 4a) The authors claim that fibroblasts become activated during gap closure as evidenced by the enhanced assembly of a-SMA fibers 24 hours following barrier removal. Yet, long a-SMA fibers are also observed when fibroblasts are cultured in the absence of epithelial cells or barrier removal (Fig. S1b). To support this conclusion, the authors should consider including additional controls to account for potential time-dependent assembly of a-SMA fibers (e.g., fibroblast-only control). (Fig. 5a) The authors conclude that fibroblasts align parallel to the direction of epithelial migration during gap closure. While quantifications are convincing, again, a fibroblast-only control accounting for time-dependent spreading and elongation (as seen in Fig. S1) is missing. Including such a control would strengthen their claim that alignment is specific to the gap closure context rather than a time-dependent phenotype. (Fig 6) The authors claim that fibroblast-derived aligned ECM drives directional epithelial migration. While fibronectin fibers appear scarce and weakly aligned with the direction of migration, laminin and type IV collagen fibers are barely detectable (Fig. 6f). This may reflect a defect in ECM deposition rather than fiber alignment, which contrasts with Fig. S1, where fibroblasts are shown to deposit and assemble laminin and type IV collagen fibers. One possible explanation is that primary fibroblasts were not cultured long enough to allow robust ECM deposition. Alternatively, the observed effect may be specific to fibronectin, which is consistent with fibroblasts being its major source. The authors should revise their interpretation or provide additional evidence to support their current claim. (Fig 6i) The authors propose that the presence of ECM alone within the gap enhances epithelial gap closure compared to empty gap conditions, although gap closure remains less effective than in the presence of primary fibroblasts. From the figure legend and methods, it seems that the decellularized ECM condition is generated using NIH-3T3 fibroblasts cultured for 8 days, whereas the other conditions used primary fibroblasts cultured for 1 day (Fig. 6a-h). This comparison is confounded by differences in cell source and ECM deposition time. If I am misunderstanding this, please clarify, otherwise consider repeating the decellularized ECM condition using primary fibroblasts and matching culture times for a fair comparison. Along these lines, please include images showing that ECM fibers remain intact following decellularization. - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Yes. The additional experiments outlined above would help support the current conclusions of the manuscript, rather than to explore new directions beyond its scope. - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Yes, the additional experiments primarily involve the inclusion of controls and additional immunofluorescence imaging to their existing experimental setups. They should be relatively straightforward to implement (~2-3 months). - Are the data and the methods presented in such a way that they can be reproduced?

      Yes. - Are the experiments adequately replicated and statistical analysis adequate?

      Overall, yes. But some plot legends should specify the number of replicates analyzed (e.g. Fig. 2b, Fig. 2d, Fig. 3h).

      Minor comments:

      • Specific experimental issues that are easily addressable.

      (Fig. 1c-e) The authors state that intestinal epithelial monolayers exhibit the most effective gap closure when in direct contact with fibroblasts. However, fibroblast-conditioned media and co-cultures show comparable gap closure efficiencies (Fig. 1e). The authors should consider revising this interpretation based on the provided data. (Fig. 3b) The authors suggest that crypt-like epithelial cells undergo migration when grown on fibroblasts, but not in conditioned media alone. This is interesting, but it is not clear how they identify crypt-like cells for tracking. The authors should clarify if crypt-like cells are defined based on markers or inferred from their morphology. (Fig 3f-h) The authors conclude that fibroblasts promote directed epithelial cell motility based on cell trajectory analysis. Although they state that this analysis is performed on epithelial monolayers, their tdTomato epithelial population appears sparse in some conditions (control and conditioned media; Fig. S6a). Such variability in cell density may bias measurements of migration directionality at the cell-level, unless a mixed population is being used for tracking. The authors should clarify whether this analysis was indeed conducted on confluent monolayers. (Fig 6b) Their gap closure experimental setup indicates that fibroblasts are cultured on a Matrigel-coated surface, which should already contain abundant laminin and type IV collagen. Thus, it is unclear why type IV collagen is not detected underneath fibroblasts. The authors should explain why this is the case for clarity. - Are prior studies referenced appropriately?

      Yes - Are the text and figures clear and accurate?

      Mostly. Figures 6d and 6g seem to be duplicated by mistake. - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      There are some missing frames in Movie 2. If they are not available, it's okay to include black frames, so that the sequence remains consistent with the timestamps. The authors may consider using asterisks as significance indicators instead of reporting precise p-values directly on their plots. Having this format would facilitate visual comparison of statistical significance across conditions. Displaying single channels of experiments where co-cultures are used would help to better interpret their data.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      This study provides a valuable contribution to understanding how fibroblasts influence intestinal epithelial migration. The main advance lies in the use of a co-culture system combining organoid-derived intestinal epithelial cells that assemble into a crypt-villus organization with primary intestinal fibroblasts in a 2D gap closure system. This approach allows the authors to examine epithelial-fibroblast interactions in a more physiologically relevant context compared to prior work. - Place the work in the context of the existing literature (provide references, where appropriate). Addressed above.

      • State what audience might be interested in and influenced by the reported findings.

      Cell and developmental biology, extracellular matrix biology, tissue regeneration. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Tissue morphogenesis, cell motility, extracellular matrix dynamics.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Please find enclosed my review comments on the manuscript entitled "Fibroblast alignment coordinates epithelial migration and maintains intestinal tissue integrity" by Jordi Comelles et al. In this manuscript, the authors use a bioengineered epithelial-stromal system composed of organoid-derived intestinal epithelial cells, primary intestinal fibroblasts, and a basement membrane matrix to show that direct physical interactions between fibroblasts and epithelial cells drive a large-scale organization of the fibroblast network. This spatial reorganization, in turn, promotes persistent and oriented migration of epithelial cells, ultimately enabling restoration of the intestinal epithelium in an in vitro gap-closure assay. Overall, while the authors use an elegant in vitro model to study intestinal wound closure, and more specifically the role of fibroblasts in this context, I find this manuscript not suitable for publication in its present form. The data are overinterpreted, the novelty is limited, and the molecular mechanisms underlying WAE-fibroblast interactions are insufficiently addressed.

      Figure 1 - What are the units of the "fraction gap closure" shown in panels d and e? Is it expressed as a percentage? "Actually, epithelial monolayers achieved the most effective gap closure when cultured in direct physical contact with fibroblasts (Figure 1e and Movies 2 and 3)." From the data shown in panels c, d, and e, it appears that fibroblast-conditioned medium alone promotes efficient gap closure, comparable to the + fibroblast condition. Figure 2 - The use of a cell proliferation inhibitor during the gap-closure assay would help determine the contribution of cell proliferation at the migration front. Figure 2f and 2g - Has a dose-dependent effect of PGE2 been tested? Figure 2i - The + fibroblast + EP4i condition (pink) is missing. "This suggests a mechanical or contact-mediated role for fibroblasts in preserving epithelial integrity and promoting coordinated migration beyond their paracrine signaling." While PGE2-EP4 signaling does not appear to be involved in the fibroblast-mediated enhancement of gap-closure efficiency, the conclusion that physical interactions are more important than paracrine effects is overstated. For instance, an experimental condition in which fibroblast-conditioned medium is inactivated (boiling for 5 minutes) would strengthen this conclusion. In addition, inhibition of actomyosin contractility in fibroblasts would be informative. Figure 3 - The data presented here do not convincingly support the dismissal of conditioned medium as a contributing factor. The differences between the + fibroblast-conditioned medium and + fibroblast conditions are modest. In both cases, epithelial cells migrate and gaps close. Figure 4a - "Upon removal of the barrier (t = 0 h), fibroblasts at the epithelial front were small and evenly distributed, with no prominent α-SMA fibers present." Here, fibroblasts are α-SMA positive but not elongated. α-SMA may therefore not be the most appropriate marker. What are the levels of phosphorylated MLC2? These may increase during wound closure. Also, fibroblasts culture promotes aSMA expression, therefore, it may be possible that the fibroblasts used in this assay may not represent the healthy fibroblasts found in vivo. Figure 5 - Fibroblast alignment could also result from paracrine signals secreted by epithelial cells. This possibility should be tested. In summary, this manuscript demonstrates that epithelial cells migrate more efficiently on extracellular matrix proteins deposited and oriented by fibroblasts. This concept is not novel. Identifying the molecular mechanisms governing interactions between WAE and subepithelial fibroblasts would significantly enhance the novelty and impact of this study.

      Significance

      In this manuscript, the authors use a bioengineered epithelial-stromal system composed of organoid-derived intestinal epithelial cells, primary intestinal fibroblasts, and a basement membrane matrix to show that direct physical interactions between fibroblasts and epithelial cells drive a large-scale organization of the fibroblast network. This spatial reorganization, in turn, promotes persistent and oriented migration of epithelial cells, ultimately enabling restoration of the intestinal epithelium in an in vitro gap-closure assay. Overall, while the authors use an elegant in vitro model to study intestinal wound closure, and more specifically the role of fibroblasts in this context, I find this manuscript not suitable for publication in its present form. The data are overinterpreted, the novelty is limited, and the molecular mechanisms underlying WAE-fibroblast interactions are insufficiently addressed.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Background and unknown in the field:

      This study investigates how fibroblast alignment influences the migration of intestinal epithelial cells, contributing to tissue integrity and repair. It is well established that intestinal fibroblasts are important regulators in the tissue through their ability to secrete essential paracrine factors for the epithelium. However, it is less well understood if they also play additional structural, tissue architecture instructing role and how the communication between the fibroblasts and the epithelia is regulated.

      Advance over state of the art:

      Here the authors have set-up an elegant three-component system to investigate this. They have gone beyond the recent advances of culturing intestinal and colonic organoids in 2D (in a manner that preserves- and villus-like organization) and bioengineered epithelial-stromal model comprising organoid-derived intestinal epithelial cells (IECs), primary intestinal fibroblasts, and a basement membrane matrix. Using this model, they have uncovered fibroblasts enhancing the directed and persistent migration of intestinal epithelial cells (IECs). They used scRNAseq to carefully analyse the stromal cell populations present in their co-cultures of primary mouse intestinal subepithelial fibroblasts and organoid-derived intestinal mouse epithelial cells. They observed that this reflected well the stromal cell-type composition as well as the paracrine activity previously reported for these cells in tissue. Using a clever system with Matrigel and an elastomeric barrier, the authors were able to induce non-epithelial gaps in different scenarios (IECs alone or with fibroblasts or with conditioned media) and observe the wound-closure as well as the presence of specific cell types. They observed that the epithelial monolayers showed significant gap closure when in direct contact with fibroblasts compared to controls. Interestingly, the enhanced efficiency of epithelial migration and gap closure, in the presence of fibroblasts, was independent of PGE₂-EP4 signaling and was not due to differences in cell proliferation. Instead, the imaging revealed that the fibroblasts were in direct contact with the epithelium. The authors observed that in the absence of fibroblasts the migration properties of cells in the villus and the crypt regions were dramatically different and the fibroblast presence was necessary to efficiently synchronize these to support gap closure. In addition, the presence of fibroblasts enhanced the directionality of the epithelial cell migration. Detailed imaging and image analyses revealed that gap closure involved activation of the fibroblasts and co-ordinated coalignment of IECs and fibroblasts. They also explored matrix deposition of the fibroblasts during the process and found that they deposited aligned ECM fibers that guide epithelial migration. Mere cell-derived matrix (devoid of live fibroblasts) was able to partially recapitulate the fibroblast-coordinated epithelial migration that the fibroblast generated matrix and its alignment are key contributors to the phenotype.

      Comments:

      This is overall a very interesting and well-written study. The imaging and the image analysis are state-of-the art and the bioengineered model is an exciting advancement over current methods developed by these researchers and others. This study meets all the criteria for a publication in the since that all the experiments seem to be carefully conducted, with appropriate controls and sufficient quantifications and statistics. The claims made by the authors are supported by the data. This is currently suitable to be published as a method/protocol and as a descriptive study uncovering interesting cross-talk and co-dependencies of epithelial and stromal cells during injury repair. There are of course aspects that could improve the study further like more mechanistic insight into the underpinnings of the direct epithelia-fibroblast interaction and its involvement in the directed IEC migration. However, these may be topics to investigate in a future study.

      Significance

      The strengths of the study are the highly in vivo relevant model system that is amendable to imaging and detailed image analysis of distinct cell populations. This may be adapted by others in in the field and has the potential to transform the way cell dynamics in the intestinal epithelium are visualized and investigated in vitro

  2. May 2026
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-03319

      Corresponding author(s): Pedro Milanez-Almeida

      1. General Statements

      We thank the reviewers for their careful and constructive evaluation. We agree that the original manuscript did not communicate the conceptual scope, novelty, or limitations of the framework as clearly as it should have. In response, we substantially revised both the presentation and the supporting analyses.

      The central revision reframes tinydenseR around a common landmark-by-sample fuzzy density matrix derived from UMAP-based cell–landmark connection strengths. This representation supports four downstream analysis modes: (1) landmark-level differential density modeling, (2) supervised quantitative sample embedding via partial-effect principal component projection (pePC), (3) density-contrast-aligned feature exploration via graph-diffused partial least squares decomposition (plsD), and (4) connection-strength-weighted pseudobulk differential expression. A dedicated algorithm overview ("The tinydenseR Algorithm") and revised Figure 1 make this structure explicit, and the revised manuscript clarifies which components are novel and which remain conventional once subsets are defined.

      To strengthen empirical support, we expanded synthetic and permutation benchmarks, added a landmark- versus cell-level integration comparison, updated miloR benchmarking to use graph refinement, and included analysis of the publicly available COMBAT COVID-19 PBMC dataset. The Methods now correct the terminology for UMAP-derived weights (connection strengths, not probabilities), make the landmark allocation rule explicit, and clarify assumptions underlying the density matrix construction.

      We do not claim that the revision resolves every benchmarking or sensitivity question raised. The goal was to make the claims more precise, the framework more transparent, and the empirical evidence more directly aligned with the revised conceptual framing.

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Overall, this manuscript presents a landmark-based strategy for scaling sample-level differential abundance and differential expression analyses to atlas-sized single-cell datasets by summarizing cells through fuzzy cell-landmark memberships and then applying standard sample-level modeling. The approach is promising for computational efficiency, but several key methodological claims and design choices would benefit from clearer justification and stronger empirical validation.

      We appreciate the reviewer's careful summary and recognition of the promise of landmark-based strategies for atlas-scale analysis. The specific concerns are addressed point by point below; the revision addresses them through expanded benchmarking, clearer methodological justification, and more precise scoping of the framework's contributions and limitations.

      The manuscript's main methodological novelty appears to be computational/scalable representation learning via landmarking and fuzzy cell-landmark densities. However, for differential expression within cell states, the inferential procedure is standard pseudo-bulk + limma/voom and still relies on discrete subset definitions (clusters/cell types, or landmark bins) and downstream annotation for biological interpretation. The authors should clarify that "clustering-independent" primarily applies to the DA representation/testing, and strengthen evidence that landmark-level subsets provide materially different biological resolution than simply clustering a subsample.

      Response:

      This comment raises a central question about the scope of "clustering independence" in the framework. We want to be direct: the revised manuscript does not include a dedicated head-to-head benchmark comparing tinydenseR landmark-level subsets with a "cluster a subsample" workflow. We recognize this as a limitation of the current revision.

      What the revision does instead is twofold. First, it sharpens the conceptual distinction. A subsampled clustering still imposes a hard partition and remains sensitive to the choice of clustering algorithm, graph construction, and resolution parameter. The landmark-density framework avoids that dependence: the same continuous representation supports differential density modeling, pePC, plsD, and pseudobulk DE without requiring a rigid partition at any stage. We added a sentence in the Introduction noting that structure-aware approaches can better preserve data geometry, rare populations, and local neighborhood structure than naive random subsampling.

      Second, the revision strengthens empirical support within the landmark-density framework itself. The synthetic benchmarks demonstrate recovery of simulated trajectory-associated density shifts, abundance perturbations, and activation-associated expression changes directly in the landmark-density space. The real-data applications now present findings through landmark-level density contrasts, pePC, and plsD, with results interpreted as consistent with prior biological knowledge and published reports.

      We also clarify in the revised manuscript that the pseudobulk aggregation itself now derives subsets from the fuzzy landmark topology rather than from a rigid partition, even though the formal gene-level testing remains conventional once a subset has been specified. Thus, clustering independence applies not only to density inference but also to effect-specific sample embedding and exploratory feature interpretation.

      We therefore frame the contribution as a conceptual and empirical argument for the clustering-free workflow, while acknowledging that a direct empirical comparison with a cluster-a-subsample alternative has not been performed in this revision.

      The method treats UMAP fuzzy graph weights as "connection probabilities" and uses their sums to estimate sample-level abundance around each landmark. Please clarify (i) how sensitive the density matrix is to the UMAP membership construction (e.g., choice of parameters, the per-cell membership mass constraint), and (ii) why this is an appropriate abundance surrogate in the context of sample-level inference.

      Response:

      This is an important comment about both the interpretation and parameter-dependence of the UMAP-derived fuzzy graph. We revised the manuscript on both fronts.

      First, we no longer refer to the UMAP fuzzy graph weights as "connection probabilities." The revised Methods (Step 4) now define the directed neighborhood memberships, the local connectivity and scaling terms, and the symmetrized UMAP fuzzy-union weight explicitly, describing the resulting quantities as fuzzy graph connection strengths / affinities rather than calibrated probabilities.

      Second, we now state explicitly that the cell–landmark weights depend on the choice of nearest-landmark count, the per-cell membership-mass constraint, and the local scaling parameters. These choices are held fixed within each analysis, and the resulting landmark-by-sample matrix should be interpreted as a relative measure of enrichment/depletion around landmark neighborhoods rather than as a physical cell count. We did not add a dedicated sensitivity benchmark across UMAP hyperparameters in this revision; given the scope of the other benchmarking additions, we judged this to be better addressed as a focused follow-up study and have noted the dependence on these construction choices explicitly in the Methods. We also note that the consistent behavior of the framework across the diverse datasets analyzed in this work—spanning synthetic, cytometry, and multiple scRNA-seq settings—provides empirical evidence of robustness to parameter choice, as stated in the revised Methods.

      Third, we clarify why summing connection strengths is an appropriate abundance surrogate for sample-level inference. In the revised Methods, the unnormalized quantity around each landmark is defined as the sum of cell–landmark fuzzy weights within a sample, followed by sample-size normalization and log-transformation. The text now explains this both operationally (a continuous measure of how much cell mass from a given sample concentrates near a landmark neighborhood) and conceptually (a soft-assignment abundance surrogate internally consistent with the manifold-based representation used throughout the framework).

      These revisions address the concern by making two points explicit: (i) the UMAP-derived quantities are fuzzy graph connection strengths rather than calibrated probabilities, and (ii) their sums serve as a continuous, size-normalized abundance surrogate aligned with the landmark-based representation on which downstream inference is performed.

      The landmark selection step caps landmarks at 10% of cells per sample (and an overall cap of 5,000 landmarks by default), but the rationale for these thresholds is not clearly justified. Additionally, interaction between the per-sample cap and the global 5,000 landmark cap: when the global cap is binding, how are landmarks redistributed across samples, and does this induce unequal representation across samples with different cell yields?

      Response:

      We agree that the rationale for the default landmark caps and the interaction between the per-sample and global limits should be stated more explicitly. We revised Methods Step 2 (Landmark Selection) accordingly.

      The revised manuscript states that the default caps balance approximate proportional representation across samples with bounded computation. The target number of landmarks is defined from the per-sample sampling proportion (p = 0.1 by default) together with the overall cap of 5,000 landmarks, so that landmark sampling remains approximately proportional to sample cell yield unless the global budget becomes limiting.

      We also clarify how the per-sample and global caps interact. When the global cap is binding, each sample is subject to the same per-sample upper bound given by an equal-share maximum of the global target, while smaller samples remain limited by their own proportional cap:

      where is the number of cells in sample , is the sampling proportion, , and is the number of samples. This means that no single high-yield sample can dominate the landmark set, while smaller samples contribute only up to their own proportional allocation.

      The revised manuscript also notes the consequence for unequal sample yields: because small samples remain limited by , the total number of selected landmarks can be less than the nominal global budget if many samples are small. We therefore frame the rule as a transparent compromise that limits over-representation of high-yield samples while preserving simple approximately proportional design. For additional transparency, is recorded in the metadata for inspection, users can adjust p to match replicate structure and heterogeneity, and the software warns when sample cell yields differ by more than 10-fold.

      Reviewer #1 (Significance (Required)):

      This manuscript introduces a practical landmark-based framework that makes sample-level differential abundance and expression analyses feasible at atlas scale by summarizing large cell collections through a reduced set of representative cells and fuzzy cell-landmark memberships, then leveraging well-established sample-level linear modeling. The main strengths are its emphasis on scalability, a clear engineering workflow that can be applied across large studies, and a unified representation that can support multiple downstream analyses. The main limitations are that key inferential components remain conventional once subsets are defined, biological interpretation still relies on clustering/label transfer and downstream annotation, and several core design choices (e.g., treating UMAP fuzzy weights as "probabilities" and the default landmark caps) would benefit from stronger justification and sensitivity analyses to support the paper's broader claims about cluster-independence and biological resolution.

      Response:

      We appreciate this balanced assessment. The revised manuscript addresses these concerns through the changes detailed in our preceding responses: the framework is now presented around the common landmark-density representation, with explicit scoping of which components are novel (the density matrix, pePC) and which remain conventional (pseudobulk DE after subset selection). The UMAP-derived weights are described as connection strengths rather than probabilities, the landmark caps are justified more explicitly, and the empirical support has been expanded through synthetic, permutation, integration, and real-data analyses.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors describe a new computational framework for the analysis of single-cell RNA-seq data in multiple domains that uses probabilistic cluster assignment for downstream differential expression and abundance analysis. They show that their method outperforms similar methods in certain tasks, and provide computational performance studies to show that this is tractable on large scale RNA-seq datasets.

      I congratulate the authors on a well written and interesting manuscript, and the development of what seems like quite a comprehensive analysis package. I particularly appreciate the effort to demonstrate computational performance on large datasets (and of course the great name!). Nevertheless, I have a few minor comments that I think would enhance the presentation and impact of the paper.

      We are grateful for Reviewer 2's generous assessment and constructive comments, which helped us improve the presentation, terminology, and readability of the manuscript. The responses below address each point in detail.

      From the introduction, it's a little hard to determine what exactly tinydenseR is doing, what the underlying philosophy is, and what the key innovations are. This is especially apparent as the paper then launches straight into the results. The methods section is very comprehensive, however without a 'guide' through this that bridges the gap between the very high level description and the implementation details, a reader could end up getting stuck. So my suggestion would be to add a couple of paragraphs to the introduction that provide this, such that the reader would only need to consult the methods for the real implementation details.

      Response:

      We agree that the original manuscript moved too quickly from high-level motivation into application results without providing enough conceptual guidance.

      We addressed this in two complementary ways. First, we sharpened the framing in the final paragraph of the Introduction, which now describes tinydenseR as a "fuzzy set-based, technology-agnostic framework for sample-level modeling and quantitative embedding" built around a shared landmark-based representation. Second, we added a dedicated Results subsection, "The tinydenseR Algorithm," that walks the reader through the core elements of the framework—the fuzzy density matrix, differential density analysis, pePC, plsD and pseudobulk DE—before the application sections. This overview was not present in the original version and now functions as the conceptual bridge between the high-level framing and the full Methods that the reviewer requested. We also revised Figure 1 so that the workflow schematic makes these components visually explicit, with the density matrix shown as the common representation from which all four downstream analysis modes are derived.

      We thus addressed the reviewer's underlying point somewhat differently from the exact suggestion of adding multiple new Introduction paragraphs: the revised manuscript combines a more explicit Introduction paragraph with a dedicated algorithm overview at the start of the Results and an expanded workflow schematic.

      It seems that the probabilities that you get from the fuzzy cluster assignment aren't necessarily well calibrated. After the Benjamini-Hochberg correction this might end up not being a issue, so it would be good to see that the FDR that you mention in line 431 is indeed roughly how many discoveries are made under the null hypothesis for some indicative configurations, and whether this deviates significantly from the expectation.

      Response:

      We agree that the UMAP-derived fuzzy weights should not be interpreted as calibrated probabilities for statistical inference. The revised Methods (Step 4) now describe these quantities as affinities / connection strengths used to construct a continuous abundance surrogate, not calibrated probabilities for statistical modeling.

      To address the concern empirically, we added permutation-based null-distribution analyses across the synthetic benchmark settings (Fig. S4). These analyses report the number of discoveries at q We note that the swfdr-based q-values are plug-in estimates of per-feature FDR, and do not provide a formal guarantee that the full rejection set at q ≤ α is globally FDR-controlled in the BH sense (this is now stated in Methods Step 5). The permutation results are therefore presented as evidence that, in the benchmark settings considered, the method does not produce substantial spurious discovery under arbitrary relabeling, rather than as a proof of exact calibration under all null configurations.

      The rest of my comments are minor and only related to the presentation:

      l71 (and elsewhere): A bit more detail about these datasets would be nice (what the trajectories are, etc, rather than just the configuration).

      The revised manuscript adds substantially more detail for the synthetic trajectory, DA, and DE datasets in both the Results and the dedicated Synthetic Data Methods subsections.

      l73: The language is a little different between this description and the figures, so it's a little hard to keep track of the correspondence.

      The revised manuscript aligns the synthetic sections more explicitly with the figures: trajectory is now Fig. 1, simulated DA is Fig. S1, and simulated DE is Fig. S2, with corresponding figure legends rewritten in the same language.

      Fig S2 (and others): I know that these are supplemental figures, but they're quite dense and hard to read (both conceptually and in terms of the font size). For something supporting a direct claim in the paper, it would be nice to have one or two plots in the main body of the paper that summarise all of the evidence in these denser supplementary plots.

      The revised manuscript adds a stronger main-text overview in Figure 1 and the new "The tinydenseR Algorithm" subsection. The DA/DE benchmark panels remain in the Supplementary Figures, but the revised main-text overview and workflow schematic now guide the reader more explicitly through the supporting evidence.

      l106: This is a little confusing to read (not clear whether these are single and double knock outs or just wild type and double knock out).

      The revised text now states explicitly: WT for HRAS and NRAS versus H/NRAS double knockout (KO).

      l162: In general it would be nice to link this back to some of the choices made in tinydenseR compare to other techniques.

      The revised Discussion now includes explicit comparison to MetaCell / SEACells, to unsupervised sample-representation methods such as PILOT, MrVI, and scPoli, and to the cell type and feature interpretation tools Augur and TRADE.

      l175: 'Fuzzy' density matrix is not really defined anywhere

      The new Results overview explicitly defines the landmark-by-sample density matrix and explains that entries reflect cell-landmark affinity rather than hard membership.

      l427: BH -> Benjamini-Hochberg

      Corrected; thank you.

      Eq 5: Missing a bracket I think

      Corrected; thank you.

      Fig 2b: A log x axis might be better here.

      Figure 2 has been substantially revised, whereby percentages of cells per cluster are not presented anymore. So, the original presentation concern is no longer directly applicable.

      Fig 2c,d,e: Some of the y axis ticks here in particular here look a little broken

      Corrected; thank you.

      Fig S5: DA -> differential abundance, DE -> Differential expression. A guide to what exactly to look at in this figure would also be nice.

      We revised Figure S4 (previously Fig. S5) and its legend to make the figure easier to interpret. DA is now defined as differential abundance and DE as differential expression, and we clarify that the key features to inspect are the number of discoveries at q

      Figure S7: A brief description of the cell types would be nice somewhere.

      Figure S10 (previously Figure S7) has been substantially revised, whereby cell types are not included in the comparison due to the clustering/cell type label-independence structure in the revised manuscript. So, the original presentation concern is no longer directly applicable.

      Reviewer #2 (Significance (Required)):

      Recently there has been a movement to try and go beyond simple cell type annotation, and here the authors present a consistent and scalable framework for this in R. This work develops new strategies to incorporate existing techniques into scRNA-seq analysis pipelines, whilst maintaining computational scalability.

      This work will be particularly interesting to those performing analyses dependent on cell type identification in the R ecosystem, and is demonstrated to be capable of handling the large datasets of modern scRNA-Seq studies.

      Although there not much conceptually novel, the implementation appears to be robust, and the scope of applicability is very broad. As such this work will be of broad interest, from those doing basic science on cell lines, to those analysing data from in vivo or human clinical trials.

      My background is in bioinformatics, physics, and statistics.

      Response:

      We appreciate this assessment and agree that the original version did not make the conceptual contributions sufficiently visible. The revised manuscript now distinguishes more clearly between the novel components—the common landmark-by-sample density representation and the contrast-specific supervised embedding (pePC)—and the conventional components (pseudobulk DE after subset selection). We hope the revision better conveys both the practical scalability and the specific methodological contributions, while being transparent about which downstream steps remain standard.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors present an R package for landmark-based analysis of multi-sample single-cell genomics datasets, demonstrating its utility through applications to both simulated and clinical data.

      Major:

      The current benchmarking does not sufficiently establish the advantages of the proposed method over existing alternatives. Most notably, comparisons with metacell-based approaches are absent, despite these representing the most conceptually similar class of methods. The presented benchmark on differential abundance analysis benchmark lacks a scenario with known ground truth, making it impossible to assess false discovery rate control. Additionally, the comparison with miloR should employ graph-based distance approximations, which substantially improve runtime and memory usage (https://www.biorxiv.org/content/10.1101/2023.11.08.566176v1.full). We recognize that exhaustive benchmarking across all possible scenarios is neither feasible nor necessary. However, the current comparisons should be expanded to clarify the specific strengths and limitations of this approach relative to existing methods. This would help readers assess when the proposed tool offers a meaningful advantage over established alternatives.

      Response:

      We appreciate this careful benchmarking critique. We want to address the metacell comparison directly: we did not add a head-to-head benchmark against metacell-based approaches in this revision. We recognize this as a limitation, and we explain below why we focused the empirical revision on other comparisons.

      Metacell methods (MetaCell, SEACells) aggregate similar cells into group-level representations that serve as compressed versions of the single-cell data. tinydenseR addresses a related but distinct modeling objective: it constructs a per-sample density distribution over a shared set of landmarks rather than a single compressed cell-by-gene representation. The downstream analysis modes (differential density, pePC, plsD) operate on this sample-level density matrix, not on aggregated expression profiles. A meaningful comparison would therefore require defining a shared evaluation task, and the most natural such tasks for tinydenseR—sample-level density modeling, contrast-specific embedding—are not standard outputs of metacell frameworks. We have added this conceptual positioning to the Discussion so that readers can assess the relationship between the approaches.

      For comparisons more directly aligned with the inferential claims of the framework, the revised manuscript includes synthetic DA and DE scenarios with known ground truth, permutation-based null analyses (Fig. S4), and expanded computational and methodological benchmarking against diffcyt, miloR, Seurat, and cell-level Harmony integration. The miloR benchmarking workflow was updated to use refinement_scheme = "graph," as stated in the Methods and Figure 3 legend and the Material and Methods. Even with graph refinement, miloR remained substantially slower than tinydenseR in our benchmarking setting.

      We agree that broader benchmarking across all neighboring method classes remains valuable future work, and we have stated this in the revised manuscript.

      It is unclear whether performing integration at the landmark level yields representations comparable to those obtained through cell-level integration. A systematic comparison between landmark-level and cell-level integration would help establish that the compression step does not introduce meaningful loss of biological signal.

      Response:

      In the revised manuscript, we added a dedicated landmark-level versus cell-level integration benchmark using the Luecken et al. Immune_ALL_human dataset (33,506 cells; 10 batches; 16 annotated cell types). We directly compared tinydenseR landmark-level integration (Harmony on landmarks followed by Symphony projection) against cell-level Harmony on all cells, evaluating integration quality using six metrics: three biological-conservation metrics (MCC for cell-type label transfer, cell-type ASW, graph connectivity) and three batch-mixing metrics (ASW by batch, kBET, batch entropy) (Fig. S5).

      Landmark-level integration matched or slightly exceeded cell-level Harmony on four of six metrics, including all three biological-conservation metrics. Cell-level Harmony performed better on the two neighborhood-based batch-mixing metrics, indicating that compression to landmarks did not measurably impair preservation of major biological structure in this dataset, although some reduction in local batch mixing remained.

      The landmark-based representation provides a per-sample summarized view that could lend itself to broader applications beyond cell state identification. For example, computing sample-level embeddings from the landmark matrix could enable systematic exploration of similarities and differences across large cohorts. Demonstrating such applications would strengthen the case for this framework's utility and help distinguish it from existing alternatives.

      Response:

      The revised manuscript demonstrates this broader utility across multiple datasets. The landmark-by-sample density matrix now supports both unsupervised sample embedding (PCA and diffusion-map trajectory) and supervised quantitative embedding via pePC, as described in the new "The tinydenseR Algorithm" subsection and Methods Step 6.

      Sample-level embeddings are shown in the synthetic DA/DE benchmarks (Figs. S1–S2), the xenograft treatment comparison (Fig. S6c), the COMBAT COVID-19 analysis (Fig. S7d), the longitudinal NIZ985 clinical-trial dataset (Fig. S10d), and multiple compartments and time points of the PHE885 study (Figs. S12c, S13c, Fig. 2c). Across these examples, the embeddings quantify cohort structure, separate effects of interest from nuisance variation, and summarize the fraction of total variance attributable to modeled contrasts. We agree that this broader utility helps distinguish the framework from alternatives, and we have revised the manuscript accordingly.

      Minor:

      The authors repeatedly describe their method as "agnostic to technology," but it is unclear what specific advantage this confers. In practice, most methods operating on embeddings and k-nearest neighbor graphs are similarly technology-agnostic. This claim would benefit from either clarification of the specific sense in which this property is distinctive, or a demonstration involving joint analysis of data generated across different technologies.

      Response:

      The reviewer is right that most methods operating on embeddings and kNN graphs are, in principle, technology-agnostic. Our use of the term is more specific and practical: the same end-to-end framework—from landmark construction through density modeling, embedding, and feature interpretation—is implemented out of the box in tinydenseR for both scRNA-seq and flow/mass/spectral cytometry, with modality-appropriate preprocessing but a shared downstream workflow. The revised Methods now list the supported input formats (Seurat, SingleCellExperiment, H5AnnData, BPCells, dgCMatrix, DelayedMatrix, cytoset) to make this concrete.

      We do not demonstrate joint analysis across different technologies in this manuscript, and we have therefore added a note in the Discussion that extending toward joint cross-technology analysis is a natural next step.

      The method description in the main text is insufficient to guide the reader through the subsequent applications. Starting with a more detailed overview of the key steps and assumptions underlying the approach would help readers interpret the results presented in later sections.

      We agree that the original version did not provide enough methodological guidance before the applications. The revised manuscript adds a dedicated Results subsection, "The tinydenseR Algorithm," that introduces the key steps and assumptions of the framework before the application sections. We also revised Figure 1 and expanded the relevant Methods sections so that the conceptual overview in the main text is clearly connected to the implementation details used in the later applications.

      Reviewer #3 (Significance (Required)):

      While the tool addresses a relevant need in the field, the underlying summarization approach is conceptually similar to established methods for multi-sample single-cell analysis, including those based on metacells (https://pubmed.ncbi.nlm.nih.gov/31604482/, https://pubmed.ncbi.nlm.nih.gov/35440087/, https://pubmed.ncbi.nlm.nih.gov/35963997/, https://pubmed.ncbi.nlm.nih.gov/36973557/) or data downsampling/sketching (https://pubmed.ncbi.nlm.nih.gov/31176620/). Given the limited conceptual novelty relative to these existing frameworks, the significance of this contribution is difficult to assess without more extensive benchmarking against comparable methods.

      Response:

      We appreciate this perspective. As discussed in our response to the benchmarking comment above, the revised manuscript positions tinydenseR explicitly as a sample-level landmark-density modeling framework rather than a compression or sketching strategy. The key distinction is that the downstream analyses operate on a per-sample density distribution over landmarks, not on aggregated expression profiles, which is why the empirical revision focused on benchmarks directly aligned with these inferential and computational claims. The Discussion now includes explicit positioning relative to metacell and sketching approaches, and we agree that broader cross-method benchmarking is valuable future work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The authors present an R package for landmark-based analysis of multi-sample single-cell genomics datasets, demonstrating its utility through applications to both simulated and clinical data.

      Major:

      The current benchmarking does not sufficiently establish the advantages of the proposed method over existing alternatives. Most notably, comparisons with metacell-based approaches are absent, despite these representing the most conceptually similar class of methods. The presented benchmark on differential abundance analysis benchmark lacks a scenario with known ground truth, making it impossible to assess false discovery rate control. Additionally, the comparison with miloR should employ graph-based distance approximations, which substantially improve runtime and memory usage (https://www.biorxiv.org/content/10.1101/2023.11.08.566176v1.full). We recognize that exhaustive benchmarking across all possible scenarios is neither feasible nor necessary. However, the current comparisons should be expanded to clarify the specific strengths and limitations of this approach relative to existing methods. This would help readers assess when the proposed tool offers a meaningful advantage over established alternatives. It is unclear whether performing integration at the landmark level yields representations comparable to those obtained through cell-level integration. A systematic comparison between landmark-level and cell-level integration would help establish that the compression step does not introduce meaningful loss of biological signal. The landmark-based representation provides a per-sample summarized view that could lend itself to broader applications beyond cell state identification. For example, computing sample-level embeddings from the landmark matrix could enable systematic exploration of similarities and differences across large cohorts. Demonstrating such applications would strengthen the case for this framework's utility and help distinguish it from existing alternatives.

      Minor:

      The authors repeatedly describe their method as "agnostic to technology," but it is unclear what specific advantage this confers. In practice, most methods operating on embeddings and k-nearest neighbor graphs are similarly technology-agnostic. This claim would benefit from either clarification of the specific sense in which this property is distinctive, or a demonstration involving joint analysis of data generated across different technologies. The method description in the main text is insufficient to guide the reader through the subsequent applications. Starting with a more detailed overview of the key steps and assumptions underlying the approach would help readers interpret the results presented in later sections.

      Significance

      While the tool addresses a relevant need in the field, the underlying summarization approach is conceptually similar to established methods for multi-sample single-cell analysis, including those based on metacells (https://pubmed.ncbi.nlm.nih.gov/31604482/, https://pubmed.ncbi.nlm.nih.gov/35440087/, https://pubmed.ncbi.nlm.nih.gov/35963997/, https://pubmed.ncbi.nlm.nih.gov/36973557/) or data downsampling/sketching (https://pubmed.ncbi.nlm.nih.gov/31176620/). Given the limited conceptual novelty relative to these existing frameworks, the significance of this contribution is difficult to assess without more extensive benchmarking against comparable methods.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors describe a new computational framework for the analysis of single-cell RNA-seq data in multiple domains that uses probabilistic cluster assignment for downstream differential expression and abundance analysis. They show that their method outperforms similar methods in certain tasks, and provide computational performance studies to show that this is tractable on large scale RNA-seq datasets.

      I congratulate the authors on a well written and interesting manuscript, and the development of what seems like quite a comprehensive analysis package. I particularly appreciate the effort to demonstrate computational performance on large datasets (and of course the great name!). Nevertheless, I have a few minor comments that I think would enhance the presentation and impact of the paper.

      From the introduction, it's a little hard to determine what exactly tinydenseR is doing, what the underlying philosophy is, and what the key innovations are. This is especially apparent as the paper then launches straight into the results. The methods section is very comprehensive, however without a 'guide' through this that bridges the gap between the very high level description and the implementation details, a reader could end up getting stuck. So my suggestion would be to add a couple of paragraphs to the introduction that provide this, such that the reader would only need to consult the methods for the real implementation details.

      It seems that the probabilities that you get from the fuzzy cluster assignment aren't necessarily well calibrated. After the Benjamini-Hochberg correction this might end up not being a issue, so it would be good to see that the FDR that you mention in line 431 is indeed roughly how many discoveries are made under the null hypothesis for some indicative configurations, and whether this deviates significantly from the expectation.

      The rest of my comments are minor and only related to the presentation:

      • l71 (and elsewhere): A bit more detail about these datasets would be nice (what the trajectories are, etc, rather than just the configuration).

      • l73: The language is a little different between this description and the figures, so it's a little hard to keep track of the correspondence.

      • Fig S2 (and others): I know that these are supplemental figures, but they're quite dense and hard to read (both conceptually and in terms of the font size). For something supporting a direct claim in the paper, it would be nice to have one or two plots in the main body of the paper that summarise all of the evidence in these denser supplementary plots.

      • l106: This is a little confusing to read (not clear whether these are single and double knock outs or just wild type and double knock out).

      • l162: In general it would be nice to link this back to some of the choices made in tinydenseR compare to other techniques.

      • l175: 'Fuzzy' density matrix is not really defined anywhere

      • l427: BH -> Benjamini-Hochberg

      • Eq 5: Missing a bracket I think

      • Fig 2b: A log x axis might be better here.

      • Fig 2c,d,e: Some of the y axis ticks here in particular here look a little broken

      • Fig S5: DA -> differential abundance, DE -> Differential expression. A guide to what exactly to look at in this figure would also be nice.

      • Figure S7: A brief description of the cell types would be nice somewhere.

      Significance

      Recently there has been a movement to try and go beyond simple cell type annotation, and here the authors present a consistent and scalable framework for this in R. This work develops new strategies to incorporate existing techniques into scRNA-seq analysis pipelines, whilst maintaining computational scalability.

      This work will be particularly interesting to those performing analyses dependent on cell type identification in the R ecosystem, and is demonstrated to be capable of handling the large datasets of modern scRNA-Seq studies.

      Although there not much conceptually novel, the implementation appears to be robust, and the scope of applicability is very broad. As such this work will be of broad interest, from those doing basic science on cell lines, to those analysing data from in vivo or human clinical trials.

      My background is in bioinformatics, physics, and statistics.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      1) Overall, this manuscript presents a landmark-based strategy for scaling sample-level differential abundance and differential expression analyses to atlas-sized single-cell datasets by summarizing cells through fuzzy cell-landmark memberships and then applying standard sample-level modeling. The approach is promising for computational efficiency, but several key methodological claims and design choices would benefit from clearer justification and stronger empirical validation.

      2) The manuscript's main methodological novelty appears to be computational/scalable representation learning via landmarking and fuzzy cell-landmark densities. However, for differential expression within cell states, the inferential procedure is standard pseudo-bulk + limma/voom and still relies on discrete subset definitions (clusters/cell types, or landmark bins) and downstream annotation for biological interpretation. The authors should clarify that "clustering-independent" primarily applies to the DA representation/testing, and strengthen evidence that landmark-level subsets provide materially different biological resolution than simply clustering a subsample.

      3) The method treats UMAP fuzzy graph weights as "connection probabilities" and uses their sums to estimate sample-level abundance around each landmark. Please clarify (i) how sensitive the density matrix is to the UMAP membership construction (e.g., choice of parameters, the per-cell membership mass constraint), and (ii) why this is an appropriate abundance surrogate in the context of sample-level inference.

      4) The landmark selection step caps landmarks at 10% of cells per sample (and an overall cap of 5,000 landmarks by default), but the rationale for these thresholds is not clearly justified. Additionally, interaction between the per-sample cap and the global 5,000 landmark cap: when the global cap is binding, how are landmarks redistributed across samples, and does this induce unequal representation across samples with different cell yields?

      Significance

      This manuscript introduces a practical landmark-based framework that makes sample-level differential abundance and expression analyses feasible at atlas scale by summarizing large cell collections through a reduced set of representative cells and fuzzy cell-landmark memberships, then leveraging well-established sample-level linear modeling. The main strengths are its emphasis on scalability, a clear engineering workflow that can be applied across large studies, and a unified representation that can support multiple downstream analyses. The main limitations are that key inferential components remain conventional once subsets are defined, biological interpretation still relies on clustering/label transfer and downstream annotation, and several core design choices (e.g., treating UMAP fuzzy weights as "probabilities" and the default landmark caps) would benefit from stronger justification and sensitivity analyses to support the paper's broader claims about cluster-independence and biological resolution.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Decker et al present an interesting study of the order of events in tau seeding in a biosensor HEK293 derived cell line. This a critical unresolved question in the field about subcellular compartment contributions to tau aggregation. This exploration of the nuclear tau aggregated deposition and seeding in HEK293T tau biosensor cells uses a variety of imaging-based methods. They show that nuclear aggregates only form in cells with cytosolic aggregates, nuclear aggregates cannot form in the absence of cytoplasmic tau aggregates. The original tau seeds do not persist. Also aggregates in the nucleus are dependent on VCP and SRRM2 for formation. The paper is limited in scope by use of only HEK239T cells and seem to overstate the generalizability of the findings to neuronal mechanisms of neurodegeneration. Please see to all tauopathies. In particular, the manuscript does not outline the overstatement of some of the conclusions.

      Key Points to address: 1. The manuscript does not detail limitations of the study in the discussion. Please address the concern that HEK293T biosensor cells are not neurons. Especially in the clear animations showing the transformation from cytoplasmic to nuclear aggregates appears to require cell division and nuclear breakdown.

              We agree with the reviewer that a limitation of this manuscript is we only used HEK293 cells.  We have added text to emphasize this point in a "Limitations of this study" section at the end of the discussion.  However, as a starting point we believe understanding the cell biology of protein aggregation even in non-neuronal cells can be of value.
      
              Moreover, we clearly see cases of nuclear tau aggregates forming without cell division and nuclear breakdown (Figure 1 and Movies).  We have added text to emphasize this point since it is relevant to the potential formation of nuclear aggregates in neurons and the reviewer must have missed this point.
      

      The introduction sets this up as Alzheimer's disease relevant but all studies are down with P301S tau which is a distinct and particularly aggressive form of tauopathy (FTLD-Tau). There is no amyloid beta component to any of these studies.

              This is a good point, and we have clarified our use of an FTLD model.  We do note that since seeds from post-mortem tissues in several different tauopathies can give nuclear tau aggregates (Sanders et al 2014), we anticipate that this process is general to multiple tauopathy contexts.
      

      The study does not address the peculiar structure of P301S aggregates, which while disease relevant are clearly distinct from AD or most forms of familial FTLD. The authors should limit the generalizability of the findings to their particular form of tauopathy unless they plan to use multiple tau fibril conformations in their studies.

              The reviewer points out that we have only used one model system, and presumably only one tau fibril structure and therefore we should be cautious about the generality of our results.  This is a valid point, and we now point out this limitation in the manuscript.
      

      The authors do not address the potential impact of fusing a natively unfolded protein like tau to a highly structured beta barrel like GFP. Please present this potential confound.

              We have added text pointing out that using GFP fusion proteins has the potential to alter tau function. We note this is an issue in the use of any fusion proteins, which have nevertheless proven useful tools.
      
      1. Inhibition of VCP can cause proteinopathies in the absence of other seeding. For instance, familial mutations in human VCP can cause either tau or TDP-43 proteinopathy depending on the specific human disease causing mutation. Thus, critical controls are missing from figure 3. For instance, the consequence of VCP inhibition on unseeded biosensor cells is a missing control. Second all panels should evaluate TDP-43 aggregation to ascertain whether or not the secondary nuclear seeding involves TDP-43.
            In this comment, the reviewer asks that we show the effects of VCP inhibition on unseeded cells.  We will add this control, and we observe no appreciable tau aggregation with tau seeding.
        
            We will also assess whether TDP-43 aggregates in the HEK293 biosensor cells with or without VCP inhibition and/or tau seeding.  However, we note that it is clear from many studies that tau aggregation can occur independently of TDP-43 aggregation.
        

      Minor concerns: A. Line 635 - In line 380, they discuss that aggregation of tau does not lead to perturbations in nuclear transport. In line 390, they discuss that aggregation of tau does not affect nuclear envelope integrity or nuclear import. However, in the discussion discusses that aggregation alters nuclear RNA export. These statements could use clarifying that protein export is not perturbed but RNA export and import may be.

      We have clarified this point.

      B. Line 564: "This observation suggests that tau aggregation in the cytoplasm may lead to increased expression of some RNAs." This could also be that cytoplasmic tau alters RNA export. These experiments don't differentiate between these options.

      This comment is related to other comments about the relative abundance of specific RNAs in the nucleus or cytoplasm. We will add new data to the manuscript where we examine the numbers of specific RNAs in cells with and without nuclear or cytoplasmic tau aggregates. This will allow us to determine if there is simply a retention of RNAs in the nucleus or if, in some cases, there is also an increase in RNA levels.

      1. In Figure 1, the authors show large aggregates overlapping the nucleus. It is unclear whether these aggregates have a portion both within and outside the nucleus or if they are deforming the nucleus and are wholly external to the nuclear compartment. Clarity on this issue is important. If the nucleus is deferment the observed aggregates seem reminiscent of aggresome formation. Please clarify. We assume the reviewer asks us to clarify why the large cytoplasmic tau aggregates are localized near the nucleus. Indeed, we suspect these are accumulating in aggresomes over time and have added this point to the text. Importantly, we do not observe a general defect in the integrity of the nucleus suggesting that even those these assemblies are close to the nucleus, they are not altering the nuclear envelope. We have added text to explain this issue.

      Reviewer #1 (Significance (Required)):

      Decker et al present an interesting study of the order of events in tau seeding in a biosensor HEK293 derived cell line. This a critical unresolved question in the field about subcellular compartment contributions to tau aggregation. This exploration of the nuclear tau aggregated deposition and seeding in HEK293T tau biosensor cells uses a variety of imaging-based methods. They show that nuclear aggregates only form in cells with cytosolic aggregates, nuclear aggregates cannot form in the absence of cytoplasmic tau aggregates.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Decker et al. examines the formation of nuclear tau aggregates and their functional consequences using a HEK293T tau biosensor system. The authors propose that nuclear tau aggregates arise through VCP dependent secondary seeding from cytoplasmic aggregates and that these nuclear aggregates impair RNA export. The study addresses an important and relatively unexplored aspect of tau biology. However, several conclusions extend beyond what the data directly supports, and several essential controls are missing. Major Comments - The introduction is generally clear and well organized. However, it would benefit from additional mechanistic context explaining how polyserine domains might promote tau aggregation and why this feature is biologically relevant.

              We have added text expanding what is known about how polyserine domains can increase tau fibrillization.
      
      • The live cell imaging convincingly demonstrates the temporal sequence of cytoplasmic followed by nuclear aggregation. However, the manuscript lacks controls assessing whether nuclear aggregation correlates with higher tau expression levels. Such controls are necessary to exclude expression driven artifacts.

            We will add an analysis of the relationship between tau expression levels and cells with nuclear tau aggregates.  We observed that tau aggregates were independent of the tau expression levels, ruling out that nuclear tau aggregates are solely an artifact of extremely high tau expression levels.
        
      • The authors conclude that nuclear envelope integrity is preserved, but only import assays were performed. To validate the sensitivity and specificity of the assay, export assays or positive controls for nuclear transport disruption are required.

            We had already shown that in cells with nuclear tau aggregates the nuclear export of mRNAs is perturbed. We will add additional analyses of whether nuclear export of proteins is altered.
        
      • The Cy3/Cy5 seed experiments support the claim that exogenous seeds do not enter the nucleus. However, the conclusion that VCP generates secondary seeds is overstated. For example, the manuscript states: "VCP is responsible for the formation of secondary seeds..." (lines 418-439), yet the data demonstrate correlation rather than direct evidence of seed generation.

            This is a valid point. We have rephrased the manuscript to note that VCP is required for nuclear tau aggregation, possibly through the formation of secondary tau seeds, which is consistent with earlier work suggesting VCP can generate new tau seeds (Saha et al., 2023, Nature Communications; Batra et al., 2025, Molecular Neurodegeneration).
        

      To substantiate this conclusion, the authors should: directly quantify seed abundance. The current interpretation assumes uniform cytoplasmic uptake of seeds but does not measure it; Include controls addressing VCP inhibitor specificity, as these compounds have pleiotropic effects (e.g., ER stress, proteostasis collapse). No data is provided on whether VCP inhibition alters tau ubiquitination, which could have major implications on tau aggregation.

                    This comment addresses the issue of whether VCP can generate new seeds from tau fibers.  This is a conclusion already reached by prior work (Saha et al., 2023, Nature Communications; Batra et al., 2025, Molecular Neurodegeneration).  The point of our manuscript that this comment addresses is whether the nuclear aggregates are forming from a secondary seeding event, for which we have already provided several lines of evidence.  First, we have shown that nuclear aggregates only form after the formation of a prior cytoplasmic tau aggregate (Figure 1). Second, we have shown that nuclear aggregates do not contain exogenous seeds, while all cytoplasmic tau aggregates do (Figure 2).  Finally, we have shown that nuclear tau aggregates are dependent on VCP, which is consistent with the prior work showing VCP can generate tau seeds.  It is beyond the scope of this manuscript to determine in more detail how VCP affect tau aggregates generally.  For this reason, and since we have robustly demonstrated our conclusion, we have chosen not to pursue these additional suggested experiments.
      
      • The authors observed increase in nuclear poly(A)+ RNA and specific transcripts. However, the current data do not distinguish between several possible mechanisms that may account for this increase, including impaired export, increased transcription, enhanced RNA stability, or nuclear retention due to speckle reorganization.

            To address this comment, we will quantify the levels of individual RNAs in the nucleus, cytoplasm and whole cell.  This will allow us to determine if there is an increase in RNA levels (possibly due to increased transcription or reduced decay), or if the increased nuclear RNA levels are due to block to mRNA export.  We will also assess transcription rate by measuring the intensity of the transcription loci, which will allow us to distinguish if any changes in mRNA levels are due to transcription or changes in RNA decay.
        
      • The discussion occasionally overinterprets the data. Several statements should be reframed as hypotheses rather than conclusions:

      • "VCP can generate tau seeds capable of additional seeding within a cell." (lines 572-594) This has not been directly demonstrated and should be softened accordingly.

      We have done so. 2. Active import via SRRM2 is proposed, but no supporting data are presented. This should be clearly framed as a speculative model.

      We have done so. 3. "Tau aggregates in the nucleus alter the function of nuclear speckles..." (lines 616-637). While plausible, this is not directly shown. Alternative explanations such as transcriptional upregulation or stress induced changes should be acknowledged.

      We have altered this text to be more accurate. 4. The statement "It is possible that such nuclear aggregates could alter nuclear RNA export and contribute to pathology." (lines 637-655) is reasonable, but the authors should emphasize that nuclear tau aggregates are not consistently observed across tauopathies and that the HEK293T biosensor system may not fully recapitulate neuronal biology.

      We agree with this point and have rephrased the text accordingly.

      Reviewer #2 (Significance (Required)):

      The manuscript by Decker et al. examines the formation of nuclear tau aggregates and their functional consequences using a HEK293T tau biosensor system. The authors propose that nuclear tau aggregates arise through VCP dependent secondary seeding from cytoplasmic aggregates and that these nuclear aggregates impair RNA export. The study addresses an important and relatively unexplored aspect of tau biology. However, several conclusions extend beyond what the data directly supports, and several essential controls are missing.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In their manuscript, Decker et al., investigate the relationship between cytoplasmic and nuclear tau aggregation using a HEK293T biosensor system and propose a multistep model in which cytoplasmic aggregates give rise to nuclear tau aggregates, potentially via VCP-dependent secondary seed generation and involvement of nuclear speckle components. The study further explores functional consequences of nuclear tau aggregation on RNA metabolism. Overall, the work is interesting and potentially impactful. The combination of live-cell imaging, seed-labeling strategies, genetic perturbations (SRRM2/PNN), and RNA imaging represents a thoughtful experimental approach. However, I have some minor concerns and feel the authors should address these - 1. Poly(A)+ FISH intensity is not a direct measure of export efficiency. The authors claim that tau aggregation within nuclear speckles interferes with nuclear export of RNA. It is highly possible that increased nuclear RNA levels observed could reflect altered transcription, stability, or stress responses rather than export defects alone. In the case of ATF3, a known stress responsive gene, increased nuclear signal could reflect transcriptional activation, not export defects. To prove that export is defective, the authors should at least measure total RNA levels (qPCR) in nuclear vs cytoplasmic fraction.

              To address this issue, we will quantify the levels of specific RNAs in the nucleus and cytoplasm by smFISH, which will allow us to clarify why there are more RNAs associated with nuclear speckles in the context of nuclear tau aggregates.
      

      Though the authors have shown the proposed role of VCP in generating secondary seeds by using inhibitors, the authors should show genetic validation by using dominant-negative VCP.

              This experiment essentially asks us to examine the role of VCP in nuclear tau aggregation by an additional method. We will add experiments examining how nuclear tau aggregates form when VCP is knocked down by siRNAs.  We have chosen not to use dominant negative VCP mutants since their phenotype will be complicated with the endogenous VCP possibly remaining functional.
      

      **Referees cross-commenting** *This session contains comments from different reviweers* Reviewer 3 I agree with the reviewers that additional controls and experiments would strengthen the VCP inhibition studies. However, I would like to clarify that the specific concern raised by Reviewer 1 (Key point number 4) regarding fusion of tau to GFP does not apply to this manuscript. In this study, the authors use tau conjugated to Cy3, a well established approach in the field that adds only approximately 1 kDa to the protein.

      Reviewer 1 Apologies reviewer 3, but I respectfully disagree. Please look again at the legends for figs 1 through fig 5. All clearly delineate the use of tau biosensor cells using a YFP rather than GFP fusion protein with tau. i do agree we should correct my review to state YFP rather than GFP, but structurally the concern remains the same. Cy3 labelling, I believe is used to track the relatively short lived exogenous seeds.

      Reviewer #3 (Significance (Required)):

      The integration of approaches presented here, especially in connecting tau aggregation with nuclear speckle biology and RNA processing, will be of broad interest and offers important new mechanistic insights into tau pathology. I am an expert in Alzheimer's disease and integrated stress response.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In their manuscript, Decker et al., investigate the relationship between cytoplasmic and nuclear tau aggregation using a HEK293T biosensor system and propose a multistep model in which cytoplasmic aggregates give rise to nuclear tau aggregates, potentially via VCP-dependent secondary seed generation and involvement of nuclear speckle components. The study further explores functional consequences of nuclear tau aggregation on RNA metabolism. Overall, the work is interesting and potentially impactful. The combination of live-cell imaging, seed-labeling strategies, genetic perturbations (SRRM2/PNN), and RNA imaging represents a thoughtful experimental approach. However, I have some minor concerns and feel the authors should address these -

      1. Poly(A)+ FISH intensity is not a direct measure of export efficiency. The authors claim that tau aggregation within nuclear speckles interferes with nuclear export of RNA. It is highly possible that increased nuclear RNA levels observed could reflect altered transcription, stability, or stress responses rather than export defects alone. In the case of ATF3, a known stress responsive gene, increased nuclear signal could reflect transcriptional activation, not export defects. To prove that export is defective, the authors should at least measure total RNA levels (qPCR) in nuclear vs cytoplasmic fraction.
      2. Though the authors have shown the proposed role of VCP in generating secondary seeds by using inhibitors, the authors should show genetic validation by using dominant-negative VCP.

      Referees cross-commenting

      This session contains comments from different reviewers

      Reviewer 3

      I agree with the reviewers that additional controls and experiments would strengthen the VCP inhibition studies. However, I would like to clarify that the specific concern raised by Reviewer 1 (Key point number 4) regarding fusion of tau to GFP does not apply to this manuscript. In this study, the authors use tau conjugated to Cy3, a well established approach in the field that adds only approximately 1 kDa to the protein.

      Reviewer 1

      Apologies reviewer 3, but I respectfully disagree. Please look again at the legends for figs 1 through fig 5. All clearly delineate the use of tau biosensor cells using a YFP rather than GFP fusion protein with tau. i do agree we should correct my review to state YFP rather than GFP, but structurally the concern remains the same. Cy3 labelling, I believe is used to track the relatively short lived exogenous seeds.

      Significance

      The integration of approaches presented here, especially in connecting tau aggregation with nuclear speckle biology and RNA processing, will be of broad interest and offers important new mechanistic insights into tau pathology. I am an expert in Alzheimer's disease and integrated stress response.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript by Decker et al. examines the formation of nuclear tau aggregates and their functional consequences using a HEK293T tau biosensor system. The authors propose that nuclear tau aggregates arise through VCP dependent secondary seeding from cytoplasmic aggregates and that these nuclear aggregates impair RNA export. The study addresses an important and relatively unexplored aspect of tau biology. However, several conclusions extend beyond what the data directly supports, and several essential controls are missing.

      Major Comments

      • The introduction is generally clear and well organized. However, it would benefit from additional mechanistic context explaining how polyserine domains might promote tau aggregation and why this feature is biologically relevant.
      • The live cell imaging convincingly demonstrates the temporal sequence of cytoplasmic followed by nuclear aggregation. However, the manuscript lacks controls assessing whether nuclear aggregation correlates with higher tau expression levels. Such controls are necessary to exclude expression driven artifacts.
      • The authors conclude that nuclear envelope integrity is preserved, but only import assays were performed. To validate the sensitivity and specificity of the assay, export assays or positive controls for nuclear transport disruption are required.
      • The Cy3/Cy5 seed experiments support the claim that exogenous seeds do not enter the nucleus. However, the conclusion that VCP generates secondary seeds is overstated. For example, the manuscript states: "VCP is responsible for the formation of secondary seeds..." (lines 418-439), yet the data demonstrate correlation rather than direct evidence of seed generation. To substantiate this conclusion, the authors should: directly quantify seed abundance. The current interpretation assumes uniform cytoplasmic uptake of seeds but does not measure it; Include controls addressing VCP inhibitor specificity, as these compounds have pleiotropic effects (e.g., ER stress, proteostasis collapse). No data is provided on whether VCP inhibition alters tau ubiquitination, which could have major implications on tau aggregation.
      • The authors observed increase in nuclear poly(A)+ RNA and specific transcripts. However, the current data do not distinguish between several possible mechanisms that may account for this increase, including impaired export, increased transcription, enhanced RNA stability, or nuclear retention due to speckle reorganization.
      • The discussion occasionally overinterprets the data. Several statements should be reframed as hypotheses rather than conclusions:

      • "VCP can generate tau seeds capable of additional seeding within a cell." (lines 572-594) This has not been directly demonstrated and should be softened accordingly.

      • Active import via SRRM2 is proposed, but no supporting data are presented. This should be clearly framed as a speculative model.
      • "Tau aggregates in the nucleus alter the function of nuclear speckles..." (lines 616-637). While plausible, this is not directly shown. Alternative explanations such as transcriptional upregulation or stress induced changes should be acknowledged.
      • The statement "It is possible that such nuclear aggregates could alter nuclear RNA export and contribute to pathology." (lines 637-655) is reasonable, but the authors should emphasize that nuclear tau aggregates are not consistently observed across tauopathies and that the HEK293T biosensor system may not fully recapitulate neuronal biology.

      Significance

      The manuscript by Decker et al. examines the formation of nuclear tau aggregates and their functional consequences using a HEK293T tau biosensor system. The authors propose that nuclear tau aggregates arise through VCP dependent secondary seeding from cytoplasmic aggregates and that these nuclear aggregates impair RNA export. The study addresses an important and relatively unexplored aspect of tau biology. However, several conclusions extend beyond what the data directly supports, and several essential controls are missing.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Decker et al present an interesting study of the order of events in tau seeding in a biosensor HEK293 derived cell line. This a critical unresolved question in the field about subcellular compartment contributions to tau aggregation. This exploration of the nuclear tau aggregated deposition and seeding in HEK293T tau biosensor cells uses a variety of imaging-based methods. They show that nuclear aggregates only form in cells with cytosolic aggregates, nuclear aggregates cannot form in the absence of cytoplasmic tau aggregates. The original tau seeds do not persist. Also aggregates in the nucleus are dependent on VCP and SRRM2 for formation. The paper is limited in scope by use of only HEK239T cells and seem to overstate the generalizability of the findings to neuronal mechanisms of neurodegeneration. Please see to all tauopathies. In particular, the manuscript does not outline the overstatement of some of the conclusions.

      Key Points to address:

      1. The manuscript does not detail limitations of the study in the discussion. Please address the concern that HEK293T biosensor cells are not neurons. Especially in the clear animations showing the transformation from cytoplasmic to nuclear aggregates appears to require cell division and nuclear breakdown.
      2. The introduction sets this up as Alzheimer's disease relevant but all studies are down with P301S tau which is a distinct and particularly aggressive form of tauopathy (FTLD-Tau). There is no amyloid beta component to any of these studies.
      3. The study does not address the peculiar structure of P301S aggregates, which while disease relevant are clearly distinct from AD or most forms of familial FTLD. The authors should limit the generalizability of the findings to their particular form of tauopathy unless they plan to use multiple tau fibril conformations in their studies.
      4. The authors do not address the potential impact of fusing a natively unfolded protein like tau to a highly structured beta barrel like GFP. Please present this potential confound.
      5. Inhibition of VCP can cause proteinopathies in the absence of other seeding. For instance, familial mutations in human VCP can cause either tau or TDP-43 proteinopathy depending on the specific human disease causing mutation. Thus, critical controls are missing from figure 3. For instance, the consequence of VCP inhibition on unseeded biosensor cells is a missing control. Second all panels should evaluate TDP-43 aggregation to ascertain whether or not the secondary nuclear seeding involves TDP-43.

      Minor concerns:

      A. Line 635 - In line 380, they discuss that aggregation of tau does not lead to perturbations in nuclear transport. In line 390, they discuss that aggregation of tau does not affect nuclear envelope integrity or nuclear import. However, in the discussion discusses that aggregation alters nuclear RNA export. These statements could use clarifying that protein export is not perturbed but RNA export and import may be.

      B. Line 564: "This observation suggests that tau aggregation in the cytoplasm may lead to increased expression of some RNAs." This could also be that cytoplasmic tau alters RNA export. These experiments don't differentiate between these options.

      C. In Figure 1, the authors show large aggregates overlapping the nucleus. It is unclear whether these aggregates have a portion both within and outside the nucleus or if they are deforming the nucleus and are wholly external to the nuclear compartment. Clarity on this issue is important. If the nucleus is deferment the observed aggregates seem reminiscent of aggresome formation. Please clarify.

      Significance

      Decker et al present an interesting study of the order of events in tau seeding in a biosensor HEK293 derived cell line. This a critical unresolved question in the field about subcellular compartment contributions to tau aggregation. This exploration of the nuclear tau aggregated deposition and seeding in HEK293T tau biosensor cells uses a variety of imaging-based methods. They show that nuclear aggregates only form in cells with cytosolic aggregates, nuclear aggregates cannot form in the absence of cytoplasmic tau aggregates.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Reviewers

      We would like to thank the reviewer for their constructive comments on our manuscript. We have addressed all comments made by the reviewers by additional experimental data, data analyses, and text edits. A detailed point-by-point response to the reviewers is documented below.

      Summary of new/amended data panels

      Fig 2C (Rev 2): Cell-by-cell quantification of the GFP fluorescence intensity as a surrogate measure of wild-type (WT) vs mutant Pfn1 rescue construct expression levels in B16F1 KO-rescue studies.

      Figs 1B, 2A, 3C, 4A, 4C (Rev 1, 3): Inclusion of zoomed images of PIP2 staining of select regions of interests.

      Figs 6B, 6D (Rev 2): Quantification of phospho-PKC substrate antibody immunoblots of MDA-231 and B16F1 cells with or without Pfn1 KO.

      Fig 3E (not requested by the reviewers): Time-lapse images of PIP2 biosensor and F-actin in HEK-293 cells.

      __Fig 3H (Rev 3): __Half-life comparison of LatB-induced PIP2 and F-actin responses

      Fig S1 (Rev 1): F-actin and PIP2 staining of MDA-231 cells with or without treatments of myosin inhibitor blebbistatin.

      Figs 6G-I (Rev 2, 3): Quantification of various parameters from Ca2+ imaging studies.

      Fig 6J-M (Rev 2): __Images and quantification of correlative PIP2 and DAG biosensor studies __in HEK-293 cells.

      Fig 7 (not requested by the reviewers)__: __A schematic model of how Pfn1 loss leads to PIP2 reduction in cells.

      Fig S2 (not requested by the reviewers): Effect of Pfn1 knockdown on PI4P in HEK-293 cells.

      Fig S3B (Rev 2): A list of top 100 (50 up, 50 down) differentially expressed genes in response to Pfn1 KO in MDA-231 cells.

      Point-by-Point response

      __REVIEWER 1: __

      1. "The quantifications of the PIP2 levels were apparently done simply by measuring the fluorescence intensities of wild-type and knockout cells stained with monoclonal actin-PIP2 antibody. However, the knockout cells appear more spread compared to the wild-type cells (Fig. 1B), and this can possibly affect the quantifications (e.g. there may be more plasma membrane ruffles/folds in the wild-type cells). Thus, I recommend that in all critical quantifications the authors would also use a general plasma membrane marker to confirm that the PIP2-density (and not just morphology of the plasma membrane) is indeed affected by Pfn1-depletion". Response: For PM PIP2 analysis, we specifically quantified the total rather than the average PM PIP2 staining intensity (as also previously done in other studies - Hammond et al. J. Cell Science 2006; Biochem. J 2009) for three reasons. First, PIP2 is non-uniformly distributed across the PM, and therefore the average intensity calculation collapses a lot of biologically meaningful spatial information. Second, the average intensity calculation is impacted by significant cell shape and area differences that exist between cells within a group as well as between groups. Third, the integrated PM intensity is a better metric of how much total PIP2 is available for metabolic turnover on a cell-by-cell basis. These justifications are now detailed in the revised manuscript.

      In our previous study (Ricci et al., J. Biol. Chem 2024, PMID 38141770), we utilized orthogonal techniques (immunostaining, lipid dot blot) in multiple cell lines to demonstrate that total PIP2 as well as PIP2 intensity at the plasma membrane (PM) (based on manual tracing of hundreds of cells in immunostaining experiments) are reduced by silencing Pfn1 expression, and conversely, elevated upon Pfn1 overexpression. We would like to clarify here that in our present study we used an automated pipeline in "cell profiler" to detect cell edges and quantify integrated PM intensity of PIP2 in control vs Pfn1 knockout (KO) cells, and our present findings in Pfn1 KO setting recapitulated our previous findings in transient knockdown setting. While our cell-profile pipeline accurately detects the cell edges, we address the reviewer's comment on confirmation of findings with a PM marker by providing new experimental data in HEK-293 cells transfected with fluorescence biosensors of PIP2 and DAG along with a PM marker (iRFP-Lyn11), which also shows reduction of PIP2 fluorescence staining at the Lyn11-positive PM regions in Pfn1 knockdown cells relative to control cells (see new data panels Figs 6J, L).

      "To get a better idea about which cellular actin filament structures are important for regulating the PIP2-levels at the plasma membrane, one could also use a larger repertoire of actin/myosin inhibitors (CK666, cytochalasin-B, blebbistatin). By using these compounds, one may e.g. uncover if the Arp2/3-nucleated branched actin networks and/or contractile actomyosin structures would specifically contribute to regulation of the plasma membrane PIP2 levels".

      Response: We thank the reviewer for this suggestion. We have now evaluated the effect of blebbistatin treatment on PIP2 in MDA-231 cells (now shown supplementary Fig S1). A previous study showed that the major effects of blebbistatin on actin cytoskeleton are disintegration of actin stress fibers, softening of cortical actin, and transformation of lamellipodial actin into loose network of accumulated amorphous actin structures that correspond to membrane ruffles (Shutova et al., 2012). These phenotypes were also recapitulated in our experimental settings. In general, blebbistatin-treated cells exhibited protrusive structures in random directions with PIP2 enrichment in peripheral F-actin-rich regions (consistent with the LatB experimental data) and a higher (p=0.09) overall cell edge PIP2 staining vs vehicle-treated cells further underscoring the impact of actin cytoskeletal perturbation on PM PIP2.

      "The effects of PLCb3 silencing on Pfn1-dependent changes in the PIP2 levels are interesting. To gain better insight into the underlying mechanism, one could also check if the levels of active (phosphorylated) PLCb3 are affected upon Pfn1-depletion".

      Response: We would like to point out that unlike PLCg, PLCb is not activated by phosphorylation. While literature has documented that certain site-specific phosphorylations of PLCb by PKC (in a feedback manner) and PKA, these phosphorylation events, if at all, have inhibitory effect on PLCb activity. Since our data supports the model that Pfn1 loss leads to an increase in PLC-mediated PIP2 hydrolysis and downstream PKC activation, we feel that probing for such inhibitory feedback phosphorylation events will not provide any mechanistic insights.

      "In the 'Discussion', the authors speculate that Pfn1 H119E mutant may have more frequent interactions with PIP2 as compared to wild-type Pfn1. This does not make much sense, because Pfn1 binding to PIP2 is very weak (e.g. ref. 28), and it is unlikely that introducing a negativelycharged glutamate would increase its affinity to negatively charged headgroup of PIP2. Thus, it seems unlikely that Pfn1 would affect the PIP2 content of plasma membrane through direct interactions with PIP2".

      Response: __We did not mean to imply that glutamate substitution of H119 residue would necessarily increase Pfn1's __intrinsic affinity to negatively charged PIP2. While PIP2 binding of WT vs H119E-Pfn1 has never been quantified in biochemical assays, we previously (Bae et al. PNAS 2010; PMID 21115820) showed that H119E substation does not affect the membrane fraction of ectopically overexpressed Pfn1 in cells. Along this line, Pascal-Goldschmit and colleagues (PMID: 7673143) also showed that analogous mutant H119D-Pfn1 inhibits PLCg-mediated PIP2 hydrolysis as efficiently as WT-Pfn1, further underscoring the fact that H119D/E-Pfn1 is not defective in membrane phosphoinositide binding. Our data largely supports a model that Pfn1-dependent PIP2 alteration is predominantly related to its actin-regulatory function. However, since Pfn1's binding to actin and PIP2 are mutually exclusive, we cannot absolutely rule out a minor (possibly insignificant) contribution of Pfn1's ability to block PIP2 hydrolysis by direct PM interaction. We therefore offered a hypothetical scenario where H119E-Pfn1 mutant may have more frequent interaction with PM PIP2 simply because it is not able to interact with actin. We have now better clarified this argument in the "Discussion" section of the revision.

      "The cell images in Fig. 2A are bit difficult to follow due to the large number of cells in the images. One could perhaps show higher resolution images with few knockout and rescue cells in the same field of view and indicate the rescued cells in these images e.g. with arrows".

      Response: As requested by the reviewer, we have now shown zoomed images in Fig 2A in the revision.

      "Please clearly describe in each figure legend what the error bars represent"

      Response: We have now clearly mentioned in the Statistics section of "Materials and Methods" that all error bars represent standard deviation unless explicitly mentioned otherwise.



      REVIEWER 2

      1. "The data show that actin binding-deficient mutants of Pfn1 do not rescue the knockdown. In these experiments, it is critical to quantitate the relative expression levels of the mutants. The model that Pfn1 regulation of PIP2 requires interactions with actin is not really clear - is it due to Pfn1 targeting by actin binding, or Pfn1 regulation of actin itself? Either possibility seems possible, and the experiments do not distinguish them". Response: We thank the reviewer for these comments. First, since GFP and Pfn1 rescue constructs are linked by an IRES, we analyzed GFP fluorescence intensity of cells selected for PIP2 analyses as a surrogate measure for comparing the relative expressions of Pfn1 rescue constructs across the various groups. As per these analyses (based on measurements of hundreds of cells from 3 different experiments), the average GFP expression of cells chosen for PIP2 analyses was found to be comparable between the various Pfn1 KO rescue groups (now shown in Fig 2C). Therefore, we argue that our observed phenotypic differences related to PIP2 are not confounded by the expressions of various Pfn1 rescue constructs.

      Second, it is known that Pfn1 loss leads to pronounced reduction in lamellipodial F-actin content (as shown in Figs 3A-B). Our LatB experimental data (Figs 3E-G) show that actin depolymerization leads to pronounced PM PIP2 reduction within minutes. Based on these findings, taken together additional evidence for increased basal PLC activity signature readouts in Pfn1-deficient cells (i.e. greater baseline PKC activity, greater PM DAG/PIP2 ratio from biosensor studies as recommended by the reviewer (new data - shown in Figs 6J-M)), we postulate (concurring with Reviewer 3) that disruption of cortical cytoskeleton (possibly also accompanied by removal of PIP2-binding adaptor proteins) may enhance PIP2's accessibility to hydrolytic enzymes. In fact, two previous studies (Cho et al., PNAS, 2005 and Andrade et al., Scientific Reports 2015) have demonstrated that actin filament disruption increases PM mobility of PIP2. There is also evidence for actin depolymerization-induced uncaging of PLC from the cortical actin network (Huang et al, Planta, 2009). Therefore, in principle, Pfn1 loss may cause more frequent PLC-PIP2 interaction and enhance baseline PIP2 hydrolysis by either increasing PM diffusion of PIP2 and/or uncaging of PLC. We have now included a schematic working model (Fig 7) to illustrate this concept and added these points in the discussion. However, a direct demonstration of increased PIP2 accessibility of PLC in Pfn1-deficient cells is beyond the scope of the present - this is something we will pursue in the future.

      "The knockdown data on PLCbeta is convincing with regard to its role in PIP2 reductions, but the papers does not explain how actin-Pfn1 interactions regulate PLCbeta".

      Response: Please see our detailed response to the previous comment that specifically addresses how we envision Pfn1 negatively regulates PLC-mediated PIP2 hydrolysis via modulating actin cytoskeleton.

      "The transcriptome data must be provided along with the data in Figure 5 - otherwise it is impossible for the reader to evaluate. The fact that the data is being used in another paper is not an adequate reason for its omission".

      Response: The transcriptomic data is now displayed in Supplementary Figure S3, where we have now listed top 100 (50 up, 50 down) differentially expressed genes in response to Pfn1 KO in MDA-231 cells (see panel B in Fig S2). We are in the process of submitting the FASTA file to GEO database.

      "The PKC substrate data is not convincing. The blots are messy, and there is no quantitation".

      Response: Since phospho-PKC substrate antibody is supposed to recognize all phosphorylated proteins by PKC, we expect to see multiple bands. The intensity of each lane in entirety is approximative of PKC activity by detecting proteins at multiple molecular weights phosphorylated at their serine residues. We have replaced the B16 generated data with a better-quality blot and added quantifications with statistical analysis (Figs 6B, D).

      "The calcium data should include statistical analysis of the differences".

      Response: We have now performed statistical analyses of the calcium data. Specifically, we compared the peak amplitude, integrated Ca2+ signal (area under the curve), and the post-stimulation resting value between control and Pfn1 knockdown groups. As per these analyses, we did not see any significant difference in either the peak amplitude or integrated Ca2+ signal between the control and Pfn1 knockdown groups, further underscoring the fact that Pfn1 loss does not necessarily confer cells an increased ability to respond to agonists (i.e. LPA-induced GPCR activation in this specific case). However, we noted that the post-stimulation resting Ca2+ signal was elevated in Pfn1-deficient cells relative to control cells (p2 hydrolysis and/or reduced re-uptake of cytosolic Ca2+ by endoplasmic reticulum and/or reduced efficiency of Ca2+ export. These analyses are now included in Figs 6G-I in the revision.

      "The discussion of DAG and PA levels is problematic. As the authors are aware, whole cell lipidomics can easily miss small changes in specific compartments. If the authors think that lipid sensor analysis of PM DAG and PA would strengthen the analysis, then this should be included. The large change in PC levels does seem to suggest an alternative source of PA. While the authors present arguments against a role for PLD, this could be directly tested. In any case, the finding of a nearly 100-fold greater change in PC than in PA raises question about what the whole cell PA measurements is really detecting".

      Response: We thank the reviewer for these comments and experimental suggestions__. First__, we completely agree with the reviewer that whole cell lipidomic analyses fail to detect small changes in specific compartment; we mention this point in the revision. In the revision, we have displayed our lipids of interest as individual line plots connecting control and Pfn1 KO group experiment-by-experiment to show the trend of lipid change in each experiment. As per these analyses, in 4 out 5 experiments, the total DAG increased in Pfn1 KO cells. However, the large experiment-to-experiment variability in the absolute content as well as Pfn1-dependent changes in DAG precluded us from achieving statistical significance between the two groups. The large variability in the measured DAG content in our experiments is not totally surprising since cellular DAG level is known to fluctuate with growth and/or impacted by unintended changes in the chemical parameters of culture condition. However, the largest pool of DAG is in ER/golgi, and since whole cell lipidomic measurements fail to reveal PM DAG due to PIP2 hydrolysis, as per reviewer's recommendation, we now include lipid biosensor experimental data (Fig 6J-M) of control vs Pfn1 knockdown HEK-293 cells to demonstrate that PM DAG-to-PIP2 ratio (an indicator of the basal PIP2 hydrolysis efficiency) is increased upon Pfn1 depletion. We believe that these new correlative PIP2/DAG biosensor data further strengthen our conclusion.

      Regarding the reviewer's comment on the orders of change in PC vs PA, we clearly mentioned in the original discussion that it is highly unlikely that PA increase in Pfn1-deficient cells is reflective of increased PLD-mediated conversion of PC for two reasons. First, we saw disproportionate orders of magnitude of changes in the content of PA (~3000 pmol/mg increase) vs PC (>200,000 pmol/mg decrease) in response to Pfn1 KO in MDA-231 cells. Second and more importantly, since monomeric actin directly binds to and inhibits the activity of PLD, the expected increased G-to-F-actin ratio in Pfn1-deficient cells, if at all, would likely result in diminished PLD activity reducing PLD-mediated conversion of PC to PA.

      In our opinion, since DAG is the direct hydrolysis product of PIP2 and we are now able to demonstrate elevated PM DAG-to-PIP2 ratio in Pfn1-deficient cells in biosensor experiments, PA biosensor studies are not necessary.

      REVIEWER #3

      1. "General: Scale bar labels are too small, please also provide time-stamps for time course measurements" Response: These concerns have been addressed in the revision.

      "As with every antibody stain, there is a remaining risk that a change in the cellular context affects an off-target of the antibody (e.g., a protein phosphorylation site). I think that this is not particularly likely, but I'd control for it, which can be done in a straightforward manner: The authors could do a strong-detergent treatment to rule out a potential off-target effect of the antibody (e.g., 0.1% Triton X-100, 1 h). This should remove all (non-amino-) lipids from the sample, including the phosphoinositides. Overall, binding of the antibody should be strongly reduced, fluorescence images should be much dimmer & the effect of the Pfn1 KO should mostly disappear."

      Response: The PIP2 antibody used in the present study is a well-vetted and widely used antibody in literature. Notably, two papers published by Dr. Hammond (one of the co-authors), an expert in phosphoinositide signaling, previously showed selectivity of this antibody by blocking with lipids, neomycin, and PH-domain of PIP2-binding proteins (Hammond et al, J. Cell Sci, 2006; Biochem J. 2009). We cite these papers in the revision.

      "Figure 1: Please show images in a larger zoom, cell details are barely visible (same for Figure 3). I also would not use "PM PIP2 levels" in the legend, as nuclei appear visibly lighter, indicating that some PIP2 is likely present in other membranes. The type of PIP2 staining should be specified in either the Figure itself or in the legend."

      Response: We would like to clarify here that we used an automated pipeline in "cell profiler" to detect cell edges and quantify integrated PM intensity of PIP2 in control vs Pfn1 knockout (KO) cells; so nuclear membrane PM is not accounted for in the analyses. We have zoomed PIP2 images in Figure 1 as the reviewer suggested. These changes are incorporated in the revision.

      "Figure 3: Same comment as for Figure 1, zoomed images would really help, especially for the PM/Cytosol distribution of the PIP2 biosensor"

      Response: Zoomed images of Fig 3 have been provided in the revision.

      "The lag time in the dissociation of the PIP2 sensor is interesting, as is the fact that the kinetic of PIP2 biosensor release is (visually) slower. I recommend to do a couple of simple fits to quantify these effects. If my impression holds, this would be a strong support of the author's interpretation that actin depolymerization actually leads to a loss of PM PIP2 - a simple binding/unbinding kinetic would be much closer to the actin depolymerization kinetic".

      Response: As suggested by the reviewer, we have done curve fitting of these data to calculate the half-life of F-actin and PIP2 (results shown in Fig 3H). As per these calculations, the mean half-life of PIP2 (~ 1min) is significantly longer than that of F-actin (~2.2 min) which further supports our interpretation that actin depolymerization leads to a loss of PM PIP2.

      "Figure 4: Same comment as for Figures 1 and 3, zoomed images would be most helpful."

      Response: Zoomed images have been provided in the revision.

      "Figure 5G: It looks like the two conditions were internally normalized. Given that we're looking at differential levels of PIP2/IP3/DAG, I think it is very possible that baseline Ca levels are also different. I'd either report in au or do a global normalization which would also capture any difference between the two conditions. This should also clarify whether there are differences in post-stimulus steady state Ca levels, as it currently looks like".

      Response: Since we used a transfectable Ca2+ biosensor (GCaMP), to account for cell-to-cell variation in the actual expression of the biosensor, we had to baseline-corrected GCaMP fluorescence by normalizing each kinetic datapoint readout to the average pre-stimulation value on a cell-by-cell basis. However, we have now performed additional analyses. Specifically, we calculated the peak amplitude, integrated Ca2+ signal (area under the curve), and the post-stimulation resting value for each of the two groups. As per these analyses, we did not see any significant difference in either the peak amplitude or integrated Ca2+ signal between the control and Pfn1 knockdown groups, further underscoring the fact that Pfn1 loss does not necessarily confer cells an increased ability to respond to agonists (i.e. LPA-induced GPCR activation in this specific case). However, we noted that the post-stimulation resting Ca2+ signal was elevated in Pfn1-deficient cells relative to control cells (p2 hydrolysis and/or reduced re-uptake of cytosolic Ca2+ by endoplasmic reticulum and/or reduced efficiency of Ca2+ export. These analyses are now included in Figs 6G-I in the revision.

      "Please increase the font size in Figure 6C, this is barely readable".

      Response: We have now replaced that panel with one with bigger font texts.


      "Do the authors think that most PIP2 is actually in lipid-protein complexes and actin depolymerization with the corresponding removal of PIP-binding adaptor proteins exposes previously shielded PIP2 molecules to enzymatic hydrolysis?"

      Response: Yes, we certainly think that is the most likely scenario. Please see our detailed response to Reviewer 2's comment #1. We have now clearly included this in the discussion and added a schematic mechanistic model to better illustrate our thinking (Figure 7).

      "The lipidomic changes are extremely interesting. This could indicate a change in overall cellular architecture which goes beyond PIPs. SM/Chol/PC all go down - I'd interpret that this as a relatively lower content of Plasma membrane and ER. It would be interesting to see if the surface to volume ratio of the cell changes - a comparison with total Cardiolipin as a proxy for mitochondrial membrane size could also be informative. It may very well be that the Pfn1 KO effects on structural membrane lipids are the more important finding - but elucidating that mechanism is beyond the scope of the current manuscript. I look forward to learning about it in the next story".

      Response: We thank the reviewer for this insightful comment. However, this is something we would consider as a scope of future studies.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Orenberg et al. is a well done, well-written paper that provides an in-depth look at the effects of Pfn-1 depletion on PIP2 levels, actin polymerisation and the broader lipidome. I enjoyed reading it, the main conclusions are sound and well-taken & the finding that PIP2 levels correlate with actin polymerization is intriguing as well as the fact that the global lipid to protein ratio changes. This is indicative of the identification of a major player in lipid flux pathways. I have just a few suggestions for control experiments, formulations & figure layout changes that I think will make the paper even better:

      • General: Scale bar labels are too small, please also provide time-stamps for time course measurements.
      • As with every antibody stain, there is a remaining risk that a change in the cellular context affects an off-target of the antibody (e.g., a protein phosphorylation site). I think that this is not particularly likely, but I'd control for it, which can be done in a straightforward manner: The authors could do a strong-detergent treatment to rule out a potential off-target effect of the antibody (e.g., 0.1% Triton X-100, 1 h). This should remove all (non-amino-) lipids from the sample, including the phosphoinositides. Overall, binding of the antibody should be strongly reduced, fluorescence images should be much dimmer & the effect of the Pfn1 KO should mostly disappear.
      • Figure 1: Please show images in a larger zoom, cell details are barely visible (same for Figure 3). I also would not use "PM PIP2 levels" in the legend, as nuclei appear visibly lighter, indicating that some PIP2 is likely present in other membranes. The type of PIP2 staining should be specified in either the Figure itself or in the legend.
      • Figure 3: Same comment as for Figure 1, zoomed images would really help, especially for the PM/Cytosol distribution of the PIP2 biosensor.
      • The lag time in the dissociation of the PIP2 sensor is interesting, as is the fact that the kinetic of PIP2 biosensor release is (visually) slower. I recommend to do a couple of simple fits to quantify these effects. If my impression holds, this would be a strong support of the author's interpretation that actin depolymerization actually leads to a loss of PM PIP2 - a simple binding/unbinding kinetic would be much closer to the actin depolymerization kinetic.
      • Figure 4: Same comment as for Figures 1 and 3, zoomed images would be most helpful
      • Figure 5G: It looks like the two conditions were internally normalized. Given that we're looking at differential levels of PIP2/IP3/DAG, I think it is very possible that baseline Ca levels are also different. I'd either report in au or do a global normalization which would also capture any difference between the two conditions. This should also clarify whether there are differences in post-stimulus steady state Ca levels, as it currently looks like.
      • Please increase the font size in Figure 6C, this is barely readable

      For the discussion:

      • Do the authors think that most PIP2 is actually in lipid-protein complexes and actin depolymerization with the corresponding removal of PIP-binding adaptor proteins exposes previously shielded PIP2 molecules to enzymatic hydrolysis?
      • The lipidomic changes are extremely interesting. This could indicate a change in overall cellular architecture which goes beyond PIPs. SM/Chol/PC all go down - I'd interpret that this as a relatively lower content of Plasma membrane and ER. It would be interesting to see if the surface to volume ratio of the cell changes - a comparison with total Cardiolipin as a proxy for mitochondrial membrane size could also be informative. It may very well be that the Pfn1 KO effects on structural membrane lipids are the more important finding - but elucidating that mechanism is beyond the scope of the current manuscript. I look forward to learning about it in the next story.

      André Nadler

      Significance

      The manuscript by Orenberg et al. is a well done, well-written paper that provides an in-depth look at the effects of Pfn-1 depletion on PIP2 levels, actin polymerisation and the broader lipidome. I enjoyed reading it, the main conclusions are sound and well-taken & the finding that PIP2 levels correlate with actin polymerization is intriguing as well as the fact that the global lipid to protein ratio changes. This is indicative of the identification of a major player in lipid flux pathways.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      1. The data show that actin binding-deficient mutants of Pfn1 do not rescue the knockdown. In these experiments, it is critical to quantitate the relative expression levels of the mutants. The model that Pfn1 regulation of PIP2 requires interactions with actin is not really clear - is it due to Pfn1 targeting by actin binding, or Pfn1 regulation of actin itself? Either possibility seems possible, and the experiments do not distinguish them.
      2. The knockdown data on PLCbeta is convincing with regard to its role in PIP2 reductions, but the papers does not explain how actin-Pfn1 interactions regulate PLCbeta.
      3. The transcriptome data must be provided along with the data in Figure 5 - otherwise it is impossible for the reader to evaluate. The fact that the data is being used in another paper is not an adequate reason for its omission.
      4. The PKC substrate data is not convincing. The blots are messy, and there is no quantitation.
      5. The calcium data should include statistical analysis of the differences.
      6. The discussion of DAG and PA levels is problematic. As the authors are aware, whole cell lipidomics can easily miss small changes in specific compartments. If the authors think that lipid sensor analysis of PM DAG and PA would strengthen the analysis, then this should be included. The large change in PC levels does seem to suggest an alternative source of PA. While the authors present arguments against a role for PLD, this could be directly tested. In any case, the finding of a nearly 100-fold greater change in PC than in PA raises question about what the whole cell PA measurements is really detecting.

      Significance

      The manuscript by Orenberg et al. is an extension of previous work showing a link between Pfn1 and PM PIP2. While the new data expand the observations, and the PIP2 biosensor data are clean, the proposed model is not really convincing or fully defined - a number of elements are suggestive but not definitive. Several of the data could have multiple explanations (some of which are acknowledged in the discussion). The overriding hypothesis is that Pfn1-actin coupling regulates PLCbeta, but it is not clear how this would happen. Finally, several of the data are not convincing (PKC substrates) or lack statistical analysis (calcium imaging).

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Earlier studies have shown that actin-binding protein, profilin, can inhibit the PLC-dependent hydrolysis of PIP2 in vitro and provided evidence that acute profilin-1 (Pfn1) knockdown results in diminished PIP2-levels at the plasma membrane. However, the underlying mechanism by which profilin regulates PIP2-levels in cells has remained elusive. Here, Orenberg at al., show that Pfn1-dependent changes in the plasma membrane PIP2 levels are not transient. Interestingly, they also provide evidence that Pfn1 controls plasma membrane PIP2 levels through its actin-regulating activity and not through directly interacting with PIP2. Finally, they show that loss of Pfn1 also affects the levels of many other lipids in cells.

      Majority of the data presented in the manuscript appear of good technical quality, but I have some suggestions to strengthen the manuscript.

      1. The quantifications of the PIP2 levels were apparently done simply by measuring the fluorescence intensities of wild-type and knockout cells stained with monoclonal actin-PIP2 antibody. However, the knockout cells appear more spread compared to the wild-type cells (Fig. 1B), and this can possibly affect the quantifications (e.g. there may be more plasma membrane ruffles/folds in the wild-type cells). Thus, I recommend that in all critical quantifications the authors would also use a general plasma membrane marker to confirm that the PIP2-density (and not just morphology of the plasma membrane) is indeed affected by Pfn1-depletion.
      2. To get a better idea about which cellular actin filament structures are important for regulating the PIP2-levels at the plasma membrane, one could also use a larger repertoire of actin/myosin inhibitors (CK666, cytochalasin-B, blebbistatin). By using these compounds, one may e.g. uncover if the Arp2/3-nucleated branched actin networks and/or contractile actomyosin structures would specifically contribute to regulation of the plasma membrane PIP2 levels.
      3. The effects of PLCb3 silencing on Pfn1-dependent changes in the PIP2 levels are interesting. To gain better insight into the underlying mechanism, one could also check if the levels of active (phosphorylated) PLCb3 are affected upon Pfn1-depletion.
      4. In the 'Discussion', the authors speculate that Pfn1 H119E mutant may have more frequent interactions with PIP2 as compared to wild-type Pfn1. This does not make much sense, because Pfn1 binding to PIP2 is very weak (e.g. ref. 28), and it is unlikely that introducing a negatively-charged glutamate would increase its affinity to negatively-charged headgroup of PIP2. Thus, it seems unlikely that Pfn1 would affect the PIP2 content of plasma membrane through direct interactions with PIP2.
      5. The cell images in Fig. 2A are bit difficult to follow due to the large number of cells in the images. One could perhaps show higher resolution images with few knockout and rescue cells in the same field of view and indicate the rescued cells in these images e.g. with arrows.
      6. Please clearly describe in each figure legend what the error bars represent.

      Significance

      Although this study does not determine the actual mechanism/pathway by which Pfn1 controls plasma membrane PIP2 levels, it nevertheless provides evidence that perturbation of the actin cytoskeleton by loss of actin-binding profilin or with an actin inhibitor latrunculin-B results in decrease in the plasma membrane PIP2, and that PLC-activity is critical for regulation of PIP2 levels downstream of Pfn1 in cells. Therefore, this study presents a valuable contribution to a specific field, and will be interesting to those studying the actin cytoskeleton - plasma membrane interplay.

      My expertise: Cytoskeleton research.

  3. Apr 2026
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers


      __Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      __Summary: Overall, this study adds a large amount of data for the scyphozoan Aurelia coerulea by producing several single-cell RNA sequencing libraries that cover the transition from polyp to medusa. The study provides a modern view of cell type diversity and cell-specific transcriptome changes during this period of extreme morphological change in this particular cnidarian lineage, which is understudied. Certain unique cell subtypes, including neural cell subtypes and muscle cell subtypes which are specific to different life stages are discussed in detail providing some new insights.

      My overall assessment is that the manuscript has good potential to be impactful, but in its current form it is somewhat clunky and overly complex to read, the figures were too crowded and difficult to comprehend, and the authors did not provide enough context regarding the current state of knowledge and what this study adds to it. In particular, Figure 1 and the section about striated and smooth muscles sharing partial transcriptomic profiles need the most work. The results were presented in the context of the anthozoan Nematostella but this should be broadened further to include other cnidarian single-cell studies, such as those from Hydra and Clytia which are both medusozoans like Aurelia. The writing throughout could be streamlined and simplified to better highlight the major findings as described in the abstract of the paper. Several figures were not well presented or clear and could be improved or decluttered to better communicate and support important results. In addition, some methods were totally missing, and I was unable to access the github repository associated with the paper which should detail all analyses described in the paper. In its current form, reproducibility of analyses would be quite limited. I did greatly appreciate the inclusion of the data on the UCSC Cell Browser, which allows anyone to access the single cell data matrix for visual exploration.

      Answer: We thank the reviewer for the overall positive assessment and have tried to address all of the comments that follow.

      Major comments: The Introduction section was very short - only three paragraphs. I feel that this section could be expanded to give more context about Aurelia as a research organism, and the current resources available. This includes genomic and transcriptomic resources particularly those focused on the transition between life cycle stages (polyp to medusa). Any other relevant background on cell type diversity or if there is anything known about the molecular profile of specific cell types found in different life stages should also be included here . Do marker genes already exist for some of the important cell types discussed in the manuscript? It would be better to present the current state of knowledge, and context for why this study was done, how it builds upon current knowledge, and what it adds to our current understanding so that the study is properly framed from the beginning.

      Answer: Introduction was expanded and also includes explanations to which extant medusa specific cell-types were investigated so far. This additional information is highlighted in blue typeface in the manuscript.

      In the Results section, I find the sentence on p. 4, "Further, ~70% of these gene models do not have readily identifiable orthologs and thus represent putative orphan genes" to be rather confusing. What analysis was performed to determine this percentage, and which set of organisms were compared? Doesn't this percentage seem rather high for a cnidarian? Or is this referring to orthologs outside of cnidaria? Please comment further on how this percentage was determined and possible explanations for it being this high. Right now, it just feels tacked on to this paragraph with no context or further explanation which leads to the confusion.

      __Answer: __This statement originally referred to a lack of any best-blast-hit nor any protein domain annotation found for the sequence. This number has dropped to only 47% with the most recent mapping tool, which is a value also fairly commonly found in other animal genomes. Nonetheless this statement has been removed from the manuscript.

      Figure 1. There are many issues with this figure that encompass how I felt generally about the figures of the paper. The figure should ideally take up the entire width of the page rather than squishing some text next to the figure.

      __Answer: __The figures are intended to be a full page, they are also included embedded into the text to facilitate review of the manuscript and the full-resolution figures are included for proper review. In the revised version we have kept this comment in mind to ensure the figures are legible.

      Figure 1A: The colors of the different developmental stages from which tissue was samples (e.g. polyp1, polyp2, polyp.clover) do not seem to match between legend and figure. For example, the "polyp.clover" stage is circled in blue in the schematic, but given a green dot in the legend. The "medusa.manubrium" is circled in orange in the schematic, but given a purple dot in the legend. Suggest making the colors match between legend and schematics.

      __Answer: __ The colors correspond to the grouped stages and colour palette used for the life cycle stage divisions. This has been considered in the revised figure

      Figure 1E: In Panel E, the labels showing that the top graph is "polyp" and the bottom graph is "medusa" are much too small. Increase the font size of the labels. The font size for the GO terms themselves are also too small.

      __Answer: __This figure has been removed in the revision; Attention has been paid to font sizes in the revised figures.

      Figure 1F: The bulk of this study centers around the single-cell RNA sequencing data and resulting analyses from these data. As such, I would expect the cellular atlas resulting from these data to be similarly highlighted. In Figure 1F, the annotated cell atlas as presented is much too small, making it impossible to even add the labels for the different clusters directly on the UMAP. Suggest increasing the size substantially to at least half of the page width, so that it is possible to do so.

      __Answer: __This has been removed in the revision; the full distribution of the identified clusters is now figure 2. We do not include all of the population sub-types on the UMAP in this figure as this is simply a visualization tool and the distribution of the sub-types on that map is not necessarily informative. Rather we include the relative proportions of the sub-types/states in the bar plot, and the relationships between these clusters in the tree.

      -There should also be a complimentary figure in the supplement that shows all of the individual clusters, each in different colors and clearly annotated with labels, rather than just showing multiple clusters that were combined into the major cell types. There is an example of this in the Clytia single cell paper (see Chari et al. 2021 Figure 2A vs Fig S9).

      __Answer: __A fully coloured UMAP with all cell states is available in the supplement figure S3

      -The graph on the right of this panel showing the "Distribution of cell types in time and space" is overly complicated with all of the colors and the meaning is quite lost as it is quite difficult to interpret at this very small size. Suggest removing and possibly showing as a supplemental figure so that it's meaning is easier to assess.

      __Answer: __This barplot is now larger and includes both the partitions (major cell populations, as seen in the UMAP) and proportion of individual cell clusters. We feel this is an intuitive way to illustrate the relative distributions of all cell type states across the dataset as a whole and so we keep this in the main figures of the manuscript.

      -In addition, striated muscles are marked on the overall UMAP; however, it is not noted until later that the smooth muscles are part of the "outer epidermis" cluster. Suggest altering the legend or the text of the figure itself to show where the smooth muscles are thought to be in the overall UMAP, especially since they are specifically discussed in depth later in the manuscript. Exactly which "part" of the outer epidermis cluster includes the smooth muscle cells?

      __Answer: __We have added the smooth muscle cluster in the main figure umap.

      Figure 1G: Panel G, for example, is not useful in conveying its point as the text labels are too tiny and the figure is overly complex to be squished into a panel of this figure. Suggest removing and making 1G a supplemental figure by itself or perhaps together with 1C (as they are linked) where it is more legible. The figure legend text for Fig 1G is also confusing as it refers to "scyphozoa" in (C) but there is no "scyphozoa" in 1C, only "medusa".

      __Answer: __This is now Figure 1D and E and is given increased space in the figure. We feel the message that the medusa-specific gene set is not restricted to medusa-specific cell types is an important one and so we have kept this in the main figure. We provide a table with all gene annotations in the supplement so that it is accessible to anyone with further interest (DS1.1a and DS1.1b).

      Text, p. 6: The explanation for how the clusters were annotated in Fig 1 and Fig 2 is much too vague. The text states, 'We identified 9 broadly defined cell populations, for which we assign identities by assessing up-regulated gene lists (Data S1.3)." What does this mean? How exactly were the up-regulated gene lists assessed? This needs to be clarified further. What genes were used to label these clusters or groups as particular cell types? How does the annotation relate to Supplemental Tables S1.3 and S1.3b? Does the previous literature need to be cited to support these annotations based on specific genes? Suggest doing a better job overall and providing more detail and context explaining how the single cell clusters were annotated.

      __Answer: __We have expanded our description of how we assigned identities to the nine principal cell type families as follows:

      (pg. 8) The inner epithelia, or gastrodermis, expresses several collagens that are a characteristic of the inner cell layer of anthozoans (39); the outer cell layer houses the ring musculature and is rich in contractile proteins. The striated muscle cluster is also rich in contractile protein and is the only principal cell population absent from the polyp-derived samples (Fig. 2C). The mucin gland expresses mucin-like-proteins, whereas the digestive gland expresses other digestive enzymes, and the neural cluster expresses synapsin and other conserved known neural regulators such as ashA. The cnidocytes express mini-collagens and are enriched in pathways targeting the endoplasmic reticulum (40).

      Text, starting on p14: "Striated and smooth muscles share partial transcriptomic profiles." This section is highly confusing and could do with some simplification in both text and figures. - The genes for which expression is shown in Fig. 5, 6 and 7 are not properly introduced or given nearly enough context in the text. For example, the text states, "To investigate the dynamics of muscle formation, we further compared phalloidin staining of muscle fields with in situ hybridization detection of specific cluster marker expression in polyps (Fig. 5), strobila (Fig. 6), and ephyra (Fig.7)." However, it is not until the legend of Figure 7 and also much later in the text (in the Discussion, p23) that it is noted what types of muscles each of the genes used in ISH actually mark ("While a small set of genes are shared across the two muscle phenotypes (e.g. stmyhc1 and mrlc2), others are more specific to either phenotype (eg. stmyhc5 in striated muscle; myophilin-like-2 in smooth muscle) (Fig.8A), which were verified by in situ hybridization (Figs.5,6,7)". This needs to be rewritten and improved for flow and clarity purposes.

      Answer: Figure 5,6 and 7 were re-assembled in a different structure according to reviewers suggestion. Specifically, we now present the muscle anatomy together first, followed by molecular validations from the atlas data. Marker genes used for in situ hybridization (ish) were introduced as suggested. Text was re-written according to changes in figures. In general, figures and text were simplified to gain more clarity on the muscle chapter.

      • Suggest that the authors show an overall UMAP of smooth and striated muscle (perhaps the smooth muscle subtypes are part of the large 'outer epidermis' cluster; see the comment for Figure 5B above), and then include featureplots that show the expression of each of the genes used in ISH in these clusters. This might make it clearer as to what type of muscle the genes should be highlighting within each developmental stage. It might look something similar to what is shown in Figure 7P (although it is unclear how the featureplots shown in this figure relate to the UMAP shown in Figure 5B). In addition, the featureplots in Figure 7P only show 3 out of the 4 genes used in ISH which is not helpful. Featureplots should be clearly shown for all genes discussed. This is essential to linking the pattern in the single-cell data to the expression data and is the minimum required to provide clear understanding.

      Answer: We took this suggestion under consideration when re-compiling the figures. Now the feature plots and the insitu’s are found in the same figure (Figure 6).

      • The text reads, "To investigate the dynamics of muscle formation, we further compared phalloidin staining of muscle fields with in situ hybridization detection of specific cluster marker expression in polyps (Fig. 5), strobila (Fig. 6), and ephyra (Fig.7)." However, Figure 6 also contains images of ephyra (Fig6. P-S). Suggest that those panels could be included in Figure 7.

      Answer: This text no longer appears in the manuscript. The relevant section now reads as follows (p15:17):

      “We assessed the anatomic location of the muscle fields by phalloidin staining in Aurelia polyps, strobilae and ephyrae (Fig.5). Polyps have three distinct smooth muscle fields (Fig. 5A,B-G): the radial muscles of the oral disc (Fig. 5D), the longitudinal tentacle muscles (Fig. 5E), and the longitudinal retractor muscles that run along the body column (Fig. 5F,G (35)). During strobilation, fragments of the polyp retractor muscles are retained in the early ephyra (Fig. 5J (35)). Striated muscles appear coronally around the oral disc, oriented radially along the lappets of early detached ephyra (Fig. 5L-N). At the tips of the lappets, the border of the coronal muscle, and at the base of the manubrium, fibres show a mixed organization of smooth and striated myofibrils (Fig. 5O,P). These findings corroborate previous studies that used light- (26) or electron microscopy (24,25).

      We next compared expression patterns expected from our single cell data with the phalloidin-based anatomy of smooth and striated muscles. As expected, several genes were shared between the smooth and striated muscle cluster (Fig.6E), while others were highly specific to either smooth (Fig.6C,D) or striated muscle cluster (Fig.6P; Data S1.11). Different calponin paralogs show distinct expression in the different muscle types (Fig. 7A). For example, calponin1 is specific to the smooth retractor muscle of the polyp and no other subpopulation of the smooth muscle type (Fig. 6A-C). At the strobila stage, expression of calponin1 is still visible in fragmented retractor muscles, consistent with the single cell expression profile (Fig. 6F). By comparison, mrlc2 expression marks the locations of all smooth muscle populations in polyps including tentacle muscles, radial muscles of oral disc and retractor muscles of the body column (Fig. 6D,E).”

      • There are parts of this section text where reference to the Figures is complicated and not easy for the reader to follow. I got particularly confused in trying to follow this part of the manuscript. For example, a sentence on p15 reads, "mrlc2 and stmyhc1 reads are detected in both muscle types (Fig. 7pFig. 5M, Fig 6C,E,G-P, Fig. 7J-L,N-P), and ISH indicates that the expression is localised to the fields of striated muscles in ephyrae (Fig.7J,K,N), as well as the smooth muscle populations in polyps including longitudinal tentacle muscles, radial muscles of oral disc and retractor muscles of the body column (Fig. 5M, Fig.6H,I,L,M), and the muscles of the manubrium in the meta-ephyra (Fig. 7L,O)." It is quite difficult to keep jumping between Figures and panels to look at this. A better organization of the Figures and much clearer text that doesn't jump around could go a long way to making it easier to follow.

      Answer: __ We thank reviewer 1 for the suggested changes. We feel that recombining the results from previous versions of the figures helped to improve the clarity in this section. Single cell data was updated to include an UMAP of the muscle subset and gene expression plots highlighting the differential expression in either smooth- striated or both muscle types corresponding to the in situ hybridization (ish) gene expression profile. The figure (__Fig. 6) is now arranged in a way that allows the reader to easily follow the results for the spatial validation of both muscle types since ish for all life stages is shown in one panel together with the muscle subset UMAP and gene expression plots. Additionally, the two muscle clusters are now labelled also in (Fig. 2A) to provide a better understanding for the reader where muscle clusters are located in the UMAP of the full object.

      The text reads now: (Fig. 6, figure caption): (Q) feature plots of all marker genes on the muscle specific subset (R) reference UMAP of whole dataset (left) subset (right) (S) Distribution plot of muscle types across the different Aurelia life stages (left) and medusa tissues (right).

      Discussion -The authors do try to put their results into context with the two Aurelia genome papers (Gold et al. 2018, and Khalturin et al. 2019) and two additional bulk transcriptome studies (Fuchs et al. 2014, Brekhman et al. 2015), but not until the first part of the Discussion. In principle, this would be fine. However, in practice, their discussion of these studies is somewhat vague and generalized and did not really provide a clear review or analysis of how adding in cell-type specific data is helping our understanding. The argument about how their results fit with previous findings was confusing and unclear. They start by discussing "genome usage" but then switch to talking about cell type diversity across life stages. The connections between "genome usage", "gene representation", and cell types was not easy to follow. Suggest rewriting this section to clearly discuss the findings in this manuscript in the context of previous studies with straightforward and precise language.

      -In the discussion about the neural subtypes, comparisons are only made to Nematostella where there are also two major neural classes. It would be even better to include discussion of single-cell data related to neurons in other cnidarians, such as Hydra, where there is detailed discussion of neuron subtypes in both a published manuscript (Siebert et al. 2019, Science) and a preprint (Primack et al. 2023, biorxiv) and Clytia (Chari et al. 2021, Science Advances). I do see that Clytia and Podocoryna are mentioned in the next section of the Discussion, specifically related to the Otx gene.

      Answer: We thank the reviewer for this oversight. We have incorporated comparative observations from the published Hydra dataset in this regard.

      Pg 21 “ This contrasts with the distribution of n1 and n2 class neurons in the freshwater hydozoan polyp Hydra vulgaris, of which only three of the fifteen sub-types are of the ins-positive n1 type (“ec2”, “en2”, and “en3”: Fig. S8D; (58)). Similarly in the Clytia medusa only one of the three neuron groups (neuron cells “A” (16) have INSM reads and thus could be considered type 1 neurons as defined here.”

      -The section about muscle subtypes in the Discussion would need to be rewritten in accordance to changes suggested above for the Results for this section.

      Answer: Discussion was rewritten according to the changes made in the results section like suggested by reviewer1.

      Materials and Methods -In the section "Comparison with Nematostella" the authors discuss running OMA to generate the set of identified 1:1 orthologs but never go on to mention how many orthologs were identified. Please report this number so it is clear whether this is a small or large subset of the total analyzed. In a recent study of the Hydra AEP strain (Cazet et al. 2023 Genome Research), a similar analysis was done between Hydra and Clytia and they found 5979 genes with 1:1 orthologs between the two species. There should also be a supplemental datasheet that provides a list of these orthologs (See Supplemental Data S17 provided in Cazet et al. 2023 as an example). I am curious to know how many 1:1 orthologs were found between Aurelia and Nematostella. I would expect there to be a smaller overall number than between Hydra and Clytia due to the larger phylogenetic distance between these two taxa. I also strongly suggest that the Cazet et al. 2023 paper should be referenced, as it was the first time an attempt to compare single-cell datasets between two cnidarian species was done. The current manuscript took an alternative approach to comparing Aurelia to Nematostella, so it would be good to acknowledge this and justify the methods used in this manuscript compared to those used in Cazet et al. 2023.

      Answer: We recognize our oversight in not properly referencing the previous study comparing two cnidarian species and have integrated this reference now, and include the requested information regarding our OMA analysis as follows:.

      In total 4311 1:1 gene orthologs between the two species were identified (Data S2.). A similar comparison using OrthoFinder (90) between Hydra and Clytia, both members of the Hydrozoa clade, found 5979 1:1 orthologs (66). OMA was preferred in this study over other available orthology databases because it outputs a high-confidence predicted 1:1 gene orthology list that can be used directly to combine multi-species data.

      -There are missing descriptions of methods throughout the paper. One example is in the section about Transcription Factor families that are over or underrepresented amongst upregulated genes compared to their distribution in the genome - I could not find any description of the methods used to identify these Transcription Factor families in the dataset of Aurelia upregulated genes. How were these families chosen? How were they identified in this dataset?

      Answer: Transcription factors were identified and classified using the Animal Transcription Factor Database version 4. (https://guolab.wchscu.cn/AnimalTFDB4/#/). This information has been added to the manuscript methods.

      -I noticed in the Data and materials availability statement and a few other places in the manuscript, a github repository was mentioned: https://github.com/technau/AureliaAtlas. I tried to access this repository to review what was included, but unfortunately it is not accessible. I found seven repositories within github.com/technau but the AureliaAtlas was not one of them. This repository should include all scripts to generate all figures and other analyses in the paper and should be made available to reviewers to better understand exactly how all analyses were completed. A good example of how this could be done is found in the repository related to Cazet et al. 2023 (https://github.com/cejuliano/brown_hydra_genomes), which is very comprehensive and easy to follow. -When I looked through a similar repository https://github.com/technau/CellReports2022/ from the Steger et al. 2022 Cell Reports Nematostella single-cell paper from this same group, I find it to be rather disappointing. They apparently included all code to generate all figures in a single R file that is not easy to follow and not well commented. If this is the same strategy used for this manuscript, I feel that a much stronger effort could be made to make the analyses of this Aurelia manuscript transparent by producing a github that is more like that of https://github.com/cejuliano/brown_hydra_genomes from the Cazet et al. 2023 paper which organizes each type of analysis in a different github subfolder and within each subfolder they include very detailed information and comments explaining each step of each analysis. Doing this would go a long way to making the analyses in this manuscript more transparent and easier to follow and would certainly put some of my concerns to rest.

      __Answer: __We thank the reviewer for pointing this out. We have ensured that the github page is publicly accessible. We have provided all of the necessary R scripts to generate the analysis and figures. The structure is improved over the Steger paper; separate scripts are provided for each step, including importing and processing the raw data for the Seurat workflow, data processing to assess the life cycle and first clustering, analyses of each subset, and finally calling results from the previous scripts to generate all figures contained in the manuscript.

      Minor comments:

      Figures: Figure 2A: In the legend it says "Colour code as in (B) and (C)" but it's really referencing the colors in Figure 1A, correct? It is confusing to have to look back to Figure 1A to understand the colors here.

      __Answer: __The original figures 1 and 2 have been modified and combined into a single figure in this version.

      Figure 2D: Typo in the word "proteins" in the title of this panel.

      __Answer: __This word no longer appears in the revised figures.

      Figure 3F: The placement of the tree and the two featureplots for myc3 in Nematostella and Aurelia is confusing. Suggest moving the featureplot for Aurelia myc3 so that it is beside Nematostella (to the right of the tree) or move the featureplot for Nematostella myc3 so that it is beside the Aurelia featureplot (to the left of the tree).

      __Answer: __We thank the reviewer for this suggestion and have edited this figure accordingly by moving the myc3 expression plots alongside all of the others.

      Figure 4B: The description of this panel reads, "Distribution-histogram across all samples, medusa-specific cell clusters are highlighted with black outline.", however as a reader, the black outline is not very clear. Suggest making it bolder. In addition, this black outline is a little confusing - it should mark the medusa-specific cell clusters; however, the black outline appears in cell clusters in strobila and ephyra?

      __Answer: __ The black outline is now increased in width for clarity. Medusa-specific cell types are defined by their absence from the polyp samples because already in the strobila stage medusa-specific tissues are being generated and thus these transcriptomic profiles begin to appear. We added a clause in the figure legend to clarify this, as well as within the main text when medusa-specific cell states are first defined.

      Pg.8: “ In total we find 12 cell type states that are not represented (<br /> Figure 5B: It is unclear from where this reference UMAP was derived. Does it come from the overall UMAP, showing the 'outer epidermis' cluster only, with the putative smooth muscle cells in red? Or is it the 'outer epidermis' cluster plus the striated muscle cluster? Suggest making this clearer (see below for larger edits to this section of the manuscript).

      Answer: This has been addressed. Figure 6R now includes both the full dataset inset, as well as the muscle-only subset and is consistent with the rest of the manuscript in this regard.

      Figure 5K/L/M: It is unclear which parts of the polyp in K is used for the images shown in L or M. Both come from the large red box, but it is unclear from which part L and M were made. In addition, the subtraction of the background from the image (to make it look white) is distracting and makes the image itself look artificial.

      Answer: New brightfield images were included to give a better understanding of the region of interest. The images in which the background was subtracted were replaced with the original pictures and contrast was enhanced to brighten the background.

      Figure 6C, G-S: - Not sure what the blue boxes around these panels are meant to highlight? - Also not sure what the image in the left of panel C is. Perhaps an oral view of the strobila? The legend or panel itself should mention this. - Again, subtraction of the background from the image (to make it look white) in panels C, D and E is distracting and makes the image itself look artificial.

      Answer: The figure was redone and the boxes are not present anymore.

      Figure 6J, M, N, O: - For someone not accustomed to looking at images of strobilating polyps, it is unclear what part and what orientation these images are taken of. Suggest including some of these details in the figure legend at least. Fig 6O actually looks like an ephyra, but is annotated as an "advanced strobila"?

      Answer: Figure was re-done (fig.6) with appropriate schematics next to the images.

      Figure 7H: - Not sure what the white lines in this panel are meant to indicate?

      __Answer: __The white lines were removed.

      Results: p5 - In this sentence, "Because these four pouches look like a cloverleaf from above, we call this stage the "clover-polyp", suggest changing "clover-polyp" to match the Figure 1A (where it is written as polyp.clover), or change the text in the Figure to match the text in the manuscript.

      __Answer: __ We made sure to match this in the revised figure.

      p8 - In this sentence, "the bZIP protein family are over-represented as terminal cell type markers, while the number of zinc-finger proteins of the N2C2 class are under-represented", the "N2C2" class the authors refer to is not clear. Is there a typo here? In the figure to which this sentence refers (Figure 2D), the proteins referenced are "zf-H2C2" or "zf-C2H2".

      __Answer: __This no longer appears in the current manuscript.

      p9 - Typo - should be "medusozoans" rather than "medusazoans".

      __Answer: __This has been corrected.

      p11+ - Section titled, "Aurelia neural complement reveals two neural classes with similarities to anthozoan neurons" - I found the classification of N1 and N2 to be confusing, since initially they are described as neural clusters, however N1 in particular is shown to consist of primarily secretory, non-neural cell types. For example, when looking at Figure 4A and B, it is evident that N1 contains only a relatively small number of neural cell-types (in shades of orange), while most of the cells are other secretory, but non-neural cell types (in shades of brown). Not sure if the authors should alter the title to reflect this? For example, instead of 'neural' classes, they could be called 'neuro-secretory' or 'mixed neural and secretory classes'?

      __Answer: __We appreciate the confusion and have adjusted the heading accordingly. However we choose to maintain the designation as N1 and N2 class to reflect the distinction between insulinoma-positive and pou4-positive major Cnidarian neuroglandular sub-types present as defined in our earlier Nematostella work (Steger et al., 2018). We also include a comment in the discussion regarding the support for this distinction in other published Cnidarian dataset as follows.

      ”This contrasts with the distribution of n1 and n2 class neurons in the freshwater hydozoan polyp Hydra vulgaris, of which only three of the fifteen sub-types are of the ins-positive n1 type (“ec2”, “en2”, and “en3”: Fig. S8D;(58)).”

      p11 - Text reads, "Class 1 neurons in the medusa are also most prevalent within the gastrodermis and manubrium, and includes one subtype that first appears in the strobila and is found in all medusa tissue samples ("n1.3.medusa"; lower black box Fig. 4F).", however there is no "lower black box" in Figure 4F apparent.

      __Answer: __Re-evaluation of the detectable cell states after updating the mapping tool, which addresses issues associated with an overabundance of isoforms, results in the dissolution of this putative medusa-specific cell state. This profile is also found within the polyp and so the second half of this sentence has been removed.

      p13 - The text reads, "We find that class 2 neurons all express elevated levels of specific alpha- and beta- tubulins (TBA1-like3 and TBB-like-1; Fig. 4D).". Make the capitalization of your gene names (TBA1-like3, etc) consistent between text and figure throughout (in Fig. 4D the gene names are lower case).

      __Answer: __We have taken care to be consistent throughout the manuscript.

      p14 - In the first paragraph of this page, Fig. 4C is referenced twice, however both times the referencing sentence does not match this panel (most likely the authors meant to reference 4E, F or G).

      __Answer: __This has been corrected.

      p14 - The final sentence of this upper paragraph, "Specific tubulin-paralog expression within the class n2 neurons suggest that this is the portion of the nervous system labelled by the β-Tubulin antibody." is confusing. Do you mean that the b-tubulin antibody is most likely labelling the product of the tbb-like-1 gene that is shown in the featureplot in Fig 4D? Suggest rewriting this sentence for clarity.

      __Answer: __This sentence has been re-written as follows: “Specific tubulin-paralog expression within the class n2 neurons suggests that these two genes are translated into proteins recognised by this commercial β-Tubulin antibody. Furthermore, this antibody labelling suggests that the MNN is composed of N2 class neurons.” pg 14

      p14 - on this page and others in the manuscript, there are instances of the word "Aurelia" not being italicized.

      __Answer: __This has been corrected.

      p14 - In this sentence, "In the sea anemone Nematostella, anemone-specific gene duplications of members of the PaTH (Paraxis, Twist Hand-related) bHLH family of protein coding genes was driving the diversification of muscle cell types (29)." the "was driving" part of the sentence is grammatically clunky. Suggest rewording slightly. (e.g. "...protein coding genes drive the diversification of muscle cell type").

      __Answer: __We changed this to ‘drove’.

      -Myophilin-like2 in the text of the manuscript is written as myofilin-like2 in the figure panels (e.g. Fig 5L, Fig. 6D). Make consistent between text and figures.

      Answer: We changed all references to myophilin to calponin, which is the better known name of the vertebrate ortholog.

      p15 - on this page and several instances thereafter, "in situ" is not italicized as it should be.

      __Answer: __This has been corrected

      p19 - In the line, "Taken all together these data suggest that the contractile apparatus in the Scyphozoa, using here Aurelia as a proxy, is similar to the bilaterian smooth muscle contractile complex (Fig. 8C)." this should really reference Fig. 8 B-C

      __Answer: __This has been corrected according to the newest figure.

      Reviewer #1 (Significance (Required)):

      General assessment:

      I believe this manuscript adds a significant amount of useful data and provides some novel insights into scyphozoan cell types across an important life history transition from polyp to medusa in Aurelia. Adding the dataset to the USCS Cell Browser is a strength. I think there is the potential to make this an impactful paper but in its current form, it is pretty messy, and not clearly presented, and lacks some transparency. The greatest weaknesses lie in not framing the work adequately or putting it into enough context with previous work and also not relating it to other medusozoans; in the Figures which are overly crowded, and confusing rather than being clear and supporting the results; and in the lack of explanation for some methods like how cell clusters were annotated, how transcription factor families were determined; and the lack of access to the github data repository, which raises questions of reproducibility. It will take a good amount of restructuring figures and reframing to make the study clear and impactful and the methods and analyses reproducible.

      Advance: If the weaknesses are addressed adequately, this study does contribute new insights in the area of further understanding changes across an important scyphozoan life cycle transition in terms of diversity of cell types and their cell-type transcriptomes, opening up further questions which can now be addressed.

      Audience: The broader cnidarian community will be interested in this study. People studying cell type evolution and cell type novelty across the tree of life will also be interested. Anyone looking for examples of how to use modern approaches to understanding life cycle changes in animals will be interested.

      My expertise is in cnidarian cellular and molecular biology and evolution including working with model cnidarian research organisms and employing techniques and approaches similar to those used in this study.

      We thank this reviewer for their detailed comments and suggestions, and feel the manuscript is much improved in its current form. We hope that we have satisfied all concerns raised here.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      __This paper is well-written and serves as a valuable resource not only for the cnidarian community but also for researchers studying more broadly cell type identity and evolution. A key cell type enabling the transition from polyp to free-swimming medusa is the cnidarian striated muscle, which has only been morphologically identified in medusozoan jellyfish. While this study does not include functional analyses, it lays the foundation for the Aurelia research community to leverage single-cell atlas data for future investigations.

      Key experiments supporting the paper's main conclusions are missing :

      •At the beginning of the Results section, the authors mention identifying a previously undescribed developmental stage, which they name "clover-polyp" However, they do not later discuss whether this newly identified stage has a distinct gene expression signature. This point should be addressed in the paper or removed.

      __Answer: __We do not find any specific transcriptomic signature specific to this stage. We keep this designation as a morphological indicator of a strobilation-competent polyp, but have re-worded our introduction of this term as follows:

      “The first external sign of strobilation is the expansion of the body column into four pouches that are filled with multiple folds of inner cell layer epithelia (Fig. 1A), and resembles a cloverleaf from above; we call this stage the “clover-polyp”.”

      •A key reference is missing in the following sentences :

      "The anthozoan Nematostella vectensis has two principal neural sub-families that have been described that correspond to those with insulinoma expression (n1) and those with pou4 expression (n2) (13,14)."

      "The class n1 family also includes putatively non-neural secretory cell types ("s"), which are enriched in genes associated with digestion and extracellular matrix production (Data S1.10). These data suggest a close relationship between neurons and gland cells, like what has been suggested in other cnidarians (13,27)."

      "Thus, similar to that described for the anthozoan Nematostella vectensis (13,14), Class 1 neurons and related secretory cells comprise the predominant type of neuroglandular cells in the polyp stage. Further, these are the primary neuroglandular cells within the gastrodermis of the medusa."

      The first functional analysis of NvInsm1+ expressing neurons and secretory cells in Nematostella vectensis was conducted in this study (Tournière, O. et al., 2022), making it essential to cite this work.

      __Answer: __We appreciate the reviewer for drawing this oversight to our attention. This has been corrected in the revised manuscript.

      • To validate the neuronal component of this single-cell data, it is essential to confirm the N1 and N2 populations and demonstrate that they do not overlap. I recommend performing in situ hybridization or antibody staining for Insm1+ and Pou4+ cells (or any other suitable markers for these populations) to show that they are expressed in distinct cells/region in Aurelia.

      __Answer: __We appreciate the reviewers comment, however, there are unfortunately no specific antibodies available for Insm1 or Pou4, or any other n1/n2 specific neuronal marker protein. Moreover, we find in situ hybridization in this system to be very challenging except for highly expressed structural genes. Neurons are particularly difficult, because they are very small cells embedded between many other cell types. We attempted to validate distribution of different neuron populations with colorimetric in situ hybridization, FISH as well as HCR (hybridization chain reaction). However, we were not successful in labelling individual neuron bodies and visualising their cytoplasmic RNA content to distinguish individual cells and therefore individual neuron types. Regardless, to validate at least neuronal cell types, we were able to correlate pan-neuronal tbb-like expression with b-Tubulin antibody staining and of RFamide antibody staining with specific neuronal subpopulations.

      •What is labelled in yellow in Figure 5C? The legend should be updated.

      Answer: Figure 5C does not exist in the current version of the manuscript.

      •Figure 5i, j, and k, are not clear, the paper would benefit with bright field pictures.

      __Answer: __Images were replaced and some bright field photos are incorporated into both new figures.

      •Each figure should connect specific gene expression at a given stage with the corresponding single-cell expression data in a dot plot. For instance, in Figure 6, myofillin-like 2, mhc1, and mhc2 should be accompanied by their respective single-cell expression data at this stage in a dot plot.

      Answer: done!

      • The authors repeatedly refer to the polyp as asexual and the medusa as sexual; however, they do not mention any gonadal cluster nor discuss its absence from their single-cell data.

      __Answer: __We have added the following sentence to the current manuscript to account for this: “Despite its larger size, this animal was still reproductively immature and so no gonadal tissues were collected.”

      •The authors include EdU experiments in Figure S2 but discuss them only briefly in the text. If these experiments provide new insights, they should be elaborated on; otherwise, they could be removed from the manuscript.

      __Answer: __We have removed these data from the manuscript.

      • As this paper is primarily a resource for the cnidarian community, ensuring easy access is crucial for enabling species comparisons. I recommend making the data openly available through a single-cell portal, as done in Juliano et al. (2019).

      __Answer: __We have already released these data on the UCSC cellbrowser platform, as was stated in the manuscript. These data have been updated to reflect the current status of the analyses and is publicly available at www.jellyfish-atlas.cells.ucsc.edu

      Reviewer #2 (Significance (Required)): This well-written paper is a valuable resource for the cnidarian community. A key cell type driving the transition from polyp to free-swimming medusa is the cnidarian striated muscle, which has only been morphologically identified in medusozoan jellyfish. While the study lacks functional analyses, further biological validations, such as in situ hybridizations, are needed to confirm the single-cell data. Nevertheless, it lays a strong foundation for the Aurelia research community to utilize single-cell atlas data in future studies. To maximize its impact, the authors should ensure the data is easily accessible to the broader scientific community.

      We thank this reviewer for their recognition of the importance of this work. We have ensured that the data are available for download through the UCSC cell browser, and all scripts used in the data analysis are available on our github page. We additionally included our new gene models that are associated with the single cell data on the companion UCSC genome browser website, which now hosts the NCBI genome assembly with our gene models.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Link and collaborators presents a well-executed and thorough analysis (statistically significant) of cell types and developmental trajectories in Aurelia coerulea, a cnidarian with a medusa stage. While previous cnidarian cell atlases have focused on embryo-to-polyp development, this study uniquely incorporates adult medusa-stage cells, providing novel insights into cnidarian biology.

      The authors successfully identify a broad range of cell types and precursors in both polyp and medusa stages. By comparing transcriptional profiles, they demonstrate the presence of new cell types, such as neurons, in the medusa. Notably, they provide compelling evidence for the coexistence of both striated and smooth muscle within cnidarians-a topic they have explored in previous work. Their morphological analysis further suggests that striated and smooth muscle forms can exist within single cells, which is particularly intriguing. Overall, the results are convincing.

      A major strength of this study is the extensive number of cells analyzed and the rigorous classification of cell identities based on transcriptional profiles. Unlike many single-cell studies, the authors complement their findings with morphological, immunochemical, and in situ data, strengthening their conclusions. Conducting such an analysis without a fully annotated genome presents a significant challenge, yet the authors navigate this limitation effectively.

      One relative limitation, common to many single-cell studies, is the lack of detailed spatial information on the identified subtypes. While the authors have made efforts in this direction, a higher-resolution atlas that pinpoints these subtypes within the body would enhance the impact of the study. The absence of transgenic tools with cell-type-specific enhancers makes this difficult, but it remains a valuable avenue for future research. Despite this, the study's novelty and quality-particularly its inclusion of medusa-stage data-make it a strong candidate for publication in any journal associated with Review Commons.

      Minor Comments: • The term "terminal cell type markers" may not be the most appropriate for transcription factors that regulate state or specification. A more precise term, such as "state or specification transcriptional regulators," might be preferable.

      __Answer: __This term does not appear in the revised manuscript.

      • The suggestion that cell-type specification is not governed by a random collection of TFs seems self-evident. If not TFs, what alternative regulatory mechanisms (e.g., post-transcriptional regulation, small RNAs) are being implied?

      __Answer: __In the revised manuscript we have removed focus on the TFs.

      • The rationale behind the observation that "'early' cells separate along three principal trajectories (cnido.1, cnido.2, and cnido.3m), then converge upon a second mature transcriptomic phenotype" could be more clearly explained.

      __Answer: __This is a phenomenon that is now well established for cnidarians from the perspective of single cell transcriptomics (Chari et al, 2021: Clytia; Steger et al, 2022, Cole et al 2024, Plessier and Marlow 2026: Nematostella; Cazet et al 2023: Hydra). This phenomena is also described here in terms of the sequence of transcription factors that are activated sequentially in both Aurelia and Nematostella. We have modified the introductory text to better place these observations in context as follows:

      Recently we reported that within the sea anemone Nematostella vectensis, specification of the distinct cnidocyte types is marked by a diverging transcriptomic profile corresponding to the formation of the different capsule types, which then undergo a molecular switch demarcated by up-regulation of GFI1B and converge upon a secondary neural-like expression profile (11). Notably, we find a similar forked trajectory within the cnidocyte population of Aurelia. (Fig. 3A). A cluster of SoxC expressing ‘early’ cells separate along two principal trajectories (cnido.1, cnido.2), which then converge upon a second mature transcriptomic phenotype upon activation of jun/fos (Fig. 3E).

      • The illustrations of the nervous system in the ephyra and rhopalia are intriguing but lack spatial context for different neuronal populations beyond the positioning of class 2 neurons ("alpha- and beta-tubulin cells").

      Answer: We added a better introduction to gain more understanding of the different neuron populations in contrast to various findings of related publications. The text now reads:

      This rhopalia nervous system develops during polyp-medusa metamorphosis and is composed of specialized light- (pigment cup) and gravity- sensing (lithocyte/statocyst) cells, segregated into individual compartments with different developmental origins (12). Rhopalia development involves the gene expression of otx1, pit1 and brn3 in the pigment-cup (10),.... p4/5

      Further, we used findings from previous studies to add a more elaborate description to our results and we finally discuss it, for example:

      The ins-negative populations in both species express pou4 orthologs, also called brn3 (10), that is expressed also within the cnidocyte lineages and thus further supports claims of a close relationship between cnidocytes and insulinoma-negative/pou4-positive n2 neurons (13,14,52). p22

      • Muscle characterization is well-supported by phalloidin staining and gene markers, but is there a specific marker for smooth muscle? Myophilin-like-2 is mentioned, but is it definitive?

      Answer: Yes, there are many, as tabulated in supplemental Data S1.11. For example myophilin-like-2 [calponin] is a specific marker for smooth muscle cells and this is demonstrated via in situ hybridization in fig.6.

      • The finding that ~40% of genes distinguishing smooth and striated muscle lack homologs in other animals is striking. It may be worth investigating their expression patterns via in situ hybridization, particularly for those that differentiate muscle types. The fact that these genes are of unknown affinity does not mean they are uninformative.

      __Answer: __There are a variety of reasons that lead to a lack of orthology information amongst the gene models, including fragmented gene models, inclusion of unidentified lncRNAs, amongst others. However, due to this ambiguity and the lack of identification of these rationals we have removed this observation from the current manuscript. In fact, with the updated mapping tool and current gene annotations this number has fallen to only ~28% of the identified muscle-specific gene models, from a total ~38.7% unannotated gene models in the entire transcriptome. This is similar to other cells types in the dataset (between ~20%-35%), and also similar to the number of unannotated genes in the sea anemone Nematostella vectensis (36.5% overall)

      • The incompleteness of Aurelia genomes is acknowledged as a limitation. However, since the San Diego strain genome appears to be the most complete, is there a reason it was not used in this study? Was it not possible to recover the same strain?

      __Answer: __We have a standing culture in the lab that was used for these collections. While we considered generating a genomic assembly for this laboratory strain, we have concluded that this is not an effective use of resources at this time. We have now updated the reference for mapping however, from a re-analysis of the available Aurelia coerulea isolate AC-2021 genome (NCBI: GCA_039566865.1) annotated with the Gnomon 9.0 automated annotation pipeline, and supplemented with our in-house transcriptome to recover ~5000 additional gene model coordinates on the genome. These are available now via the UCSC genome browser website.

      We further thank this reviewer for the overall positive assessment of our work, and hope that the revised version further strengthens the data analysis and contribution to the community as a whole.

      __ **Referees cross-commenting**__

      Referees, I generally agree with their assessments. Below, I outline my main concerns and suggestions for improvement.

      Figures and Data Presentation

      I concur with Referee 1 that the figures are overcrowded, making it difficult to interpret individual panels. The excessive number of panels within a single figure creates unnecessary complexity. Some of these could be moved to the supplementary materials to improve readability. It seems that the authors aim to present every possible data analysis, but this is not necessary within the main text. As Referee 1 also noted, the key findings should be clearly visible, allowing the reader to follow the story without getting lost in excessive detail.

      __Answer: __We have re-structured most of the figures with this in mind and hope that we have achieved better clarity. Many of the data analyses in the previous versions have been removed if not directly related to the observations highlighted in the current version.

      Additionally, the annotation of clusters remains unclear, a concern also raised by other referees. The manuscript would benefit from a more explicit description of how these clusters were assigned.

      __Answer: __We have expanded our description of how we assigned identities to the nine principal cell type families as follows:

      (pg. 8) The inner epithelia, or gastrodermis, expresses several collagens that is a characteristic of the inner cell layer of anthozoans (39); the outer cell layer houses the ring musculature and is rich in contractile proteins. The striated muscle cluster is also rich in contractile protein and is the only principal cell population absent from the polyp-derived samples (Fig. 2C). The mucin gland expresses mucins, whereas the digestive gland expresses other digestive enzymes, whereas the neural cluster expresses synapsin and other conserved known neural regulators such as ashA. The cnidocytes express mini-collagens and are enriched in pathways targeting the endoplasmic reticulum (40).

      Writing and Discussion

      While I do not have major concerns with the writing, I suggest expanding the discussion, particularly regarding the relationship between muscle cell types and the diversification of paralogs. If the figures are streamlined, the text can also be made more concise, avoiding exhaustive references to every individual data point.

      Clarifications on the Muscle Section

      Several aspects of the muscle analysis require clarification: • The differences between muscle cell types are based on a set of differentially expressed genes, 40% of which (in each set) are of unknown affinities. However, it is surprising that the regulatory genes shared between both muscle profiles are expressed in bilaterian smooth muscles. The manuscript does not address whether bilaterian striated muscles share regulatory genes with the Aurelia striated muscle set. This comparison would be valuable.

      Answer: __With the latest mapping tool the percentage of muscle-specific genes of unknown affinities has dropped to ~28% and we no longer highlight this observation in the manuscript. Regarding the regulatory genes shared with smooth muscles of bilaterians, we feel this may be a misunderstanding. In Fig. 7 we clarify that these are __structural proteins regulating the contraction of the muscle (e.g. Myosin light chain kinase and calponin). With respect to the developmental regulators, e.g. muscle cell type determining transcription factors, we list several in Data S1.3b, S1.4b. A broader phylogenetic and also functional analysis of these transcription factors in different jellyfish species is the focus of another collaborative study and therefore we do not include an in depth discussion of this topic in the current manuscript.__ __

      • The high proportion of unknown genes is concerning. Is this due to issues with the transcriptome assembly, or is it a consequence of insufficient comparative analyses? The statement that "Mapping to this final transcriptome increased confidently mapped genes to 60%" raises questions-does this mean that 40% of differentially expressed genes remain unmapped? This point should be clarified.

      __Answer: __With the latest mapping tool, we now recover a confident alignment for ~80% of the sequences (See supplementary data S2.1). With the previous tool this value was only 60%, which means that 40% of the sequence data could not be used at all to generate the expression matrix. This is a different feature of the data analysis than the identity of the gene models. However, the statement mentioned here no longer appears in the current version of the manuscript.

      • Given the large number of differentially expressed genes with unknown function, could the authors perform in situ hybridization assays on a subset of these genes? This could provide insights into their spatial expression patterns and potential functional relevance.

      Answer: This is an intriguing suggestion, however, given that in situ hybridization for medium and low expressed genes are extremely difficult in this organism, we feel that this is beyond the scope of this study.

      • Both muscle types appear to rely on a similar contractile apparatus but exhibit differential usage of paralogs. This finding is intriguing but is not sufficiently discussed. Are other cell types associated with the differential use of paralogs? Expanding this discussion would add depth to the manuscript.

      Answer: We thank the reviewer for this insightful comment. Indeed, there is circumstantial evidence that differential usage of paralogs is also found among other cell types, e.g. neurons. We indeed discuss the example of a few other genes, e.g. ATOH-like transcription factors and myc. However, the diversity of neuronal populations is very large, which makes the picture quite complex. We are currently working on a phylogenetic framework of cell type families and also between species to address this point, but this requires more theoretical and methodological work. In this paper, we therefore restricted the analyses to the structural proteins of the two types of muscles, which facilitates the assignment of paralogs to either muscle. We point out that this is reminiscent of the differential expression of paralogs in the fast and slow contracting muscle cell types in Nematostella, suggesting that such a subfunctionalization may generally drive also the physiological diversification of muscle cell types in cnidarians (and of animals in general). Future work is aiming to address this on a broader scale, as suggested by the reviewer.

      Neuronal Subtypes

      I reiterate my previous comment regarding neuronal types: • The enrichment of neural subtypes in the medusa stage is an interesting, albeit expected, finding. However, the manuscript lacks details regarding their specific spatial distribution within the body. Providing this information would enhance the biological relevance of the findings.

      Answer: in situ hybridization for neurons is a challenge in all cnidarians, because the small neurons with very thin neurites are embedded and intermingled between many other cell types. In Aurelia, this has proven to be particularly difficult. At the very best, one might see small cell bodies stained, however, it fails to visualize neurites. We also tried HCR (hybridization chain reaction) in combination with antibody staining (b-Tubulin) to get to single cell resolution. However, the results were not conclusive and we therefore refrain from showing them in the paper. As an alternative we connected the findings of previous studies (Nakanishi et al., 2009, 2010) in terms of certain types of neurons located in different compartments of the rhopalia and corresponding marker genes with our single cell data (introduction/discussion). We acknowledge that more work needs to be done, best by generating specific antibodies against neuronal antigens. However, this is beyond the scope of this paper.

      References

      I also agree with Referee 2 that some statements require further substantiation with appropriate references. Strengthening these points with supporting literature would improve the rigor of the manuscript.

      Answer: We added appropriate references at all places indicated, as detailed above.

      Final Remarks

      Overall, while the study presents interesting findings, the manuscript would benefit from a clearer organization of figures, a more explicit explanation of muscle and neural subtype findings, and a deeper discussion on the significance of unknown genes and paralog usage. Addressing these concerns will enhance the clarity and impact of the paper.

      Reviewer #3 (Significance (Required)):

      Overall, this is a significant and well-supported study that advances our understanding of cnidarian cell diversity and muscle evolution. By examining how cell types change across the polyp and medusa stages, this study provides valuable insights not only into cnidarian development but also into broader evolutionary questions regarding the emergence of new body plans and tissue types. As a developmental biologist specializing in invertebrates, I find the results of this work particularly remarkable. It provides valuable insights into the developmental processes occurring in pre-bilaterian animals, shedding light on how cell types emerge and diversify in early-diverging metazoans

      Answer: We thank reviewer 3 for this positive evaluation.

      __Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      __Link et al. have studied cell type diversity in the scyphozoan Aurelia coerulea. More specifically, they compared several stages in the animal's life cycle using single-cell RNA-seq. Many members of the cnidarian clade Medusuzoa (scyphozoans included) have a metagenetic lifecycle that includes a sessile, clonally reproducing polyp and a free swimming, sexually reproducing medusa (jellyfish). The two phases are fundamentally different in their functional morphology, but the cellular basis of this difference has been unknown. The authors generated single cell RNA-seq libraries from eight life-cycle stages of the animal to include polyps, and medusae. Their main finding is that different cell types underlie polyp-medusa transition in this animal. Although expected intuitively, this finding has never been demonstrated experimentally. Moreover, a recent study on a colonial hydrozoan (Salamanca-Diaz et al. 2025) has shown that colony parts, as opposed to different life stages, use largely the same cellular components. Therefore, the current study is of broad interest to developmental and evolutionary biologists. Overall, the experiments and data analyses have been performed to a high standard, the figures are of good quality, and the manuscript is well written. Below are a few minor points to be addressed.

      The Aurelia strain used in the study is somewhat ambiguous (suggested to be A. coerulea). The authors' statements on pp. 24, 25 are somewhat confusing--they first say they got over 90% alignment to the San Diego strain genome assembly but then state (in the 'Transcriptome mapping' section) that they got only 40% of their reads aligned, forcing them to use Trinity de novo transcriptome assembly. Please clarify.

      __Answer: __Alignment to the genome is different from assignment of the alignment to a gene model. Ambiguous alignment cannot be assigned, and missing gene models would not have an assignment. However, we have switched the mapping tool used for this dataset for one that fits both genome sequence alignment AND gene model assignment better than the previously available choices. We now have ~80% of all sequences unambiguously aligned to the genome.

      1. 7--the authors state that some transcription factor families are over/underrepresented as terminal type marker. How do they know which cells are terminally differentiated.

      __Answer: __We have removed our focus on transcription factor families in this work and recognize that the definition of a terminally differentiated cell state from single cell transcriptomics has not been clearly defined.

      The homeobox gene Tlx has been reported to be associated with medusa development, being absent in taxa without medusae (Travert et al. 2023). Is it expressed in the Aurelia medusa (I couldn't find it in the data), and if so, where?

      __Answer: __This is indeed a good point that we were also interested in. However, Tlx is detected ONLY in the ephyra libraries and at very low levels which is why we chose to avoid discussing it as the low detection prevents accurate reporting of the expression and could reflect rather a mapping problem for this gene (mis-annotated 3’ end). As information for this reviewer, the gene model shows some spurious reads specifically in a few neuron subtypes, and outside the ephyra is lowly detected ONLY in the medusa library for medusa neuron n.7 (n2.7m).

      I do not quite understand the authors' arguments for independent striated muscle evolution in cnidarians and bilaterians. Key striated muscle genes (e.g., titin) are present in hydrozoan and anthozoan genomes; furthermore, the expression patterns of Otx is not indicative because its function in medusozoans is unknown. What are the arguments against an alternative scenario in which striated muscles evolved before the cnidarian-bilaterian split, but lost in anthozoans?

      Answer: This is indeed a complex question, which requires a more thorough and targeted comparative analysis. We note that a BLAST hit for Titin can be misleading due to the many domain repeats of this Titin, which are also found in other proteins. To be more prudent, we removed this part from the manuscript. This will be subject of a future, thorough study.

      1. 27, the link https://github.com/technau/AureliaAtlas is broken.

      __Answer: __We appreciate this comment and have ensured that the github archive is publicly available with all relevant scripts associated with all versions of the BioRxiV record.

      p. 24 (limitations of the study section), the authors refer to "cosmopolitan species"; they probably mean "genus".

      __Answer: __We changed to “taxon” and dropped cosmopolitan.

      p. 24-25 on two occasions in the M&M sections, the authors put the abbreviation first and the initials in brackets (ASW and BSA).

      __Answer: __This has been corrected.

      "Metagenic" should be "metagenetic"

      __Answer: __This has been corrected.

      Reviewer #4 (Significance (Required)):

      The study is of broad interest to developmental and evolutionary biologists. It addresses an important question, not dealt with directly in previous studies.

      Answer: We thank reviewer 4 for this positive and encouraging assessment.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Link et al. have studied cell type diversity in the scyphozoan Aurelia coerulea. More specifically, they compared several stages in the animal's life cycle using single-cell RNA-seq. Many members of the cnidarian clade Medusuzoa (scyphozoans included) have a metagenetic lifecycle that includes a sessile, clonally reproducing polyp and a free swimming, sexually reproducing medusa (jellyfish). The two phases are fundamentally different in their functional morphology, but the cellular basis of this difference has been unknown. The authors generated single cell RNA-seq libraries from eight life-cycle stages of the animal to include polyps, and medusae. Their main finding is that different cell types underlie polyp-medusa transition in this animal. Although expected intuitively, this finding has never been demonstrated experimentally. Moreover, a recent study on a colonial hydrozoan (Salamanca-Diaz et al. 2025) has shown that colony parts, as opposed to different life stages, use largely the same cellular components. Therefore, the current study is of broad interest to developmental and evolutionary biologists. Overall, the experiments and data analyses have been performed to a high standard, the figures are of good quality, and the manuscript is well written. Below are a few minor points to be addressed.

      The Aurelia strain used in the study is somewhat ambiguous (suggested to be A. coerulea). The authors' statements on pp. 24, 25 are somewhat confusing--they first say they got over 90% alignment to the San Diego strain genome assembly but then state (in the 'Transcriptome mapping' section) that they got only 40% of their reads aligned, forcing them to use Trinity de novo transcriptome assembly. Please clarify.

      p. 7--the authors state that some transcription factor families are over/underrepresented as terminal type marker. How do they know which cells are terminally differentiated.

      The homeobox gene Tlx has been reported to be associated with medusa development, being absent in taxa without medusae (Travert et al. 2023). Is it expressed in the Aurelia medusa (I couldn't find it in the data), and if so, where?

      I do not quite understand the authors' arguments for independent striated muscle evolution in cnidarians and bilaterians. Key striated muscle genes (e.g., titin) are present in hydrozoan and anthozoan genomes; furthermore, the expression patterns of Otx is not indicative because its function in medusozoans is unknown. What are the arguments against an alternative scenario in which striated muscles evolved before the cnidarian-bilaterian split, but lost in anthozoans?

      p. 27, the link https://github.com/technau/AureliaAtlas is broken.

      p. 24 (limitations of the study section), the authors refer to "cosmopolitan species"; they probably mean "genus".

      p. 24-25 on two occasions in the M&M sections, the authors put the abbreviation first and the initials in brackets (ASW and BSA).

      "Metagenic" should be "metagenetic"

      Significance

      The study is of broad interest to developmental and evolutionary biologists. It addresses an important quastion, not dealt with directly in previous studies.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Link and collaborators presents a well-executed and thorough analysis (statistically significant) of cell types and developmental trajectories in Aurelia coerulea, a cnidarian with a medusa stage. While previous cnidarian cell atlases have focused on embryo-to-polyp development, this study uniquely incorporates adult medusa-stage cells, providing novel insights into cnidarian biology.

      The authors successfully identify a broad range of cell types and precursors in both polyp and medusa stages. By comparing transcriptional profiles, they demonstrate the presence of new cell types, such as neurons, in the medusa. Notably, they provide compelling evidence for the coexistence of both striated and smooth muscle within cnidarians-a topic they have explored in previous work. Their morphological analysis further suggests that striated and smooth muscle forms can exist within single cells, which is particularly intriguing. Overall, the results are convincing.

      A major strength of this study is the extensive number of cells analyzed and the rigorous classification of cell identities based on transcriptional profiles. Unlike many single-cell studies, the authors complement their findings with morphological, immunochemical, and in situ data, strengthening their conclusions. Conducting such an analysis without a fully annotated genome presents a significant challenge, yet the authors navigate this limitation effectively.

      One relative limitation, common to many single-cell studies, is the lack of detailed spatial information on the identified subtypes. While the authors have made efforts in this direction, a higher-resolution atlas that pinpoints these subtypes within the body would enhance the impact of the study. The absence of transgenic tools with cell-type-specific enhancers makes this difficult, but it remains a valuable avenue for future research.

      Despite this, the study's novelty and quality-particularly its inclusion of medusa-stage data-make it a strong candidate for publication in any journal associated with Review Commons.

      Minor Comments:

      • The term "terminal cell type markers" may not be the most appropriate for transcription factors that regulate state or specification. A more precise term, such as "state or specification transcriptional regulators," might be preferable.
      • The suggestion that cell-type specification is not governed by a random collection of TFs seems self-evident. If not TFs, what alternative regulatory mechanisms (e.g., post-transcriptional regulation, small RNAs) are being implied?
      • The rationale behind the observation that "'early' cells separate along three principal trajectories (cnido.1, cnido.2, and cnido.3m), then converge upon a second mature transcriptomic phenotype" could be more clearly explained.
      • The illustrations of the nervous system in the ephyra and rhopalia are intriguing but lack spatial context for different neuronal populations beyond the positioning of class 2 neurons ("alpha- and beta-tubulin cells").
      • Muscle characterization is well-supported by phalloidin staining and gene markers, but is there a specific marker for smooth muscle? Myophilin-like-2 is mentioned, but is it definitive?
      • The finding that ~40% of genes distinguishing smooth and striated muscle lack homologs in other animals is striking. It may be worth investigating their expression patterns via in situ hybridization, particularly for those that differentiate muscle types. The fact that these genes are of unknown affinity does not mean they are uninformative.
      • The incompleteness of Aurelia genomes is acknowledged as a limitation. However, since the San Diego strain genome appears to be the most complete, is there a reason it was not used in this study? Was it not possible to recover the same strain?

      Referees cross-commenting

      Referees, I generally agree with their assessments. Below, I outline my main concerns and suggestions for improvement.

      Figures and Data Presentation

      I concur with Referee 1 that the figures are overcrowded, making it difficult to interpret individual panels. The excessive number of panels within a single figure creates unnecessary complexity. Some of these could be moved to the supplementary materials to improve readability. It seems that the authors aim to present every possible data analysis, but this is not necessary within the main text. As Referee 1 also noted, the key findings should be clearly visible, allowing the reader to follow the story without getting lost in excessive detail.

      Additionally, the annotation of clusters remains unclear, a concern also raised by other referees. The manuscript would benefit from a more explicit description of how these clusters were assigned.

      Writing and Discussion

      While I do not have major concerns with the writing, I suggest expanding the discussion, particularly regarding the relationship between muscle cell types and the diversification of paralogs. If the figures are streamlined, the text can also be made more concise, avoiding exhaustive references to every individual data point.

      Clarifications on the Muscle Section

      Several aspects of the muscle analysis require clarification:

      • The differences between muscle cell types are based on a set of differentially expressed genes, 40% of which (in each set) are of unknown affinities. However, it is surprising that the regulatory genes shared between both muscle profiles are expressed in bilaterian smooth muscles. The manuscript does not address whether bilaterian striated muscles share regulatory genes with the Aurelia striated muscle set. This comparison would be valuable.
      • The high proportion of unknown genes is concerning. Is this due to issues with the transcriptome assembly, or is it a consequence of insufficient comparative analyses? The statement that "Mapping to this final transcriptome increased confidently mapped genes to 60%" raises questions-does this mean that 40% of differentially expressed genes remain unmapped? This point should be clarified.
      • Given the large number of differentially expressed genes with unknown function, could the authors perform in situ hybridization assays on a subset of these genes? This could provide insights into their spatial expression patterns and potential functional relevance.
      • Both muscle types appear to rely on a similar contractile apparatus but exhibit differential usage of paralogs. This finding is intriguing but is not sufficiently discussed. Are other cell types associated with the differential use of paralogs? Expanding this discussion would add depth to the manuscript.

      Neuronal Subtypes

      I reiterate my previous comment regarding neuronal types:

      • The enrichment of neural subtypes in the medusa stage is an interesting, albeit expected, finding. However, the manuscript lacks details regarding their specific spatial distribution within the body. Providing this information would enhance the biological relevance of the findings.

      References

      I also agree with Referee 2 that some statements require further substantiation with appropriate references. Strengthening these points with supporting literature would improve the rigor of the manuscript.

      Final Remarks

      Overall, while the study presents interesting findings, the manuscript would benefit from a clearer organization of figures, a more explicit explanation of muscle and neural subtype findings, and a deeper discussion on the significance of unknown genes and paralog usage. Addressing these concerns will enhance the clarity and impact of the paper.

      Significance

      Overall, this is a significant and well-supported study that advances our understanding of cnidarian cell diversity and muscle evolution.

      By examining how cell types change across the polyp and medusa stages, this study provides valuable insights not only into cnidarian development but also into broader evolutionary questions regarding the emergence of new body plans and tissue types.

      As a developmental biologist specializing in invertebrates, I find the results of this work particularly remarkable. It provides valuable insights into the developmental processes occurring in pre-bilaterian animals, shedding light on how cell types emerge and diversify in early-diverging metazoans

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This paper is well-written and serves as a valuable resource not only for the cnidarian community but also for researchers studying more broadly cell type identity and evolution. A key cell type enabling the transition from polyp to free-swimming medusa is the cnidarian striated muscle, which has only been morphologically identified in medusozoan jellyfish. While this study does not include functional analyses, it lays the foundation for the Aurelia research community to leverage single-cell atlas data for future investigations.

      Key experiments supporting the paper's main conclusions are missing :

      • At the beginning of the Results section, the authors mention identifying a previously undescribed developmental stage, which they name "clover-polyp" However, they do not later discuss whether this newly identified stage has a distinct gene expression signature. This point should be addressed in the paper or removed.
      • A key reference is missing in the following sentences :

      "The anthozoan Nematostella vectensis has two principal neural sub-families that have been described that correspond to those with insulinoma expression (n1) and those with pou4 expression (n2) (13,14)."

      "The class n1 family also includes putatively non-neural secretory cell types ("s"), which are enriched in genes associated with digestion and extracellular matrix production (Data S1.10). These data suggest a close relationship between neurons and gland cells, like what has been suggested in other cnidarians (13,27)."

      "Thus, similar to that described for the anthozoan Nematostella vectensis (13,14), Class 1 neurons and related secretory cells comprise the predominant type of neuroglandular cells in the polyp stage. Further, these are the primary neuroglandular cells within the gastrodermis of the medusa."

      The first functional analysis of NvInsm1+ expressing neurons and secretory cells in Nematostella vectensis was conducted in this study (Tournière, O. et al., 2022), making it essential to cite this work. - To validate the neuronal component of this single-cell data, it is essential to confirm the N1 and N2 populations and demonstrate that they do not overlap. I recommend performing in situ hybridization or antibody staining for Insm1+ and Pou4+ cells (or any other suitable markers for these populations) to show that they are expressed in distinct cells/region in Aurelia. - What is labelled in yellow in Figure 5C? The legend should be updated. - Figure 5i, j, and k, are not clear, the paper would benefit with bright field pictures. - Each figure should connect specific gene expression at a given stage with the corresponding single-cell expression data in a dot plot. For instance, in Figure 6, myofillin-like 2, mhc1, and mhc2 should be accompanied by their respective single-cell expression data at this stage in a dot plot. - The authors repeatedly refer to the polyp as asexual and the medusa as sexual; however, they do not mention any gonadal cluster nor discuss its absence from their single-cell data. - The authors include EdU experiments in Figure S2 but discuss them only briefly in the text. If these experiments provide new insights, they should be elaborated on; otherwise, they could be removed from the manuscript. - As this paper is primarily a resource for the cnidarian community, ensuring easy access is crucial for enabling species comparisons. I recommend making the data openly available through a single-cell portal, as done in Juliano et al. (2019).

      Significance

      This well-written paper is a valuable resource for the cnidarian community. A key cell type driving the transition from polyp to free-swimming medusa is the cnidarian striated muscle, which has only been morphologically identified in medusozoan jellyfish. While the study lacks functional analyses, further biological validations, such as in situ hybridizations, are needed to confirm the single-cell data. Nevertheless, it lays a strong foundation for the Aurelia research community to utilize single-cell atlas data in future studies. To maximize its impact, the authors should ensure the data is easily accessible to the broader scientific community.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: Overall, this study adds a large amount of data for the scyphozoan Aurelia coerulea by producing several single-cell RNA sequencing libraries that cover the transition from polyp to medusa. The study provides a modern view of cell type diversity and cell-specific transcriptome changes during this period of extreme morphological change in this particular cnidarian lineage, which is understudied. Certain unique cell subtypes, including neural cell subtypes and muscle cell subtypes which are specific to different life stages are discussed in detail providing some new insights.

      My overall assessment is that the manuscript has good potential to be impactful, but in its current form it is somewhat clunky and overly complex to read, the figures were too crowded and difficult to comprehend, and the authors did not provide enough context regarding the current state of knowledge and what this study adds to it. In particular, Figure 1 and the section about striated and smooth muscles sharing partial transcriptomic profiles need the most work. The results were presented in the context of the anthozoan Nematostella but this should be broadened further to include other cnidarian single-cell studies, such as those from Hydra and Clytia which are both medusozoans like Aurelia. The writing throughout could be streamlined and simplified to better highlight the major findings as described in the abstract of the paper. Several figures were not well presented or clear and could be improved or decluttered to better communicate and support important results. In addition, some methods were totally missing, and I was unable to access the github repository associated with the paper which should detail all analyses described in the paper. In its current form, reproducibility of analyses would be quite limited. I did greatly appreciate the inclusion of the data on the UCSC Cell Browser, which allows anyone to access the single cell data matrix for visual exploration.

      Major comments:

      The Introduction section was very short - only three paragraphs. I feel that this section could be expanded to give more context about Aurelia as a research organism, and the current resources available. This includes genomic and transcriptomic resources particularly those focused on the transition between life cycle stages (polyp to medusa). Any other relevant background on cell type diversity or if there is anything known about the molecular profile of specific cell types found in different life stages should also be included here. Do marker genes already exist for some of the important cell types discussed in the manuscript? It would be better to present the current state of knowledge, and context for why this study was done, how it builds upon current knowledge, and what it adds to our current understanding so that the study is properly framed from the beginning.

      In the Results section, I find the sentence on p. 4, "Further, ~70% of these gene models do not have readily identifiable orthologs and thus represent putative orphan genes" to be rather confusing. What analysis was performed to determine this percentage, and which set of organisms were compared? Doesn't this percentage seem rather high for a cnidarian? Or is this referring to orthologs outside of cnidaria? Please comment further on how this percentage was determined and possible explanations for it being this high. Right now, it just feels tacked on to this paragraph with no context or further explanation which leads to the confusion.

      Figure 1. There are many issues with this figure that encompass how I felt generally about the figures of the paper. The figure should ideally take up the entire width of the page rather than squishing some text next to the figure.

      Figure 1A: The colors of the different developmental stages from which tissue was samples (e.g. polyp1, polyp2, polyp.clover) do not seem to match between legend and figure. For example, the "polyp.clover" stage is circled in blue in the schematic, but given a green dot in the legend. The "medusa.manubrium" is circled in orange in the schematic, but given a purple dot in the legend. Suggest making the colors match between legend and schematics.

      Figure 1E: In Panel E, the labels showing that the top graph is "polyp" and the bottom graph is "medusa" are much too small. Increase the font size of the labels. The font size for the GO terms themselves are also too small.

      Figure 1F: The bulk of this study centers around the single-cell RNA sequencing data and resulting analyses from these data. As such, I would expect the cellular atlas resulting from these data to be similarly highlighted. In Figure 1F, the annotated cell atlas as presented is much too small, making it impossible to even add the labels for the different clusters directly on the UMAP. Suggest increasing the size substantially to at least half of the page width, so that it is possible to do so.

      • There should also be a complimentary figure in the supplement that shows all of the individual clusters, each in different colors and clearly annotated with labels, rather than just showing multiple clusters that were combined into the major cell types. There is an example of this in the Clytia single cell paper (see Chari et al. 2021 Figure 2A vs Fig S9).
      • The graph on the right of this panel showing the "Distribution of cell types in time and space" is overly complicated with all of the colors and the meaning is quite lost as it is quite difficult to interpret at this very small size. Suggest removing and possibly showing as a supplemental figure so that it's meaning is easier to assess.
      • In addition, striated muscles are marked on the overall UMAP; however, it is not noted until later that the smooth muscles are part of the "outer epidermis" cluster. Suggest altering the legend or the text of the figure itself to show where the smooth muscles are thought to be in the overall UMAP, especially since they are specifically discussed in depth later in the manuscript. Exactly which "part" of the outer epidermis cluster includes the smooth muscle cells?

      Figure 1G: Panel G, for example, is not useful in conveying its point as the text labels are too tiny and the figure is overly complex to be squished into a panel of this figure. Suggest removing and making 1G a supplemental figure by itself or perhaps together with 1C (as they are linked) where it is more legible. The figure legend text for Fig 1G is also confusing as it refers to "scyphozoa" in (C) but there is no "scyphozoa" in 1C, only "medusa".

      Text, p. 6: The explanation for how the clusters were annotated in Fig 1 and Fig 2 is much too vague. The text states, 'We identified 9 broadly defined cell populations, for which we assign identities by assessing up-regulated gene lists (Data S1.3)." What does this mean? How exactly were the up-regulated gene lists assessed? This needs to be clarified further. What genes were used to label these clusters or groups as particular cell types? How does the annotation relate to Supplemental Tables S1.3 and S1.3b? Does the previous literature need to be cited to support these annotations based on specific genes? Suggest doing a better job overall and providing more detail and context explaining how the single cell clusters were annotated.

      Text, starting on p14: "Striated and smooth muscles share partial transcriptomic profiles." This section is highly confusing and could do with some simplification in both text and figures.

      • The genes for which expression is shown in Fig. 5, 6 and 7 are not properly introduced or given nearly enough context in the text. For example, the text states, "To investigate the dynamics of muscle formation, we further compared phalloidin staining of muscle fields with in situ hybridization detection of specific cluster marker expression in polyps (Fig. 5), strobila (Fig. 6), and ephyra (Fig.7)." However, it is not until the legend of Figure 7 and also much later in the text (in the Discussion, p23) that it is noted what types of muscles each of the genes used in ISH actually mark ("While a small set of genes are shared across the two muscle phenotypes (e.g. stmyhc1 and mrlc2), others are more specific to either phenotype (eg. stmyhc5 in striated muscle; myophilin-like-2 in smooth muscle) (Fig.8A), which were verified by in situ hybridization (Figs.5,6,7)". This needs to be rewritten and improved for flow and clarity purposes.
      • Suggest that the authors show an overall UMAP of smooth and striated muscle (perhaps the smooth muscle subtypes are part of the large 'outer epidermis' cluster; see the comment for Figure 5B above), and then include featureplots that show the expression of each of the genes used in ISH in these clusters. This might make it clearer as to what type of muscle the genes should be highlighting within each developmental stage. It might look something similar to what is shown in Figure 7P (although it is unclear how the featureplots shown in this figure relate to the UMAP shown in Figure 5B). In addition, the featureplots in Figure 7P only show 3 out of the 4 genes used in ISH which is not helpful. Featureplots should be clearly shown for all genes discussed. This is essential to linking the pattern in the single-cell data to the expression data and is the minimum required to provide clear understanding.
      • The text reads, "To investigate the dynamics of muscle formation, we further compared phalloidin staining of muscle fields with in situ hybridization detection of specific cluster marker expression in polyps (Fig. 5), strobila (Fig. 6), and ephyra (Fig.7)." However, Figure 6 also contains images of ephyra (Fig6. P-S). Suggest that those panels could be included in Figure 7.
      • There are parts of this section text where reference to the Figures is complicated and not easy for the reader to follow. I got particularly confused in trying to follow this part of the manuscript. For example, a sentence on p15 reads, "mrlc2 and stmyhc1 reads are detected in both muscle types (Fig. 7pFig. 5M, Fig 6C,E,G-P, Fig. 7J-L,N-P), and ISH indicates that the expression is localised to the fields of striated muscles in ephyrae (Fig.7J,K,N), as well as the smooth muscle populations in polyps including longitudinal tentacle muscles, radial muscles of oral disc and retractor muscles of the body column (Fig. 5M, Fig.6H,I,L,M), and the muscles of the manubrium in the meta-ephyra (Fig. 7L,O)." It is quite difficult to keep jumping between Figures and panels to look at this. A better organization of the Figures and much clearer text that doesn't jump around could go a long way to making it easier to follow.

      Discussion

      • The authors do try to put their results into context with the two Aurelia genome papers (Gold et al. 2018, and Khalturin et al. 2019) and two additional bulk transcriptome studies (Fuchs et al. 2014, Brekhman et al. 2015), but not until the first part of the Discussion. In principle, this would be fine. However, in practice, their discussion of these studies is somewhat vague and generalized and did not really provide a clear review or analysis of how adding in cell-type specific data is helping our understanding. The argument about how their results fit with previous findings was confusing and unclear. They start by discussing "genome usage" but then switch to talking about cell type diversity across life stages. The connections between "genome usage", "gene representation", and cell types was not easy to follow. Suggest rewriting this section to clearly discuss the findings in this manuscript in the context of previous studies with straightforward and precise language.
      • In the discussion about the neural subtypes, comparisons are only made to Nematostella where there are also two major neural classes. It would be even better to include discussion of single-cell data related to neurons in other cnidarians, such as Hydra, where there is detailed discussion of neuron subtypes in both a published manuscript (Siebert et al. 2019, Science) and a preprint (Primack et al. 2023, biorxiv) and Clytia (Chari et al. 2021, Science Advances). I do see that Clytia and Podocoryna are mentioned in the next section of the Discussion, specifically related to the Otx gene.
      • The section about muscle subtypes in the Discussion would need to be rewritten in accordance to changes suggested above for the Results for this section.

      Materials and Methods

      • In the section "Comparison with Nematostella" the authors discuss running OMA to generate the set of identified 1:1 orthologs but never go on to mention how many orthologs were identified. Please report this number so it is clear whether this is a small or large subset of the total analyzed. In a recent study of the Hydra AEP strain (Cazet et al. 2023 Genome Research), a similar analysis was done between Hydra and Clytia and they found 5979 genes with 1:1 orthologs between the two species. There should also be a supplemental datasheet that provides a list of these orthologs (See Supplemental Data S17 provided in Cazet et al. 2023 as an example). I am curious to know how many 1:1 orthologs were found between Aurelia and Nematostella. I would expect there to be a smaller overall number than between Hydra and Clytia due to the larger phylogenetic distance between these two taxa. I also strongly suggest that the Cazet et al. 2023 paper should be referenced, as it was the first time an attempt to compare single-cell datasets between two cnidarian species was done. The current manuscript took an alternative approach to comparing Aurelia to Nematostella, so it would be good to acknowledge this and justify the methods used in this manuscript compared to those used in Cazet et al. 2023.
      • There are missing descriptions of methods throughout the paper. One example is in the section about Transcription Factor families that are over or underrepresented amongst upregulated genes compared to their distribution in the genome - I could not find any description of the methods used to identify these Transcription Factor families in the dataset of Aurelia upregulated genes. How were these families chosen? How were they identified in this dataset?
      • I noticed in the Data and materials availability statement and a few other places in the manuscript, a github repository was mentioned: https://github.com/technau/AureliaAtlas. I tried to access this repository to review what was included, but unfortunately it is not accessible. I found seven repositories within github.com/technau but the AureliaAtlas was not one of them. This repository should include all scripts to generate all figures and other analyses in the paper and should be made available to reviewers to better understand exactly how all analyses were completed. A good example of how this could be done is found in the repository related to Cazet et al. 2023 (https://github.com/cejuliano/brown_hydra_genomes), which is very comprehensive and easy to follow.
      • When I looked through a similar repository https://github.com/technau/CellReports2022/ from the Steger et al. 2022 Cell Reports Nematostella single-cell paper from this same group, I find it to be rather disappointing. They apparently included all code to generate all figures in a single R file that is not easy to follow and not well commented. If this is the same strategy used for this manuscript, I feel that a much stronger effort could be made to make the analyses of this Aurelia manuscript transparent by producing a github that is more like that of https://github.com/cejuliano/brown_hydra_genomes from the Cazet et al. 2023 paper which organizes each type of analysis in a different github subfolder and within each subfolder they include very detailed information and comments explaining each step of each analysis. Doing this would go a long way to making the analyses in this manuscript more transparent and easier to follow and would certainly put some of my concerns to rest.

      Minor comments:

      Figures:

      Figure 2A: In the legend it says "Colour code as in (B) and (C)" but it's really referencing the colors in Figure 1A, correct? It is confusing to have to look back to Figure 1A to understand the colors here.

      Figure 2D: Typo in the word "proteins" in the title of this panel.

      Figure 3F: The placement of the tree and the two featureplots for myc3 in Nematostella and Aurelia is confusing. Suggest moving the featureplot for Aurelia myc3 so that it is beside Nematostella (to the right of the tree) or move the featureplot for Nematostella myc3 so that it is beside the Aurelia featureplot (to the left of the tree).

      Figure 4B: The description of this panel reads, "Distribution-histogram across all samples, medusa-specific cell clusters are highlighted with black outline.", however as a reader, the black outline is not very clear. Suggest making it bolder. In addition, this black outline is a little confusing - it should mark the medusa-specific cell clusters; however, the black outline appears in cell clusters in strobila and ephyra?

      Figure 5B: It is unclear from where this reference UMAP was derived. Does it come from the overall UMAP, showing the 'outer epidermis' cluster only, with the putative smooth muscle cells in red? Or is it the 'outer epidermis' cluster plus the striated muscle cluster? Suggest making this clearer (see below for larger edits to this section of the manuscript).

      Figure 5K/L/M: It is unclear which parts of the polyp in K is used for the images shown in L or M. Both come from the large red box, but it is unclear from which part L and M were made. In addition, the subtraction of the background from the image (to make it look white) is distracting and makes the image itself look artificial.

      Figure 6C, G-S:

      • Not sure what the blue boxes around these panels are meant to highlight?
      • Also not sure what the image in the left of panel C is. Perhaps an oral view of the strobila? The legend or panel itself should mention this.
      • Again, subtraction of the background from the image (to make it look white) in panels C, D and E is distracting and makes the image itself look artificial.

      Figure 6J, M, N, O:

      • For someone not accustomed to looking at images of strobilating polyps, it is unclear what part and what orientation these images are taken of. Suggest including some of these details in the figure legend at least. Fig 6O actually looks like an ephyra, but is annotated as an "advanced strobila"?

      Figure 7H:

      • Not sure what the white lines in this panel are meant to indicate?

      Results:

      p5 - In this sentence, "Because these four pouches look like a cloverleaf from above, we call this stage the "clover-polyp", suggest changing "clover-polyp" to match the Figure 1A (where it is written as polyp.clover), or change the text in the Figure to match the text in the manuscript.

      p8 - In this sentence, "the bZIP protein family are over-represented as terminal cell type markers, while the number of zinc-finger proteins of the N2C2 class are under-represented", the "N2C2" class the authors refer to is not clear. Is there a typo here? In the figure to which this sentence refers (Figure 2D), the proteins referenced are "zf-H2C2" or "zf-C2H2".

      p9 - Typo - should be "medusozoans" rather than "medusazoans".

      p11+ - Section titled, "Aurelia neural complement reveals two neural classes with similarities to anthozoan neurons"

      • I found the classification of N1 and N2 to be confusing, since initially they are described as neural clusters, however N1 in particular is shown to consist of primarily secretory, non-neural cell types. For example, when looking at Figure 4A and B, it is evident that N1 contains only a relatively small number of neural cell-types (in shades of orange), while most of the cells are other secretory, but non-neural cell types (in shades of brown). Not sure if the authors should alter the title to reflect this? For example, instead of 'neural' classes, they could be called 'neuro-secretory' or 'mixed neural and secretory classes'?

      p11 - Text reads, "Class 1 neurons in the medusa are also most prevalent within the gastrodermis and manubrium, and includes one subtype that first appears in the strobila and is found in all medusa tissue samples ("n1.3.medusa"; lower black box Fig. 4F).", however there is no "lower black box" in Figure 4F apparent.

      p13 - The text reads, "We find that class 2 neurons all express elevated levels of specific alpha- and beta- tubulins (TBA1-like3 and TBB-like-1; Fig. 4D).". Make the capitalization of your gene names (TBA1-like3, etc) consistent between text and figure throughout (in Fig. 4D the gene names are lower case).

      p14 - In the first paragraph of this page, Fig. 4C is referenced twice, however both times the referencing sentence does not match this panel (most likely the authors meant to reference 4E, F or G).

      p14 - The final sentence of this upper paragraph, "Specific tubulin-paralog expression within the class n2 neurons suggest that this is the portion of the nervous system labelled by the β-Tubulin antibody." is confusing. Do you mean that the b-tubulin antibody is most likely labelling the product of the tbb-like-1 gene that is shown in the featureplot in Fig 4D? Suggest rewriting this sentence for clarity.

      p14 - on this page and others in the manuscript, there are instances of the word "Aurelia" not being italicized.

      p14 - In this sentence, "In the sea anemone Nematostella, anemone-specific gene duplications of members of the PaTH (Paraxis, Twist Hand-related) bHLH family of protein coding genes was driving the diversification of muscle cell types (29)." the "was driving" part of the sentence is grammatically clunky. Suggest rewording slightly. (e.g. "...protein coding genes drive the diversification of muscle cell type").

      -Myophilin-like2 in the text of the manuscript is written as myofilin-like2 in the figure panels (e.g. Fig 5L, Fig. 6D). Make consistent between text and figures.

      p15 - on this page and several instances thereafter, "in situ" is not italicized as it should be.

      p19 - In the line, "Taken all together these data suggest that the contractile apparatus in the Scyphozoa, using here Aurelia as a proxy, is similar to the bilaterian smooth muscle contractile complex (Fig. 8C)." this should really reference Fig. 8 B-C

      Significance

      General assessment:

      I believe this manuscript adds a significant amount of useful data and provides some novel insights into scyphozoan cell types across an important life history transition from polyp to medusa in Aurelia. Adding the dataset to the USCS Cell Browser is a strength. I think there is the potential to make this an impactful paper but in its current form, it is pretty messy, and not clearly presented, and lacks some transparency. The greatest weaknesses lie in not framing the work adequately or putting it into enough context with previous work and also not relating it to other medusozoans; in the Figures which are overly crowded, and confusing rather than being clear and supporting the results; and in the lack of explanation for some methods like how cell clusters were annotated, how transcription factor families were determined; and the lack of access to the github data repository, which raises questions of reproducibility. It will take a good amount of restructuring figures and reframing to make the study clear and impactful and the methods and analyses reproducible.

      Advance: If the weaknesses are addressed adequately, this study does contribute new insights in the area of further understanding changes across an important scyphozoan life cycle transition in terms of diversity of cell types and their cell-type transcriptomes, opening up further questions which can now be addressed.

      Audience: The broader cnidarian community will be interested in this study. People studying cell type evolution and cell type novelty across the tree of life will also be interested. Anyone looking for examples of how to use modern approaches to understanding life cycle changes in animals will be interested.

      My expertise is in cnidarian cellular and molecular biology and evolution including working with model cnidarian research organisms and employing techniques and approaches similar to those used in this study.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to the Reviewers

      We thank three anonymous Reviewers for their careful examination of our manuscript. Below, we provide a point-by-point response.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      1. EVIDENCE, REPRODUCIBILITY AND CLARITY Summary

      Hubbert and colleagues describe ExTaSy, a CRISPR-Cas9-based platform for the endogenous tagging of proteins in Drosophila melanogaster. The system combines several established molecular tools into a single-vector framework: homology-directed repair (HDR) for the insertion of a 3XHA tag at the endogenous locus, piggyBac transposase-mediated near-scarless removal of a transgenic selection marker, and φC31 integrase-mediated recombination-mediated cassette exchange (RMCE) for subsequent tag swapping. The authors demonstrate the system across a set of 65 genomic loci and provide a bioinformatic pipeline to automate guide RNA and homology arm design.

      Major Comments

      1. Validation of knock-in lines is inadequate and does not reflect current standards in the field. The authors state that correct insertions were confirmed using "two PCRs per inserted fragment done with primers binding to the 5' and 3' ends of the inserted DNA and corresponding gene-specific validation primers." This strategy is well known to produce false positives, as it cannot distinguish correctly targeted single-copy integrants from concatemeric insertions at the target locus (e.g. Skryabin et al., 2020). The current standard for validating CRISPR-mediated knock-ins requires PCR amplification using primers that anneal outside the homology arms and span the entire inserted cassette. These reactions must be performed under conditions that minimise the formation of PCR chimeras, specifically low cycle numbers and use of a high-processivity polymerase. The authors should either provide data from such experiments for their characterised lines, or clearly acknowledge this limitation and qualify their efficiency estimates accordingly (see related point 2 below).

      __Response: __We originally opted for using primers that span a fragment from the inserted DNA into the genomic locus for ease of amplification, which is currently standard in the field (e.g., Kanca et al. 2022). We usually run these PCRs in a heterozygous background (before homozygous stocks are established or because tagged lines remain balanced), and the unmodified locus preferentially amplifies in a whole-fragment PCR. However, we have recently started running whole-fragment PCRs and plan to repeat them for all loci and will report the results in a revised version of the manuscript. We are also revising the manuscript to reflect the necessity (or at least preference) to perform insert-spanning PCRs.

      Reported efficiency metrics do not adequately distinguish correctly targeted integrants from marker-positive flies.

      A related concern is that many of the efficiency parameters reported in the manuscript appear to be based solely on the detection of the marker cassette. The 63.1% overall success rate, for example, seemingly reflects the recovery of DsRed-positive flies rather than of sequence validated, single-copy, on-target integrants. These are fundamentally different quantities, with only the latter being of practical value for the users of the described technique. The authors should either provide data that properly accounts for correct integration, or more carefully define what each reported metric represents and explicitly acknowledge the limitations of using marker presence as a proxy for successful knock-in.

      __Response: __The reviewer is correct that the numbers we report are DsRed-positive flies. However, most have been confirmed with end-of-fragment/locus spanning PCRs, so are on-target (although not necessarily single-copy; see comment #1). While we cannot categorically exclude off-target insertions, we have not observed any cases where the DsRed segregates independently of the targeted chromosome, which at least makes off-target insertions on other chromosomes highly unlikely. We will clarify in the text that the 63.1 % success rate relates to DsRed marker expression and insertion site-spanning PCR and acknowledge the limitations as suggested by the reviewer.

      The characterisation of tag exchange requires expansion or more careful framing of its scope.

      The possibility of exchanging tags through fly crosses rather than repeated microinjections is, in the view of this reviewer, the most practically useful feature of ExTaSy and the aspect most likely to drive community adoption. It is therefore important that this feature is characterised with sufficient rigour to allow prospective users to assess its reliability. In the current manuscript, tag exchange has been demonstrated at only five loci using a single replacement tag (sfGFP). The dataset includes one outright failure (the Met C-terminus) and one instance of an unexpected 9 bp insertion at the recombination site, leaving the success rates and failure modes across a broader range of loci and tags uncharacterised. The authors should either expand the tag exchange experiments to cover a more representative set of conditions, or frame the current data explicitly as a proof of concept and limit their conclusions about the practical utility of tag exchange accordingly. In either case, the value of this work to the community would be substantially increased if a collection of donor lines carrying the most commonly used tags for different applications, as the authors themselves enumerate in the Discussion, were generated and deposited at a public stock centre such as the VDRC concurrent with publication. On this note, it is also worth flagging that at present the plasmids described in this study have not yet been deposited at Addgene or the European Plasmid Repository, and that fly lines are available only on request. For a methods paper aimed at community adoption, deposition of reagents in publicly accessible repositories at the time of publication is the expected standard.

      __Response: __We are in the process of increasing the number of fly stocks for which tags have been exchanged and will be able to provide a more rigorous characterization with an updated version of the manuscript. We are also working on additional swap lines (for example T2A-GAL4). Regarding submission of the materials to relevant databases, we are in the process of depositing the plasmids on Addgene. We plan to deposit the swap lines and other toolkit stocks (new hs-Flp, vas-int lines as well as pBac transposase lines) at the VDRC or BDSC. To make the tagged fly lines viable for distribution via the VDRC, we are working to increase their numbers, and we plan to publish them separately as a resource, where we also plan to characterize the expression of more transcription factors and their isoforms in greater detail.

      The Introduction should better reflect the current state of the field, including explicit comparison with MiMIC and CRIMIC.

      The introduction would benefit from a clearer distinction between transgene-based approaches that introduce additional gene copies and true CRISPR-mediated knock-ins at the endogenous locus. As it stands, the discussion of prior methods does not sufficiently acknowledge that CRISPR-based knock-in is already the standard approach in Drosophila, and that the individual techniques employed in ExTaSy are well established. Notably, the MiMIC and CRIMIC systems (Nagarkar-Jaiswal et al., 2015; Li-Kroeger et al., 2018), which also support RMCE-based tag exchange at endogenous loci and for which large collections of lines are already publicly available, are not adequately discussed. These are arguably the closest comparators to ExTaSy, and the authors should explicitly address how their approach differs from and offers advantages over this existing framework, particularly given that MiMIC/CRIMIC insertions can also tag internal sites and thus avoid some of the terminus-specific complications described here.

      __Response: __We will expand the introduction and the discussion to give more reference to other resources for endogenously and exogenously tagged genes in Drosophila and compare ExTaSy in greater detail with other methods, highlighting advantages and disadvantages of each and making clear that RMCE-based tag exchange and marker removal are not novel inventions.

      • *

      Minor Comment

      The labelling of sgRNA target sites in Figure 1 is inaccurate and should be corrected.

      In Figure 1, the sgRNA target sites are annotated with triangles labelled "PAM synth." The presence of a PAM is necessary but not sufficient to define a target site; the label should therefore be changed to "target site" or an equivalent term. Additionally, the Methods section incorrectly expands PAM as "primary adjacent motif"; the correct expansion is "protospacer adjacent motif."

      __Response: __The labelling in Figure 1 will be changed and the PAM abbreviation corrected.

      Could the fly crossing scheme in Figure S3 be simplified?

      In the scheme in Fig. S3 the second step seems to be intended to introduce the hs-Flp and vase-Int transgenes. Would it not be possible to already incorporate the Integrase into the swap fly line when it is made and the hs-Flp into the ExTaSy line, thereby saving one generation?

      __Response: __This would in principle be possible; however, we prefer to keep the lines “clean” in case a tag exchange is not desired, and so this would require an initial crossing step. We therefore prefer the crossing scheme as it is.

      Figure 1F has no call out in the main text.

      __Response: __This will be corrected.

      Line 155: What was the reason for the low survival rate? Is this likely to be indicative of a problem during marker removal, or a stochastic event as not all fly crosses are always productive (bad food, early death of flies, etc.)?

      __Response: __This was a stochastic event. The fly line we used for expression of piggyBac transposase (BDSC_8285) is generally not growing well, and we could only use one eighth of all offspring to ensure correct segregation. We will make this clear in the text.

      Line 160: What is the N number of "all cases"?

      __Response: __This will be changed to “We performed Sanger sequencing for one established line for each of the 17 loci and confirmed clean excision of the piggyBac sites in all cases.”

      Scale bars are missing in Fig. 3g,h.

      __Response: __These will be included.

      • *

      Line 219: The labeling of the panels got mixed up. Panel F does not show an immunostaining.

      __Response: __The labeling will be corrected.

      Line 226 and Fig. 3h: It is unclear what area is shown in the inlay. The overview image highlights three POIs, but none seem to fit the inlay.

      __Response: __The images were indeed misleading as the inlay did not show a magnification of the same focal plane. We will show the inlay together with the overview of the corresponding focal plane as part of Supplementary Figure 5 and will amend the text accordingly.

      Line 233: Why was the transgenic marker not removed? The authors want to highlight the easy and advantage of marker removal, so leaving in the marker is an odd choice.

      __Response: __In this case, we observed that flies become homozygous even with the marker, so we assumed that a marker removal would not be necessary. We are currently performing additional experiments to remove the marker and repeat the staining, which we will submit with a revised version of the manuscript.

      Line 250: Why was only one isoform of hth tagged? Without a rational this seems to be an odd choice, in particular since the authors seem to suggest in the introduction (Line 38) that a disadvantage of previous technologies is the tagging of only selected isoforms.

      __Response: __While expanding the introduction (see comment #4), we will also rephrase it to highlight that current CRISPR-based methods (MiMIC and CRIMIC) are designed to tag all isoforms simultaneously or select isoforms, whereas overexpression constructs are limited to one isoform. In contrast, ExTaSy allows tagging of all isoforms that share a terminus. We will emphasize advantages and disadvantages in the discussion. In the case of hth, three different C-termini are annotated, and we are currently performing experiments to also tag the other termini and co-stain them with Ubx. We will submit the results in a revised version of the manuscript.


      Reviewer #1 (Significance (Required)):

      SIGNIFICANCE

      ExTaSy assembles a set of well-established tools, namely CRISPR-mediated HDR, piggyBac-based marker excision, and φC31-mediated RMCE, into a unified, single-vector framework for endogenous protein tagging in Drosophila. The individual components have all been described and are in routine use in the field; the conceptual advance is therefore limited. Nevertheless, the integration of these features into a streamlined platform with accompanying automated design software represents a practical contribution that is likely to be of genuine utility to the Drosophila community, particularly for laboratories without specialist transgenesis infrastructure.

      The possibility of tag exchange by fly crossing is the most distinctive feature of the system. However, as discussed above, this is currently demonstrated at only five loci with a single replacement tag, which limits the conclusions that can be drawn about its generality. More broadly, ExTaSy employs well-proven strategies throughout, which is a source of reliability but also means that the study does not incorporate more recent developments in the field. For example, approaches based on single-strand annealing, such as the recently described Seed/Harvest system (Aguilar et al., 2024), can achieve entirely scarless marker removal and thus circumvent the TTAA scar left by piggyBac excision, a limitation the authors themselves acknowledge may reduce expression at modified N-terminal loci. Similarly, the current system is restricted to N- and C-terminal tagging. Given that the goal of endogenous tagging is to minimally perturb protein function, and given the now widespread availability of high-quality protein structure predictions for the Drosophila proteome, a modern tagging platform might be expected to use structural modelling to identify optimal insertion sites irrespective of their location. These are not oversights that diminish the practical value of the current work, but highlight that this study does not always operate at the cutting edge of method development in this area. A brief discussion of these more recent developments in the context of ExTaSy's design choices would usefully situate the work within the broader landscape and help readers understand both what the system offers today and where improvements are likely to come from.

      __Responses: __

      • As stated above, we are currently performing experiments to further validate the tag exchange.
      • Regarding the SEED/Harvest system, we have considered this; however, this would leave both flanking attP/attB sites at the genomic locus rather than only the site between the tag and the CDS. Both sites would have to be incorporated into the CDS or they would leave an even bigger scar. Additionally, since SEED/Harvest relies on micro-homology between two tag halves, it would require removal of the transgenesis marker before tagged lines become usable. Our system is advantageous in that C-terminally tagged lines can usually be used immediately. However, we will refer to the paper by Aguilar et al. and discuss how a similar system could be incorporated into ExTaSy.
      • Regarding structure-function predictions, these could be incorporated into the bioinformatic pipeline. It would then be possible to modify ExTaSy to introduce tags internally together with a SEED/Harvest-like modification. We will include this in the discussion.

        Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary

      Hubbert et al. describes ExTaSy (Exchangeable Tagging System), a method for endogenous protein tagging in fruitflies. The technique attempts to address some limitations of current tagging strategies, such as non-physiological expression from transgenes, disruption of the target gene, and limited usefulness of a single tag type. The basic approach is not novel, rather it effectively incorporates ideas from several previously published methods:

      • Crispr-based release of the HDR donor from the backbone in vivo (Kanca et al., 2019 and 2021).
      • PBac scarless tagging (flycrisprdesign)
      • In vivo RMCE to swap out tags (Nagarkar-Jaiswal et al., 2015) Although not novel, the authors show the completeness and effectiveness of the approach. They were able to tag genes across multiple chromosomes, with knock-in rates comparable to other approaches, and demonstrate tag swapping through RMCE. Overall, this work introduces a versatile and modular platform that combines several previous innovations into a single effective package.

      Major comments

      1.The manuscript would benefit from a more upfront discussion of how ExTaSy relates to existing methods. As currently written, the implies a higher degree of novelty than is warranted, since ExTaSy combine several previously established approaches, including, as already noted. While this is valuable, the authors should more clearly acknowledge in the abstract and introduction that the primary advance is the unification and streamlining of these existing technologies into a single platform, rather than the introduction of fundamentally new components.

      __Response: __While we did cite most of the publications mentioned by the reviewer, we will make clearer that our system combines several previously established Drosophila systems and is not per se a novel invention. We will expand the introduction and discussion to reflect this and cite additional publications.

      • *

      2.Comparison to prior systems. The manuscript should include a direct comparison to existing tagging pipelines. For example: What practical steps are eliminated relative to prior approaches? Does ExTaSy reduce the number of injections or constructs required? How does the workflow differ in terms of time, cost, or technical expertise? This is vaguely addressed in the discussion, but more specific and clear comparisons would improve things for the reader who is trying to decide which method to use. For example, how does this strategy directly compare with the protein trap alleles described in Kanca et al., 2022? This could be done as a supplemental table.

      __Response: __A similar concern has been raised by reviewer #1 (comment #4). We will expand the introduction and the discussion to compare ExTaSy in more detail with other methods, highlighting advantages and disadvantages of each.

      3.Only 4 successful RMCE swaps are presented. This is too few to make a confident conclusion about the efficiency. The authors should do at least 4 more and include negative data.

      __Response: __A similar point has been made by reviewer #1 (comment #3). We are in the process of expanding the number of fly stocks for which tags have been exchanged and will be able to provide a more rigorous characterization with an updated version of the manuscript.

      4.Some discussion of the potential limitations of the linker from the residual att sites is needed.

      __Response: __We will include this in the discussion.

      Minor comments

      1.It would be helpful to include a workflow overview figure summarizing the full pipeline.

      __Response: __We will include such a figure in the supplement.

      2.Line 124: Most genes we tagged at the C-terminus were homozygous viable, indicating limited detrimental effects. Need to include the numbers? What is "most genes."

      __Response: __We will include these numbers in the text.

      3.Briefly explain how the tested genes were selected (e.g., random, representative, biased toward certain classes), as this could affect interpretation of generalizability. If most of the genes are essential for viability, this makes the viability of tagged lines more impressive.

      __Response: __This is an excellent suggestion, and we thank the reviewer for pointing this out. We have mainly tagged genes that are relevant for work in our labs and for collaborators, focusing almost entirely on transcription factor-encoding genes that are largely essential for normal development. We will include a brief discussion of this.

      Reviewer #2 (Significance (Required)):

      Significance

      1.General assessment: This study presents ExTaSy, a practical and well-executed platform for endogenous protein tagging in Drosophila. Its main strength is the integration of multiple existing technologies into a streamlined workflow that enables tagging, marker removal, and tag swapping. The system is clearly functional and broadly applicable. However, the conceptual novelty is limited, and the manuscript should more explicitly frame the work as an engineering advance. Tagging and RMCE efficiencies are moderate.

      2.Advance: ExTaSy represents a technical advance that combines CRISPR HDR tagging, piggyBac scarless editing, and RMCE into a single platform. The biggest improvement is the ability to tag once and flexibly swap tags via crosses, reducing the need for repeated genome engineering. This extends existing methods by improving experimental flexibility.

      3.Audience: This work will primarily interest a specialized audience in Drosophila genetics, CRISPR technologies, and functional genomics, with broader relevance to researchers developing tagging systems in other model organisms.

      4.Field of expertise: CRISPR screening, Drosophila genetics, functional genomics. No limitations on my ability to evaluate.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This methods paper is targeting the long-standing ambition of how to most efficiently tag proteins at the endogenous gene locus in Drosophila. Since the invention of CRISPR-Cas9 many genes have been successfully modified in Drosophila, but the community is still lacking a large collection of tagged proteins under endogenous control made with the same method.

      This manuscript is using a small tag, 3xHA, which supposedly is easier to integrate, and the design allows to then swap the tag with larger fluorescent tags, solely by fly crossing. Then, the dsRed or white markers, allowing identification, can be removed with a biggybac recombinase leaving only a small scar. However, attP/B/R scars do remain. Design and cloning appear straightforward. Overall, this is an interesting strategy.

      However, the manuscript falls short in really describing the resource, apart from the cloning design. A more rigorous analysis of a number of lines should be presented to better judge if the strategy practically works. It is quite disappointing to see that only 2 or 3 genes/proteins were analysed here in a bit more detail. This does not sound like a very straightforward resource that aims to go large scale.

      Major comments:

      1. The important novelty here is not only the design that allows high-throughput cloning but more importantly that the tagged lines are actually correct and functional. To present this better, I suggest to rearrange Figure 1 to show the flow: 65 constructs cloned, 41 "successfully" inserted. Of how many the dsRed marker was removed, of how many expression or function was tested? Hence the reader knows about the current state of the resource. These numbers would be informative to have in the abstract, too.

      __Response: __We will include these numbers in the abstract. Reviewer 2 asked for an overview figure of the workflow, which we will include as a supplementary figure, where we can also include numbers as suggested by this reviewer.

      The 41 tagged gene insertions need at least some basic characterisation to verify that they are at the correct place or make a functional protein. Which genes were chosen? I do not see 41 genes tagged in the table provided. I supposed the N-terminal tags should initially be loss of function. Are the N-term lines lethal when inserted in an essential gene? Again, this could be shown in an overview, instead by a non-quantitative statement in the text.

      __Response: __We have verified the insertion site of the lines with genotyping PCR. We will include a table to show in more detail which genes were tagged at which terminus, and which protein isoforms are captured by the respective tag.

      • *

      How many of the 41 tagged proteins are functional? The authors only provide information on Ubx-3xHA (functional) and Mef2-3xHA (non-functional), which I find weak.

      __Response: __We will include this information in the table mentioned in the above comment.

      Stainings are only shown for 2 proteins, Ubx-GFP and Exd-3xHA. How about the others?

      __Response: __We are currently in the process of using ExTaSy to establish a library of tagged fly lines, which we intend to characterize in more detail and publish separately. For the current manuscript, we prefer to focus on the methodology of the tagging system itself.

      I am not sure about how to calculate the transgenesis rates, but strictly speaking to ones that did not result in an insertion should also be counted for the statistics, I guess.

      __Response: __There is indeed no commonly agreed upon way to calculate these rates, and it is done differently in different publications. We felt that metrics that discriminate between the overall success rate (i.e., all those injections that lead to transgenics) and the success rate within successful injections would be most useful. We will try to make clear in the text where we refer to all attempts and where we exclusively refer to the successful ones.

      Minor comments:

      1. The introduction states that ExTaSy would tag all isoforms of genes. However, I find this an overstatement, as for complex genes tagging at the one place cannot always label all isoforms, see the Hth line generated here (Iso E).

      __Response: __This was indeed badly phrased and we will correct the wording also in response to reviewer #1 comment #14 to reflect that overexpression constructs are limited to a specific isoform, whereas ExTaSy enables simultaneous tagging of all isoforms that share a terminus.

      Why does it matter on which chromosome the target gene is? This can be moved to supplement. I would rather like to know what the genes are.

      __Response: __We presume that the reviewer refers to Figure 1, where we show the success rates for individual chromosomes. We felt that the lower success rate for injections targeting gene on chr3 (which is, as we describe, due to lower survival of the injection line) warranted this separation by chromosome. As stated above, we will include a list of tagged genes as a table.

      **Referees cross-commenting**

      I agree with the 2 other reviewer's points. In particular that the knock-in lines need better verifications. This was also my major point.

      __Response: __As also stated for reviewer #1 comment #1, we have now begun to run whole-fragment PCRs for all loci to investigate this further and will report the results in a revised version of the manuscript.

      Reviewer #3 (Significance (Required)):

      The methodology presented here is per se not really new. The 3xP3-dsRed eye marker is standard, its removal by biggbac transposase has been done before and RMCE to change the tagging cassettes with attP/B is done since many years. The latter has the disadvantage to not be seamless, as one attR site remains, which is translated, the other attR site remains in the 5'- or 3'-UTR, which can have an effect. U6-driven sgRNA expression is also standard.

      __Response: __We will make clearer that our system combines several previously established Drosophila systems and is not per se a novel invention. We will expand the introduction and discussion to reflect this and cite additional publications.

      The design includes the sgRNA and the HDR template cassette in a single vector, which is smart and makes cloning straight forward. Again, the paper would be stronger if the list of all cloned clones would be listed (are 65 all that were clones or all that were injected?

      __Response: __We will include this as a table.

      As the authors do not rigorously test the function of the tagged genes, it is hard to judge how valuable the pipeline is. This can be easily solved by providing more data that support the easy, high-throughput exchange tagging pipeline that produces tagged Drosophila lines that are useful to the community.

      __Response: __As stated above, we plan to publish a more detailed analysis of tagged lines as a separate resource paper. We will state in the manuscript which lines were homozygous viable before and after marker removal, which gives at least an indication of whether the tagged protein is functional.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This methods paper is targeting the long-standing ambition of how to most efficiently tag proteins at the endogenous gene locus in Drosophila. Since the invention of CRISPR-Cas9 many genes have been successfully modified in Drosophila, but the community is still lacking a large collection of tagged proteins under endogenous control made with the same method. This manuscript is using a small tag, 3xHA, which supposedly is easier to integrate, and the design allows to then swap the tag with larger fluorescent tags, solely by fly crossing. Then, the dsRed or white markers, allowing identification, can be removed with a biggybac recombinase leaving only a small scar. However, attP/B/R scars do remain. Design and cloning appear straightforward. Overall, this is an interesting strategy. However, the manuscript falls short in really describing the resource, apart from the cloning design. A more rigorous analysis of a number of lines should be presented to better judge if the strategy practically works. It is quite disappointing to see that only 2 or 3 genes/proteins were analysed here in a bit more detail. This does not sound like a very straightforward resource that aims to go large scale.

      Major comments:

      1. The important novelty here is not only the design that allows high-throughput cloning but more importantly that the tagged lines are actually correct and functional. To present this better, I suggest to rearrange Figure 1 to show the flow: 65 constructs cloned, 41 "successfully" inserted. Of how many the dsRed marker was removed, of how many expression or function was tested? Hence the reader knows about the current state of the resource. These numbers would be informative to have in the abstract, too.
      2. The 41 tagged gene insertions need at least some basic characterisation to verify that they are at the correct place or make a functional protein. Which genes were chosen? I do not see 41 genes tagged in the table provided. I supposed the N-terminal tags should initially be loss of function. Are the N-term lines lethal when inserted in an essential gene? Again, this could be shown in an overview, instead by a non-quantitative statement in the text.
      3. How many of the 41 tagged proteins are functional? The authors only provide information on Ubx-3xHA (functional) and Mef2-3xHA (non-functional), which I find weak.
      4. Stainings are only shown for 2 proteins, Ubx-GFP and Exd-3xHA. How about the others?
      5. I am not sure about how to calculate the transgenesis rates, but strictly speaking to ones that did not result in an insertion should also be counted for the statistics, I guess.

      Minor comments:

      1. The introduction states that ExTaSy would tag all isoforms of genes. However, I find this an overstatement, as for complex genes tagging at the one place cannot always label all isoforms, see the Hth line generated here (Iso E).
      2. Why does it matter on which chromosome the target gene is? This can be moved to supplement. I would rather like to know what the genes are.

      Referees cross-commenting

      I agree with the 2 other reviewer's points. In particular that the knock-in lines need better verifications. This was also my major point.

      Significance

      The methodology presented here is per se not really new. The 3xP3-dsRed eye marker is standard, its removal by biggbac transposase has been done before and RMCE to change the tagging cassettes with attP/B is done since many years. The latter has the disadvantage to not be seamless, as one attR site remains, which is translated, the other attR site remains in the 5'- or 3'-UTR, which can have an effect. U6-driven sgRNA expression is also standard. The design includes the sgRNA and the HDR template cassette in a single vector, which is smart and makes cloning straight forward. Again, the paper would be stronger if the list of all cloned clones would be listed (are 65 all that were clones or all that were injected?

      As the authors do not rigorously test the function of the tagged genes, it is hard to judge how valuable the pipeline is. This can be easily solved by providing more data that support the easy, high-throughput exchange tagging pipeline that produces tagged Drosophila lines that are useful to the community.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Hubbert et al. describes ExTaSy (Exchangeable Tagging System), a method for endogenous protein tagging in fruitflies. The technique attempts to address some limitations of current tagging strategies, such as non-physiological expression from transgenes, disruption of the target gene, and limited usefulness of a single tag type. The basic approach is not novel, rather it effectively incorporates ideas from several previously published methods:

      • Crispr-based release of the HDR donor from the backbone in vivo (Kanca et al., 2019 and 2021).
      • PBac scarless tagging (flycrisprdesign)
      • In vivo RMCE to swap out tags (Nagarkar-Jaiswal et al., 2015) Although not novel, the authors show the completeness and effectiveness of the approach. They were able to tag genes across multiple chromosomes, with knock-in rates comparable to other approaches, and demonstrate tag swapping through RMCE. Overall, this work introduces a versatile and modular platform that combines several previous innovations into a single effective package.

      Major comments

      1.The manuscript would benefit from a more upfront discussion of how ExTaSy relates to existing methods. As currently written, the implies a higher degree of novelty than is warranted, since ExTaSy combine several previously established approaches, including, as already noted. While this is valuable, the authors should more clearly acknowledge in the abstract and introduction that the primary advance is the unification and streamlining of these existing technologies into a single platform, rather than the introduction of fundamentally new components. 2.Comparison to prior systems. The manuscript should include a direct comparison to existing tagging pipelines. For example: What practical steps are eliminated relative to prior approaches? Does ExTaSy reduce the number of injections or constructs required? How does the workflow differ in terms of time, cost, or technical expertise? This is vaguely addressed in the discussion, but more specific and clear comparisons would improve things for the reader who is trying to decide which method to use. For example, how does this strategy directly compare with the protein trap alleles described in Kanca et al., 2022? This could be done as a supplemental table. 3.Only 4 successful RMCE swaps are presented. This is too few to make a confident conclusion about the efficiency. The authors should do at least 4 more and include negative data. 4.Some discussion of the potential limitations of the linker from the residual att sites is needed.

      Minor comments

      1.It would be helpful to include a workflow overview figure summarizing the full pipeline. 2.Line 124: Most genes we tagged at the C-terminus were homozygous viable, indicating limited detrimental effects. Need to include the numbers? What is "most genes." 3.Briefly explain how the tested genes were selected (e.g., random, representative, biased toward certain classes), as this could affect interpretation of generalizability. If most of the genes are essential for viability, this makes the viability of tagged lines more impressive.

      Significance

      1.General assessment: This study presents ExTaSy, a practical and well-executed platform for endogenous protein tagging in Drosophila. Its main strength is the integration of multiple existing technologies into a streamlined workflow that enables tagging, marker removal, and tag swapping. The system is clearly functional and broadly applicable. However, the conceptual novelty is limited, and the manuscript should more explicitly frame the work as an engineering advance. Tagging and RMCE efficiencies are moderate. 2.Advance: ExTaSy represents a technical advance that combines CRISPR HDR tagging, piggyBac scarless editing, and RMCE into a single platform. The biggest improvement is the ability to tag once and flexibly swap tags via crosses, reducing the need for repeated genome engineering. This extends existing methods by improving experimental flexibility. 3.Audience: This work will primarily interest a specialized audience in Drosophila genetics, CRISPR technologies, and functional genomics, with broader relevance to researchers developing tagging systems in other model organisms. 4.Field of expertise: CRISPR screening, Drosophila genetics, functional genomics. No limitations on my ability to evaluate.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      Hubbert and colleagues describe ExTaSy, a CRISPR-Cas9-based platform for the endogenous tagging of proteins in Drosophila melanogaster. The system combines several established molecular tools into a single-vector framework: homology-directed repair (HDR) for the insertion of a 3XHA tag at the endogenous locus, piggyBac transposase-mediated near-scarless removal of a transgenic selection marker, and φC31 integrase-mediated recombination-mediated cassette exchange (RMCE) for subsequent tag swapping. The authors demonstrate the system across a set of 65 genomic loci and provide a bioinformatic pipeline to automate guide RNA and homology arm design.

      Major Comments

      1. Validation of knock-in lines is inadequate and does not reflect current standards in the field.

      The authors state that correct insertions were confirmed using "two PCRs per inserted fragment done with primers binding to the 5' and 3' ends of the inserted DNA and corresponding gene-specific validation primers." This strategy is well known to produce false positives, as it cannot distinguish correctly targeted single-copy integrants from concatemeric insertions at the target locus (e.g. Skryabin et al., 2020). The current standard for validating CRISPR-mediated knock-ins requires PCR amplification using primers that anneal outside the homology arms and span the entire inserted cassette. These reactions must be performed under conditions that minimise the formation of PCR chimeras, specifically low cycle numbers and use of a high-processivity polymerase. The authors should either provide data from such experiments for their characterised lines, or clearly acknowledge this limitation and qualify their efficiency estimates accordingly (see related point 2 below). 2. Reported efficiency metrics do not adequately distinguish correctly targeted integrants from marker-positive flies.

      A related concern is that many of the efficiency parameters reported in the manuscript appear to be based solely on the detection of the marker cassette. The 63.1% overall success rate, for example, seemingly reflects the recovery of DsRed-positive flies rather than of sequence validated, single-copy, on-target integrants. These are fundamentally different quantities, with only the latter being of practical value for the users of the described technique. The authors should either provide data that properly accounts for correct integration, or more carefully define what each reported metric represents and explicitly acknowledge the limitations of using marker presence as a proxy for successful knock-in. 3. The characterisation of tag exchange requires expansion or more careful framing of its scope.

      The possibility of exchanging tags through fly crosses rather than repeated microinjections is, in the view of this reviewer, the most practically useful feature of ExTaSy and the aspect most likely to drive community adoption. It is therefore important that this feature is characterised with sufficient rigour to allow prospective users to assess its reliability. In the current manuscript, tag exchange has been demonstrated at only five loci using a single replacement tag (sfGFP). The dataset includes one outright failure (the Met C-terminus) and one instance of an unexpected 9 bp insertion at the recombination site, leaving the success rates and failure modes across a broader range of loci and tags uncharacterised. The authors should either expand the tag exchange experiments to cover a more representative set of conditions, or frame the current data explicitly as a proof of concept and limit their conclusions about the practical utility of tag exchange accordingly. In either case, the value of this work to the community would be substantially increased if a collection of donor lines carrying the most commonly used tags for different applications, as the authors themselves enumerate in the Discussion, were generated and deposited at a public stock centre such as the VDRC concurrent with publication. On this note, it is also worth flagging that at present the plasmids described in this study have not yet been deposited at Addgene or the European Plasmid Repository, and that fly lines are available only on request. For a methods paper aimed at community adoption, deposition of reagents in publicly accessible repositories at the time of publication is the expected standard. 4. The Introduction should better reflect the current state of the field, including explicit comparison with MiMIC and CRIMIC.

      The introduction would benefit from a clearer distinction between transgene-based approaches that introduce additional gene copies and true CRISPR-mediated knock-ins at the endogenous locus. As it stands, the discussion of prior methods does not sufficiently acknowledge that CRISPR-based knock-in is already the standard approach in Drosophila, and that the individual techniques employed in ExTaSy are well established. Notably, the MiMIC and CRIMIC systems (Nagarkar-Jaiswal et al., 2015; Li-Kroeger et al., 2018), which also support RMCE-based tag exchange at endogenous loci and for which large collections of lines are already publicly available, are not adequately discussed. These are arguably the closest comparators to ExTaSy, and the authors should explicitly address how their approach differs from and offers advantages over this existing framework, particularly given that MiMIC/CRIMIC insertions can also tag internal sites and thus avoid some of the terminus-specific complications described here.

      Minor Comment

      1. The labelling of sgRNA target sites in Figure 1 is inaccurate and should be corrected.

      In Figure 1, the sgRNA target sites are annotated with triangles labelled "PAM synth." The presence of a PAM is necessary but not sufficient to define a target site; the label should therefore be changed to "target site" or an equivalent term. Additionally, the Methods section incorrectly expands PAM as "primary adjacent motif"; the correct expansion is "protospacer adjacent motif." 6. Could the fly crossing scheme in Figure S3 be simplified?

      In the scheme in Fig. S3 the second step seems to be intended to introduce the hs-Flp and vase-Int transgenes. Would it not be possible to already incorporate the Integrase into the swap fly line when it is made and the hs-Flp into the ExTaSy line, thereby saving one generation? 7. Figure 1F has no call out in the main text. 8. Line 155: What was the reason for the low survival rate? Is this likely to be indicative of a problem during marker removal, or a stochastic event as not all fly crosses are always productive (bad food, early death of flies, etc.)? 9. Line 160: What is the N number of "all cases"? 10. Scale bars are missing in Fig. 3g,h. 11. Line 219: The labeling of the panels got mixed up. Panel F does not show an immunostaining. 12. Line 226 and Fig. 3h: It is unclear what area is shown in the inlay. The overview image highlights three POIs, but none seem to fit the inlay. 13. Line 233: Why was the transgenic marker not removed? The authors want to highlight the easy and advantage of marker removal, so leaving in the marker is an odd choice. 14. Line 250: Why was only one isoform of hth tagged? Without a rational this seems to be an odd choice, in particular since the authors seem to suggest in the introduction (Line 38) that a disadvantage of previous technologies is the tagging of only selected isoforms.


      Significance

      ExTaSy assembles a set of well-established tools, namely CRISPR-mediated HDR, piggyBac-based marker excision, and φC31-mediated RMCE, into a unified, single-vector framework for endogenous protein tagging in Drosophila. The individual components have all been described and are in routine use in the field; the conceptual advance is therefore limited. Nevertheless, the integration of these features into a streamlined platform with accompanying automated design software represents a practical contribution that is likely to be of genuine utility to the Drosophila community, particularly for laboratories without specialist transgenesis infrastructure.

      The possibility of tag exchange by fly crossing is the most distinctive feature of the system. However, as discussed above, this is currently demonstrated at only five loci with a single replacement tag, which limits the conclusions that can be drawn about its generality. More broadly, ExTaSy employs well-proven strategies throughout, which is a source of reliability but also means that the study does not incorporate more recent developments in the field. For example, approaches based on single-strand annealing, such as the recently described Seed/Harvest system (Aguilar et al., 2024), can achieve entirely scarless marker removal and thus circumvent the TTAA scar left by piggyBac excision, a limitation the authors themselves acknowledge may reduce expression at modified N-terminal loci. Similarly, the current system is restricted to N- and C-terminal tagging. Given that the goal of endogenous tagging is to minimally perturb protein function, and given the now widespread availability of high-quality protein structure predictions for the Drosophila proteome, a modern tagging platform might be expected to use structural modelling to identify optimal insertion sites irrespective of their location. These are not oversights that diminish the practical value of the current work, but highlight that this study does not always operate at the cutting edge of method development in this area. A brief discussion of these more recent developments in the context of ExTaSy's design choices would usefully situate the work within the broader landscape and help readers understand both what the system offers today and where improvements are likely to come from.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This paper describes the localisation of DNA repair proteins, which carry out their DNA repair function in the nucleus, to the cytoplasmic Golgi apparatus. Using the Human Protein Atlas to identify candidates, the authors use antibody localisation to show that a significant number of DNA repair proteins also localise at the Golgi. It appears that proteins involved in common DNA repair pathways localise to common regions of the Golgi. The Golgi-nucleus distribution of the DNA repairs proteins changes upon DNA damage, indicating a dynamic relationship. The authors focus on the DNA repair protein RAD51C and show that its loss from the Golgi and translocation to the nucleus upon DNA damage is mediated by the ATM kinase. Anchoring at the Golgi is shown to be mediated by the golgin giantin. A functional role for giantin in DNA repair is shown in knockdown studies, supporting a mechanism whereby Golgi anchoring of RAD51C, and possibly other DNA repair proteins, by giantin, is required to maintain proper control of DNA repair. The data are clear and support the authors' conclusions. The data are carefully quantified throughout. I found the text easy to read.

      • Major points:*

      • 1.) To validate the Golgi localisation, KD using siRNA was used. It was deemed that a signal reduction of 25% was enough to indicate specific antibody labelling. This seems like a low number, and not very stringent. For some of the hits, expressing tagged versions of the proteins would greatly strengthen the Golgi assignment. This may not be possible for all, but for RAD51C would seem an important experiment. *

      Response: We thank the reviewer for raising the important issue of antibody validation stringency. We agree that for a single-candidate study, a larger reduction after knockdown would generally be preferable. In our case, the 25% cutoff was used only in the primary high-content screening step as part of an intentionally inclusive two-stage workflow, for the following reasons:

      First, because this dataset is generated in a screening format across hundreds of targets, knockdown-efficiency, protein turnover, and the relative size of the Golgi associated pool are unknown and highly variable between genes. For many proteins the Golgi pool represents a small fraction of total cellular signal, and a modest change in total abundance can translate into a smaller absolute change in the Golgi ROI after segmentation, background subtraction, and imaging noise. We therefore selected a permissive cutoff to reduce false negatives and ensure we did not systematically miss candidates with slower turnover, partial knockdown, or small Golgi pools. This strategy is consistent with large scale subcellular mapping efforts, including the Human Protein Atlas, where genetic depletion by siRNA is used as a key validation pillar for immunofluorescence localization and is combined with additional validation strategies when deeper confidence is required (Stadler et al, 2012). Furthermore, it is important to note that this validation was performed in a high-content screening format in which fixation, permeabilisation, antibody concentration, and blocking conditions were kept uniform across all candidates rather than optimised for each individual antibody. In standard single-target immunofluorescence experiments, these parameters would be titrated to maximise signal-to-noise for the specific antibody and antigen in question. Under non-optimised screening conditions, the absolute magnitude of signal change upon knockdown is inherently attenuated compared to what would be expected from a purpose-optimised assay. We therefore consider a 25% reduction threshold under these uniform, non-optimised screening conditions to be a meaningful and appropriately calibrated criterion.

      Second, we wish to clarify that the primary intent of our screen was not to validate the Golgi-nuclear localisation of any single protein in isolation, but rather to identify whether entire functional pathways are represented at the two organelles. This is precisely why the bioinformatic network analysis was performed as an integral part of the workflow, and not as an afterthought. The finding that the validated hit list is significantly enriched for coherent functional clusters, most notably a network spanning multiple core DNA repair pathways (HR, MMR, BER, MMEJ) serves as an in silico validation of the dataset as a whole. The emergence of pathway-level organisation, with proteins from the same repair pathways co-associating, localising to the same Golgi sub-compartments, and redistributing in the same direction upon genotoxic stimuli, provides biological coherence that goes beyond what individual antibody validation can offer, and substantially reduces the likelihood that the Golgi signal represents a collection of unrelated false positives.

      Third, our mechanistic conclusions do not rely on the 25% screening threshold. For RAD51C, we used multiple orthogonal validation approaches, including independent antibodies recognizing distinct RAD51C epitopes and genetic depletion, supported by biochemical evidence.

      In response to this comment, we have provided the full screening validation dataset as source data (Supplementary____Table S1), including intensity changes for the candidates, so that readers can inspect the distributions and apply their own thresholds. We have also clarified in the Results section the rationale behind our screening strategy (lines 128-139) and the role of the bioinformatic network analysis as an integral validation step (lines 141-156).

      Turning to the specific suggestion of tagged RAD51C, we fully agree that tagged proteins can provide valuable orthogonal validation. We attempted endogenous tagging using CRISPR-mediated homologous recombination but were unable to obtain viable colonies following editing, consistent with the essential role of RAD51C in homologous recombination. We also attempted ectopic expression of tagged RAD51C but were unable to obtain constructs that preserved physiological expression levels, maintained robust cell viability or produced interpretable localization. This difficulty is not unique to our laboratory: colleagues working on RAD51 paralog complexes have reported that tagging or overexpression of RAD51C perturbs both its localisation and its ability to form functional paralog complexes (Greenhough et al, 2023; Rawal et al, 2023; Somyajit et al, 2015; Berti et al, 2020) all use purified complexes or untagged proteins for functional assays. We discussed these challenges extensively with experts in the DNA damage repair field at several international meetings (EMBO Sounio, Keystone Symposia, German DNA Repair Society). For these reasons, we relied on orthogonal approaches that do not require tagging (genetic depletion plus independent antibodies, and biochemical fractionation) to support the Golgi localization claim. We agree with the reviewer that this represents a limitation of this study, and we addressed these concerns in the discussion of our revised manuscript (lines 630-641).

      *2.) The total signal should be quantified for each DNA repair protein upon genotoxic stress, in addition to the Golgi to nucleus ratio. For many of the proteins it looks like the total signal goes down, which could influence interpretation. *

      Response: __We thank the reviewer for this important point. We wish to clarify that our imaging pipeline uses marker-based segmentation throughout, the Golgi compartment is segmented using GM130 and the nucleus using Hoechst, as unsegmented whole-cell masks without organelle markers yield unreliable intensity measurements in this experimental setup. True total cellular signal is therefore not directly accessible in this dataset. In the revised manuscript we provide the absolute fluorescence intensities for both the Golgi and nuclear compartments separately. In addition, we now include total (Golgi + nuclear) intensity measurements for each protein (__Supplementary Figures 3D, 4D, __and 5E__) as the most reliable proxy for overall protein distribution. These data are presented alongside the redistribution ratio to enable comprehensive interpretation.

      As the reviewer correctly notes, a subset of proteins shows a reduction in total signal after treatment, particularly with doxorubicin. This is consistent with known effects of doxorubicin-induced DNA damage on cellular proteostasis, including widespread ubiquitination and suppression of protein translation (Halim et al, 2018). Several DDR regulators are subject to ubiquitin-dependent turnover following genotoxic stress, such as CHK1 (Zhang et al, 2005). More broadly, ubiquitin and proteasome mediated regulation is an integral component of the DNA damage response and can affect the abundance and detectability of DDR factors (Brinkmann et al, 2015). Changes in abundance are therefore an expected biological feature of the response. For this reason, we used the Golgi-to-nucleus ratio as the primary redistribution readout, as it captures relative compartmental partitioning independently of changes in total protein levels.

      *3.) The study would benefit from live imaging of the Golgi to nucleus translocation of RAD51C. This would give a better indication of dynamics. *

      __Response: __We agree that live imaging would directly visualize the dynamics of RAD51C redistribution between the Golgi and the nucleus. This was indeed one of our initial goals following the identification of the Golgi-associated RAD51C pool. However, as described above in our response to Major Comment 1, live imaging requires a fluorescently tagged RAD51C construct, and all tagging strategies we attempted, both endogenous CRISPR-mediated tagging and ectopic expression, failed to yield cell lines with robust signal while preserving physiological behaviour. This appears to be a broader challenge for highly conserved and functionally constrained DNA repair proteins, and is not unique to our laboratory.

      Given these constraints, we focused on tag-independent approaches: multiple independent RAD51C antibodies combined with genetic depletion controls, quantitative fixed-cell time courses, and biochemical fractionation. These orthogonal datasets together support compartment-specific changes over time in a manner consistent with redistribution. We have clarified this limitation explicitly in the manuscript and avoided any wording that could be interpreted as implying direct single-molecule tracking in live cells. We present this as an important avenue for future work, contingent on the development of viable RAD51C-expressing cell lines (lines 630-641).

      *4.) The double depletion experiments suggest a functional relationship between giantin and RAD51C. But they do not formally show it. Experiments to more directly address the functional role of the interaction between these two proteins would strengthen the study. *

      Response: We agree with the reviewer that double depletion alone cannot formally prove that the physical Giantin-RAD51C interaction is the sole determinant of the observed DDR phenotypes. However, we would like to highlight the breadth of evidence we have assembled in support of this functional relationship:

      • Physical interaction between endogenous Giantin and RAD51C demonstrated by colocalisation (Figure 4F-G) and co-immunoprecipitation (Figure 4H-I).
      • Damage-induced dissociation of the Giantin-RAD51C complex that is prevented by ATM inhibition or Importazole treatment, directly linking the interaction to the DDR signalling axis (Figure 3K-P)
      • Premature nuclear accumulation of RAD51C upon Giantin depletion, producing aberrant nuclear foci lacking canonical HR markers and impaired ATM signalling (Figure 4B-E & J-M)
      • DR-GFP reporter assay confirming that Giantin depletion reduces HR efficiency to approximately 60% of control, consistent with the reduction previously reported in the genome-wide HR screen (Adamson et al. 2012) and validating the functional significance of Giantin in HR (Figure 5L).
      • Partial rescue of ATM phosphorylation, genomic instability and proliferation phenotypes by RAD51C co-depletion, arguing for RAD51C as a functionally relevant conduit of the Giantin-dependent phenotype (Figures 5M-5P). These observations are further supported by the established literature on RAD51C function, its roles in CHK2 phosphorylation, replication fork stabilisation, and RAD51 filament formation (Badie et al, 2009; Somyajit et al, 2015; Prakash et al, 2022) providing a mechanistically coherent framework in which mislocalisation of RAD51C, whether directly or indirectly through Giantin, leads to dysregulation of DDR signalling and repair capacity, as we directly demonstrate with the HR efficiency assay.

      Nonetheless, we fully agree that the most direct proof of the functional relevance of the physical Giantin-RAD51C interaction would come from separation-of-function experiments, ideally using an interaction-deficient Giantin mutant or an RAD51C variant unable to bind Giantin. We wish to be transparent that both approaches face substantial technical barriers in this system. RAD51C tagging consistently compromised cell viability and protein function, precluding the generation of interaction-deficient variants at physiological expression levels. Engineering an interaction-deficient Giantin mutant presents an independent challenge: Giantin is one of the largest Golgi matrix proteins (~376 kDa), composed almost entirely of extended coiled-coil domains that are resistant to structural prediction, and identifying a discrete RAD51C interaction interface without disrupting broader scaffolding function would require a dedicated structural and biochemical programme. We have framed these explicitly as the most important future priorities in the Discussion (lines 555-564), rather than over-interpreting the current data.

      *5.) The Kaplan-Meier plots in Fig S9 seems to be quite selective in that only breast cancer is shown. Does giantin reduction correlate with poor prognosis in other cancers? *

      __Response: __We thank the reviewer for this suggestion. We initially focused on breast cancer because RAD51C is a clinically established hereditary breast and ovarian cancer susceptibility gene (Meindl et al, 2010; Ghannoum et al, 2023), providing direct clinical context for a study centred on RAD51C dynamics and genome stability. We agree however that restricting the survival analysis to a single cancer type can appear selective.

      To address this directly, we expanded the in-silico survival analysis of Giantin (GOLGB1) using GEPIA2 (Tang et al, 2019) across all available TCGA cohorts (overall survival, median cutoff, FDR correction). In the pooled pan-cancer analysis, higher GOLGB1 expression is significantly associated with improved overall survival (HR(high) = 0.75, p = 6.6 × 10⁻¹⁵). When stratified by tumour type, the majority of individual associations do not reach statistical significance. The two most robust statistically significant associations are kidney renal clear cell carcinoma (KIRC; HR(high) = 0.57, p = 3.4 × 10⁻⁴), where high GOLGB1 expression is associated with improved survival, and lower-grade glioma (LGG; HR(high) = 1.5, p = 0.036), where the association is in the opposite direction. A significant association is also observed in thymoma (THYM; HR(high) = 7.3, p = 0.031), though this should be interpreted with caution given the small cohort size (n = 59). Notably, the breast cancer association observed in the KM Plotter analysis (HR = 0.71, p = 1.8 × 10⁻¹¹; n = 4,929) does not reach significance in the TCGA BRCA cohort (HR = 1.1, p = 0.68; n = 1,070), most likely reflecting the substantially smaller sample size of the TCGA cohort, which is approximately 4.6-fold smaller and therefore underpowered to detect a modest effect. These context-dependent associations are consistent with the tumour-type-specific roles of Golgi scaffolding proteins and are discussed accordingly in the revised manuscript.

      In the revised manuscript we have retained the original breast cancer Kaplan-Meier plots and supplemented them with a pan-cancer survival map across all TCGA cohorts (lines 611-625; Figure S9G) and a summary table (Supplementary Table 3) reporting hazard ratios, sample sizes, and p-values for each tumour type, allowing readers to assess the clinical relevance of GOLGB1 expression.

      *Minor points: There are a few grammatical errors here and there. The figures do not appear in the correct order in the text, which makes the early parts of the paper a bit difficult to follow. Some of the figures don't seem to clearly match the text. For example, it is mentioned that RAD51C labelling was done with 3 different antibodies. I could not find this data. *

      Response: __We thank the reviewer for these helpful observations. In the revised manuscript we have (i) carefully proofread the text and corrected grammatical errors throughout; (ii) revised the Results section to ensure that figures and supplementary figures are cited in sequential order and that each panel is explicitly introduced before being discussed, improving readability in the early sections. and (iii) corrected figure callouts to ensure they match the text. In particular, the statement that RAD51C labeling was performed with three different antibodies has been linked to the corresponding figure panels in the Results section. Antibody identifiers, sources, and dilutions are clearly reported in the Methods and in the table in __Supplementary Table S1.

      __ Reviewer #1 (Significance (Required)):__

      *This paper is novel and should be of significant interest to the field. It has important implications for how we think about the Golgi apparatus, and for how DNA repair pathways may be controlled. The pattern is clearly complex, with many DNA repair proteins localising to the Golgi, and some showing opposite dynamics. However, by focussing on RAD51C and giantin, the paper nicely demonstrates a novel mechanism for controlling DNA repair by these proteins. *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Background - Eukaryotic cells rely on tightly regulated DNA repair pathways to preserve genome stability under the constant threat of both endogenous and exogenous genotoxic stress. While the nucleus, and to a lesser extent the mitochondria, is the primary site where DNA damage is detected and repaired, accumulating evidence indicates that extranuclear organelles, particularly the Golgi apparatus, play a surprisingly important role in modulating stress signaling, proteostasis, and the trafficking/activation of key DNA repair factors.

      • Emerging evidence has shown that genotoxic stress can result in a major remodeling of the Golgi apparatus; however, the crosstalk between the Golgi and the nucleus, and its contribution to the DNA damage response, remains poorly defined. The present study offers timely insight by examining the spatiotemporal behavior of DNA repair proteins that shuttle between the Golgi and the nucleus, and how this trafficking contributes to the maintenance of genomic stability.*

      Main findings - The authors employed the Human Protein Atlas (HPA) project to shortlist proteins that might link Golgi-nuclear function and validated each candidate using an siRNA-mediated antibody-validation pipeline, thereby identifying 163 proteins that localize to both the Golgi and the nucleus. Bioinformatic analysis of these candidates revealed a significant enrichment for DNA damage response (DDR) regulators, including multiple factors from core DNA repair pathways, suggesting that a portion of the DDR machinery may reside in the Golgi at steady state. Interestingly, the authors observed that dual-localizing DDR proteins undergo lesion-specific redistribution between the Golgi and the nucleus in response to specific types of DNA injuries. For instance, BER and MMEJ proteins shifted from nucleus to Golgi in response to doxorubicin, whereas MMR and HR proteins redistributed from Golgi to nucleus. This trend was reversed with H2O2 or KBrO3 treatments.

      • To gain further insight into the link between the DDR and Golgi-nuclear communication, the authors focused on the HR factor RAD51C, which also plays a key role during the replicative stress response. The authors noticed that RAD51 is significantly associated with the Golgi, in addition to its known nuclear pool. Interestingly, they demonstrated that doxorubicin triggers the ATM-dependent release of this Golgi-tethered RAD51C pool and its Importin-β-mediated import into the nucleus, where it forms repair-associated foci. They further identified Giantin as the Golgi scaffold that anchors RAD51C at steady state in this subcellular compartment and showed that its depletion leads to premature nuclear accumulation of RAD51C, formation of aberrant RAD51C foci lacking canonical HR markers, reduced ATM activation, elevated genomic instability, and increased cell proliferation. *

      Together, this study revealed an underappreciated and functionally meaningful spatiotemporal level of regulation within the DDR, suggesting that the Golgi, rather than functioning solely as a trafficking organelle, acts as a platform that anchors, releases, and temporally controls the availability of key DNA repair factors in response to genotoxic stress. In particular, the authors demonstrated that the timely and regulated release of RAD51C from the Golgi is essential for maintaining genome stability and is dependent on canonical DDR signaling pathways, including ATM activation and Importin-β-mediated nuclear import.

      • Overall Critique - This manuscript offers a novel and compelling perspective on the regulation of the DDR by positioning the Golgi as an active participant in the spatiotemporal control of DNA repair factors. By integrating multiple experimental layers, including a systematic localization screening, a sub-Golgi mapping, several dynamic redistribution assays, and functional perturbation read-outs, the authors built a strong and coherent case for a biologically meaningful Golgi-nucleus communication axis during the DDR. Therefore, the study is timely and highly relevant for the DNA repair field, with broader implications for our understanding of how subcellular organelles coordinate genome maintenance and cellular homeostasis.

      While the manuscript is clearly written and the figures are coherent and supportive of the main findings of the study, several issues should be addressed to ensure full interpretability and reproducibility.

      Major Comments*

      *1. Limited use of agents causing genotoxic stress - The authors report intriguing lesion-specific shifts in Golgi-nuclear redistribution, yet much of the mechanistic work relies heavily on doxorubicin, a pleiotropic drug that induces diverse forms of DNA damage beyond DSBs. Expanding the core analysis of the study to include a broader panel of mechanistically defined genotoxins (e.g., etoposide, camptothecin, neocarzinostatin, or ionizing radiation) would substantially strengthen the conclusion that the trafficking patterns reflect damage-type specificity rather than drug-specific off-target effects. Such broader analysis would also clarify whether Golgi-nucleus communication responds differentially to replication-associated breaks, Topo II-dependent lesions, oxidative stress, or crosslinks. *

      __Response: __We thank the reviewer for this important point. We would first note that while doxorubicin is indeed pleiotropic, its primary and best-established mechanism of action is the poisoning of Topoisomerase II, leading to DNA double-strand breaks, a mechanism it shares with etoposide (van der Zanden et al, 2021; Thorn et al, 2011). The additional effects of doxorubicin, including reactive oxygen species generation and chromatin remodelling, are well-documented but secondary to this DSB-inducing activity, as we note in the revised manuscript. Nonetheless the goal of this study was not to comprehensively map lesion-specific trafficking for every DDR protein, but rather to establish the existence of a dynamic Golgi-nucleus redistribution axis and then focus mechanistically on the validated targets, in this case RAD51C. The lesion-dependent redistribution patterns are therefore presented as an initial, hypothesis-generating observation emerging from our screening and characterisation framework. A systematic, lesion-by-lesion dissection of redistribution kinetics across the broader DDR network would represent a substantial additional study and is beyond the scope of the present work.

      Importantly, our key mechanistic observations for RAD51C are not restricted to doxorubicin. We tested a panel of genotoxic agents covering mechanistically distinct lesion classes: camptothecin (CPT; Topoisomerase I-associated replication breaks), etoposide (ETO; Topoisomerase II-dependent DSBs), and mitomycin C (MMC; interstrand crosslinks) (Figures S8A-S8I). Across all DSB-inducing agents, RAD51C consistently redistributed from the Golgi to the nucleus, demonstrating that this response is not a doxorubicin-specific off-target effect. Notably, RAD51C did not redistribute in response to oxidative lesions induced by hydrogen peroxide or potassium bromate, consistent with its established role in homologous recombination and DSB repair rather than oxidative damage pathways, as discussed in the manuscript. This lesion-type selectivity provides additional evidence that the Golgi-nuclear redistribution we observe is a biologically specific response rather than a non-selective stress effect.

      *2. Functional implications of RAD51C redistribution for HR efficiency - Although the study convincingly demonstrates a release of RAD51C from the Golgi and its subsequent nuclear foci formation, it remains unclear how this redistribution influences HR efficiency. Incorporating a functional HR assay (e.g., DR-GFP reporter, RAD51 filament assembly, or fork protection assays) would help determine whether Golgi-anchored RAD51C release is directly required for HR or instead primarily modulates upstream DDR signaling. *

      Response: __We thank the reviewer for this important suggestion. We have performed DR-GFP reporter assays to directly assess HR efficiency following Giantin and RAD51C depletion. Depletion of Giantin reduced HR efficiency to approximately 60% of control levels, and RAD51C depletion to approximately 40%, consistent with the HR reduction previously reported in the genome-wide HR screen (Adamson et al, 2012). Co-depletion of Giantin and RAD51C reduced HR to levels comparable to RAD51C depletion alone, suggesting that the effect of Giantin on HR is mediated primarily through RAD51C, consistent with RAD51C being the key effector of the Giantin-dependent spatial regulatory mechanism we describe. These data are included in the revised manuscript (__lines 455-465; Figure 5L).

      *In addition, the manuscript does not fully reconcile how Golgi-tethering of RAD51C fits with its well-established nuclear roles during replication stress, where timely availability of RAD51C is essential for fork stabilization and restart. *

      Response: __We agree that the nuclear function of RAD51C during replication stress is well established and important to reconcile with our findings. Our imaging data consistently show a detectable nuclear RAD51C population at steady state across all cell lines examined, and we do not propose that RAD51C is exclusively Golgi-localised. We suggest that the two pools serve distinct functional purposes: the constitutive nuclear pool supports ongoing replication fork stabilisation and restart, processes that require RAD51C availability independently of acute DNA damage, while the Golgi-tethered fraction represents a damage-responsive reserve that is released acutely upon DSB induction in an ATM-dependent manner. We wish to be transparent that this two-pool model is speculative at present, formally distinguishing the contributions of each pool would require direct labelling of the Golgi-anchored fraction, which was not technically feasible in this system as discussed above. Nonetheless, this model is consistent with established principles of signal-responsive protein sequestration in cell biology, and is directly supported by our Giantin depletion data: premature release of the Golgi pool leads to aberrant nuclear RAD51C foci lacking canonical HR markers and impaired ATM signalling, demonstrating that unscheduled nuclear accumulation is actively detrimental rather than simply redundant. We have added a paragraph to the revised Discussion explicitly framing the two-pool distinction as a working model and identifying direct pool-identity tracking as an important future direction (__lines 566-587).

      *3. Specificity of Giantin-related phenotypes - The phenotypes observed upon Giantin depletion (e.g., increased micronuclei, comet tail moments, impaired ATM signaling, and elevated proliferation) could partially reflect a global dysfunction of the Golgi rather than RAD51C-specific tethering defects. Although co-depletion of RAD51C provides partial rescue, additional controls examining Golgi integrity, trafficking competence, or rescue with siRNA-resistant Giantin would help confirm specificity and distinguish direct from indirect effects. *

      __Response: __We thank the reviewer for raising this important concern, which was a central consideration throughout our investigation. We address it through three complementary lines of evidence.

      First, regarding Golgi structural integrity and trafficking competence: as previously reported, Giantin depletion has not been associated with strong Golgi fragmentation or major morphological alterations (Koreishi et al, 2013; Bergen et al, 2017; Stevenson et al, 2021), and we observed no significant Golgi fragmentation upon Giantin knockdown in our system. Consistent with the literature, Giantin has been implicated in specific cargo trafficking, most notably collagen secretion, rather than general secretory pathway function (Stevenson et al, 2021). To directly confirm that general Golgi trafficking competence was preserved in our experimental system, we performed the VSV-G-YFP trafficking assay (Presley et al, 1997), a well-established functional readout of general secretory trafficking. Giantin depletion did not result in a significant change in trafficking efficiency compared to control siRNA (Rebuttal Figure 1), consistent with the literature and arguing against a general collapse of Golgi function as the basis for the phenotypes observed.

      Rebuttal ____Figure 1. VSV-G-YFP trafficking assay.

      (A) Representative images of cells treated with control siRNA or giantin siRNA. Nuclei are stained with Hoechst. Total VSV-G-YFP (YFP-tsO45G) signal is shown together with antibody staining against VSV-G in non-permeabilized cells to assess cell surface levels. Scale bars, 10 μm.

      (B) Quantification of VSV-G trafficking from two independent biological replicates.

      Second, the phenotypes are RAD51C-dependent and not a generic Golgi dysfunction: the genomic instability and DDR signalling defects we observe upon Giantin depletion are not phenocopied by GMAP210 depletion, another Golgin family member, indicating that the phenotypes are not a generic consequence of Golgin loss. Critically, we now directly demonstrate using the DR-GFP reporter assay that Giantin depletion reduces HR efficiency to approximately 60% of control, and that co-depletion of RAD51C produces no further reduction beyond RAD51C depletion alone, consistent with RAD51C epistasis over Giantin for HR capacity (Figure 5L). This functional epistasis, together with the physical interaction between Giantin and RAD51C by co-immunoprecipitation, their co-localisation within the same Golgi sub-compartment, and the partial rescue of ATM phosphorylation, micronuclei formation and proliferation phenotypes upon RAD51C co-depletion, provides a coherent mechanistic chain linking Giantin specifically to RAD51C-dependent DDR outcomes. While we cannot formally exclude indirect contributions from other Giantin-associated factors, none of our observations are consistent with the phenotype arising from non-specific Golgi perturbation.

      Third, Giantin may play a broader role in connecting DDR signalling to cytoplasmic and Golgi-resident processes, beyond RAD51C tethering alone: we consider this a feature of the biology rather than a confound. Golgins are well established as multi-cargo scaffolding platforms, and Giantin in particular occupies a strategic position where several processes converge: the tethering of DDR factors, the regulation of damage-induced signalling cascades, and the directional trafficking of repair factors between compartments. This would explain why Giantin depletion produces a phenotype that extends beyond what RAD51C co-depletion alone can fully rescue, and is consistent with the pathway-level coherence we observe across our screen. Understanding the full complement of Giantin-associated DDR interactions represents one of the most compelling directions emerging from this work.

      In response to this comment, we have expanded the Discussion (lines 545-565) to explicitly propose that Giantin functions as a broader organisational node coordinating multiple DDR factors, while our data specifically and consistently implicate RAD51C as a primary conduit.

      *4. Positioning of ATM in the Golgi-nuclear signaling - While ATM inhibition prevents RAD51C release, its spatial and mechanistic basis of this regulation remains obscure. It is not clear whether ATM acts locally at the Golgi, through cytoplasmic pools, or indirectly via nuclear feedback signaling. Clarifying or discussing this point in more depth would improve the mechanistic coherence of the proposed model. *

      __Response: __We thank the reviewer for raising this important mechanistic question. The spatial basis of ATM action at the Golgi is indeed an emerging and exciting area of cell biology. A growing body of evidence demonstrates that ATM associates with the Golgi membrane through binding to phosphatidylinositol-4-phosphate (PI4P), and that this Golgi-resident pool modulates the magnitude and kinetics of the nuclear DDR (Ovejero et al, 2023). Importantly, the most recent work in this area demonstrates that Golgi-associated ATM is not merely a passive reservoir but is enzymatically active and capable of phosphorylating Golgi-resident substrates (Soulet et al, 2026), providing a compelling mechanistic basis for how damage-induced ATM signalling could reach the Golgi to license RAD51C release.

      To directly examine whether ATM localises to the Golgi in our system and whether its activation state changes upon DNA damage, we performed a biochemical Golgi enrichment assay using the Minute{trade mark, serif} Golgi Apparatus EnrichmentKit (Cat #: GO-037) to examine ATM distribution across cis- and trans-Golgi fractions. Fraction purity was validated using GM130 (cis-Golgi), TGN46 (trans-Golgi), and HSP60 (membrane fraction) (Rebuttal Figure 2A). This analysis revealed that ATM is detectable in the total membrane fraction and enriched in the cis-Golgi fraction under basal conditions (Rebuttal Figure 2A). Under normal physiological conditions, activated ATM (pATM) was absent from Golgi-enriched fractions (Rebuttal Figure 2B), but was detectable in the cis-Golgi fraction following doxorubicin-induced genotoxic stress (Rebuttal Figure 2C). While these observations are preliminary and require further validation, they are consistent with the emerging literature and raise the intriguing possibility that ATM is recruited to and activated at the Golgi in a damage-dependent manner, where it could act locally to license RAD51C release.

      Rebuttal Figure 2. Biochemical Golgi fractionation confirms ATM enrichment in cis-Golgi compartments.

      *Western blot of HeLa-K fractions enriched for cis- and trans-Golgi membranes, probing for (A) ATM under basal conditions, and (B and C) pATM under basal conditions and (B) pATM (C) after treatment with DOX (40 μM) (markers: GM130 for cis-Golgi, TGN46 for trans-Golgi, HSP60 for membrane fraction (MEM). *

      We consider the precise spatial and mechanistic dissection of ATM signalling at the Golgi and its relationship to nuclear feedback, one of the most exciting directions to emerge from this work, and one that we hope our study has helped to open. We have expanded the Discussion (lines 525-543) accordingly to place our findings in the context of the emerging Golgi-ATM literature and to frame this as an important unresolved question for future investigation.

      *5. RAD51C is examined in silo, without consideration for the BCDX2 complex - RAD51C is exclusively analyzed in isolation, despite its well-established function as part of the BCDX2 paralog complex (RAD51B-RAD51C-RAD51D-XRCC2). Because RAD51C does not normally operate as a standalone factor, it is unclear why only RAD51C, among all paralogs, would be subjected to Golgi tethering, ATM-dependent release, and Importin-β-driven nuclear import. This raises important mechanistic questions: Are other BCDX2 members also Golgi-associated? Do they undergo similar trafficking dynamics? Does Golgi tethering selectively regulate RAD51C, or does the complex translocate together? Addressing these points would greatly strengthen the biological plausibility and mechanistic coherence of the proposed model. *

      Response: We thank the reviewer for raising this important point. We fully agree that RAD51C functions as a core component of the BCDX2 (RAD51B-RAD51C-RAD51D-XRCC2) and CX3 (RAD51C-XRCC3) paralog complexes, and that its canonical roles in HR and replication fork protection occur within these assemblies. Our decision to focus on RAD51C was driven by the screening data: of the DDR proteins identified, RAD51C displayed the most robust Golgi-associated pool, the clearest damage-induced redistribution dynamics, and a tractable anchoring interaction with Giantin that could be interrogated biochemically.

      We would also note that extending this analysis to other RAD51 paralogs is not straightforward with current tools. The available commercial antibodies against RAD51B, RAD51D and XRCC2 perform poorly in immunofluorescence applications, and most localisation studies for these proteins have relied on overexpression of tagged constructs, a strategy that, as discussed above, risks perturbing both localisation and complex assembly. The lack of reliable antibodies for endogenous paralog detection at the resolution required for Golgi localisation analysis represents a genuine technical barrier that we encountered directly during this study.

      Whether Golgi association and ATM-dependent release involve RAD51C alone or extend to other BCDX2 or CX3 members is therefore a genuinely open and important question. We note that our co-immunoprecipitation data were performed on total cell lysate and cannot distinguish whether the Golgi-associated RAD51C is complexed with other paralogs or represents a monomeric subpopulation. Golgins are well established as multi-cargo scaffolding platforms, and it is entirely plausible that Giantin organises a broader paralog module rather than tethering RAD51C as an isolated subunit. A systematic analysis of RAD51 paralogs for Golgi localisation and lesion-dependent trafficking enabled by improved reagents such as proximity labelling or endogenous tagging approaches compatible with essential proteins would determine whether the BCDX2 complex translocates as a unit or whether individual subunits are differentially regulated, with potentially distinct consequences for HR fidelity. We have revised the manuscript accordingly and identify this as an explicit priority for future work in the revised Discussion (lines 583-602).

      Minor Comments

      1. Pathway-specific sub-Golgi localization patterns - The finding that DDR proteins map to distinct cis/trans Golgi subdomains is an interesting and potentially important observation. However, the dataset is limited to 15 proteins, making the proposed pathway-level trends (e.g., HR factors enriched in cis-Golgi; BER/MMEJ factors enriched in trans-Golgi) preliminary. Strengthening this conclusion by increasing the number of DDR proteins analyzed would help determine whether sub-Golgi compartmentalization contributes meaningfully to DNA repair pathway regulation.

      Response: We thank the reviewer for this constructive suggestion. We agree that extending sub-Golgi mapping to a larger number of DDR proteins would be valuable, and we present the current dataset explicitly as a first, hypothesis-generating map rather than a definitive pathway atlas.

      We would like to highlight, however, that the value of this observation lies not simply in the number of proteins mapped, but in the biological coherence of the patterns that emerge. The finding that proteins from the same repair pathway tend to occupy the same Golgi sub-compartment: BER and MMEJ factors enriching in the trans-Golgi, HR factors in the medial/cis-Golgi, and that this sub-compartmental positioning correlates with the direction of their redistribution upon genotoxic stress, is a pattern that would be unlikely to arise by chance across 15 independently validated proteins. This internal consistency argues that the sub-Golgi organisation reflects genuine pathway-level biology rather than noise, even if the dataset is not yet exhaustive. Together with the bioinformatic network analysis, which independently supports pathway-level clustering across the broader validated hit list, these observations reinforce each other as complementary layers of evidence.

      2. Is the Golgi-released RAD51C indeed the pool that enters the nucleus? The major assumption of the study is that the RAD51C population released from the Golgi upon DNA damage is the same pool that subsequently accumulates in the nucleus to form repair foci. While the imaging and fractionation data are consistent with this model, the study does not directly track or distinguish Golgi-derived RAD51C from cytoplasmic or pre-existing nuclear pools. Without a method to specifically label, pulse-chase, or track the Golgi-anchored fraction, it remains formally possible that nuclear RAD51C originates from other subcellular reservoirs.

      __Response: __We thank the reviewer for highlighting this important mechanistic point, which we agree cannot be fully resolved with the current dataset. Several independent lines of evidence are nonetheless consistent with a model in which the Golgi-associated pool contributes directly to damage-induced nuclear accumulation.

      • Our time-resolved imaging demonstrates a reciprocal decrease at the Golgi and a concurrent increase in the nucleus following genotoxic stress, consistent with redistribution rather than independent compartment-specific changes (Figures 3E-3I).
      • Biochemical fractionation provides an orthogonal readout of the same reciprocal shift under identical conditions (Figures 3J and S6D).
      • ATM inhibition simultaneously prevents Golgi loss and blunts nuclear accumulation, while Importin-β perturbation blocks nuclear entry, together supporting an active and regulated translocation route (Figures 3K-3P).
      • Giantin depletion, which releases the Golgi-tethered RAD51C pool prematurely, leads to aberrant nuclear RAD51C foci lacking canonical HR markers and impaired ATM signalling, strongly supporting that the Golgi-tethered fraction has functional consequences in the nucleus consistent with it being the relevant pool (Figures 4B-4E and 4J-4M).
      • In the revised manuscript we have included cytoplasmic RAD51C signal quantification across the doxorubicin time course (Figure 3H). The cytoplasmic signal shows only a moderate and gradual reduction that is kinetically distinct from the sharp Golgi decrease and does not precede the nuclear increase. This pattern is inconsistent with a large pre-existing cytoplasmic reservoir driving the nuclear accumulation; if the cytoplasmic pool were the primary source, one would expect a rapid and prominent cytoplasmic decrease coinciding with or preceding nuclear accumulation, which we do not observe. Instead, the data are more consistent with rapid transit of Golgi-released RAD51C through the cytoplasm rather than stable cytoplasmic accumulation prior to nuclear entry. We acknowledge that definitive pool-identity tracking would require spatially restricted labelling approaches such as Giantin-proximal TurboID or photoactivatable tagging strategies, which are precluded by the technical constraints on RAD51C tagging described above. We have revised the manuscript to avoid overstatement on this point and identify these approaches as important future directions (lines 297-305 & lines 715-719).

      Reviewer #2 (Significance (Required)):

      General assessment - This study presents a novel and conceptually compelling view of the DNA damage response (DDR) by positioning the Golgi apparatus as an active regulator of the spatiotemporal availability of DNA repair factors. The strongest aspects of the work include its integration of a systematic immune-localization screening, a sub-Golgi compartment mapping, dynamic redistribution assays, and functional perturbations to build a coherent model of Golgi-nucleus communication during genotoxic stress. The mechanistic focus on RAD51C provides a clear case study linking organelle-level regulation to genome stability.

      • Advance - To my knowledge, this is the first comprehensive demonstration that the Golgi can serve as a spatiotemporal coordination node for DDR proteins, including those involved in HR. The identification of a substantial pool of RAD51C, and reportedly other DDR factors, anchored within specific Golgi subdomains represents a significant conceptual advance. The demonstration that Golgi-tethered RAD51C is released in an ATM-dependent manner and subsequently participates in nuclear foci formation suggests a previously unrecognized organelle-level regulatory checkpoint in genome maintenance. This work therefore extends current models of the DDR by revealing a layer of intracellular coordination that bridges classical nuclear pathways with cytoplasmic organelle function.*

      • Audience - This study will be of strong interest to a specialized audience in the fields of DNA repair, genome stability, and cell biology, particularly those studying the spatial organization of repair pathways and intracellular stress signaling. It will also appeal to researchers investigating organelle biology, intracellular trafficking, and the broader coordination of cytoplasmic and nuclear responses to stress. Beyond these communities, the work may be relevant to cancer, as it suggests new mechanisms by which organelle perturbations or Golgi-associated scaffolding proteins could influence therapeutic responses or genomic instability.

      Reviewer expertise - Field of expertise: DNA repair, genome stability, organelle biology, cancer cell biology.*

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      *This study investigates the communication between the Golgi complex and the nucleus of the cell, which remains a largely unexplored field. The authors used publicly available siRNA and antibody data from the Human Protein Atlas as a basis for finding overlap between the proteomes of the two cellular compartments. In validating the data from the HPA, the study finds a novel cluster of DNA repair proteins present in the Golgi, which they validate and resolve to sub-compartmental localization. To do so they use immunofluorescence (IF) localization on ¬cis- and trans-Golgi cisternae marked by GM130 and TGN46, respectively. The authors find that many of the fully validated proteins present in both the nucleus and Golgi redistribute between the Golgi and the nucleus dependent on the protein and the type of DNA lesion. They focused on RAD51C, a recombination factor. They show that RAD51C resides in both the ¬cis- and trans- subsections prior to damage and responds to DNA damage in an ATM-dependent manner via release of a Golgi-based pool bound to Giantin, which is then imported into the nucleus via Importin-β. Knockdown experiments showed that Giantin regulates RAD51C spatially and temporally. The work reveals a dynamic interchange of proteins between the Golgi and nucleus that controls cell functions beyond the classic secretory, membrane trafficking, and PTM roles of the Golgi. The authors build on prior work on Golgi impacts on DDR, offering an alternative cellular compartment for storage of DDR factors prior to damage. Overall, the data is timely and relevant, as it finds new roles for the Golgi in DNA damage response (DDR) regulation. The data is largely convincing and well controlled. The IF data is presented in black and white single channels and merged in color, which allows good comparison of the different protein stains. The scope of the initial screen of HPA antibodies and Golgi/Nuclear dual proteomes is impressive, and the overlap of DDR proteins is characterized for fifteen different proteins at a sub-compartmental level. The focus on RAD51C as a member of the HR pathway was a strong choice, and the study presents interesting information on its regulation by Golgi complex members, as well as a feedback look with pATM. The possibility of the Golgi storing specific DDR factors in specific compartments is well-supported and intriguing. There are a few major and minor points that should strengthen the paper and improve clarity prior to publication. *

      Major Comments:

      *1. Much of the strength of the IF data is lost in the choice of scale for presentation of the data. In almost all cases, enlarged sections should be shown of the areas currently indicated by arrow, in all channels. This is done well in Figure 3A, where an area of the Golgi is enlarged and the overlap of RAD51C in the GM130-marked Golgi is clearly visible in the merged channel, even when printed out. I would highly recommend including the white box and enlarged in all images and channels, while keeping the representative fields as is (e.g. if the image is 40mm, draw a 7mm box around representative cells/Golgi, and enlarge to 15mm in the bottom left). This change should be made to F1E, F2F, F3E, F3J, and F3M, as well as having enlarged figures in the corners in all supplementary data IF figures. Where possible, a fully enlarged image of the bounding box could also be included. Some of the IF data would be strengthened by using the nuclei stain to draw a masking outline to include in the black and white channels, to clearly delaminate what is Golgi-localized and what is nuclear. *

      Response: We thank the reviewer for this helpful suggestion and fully agree that enlarged insets substantially improve the visibility of Golgi-localised signal, particularly when figures are printed. We share the reviewer's view that alternative display formats with larger insets would be preferable, and we have implemented enlarged boxed regions wherever space constraints permitted.

      Specifically, we have added boxed regions with enlarged insets to Figure 1E, all panels of Figure 3. For Figure 2, the number of conditions and proteins displayed simultaneously within the constraints of standard journal figure dimensions made it impractical to include enlarged insets for all panels without reducing the overall field size to the point of losing contextual information. We have nonetheless improved the visibility of the Golgi signal in Figure 2 as much as possible within these constraints, and note that the final figure layout will be further optimised in line with the journal's specific formatting guidelines. In addition, all figures have been provided as high-resolution image files to allow electronic magnification, enabling readers to inspect the Golgi-localised signal in detail beyond what is visible in the printed version.

      Regarding the use of nuclear outline masks in single-channel images, we tested this approach but found that given the number of structures present within each field, including Golgi stacks, nuclear foci, and cytoplasmic signal, overlaying nuclear outlines on individual channels added visual complexity that made the images harder rather than easier to interpret. As an alternative, we have included a full-colour merged panel, when possible, which we consider a cleaner way to delineate nuclear versus Golgi-localised signal and allows the reader to directly compare compartment-specific distributions across channels.

        1. *There is a lack of consistency in the representative images shown by IF. For example, Figure 1 gives the impression of very little RAD51C in the nucleus but this is rightly shown to not be the case in Supp. Fig 2A. The same is true of the various images of LIG1. The authors should use representative data that better reflects the distribution of the proteins being studied and maintain consistency across images. If there is a lot of variation in staining patterns, the authors should show images and percentages corresponding to the variations especially for the key gene studied, RAD51C.

      Response: We agree and have replaced the representative IF panels for RAD51C and LIG1 with images that better reflect the quantified distributions across biological replicates. The revised panels were selected to match the quantified compartment intensities shown in the accompanying graphs rather than representing outlier cells. We would also note that the apparent discrepancy between Figure 1E and Supplementary Figure S2A partly reflects a difference in imaging conditions: Supplementary Figure S2A __and __Figure 2F were acquired directly from the high-content screening pipeline under uniform, non-optimised antibody and fixation conditions at widefield resolution, whereas Figure 1E shows representative single optical section confocal images acquired after candidate identification with antibody conditions optimised for each individual protein. The improved signal-to-noise in the optimised confocal images more faithfully captures the dual Golgi and nuclear localisation of RAD51C, and the apparent difference between the two image sets is therefore expected rather than inconsistent. We have updated the figure legends to clarify the imaging modality and conditions for each panel. Furthermore, the quantified distribution of RAD51C across Golgi, nuclear and cytoplasmic compartments across multiple cell lines is shown in Figure 3B and 3D, providing a population-level representation of the dual localisation that complements the representative images shown in Figure 1E.

        1. *The initial screening by siRNA-mediated knockdown pipeline that validated and confirmed dual Golgi and nuclear localization of 163 of the 329 dual-localization HPA proteins does not have any data included. This seems like a very large amount of data to gloss over and not include even as supplementary data. This should be included as source data, and discussion of the in-text information should be strengthened. The data included with the networking of these validated proteins is strong, but the process of elimination and validation has not been shown. In addition, the antibody information included in the supplementary data does not include dilution factors or blocking factors is not included, which would be beneficial to future studies to include.

      Response: We agree and have addressed this in full. We note that the HPA antibody validation data, including immunofluorescence images and siRNA knockdown results, are publicly available for inspection on the Human Protein Atlas website (www.proteinatlas.org) for the majority of candidates, providing an independent layer of verification. In the revised submission, we additionally provide the complete siRNA-mediated validation dataset generated in our laboratory as source data (Table S1; lines 1025-1041), including for each candidate the HPA antibody identifier, gene symbol, Ensembl ID, antibody staining pattern, siRNA identifier, cell number per replicate, and normalised Golgi and nuclear signal ratios for both experimental replicates. This allows readers to inspect the validation metrics directly and apply alternative thresholds if desired. We have also expanded the antibody information to include diluent conditions (4% FBS in 0.1% Triton-X100 for all HPA antibodies used at 2 μg/ml in the screening pipeline), enabling reproducibility and reuse of the dataset by the community.

        1. *The authors should expand upon the paragraph lines 155-162 to include more discussion on Figure S2A and S2B. The expanse of this data is some of the strongest in the paper, and it should be further discussed in-text. Also, the rationale behind the choice in the specific proteins that are included in these analysis / figures is not always clear in -text, and more attention should be spent on the narrowing down of the analysis to the final proteins. This is also especially important as many of the DDR proteins chosen are not the most common DDR proteins. Also note in text that the Golgi marker GM130 (presumably) was used for the screening, which means that some proteins which are only localizing to the TGN46 trans Golgi might have been lost in the validation step (or, explain why this is not the case).

      Response: __We expanded the Results text (__lines 141-163) to discuss Figures S2A and S2B in more depth and clarified the rationale for selecting the final set of DDR proteins taken forward, including considerations of pathway representation, bioinformatic annotations, literature-described roles in DNA repair. We would also note that the identity of the DDR proteins identified in this screen was determined by the HPA dataset and the unbiased validation pipeline rather than by prior assumptions about which repair factors would be present at the Golgi. The presence of less commonly studied DDR factors is therefore a direct reflection of the screen output, and we consider this one of the strengths of the approach.

      We would also like to address the reviewer's concern about potential GM130-based bias directly: at the widefield or confocal resolution used in the high-content screening pipeline, the Golgi apparatus appears as a single perinuclear structure and cis- and trans-Golgi subdomains cannot be resolved. GM130 was therefore used purely as a segmentation marker to define the Golgi compartment as a whole rather than to selectively label the cis-Golgi cisternae. The resulting Golgi mask captures signals from the entire Golgi ribbon, including trans-Golgi regions, meaning that proteins with exclusively trans-Golgi localisation would not have been systematically excluded at the screening stage. Sub-compartmental resolution of cis versus trans localisation was only possible in subsequent analyses using nocodazole-dispersed mini-stacks imaged by confocal microscopy with co-staining for both GM130 and TGN46.

      *5. The relationship between Giantin loss, increased cell proliferation, and elevated endogenous DNA damage as it relates to RAD51C remains insufficiently resolved and requires further clarification. Several of the proliferation assays used are not optimal for addressing changes in cell growth. For example, Figure 5O appears to quantify cell numbers by counting fields from IF images, which is an unconventional approach. This should be done by growth curves, luminescent viability or colony formation assays. In addition, this point will be greatly strengthened by performing rescue experiments for Giantin directly (instead of co-depletion as a means of rescue) and/or using a mutant of RAD51C that does not bind to Giantin. If these additional experiments are beyond the current scope, the conclusions should be softened in the discussion. *

      Response: We thank the reviewer for raising these important points, which we address in turn:

      Giantin-RAD51C relationship and mechanistic interpretation. __We acknowledge that establishing the full causal chain between Giantin loss, RAD51C mislocalisation, elevated endogenous DNA damage and increased cell proliferation is challenging within the scope of a single study, and we discuss this openly in the Discussion (__lines 555-564). Our evidence collectively includes: physical interaction between endogenous Giantin and RAD51C by co-immunoprecipitation (Figures 4H and 4I), premature nuclear accumulation of RAD51C upon Giantin depletion (Figures 4B-4E and 4J-4M), new additional experiment showing direct reduction of HR efficiency in the DR-GFP assay (Figure 5L), impaired ATM signalling (Figures 5J and 5M), elevated genomic instability (Figures 5A-5E), and epistatic rescue by RAD51C co-depletion (Figures 5M-5P). These observations are further contextualised by the established literature on RAD51C function: RAD51C is known to regulate CHK2 phosphorylation and cell cycle checkpoint signalling (Badie et al, 2009), stabilise replication forks (Somyajit et al, 2015), and promote RAD51 filament formation required for DSB repair (Prakash et al, 2015). Dysregulation of these functions through Giantin-dependent mislocalisation provides a mechanistically coherent explanation for the elevated genomic instability and altered proliferation we observe, and is entirely consistent with our model. Together, the experimental evidence and the published biology of RAD51C support a model in which Giantin spatially regulates RAD51C to maintain proper DDR signalling and HR capacity.

      We agree that separation-of-function tools would further strengthen this model and identify these as important future priorities. We wish to note however that both approaches face substantial technical barriers in this system. As described in our response to Reviewer 1 Major Comment 1, RAD51C tagging, whether by CRISPR-mediated endogenous editing or ectopic expression, consistently compromised cell viability and protein function, precluding the generation of interaction-deficient variants at physiological expression levels. Engineering an interaction-deficient Giantin mutant presents an independent and considerable challenge: Giantin is one of the largest Golgi matrix proteins (~376 kDa), composed almost entirely of extended coiled-coil domains that are intrinsically difficult to model structurally, and identifying a discrete interaction interface with RAD51C without disrupting the broader scaffolding function of the protein would require a dedicated structural and biochemical programme. We therefore consider these important but substantial future directions rather than straightforward experimental additions to the current study.

      Proliferation assays. Colony formation assays provide a rigorous readout of long-term proliferative capacity, and these data are presented for single knockdown conditions in Figures 5F-5I. The cell number quantification in Figure 5P was specifically included to assess the double knockdown of Giantin and RAD51C simultaneously, a condition not covered by the colony formation assay. We respectfully note that automated fluorescence microscopy-based nuclear counting is a well-established approach for measuring cell proliferation in siRNA screening contexts. Nuclear counting from high-content imaging has been used as a direct readout of cell growth and proliferation in RNAi screens (Boutros et al, 2004; Martin et al, 2014; Garvey et al, 2016; Mikheeva et al, 2024), and has been shown to produce results comparable to or superior to conventional viability assays including MTT and flow cytometry-based methods (Mikheeva et al, 2024). We have nonetheless clarified in the revised figure legend that Figure 5P reports relative cell number quantified by automated nuclear counting from high-content imaging fields as a secondary concordant measure alongside the colony formation data, rather than a standalone proliferation assay.

      *6. It is unclear from the discussion and from presented data whether proteins are directly transported between the Golgi and the nucleus, or whether they go into the cytoplasm for a transient period, presumably when they could interact with Importin β. There is also some data where cytoplasm signal could be quantified to address this (Figure 3E-I). *

      Response: We thank the reviewer for this mechanistic point. In the revised manuscript we have included cytoplasmic RAD51C signal quantification alongside Golgi and nuclear measurements for the doxorubicin time course (lines 297-305; Figure 3H). The cytoplasmic signal shows a moderate and gradual reduction distinct in both magnitude and kinetics from the sharp Golgi decrease, consistent with a transient cytoplasmic intermediate rather than a stable pool. Regarding the identity of the translocating pool, two observations directly support a Golgi origin. First, Importazole treatment prevents RAD51C release from the Golgi following genotoxic stress and simultaneously reduces nuclear RAD51C foci formation, demonstrating that Importin-β-mediated import is required both for Golgi clearance and for productive nuclear accumulation. Second, Giantin depletion which prematurely releases the Golgi-tethered pool, leads to aberrant nuclear RAD51C foci, directly linking the Golgi-anchored fraction to nuclear accumulation. Together these data support a model in which Golgi-resident RAD51C transits through the cytoplasm for Importin-β-mediated nuclear import. We acknowledge that without direct labelling of the Golgi-anchored fraction, the precise contribution of each subcellular pool to the nuclear accumulation cannot be fully resolved with the current dataset. We discuss the development of appropriate tagging strategies as an important future direction to dissect the dynamics of this process in further detail.

      *7. Statistical analysis on experiments with more than two samples need to be performed with ANOVA and a follow up post-hoc test, not with two-tailed unpaired Student's t-test, which only compares the control and each individual sample. This type of analysis inflates the Type 1 error rates (false positives) in your datasets. For example, the two-tailed unpaired Student's t-test is appropriate in Figure 2F-H, but not in Figure 3 when the samples are timepoints. In this case, a One-way ANOVA with Tukey's post-hoc test (if you want to show all coparisons), or Bonferroni/Sidak if you only need to compare several samples). *

      Response: We agree with the reviewer and thank them for highlighting this important statistical issue. We have revised the statistical analysis for all experiments involving more than two groups to avoid inflation of Type I error rates caused by multiple pairwise Student's t tests. Specifically, for Figures 3F-I, 4C-E, and Figure 5, the data were reanalysed using one way ANOVA followed by the appropriate multiple comparisons post hoc test. The Methods section and corresponding figure legends have been updated to clearly state the statistical tests used for each dataset.

      Minor Comments: General 1. Throughout the text, the reference to many figures and supplementary figures in the same sentence, with little discussion of the data therein makes it hard to follow. In-text referencing is particularly confusing in the section "Dual-localising DDR proteins dynamically redistribute between the Golgi and nucleus in response to specific types of DNA injuries," where the reader is switching between multiple figures and supplementary figures.

      __Response: __We thank the reviewer for this helpful comment. In the revised manuscript, we have improved the readability of the text and revised the figure references to make them clearer. We hope these revisions make the manuscript easier to follow and allow readers to better inspect the figures.

      1. In figures that display technical replicates as individual data points, consider distinguishing each replicate by using different marker shapes (e.g., repeat 1 = upright triangle; repeat 2 = inverted triangle; repeat 3 = diamond). This would provide additional clarity regarding the consistency and repeatability of each technical repeat.

      __Response: __We thank the reviewer for this suggestion. We have updated the data presentation to distinguish biological replicates using different marker shapes in datasets where replicate tracking is of particular relevance to the interpretation. For datasets where individual replicate values are already clearly separable, we have maintained the existing presentation to avoid unnecessary visual complexity.

      1. Make sure all western blot data includes the marker size (F3C and F5L has none, F4H/I have size of proteins not size of markers).

      __Response: __We added missing marker sizes to our western blot data in the revised manuscript.

      1. Be consistent with use of capitalization in figure legends and graph/figure labels.

      __Response: __We made sure that the capitalisation is consistent in figure legends, graph and figure legends in the revised manuscript.

      Figure 2

      In Figure 2A, please include in the figure itself that GM130 is the cis Golgi, and TGN46 is the trans Golgi (Figures should not be dependent on the text for full understanding).

      __Response: __We revised Figure 2A and 2C to label GM130 as cis-Golgi and TGN46 as trans-Golgi within the figure, making it self-explanatory.

      1. Why are LRIG2 and LRRIQ3 not included in the 2E cis vs trans Golgi data, when all other proteins from F1D are included? Include, or comment on in-text.

      __Response: __Both LRIG2 and LRRIQ3 are included in 2E in both the original and revised manuscript.

      1. Be sure to include scale bar data in each figure legend (F2A-E is currently missing it), and include updated scales included in the enlarged data.

      __Response: __Scale bar data is now included in each figure legend in the revised manuscript.

      1. In Figure 2F, make sure that the merged green channel is presented at the same intensity as it is in the single black and white channel, as the green looks very overexposed in several of the merged (CCAR1 DMSO merged is the most noticeable).

      __Response: __We agree and thank you for pointing this out. We have now revised the images and corrected the issue by updating all image panels in the figure.

      1. In Figure 2G, include the grey label in the figure legend.

      __Response: __We thank the reviewer for this comment. The grey label has now been included in the figure legend in the revised manuscript.

      1. In Figure 2G-H, the method of data presentation in the graphs coupled with the statistical analysis is confusing and should be expanded upon in the legend.

      __Response: __We agree that the amount of data presented may appear overwhelming. In the revised figure, we have adjusted the placement of the statistical annotations to improve clarity. Also, we improved the figure legend, to make the figure easier to read and interpret.

      Figure 3

      Figure E/F/G: Is there cytoplasmic quantification as well? Your rationale is that the Golgi RAD51C goes into the nucleus, but via the cytoplasm (due to Importin β import); do you see the cytoplasmic levels increase? Or is it too dilute to notice a difference? At least, this omission needs to be mentioned in-text.

      Figure H/I also include the quantification of the cytoplasmic fraction. It is mentioned in-text on line 272, but not quantified. This comes up as a big question: Do the proteins go directly between the Golgi and nucleus, or do they go through the cytoplasm?

      __Response: __We thank the reviewer for both of these related points. As described in our response to Major Comment 6 above, we have added cytoplasmic RAD51C signal quantification to the doxorubicin time course in the revised manuscript (Figure 3H) and discuss the implications for the proposed translocation route.

      Figure 3A, 3E, and if the data is present for 3J and 3M, could all benefit from using the nuclei staining as a mask to draw an outline around the nucleus in the other channels, and then show a merge in full color instead of a nuclei-only channel. Also note from the major comments, that this data especially is so small to see without enlarged images.

      __Response: __We thank the reviewer for this suggestion. Regarding nuclear outline masks, we tested this approach but found that the number of structures present in each field, including Golgi stacks, nuclear foci and cytoplasmic signal, made overlaid outlines visually confusing rather than clarifying. We have instead included a full-colour merged panel in Figure 3E, which we consider a cleaner way to distinguish nuclear from Golgi-localised signal while preserving the spatial context of the data.

      Regarding image size, we have added enlarged insets to Figures 3E, 3J and 3M in the revised manuscript. We have chosen to display multiple cells per panel rather than a single enlarged cell in order to capture the heterogeneity of the cell population, which we consider important for an accurate representation of the data. All figures have been provided as high-resolution image files to allow electronic magnification, enabling detailed inspection of the signal beyond what is visible in the printed version. We acknowledge that the constraints of standard journal figure dimensions limit how large individual panels can be, and the final layout will be optimised in line with the journal's formatting guidelines.

      *In-text discussion of the results from Figure 3 has an in-depth discussion of the NLS and NES in RAD51C, but this is not followed up on with site-directed mutagenesis or any data; perhaps move this to the discussion instead of results section. *

      __Response: __We have removed the discussion of the NLS and NES from the Results section.

      Figure 4

      Comments from earlier figures hold, with size of enlarged events and using the nuclei as an outline in the single channels. E.g. Figure 4F arrows appear to point to nothing at the chosen scale. The zoom in 4G is insufficient, as the chosen feature is so small it is not even visible in full fields.

      __Response: __We thank the reviewer for this comment. The arrows in Figure 4F indicate individual nocodazole-dispersed Golgi mini-stacks, which are displayed at higher magnification in Figure 4G. The full field in Figure 4F is intentionally shown to illustrate the degree of Golgi dispersion achieved by nocodazole treatment, a context that may be unfamiliar to readers outside the Golgi field, before zooming into a single representative mini-stack in Figure 4G for the cisternal localisation analysis.

      • Figure 4H and 4I need to show the size of the markers *

      __Response: __The size of the markers are now included in the revised manuscript.

      *The representative image in 4L for siGiantin pATM has no pATM foci, while the quantification in 4M has a reduction from ~50% to ~25%, so this image is not representative of this data, or the data quantification is not as strong as the actual data. *

      __Response: __We thank the reviewer for this observation. We wish to clarify that the quantification in Figure 4M reports the mean percentage of RAD51C foci co-localising with pATM across the entire cell population from three independent biological replicates. A reduction from ~50% to ~25% therefore reflects a population-level shift in co-localisation frequency, not that every individual cell shows exactly 25% co-localisation. Given the inherent cell-to-cell variability in foci number and co-localisation, individual cells will span a range of values around this mean, and the representative image shown in Figure 4L reflects one such cell.

      Figure 5

      *Figure 5A has overexposure of the nuclei stain in order to visualize micronuclei. Readjust the levels, and enlarge the images for better visualization. (is this DAPI-stained? Please label). *

      __Response: __The display levels of the nuclear stain in Figure 5A are intentionally set to allow visualisation of micronuclei, which are significantly dimmer than the main nucleus and would not be detectable at display settings optimised for the primary nuclear signal. This is standard practice in micronuclei quantification studies and is necessary to accurately identify and score these structures. The nuclear stain is Hoechst 33342, and this has been explicitly labelled in the revised figure legend.

      *Figure 5A-C: Figure 5A does not show siRAD51, but it is included in the DMSO only graph. Please either show RAD51 data in 5A and 5C, or do not include in 5B. If the DMSO and ETO experiments were performed separately and that accounts for this discrepancy, then show separately. *

      __Response: __We thank the reviewer for this observation. The siRAD51C condition is included in Figure 5B as an internal positive control, consistent with its well-established role in genome stability. RAD51C depletion combined with etoposide treatment resulted in severe cellular toxicity and insufficient cell numbers for reliable quantification, and this condition was therefore excluded from Figure 5C. This has been clarified in the revised figure legend.

      *Figure 5M the white label is difficult to see in the green box. *

      __Response: __We have updated the label colour in Figure 5M to improve visibility against the green background in the revised manuscript.

      * Supplementary Figures*

      Consider reordering/ subdividing supplementary figures for ease of reference during reading.

      Response: We thank the reviewer for this suggestion. The current supplementary figure structure was intentionally designed to minimise the total number of supplementary figures and maintain a logical correspondence with the main figures, avoiding a situation where readers need to navigate an extensive supplementary section, a concern the reviewer raised regarding figure presentation. We believe the current organisation achieves a reasonable balance between completeness and accessibility.

      SF1 and SF2A: Include enlarged boxes or full images so that data is visible.

      __Response: __As described in our response to Major Comment 1, all figures have been provided as high-resolution image files to allow electronic magnification. Space constraints within standard journal figure dimensions preclude the addition of enlarged insets to all supplementary panels without substantially reducing the contextual field of view.

      *SF3A, SF4A, and SF5A: Include enlarged images, include nuclei marker if possible (otherwise, the nuclear intensity is not proven nuclear). *

      Response: We appreciate the suggestion, but adding enlarged insets and nuclei markers to all panels in Figures S3A, S4A and S5A would disproportionately increase the length and complexity of the supplementary section, making it harder rather than easier to navigate. The nuclear intensity measurements are derived from automated segmentation of the Hoechst channel using CellProfiler, which reliably defines nuclear boundaries independently of the antibody channel, and are therefore not dependent on visual confirmation of nuclear localisation in each representative image.

      *SF3B-C, SF4B-C, and SF5 B-D: Change the data presentation in the same method as changed for F2G-H. *

      Response: We have updated the figure legends for Figures S3B-C, S4B-C and S5B-D to improve readability.

      SF3D: List proteins in the same order as in B and C.

      Response: The proteins in Figure S3D are listed in the same order as in Figures S3B and S3C.

      SF6D: Label M N and C more clearly. Include size labels.

      Response: We have added clearer labels for the membrane (M), nuclear (N) and cytoplasmic (C) fractions and included molecular weight size markers in the revised Figure S6D.

      *SF7A-B: Include enlarged. *

      Response: We respectfully note that the purpose of Figures S7A-B is to display the overall cellular response to inhibitor treatments across the cell population, rather than to highlight specific subcellular structures. Enlarged insets would reduce the number of cells visible per panel and would not add scientific value in this context. The Golgi and nuclear signals are clearly visible at the chosen magnification.

      *SF8: Include arrows as in previous experiments, include enlarge. *

      Response: Arrows have been added to Figure S8 to indicate Golgi and nuclear RAD51C signal, consistent with the annotation style used in the main figures. The images already show two representative cells per condition to maximise the visible detail at the chosen scale.

      *SF9G: G is labelled, but not included. *

      Response: Figure S9G has been added in the revised manuscript, showing the pan-cancer overall survival map for GOLGB1 expression across all TCGA cohorts generated using GEPIA2. The figure legend has been updated accordingly.

      *Reviewer #3 (Significance (Required)): *

      * The work finds new roles for the Golgi in regulation of DNA damage responses and the screen could be an important dataset (but results need to be made available) for the DNA repair community. The scope of the initial screen of HPA antibodies and Golgi/Nuclear dual proteomes is impressive, and the overlap of DDR proteins is characterized for fifteen different proteins at a sub-compartmental level. The work provides important insights into RAD51C regulation, however, there are key mechanistic insights and control experiments missing from the studies involving RAD51C and Giantin, dampening its impact. The idea of an alternative cellular compartment for storage of DDR factors prior to damage is interesting, and suggests the spatial regulation of specific lesion responses are stored in specific sub-compartments of the Golgi, which could contribute to repair regulation.*

      References:

      Adamson B, Smogorzewska A, Sigoillot FD, King RW & Elledge SJ (2012) A genome-wide homologous recombination screen identifies the RNA-binding protein RBMX as a component of the DNA-damage response. Nat Cell Biol 14: 318-328

      Badie S, Liao C, Thanasoula M, Barber P, Hill MA & Tarsounas M (2009) RAD51C facilitates checkpoint signaling by promoting CHK2 phosphorylation. J Cell Biol 185: 587-600

      Bergen DJM, Stevenson NL, Skinner REH, Stephens DJ & Hammond CL (2017) The Golgi matrix protein giantin is required for normal cilia function in zebrafish. Biol Open 6: 1180-1189

      Berti M, Teloni F, Mijic S, Ursich S, Fuchs J, Palumbieri MD, Krietsch J, Schmid JA, Garcin EB, Gon S, et al (2020) Sequential role of RAD51 paralog complexes in replication fork remodeling and restart. Nat Commun 11: 3531

      Boutros M, Kiger AA, Armknecht S, Kerr K, Hild M, Koch B, Haas SA, Paro R, Perrimon N & Heidelberg Fly Array Consortium (2004) Genome-wide RNAi analysis of growth and viability in Drosophila cells. Science 303: 832-835

      Brinkmann K, Schell M, Hoppe T & Kashkar H (2015) Regulation of the DNA damage response by ubiquitin conjugation. Front Genet 6: 98

      Garvey CM, Spiller E, Lindsay D, Chiang C-T, Choi NC, Agus DB, Mallick P, Foo J & Mumenthaler SM (2016) A high-content image-based method for quantitatively studying context-dependent cell population dynamics. Sci Rep 6: 29752

      Ghannoum S, Fantini D, Zahoor M, Reiterer V, Phuyal S, Leoncio Netto W, Sørensen Ø, Iyer A, Sengupta D, Prasmickaite L, et al (2023) A combined experimental-computational approach uncovers a role for the Golgi matrix protein Giantin in breast cancer progression. PLoS Comput Biol 19: e1010995

      Greenhough LA, Liang C-C, Belan O, Kunzelmann S, Maslen S, Rodrigo-Brenni MC, Anand R, Skehel M, Boulton SJ & West SC (2023) Structure and function of the RAD51B-RAD51C-RAD51D-XRCC2 tumour suppressor. Nature619: 650-657

      Halim VA, García-Santisteban I, Warmerdam DO, van den Broek B, Heck AJR, Mohammed S & Medema RH (2018) Doxorubicin-induced DNA damage causes extensive ubiquitination of ribosomal proteins associated with a decrease in protein translation. Mol Cell Proteomics 17: 2297-2308

      Koreishi M, Gniadek TJ, Yu S, Masuda J, Honjo Y & Satoh A (2013) The golgin tether giantin regulates the secretory pathway by controlling stack organization within Golgi apparatus. PLoS One 8: e59821

      Martin HL, Adams M, Higgins J, Bond J, Morrison EE, Bell SM, Warriner S, Nelson A & Tomlinson DC (2014) High-content, high-throughput screening for the identification of cytotoxic compounds based on cell morphology and cell proliferation markers. PLoS One 9: e88338

      Meindl A, Hellebrand H, Wiek C, Erven V, Wappenschmidt B, Niederacher D, Freund M, Lichtner P, Hartmann L, Schaal H, et al (2010) Germline mutations in breast and ovarian cancer pedigrees establish RAD51C as a human cancer susceptibility gene. Nat Genet 42: 410-414

      Mikheeva AM, Bogomolov MA, Gasca VA, Sementsov MV, Spirin PV, Prassolov VS & Lebedev TD (2024) Improving the power of drug toxicity measurements by quantitative nuclei imaging. Cell Death Discov 10: 181

      Ovejero S, Kumanski S, Soulet C, Azarli J, Pardo B, Santt O, Constantinou A, Pasero P & Moriel-Carretero M (2023) A sterol-PI(4)P exchanger modulates the Tel1/ATM axis of the DNA damage response. EMBO J 42: e112684

      Prakash R, Rawal Y, Sullivan MR, Grundy MK, Bret H, Mihalevic MJ, Rein HL, Baird JM, Darrah K, Zhang F, et al(2022) Homologous recombination-deficient mutation cluster in tumor suppressor RAD51C identified by comprehensive analysis of cancer variants. Proc Natl Acad Sci U S A 119: e2202727119

      Prakash R, Zhang Y, Feng W & Jasin M (2015) Homologous recombination and human health: the roles of BRCA1, BRCA2, and associated proteins. Cold Spring Harb Perspect Biol 7: a016600

      Presley JF, Cole NB, Schroer TA, Hirschberg K, Zaal KJM & Lippincott-Schwartz J (1997) ER-to-Golgi transport visualized in living cells. Nature 389: 81-85

      Rawal Y, Jia L, Meir A, Zhou S, Kaur H, Ruben EA, Kwon Y, Bernstein KA, Jasin M, Taylor AB, et al (2023) Structural insights into BCDX2 complex function in homologous recombination. Nature 619: 640-649

      Somyajit K, Saxena S, Babu S, Mishra A & Nagaraju G (2015) Mammalian RAD51 paralogs protect nascent DNA at stalled forks and mediate replication restart. Nucleic Acids Res 43: 9835-9855

      Soulet C, Catalan J & Moriel-Carretero M (2026) The DNA Damage Response kinase ATM restricts Golgi extension. bioRxiv

      Stadler C, Hjelmare M, Neumann B, Jonasson K, Pepperkok R, Uhlén M & Lundberg E (2012) Systematic validation of antibody binding and protein subcellular localization using siRNA and confocal microscopy. J Proteomics 75: 2236-2251

      Stevenson NL, Bergen DJM, Lu Y, Prada-Sanchez ME, Kadler KE, Hammond CL & Stephens DJ (2021) Correction: Giantin is required for intracellular N-terminal processing of type I procollagen. J Cell Biol 220

      Tang Z, Kang B, Li C, Chen T & Zhang Z (2019) GEPIA2: an enhanced web server for large-scale expression profiling and interactive analysis. Nucleic Acids Res 47: W556-W560

      Thorn CF, Oshiro C, Marsh S, Hernandez-Boussard T, McLeod H, Klein TE & Altman RB (2011) Doxorubicin pathways: pharmacodynamics and adverse effects. Pharmacogenet Genomics 21: 440-446

      van der Zanden SY, Qiao X & Neefjes J (2021) New insights into the activities and toxicities of the old anticancer drug doxorubicin. FEBS J 288: 6095-6111

      Zhang Y-W, Otterness DM, Chiang GG, Xie W, Liu Y-C, Mercurio F & Abraham RT (2005) Genotoxic stress targets human Chk1 for degradation by the ubiquitin-proteasome pathway. Mol Cell 19: 607-618

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study investigates the communication between the Golgi complex and the nucleus of the cell, which remains a largely unexplored field. The authors used publicly available siRNA and antibody data from the Human Protein Atlas as a basis for finding overlap between the proteomes of the two cellular compartments. In validating the data from the HPA, the study finds a novel cluster of DNA repair proteins present in the Golgi, which they validate and resolve to sub-compartmental localization. To do so they use immunofluorescence (IF) localization on ¬cis- and trans-Golgi cisternae marked by GM130 and TGN46, respectively. The authors find that many of the fully validated proteins present in both the nucleus and Golgi redistribute between the Golgi and the nucleus dependent on the protein and the type of DNA lesion. They focused on RAD51C, a recombination factor. They show that RAD51C resides in both the ¬cis- and trans- subsections prior to damage and responds to DNA damage in an ATM-dependent manner via release of a Golgi-based pool bound to Giantin, which is then imported into the nucleus via Importin-β. Knockdown experiments showed that Giantin regulates RAD51C spatially and temporally. The work reveals a dynamic interchange of proteins between the Golgi and nucleus that controls cell functions beyond the classic secretory, membrane trafficking, and PTM roles of the Golgi. The authors build on prior work on Golgi impacts on DDR, offering an alternative cellular compartment for storage of DDR factors prior to damage. Overall, the data is timely and relevant, as it finds new roles for the Golgi in DNA damage response (DDR) regulation. The data is largely convincing and well controlled. The IF data is presented in black and white single channels and merged in color, which allows good comparison of the different protein stains. The scope of the initial screen of HPA antibodies and Golgi/Nuclear dual proteomes is impressive, and the overlap of DDR proteins is characterized for fifteen different proteins at a sub-compartmental level. The focus on RAD51C as a member of the HR pathway was a strong choice, and the study presents interesting information on its regulation by Golgi complex members, as well as a feedback look with pATM. The possibility of the Golgi storing specific DDR factors in specific compartments is well-supported and intriguing. There are a few major and minor points that should strengthen the paper and improve clarity prior to publication.

      Major Comments:

      1. Much of the strength of the IF data is lost in the choice of scale for presentation of the data. In almost all cases, enlarged sections should be shown of the areas currently indicated by arrow, in all channels. This is done well in Figure 3A, where an area of the Golgi is enlarged and the overlap of RAD51C in the GM130-marked Golgi is clearly visible in the merged channel, even when printed out. I would highly recommend including the white box and enlarged in all images and channels, while keeping the representative fields as is (e.g. if the image is 40mm, draw a 7mm box around representative cells/Golgi, and enlarge to 15mm in the bottom left). This change should be made to F1E, F2F, F3E, F3J, and F3M, as well as having enlarged figures in the corners in all supplementary data IF figures. Where possible, a fully enlarged image of the bounding box could also be included. Some of the IF data would be strengthened by using the nuclei stain to draw a masking outline to include in the black and white channels, to clearly delaminate what is Golgi-localized and what is nuclear.
      2. There is a lack of consistency in the representative images shown by IF. For example, Figure 1 gives the impression of very little RAD51C in the nucleus but this is rightly shown to not be the case in Supp. Fig 2A. The same is true of the various images of LIG1. The authors should use representative data that better reflects the distribution of the proteins being studied and maintain consistency across images. If there is a lot of variation in staining patterns, the authors should show images and percentages corresponding to the variations especially for the key gene studied, RAD51C.
      3. The initial screening by siRNA-mediated knockdown pipeline that validated and confirmed dual Golgi and nuclear localization of 163 of the 329 dual-localization HPA proteins does not have any data included. This seems like a very large amount of data to gloss over and not include even as supplementary data. This should be included as source data, and discussion of the in-text information should be strengthened. The data included with the networking of these validated proteins is strong, but the process of elimination and validation has not been shown. In addition, the antibody information included in the supplementary data does not include dilution factors or blocking factors is not included, which would be beneficial to future studies to include.
      4. The authors should expand upon the paragraph lines 155-162 to include more discussion on Figure S2A and S2B. The expanse of this data is some of the strongest in the paper, and it should be further discussed in-text. Also, the rationale behind the choice in the specific proteins that are included in these analysis / figures is not always clear in -text, and more attention should be spent on the narrowing down of the analysis to the final proteins. This is also especially important as many of the DDR proteins chosen are not the most common DDR proteins. Also note in text that the Golgi marker GM130 (presumably) was used for the screening, which means that some proteins which are only localizing to the TGN46 trans Golgi might have been lost in the validation step (or, explain why this is not the case).
      5. The relationship between Giantin loss, increased cell proliferation, and elevated endogenous DNA damage as it relates to RAD51C remains insufficiently resolved and requires further clarification. Several of the proliferation assays used are not optimal for addressing changes in cell growth. For example, Figure 5O appears to quantify cell numbers by counting fields from IF images, which is an unconventional approach. This should be done by growth curves, luminescent viability or colony formation assays. In addition, this point will be greatly strengthened by performing rescue experiments for Giantin directly (instead of co-depletion as a means of rescue) and/or using a mutant of RAD51C that does not bind to Giantin. If these additional experiments are beyond the current scope, the conclusions should be softened in the discussion.
      6. It is unclear from the discussion and from presented data whether proteins are directly transported between the Golgi and the nucleus, or whether they go into the cytoplasm for a transient period, presumably when they could interact with Importin β. There is also some data where cytoplasm signal could be quantified to address this (Figure 3E-I).
      7. Statistical analysis on experiments with more than two samples need to be performed with ANOVA and a follow up post-hoc test, not with two-tailed unpaired Student's t-test, which only compares the control and each individual sample. This type of analysis inflates the Type 1 error rates (false positives) in your datasets. For example, the two-tailed unpaired Student's t-test is appropriate in Figure 2F-H, but not in Figure 3 when the samples are timepoints. In this case, a One-way ANOVA with Tukey's post-hoc test (if you want to show all coparisons), or Bonferroni/Sidak if you only need to compare several samples).

      Minor Comments:

      General

      1. Throughout the text, the reference to many figures and supplementary figures in the same sentence, with little discussion of the data therein makes it hard to follow. In-text referencing is particularly confusing in the section "Dual-localising DDR proteins dynamically redistribute between the Golgi and nucleus in response to specific types of DNA injuries," where the reader is switching between multiple figures and supplementary figures.
      2. In figures that display technical replicates as individual data points, consider distinguishing each replicate by using different marker shapes (e.g., repeat 1 = upright triangle; repeat 2 = inverted triangle; repeat 3 = diamond). This would provide additional clarity regarding the consistency and repeatability of each technical repeat.
      3. Make sure all western blot data includes the marker size (F3C and F5L has none, F4H/I have size of proteins not size of markers).
      4. Be consistent with use of capitalization in figure legends and graph/figure labels.

      Figure 2

      1. In Figure 2A, please include in the figure itself that GM130 is the cis Golgi, and TGN46 is the trans Golgi (Figures should not be dependent on the text for full understanding).
      2. Why are LRIG2 and LRRIQ3 not included in the 2E cis vs trans Golgi data, when all other proteins from F1D are included? Include, or comment on in-text.
      3. Be sure to include scale bar data in each figure legend (F2A-E is currently missing it), and include updated scales included in the enlarged data.
      4. In Figure 2F, make sure that the merged green channel is presented at the same intensity as it is in the single black and white channel, as the green looks very overexposed in several of the merged (CCAR1 DMSO merged is the most noticeable).
      5. In Figure 2G, include the grey label in the figure legend.
      6. In Figure 2G-H, the method of data presentation in the graphs coupled with the statistical analysis is confusing and should be expanded upon in the legend.

      Figure 3

      1. Figure E/F/G: Is there cytoplasmic quantification as well? Your rationale is that the Golgi RAD51C goes into the nucleus, but via the cytoplasm (due to Importin β import); do you see the cytoplasmic levels increase? Or is it too dilute to notice a difference? At least, this omission needs to be mentioned in-text.
      2. Figure H/I also include the quantification of the cytoplasmic fraction. It is mentioned in-text on line 272, but not quantified. This comes up as a big question: Do the proteins go directly between the Golgi and nucleus, or do they go through the cytoplasm?
      3. Figure 3A, 3E, and if the data is present for 3J and 3M, could all benefit from using the nuclei staining as a mask to draw an outline around the nucleus in the other channels, and then show a merge in full color instead of a nuclei-only channel. Also note from the major comments, that this data especially is so small to see without enlarged images.
      4. In-text discussion of the results from Figure 3 has an in-depth discussion of the NLS and NES in RAD51C, but this is not followed up on with site-directed mutagenesis or any data; perhaps move this to the discussion instead of results section.

      Figure 4

      1. Comments from earlier figures hold, with size of enlarged events and using the nuclei as an outline in the single channels. E.g. Figure 4F arrows appear to point to nothing at the chosen scale. The zoom in 4G is insufficient, as the chosen feature is so small it is not even visible in full fields.
      2. Figure 4H and 4I need to show the size of the markers
      3. The representative image in 4L for siGiantin pATM has no pATM foci, while the quantification in 4M has a reduction from ~50% to ~25%, so this image is not representative of this data, or the data quantification is not as strong as the actual data.

      Figure 5

      1. Figure 5A has overexposure of the nuclei stain in order to visualize micronuclei. Readjust the levels, and enlarge the images for better visualization. (is this DAPI-stained? Please label).
      2. Figure 5A-C: Figure 5A does not show siRAD51, but it is included in the DMSO only graph. Please either show RAD51 data in 5A and 5C, or do not include in 5B. If the DMSO and ETO experiments were performed separately and that accounts for this discrepancy, then show separately.
      3. Figure 5M the white label is difficult to see in the green box.

      Supplementary Figures

      1. Consider reordering/ subdividing supplementary figures for ease of reference during reading.
      2. SF1 and SF2A: Include enlarged boxes or full images so that data is visible.
      3. SF3A, SF4A, and SF5A: Include enlarged images, include nuclei marker if possible (otherwise, the nuclear intensity is not proven nuclear).
      4. SF3B-C, SF4B-C, and SF5 B-D: Change the data presentation in the same method as changed for F2G-H.
      5. SF3D: List proteins in the same order as in B and C.
      6. SF6D: Label M N and C more clearly. Include size labels.
      7. SF7A-B: Include enlarged.
      8. SF8: Include arrows as in previous experiments, include enlarge.
      9. SF9G: G is labelled, but not included.

      Significance

      The work finds new roles for the Golgi in regulation of DNA damage responses and the screen could be an important dataset (but results need to be made available) for the DNA repair community. The scope of the initial screen of HPA antibodies and Golgi/Nuclear dual proteomes is impressive, and the overlap of DDR proteins is characterized for fifteen different proteins at a sub-compartmental level. The work provides important insights into RAD51C regulation, however, there are key mechanistic insights and control experiments missing from the studies involving RAD51C and Giantin, dampening its impact. The idea of an alternative cellular compartment for storage of DDR factors prior to damage is interesting, and suggests the spatial regulation of specific lesion responses are stored in specific sub-compartments of the Golgi, which could contribute to repair regulation.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Background - Eukaryotic cells rely on tightly regulated DNA repair pathways to preserve genome stability under the constant threat of both endogenous and exogenous genotoxic stress. While the nucleus, and to a lesser extent the mitochondria, is the primary site where DNA damage is detected and repaired, accumulating evidence indicates that extranuclear organelles, particularly the Golgi apparatus, play a surprisingly important role in modulating stress signaling, proteostasis, and the trafficking/activation of key DNA repair factors.

      Emerging evidence has shown that genotoxic stress can result in a major remodeling of the Golgi apparatus; however, the crosstalk between the Golgi and the nucleus, and its contribution to the DNA damage response, remains poorly defined. The present study offers timely insight by examining the spatiotemporal behavior of DNA repair proteins that shuttle between the Golgi and the nucleus, and how this trafficking contributes to the maintenance of genomic stability.

      Main findings - The authors employed the Human Protein Atlas (HPA) project to shortlist proteins that might link Golgi-nuclear function and validated each candidate using an siRNA-mediated antibody-validation pipeline, thereby identifying 163 proteins that localize to both the Golgi and the nucleus. Bioinformatic analysis of these candidates revealed a significant enrichment for DNA damage response (DDR) regulators, including multiple factors from core DNA repair pathways, suggesting that a portion of the DDR machinery may reside in the Golgi at steady state. Interestingly, the authors observed that dual-localizing DDR proteins undergo lesion-specific redistribution between the Golgi and the nucleus in response to specific types of DNA injuries. For instance, BER and MMEJ proteins shifted from nucleus to Golgi in response to doxorubicin, whereas MMR and HR proteins redistributed from Golgi to nucleus. This trend was reversed with H2O2 or KBrO3 treatments.

      To gain further insight into the link between the DDR and Golgi-nuclear communication, the authors focused on the HR factor RAD51C, which also plays a key role during the replicative stress response. The authors noticed that RAD51 is significantly associated with the Golgi, in addition to its known nuclear pool. Interestingly, they demonstrated that doxorubicin triggers the ATM-dependent release of this Golgi-tethered RAD51C pool and its Importin-β-mediated import into the nucleus, where it forms repair-associated foci. They further identified Giantin as the Golgi scaffold that anchors RAD51C at steady state in this subcellular compartment and showed that its depletion leads to premature nuclear accumulation of RAD51C, formation of aberrant RAD51C foci lacking canonical HR markers, reduced ATM activation, elevated genomic instability, and increased cell proliferation.

      Together, this study revealed an underappreciated and functionally meaningful spatiotemporal level of regulation within the DDR, suggesting that the Golgi, rather than functioning solely as a trafficking organelle, acts as a platform that anchors, releases, and temporally controls the availability of key DNA repair factors in response to genotoxic stress. In particular, the authors demonstrated that the timely and regulated release of RAD51C from the Golgi is essential for maintaining genome stability and is dependent on canonical DDR signaling pathways, including ATM activation and Importin-β-mediated nuclear import.

      Overall Critique - This manuscript offers a novel and compelling perspective on the regulation of the DDR by positioning the Golgi as an active participant in the spatiotemporal control of DNA repair factors. By integrating multiple experimental layers, including a systematic localization screening, a sub-Golgi mapping, several dynamic redistribution assays, and functional perturbation read-outs, the authors built a strong and coherent case for a biologically meaningful Golgi-nucleus communication axis during the DDR. Therefore, the study is timely and highly relevant for the DNA repair field, with broader implications for our understanding of how subcellular organelles coordinate genome maintenance and cellular homeostasis.

      While the manuscript is clearly written and the figures are coherent and supportive of the main findings of the study, several issues should be addressed to ensure full interpretability and reproducibility.

      Major Comments

      1. Limited use of agents causing genotoxic stress - The authors report intriguing lesion-specific shifts in Golgi-nuclear redistribution, yet much of the mechanistic work relies heavily on doxorubicin, a pleiotropic drug that induces diverse forms of DNA damage beyond DSBs. Expanding the core analysis of the study to include a broader panel of mechanistically defined genotoxins (e.g., etoposide, camptothecin, neocarzinostatin, or ionizing radiation) would substantially strengthen the conclusion that the trafficking patterns reflect damage-type specificity rather than drug-specific off-target effects. Such broader analysis would also clarify whether Golgi-nucleus communication responds differentially to replication-associated breaks, Topo II-dependent lesions, oxidative stress, or crosslinks.
      2. Functional implications of RAD51C redistribution for HR efficiency - Although the study convincingly demonstrates a release of RAD51C from the Golgi and its subsequent nuclear foci formation, it remains unclear how this redistribution influences HR efficiency. Incorporating a functional HR assay (e.g., DR-GFP reporter, RAD51 filament assembly, or fork protection assays) would help determine whether Golgi-anchored RAD51C release is directly required for HR or instead primarily modulates upstream DDR signaling.

      In addition, the manuscript does not fully reconcile how Golgi-tethering of RAD51C fits with its well-established nuclear roles during replication stress, where timely availability of RAD51C is essential for fork stabilization and restart. 3. Specificity of Giantin-related phenotypes - The phenotypes observed upon Giantin depletion (e.g., increased micronuclei, comet tail moments, impaired ATM signaling, and elevated proliferation) could partially reflect a global dysfunction of the Golgi rather than RAD51C-specific tethering defects. Although co-depletion of RAD51C provides partial rescue, additional controls examining Golgi integrity, trafficking competence, or rescue with siRNA-resistant Giantin would help confirm specificity and distinguish direct from indirect effects. 4. Positioning of ATM in the Golgi-nuclear signaling - While ATM inhibition prevents RAD51C release, its spatial and mechanistic basis of this regulation remains obscure. It is not clear whether ATM acts locally at the Golgi, through cytoplasmic pools, or indirectly via nuclear feedback signaling. Clarifying or discussing this point in more depth would improve the mechanistic coherence of the proposed model. 5. RAD51C is examined in silo, without consideration for the BCDX2 complex - RAD51C is exclusively analyzed in isolation, despite its well-established function as part of the BCDX2 paralog complex (RAD51B-RAD51C-RAD51D-XRCC2). Because RAD51C does not normally operate as a standalone factor, it is unclear why only RAD51C, among all paralogs, would be subjected to Golgi tethering, ATM-dependent release, and Importin-β-driven nuclear import. This raises important mechanistic questions: Are other BCDX2 members also Golgi-associated? Do they undergo similar trafficking dynamics? Does Golgi tethering selectively regulate RAD51C, or does the complex translocate together? Addressing these points would greatly strengthen the biological plausibility and mechanistic coherence of the proposed model.

      Minor Comments

      1. Pathway-specific sub-Golgi localization patterns - The finding that DDR proteins map to distinct cis/trans Golgi subdomains is an interesting and potentially important observation. However, the dataset is limited to 15 proteins, making the proposed pathway-level trends (e.g., HR factors enriched in cis-Golgi; BER/MMEJ factors enriched in trans-Golgi) preliminary. Strengthening this conclusion by increasing the number of DDR proteins analyzed would help determine whether sub-Golgi compartmentalization contributes meaningfully to DNA repair pathway regulation.
      2. Is the Golgi-released RAD51C indeed the pool that enters the nucleus? The major assumption of the study is that the RAD51C population released from the Golgi upon DNA damage is the same pool that subsequently accumulates in the nucleus to form repair foci. While the imaging and fractionation data are consistent with this model, the study does not directly track or distinguish Golgi-derived RAD51C from cytoplasmic or pre-existing nuclear pools. Without a method to specifically label, pulse-chase, or track the Golgi-anchored fraction, it remains formally possible that nuclear RAD51C originates from other subcellular reservoirs.

      Significance

      General assessment - This study presents a novel and conceptually compelling view of the DNA damage response (DDR) by positioning the Golgi apparatus as an active regulator of the spatiotemporal availability of DNA repair factors. The strongest aspects of the work include its integration of a systematic immune-localization screening, a sub-Golgi compartment mapping, dynamic redistribution assays, and functional perturbations to build a coherent model of Golgi-nucleus communication during genotoxic stress. The mechanistic focus on RAD51C provides a clear case study linking organelle-level regulation to genome stability.

      Advance - To my knowledge, this is the first comprehensive demonstration that the Golgi can serve as a spatiotemporal coordination node for DDR proteins, including those involved in HR. The identification of a substantial pool of RAD51C, and reportedly other DDR factors, anchored within specific Golgi subdomains represents a significant conceptual advance. The demonstration that Golgi-tethered RAD51C is released in an ATM-dependent manner and subsequently participates in nuclear foci formation suggests a previously unrecognized organelle-level regulatory checkpoint in genome maintenance. This work therefore extends current models of the DDR by revealing a layer of intracellular coordination that bridges classical nuclear pathways with cytoplasmic organelle function.

      Audience - This study will be of strong interest to a specialized audience in the fields of DNA repair, genome stability, and cell biology, particularly those studying the spatial organization of repair pathways and intracellular stress signaling. It will also appeal to researchers investigating organelle biology, intracellular trafficking, and the broader coordination of cytoplasmic and nuclear responses to stress. Beyond these communities, the work may be relevant to cancer, as it suggests new mechanisms by which organelle perturbations or Golgi-associated scaffolding proteins could influence therapeutic responses or genomic instability.

      Reviewer expertise - Field of expertise: DNA repair, genome stability, organelle biology, cancer cell biology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This paper describes the localisation of DNA repair proteins, which carry out their DNA repair function in the nucleus, to the cytoplasmic Golgi apparatus. Using the Human Protein Atlas to identify candidates, the authors use antibody localisation to show that a significant number of DNA repair proteins also localise at the Golgi. It appears that proteins involved in common DNA repair pathways localise to common regions of the Golgi. The Golgi-nucleus distribution of the DNA repairs proteins changes upon DNA damage, indicating a dynamic relationship. The authors focus on the DNA repair protein RAD51C and show that its loss from the Golgi and translocation to the nucleus upon DNA damage is mediated by the ATM kinase. Anchoring at the Golgi is shown to be mediated by the golgin giantin. A functional role for giantin in DNA repair is shown in knockdown studies, supporting a mechanism whereby Golgi anchoring of RAD51C, and possibly other DNA repair proteins, by giantin, is required to maintain proper control of DNA repair.

      The data are clear and support the authors' conclusions. The data are carefully quantified throughout. I found the text easy to read.

      Major points:

      1. To validate the Golgi localisation, KD using siRNA was used. It was deemed that a signal reduction of 25% was enough to indicate specific antibody labelling. This seems like a low number, and not very stringent. For some of the hits, expressing tagged versions of the proteins would greatly strengthen the Golgi assignment. This may not be possible for all, but for RAD51C would seem an important experiment.
      2. The total signal should be quantified for each DNA repair protein upon genotoxic stress, in addition to the Golgi to nucleus ratio. For many of the proteins it looks like the total signal goes down, which could influence interpretation.
      3. The study would benefit from live imaging of the Golgi to nucleus translocation of RAD51C. This would give a better indication of dynamics.
      4. The double depletion experiments suggest a functional relationship between giantin and RAD51C. But they do not formally show it. Experiments to more directly address the functional role of the interaction between these two proteins would strengthen the study.
      5. The Kaplan-Meier plots in Fig S9 seems to be quite selective in that only breast cancer is shown. Does giantin reduction correlate with poor prognosis in other cancers?

      Minor points: There are a few grammatical errors here and there. The figures do not appear in the correct order in the text, which makes the early parts of the paper a bit difficult to follow. Some of the figures don't seem to clearly match the text. For example, it is mentioned that RAD51C labelling was done with 3 different antibodies. I could not find this data.

      Significance

      This paper is novel and should be of significant interest to the field. It has important implications for how we think about the Golgi apparatus, and for how DNA repair pathways may be controlled. The pattern is clearly complex, with many DNA repair proteins localising to the Golgi, and some showing opposite dynamics. However, by focussing on RAD51C and giantin, the paper nicely demonstrates a novel mechanism for controlling DNA repair by these proteins.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-02954

      Corresponding author(s): Ana-Maria Lennon-Duménil and Sandra Iden

      1. General Statements [optional]

      We thank the three reviewers for the time and caution taken to assess our manuscript, and for their constructive feedback that will help improve the study. We herewith provide a revised manuscript that addressed the key points raised by the reviewers.

      2. Point-by-point description of the revisions

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __

      Summary: The manuscript by Delgado et al. reports the role of the actin remodeling Arp2/3 complex in the biology of Langerhans cells, which are specialized innate immune cells of the epidermis. The study is based on a conditional KO mouse model (CD11cCre;Arpc4fl/fl), in which the deletion of the Arp2/3 subunit ArpC4 is under the control of the myeloid cell specific CD11c promoter.

      In this model, the assembly of LC networks in the epidermis of ear and tail skin is preserved when examining animals immediately after birth (up to 1 week). Subsequently however LCs from ArpC4-deleted mice start displaying morphological aberrations (reduced elongation and number of branches at 4 weeks of age). Additionally, a profound decline in LC numbers is reported in the skin of both the ear and tail of young adult mice (8-10 weeks).

      To explore the cause of such decline, the authors then opt for the complementary in vitro study of bone-marrow derived DCs, given the lack of a model to study LCs in vitro. They report that ArpC4 deletion is associated with aberrantly shaped nuclei, decreased expression of the nucleoskeleton proteins Lamin A/C and B1, nuclear envelop ruptures and increased DNA damage as shown by γH2Ax staining. Importantly, they provide evidence that the defects evoked by ArpC4 deletion also occur in the LCs in situ (immunofluorescence of the skin in 4-week old mice).

      Increased DNA damage is further documented by staining differentiating DCs from ArpC4-deleted mice with the 53BP1 marker. In parallel, nuclear levels of DNA repair kinase ATR and recruitment of RPA70 (which recruits ATR to replicative forks) are reduced in the ArpC4-deleted condition. In vitro treatment of DCs with the topoisomerase II inhibitor etoposide and the Arp2/3 inhibitor CK666 induce comparable DNA damage, as well as multilobulated nuclei and DNA bridges. The authors conclude that the ArpC4-KO phenotype might stem, at least in part, from a defective ability to repair DNA damages occurring during cell division.

      The study in enriched by an RNA-seq analysis that points to an increased expression of genes linked to IFN signaling, which the authors hypothetically relate to overt activation of innate nucleic acid sensing pathways.

      The study ends by an examination of myeloid cell populations in ArpC4-KO mice beyond LCs. Skin cDC2 and cDC2 subsets display skin emigration defects (like LCs), but not numerical defects in the skin (unlike LCs). Myeloid cell subsets of the colon are also present in normal numbers. In the lungs, interstitial and alveolar macrophages are reduced, but not lung DC subsets. Collectively, these observations suggest that ArpC4 is essential for the maintenance of myeloid cell subsets that rely on cell division to colonize or to self-maintain within their tissue of residency (including LCs).

      MAJOR COMMENTS

      1. ArpC4 and Arp2/3 expression The authors argue that LCs from Arpc4KO mice should delete the Arpc4 gene in precursors that colonize the skin around birth. It would be important to show it to rule out the possibility that the lack of phenotype (initial seeding, initial proliferative burst) in young animals (first week) could be related to an incomplete deletion of ArpC4 expression. Also important would be to show what is happening to the Arp2/3 complex in LCs from Arpc4KO mice.

      __Response: __We thank this reviewer for the careful assessment of our manuscript. Regarding this specific comment, we would like to clarify that we do not expect ArpC4 to be deleted in LC precursors, as CD11c is only expressed once the cells have entered the epidermis. Instead, we expect the deletion to take place after birth around day 2-4 (Chorro et al., 2009). For this reason, we performed a deletion PCR of epidermal cells at postnatal day 7 (P7), a time at which the proliferative burst occurs. This analysis revealed CD11c-Cre-driven recombination in the ArpC4 locus (Fig. S2C). This experiment indicates that ArpC4 deletion does not alter LC proliferation and postnatal network formation.

      We apologize if this was not clear enough and have (1) revised the manuscript text to clearly explain the time at which ArpC4 will be deleted early during development when using the CD11c-Cre transgene, and (2) better emphasized the rationale for the deletion PCR (page 4).

      In the in vitro studies with DCs, the level of ArpC4 and Arp2/3 deletion at the protein level is also not documented.

      __Response: __We have previously analyzed the expression of ArpC4 in BMDCs in a recent study, confirming its loss in CD11c-Cre;ArpC4fl/fl cells at the protein level: Rivera et al. Immunity 2022; doi: 10.1016/j.immuni.2021.11.008. PMID: 34910930 (Fig. S2D). Therefore, in the current manuscript we only refer to that paper (Results, first paragraph).

      The authors explain that surface expression of the CD11c marker, which drives Arpc4 deletion, gradually increased during differentiation of DCs: from 50% to 90% of the cells. Does that mean that loss of ArpC4 expression is only effective in a fraction of the cells examined before day 10 of differentiation (e.g. in the RNA-seq analysis)?

      __Response: __The reviewer is correct, there is heterogeneity in CD11c expression, which is inherent of this DC culture model, implying that Arpc4 gene deletion will be partial. However, despite this, we were able to detect significant differences between the transcriptome of control and CD11c-Cre;ArpC4fl/fl DCs in early phases during differentiation, emphasizing that the phenotype of ArpC4 loss is robust.

      We have included a notion on this heterogeneity in the revised manuscript text (page 5).

      Intra-nuclear versus extra-nuclear activities of Arp2/3

      The authors favor a model whereby intra-nuclear ArpC4 helps maintaining nuclear integrity during proliferation of DCs (and possibly LCs). However, multiple pools of Arp2/3 have been described and accordingly, multiple mechanisms may account for the observed phenotype: i) cytoplasmic pool to drive the protrusions sustaining the assembly of the LC network and its connectivity with keratinocytes ; ii) peri-nuclear pool to protect the nucleus ; iii) Intra-nuclear pool to facilite DNA repair mechanisms e.g. by stabilizing replicative forks (the scenario favored by the authors).

      __Response: __The referee is correct, and this is discussed in our manuscript (page 11, upper paragraph): we cannot exclude that several pools of branched actin are influencing the phenotype we here describe.

      Unfortunately, we have previously tested several antibodies against ArpC4, but in our hands, and despite comprehensive optimization, they did not yield specific signals that would enable us to assess changes in subcellular localization in murine cells. Upon this reviewer's comment, we have now reassessed the available tools. We have tested an antibody against ArpC2 (Millipore, Anti-p34-Arc/ARPC2, 07-227-I-100UG), which however did not produce any specific signals either. Instead, we found an ArpC5 antibody that yielded a filamentous staining in the cytoplasm plus nuclear staining in distinct foci of control bone marrow-derived DCs, indicating that Arp2/3 components may in principle act in the nucleus in these cells (see revised Figure S3F,G).

      It is recommended that the authors try to gather more supportive data to sustain the intra-nuclear role. Documenting ArpC4 presence in the nucleus would help support the claim. It could be combined with treatments aiming at blocking proliferation in order to reinforce the possibility that a main function of ArpC4 is to protect proliferating cells by favoring DNA repair inside the nucleus.

      __Response: __We thank this reviewer for this very helpful comment. As outlined in the previous response, we have aimed at obtaining subcellular localization data for Arp2/3 complex components, and along with that study a potential intranuclear localization. Beyond that, in comparison to commonly cultured cell types, however, we face two hurdles addressing the nuclear Arp2/3 role in full: 1) Due to poor transduction rates and epigenetic silencing, we cannot sufficiently express exogenous constructs such as ArpC4-NLS in DCs to assess the subcellular localization of Arp2/3 complex components. 2) We have performed preliminary tests to block proliferation in DCs, using the cyclin D kinase 1 inhibitor RO3306 at different concentrations and incubation times during DC differentiation. Unfortunately, most cells were found dead after treatment. Further lowering the inhibitor concentrations (below 3.5uM) will likely not block the cell cycle, rendering this approach unsuited.

      As mentioned above, we have tested the suitability of additional antibodies directed against Arp2/3 complex components to assess their subcellular localization, with the aim to discriminate peripheral cytoplasmic vs. perinuclear vs. intranuclear localization. These new data that report nuclear and cytoplasmic ArpC5 in control DCs are now presented in revised figure S3F,G. In addition, we toned down our current phrasing in the discussion, also emphasizing the possibility that cytoplasmic or perinuclear pools of the complex may indirectly help maintain the integrity of the genome in LCs (page 12).

      Nuclear envelop ruptures

      The nuclear envelop ruptures are not sufficiently documented (how many cells were imaged? quantification?). The authors employ STED microscopy to examine Lamin B1 distribution. The image shown in Figure 4A does not really highlight the nuclear envelop, but rather the entire content. Whether it is representative is questionable. We would expect Lamin B1 staining intensity to be drastically reduced given the quantification shown in Figure 3D. In addition, although the authors have stressed in the previous figure that Arpc4-KO is associated with nucleus shape aberrations, the example shown in Figure 4A is that of a nucleus with a normal ovoid shape.

      It is recommended to quantify the ruptures with Lap2b antibodies (or another staining that would better delineate the envelop) in order to avoid the possible bias due to the reduced staining intensity of Lamin B1.

      __Response: __NE ruptures are quantified by imaging NLS-GFP-expressing DCs in microchannels to visualize leakage of their nuclear content (Fig. 4B,C). The STED image mentioned by the referee (Fig. 4A,D) was only shown to further illustrate examples of NE ruptures, here using Lamin B1 as an immunofluorescence marker for the NE. We do agree with the reviewer that it was not chosen optimally to represent the ArpC4KO phenotype regarding nuclear shape and Lamin B1.

      We have now provided representative examples of nuclear illustrations of the ArpC4KO phenotype vs. control cells. In addition, we performed STED microscopy of Lap2b immunostained DCs as suggested by the referee (revised Fig. 4A,B).

      A missing analysis is that of nuclear envelop ruptures as a function of nucleus deformations.

      __Response: __As stated in the manuscript (page 5, third paragraph), the morphology of DCs is quite heterogeneous. As mentioned above, nuclear rupture events were quantified by live-imaging of NLS-GFP expressing DCs, enabling the tracing of rupture events. Live imaging is the only robust manner to measure nuclear membrane rupture events as they are transient due to rapid membrane repair (Raab et al. Science 2016). The NLS-GFP label itself, however, is not accurate enough to also quantify nuclear deformations. The latter therefore was quantified after cell fixation, using DAPI and/or immunostaining for NE envelope markers (Figures 3 and S3).

      As suggested by the referee, we have now quantified nuclear deformations using Lap2b staining of the nuclear envelope (revised Fig. 4A,B), demonstrating reduced circularity and increased elongation of ArpC4KO nuclei.

      Fig 4B-C: same frequency of Arpc4-KO and WT cells displaying nuclear envelop ruptures in the 4-µm channels; however image show a rupture for the Arpc4-KO and no rupture for the WT cells (this is somehow misleading). Are ruptures similar in Arpc4-KO and WT cells in this condition?

      __Response: __We apologize for choosing an image that does not represent well our quantification, our mistake. The revised manuscript now contains an image that better reflects our quantification (revised Fig. 4C).

      Fig 4D-E: is their a direct link between nuclear envelop ruptures and ƴH2A.X?

      __Response: __At present, we can only correlate the findings of increased gH2Ax and elevated events of nuclear envelope ruptures in ArpC4KO DCs. Rescue experiments are very difficult to impossible in DCs (e.g. restoring Lamin A/C and B1 levels in the KOs and subsequently assessing the amount of DNA damage). While we are afraid that we cannot address a potential link between NE ruptures and DNA damage by experiments in a manner feasible within this manuscript's revision, we have discussed this interesting aspect based on observations in immortalized cell culture systems (page 10). However, we would like to note that this was indeed shown for different cell types in Nader et al. Cell 2021. This effect results from access of cytosolic nuclease Trex1 to nuclear DNA. We have added this point in our revised manuscript (page 11).

      Interesting (but optional) would be to understand what is happening to DNA, histones? Is their evidence for leakage in the cytoplasm?

      __Response: __This is an interesting question. To assess this, we have now performed immunostainings for double-stranded DNA in the cytoplasm, following published protocols (Spada et al., 2019; PMID 31727239). This analysis revealed significantly increased cytoplasmic dsDNA in ArpC4KO DCs (revised Fig. 4G,H), indeed suggesting leakage into the cytoplasm following ArpC4 loss.

      RNA seq analysis

      The RNA-seq analysis suffers from a lack of direct connection with the rest of the study. The extracted molecular information is not validated nor further explored. It remains very descriptive. The PCA analysis suggests a « more pronounced transcriptomic heterogeneity in differentiating Arpc4KO DCs ». However it seems difficult to make such a claim from the comparison of 3 mice per group. In addition, such heterogeneity is not seen in the more detailed analysis (Fig 5F). The authors claim that « day 10 control and Arpc4KO DCs showed no to very little differences in gene expression, in contrast to cells at days 7-9 of differentiation ». This is not obvious from the data displayed in the corresponding figure. In addition, it is not expected that cells that may take a divergent differentiation path at days 7-9 may would return to a similar transcriptional activity at day 10.

      A point that is not discussed is that before day 10 of DC differentiation, Arpc4 KO is expected to only occur in about 50% of the cell population. This is expected to impact the RNA-seq analysis.

      Not all clusters have been exploited (e.g. cluster 3 elevated, cluster 6 partly reduced). I suggest the authors reconsider their analysis and analysis of the RNA-seq analysis (or eventually invest in complementary analysis).

      __Response: __Despite a comprehensive analysis of the different transcriptomes of control and ArpC4 mutant cells during DC differentiation, we decided to focus the presentation and discussion of our RNAseq results on the most notable findings. Of these, the elevated innate immune responses in ArpC4KO DCs (Fig. 5E,H) caught our particular attention, as this seemed highly meaningful in light of DC and LC functions.

      As suggested by the referee, in the revised manuscript, we better connected the RNAseq data to the other cellular and molecular analyses shown, complementing these results by investigating the potential involvement of innate immune responses in the ArpC4KO phenotype (page 7).

      What causes the profound numerical drop of LC in the epidermis?

      A major open question is what causes the massive drop of LCs. Although differentiating Arpc4KO DCs start accumulating DNA damage upon proliferation, they succeed in progressing through the cell cycle. There is even a slightly elevated expression of cell cycle genes at day 7 of differentiation in the DC model.

      Only a trend for increased apoptosis is observed in ear and tail skin. It would be important to provide complementary data documenting increased death (or aberrant emigration?) of LCs in the 4-8 week time window.

      __Response: __We agree with the reviewer that this is an important question. We exclude that elevated emigration causes the decline of LCs in ArpC4KO epidermis, as ArpC4-mutant LCs are significantly reduced (and not increased) in number in skin-draining lymph nodes (Fig. 7E). To assess whether increased cell death contributed to LC loss, we have tried to identify LCs that are just about to die. As the reviewer noted, we could only observe a trend of apoptosis-positive LCs in mutant epidermis. We assume that this is because of a quick elimination of compromised LCs following DNA damage, with only a short time passing until LCs with impaired genome integrity will be cleared from the system, making it very difficult to detect gH2Ax-positive cells that are positive for markers of cell death.

      Despite these limitations to detect DNA-damage-positive but viable LCs in vivo, we have now collected 6-week-old mice to analyze LC numbers and apoptosis (cleaved Caspase-3), complementing our data derived from 7-day and 4-week-old mice (Figures S2A,B,E,F). While we did observe the expected trends for reduced LC numbers and increased DNA damage of ArpC4KO LCs as seen in adolescent mice, we were unable to detect a significant increase of apoptotic LCs in ArpC4KO animals at 6 weeks of age (revised Suppl. Fig. 4A-D). We assume that this is due to the outlined short-lived stages of apoptotic cells. Alternatively, it seems possible that ArpC4KO LCs were lost via cell death pathways other than apoptosis, a matter which we feel is beyond the scope of this manuscript. Accordingly, we revised our discussion to include this possibility (page 11-12).

      Functional consequences

      Although the study reports novel aspects of LC biology, the consequence of ArpC4 deletion for skin barrier function and immunosurveillance are not investigated. It would seem very relevant to test how this model copes with radiation, chemical and/or microorganism challenges.

      __Response: __We fully agree with this reviewer that this is a very interesting point. Therefore, next to assessing the steady-state circulation of LCs and DCs, we also addressed the consequence of ArpC4 loss for LC function in chemically challenged skin: we performed skin painting experiments using the contact sensitizer fluorescein isothiocyanate (FITC), diluted in the sensitizing agent dibutyl phthalate (DBP), to detect cutaneous-derived phagocytes within draining lymph nodes. These experiments revealed that migration of ArpC4KO LCs (as well as of ArpC4KO DCs) to skin-draining lymph nodes was impaired (Fig. 7C-E), confirming an in vivo role of ArpC4 for immune cell migration to lymphatic organs following a chemical challenge. The revised manuscript contains a more detailed note to properly explain the FITC painting experiment and highlight its importance (page 9).

      MINOR COMMENTS:

      1- Figure 1D

      Gating strategy: twice the same empty plots. The content seems to be missing... Does this need to be shown in the main figure?

      __Response: __We apologize for this problem that might be due to file conversion of PDF reader software. In our PDF versions (including the published bioRxiv preprint) we do see the data points; however, we have earlier experienced incomplete FACS plots during manuscript preparation.

      For the revised manuscript, we double-checked the results after converting the figures into PDFs. Here is a screenshot:

      2- Figure 2

      Best would be to keep same scale to compare P1 and P7 (tail skin, figure 2A)

      Response: We have replaced the examples with micrographs of comparable scale (revised Fig. 2A).

      Overlay of Ki67 and MHC-II does not allow to easily visualize the double-positive cells (Fig 2C)

      Response: We now provided single-channel image next to the merged view and improved the visualization of double-positive cells (revised Fig. 2C).

      Quality of Ki67 staining different for Arpc4-KO (less intense, less focused to the nuclei): a technical issue or could that reflect something?

      Response: We thank the reviewer for spotting this. We have re-assessed all Ki67 micrographs and noted that the originally chosen examples indeed were not fully representative. We have selected more representative examples of Ki67-positive cells in control and mutant tissues, reflecting no difference in the principal nature of Ki67 staining (revised Fig. 2C).

      Fig 2C: Panels mounted differently for ear and tail skin (different order to present the individual stainings, Dapi for tail skin only).

      Response: We agree and have harmonized the sequence of panels in figure 2 accordingly (revised Fig. 2C).

      3- LC branch analysis (Fig 1 and 2)

      While Fig 1 indicates that ear skin LCs form in average twice as few branches as tail skin LCs (3-4 versus 8-9 branches per cell), Fig 2 shows the opposite (10-12 versus 6-7 branches per cell).

      Is this due to a very distinct pattern between the 2 considered ages (4 weeks versus 8-10 weeks)? Could the author double-check that there is no methodological bias in their analysis?

      Response: We thank the reviewer for hinting to this apparent inconsistency. Indeed, our initial analysis suffered from a bias in detecting LC dendrites, as the tissue cellularity and overall morphology significantly differs between 4-week-old and adult animals: In adult animals, the immunostainings showed a higher baseline background signal for the skin epithelium compared to P28. We had noted this beforehand and had adjusted the imaging pipeline accordingly, with a more stringent thresholding to eliminate background signals in the case of adult tissues. While we were able to detect the described ArpC4 phenotype, this strategy resulted in a reduced ability to detect dendrites (both in control and mutant tissues), explaining the seemingly reduced number of dendrites in adult vs. 4-week-old tissues.

      We have double-checked both the micrographs and the corresponding quantifications and did not identify errors. Instead, our assumption -that a too high stringency for background reduction in adults caused the discrepancy- turned out correct. We now performed detailed analyses of LC morphology at 4-week and adult stages by confocal microscopy, using a 63x objective rather than a 40x objective as done previously. The new results confirm that with this approach the number of LC dendrites across these ages are largely comparable, while the phenotypes of ArpC4 loss are retained. The revised manuscript now contains a completely new analysis based on image acquisition with a 63x objective (revised Fig. 1E-G).

      4- Fig 3 E-G

      How many animals were examined (n=5)? Reproducible accros animals? Why was it done with 4-week animals (phenotype not complete? Event occurring before loss in numbers...)

      Response: As mentioned in the figure legend for Fig. 3F we have analysed N = 4 control and N= 5 KO mice. We chose the 4-week time-point as this was the stage when the loss of LCs first became apparent (even though non-significant at this age). We aimed to learn whether changes in nuclear morphology and nuclear envelope markers represented early molecular and cellular events following ArpC4 loss. Compared to later stages, this strategy poses a reduced risk to detect indirect effects of ArpC4 loss. We added a notion in the revised manuscript text to clarify this (page 5).

      Staining Lamin A/C globally more intense in the Arpc4-KO epidermis (also seems to apply to the masks corresponding to the LCs). Surprising to see that the quantification indicates a major drop of Lamin A/C intensity in the LCs.

      Response: We again thank the reviewer for this careful assessment. As with many tissue stainings, there is inter-sample variability. We have now revisited the micrographs and did not find a significant global reduction of Lamin A/C in the entire epidermis (including keratinocytes/KCs). The drop of Lamin A/C intensity is restricted to ArpC4KO LCs -and not KCs- and in line with the reduced Lamin A/C expression data in DCs (Fig. 3C,D). The revised manuscript now shows more representative examples (revised Fig. 3E).

      Legend Fig 4D replace confocal microscopy by STED microscopy

      Response: We replaced "confocal microscopy" by "STED microscopy".

      6- Figure 4F

      Intensity/background of γH2Ax staining very distinct between the 2 micrographs shown for WT and Arpc4-KO epidermis.

      Response: We revisited the micrographs and now selected more representative examples (revised Fig. 4I).

      7- Figure 7C, F, H

      Gating strategies: would be better to harmonize the style of the plots (dot plots and 2 types of contour plots have been used...)

      Response: We agree and provided a harmonized plot illustration in the revised manuscript (revised Fig. 7).

      8- Figure 7H

      Legend of lower gating strategy seems to be wrong (KO and not WT).

      Response: We thank the reviewer for pointing out this mistake. The revised Figure 7H shows a corrected figure display.

      Reviewer #1 (Significance (Required)):

      Strengths: the general quality of the manuscript is high. It is very clearly written and it contains a very detailed method section that would allow reproducing the reported experiments. This work entails a clear novelty in that it represents the first investigation of the role of ArpC4 in LCs. It opens an interesting perspective about specific mechanisms sustaining the maintenance of myeloid cell subsets in peripheral tissues. This work is therefore expected to be of interest for a large audience of cellular immunologists and beyond. Challenging skin function with an external trigger would lift the relevance for a even wider audience (see main point 6).

      __Response: __see main point 6.

      Limitations: in its current version the manuscript suffers from a lack of solidity around a few analysis (see main points on ArpC4 and Arp2/3 protein expression, nuclear envelop rupture analysis,...). It also tends to formulate a narrative centered on the ArpC4 intra-nuclear function that is not definitely proven.

      The field of expertise of this reviewer is: cellular immunology and actin remodeling.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      SUMMARY This is a study in experimental mice employing both in vitro and, importantly, in vivo approaches. EPIDERMAL LANGERHANS CELLS serve as a paradigm for the maintenance of homeostasis of myeloid cells in a tissue, epidermis in this case. In addition to well known functions of the ACTIN NETWORK in cell migration, chemotaxis, cell adherence and phagocytosis the authors reveal a critical function of actin networks in the survival of cells in their home tissue.

      Actin-related proteins (Arp), specifically here the Arp2/3 complex, are necessary to form the filamentous actin networks. The authors use conditional knock-out mice where Arpc4 (an essential component of the Arp2/3 complex) is deleted under the control of CD11c, the most prominent dendritic cell marker which is also expressed on Langerhans cells. In normal mice, epidermal Langerhans cells reside in the epidermis virtually life-long. They initially settle the epidermis around and few days after birth an establish a dense network by a burst of proliferation and then they "linger on" by low level maintenance proliferation. In the epidermis of Arpc4 knock-out mice Langerhans cells also start off with this proliferative burst but, strikingly, they do not stay but are massively reduced by the age of 8-12 weeks.

      The analyses of this decline revealed that

      -- the shape (number of nuclear lobes) and integrity of cell nuclei was compromised; they were fragile and ruptured to some degree when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      -- DNA damage, as detected by staining for gamma-H2Ax or 53BP1 accumulated when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      -- recruitment of DNA repair molecules was inhibited when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      -- gene signatures of interferon signaling and response were increased when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      -- in vivo migration of dendritic cells and Langerhans cells from the skin to the draining lymph nodes in an inflammatory setting (FITC painting of the skin) was impaired when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      -- the persistence of the typical dense network of Langerhans cells in the epidermis, created by proliferation shortly after birth, is abrogated when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing. Importantly, this was not the case for myeloid cell populations that settle a tissue without needing that initial burst of proliferation. For instance, numbers of colonic macrophages were not affected when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing.

      Thus, the authors conclude that the Arp2/3 complex is essential by its formation of actin networks to maintain the integrity of nuclei and ensure DNA repair thereby ascertaining the maintenance proliferation of Langerhans cells and, as the consequence, the persistence of the dense epidermal netowrk of Langerhans cells.

      Up-to-date methodology from the fields of cell biology and cellular immunology (cell isolation from tissues, immunofluorescence, multiparameter flow cytometry, FISH, "good old" - but important - transmission electron microscopy, etc.) was used at high quality (e.g., immunofluorescence pictures!). Quantitative and qualitative analytical methods were timely and appropriate (e.g., Voronoi diagrams, cell shape profiling tools, Cre-lox gene-deletion technology, etc.). Importantly, the authors used a clever method, that they had developed several years ago, namely the analysis of dendritic cell migration in microchannels of defined widths. Molecular biology methods such as RNAseq were also employed and analysed by appropriate bioinformatic tools.

      MAJOR COMMENTS:

      • ARE THE KEY CONCLUSIONS CONVINCING? Yes, they are.

      • SHOULD THE AUTHORS QUALIFY SOME OF THEIR CLAIMS AS PRELIMINARY OR SPECULATIVE, OR REMOVE THEM ALTOGETHER? No, I think it is ok as it stands. The authors are wording their claims and conclusions not apodictically but cautiously, as it should be. They point out explicitely which lines of investigations they did not follow up here.

      • WOULD ADDITIONAL EXPERIMENTS BE ESSENTIAL TO SUPPORT THE CLAIMS OF THE PAPER? REQUEST ADDITIONAL EXPERIMENTS ONLY WHERE NECESSARY FOR THE PAPER AS IT IS, AND DO NOT ASK AUTHORS TO OPEN NEW LINES OF EXPERIMENTATION. I think that the here presented experimental evidence suffices to support the conclusions drawn. No additional experiments are necessary.

      • ARE THE SUGGESTED EXPERIMENTS REALISTIC IN TERMS OF TIME AND RESOURCES? IT WOULD HELP IF YOU COULD ADD AN ESTIMATED COST AND TIME INVESTMENT FOR SUBSTANTIAL EXPERIMENTS. Not applicable.

      • ARE THE DATA AND THE METHODS PRESENTED IN SUCH A WAY THAT THEY CAN BE REPRODUCED? Yes, they are.

      • ARE THE EXPERIMENTS ADEQUATELY REPLICATED AND STATISTICAL ANALYSIS ADEQUATE? Yes.

      __Response: __We thank the reviewer very much for assessing our work, for providing constructive suggestions, and for acknowledging the strength of the study.

      MINOR COMMENTS:

      • SPECIFIC EXPERIMENTAL ISSUES THAT ARE EASILY ADDRESSABLE. None

      • ARE PRIOR STUDIES REFERENCED APPROPRIATELY? Essentially yes. Regarding the reduction / loss of the adult epidermal Langerhans cell network, it may be of some interest to also refer to / discuss to another one of the few examples of this phenomenon. There, the initial burst of proliferation is followed by reduced proliferation and increased apoptosis when a critical member of the mTOR signaling cascade is conditionally knocked out (Blood 123:217, 2014).

      Response: We thank the reviewer for pointing out this important work. We now included the paper into the revised manuscript (page 12).

      • ARE THE TEXT AND FIGURES CLEAR AND ACCURATE? Yes they are. Figures are well arranged for easy comprehension.

      • DO YOU HAVE SUGGESTIONS THAT WOULD HELP THE AUTHORS IMPROVE THE PRESENTATION OF THEIR DATA AND CONCLUSIONS?

      1. Materials & Methods. The authors write, regarding flow cytometry of epidermal cells: "Briefly, 1cm2 of back skin from 8-14 weeks old female wild-type and knockout littermates was dissociated in 0.25 mg/mL Liberase (Sigma, cat. #5401020001) and 0.5 mg/mL DNase (Sigma, cat.#10104159001) in 1 mL of RPMI (Sigma) and mechanically disaggregated in Eppendorf tubes, FOLLOWED BY INCUBATED for 2 h at 37 {degree sign}C." Followed by what?

      __Response: __We apologize for this mistake. The text should read: "... followed by incubation for 2 h at 37 {degree sign}C and filtration using a 100µm cell strainer. Thereafter, blocking was performed in PBS supplemented with 0.5% bovine serum albumin and 2 mM EDTA at 4 {degree sign}C, followed by antibody labeling of cells in single cell suspension". The text has been corrected in the revised manuscript (page 17).

      Materials & Methods. BMDC electronmicroscopy. What is "IF". Please specify.

      __Response: __We also regret this mistake in the method text. It should read: "... For electron microscopy analysis, after PDMS removal, cells were fixed using 2.5% glutaraldehyde ...". The text has been corrected in the revised manuscript (page 21).

      RESULTS in gene expression analyses. The authors observe some increase in apoptosis (as detected by cleaved-Caspase-3 staining). Is this observation in immunofluorescence also evident in the RNAseq data (where the IFN changes were seen), i.e., in Figure 5.

      __Response: __We have checked our RNAseq data regarding any changes in apoptosis-related genes, however, apart from a few transcripts that are upregulated in ArpC4KO cells, we do not find a pronounced enrichment of apoptosis-related genes. We included volcano plot data in revised Suppl. Fig. 5H to share these DEGs.

      Figure 7 F and G. Perhaps the authors may want to swap upper and lower panels in F or G, so that macrophage FACS plots and bar graphs are in the same row - ob, obiously, DC plots and bars likewise.

      __Response: __We agree and have harmonized the panel sequence in the revised manuscript (revised Fig. 7F, G; panels swapped in G, display harmonized).

      Figure 7H. "Gating strategy in ArpC4WT Lung (previously gated in Live CD45+ cells)" - The lower row is knock-out, not WT. This is indicated correctly in the legand, but in the figure both rows are labeled as WT.

      __Response: __Indeed, the legend information is correct, but the corresponding figure panel is incorrect. We now provide a corrected version (revised Fig. 7H).

      The reference by Park et al. 2021 is missing in the list.

      __Response: __We have added the reference to the revised bibliography.

      Figure 1D. Sure, the bar graphs are meant to say "CD11c"? The FACS plots show "CD11b".

      __Response: __We have checked the panels and corrected where necessary (revised fig. 1D).

      As to cDC1. In Figure 1D the FACS plot shows an absence of CD103+ cDC1 cells. In contrast, In Figure 7A-left side panel, there is not difference in cDC1 cells between WT and KO mice. Is therefore the flow cytometry plot in Figure 1D not representative regarding cDC1 cells? Correct?

      __Response: __The reviewer is correct about this apparent discrepancy. We have not observed differences in the control vs. ArpC4KO cDC1 population, hence Figure 7 represents our findings. For figure 1, we have by mistake chosen a non-representative plot, with the aim of illustrating the gating strategy. We apologize for this mistake and now provide a corrected and representative FACS plot figure in the revised manuscript (revised Fig. 1D).

      Reviewer #2 (Significance (Required)):

      • DESCRIBE THE NATURE AND SIGNIFICANCE OF THE ADVANCE (E.G. CONCEPTUAL, TECHNICAL, CLINICAL) FOR THE FIELD. This is a conceptual advance. It adds a big step to our understanding of how immune cells in tissues (which all come from the bone marrow or are seeded before birth from embryonal hematopoietic organs such as yolk sac and fetal liver) can remain resident in these tissues. For cell types such as Langerhans cells, which establish their final population density within their tissues of residence, the presented finding convincingly buttress the role of proliferation and thereby the role for the actin-related protein complex 2/3 (Arp2/3).

      • PLACE THE WORK IN THE CONTEXT OF THE EXISTING LITERATURE (PROVIDE REFERENCES, WHERE APPROPRIATE). While we know much about actin-related proteins (Arp), as correctly cited by the authors, this knowledge is derived mostly from in vitro studies. The submitted study translates the findings to an in vivo setting for the first time.

      • STATE WHAT AUDIENCE MIGHT BE INTERESTED IN AND INFLUENCED BY THE REPORTED FINDINGS. Skin immunologists foremost, but these findings are of interest to the entire community of immunologists, but also cell biologists.

      • DEFINE YOUR FIELD OF EXPERTISE. My expertise is in skin immunology, in particular skin dendritic cells including Langerhans cells.

      We acknowledge the referee for their positive assessment of our manuscript.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      The manuscript identifies a role of the Arp2/3 complex, the major regulator of actin branching in cells, for controlling the homeostasis of murine Langerhans cells (LCs), a specialized subset of dendritic cells in the skin epidermis. The findings of the study are based on the analysis of CD11c-Cre Arpc4-flox mice, a conditional knockout mouse model, which interferes with Arp2/3 function in Langerhans cells and other CD11c-expressing myeloid cells, e.g. dendritic cell or macrophage subsets. By using immunofluorescence and flow cytometry analysis of epidermis and skin tissues, the authors provide a detailed analysis of LC numbers at different developmental stages (postnatal day 1, 7, 28, and adult mice) and demonstrate that Arpc4-deficiency does not interfere with the establishment of LC networks until postnatal day 28. However, LCs in ear and tail skin are substantially reduced in Arpc4-deficient mice at 8-12 weeks of age. In parallel to their in vivo model, the authors analyze cultures of bone marrow-derived dendritic cells (BMDCs) from control and CD11c-Cre Arpc4-flox mice. Arpc4-deficiency in BMDCs, which develop over 8-10 days in culture, results in nuclear shape and lamina abnormalities, as well as signs of increased DNA damage. Aspects of this phenotype are also detected in Langerhans cells in epidermal preparations. Transcriptomic analysis of BMDCs highlights a gene signature of increased expression of the interferon response pathway and alterations in cell cycle regulation. Arpc4-deficient BMDCs show increased expression of DNA damage markers and reduced expression of certain DNA repair factors. Based on these correlative findings from the BMDC model, the authors conclude that the decline in LC numbers might develop from the accumulation of DNA damage over time, which the authors phrease "pre-mature aging of Langerhans cells". Lastly, the authors show a heterogenous picture how Arp2/3 depletion affects distinct DC populations in CD11c-Cre Arpc4-flox mice. While some tissue-resident DC subsets appear normal in numbers, others are declined in numbers in the tissue. This may be related to their proliferation potential in tissues.

      Major comments:

      • Are the claims and the conclusions supported by the data or do they require additional experiments or analyses to support them?

      1) The authors claim that Arpc4 deficiency selectively compromises myeloid cell populations that rely on proliferation for tissue colonization (Figure 7). The presented data might give hints for such a general hypothesis, but solid experimental proof to prove this is lacking. When comparing myeloid cell subsets from foru different irgans, the authors refer to published data that some dendritic cell subsets are more proliferative in tissues than others and that CD11cCre Arpc4-flox mice appear to have reduced cell numbers in these populations. However, the presented data are purely correlative and no functional connection to cell proliferation has been made to the phenotypes. While some dendritic cell subsets (Langerhans cells, alveolar DCs) show reduced cell numbers in CD11cCre Arpc4-flox mice, other myeloid cell cells subsets are unaffected (e.g. dermal cDC1 and 2, colon macrophages).There could be plenty of other reasons that might underly the observed discrepancies between these cell subsets, e.g. Arp2/3 knockout efficiency and myeloid cell turnover in the tissue are just two examples, which have not been taken into consideration. Direct measurement of cell proliferation, e.g. BrdU labeling, and the observed phenotype would be missing to make such claims. The data could either be removed. Experimentally addressing these points could take 3-6 months.

      Response and revisions: We thank the referee for bringing this point. We agree that these results give hints that support our conclusion but that do not address this question directly. However, we would like to emphasize that our conclusion is based on studies from others showing that alveolar macrophages self-maintain themselves through proliferation (Bain et al. Mucosal Immunology 2022). In contrast, it has been reported that a large fraction of colonic macrophages are derived from monocytes that are being recruited to the gut through life (Bain et al. Mucosal Immunity 2023). We now added these points in our revised manuscript. Moreover, during revision we confirmed deletion of the ArpC4 allele by genotyping PCR of FACsorted colon macrophages (revised Suppl. Fig. 7C and revised methods). In addition, we stress that we do not exclude that different intracellular Arpc4-dependent processes might contribute to the phenotypes observed (beyond maintenance of DNA integrity) (page 11). This will help mitigate our conclusions and leave open the potential implication of alternative mechanisms.

      2) The authors claim that DC subsets (e.g. dermal cDCs), which develop from pre-DCs, are not affected by Arp2/3 depletion (Figure 7, although the FACS plot in Fig. 1D would suggest a different picture for cDC1). This is surprising in light of the data with bone marrow-derived DCs (BMDCs), the major in vitro model of this study, which develop from CDPs that again develop from pre-DCs. BMDCs did show aberrant nuclei and signs of DNA damage. How would the authors then explain the discrepancies of the BMDC model with DC subsets, where the authors feel that the pre-DC origin explains the phenotypic difference? This is a general concern of the data interpretation and conclusions.

      __Response: __We thank the referee for bringing this point that indeed requires clarification. Two non-exclusive hypotheses could explain this apparent discrepancy:

      • The ontogeny of bone-marrow-derived DCs: Depending on the protocol used, there might be variations in the precursors DCs develop from. We use one of the first protocols, which was pioneered by Paola Ricciardi-Castagnoli lab (Winzler et al. Exp.Med. 1997). It relies on a supernatant from J558 cells transfected with GMCSF, which contains additional cytokines and mainly generate DC2-like DCs. Langerhans cells are closer to DC2s, which resemble more macrophages than DC1s. We thus chose this protocol rather than the protocols that use Flt3-L, which produce both DC1s and DC2s developed from common dendritic-cell precursors (CDPs). It is thus possible that our BM-derived DCs develop from other precursor cells closer to monocyte precursors.
      • As shown in Figure 5C, kinetics of acquisition of CD11c expression, and thus deletion of the Arpc4 gene, might be distinct in vivo and in vitro. In vivo, as stated in our manuscript, DCs acquire CD11c as preDCs and undergo few rounds of divisions after. In vitro, as shown by our cycling experiments, BM-derived DCs continuously cycle, so they will keep dividing after having acquired CD11c (around day 7) and deleting the Arpc4 gene. We now mentioned these hypotheses in the discussion of our revised manuscript to explain the apparent contradiction raised by the referee (pages 10 and 12).

      3) In line with point 2, the authors never show that BMDCs show reduced proliferation, reduced cell numbers or increased cell death in Arpc4-deficient cell cultures, as a consequence of the detected DNA damage and impaired DNA repair. In fact, Figure 5C even shows that cell growth rates between control and KO are equal. This is a major mismatch in the current study. Since the authors use the BMDC model to explain the declining cell numbers in Langerhans cells (which derive from fetal liver cells), this phenotype is not mirrored by the BMDC culture and it remains open whether the observed changes in nuclear DNA damage and repair are indeed directly linked to the observed phenotype of declining cell numbers in the tissue. These aspects require argumentation why cell growth is unchanged in KO cells. Additional experiments addressing these points with sufficient biological replicates (cultures from different mice) could take 2-3 months, including preparation time.

      __Response____: __We thank the referee for bringing this point, which was probably not properly discussed in the first version of our manuscript. Indeed, Arpc4KO BM-derived DCs do not show the premature cell death phenotype observed in LCs in vivo, as stated by the referee. There are at least two putative non-exclusive explanations for this. First, unlike LCs, which are long-lived cells, BM-derived DCs can be kept in culture for only 10-12 days. As DNA damage-induced cell death takes time (LCs only start to die about 3-4 weeks after network establishment), the lifespan of BM-DCs could simply not be long enough to observe this phenotype. Second, in the epidermis, LCs are physically constrained and continuously exposed to diverse signals that might increase their sensitivity to DNA damage and thereby induction of subsequent cell death.

      We have attempted to clarify this point in our revised manuscript by providing putative explanations for the death phenotype of Arpc4-deficient LCs not being observed in BM-derived DCs. We further explained that this does not invalidate this cellular model as it was used to raise hypotheses on the putative role played by ArpC4 in myeloid cells, i.e. maintenance of DNA integrity, which was then confirmed in vivo (ArpC4KO LCs do indeed display DNA damage in the epidermis) (page 12). Without this "imperfect cellular model", we would have probably not been able to uncover this novel function of Arp2/3 in immune cells.

      4) The authors refer to a "pre-mature aging" phenotype of Arpc4-deficient BMDCs and LCs, based on reductions in Lamin B, Lamin A and increases in gH2AX and 53BP1. I find this term and overstatement of the current data and suggest that other markers for cell senescence, such as p53, Rb, p21 and b-Galactosidase are then also used to make such strong claim on "aging" and cell senescence. Experimentally addressing this point with sufficient biological replicates could take 2-3 months, including preparation time.

      __Response: __We now assessed senescence signatures in our RNAseq analysis of Arpc4WT and Arpc4KO-derived DCs, as suggested by the referee. These results revealed several senescence-related DEGs upregulated in ArpC4KO DCs, such as serpinB2 (revised Suppl. Fig. 5G, volcano plots) as well as a general enrichment of a senescence-related signature when using the senescence gene set (Aging Atlas Consortium, 2021; revised Fig. 5I). These data support our notion of a premature aging phenotype following ArpC4 loss in BMDCs.

      5) The study does not provide a mechanism how the Arp2/3 complex would mediate the observed effects on DNA damage and repairs has not been addressed in the cell model, and only potential scenarios from other non-myeloid cell lines are discussed. It remains unclear whether the observed phenotypes in Arpc4-depleted myleoid cells relate to the direct nuclear function of Arp2/3 or the cytosolic function of Arp2/3, including its roles in cytoskeletal regulation that may have secondary effects on the nuclear alterations. This is a general concern of the presented data, data on mechanism might require more than 6 months.

      __Response____: __The referee is correct: Our manuscript shows that Arp2/3 deficiency in specific myeloid cells impacts on their survival in vivo and proposes that this could result at least in part from impaired maintenance of DNA integrity in these cells. We do not know whether this also applies to non-myeloid cells, which, although very interesting, is beyond the scope of the present study. In addition, we do not have any experimental tool to distinguish whether the DNA damage phenotype of Arpc4KO cells involves the nuclear or cortical pool of F-actin, this is why we have left this question open in the discussion of our manuscript.

      6) OPTIONAL: The authors make a strong case arguing that the increased interferon expression signature (based on the transcriptomics data) reflects the nuclear ruptures in Arpc4-deficient cells and adds to the observed phenotype. If this is so, what happens then in STING knockout cells in the presence of CK666 inhibitor?

      __Response____: __During revision, we now tested the putative role of STING in the ArpC4-KO phenotype. We found that abrogation of STING function in ArpC4KO mice did not rescue LC survival, excluding the possibility that aberrant STING activation triggers LC loss in these animals (revised Fig. S5E,F). Therefore, we tempered our conclusion (page 7).

      • Are the data and the methods presented in such a way that they can be reproduced?

      1) The analyses include quite a number of intensity calculations of immunofluorescence signals (Fig. 3D, E; Fig. 4E, Fig. 5B and 6B)? The background stainings are often variable or very high. In some cases it is even unclear whether stainings are really detecting protein and go beyond background staining (Fig. 6A, Fig. 5F). How were immunofluorescence data acquired and dealt with different background staining intensities?

      __Response____: __We extended our description of the microscopes used for image acquisition as well as the downstream analyses for each experiment, which indeed vary depending on the signals observed with distinct antibodies or constructs.

      2) It remained unclear to me on which basis the nuclear deformations in Fig. 3G, H were calculated?

      __Response____: __We also extended the mentioning of methods used to quantify nuclear deformations.

      3) The detailed phenotype of control mice is a bit unclear. It appears as if these were Cre-negative animals. Did the authors have some proof-of-principle experiments showing that CD11cCre Arpc4 +/+ animals have comparable phenotypes to Cre-negative animals?

      __Response____: __We have never observed any decline in LC numbers in other mouse lines/genotypes (for example in cPLA2flox/flox;CD11c-Cre mice shown in the manuscript, Fig. S6B), excluding a putative role for the Cre in LC death. To nevertheless thoroughly check this aspect, we now quantified gH2Ax immunostaining of LCs of both Cre-positive and Cre-negative animals. These analyses revealed no Cre-mediated effect on DNA damage in LCs (revised Suppl. Fig. 4E,F).

      • Are the experiments adequately replicated and statistical analysis adequate?

      For most experiments, the number of biological replicates (mice, or BMDC cultures from different mice) and individual values (n, cells) are indicated. Statistical analysis appears adequate.

      Minor comments:

      • Prior published studies on Arp2/3 function in immune cells are referenced accordingly. A number of additional pre-print manuscripts on this topic have not been cited and could be considered referencing.

      __Response: __We now cited additional, relevant preprints and peer-reviewed work (page 12).

      • The text is very clearly and very well written. Figures are clear and accurate for most cases. There are some open questions:

      • Fig. 1B: The number of dots betwenn graph and legend do not match. The dots are not n=12 for both genotypes. Additionally: What do the symbols in the circles in the graph stand for? This is also in another later figure unclear.

      • Fig. 2C: The current IF presentation (overlay MHCII with Ki67) is not very helpful. An additional image that shows only the Ki67 signal in the MHCII mask would be very helpful.

      • Fig. 4B: BMDCs of which culture day were used for these experiments?

      • Fig. 4A and D shows the same representative cells for two biological messages, which is only moderately convincing regarding a "general" phenotype.

      • Fig. 5, B: Scale bars are missing.

      __Response: __We have fixed all these points (revised Fig. 1B, 2C, 4B, 4A&D, 5B).

      Reviewer #3 (Significance (Required)):

      Strengths and Advance:

      The study provides strong data and a very detailed analysis of how the Arp2/3 complex regulates stages of Langerhans cell development and homeostasis. The role of the Arp2/3 complex as regulator of actin branching, which is involved in many cellular functions, has previously not been reported for this cell type. Previous research in immune cells have already studied the Arp2/3 complex, but studies were focussed on its role in migration and the majority of published phenotypes related to cell migration. While there are already a number of in vitro studies showing that the Arp2/3 complex can regulate aspects of cell cycle control or cell death in non-immune cells, most of these studies were performed with immortalized, non-immune cell lines, which can be more easily manipulated to dissect mechanistic aspects of the cellular phenotype, but are limited in their physiological interpretation. Hence, it is a major strength of this study to investigate the effects of Arp2/3 in a primary immune cell type, directly in the native and physiological environment. This is important because in vitro data from other cell types cannot be easily extrapolated to any other cell type and it is critical for our understanding to collect physiological data from tissues, where the biology really happens. The finding that the Arp2/3 complex regulates the tissue-residency of Langerhans cell through processes that are unrelated to migration are partially unexpected, shifting the view of this protein complex's physiological role to other cell biological processes, e.g. regulation of cell proliferation.

      Limitations: The limitations of the study are detailed in the five major points listed above. The study accumulates many experiments that characterize the phenotype of Arpc4-depleted cells, showing signs of DNA damage in Langerhans cells and cultures of BMDCs. How the Arp2/3 complex would mechanistically mediate the observed effects on DNA damage and repairs have not been addressed. It also remains open whether this is due to the effects of the Arp2/3 complex in the nucleus or the cytosol, which would be biologically extremely important to understand. Above that, there are some discrepancies regarding the phenotype of the BMDC model, which does neither entirely match the Langerhans cell phenotype in the tissue (reduced proliferation, LC derive from different progenitors), nor other endogenous DC populations, which should also derive from similar progenitors.

      Audience and reviewer background:

      In its current form, the manuscript will already be of interest for several research fields: Langerhans cell and dendritic cell homeostasis, immune cell trafficking, actin and cytoskeleton regulation in immune cells, physiological role of actin-regulating proteins. My own field of expertise is immune cell trafficking in mouse models, leukocyte migration and cytoskeletal regulation. I cannot judge the analysis and clustering of the bulk RNA sequencing data.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The manuscript identifies a role of the Arp2/3 complex, the major regulator of actin branching in cells, for controlling the homeostasis of murine Langerhans cells (LCs), a specialized subset of dendritic cells in the skin epidermis. The findings of the study are based on the analysis of CD11c-Cre Arpc4-flox mice, a conditional knockout mouse model, which interferes with Arp2/3 function in Langerhans cells and other CD11c-expressing myeloid cells, e.g. dendritic cell or macrophage subsets. By using immunofluorescence and flow cytometry analysis of epidermis and skin tissues, the authors provide a detailed analysis of LC numbers at different developmental stages (postnatal day 1, 7, 28, and adult mice) and demonstrate that Arpc4-deficiency does not interfere with the establishment of LC networks until postnatal day 28. However, LCs in ear and tail skin are substantially reduced in Arpc4-deficient mice at 8-12 weeks of age. In parallel to their in vivo model, the authors analyze cultures of bone marrow-derived dendritic cells (BMDCs) from control and CD11c-Cre Arpc4-flox mice. Arpc4-deficiency in BMDCs, which develop over 8-10 days in culture, results in nuclear shape and lamina abnormalities, as well as signs of increased DNA damage. Aspects of this phenotype are also detected in Langerhans cells in epidermal preparations. Transcriptomic analysis of BMDCs highlights a gene signature of increased expression of the interferon response pathway and alterations in cell cycle regulation. Arpc4-deficient BMDCs show increased expression of DNA damage markers and reduced expression of certain DNA repair factors. Based on these correlative findings from the BMDC model, the authors conclude that the decline in LC numbers might develop from the accumulation of DNA damage over time, which the authors phrease "pre-mature aging of Langerhans cells". Lastly, the authors show a heterogenous picture how Arp2/3 depletion affects distinct DC populations in CD11c-Cre Arpc4-flox mice. While some tissue-resident DC subsets appear normal in numbers, others are declined in numbers in the tissue. This may be related to their proliferation potential in tissues.

      Major comments:

      • Are the claims and the conclusions supported by the data or do they require additional experiments or analyses to support them?

      1) The authors claim that Arpc4 deficiency selectively compromises myeloid cell populations that rely on proliferation for tissue colonization (Figure 7). The presented data might give hints for such a general hypothesis, but solid experimental proof to prove this is lacking. When comparing myeloid cell subsets from foru different irgans, the authors refer to published data that some dendritic cell subsets are more proliferative in tissues than others and that CD11cCre Arpc4-flox mice appear to have reduced cell numbers in these populations. However, the presented data are purely correlative and no functional connection to cell proliferation has been made to the phenotypes. While some dendritic cell subsets (Langerhans cells, alveolar DCs) show reduced cell numbers in CD11cCre Arpc4-flox mice, other myeloid cell cells subsets are unaffected (e.g. dermal cDC1 and 2, colon macrophages).There could be plenty of other reasons that might underly the observed discrepancies between these cell subsets, e.g. Arp2/3 knockout efficiency and myeloid cell turnover in the tissue are just two examples, which have not been taken into consideration. Direct measurement of cell proliferation, e.g. BrdU labeling, and the observed phenotype would be missing to make such claims. The data could either be removed. Experimentally addressing these points could take 3-6 months.

      2) The authors claim that DC subsets (e.g. dermal cDCs), which develop from pre-DCs, are not affected by Arp2/3 depletion (Figure 7, although the FACS plot in Fig. 1D would suggest a different picture for cDC1). This is surprising in light of the data with bone marrow-derived DCs (BMDCs), the major in vitro model of this study, which develop from CDPs that again develop from pre-DCs. BMDCs did show aberrant nuclei and signs of DNA damage. How would the authors then explain the discrepancies of the BMDC model with DC subsets, where the authors feel that the pre-DC origin explains the phenotypic difference? This is a general concern of the data interpretation and conclusions.

      3) In line with point 2, the authors never show that BMDCs show reduced proliferation, reduced cell numbers or increased cell death in Arpc4-deficient cell cultures, as a consequence of the detected DNA damage and impaired DNA repair. In fact, Figure 5C even shows that cell growth rates between control and KO are equal. This is a major mismatch in the current study. Since the authors use the BMDC model to explain the declining cell numbers in Langerhans cells (which derive from fetal liver cells), this phenotype is not mirrored by the BMDC culture and it remains open whether the observed changes in nuclear DNA damage and repair are indeed directly linked to the observed phenotype of declining cell numbers in the tissue. These aspects require argumentation why cell growth is unchanged in KO cells. Additional experiments addressing these points with sufficient biological replicates (cultures from different mice) could take 2-3 months, including preparation time.

      4) The authors refer to a "pre-mature aging" phenotype of Arpc4-deficient BMDCs and LCs, based on reductions in Lamin B, Lamin A and increases in gH2AX and 53BP1. I find this term and overstatement of the current data and suggest that other markers for cell senescence, such as p53, Rb, p21 and b-Galactosidase are then also used to make such strong claim on "aging" and cell senescence. Experimentally addressing this point with sufficient biological replicates could take 2-3 months, including preparation time.

      5) The study does not provide a mechanism how the Arp2/3 complex would mediate the observed effects on DNA damage and repairs has not been addressed in the cell model, and only potential scenarios from other non-myeloid cell lines are discussed. It remains unclear whether the observed phenotypes in Arpc4-depleted myleoid cells relate to the direct nuclear function of Arp2/3 or the cytosolic function of Arp2/3, including its roles in cytoskeletal regulation that may have secondary effects on the nuclear alterations. This is a general concern of the presented data, data on mechanism might require more than 6 months.

      6) OPTIONAL: The authors make a strong case arguing that the increased interferon expression signature (based on the transcriptomics data) reflects the nuclear ruptures in Arpc4-deficient cells and adds to the observed phenotype. If this is so, what happens then in STING knockout cells in the presence of CK666 inhibitor?

      • Are the data and the methods presented in such a way that they can be reproduced?

      1) The analyses include quite a number of intensity calculations of immunofluorescence signals (Fig. 3D, E; Fig. 4E, Fig. 5B and 6B)? The background stainings are often variable or very high. In some cases it is even unclear whether stainings are really detecting protein and go beyond background staining (Fig. 6A, Fig. 5F). How were immunofluorescence data acquired and dealt with different background staining intensities?

      2) It remained unclear to me on which basis the nuclear deformations in Fig. 3G, H were calculated?

      3) The detailed phenotype of control mice is a bit unclear. It appears as if these were Cre-negative animals. Did the authors have some proof-of-principle experiments showing that CD11cCre Arpc4 +/+ animals have comparable phenotypes to Cre-negative animals?

      • Are the experiments adequately replicated and statistical analysis adequate?

      For most experiments, the number of biological replicates (mice, or BMDC cultures from different mice) and individual values (n, cells) are indicated. Statistical analysis appears adequate.

      Minor comments:

      • Prior published studies on Arp2/3 function in immune cells are referenced accordingly. A number of additional pre-print manuscripts on this topic have not been cited and could be considered referencing.

      • The text is very clearly and very well written. Figures are clear and accurate for most cases. There are some open questions:

      1) Fig. 1B: The number of dots betwenn graph and legend do not match. The dots are not n=12 for both genotypes. Additionally: What do the symbols in the circles in the graph stand for? This is also in another later figure unclear.

      2) Fig. 2C: The current IF presentation (overlay MHCII with Ki67) is not very helpful. An additional image that shows only the Ki67 signal in the MHCII mask would be very helpful.

      3) Fig. 4B: BMDCs of which culture day were used for these experiments?

      4) Fig. 4A and D shows the same representative cells for two biological messages, which is only moderately convincing regarding a "general" phenotype.

      5) Fig. 5, B: Scale bars are missing.

      Significance

      Strengths and Advance:

      The study provides strong data and a very detailed analysis of how the Arp2/3 complex regulates stages of Langerhans cell development and homeostasis. The role of the Arp2/3 complex as regulator of actin branching, which is involved in many cellular functions, has previously not been reported for this cell type. Previous research in immune cells have already studied the Arp2/3 complex, but studies were focussed on its role in migration and the majority of published phenotypes related to cell migration. While there are already a number of in vitro studies showing that the Arp2/3 complex can regulate aspects of cell cycle control or cell death in non-immune cells, most of these studies were performed with immortalized, non-immune cell lines, which can be more easily manipulated to dissect mechanistic aspects of the cellular phenotype, but are limited in their physiological interpretation. Hence, it is a major strength of this study to investigate the effects of Arp2/3 in a primary immune cell type, directly in the native and physiological environment. This is important because in vitro data from other cell types cannot be easily extrapolated to any other cell type and it is critical for our understanding to collect physiological data from tissues, where the biology really happens. The finding that the Arp2/3 complex regulates the tissue-residency of Langerhans cell through processes that are unrelated to migration are partially unexpected, shifting the view of this protein complex's physiological role to other cell biological processes, e.g. regulation of cell proliferation.

      Limitations:

      The limitations of the study are detailed in the five major points listed above. The study accumulates many experiments that characterize the phenotype of Arpc4-depleted cells, showing signs of DNA damage in Langerhans cells and cultures of BMDCs. How the Arp2/3 complex would mechanistically mediate the observed effects on DNA damage and repairs have not been addressed. It also remains open whether this is due to the effects of the Arp2/3 complex in the nucleus or the cytosol, which would be biologically extremely important to understand. Above that, there are some discrepancies regarding the phenotype of the BMDC model, which does neither entirely match the Langerhans cell phenotype in the tissue (reduced proliferation, LC derive from different progenitors), nor other endogenous DC populations, which should also derive from similar progenitors.

      Audience and reviewer background:

      In its current form, the manuscript will already be of interest for several research fields: Langerhans cell and dendritic cell homeostasis, immune cell trafficking, actin and cytoskeleton regulation in immune cells, physiological role of actin-regulating proteins. My own field of expertise is immune cell trafficking in mouse models, leukocyte migration and cytoskeletal regulation. I cannot judge the analysis and clustering of the bulk RNA sequencing data.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      • This is a study in experimental mice employing both in vitro and, importantly, in vivo approaches. EPIDERMAL LANGERHANS CELLS serve as a paradigm for the maintenance of homeostasis of myeloid cells in a tissue, epidermis in this case. In addition to well known functions of the ACTIN NETWORK in cell migration, chemotaxis, cell adherence and phagocytosis the authors reveal a critical function of actin networks in the survival of cells in their home tissue.

      • Actin-related proteins (Arp), specifically here the Arp2/3 complex, are necessary to form the filamentous actin networks. The authors use conditional knock-out mice where Arpc4 (an essential component of the Arp2/3 complex) is deleted under the control of CD11c, the most prominent dendritic cell marker which is also expressed on Langerhans cells. In normal mice, epidermal Langerhans cells reside in the epidermis virtually life-long. They initially settle the epidermis around and few days after birth an establish a dense network by a burst of proliferation and then they "linger on" by low level maintenance proliferation. In the epidermis of Arpc4 knock-out mice Langerhans cells also start off with this proliferative burst but, strikingly, they do not stay but are massively reduced by the age of 8-12 weeks.

      • The analyses of this decline revealed that

      a) the shape (number of nuclear lobes) and integrity of cell nuclei was compromised; they were fragile and ruptured to some degree when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      b) DNA damage, as detected by staining for gamma-H2Ax or 53BP1 accumulated when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      c) recruitment of DNA repair molecules was inhibited when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      d) gene signatures of interferon signaling and response were increased when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      e) in vivo migration of dendritic cells and Langerhans cells from the skin to the draining lymph nodes in an inflammatory setting (FITC painting of the skin) was impaired when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      f) the persistence of the typical dense network of Langerhans cells in the epidermis, created by proliferation shortly after birth, is abrogated when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing. Importantly, this was not the case for myeloid cell populations that settle a tissue without needing that initial burst of proliferation. For instance, numbers of colonic macrophages were not affected when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing.

      • Thus, the authors conclude that the Arp2/3 complex is essential by its formation of actin networks to maintain the integrity of nuclei and ensure DNA repair thereby ascertaining the maintenance proliferation of Langerhans cells and, as the consequence, the persistence of the dense epidermal netowrk of Langerhans cells.

      • Up-to-date methodology from the fields of cell biology and cellular immunology (cell isolation from tissues, immunofluorescence, multiparameter flow cytometry, FISH, "good old" - but important - transmission electronmicroscopy, etc.) was used at high quality (e.g., immunofluorescence pictures!). Quantitative and qualitative analytical methods were timely and appropriate (e.g., Voronoi diagrams, cell shape profiling tools, Cre-lox gene-deletion technology, etc.). Importantly, the authors used a clever method, that they had developed several years ago, namely the analysis of dendritic cell migration in microchannels of defined widths. Molecular biology methods such as RNAseq were also employed and analysed by appropriate bioinformatic tools.

      Major comments:

      • ARE THE KEY CONCLUSIONS CONVINCING? Yes, they are.

      • SHOULD THE AUTHORS QUALIFY SOME OF THEIR CLAIMS AS PRELIMINARY OR SPECULATIVE, OR REMOVE THEM ALTOGETHER? No, I think it is ok as it stands. The authors are wording their claims and conclusions not apodictically but cautiously, as it should be. They point out explicitely which lines of investigations they did not follow up here.

      • WOULD ADDITIONAL EXPERIMENTS BE ESSENTIAL TO SUPPORT THE CLAIMS OF THE PAPER? REQUEST ADDITIONAL EXPERIMENTS ONLY WHERE NECESSARY FOR THE PAPER AS IT IS, AND DO NOT ASK AUTHORS TO OPEN NEW LINES OF EXPERIMENTATION. I think that the here presented experimental evidence suffices to support the conclusions drawn. No additional experiments are necessary.

      • ARE THE SUGGESTED EXPERIMENTS REALISTIC IN TERMS OF TIME AND RESOURCES? IT WOULD HELP IF YOU COULD ADD AN ESTIMATED COST AND TIME INVESTMENT FOR SUBSTANTIAL EXPERIMENTS. Not applicable.

      • ARE THE DATA AND THE METHODS PRESENTED IN SUCH A WAY THAT THEY CAN BE REPRODUCED? Yes, they are.

      • ARE THE EXPERIMENTS ADEQUATELY REPLICATED AND STATISTICAL ANALYSIS ADEQUATE? Yes.

      Minor comments:

      • SPECIFIC EXPERIMENTAL ISSUES THAT ARE EASILY ADDRESSABLE. None

      • ARE PRIOR STUDIES REFERENCED APPROPRIATELY? Essentially yes. Regarding the reduction / loss of the adult epidermal Langerhans cell network, it may be of some interest to also refer to / discuss to another one of the few examples of this phenomenon. There, the initial burst of proliferation is followed by reduced proliferation and increased apoptosis when a critical member of the mTOR signaling cascade is conditionally knocked out (Blood 123:217, 2014).

      • ARE THE TEXT AND FIGURES CLEAR AND ACCURATE? Yes they are. Figures are well arranged for easy comprehension.

      • DO YOU HAVE SUGGESTIONS THAT WOULD HELP THE AUTHORS IMPROVE THE PRESENTATION OF THEIR DATA AND CONCLUSIONS?

      • Materials & Methods. The authors write, regarding flow cytometry of epidermal cells: "Briefly, 1cm2 of back skin from 8-14 weeks old female wild-type and knockout littermates was dissociated in 0.25 mg/mL Liberase (Sigma, cat. #5401020001) and 0.5 mg/mL DNase (Sigma, cat.#10104159001) in 1 mL of RPMI (Sigma) and mechanically disaggregated in Eppendorf tubes, FOLLOWED BY INCUBATED for 2 h at 37 {degree sign}C." Followed by what?

      • Materials & Methods. BMDC electronmicroscopy. What is "IF". Please specify.

      • RESULTS in gene expression analyses. The authors observe some increase in apoptosis (as detected by cleaved-Caspase-3 staining). Is this observation in immunofluorescence also evident in the RNAseq data (where the IFN changes were seen), i.e., in Figure 5.

      • Figure 7 F and G. Perhaps the authors may want to swap upper and lower panels in F or G, so that macrophage FACS plots and bar graphs are in the same row - ob, obiously, DC plots and bars likewise.

      • Figure 7H. "Gating strategy in ArpC4WT Lung (previously gated in Live CD45+ cells)" - The lower row is knock-out, not WT. This is indicated correctly in the legand, but in the figure both rows are labeled as WT.

      • The reference by Park et al. 2021 is missing in the list.

      • Figure 1D. Sure, the bar graphs are meant to say "CD11c"? The FACS plots show "CD11b".

      • As to cDC1. In Figure 1D the FACS plot shows an absence of CD103+ cDC1 cells. In contrast, In Figure 7A-left side panel, there is not difference in cDC1 cells between WT and KO mice. Is therefore the flow cytometry plot in Figure 1D not representative regarding cDC1 cells? Correct?

      Significance

      • DESCRIBE THE NATURE AND SIGNIFICANCE OF THE ADVANCE (E.G. CONCEPTUAL, TECHNICAL, CLINICAL) FOR THE FIELD. This is a conceptual advance. It adds a big step to our understanding of how immune cells in tissues (which all come from the bone marrow or are seeded before birth from embryonal hematopoietic organs such as yolk sac and fetal liver) can remain resident in these tissues. For cell types such as Langerhans cells, which establish their final population density within their tissues of residence, the presented finding convincingly buttress the role of proliferation and thereby the role for the actin-related protein complex 2/3 (Arp2/3).

      • PLACE THE WORK IN THE CONTEXT OF THE EXISTING LITERATURE (PROVIDE REFERENCES, WHERE APPROPRIATE). While we know much about actin-related proteins (Arp), as correctly cited by the authors, this knowledge is derived mostly from in vitro studies. The submitted study translates the findings to an in vivo setting for the first time.

      • STATE WHAT AUDIENCE MIGHT BE INTERESTED IN AND INFLUENCED BY THE REPORTED FINDINGS. Skin immunologists foremost, but these findings are of interest to the entire community of immunologists, but also cell biologists.

      • DEFINE YOUR FIELD OF EXPERTISE. My expertise is in skin immunology, in particular skin dendritic cells including Langerhans cells.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      • The manuscript by Delgado et al. reports the role of the actin remodeling Arp2/3 complex in the biology of Langerhans cells, which are specialized innate immune cells of the epidermis. The study is based on a conditional KO mouse model (CD11cCre;Arpc4fl/fl), in which the deletion of the Arp2/3 subunit ArpC4 is under the control of the myeloid cell specific CD11c promoter.

      • In this model, the assembly of LC networks in the epidermis of ear and tail skin is preserved when examining animals immediately after birth (up to 1 week). Subsequently however LCs from ArpC4-deleted mice start displaying morphological aberrations (reduced elongation and number of branches at 4 weeks of age). Additionally, a profound decline in LC numbers is reported in the skin of both the ear and tail of young adult mice (8-10 weeks).

      • To explore the cause of such decline, the authors then opt for the complementary in vitro study of bone-marrow derived DCs, given the lack of a model to study LCs in vitro. They report that ArpC4 deletion is associated with aberrantly shaped nuclei, decreased expression of the nucleoskeleton proteins Lamin A/C and B1, nuclear envelop ruptures and increased DNA damage as shown by γH2Ax staining. Importantly, they provide evidence that the defects evoked by ArpC4 deletion also occur in the LCs in situ (immunofluorescence of the skin in 4-week old mice).

      • Increased DNA damage is further documented by staining differentiating DCs from ArpC4-deleted mice with the 53BP1 marker. In parallel, nuclear levels of DNA repair kinase ATR and recruitment of RPA70 (which recruits ATR to replicative forks) are reduced in the ArpC4-deleted condition. In vitro treatment of DCs with the topoisomerase II inhibitor etoposide and the Arp2/3 inhibitor CK666 induce comparable DNA damage, as well as multilobulated nuclei and DNA bridges. The authors conclude that the ArpC4-KO phenotype might stem, at least in part, from a defective ability to repair DNA damages occurring during cell division.

      • The study in enriched by an RNA-seq analysis that points to an increased expression of genes linked to IFN signaling, which the authors hypothetically relate to overt activation of innate nucleic acid sensing pathways.

      • The study ends by an examination of myeloid cell populations in ArpC4-KO mice beyond LCs. Skin cDC2 and cDC2 subsets display skin emigration defects (like LCs), but not numerical defects in the skin (unlike LCs). Myeloid cell subsets of the colon are also present in normal numbers. In the lungs, interstitial and alveolar macrophages are reduced, but not lung DC subsets. Collectively, these observations suggest that ArpC4 is essential for the maintenance of myeloid cell subsets that rely on cell division to colonize or to self-maintain within their tissue of residency (including LCs).

      Major comments:

      1. ArpC4 and Arp2/3 expression

      The authors argue that LCs from Arpc4KO mice should delete the Arpc4 gene in precursors that colonize the skin around birth. It would be important to show it to rule out the possibility that the lack of phenotype (initial seeding, initial proliferative burst) in young animals (first week) could be related to an incomplete deletion of ArpC4 expression. Also important would be to show what is happening to the Arp2/3 complex in LCs from Arpc4KO mice. In the in vitro studies with DCs, the level of ArpC4 and Arp2/3 deletion at the protein level is also not documented. The authors explain that surface expression of the CD11c marker, which drives Arpc4 deletion, gradually increased during differentiation of DCs: from 50% to 90% of the cells. Does that mean that loss of ArpC4 expression is only effective in a fraction of the cells examined before day 10 of differentiation (e.g. in the RNA-seq analysis)?

      1. Intra-nuclear versus extra-nuclear activities of Arp2/3

      The authors favor a model whereby intra-nuclear ArpC4 helps maintaining nuclear integrity during proliferation of DCs (and possibly LCs). However, multiple pools of Arp2/3 have been described and accordingly, multiple mechanisms may account for the observed phenotype: i) cytoplasmic pool to drive the protrusions sustaining the assembly of the LC network and its connectivity with keratinocytes ; ii) peri-nuclear pool to protect the nucleus ; iii) Intra-nuclear pool to facilite DNA repair mechanisms e.g. by stabilizing replicative forks (the scenario favored by the authors).

      It is recommended that the authors try to gather more supportive data to sustain the intra-nuclear role. Documenting ArpC4 presence in the nucleus would help support the claim. It could be combined with treatments aiming at blocking proliferation in order to reinforce the possibility that a main function of ArpC4 is to protect proliferating cells by favoring DNA repair inside the nucleus.

      1. Nuclear envelop ruptures

      The nuclear envelop ruptures are not sufficiently documented (how many cells were imaged? quantification?). The authors employ STED microscopy to examine Lamin B1 distribution. The image shown in Figure 4A does not really highlight the nuclear envelop, but rather the entire content. Whether it is representative is questionable. We would expect Lamin B1 staining intensity to be drastically reduced given the quantification shown in Figure 3D. In addition, although the authors have stressed in the previous figure that Arpc4-KO is associated with nucleus shape aberrations, the example shown in Figure 4A is that of a nucleus with a normal ovoid shape.

      It is recommended to quantify the ruptures with Lap2b antibodies (or another staining that would better delineate the envelop) in order to avoid the possible bias due to the reduced staining intensity of Lamin B1.

      A missing analysis is that of nuclear envelop ruptures as a function of nucleus deformations.

      Fig 4B-C: same frequency of Arpc4-KO and WT cells displaying nuclear envelop ruptures in the 4-µm channels; however image show a rupture for the Arpc4-KO and no rupture for the WT cells (this is somehow misleading). Are ruptures similar in Arpc4-KO and WT cells in this condition?

      Fig 4D-E: is their a direct link between nuclear envelop ruptures and ƴH2A.X?

      Interesting (but optional) would be to understand what is happening to DNA, histones? Is their evidence for leakage in the cytoplasm?

      1. RNA seq analysis

      The RNA-seq analysis suffers from a lack of direct connection with the rest of the study. The extracted molecular information is not validated nor further explored. It remains very descriptive. The PCA analysis suggests a « more pronounced transcriptomic heterogeneity in differentiating Arpc4KO DCs ». However it seems difficult to make such a claim from the comparison of 3 mice per group. In addition, such heterogeneity is not seen in the more detailed analysis (Fig 5F). The authors claim that « day 10 control and Arpc4KO DCs showed no to very little differences in gene expression, in contrast to cells at days 7-9 of differentiation ». This is not obvious from the data displayed in the corresponding figure. In addition, it is not expected that cells that may take a divergent differentiation path at days 7-9 may would return to a similar transcriptional activity at day 10. A point that is not discussed is that before day 10 of DC differentiation, Arpc4 KO is expected to only occur in about 50% of the cell population. This is expected to impact the RNA-seq analysis. Not all clusters have been exploited (e.g. cluster 3 elevated, cluster 6 partly reduced). I suggest the authors reconsider their analysis and analysis of the RNA-seq analysis (or eventually invest in complementary analysis).

      1. What causes the profound numerical drop of LC in the epidermis?

      A major open question is what causes the massive drop of LCs. Although differentiating Arpc4KO DCs start accumulating DNA damage upon proliferation, they succeed in progressing through the cell cycle. There is even a slightly elevated expression of cell cycle genes at day 7 of differentiation in the DC model. Only a trend for increased apoptosis is observed in ear and tail skin. It would be important to provide complementary data documenting increased death (or aberrant emigration?) of LCs in the 4-8 week time window.

      1. Functional consequences

      Although the study reports novel aspects of LC biology, the consequence of ArpC4 deletion for skin barrier function and immunosurveillance are not investigated. It would seem very relevant to test how this model copes with radiation, chemical and/or microorganism challenges.

      Minor comments:

      1. Figure 1D

      Gating strategy: twice the same empty plots. The content seems to be missing... Does this need to be shown in the main figure?

      1. Figure 2

      Best would be to keep same scale to compare P1 and P7 (tail skin, figure 2A)

      Overlay of Ki67 and MHC-II does not allow to easily visualize the double-positive cells (Fig 2C)

      Quality of Ki67 staining different for Arpc4-KO (less intense, less focused to the nuclei): a technical issue or could that reflect something?

      Fig 2C: Panels mounted differently for ear and tail skin (different order to present the individual stainings, Dapi for tail skin only).

      1. LC branch analysis (Fig 1 and 2)

      While Fig 1 indicates that ear skin LCs form in average twice as few branches as tail skin LCs (3-4 versus 8-9 branches per cell), Fig 2 shows the opposite (10-12 versus 6-7 branches per cell). Is this due to a very distinct pattern between the 2 considered ages (4 weeks versus 8-10 weeks)? Could the author double-check that there is no methodological bias in their analysis?

      1. Fig 3 E-G

      How many animals were examined (n=5)? Reproducible accros animals? Why was it done with 4-week animals (phenotype not complete? Event occurring before loss in numbers...)

      Staining Lamin A/C globally more intense in the Arpc4-KO epidermis (also seems to apply to the masks corresponding to the LCs). Surprising to see that the quantification indicates a major drop of Lamin A/C intensity in the LCs.

      1. Legend Fig 4D replace confocal microscopy by STED microscopy

      2. Figure 4F

      Intensity/background of γH2Ax staining very distinct between the 2 micrographs shown for WT and Arpc4-KO epidermis.

      1. Figure 7C, F, H

      Gating strategies: would be better to harmonize the style of the plots (dot plots and 2 types of contour plots have been used...)

      1. Figure 7H

      Legend of lower gating strategy seems to be wrong (KO and not WT).

      Significance

      Strengths: the general quality of the manuscript is high. It is very clearly written and it contains a very detailed method section that would allow reproducing the reported experiments. This work entails a clear novelty in that it represents the first investigation of the role of ArpC4 in LCs. It opens an interesting perspective about specific mechanisms sustaining the maintenance of myeloid cell subsets in peripheral tissues. This work is therefore expected to be of interest for a large audience of cellular immunologists and beyond. Challenging skin function with an external trigger would lift the relevance for a even wider audience (see main point 6).

      Limitations: in its current version the manuscript suffers from a lack of solidity around a few analysis (see main points on ArpC4 and Arp2/3 protein expression, nuclear envelop rupture analysis,...). It also tends to formulate a narrative centered on the ArpC4 intra-nuclear function that is not definitely proven.

      The field of expertise of this reviewer is: cellular immunology and actin remodeling.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We thank the reviewers for their careful evaluation of our manuscript and for their constructive comments. The reviews recognise the relevance of the topic and the value of the multi-omics approach used to investigate host responses to SARS-CoV-2 variants in a physiologically relevant primary nasal epithelial cell model.

      In response to the reviewers' comments, we revised the manuscript to improve clarity of presentation, strengthened the contextualisation of the experimental design, and moderated the interpretation of the results. We also incorporated additional analyses based on existing datasets to better characterise infection burden and host responses across variants.

      Importantly, the MOI reported in the original manuscript (0.01) was a typographical error; all infections were performed at MOI 0.1 as documented in the GEO dataset (GSE271378). This was corrected throughout the manuscript.

      Overall, the revisions were intended to clarify the experimental framework, strengthen the integration of the multi-omics datasets, and ensure that the conclusions accurately reflect the scope of the study as a comparative systems-level analysis of variant-associated host-response signatures rather than a mechanistic dissection of individual pathways.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      • *

      Reviewer summary: In the submitted work, authors seek to understand the transcriptional and metabolic changes induced by different variants of SARS-CoV-2 infection. They employ a model of pooled, primary nasal epithelial cells (NEC) differentiated within an air-liquid-interface. Subsequently, cultures are infected with isolates representing key variants of SAR-CoV-2 from initial D614G, Alpha, Beta, Delta, and Omicron. Following initial characterization, authors compare transcriptional changes at 24 and 72 hours post infection. Analysis focuses on differentially expressed genes, upregulated Interferon Stimulated genes, and transcripts within known metabolic pathways. Subsequently, infected cultures are also analyzed by phosphoproteomic analysis to identify changes in cell signaling and measured for amino acid content. Throughout, changes in each profile are correlated with different variants of SARS-CoV-2, with Delta and Omicron revealing particular diametrically opposed changes. There are reasonable controls applied, including the use of IFNalpha treatment to "benchmark" ISG production. Overall, authors create a picture that Omicron infections do not suppress IFN signaling as efficiently as Delta variants and further exhibit limited hallmarks of cell stress and metabolic dysregulation. This is a remarkable study that attempts to cross-correlate multiple -omics analyses of cell responses to characterize differences in infection. It is very well written and the data is exemplary. I do have some concerns related to the placement and emphasis of interpretation in the results section that need to be revised. Beyond my stylistic concern, the interpretation of the experimental changes between variants are compromised by the failure to analyze the extent of infection within the NEC model. Using an MOI of 0.01 will produce a dramatically heterogeneous extent of infection at both 24 and 72 hours post infection that will also depend on the extent of viral transmission within the culture. The limited analysis of secreted E-gene detection is insufficient to overcome the inherent unequal comparison of cell responses between variants. There are ways to assuage, but not eliminate, this problem when it comes to comparing and interpreting experimental results. My concerns and suggestions are detailed in the concerns below.

      Response: We thank Reviewer #1 for the very positive assessment of the study, which supports a decision to publish, and for the constructive suggestions. We agree that the interpretation of comparative bulk multi-omics data in differentiated NEC cultures should be carefully framed in light of variant-specific infection dynamics. In response, we revised the manuscript substantially. Importantly, the MOI reported in the manuscript (0.01) was corrected, as all infections in this study were performed at MOI 0.1, as correctly described in our GEO submission (GSE271378). We also quantified SARS-CoV-2 reads directly from the RNA-seq libraries to estimate infection burden in each sequenced sample, added infectious virus titre measurements (PFU/ml), expanded the analysis of IFNα-treated samples to include DEG and pathway-level comparisons, improved figure clarity and legends, and substantially tempered the interpretation throughout the Results and Discussion. We believe these revisions address the reviewer's concerns and strengthen the manuscript.

      1. Heterogeneous extent of infection. The MOI of 0.01 used to initiate infection is extraordinarily low for the types of analysis that is employed with the NEC culture. The interpretation of the data does not take into account that there will be infected and uninfected cells, of varying extents, making up the changes observed. Further, the variants likely have differing abilities to spread through the NEC culture, complicating both interpretation of changes and comparison between variants. At a minimum, authors need to evaluate the extent of SARS-CoV-2 infection through either flow cytometry or immunofluorescence analysis against viral protein(s). It is possible that Omicron, while secreted well, has more limited transmission allowing for more cells to mount an IFN response. Delta is a prolifically spreading virus that likely has more extensive infection at 72 hpi than the other variants. These statements are conjecture and highlight how such differences could alter the interpretation of the subsequent experiments. Response: We thank the reviewer for raising this important point. We would first like to clarify that the MOI reported in the manuscript (0.01) was a typographical error. All infections in this study were performed at MOI 0.1, as correctly documented in the RNA-seq dataset deposited in GEO (GSE271378). The manuscript text, Methods, and figure legends was corrected accordingly. MOI values in this range are commonly used for infections of differentiated airway epithelial cultures and allow productive infection while preserving epithelial integrity.

      We agree that infection heterogeneity is an important consideration when interpreting bulk transcriptomic, phosphoproteomic and metabolic measurements in differentiated epithelial cultures. We argue that such heterogeneity is expected in air-liquid interface nasal epithelial models, where SARS-CoV-2 infection occurs within a structured epithelium composed of multiple cell types and infected cells coexist with neighbouring bystander cells responding to paracrine interferon signalling. Bulk multi-omics measurements therefore capture the integrated epithelial response to infection rather than purely cell-intrinsic responses.

      To better contextualise infection burden within the sequenced samples, we included an additional analysis quantifying SARS-CoV-2 reads directly from each RNA-seq library and infectious virus titres (Figure 1). In the revised manuscript, these data are presented together in a new Supplementary Figure 1, which distinguishes intracellular viral RNA abundance from infectious virus production. The viral read analysis shows that intracellular viral RNA increases between 24 and 72 hpi across all variants and becomes broadly similar across lineages by 72 hpi, whereas plaque assays show that BA.1 has the highest early infectious output and Delta reaches the highest infectious titres at 48-72 hpi. We used these data to revise the Results and Discussion so that host-response differences are interpreted in the context of infection burden, while also making clear that intracellular viral RNA abundance, extracellular viral RNA output and infectious virus production are related but distinct measures of variant biology.

      Figure 1: Intracellular viral RNA reads (RNA-seq) (A) and infectious virus titres (PFU mL⁻¹; B) across SARS-CoV-2 variants.

      Further evaluation of IFNalpha treated cells. The paper emphasizes the ISG analysis, but the IFN treated cells should be included in the DEG and metabolic pathway analysis. IFN treatment is known to alter metabolic changes in cells, and it would be valuable to see those changes reflected in your analysis. Consider the evidence presented in the following: Fritsch SD, Weichhart T. Effects of Interferons and Viruses on Metabolism. Front Immunol. 2016 Dec 21;7:630. Heer CD, Sanderson DJ, Voth LS, Alhammad YMO, Schmidt MS, Trammell SAJ, Perlman S, Cohen MS, Fehr AR, Brenner C. Coronavirus infection and PARP expression dysregulate the NAD metabolome: An actionable component of innate immunity. J Biol Chem. Elsevier BV; 2020 Dec 25;295(52):17986-17996. Palmer CS. Innate metabolic responses against viral infections. Nat Metab. 2022 Oct;4(10):1245-1259 Further, It is possible that the changes attributed to Omicron are quite similar to the effects of the IFN treatment, given the extensive ISG detection. The same is true for the phosphor-proteomic analysis and amino acid content. I also have concerns that using a treatment of IFNalpha that impacts all cells as a benchmark for heterogeneous infection is not truly comparable. How was the concentration of IFN chosen? What was the extent of IFN activation in the culture?

      Response: In response to this suggestion, we performed pathway enrichment analysis of IFNα-treated samples to evaluate whether interferon stimulation alone induces the metabolic pathway signatures observed during viral infection. IFNα treatment produced the expected transcriptional interferon-stimulated gene programme but did not result in significant enrichment of the metabolic pathways highlighted in the infection comparisons (Figure 2). Specifically, pathways related to glycolysis/gluconeogenesis, glutathione metabolism, fatty acid metabolism, mitochondrial pathways, and oxidative phosphorylation showed only limited or modest negative enrichment scores and did not approach the magnitude of enrichment observed in virus-infected cultures. These results indicate that interferon signalling alone does not reproduce the metabolic pathway signatures associated with variant infection. The IFNα pathway analysis was included in the revised manuscript as supplementary data and referenced in the Results section.

      We agree that IFNα treatment of all cells is not directly equivalent to heterogeneous viral infection within differentiated NEC cultures. The IFNα concentration used was selected based on previous optimisation experiments showing robust induction of canonical ISGs in differentiated airway epithelial cultures. In the revised manuscript we clarified that the IFNα condition is used as a reference for interferon-responsive transcription rather than as a direct surrogate for infected cultures. We provided additional methodological clarification regarding how the IFNα concentration was selected and how interferon activation was benchmarked in NEC cultures.

      Further correlation of transcriptional changes with metabolic changes - While many published works emphasize transcriptional changes as a proxy for metabolic changes, there are robust methods that can be applied to directly analyze metabolite content and changes in the context of viral infection. In particular these studies should be assessed and compared for the interpretation of the presented results: Kramaric, T., Thein, O.S., Parekh, D. et al. SARS-CoV2 variants differentially impact on the plasma metabolome. Metabolomics 21, 50 (2025). Loveday EK, Welhaven H, Erdogan AE, Hain KS, Domanico LF, Chang CB, June RK, Taylor MP. Starve a cold or feed a fever? Identifying cellular metabolic changes following infection and exposure to SARS-CoV-2. PLoS One 2025 Feb 12;20(2):e0305065. Irún P, Gracia R, Piazuelo E, Pardo J, Morte E, Paño JR, Boza J, Carrera-Lasfuentes P, Higuera GA, Lanas A. Serum lipid mediator profiles in COVID-19 patients and lung disease severity: a pilot study. Sci Rep. 2023 Apr 20;13(1):6497. Luke Whiley, Nathan G. Lawler, Annie Xu Zeng, Alex Lee, Sung-Tong Chin, Maider Bizkarguenaga, Chiara Bruzzone, Nieves Embade, Julien Wist, Elaine Holmes, Oscar Millet, Jeremy K. Nicholson, and Nicola Gray, "Cross-Validation of Metabolic Phenotypes in SARS-CoV-2 Infected Subpopulations Using Targeted Liquid Chromatography-Mass Spectrometry (LC-MS)", Journal of Proteome Research 2024 23 (4), 1313-1327

      Response: We thank the reviewer for this important comment and agree that transcriptional pathway enrichment alone cannot establish metabolic flux or enzyme activity. Our intention in this study was to integrate transcriptomic signatures with complementary data layers, including phospho-signalling profiles and targeted intracellular amino acid quantification, to provide a comparative systems-level view of host responses to SARS-CoV-2 variants in nasal epithelial cells.

      We acknowledge that transcriptional enrichment does not necessarily reflect pathway activity and that our amino acid measurements represent a targeted metabolite readout rather than a comprehensive metabolomic or flux-based analysis. In the revised manuscript, we have therefore moderated the language used when describing metabolic changes and refered to pathway enrichment more cautiously as indicative of potential metabolic engagement rather than direct metabolic regulation.

      We have also expanded the Discussion to contextualise our findings with the metabolomic studies suggested by the reviewer and related work examining metabolic responses to SARS-CoV-2 infection.

      Editing to limit interpretation within experimental results. I appreciate that this is a stylistic concern and it is an issue in the paper. Statements in the results are often over-reaching. Some examples include: Line 156 -"suggesting attenuated or delayed early sensing" - The Low MOI and time leaves these results open to various explanations. Better to just state and move on. Line 157 "Delta drove the most extensive" - drove has a lot of assumption. "produced" "resulted in " or something more passive is more appropriate Line 179 "pointing to sustained suppression of interferon responses." - sustained is a leading interpretation. Effective? Comprehensive? again, the MOI is complicating interpretations of global transcript changes. Line 186 "suggesting a weaker activation of interferon signaling" Too much leading interpretation here. You detect fewer ISGs that are differentially regulated. Could be for many reasons.

      Response: We appreciate this comment and agree. We have revised the Results section throughout to make the language more descriptive and less interpretive. The specific examples highlighted by the reviewer were changed accordingly, and similar phrasing elsewhere in the Results was also softened. Mechanistic interpretation was reduced and moved to the Discussion where appropriate.

      Line 72 "has evolved unique strategies" Unique can be easily misconstrued to mean different mechanisms. More likely, it is a subtle balance between promotion of viral replication and suppression of IFN responses.

      Response: We agree and have revised this wording to avoid overstating mechanistic distinctiveness.

      Line 126 - 128 "NECs were derived from three commercially available donor pools". The following text doesn't make it clear that they are the same produce from different lots. The methods clarify somewhat, but should be clarified for transparency.

      Response: We thank the reviewer for noting this lack of clarity. We revised the relevant text in the Results and Methods to make clear that the NECs were derived from the same commercial product obtained across different lots/batches.

      Line 129 "Viral replication kinetics" Need to highlight that this is detection of secreted viral genomes. which is a proxy measure for replication and dissemination in the culture. Direct measurement of the extent of infection is not being made nor can be interpreted.

      Response: We agree and have revised the text and figure legend to clarify that the RT-qPCR measurements represent extracellular viral genome copies released into the apical supernatant and therefore provide a proxy measure of viral RNA output and dissemination within the culture rather than a direct measurement of infection extent. To better contextualise infection dynamics, we have complemented the RT-qPCR data with two additional measures of viral burden. First, we quantified SARS-CoV-2 reads directly from the RNA-seq libraries to estimate intracellular viral RNA abundance in the sequenced samples. Second, we measured infectious virus titres (PFU ml⁻¹) by plaque assay. These complementary analyses were presented in Supplementary Figure 1 and allow us to distinguish extracellular viral RNA release, intracellular viral RNA abundance, and infectious virus production. The revised manuscript explicitly acknowledges that bulk multi-omics measurements reflect mixtures of infected and bystander epithelial cells and therefore capture the integrated epithelial response to infection rather than the exact proportion of infected cells.

      Line 149 "Differentially expressed genes (DEGs)" What is the comparison group? The figure legend/design suggests that IFNa treatment. Is there a matched uninfected control for each timepoint as well? Later experiments specify the comparison group. Text should be clarified here for transparency.

      Response: We thank the reviewer for highlighting that the comparison group was not clearly described in the Results section. Differential expression analysis was performed by comparing each variant-infected condition with mock-infected control samples collected at 24 h. The same mock reference was used for comparisons at both 24 and 72 hpi. IFNα-treated samples were analysed separately and were not used as the reference condition for DEG identification. We have clarified this explicitly in the revised Results and Methods sections.

      Line 224 and Figure 4B - I don't see the value of the "merged NES" values given these are only aggregate of the Pre-Omicron and Omicron species. If you had compared multiple D614G and Delta variants, then there would be utility.

      Response: We agree that the merged NES values provide only a broad visual summary and that the most informative comparisons are at the individual variant level. In the revised manuscript we reduced the emphasis on the merged analysis and clarify in both the Results text and the figure legend that interpretation is primarily based on the variant-specific enrichment profiles, with lineage grouping shown only as a visual summary.

      Line 261 "quantified at 24 hpi" Why this timepoint? Changes were minor and not representative to extensive infection.

      Response: We thank the reviewer for this comment. The amino acid measurements were performed at 24 hpi to capture early metabolic responses to infection, in parallel with the phosphoproteomic analysis performed at the same time point. We agree that at this stage of infection the NEC cultures likely contain mixtures of infected and bystander epithelial cells, and therefore the amino acid measurements reflect the integrated metabolic state of the culture rather than infected cells alone. We clarified this rationale and limitation in the revised Results and Discussion sections.

      Line 268 "rather than variation in cell number." I appreciate the rigor and control of experimentation. And how many of those cells are infected? That is not controlled.

      Response: We thank the reviewer for this important point. We agree that normalisation to viable cell number does not control for infection heterogeneity within the cultures. In the revised manuscript, we revised this sentence to clarify that the amino acid measurements were normalised for cell number, but that, because they were obtained from bulk cultures at 24 hpi, they reflect the integrated metabolic state of infected and bystander cells rather than infected cells alone.

      Line 428-429 "direction of regulation" This seems like an over-interpretation of the data. You have performed pathway analysis based on the quantity of RNA transcription detected in sequencing then imputing an interpretation of regulation. Without pulse labeling of metabolic standards or kinetic analysis of metabolite quantity, it is difficult to assert regulatory direction.

      Response: We agree with the reviewer and have revised this wording accordingly. In the revised manuscript, we avoided describing pathway-level RNA-seq enrichment as direct regulation in a mechanistic sense. Instead, we refered more cautiously to positive or negative pathway enrichment based on transcript abundance patterns, which more accurately reflects the information provided by the enrichment analysis.

      Referee cross-commenting I am in agreement with the comments and suggestions of Reviewer #2 and #3. In particular, the comment of Reviewer #3 to estimate viral replication from the RNASeq data is quite valuable to begin addressing some of the concerns about the extent of viral replication. It does not negate the need to further assess productive viral titer (PFU/mL) or the extent of viral infection (immunofluorescence or flow cytometry). I also agree with Reviewer #3 regarding the extent of mechanistic interpretation that can be drawn from the current study. This concern can largely be addressed through revision of the text and a tempering of the interpretations that are drawn. I also agree and appreciate the detailed analysis of reviewer #2 regarding the inconsistencies between the text and the figures. It is critically important to be consistent in the data and presentation of these complex experiments. Resolving these issues will only strengthen the work.

      Response: We thank the reviewer for these additional comments and for highlighting the useful points raised by Reviewers #2 and #3. In line with these suggestions, we quantified viral reads directly from the RNA-seq libraries to provide an estimate of infection burden in the sequenced samples and included infectious virus titre measurements (PFU/ml) to complement the existing replication analyses. We agree that the current dataset supports a comparative systems-level analysis rather than strong mechanistic conclusions, and we therefore tempered the interpretation throughout the manuscript. Finally, we carefully reviewed and revised the figures, legends, and associated text to ensure consistency and clarity in the presentation of the data.

      Reviewer #1 (Significance (Required)):

      The work detailed in this manuscript is takes a very broad approach to identify differences in the effects of SARS-CoV-2 variant infections. Elements of this work have been published, including transcriptomics, metabolomics, and phosphoproteomics. This work is significant in that multiple variants are evaluated with comparable methods in the very relevant human nasal epithelial cell model. The use of this model, and the direct integration of multiple -omics, sets this work apart from previously published studies. This cross-omic analysis, with the IFN-treated controls, provides a robust foundation of data that can be used to detail the differences in the response to the SARS-CoV-2 variant infections. That said, a significant limitation to the study was the low MOI used to initiate infection and the lack of detailed analysis infection progression of the different variants. Further, there is limited comparison of the IFN-treatment condition in relation to the transcriptional changes, and no inclusion of IFN-controls in the other methods. Both of these limitations undercut the potential significance of the paper and its findings. Audience: This work will have be important to bench researchers interested in further characterizing and comparing the effects of SARS-CoV-2 infection. Potentially, clinicians involved in diagnostics will find utility in the study of changes for potential biomarker analysis for severe COVID19 disease. My expertise is the field of virology, having studying multiple RNA and DNA viruses, including SARS-CoV-2, to understand virus-cell interactions. My focus includes primary cell culture models of infection, proteomic and metabolic analysis of infection induced changes, and monitoring the spread of viral infection through direct and indirect measurements.

      Response: We thank the reviewer for the positive assessment of the significance of the study and for recognising the value of the integrated multi-omics analysis in a physiologically relevant human nasal epithelial cell model. We also appreciate the reviewer's constructive comments regarding infection burden and the interpretation of the IFNα reference condition. As noted above, the reported MOI of 0.01 was a typographical error and was corrected to 0.1 throughout the manuscript. To further address the reviewer's concerns regarding infection extent, we quantified viral reads directly from the RNA-seq libraries and include infectious virus titre measurements (PFU/ml) as an additional measure of productive infection. We also expanded the analysis of IFNα-treated samples to include differential expression and pathway-level comparisons, allowing more direct contextualisation of virus-induced transcriptional responses relative to a canonical interferon-stimulated programme. We believe that these revisions strengthen the interpretability and overall significance of the study.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors of the manuscript entitled "Evolutionary rewiring of host metabolism and interferon signalling by SARS-CoV-2 variants" investigated the diversity of different SARS-CoV-2 isolates regarding gene expression, kinase activity and amino acid profiles in infected primary human nasal epithelial cells. Somji et al. found certain distinct alterations of measured factors after infections compared to mock and differences in cells infected with the mentioned different SARS-CoV-2 isolates. The topic of the manuscript as such is of high importance since understanding virus host interactions in general and virus host coevolution particularly on the level of cellular metabolism and beyond comes with great potential in deeper understanding the infection biology of viral invaders. Nevertheless, the study needs to be enlarged and further defined, the experimental set up has to be improved and the drawn conclusions have to be proven by experiments. The presentation of the obtained data needs to be improved, checked and carefully chosen to allow the reader to follow the article in a much more guided way. At this stage of experimental data depth, presentation and interpretation, there is room for certain overinterpretations of the biological meanings of the presented data.

      Response: We thank Reviewer #2 for the careful evaluation of the manuscript and for recognising the relevance of the topic. We agree that the original submission required clearer presentation, stronger contextualisation of the experimental design, and more cautious interpretation. In response, we revised the figures, legends, and linked text; clarify the number of biological and technical replicates for each experiment; added viral RNA read quantification from the RNA-seq libraries and infectious virus titres (PFU/ml); expanded the IFNα-related analyses; and moderated the conclusions throughout. We believe these revisions directly address the reviewer's core concerns.

      The authors state about virus growth kinetics in Fig.1. To be able to do so in full extend, virus particle counts (PFU/ml) need to be measured and included in this data set.

      Response: We agree and added the infectious virus titre measurements (PFU/ml) to complement the RT-qPCR genome measurements (Supplementary Figure 1).

      From Fig.2 on, the presentation and introduction of the data set is often very hard to follow. Certain panel labeling is not correct e.g. in Figure 2, Figure 2A is not introduced, 72h data are linked to Figure 2C but 2C is a Venn diagram of 24h gene expression downregulation. The Venn diagrams are not mentioned in the text at all. This problem is occurring at different occasion, which makes it hard to impossible to follow the experimental flow of the study. Therefore, a complete revision of the data presentation within the figures and the linked text is needed. Further example, lines 213 and 224, Figure 4B two times mentioned with different data supposed to be shown in Fig. 4B.

      Response: We thank the reviewer for identifying these issues. We comprehensively revised the figure panel labelling, figure legends, and linked text to ensure consistency and readability throughout the manuscript.

      The authors are inconsistent with including statistics in their figures. Please include all statistics in your figures to allow the reader to get this information. Please declare how often and how each experimental set has been done and clarify e.g. in the figure legends. In addition, please improve the figure quality for better allowance of cross comparability of data sets. As example, used the same x-axis scale for all graphs in Fig 4.

      Response: We agree and have revised the figures and legends accordingly. Statistical annotations have been added in the results section, and full values associated with the pathway enrichment analysis are now reported in Supplementary Table S2. For the amino acid measurements, individual biological replicate values are now displayed in the figure panels rather than only summary statistics. Replicate numbers and experimental design (biological replicates, technical replicates, and donor batches where relevant) are now explicitly stated in the figure legends and Methods section.

      To improve comparability across datasets, figure formatting was standardised throughout the manuscript. In particular, the x-axis scales in Figure 4 (below) were harmonised across panels to allow direct comparison of normalised enrichment scores between variants and time points. Additional adjustments were made to improve figure clarity, including consistent axis labelling, colour scales, and panel annotations.

      The authors create claims about metabolic profiles without measuring deeper metabolic circumstances. Why are only amino acids measured and not metabolite concentrations in general. Metabolic gene expressions as measurement of metabolic pathway activities can be strongly misleading since gene expression per definition does not necessarily mean enzyme activity, which of course is finally important for pathway activity as well.

      Response: We agree that amino acid profiling represents a targeted metabolic readout rather than a comprehensive metabolomic analysis, and that transcript abundance does not directly equal enzyme activity or flux. We have revised the manuscript throughout to reflect this limitation more clearly and to expand the Discussion to place our targeted amino acid data and pathway enrichment analyses in the appropriate context.

      The authors need to carefully crosslink the obtained data sets. As an easy example, how much of the found differences in gene expression, pathway activities etc. is due to viral growth differences. With other words, are there regulatory differences or are the differences seen due to different growth kinetics. Are ISG expression level linked to virus growth? These type of questions not be asked and correlations need to done by the authors to guide the reader through all those assays conducted in this study.

      Response: We agree that infection burden is an important variable when interpreting bulk multi-omics datasets obtained from infected epithelial cultures. To address this, we incorporated two additional measures of viral burden into the revised manuscript. First, SARS-CoV-2 reads were quantified directly from the RNA-seq libraries to estimate intracellular viral RNA abundance in the sequenced samples. Second, infectious virus titres (PFU ml⁻¹) were measured by plaque assay. These complementary datasets are presented in Supplementary Figure 1.

      In the revised manuscript, these measures are used to contextualise the transcriptomic and pathway analyses. Intracellular viral RNA reads increased across variants between 24 and 72 hpi and reached broadly comparable levels by 72 hpi, whereas infectious virus production differed between variants, with Delta producing the highest titres at later time points. We therefore revised the Results and Discussion to explicitly acknowledge that bulk transcriptomic, signalling and metabolic signatures may reflect both infection burden and variant-specific regulatory differences. For example, ISG induction at 72 hpi is discussed in the context of similar intracellular viral RNA levels across variants, indicating that differences in interferon-responsive transcription are not explained solely by viral RNA abundance.

      More broadly, we now emphasise that NEC cultures contain mixtures of infected and bystander epithelial cells and that the multi-omics datasets capture integrated epithelial responses rather than cell-intrinsic responses alone. These revisions strengthen the crosslinking between infection dynamics and host-response datasets while avoiding overinterpretation of variant-specific regulatory mechanisms.

      .

      Referee cross-commenting I do fully agree with reviewer 1 and 3 in terms of the importance of much more comprehensive data on virus growth. Measurement of real virus progeny (PFU/ml) and viral protein and RNA expression is needed to state about the importance of altering viral dynamics for interpreting the findings. I do fully agree with reviewer 1 and 3 that data analysis, presentation and interpretation has to be improved. Information such as how often has each experiment been done and how has the experimental set up been constructed has to be clarify e.g. in the figure legends. As reviewer 1 mentioned, direct analysis of metabolite concentrations is needed to be able to judge about metabolic changes driven by the different SARS CoV-2 variants. In line with both, reviewer 1 and 3, conclusions drawn by the authors should be toned down. More data and improved data analysis and presentation are needed to foster the conclusions drawn .

      Response: We thank the reviewer for these additional comments. In response, we added PFU/ml and RNA-seq-derived viral read data, improved experimental detail in the Methods and figure legends, clarified the scope and limitations of the amino acid measurements, and substantially moderated the interpretation throughout the manuscript.

      Reviewer #2 (Significance (Required)):

      While the topic as such is interesting and hoighly relevant, the manuscript has several major flaws, both with regard to paper organisation and content. In the current state it is hard to judge, whether the data are of significance.

      Response: We appreciate this assessment and hope that the extensive revisions in response to the reviewers' comments make the organisation, data presentation, and significance of the study much clearer.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript entitled "Evolutionary rewiring of host metabolism and interferon signalling by SARS-CoV-2 variants" by Somji and colleagues sets out to understand SARS-COV-2 variant biology in primary nasal epithelial cells. Understanding this and differences in variant-specific host-virus interactions is essential to understand the molecular basis of replication advantages and enhanced transmission that ultimately lead to variant dominance. The authors employ global transcriptomic, phosphor-proteomic and amino acid metabolism assays with the aim to identify variant-specific changes to cell metabolism and innate immune activation in a comparative systems-level approach. Importantly, this work is performed in primary nasal epithelial cells. It is essential to understand variant biology in the context of relevant primary cell infection models and NECs are a great choice to address the proposed research question. The work is conceptually interesting, but largely descriptive. While this can still be useful for the field, it requires appropriate framing of the interpretations of the data. I agree with the authors that there will be virus- specific signatures that will contribute to variant fitness, but this dataset makes it hard to draw strong conclusions. The main problem with the manuscript and the interpretation are dramatic differences in viral replication. While some of the conclusions are tantalising and would warrant further investigation, I would expect to see some experimental validation to substantiate the interpretation. In the absence of experimental validations and mechanism, the conclusions should be stated as such and contextualised more with previously published work.

      Response: We thank Reviewer #3 for the thoughtful and constructive assessment. We agree that the study is primarily comparative and systems-level in nature and that the original submission overreached in parts of the interpretation. In response, we moderated the conclusions and reframed the manuscript as a comparative analysis of lineage-associated host-response signatures that generates mechanistic hypotheses for future work, rather than claiming definitive causal mechanisms. We also added additional data on viral burden, revised the analysis description, improved figure presentation, and expanded the contextualisation with previously published work.

      A major concern that I have is the analysis of the RNASeq data. Experimental design, analysis and presented data require some clarification: Too little experimental detail for the RNASeq data is given. How many replicates were sequenced/analysed? The figure legend state three independent experiments - but how many individual replicate transwells per condition (and NEC batch) were used? This information needs to be included in the manuscript. Generally, clarification on how many replicates were used per experiment needs to be included in the figure legends for all data panels.

      Response: We thank the reviewer for highlighting the need for clearer reporting of RNA-seq experimental design and replicate structure. We have revised the manuscript to explicitly clarify replicate numbers, experimental batches, and sequencing quality control. RNA-seq experiments were performed using three independent batches of donor-pooled nasal epithelial cultures (MucilAir{trade mark, serif}). For most infection conditions and time points, two to three biological replicate transwells were sequenced per condition derived from independent NEC culture batches. A small number of libraries did not pass sequencing quality control thresholds (e.g. insufficient sequencing depth or technical library failure) and were therefore excluded from downstream analysis, resulting in minor variation in replicate numbers across conditions. To improve transparency, sequencing depth and library quality metrics for all RNA-seq libraries are now provided in Supplementary Table S6. In addition, principal component analysis (PCA) of the RNA-seq dataset has been included as Supplementary Figure S5 to illustrate sample clustering and replicate consistency across conditions. All raw sequencing data, sample metadata, and replicate information are available in the GEO repository under accession number GSE271378. We have also revised the Methods and figure legends throughout the manuscript to explicitly state replicate numbers for each dataset.

      .

      The reported log2 fold changes are beyond what is biologically reasonable. A log2 fold change of 120 or even 30 (Fig.3D, suppl table) indicates issues with the data analysis. It is worth revisiting the analysis and additionally inclusion of some QC data would be helpful (e.g. PCA of the data). Furthermore, viral genome data should be extracted from the RNASeq data to give an indication of infection levels in the relevant samples rather than just relying on a representative graph (Fig.1B).

      Response: Extremely large log₂ fold-change values can arise in RNA-seq analyses when strongly inducible genes are compared to control samples with very low baseline expression. This is common for interferon-stimulated genes (ISGs), which are often undetectable or expressed at near-background levels in mock samples but become highly induced following interferon signalling or viral infection. Similar magnitudes of induction have been reported in transcriptomic studies of interferon responses and SARS-CoV-2 infection of NEC (e.g. Hatton et al., 2021 (PMID: 34876592); Ziegler et al; 2021 (PMID: 34352228); Sharif-Askari et al., (PMID: 36415751) and other.

      To improve clarity, we have revisited the analysis and revised the visualisation of the RNA-seq data. Plotting scales and figure annotations have been adjusted to avoid misleading representation of extreme fold changes. In addition, we have included additional quality-control information for the RNA-seq dataset. Principal component analysis (PCA) of the RNA-seq samples has been added as Supplementary Figure S5 to illustrate sample clustering and replicate consistency, and sequencing depth and quality metrics for all libraries are provided in Supplementary Table S6.

      As suggested by the reviewer, we also quantified viral genome reads directly from the RNA-seq libraries to estimate intracellular viral RNA abundance in the sequenced samples. These data are now presented in Supplementary Figure 1 and discussed in the Results to contextualise infection levels across conditions alongside the extracellular viral RNA measurements shown in Figure 1B.

      Please include virus replication data for all experiments. Only one replication graph is shown (Fig. 1B), but infection level/virus release should be reported for every assay as responses will of course be dependent on how much virus/how many infected cells are present. A difficulty in understanding variant specific host responses in comparative approaches is differences in infection levels. In line with other published work, Fig.1B shows dramatic differences in variant replication. The differences measured at 1hpi indicate issues with input normalisation, this will have a knock-on effect for later replication and ultimately will further increase differences in infected cell counts. L340-342 "These transcriptional shifts occurred despite broadly comparable viral loads across lineages at 24-72 hpi, suggesting that replication level alone does not account for the observed metabolic divergence." - I strongly disagree with this interpretation. The viral loads are clearly not comparable. A 2 log10 difference in virus release is a large difference that will affect the comparison of host response. These replication difference are to be expected and have been previously reported by others. Ancestral variants infect fewer cells compared to Omicron variants. This needs to be acknowledged. In a bulk RNASeq/phopshoproeomic/metabolic measurement the number of infected and uninfected bystander cells across variants will inevitably result in the identification of at least some host responses that correlate with infection levels rather than with specific biology exploited by a variant. The authors must acknowledge this and discuss the contribution of infected vs bystander cells.

      Response: We thank the reviewer for this important point. All downstream analyses in this study (RNA-seq, phosphoproteomics and amino acid profiling) were performed on matched cultures from the same infection experiment; therefore, the replication kinetics shown in Figure 1B represent the infection conditions for all assays. We clarified this explicitly in the Methods and figure legends.

      We agree that differences in viral replication across variants are important when interpreting host responses. The statement suggesting broadly comparable viral loads was removed. We also included PFU/ml measurements and quantify viral reads extracted from the RNA-seq libraries to provide additional estimates of infection burden. Finally, we expanded the Discussion to acknowledge that bulk omics measurements reflect a mixture of infected and bystander cells and that some observed host responses may partly correlate with differences in infection levels across variants.

      Include individual data points to show the spread of the data overall (Fig 6A). Just showing the mean without an indication of how many measurements were taken and the variation in the data makes it hard for the reader to interpret the data.

      Response: We agree that the variability across replicates should be indicated. We added individual data points and error bars to Figure 6A (below) and clarified the replicate structure in the figure legend and Methods. Amino acid measurements were performed using four biological replicates per condition, each processed in duplicate technical measurements that were averaged prior to statistical analysis.

      The choice of 24h for the amino acid abundance analysis needs some further justification. At 24h, some variants will only have infected very few cells. What would this mean for a bulk measurement? Do the authors suggest that there were changes to aa-metabolism in uninfected bystander cells? Would true differences in aa-metabolism in the infected cells be masked by the surrounding uninfected cells?

      Response: We thank the reviewer for this important point. We selected 24 hpi to capture early metabolic responses, in parallel with the phosphoproteomic analyses, before the later-stage transcriptional divergence became dominant. We agree, however, that at this timepoint the amino acid measurements likely reflect the integrated state of both infected and bystander cells within the cultures. We clarified this explicitly in the revised manuscript and discussed this as an important limitation of the bulk metabolite measurements.

      The framing of Alpha and Beta as pre-Omicron is confusing. IC19 and Delta are both equally pre-Omicron variants. Please consider rewording.

      Response: We agree that this terminology is potentially confusing. In the revised manuscript, we used more precise lineage descriptions throughout, distinguishing IC19 as the reference/early strain, Alpha and Beta as earlier VOCs, Delta as a separate later pre-Omicron VOC, and BA.1/BA.5 as Omicron subvariants.

      The Venn diagram in Fig. 2B/C is hard to interpret. How were the percentages calculated? From the total number of DEG across all variants? If so, this would inflate the proportion attributed to the conditions that showed the largest number of DEG genes and shrink the proportion for the conditions with less signal. An UpSet plot might be a better choice to represent the data.

      Response: We thank the reviewer for this helpful suggestion. The overlap values in Fig. 2B-C were generated using __InteractiVenn____, __which calculates set intersections and reports them as percentages relative to the total union of differentially expressed genes across all variants at the respective time point. We clarified this explicitly in the figure legend and Methods. We agree that multi-set Venn diagrams can be difficult to interpret when DEG set sizes differ substantially, and we revised the figure legends and associated text to improve clarity of presentation.

      The interpretation of the data as presented requires more mechanistic validation. As it stands, activation of metabolic pathways, or the contribution of the observed phospho changes to variant biology, is not functionally linked to infection outcome. In the absence of more experimental data, the conclusions should be toned down. (For example L330-332 "These patterns suggest that Omicron can replicate despite ongoing cytokine signalling, whereas Delta infection favours stress- and growth-linked pathways to sustain replication.")

      Response: We agree and substantially toned down these statements. The revised manuscript presents these data as comparative host-response signatures rather than mechanistically validated pathways driving infection outcome.

      L440-442 "Similarly, replicate-level variability and confidence intervals for NES values were not plotted, as the scores reflect ranked enrichment rather than absolute expression magnitude." - What do the authors mean by replicate-level variability? I assume the NES was calculated based on fold change which are not replicate-level?

      Response: We thank the reviewer for pointing out this lack of clarity. The previous wording referring to "replicate-level variability" was removed. We now clarify that NES values were calculated from ranked differential expression outputs, with nominal p-values estimated by permutation and FDR-(adjusted p-values) reported in Supplementary Table S2, together with leading-edge genes for each pathway, variant and time point.

      Differences in Oxphos have been reported by others (https://www.sciencedirect.com/science/article/pii/S2589004224012343 ). This study and others should be included in the discussion.

      Response: We thank the reviewer for highlighting this study. We have now included this in the Discussion to place our findings in the context of previous studies of SARS-CoV-2 infection in nasal epithelial cultures.

      Can the authors speculate whether the innate immune response observed links to the metabolic changes reported?

      Response: While our study does not directly establish a causal link between innate immune activation and metabolic rewiring, interferon signalling is known to influence cellular metabolism during viral infection. In our dataset, IFNα-treated cultures showed strong ISG induction but minimal enrichment of the metabolic pathways analysed here (new Supplementary Figure 4), suggesting that interferon signalling alone does not fully account for the metabolic signatures observed during SARS-CoV-2 infection. These observations support the idea that the metabolic changes detected likely reflect a combination of viral replication demands and host antiviral signalling rather than interferon activation alone. We have added a brief clarification in the Discussion to acknowledge this relationship.

      Overall, in the discussion the data should be contextualised with results from other studies. Particularly work focussing on primary airway epithelial cells and variant infections.

      Response: We agree and expanded the Discussion to better contextualise our results within the existing literature, particularly studies in primary airway epithelial models.

      Please provide more detail on how the merged NES was calculated for Alpha/Beta and BA.1/BA.5. For Fig. 4, either the merged or the unmerged NES data would be sufficient, rather than including both analyses. Enrichment of pathways would benefit from indicating which genes associated have been detected and how they functionally might contribute.

      Response: We thank the reviewer for this helpful suggestion. In the revised manuscript, we simplified the presentation of the pathway enrichment analysis by focusing primarily on the variant-level NES profiles (Figure 4A), while retaining the grouped lineage visualisation (Figure 4B) only as a simplified overview. The merged NES values were calculated by averaging the normalised enrichment scores of the corresponding variants within each lineage group (Alpha/Beta for pre-Omicron and BA.1/BA.5 for Omicron). To improve interpretability, we now report the leading-edge genes contributing to each enriched pathway in Supplementary Table S2.

      Please include how ISGs were defined for the analysis of Fig. 3.

      Response: In our analysis, interferon-stimulated genes (ISGs) were defined based on the transcriptional response of nasal epithelial cells to IFN-α stimulation, which served as a benchmark condition for interferon-responsive gene expression. Genes significantly up-regulated in IFN-α-treated samples relative to mock controls were used to define the ISG set analysed in Fig. 3. We clarified this definition and the selection criteria in the Methods and figure legend.

      Please clarify for each experiment how many replicates/measurements were taken. This information should be included in the figure legend. If data/or measurements were excluded, this should also be highlighted. From the supplementary data (amino acid data; qPCR vs RNASeq) there seems to be variation in the amount of reported measurements (aa-metabolism: 7 vs 8 measurements; RNASeqvsqPCR: 37 vs 39 measurements).

      Response: We agree and have revised the Methods, figure legends, and supplementary information to clarify replicate numbers and any exclusions.

      • 3D: The colour scaling used for log2FC is unbalanced. Consider using different gradings.*

      Response: We thank the reviewer for this suggestion. The colour scale in Fig. 3D was intentionally asymmetric because several IFNα-responsive genes show extremely large log₂fold changes due to very low baseline expression in mock samples. Using a symmetric colour scale would compress the dynamic range of the virus-infected conditions and obscure biologically meaningful differences between variants. To avoid confusion, we clarified the rationale for the colour scaling in the figure legend and ensure that the scale is clearly labelled.

      In the NES analysis, I would expect an indication of the leading edge in the figure or in the supplementary data.

      Response: We agree. Relevant leading-edge information has been added to the supplementary table S2 and is referenced in the revised Methods and Results.

      Several figures would benefit from inclusion of p-values/indication of significance (Fig. 3D, 5B, 6A/C).

      Response: We agree and we have added statistical information where appropriate including the supplementary material.

      Fig .6D requires some more explanation as to what it is shown in the figure. Statistics should be included to confirm that there are no overall differences between conditions.

      Response: We agree and expanded the explanation of Figure 6D. Amino acid levels were normalised to the total intracellular amino acid pool within each condition to evaluate proportional composition independent of total abundance. We also included statistical analysis of the normalised amino acid proportions using a Friedman test, which detected modest but significant differences across conditions (χ² = 15.33, p = 0.004). These differences reflect small shifts in a limited number of amino acids rather than major changes in overall amino acid composition. The statistical analysis and clarification have been added to the Results, Figure 6 legend, and Supplementary Table S4.

      L266-269 "All amino acid measurements were expressed as nmol per 10⁶ viable, counted cells, and viability at 24 hpi was comparable across conditions, indicating that differences in abundance reflect infection-driven metabolic changes rather than variation in cell number." - Data should be included.

      Response: We thank the reviewer for this comment. In differentiated air-liquid interface nasal epithelial cultures, cells form a structured epithelium attached to the transwell membrane and cannot be routinely counted without dissociation of the insert, which would disrupt the culture and preclude subsequent metabolic analysis. For this reason, individual experimental inserts used for amino acid measurements were not dissociated. Instead, representative inserts were dissociated to verify epithelial cell numbers. Dissociation of one mock control and one IC19-infected insert yielded comparable counts of 0.9-1.10 × 10⁶ epithelial cells per insert, confirming that each transwell contains approximately 10⁶ epithelial cells. Amino acid measurements were therefore normalised to epithelial input and reported as nmol per 10⁶ cell equivalents. The manuscript text was revised to clarify this normalisation and avoid implying routine viable cell counting of each insert.

      L270-273: "The variant-amino acid interaction network (Figure 6B) visualises these differences by linking each variant to its most strongly altered amino acids. Edge width reflects the absolute log fold change, and colour indicates direction (red for increases, blue for decreases relative to mock)." The network figure does not add any additional information that is not already contained in Fig. 6C. Consider removing this panel.

      Response: We thank the reviewer for this suggestion. While the quantitative differences in amino acid abundance are shown in Fig. 6C, the network representation in Fig. 6B was included to highlight variant-metabolite relationships and to visualise which amino acids show the strongest associations with individual viral lineages. This representation facilitates comparison of the pattern of metabolic alterations across variants rather than only their magnitude. To avoid redundancy, we clarified this purpose in the figure legend and streamline the figure presentation.

      For Fig. 6C: The colour scale for the legend is imbalanced starting at -1 with a mid point at 0 and the max at 0.35.

      Response: We thank the reviewer for noting this point. The colour scale reflects the observed range of log₂ fold changes in the dataset, where decreases in amino acid abundance were larger in magnitude than increases. As a result, the scale is asymmetric. To avoid confusion, we clarified the colour scale in the figure legend and ensure that it is clearly labelled to reflect the underlying data distribution.

      L428-41 "Dot colour indicated the direction of regulation (red, up-regulated; blue, down-regulated), and dot size was proportional to the absolute NES value. Vertical reference lines at NES = 0 were included to indicate neutral enrichment." This does not describe the data that is presented in the figure.

      Response: We thank the reviewer for pointing out that the previous description did not accurately reflect the graphical representation. Figure 4 has been revised to clarify how pathway enrichment is displayed. Dot position now represents the normalised enrichment score (NES) on a common scale across all panels, and dot colour indicates the direction of enrichment (red = positive enrichment, blue = negative enrichment). A shaded central region highlights limited enrichment around NES = 0, and the scale at the bottom indicates thresholds used to categorise moderate and strong enrichment. The figure legend and Methods description have been updated accordingly.

      Reviewer #3 (Significance (Required)):

      My criticisms are in part outlined above. While the central question of the study is important and timely, the data reported is largely incremental and lacks mechanistic insight.

      Response: We thank the reviewer for this candid assessment. We agree that the present study is not a mechanistic dissection of individual pathways, but rather a comparative systems-level analysis of lineage-associated host-response patterns in a physiologically relevant NEC model. We have revised the manuscript to better reflect this scope and to avoid overstatement of mechanistic inference.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript entitled "Evolutionary rewiring of host metabolism and interferon signalling by SARS-CoV-2 variants" by Somji and colleagues sets out to understand SARS-COV-2 variant biology in primary nasal epithelial cells. Understanding this and differences in variant-specific host-virus interactions is essential to understand the molecular basis of replication advantages and enhanced transmission that ultimately lead to variant dominance. The authors employ global transcriptomic, phosphor-proteomic and amino acid metabolism assays with the aim to identify variant-specific changes to cell metabolism and innate immune activation in a comparative systems-level approach. Importantly, this work is performed in primary nasal epithelial cells. It is essential to understand variant biology in the context of relevant primary cell infection models and NECs are a great choice to address the proposed research question.

      The work is conceptually interesting, but largely descriptive. While this can still be useful for the field, it requires appropriate framing of the interpretations of the data. I agree with the authors that there will be virus- specific signatures that will contribute to variant fitness, but this dataset makes it hard to draw strong conclusions. The main problem with the manuscript and the interpretation are dramatic differences in viral replication. While some of the conclusions are tantalising and would warrant further investigation, I would expect to see some experimental validation to substantiate the interpretation. In the absence of experimental validations and mechanism, the conclusions should be stated as such and contextualised more with previously published work.

      Major:

      1. A major concern that I have is the analysis of the RNASeq data. Experimental design, analysis and presented data require some clarification: Too little experimental detail for the RNASeq data is given. How many replicates were sequenced/analysed? The figure legend state three independent experiments - but how many individual replicate transwells per condition (and NEC batch) were used? This information needs to be included in the manuscript. Generally, clarification on how many replicates were used per experiment needs to be included in the figure legends for all data panels.

      2. The reported log2 fold changes are beyond what is biologically reasonable. A log2 fold change of 120 or even 30 (Fig.3D, suppl table) indicates issues with the data analysis. It is worth revisiting the analysis and additionally inclusion of some QC data would be helpful (e.g. PCA of the data). Furthermore, viral genome data should be extracted from the RNASeq data to give an indication of infection levels in the relevant samples rather than just relying on a representative graph (Fig.1B).

      3. Please include virus replication data for all experiments. Only one replication graph is shown (Fig. 1B), but infection level/virus release should be reported for every assay as responses will of course be dependent on how much virus/how many infected cells are present. A difficulty in understanding variant specific host responses in comparative approaches is differences in infection levels. In line with other published work, Fig.1B shows dramatic differences in variant replication. The differences measured at 1hpi indicate issues with input normalisation, this will have a knock-on effect for later replication and ultimately will further increase differences in infected cell counts. L340-342 "These transcriptional shifts occurred despite broadly comparable viral loads across lineages at 24-72 hpi, suggesting that replication level alone does not account for the observed metabolic divergence." - I strongly disagree with this interpretation. The viral loads are clearly not comparable. A 2 log10 difference in virus release is a large difference that will affect the comparison of host response. These replication difference are to be expected and have been previously reported by others. Ancestral variants infect fewer cells compared to Omicron variants. This needs to be acknowledged. In a bulk RNASeq/phopshoproeomic/metabolic measurement the number of infected and uninfected bystander cells across variants will inevitably result in the identification of at least some host responses that correlate with infection levels rather than with specific biology exploited by a variant. The authors must acknowledge this and discuss the contribution of infected vs bystander cells.

      4. Include individual data points to show the spread of the data overall (Fig 6A). Just showing the mean without an indication of how many measurements were taken and the variation in the data makes it hard for the reader to interpret the data.

      5. The choice of 24h for the amino acid abundance analysis needs some further justification. At 24h, some variants will only have infected very few cells. What would this mean for a bulk measurement? Do the authors suggest that there were changes to aa-metabolism in uninfected bystander cells? Would true differences in aa-metabolism in the infected cells be masked by the surrounding uninfected cells?

      6. The framing of Alpha and Beta as pre-Omicron is confusing. IC19 and Delta are both equally pre-Omicron variants. Please consider rewording.

      7. The Venn diagram in Fig. 2B/C is hard to interpret. How were the percentages calculated? From the total number of DEG across all variants? If so, this would inflate the proportion attributed to the conditions that showed the largest number of DEG genes and shrink the proportion for the conditions with less signal. An UpSet plot might be a better choice to represent the data.

      8. The interpretation of the data as presented requires more mechanistic validation. As it stands, activation of metabolic pathways, or the contribution of the observed phospho changes to variant biology, is not functionally linked to infection outcome. In the absence of more experimental data, the conclusions should be toned down. (For example L330-332 "These patterns suggest that Omicron can replicate despite ongoing cytokine signalling, whereas Delta infection favours stress- and growth-linked pathways to sustain replication.")

      9. L440-442 "Similarly, replicate-level variability and confidence intervals for NES values were not plotted, as the scores reflect ranked enrichment rather than absolute expression magnitude." - What do the authors mean by replicate-level variability? I assume the NES was calculated based on fold change which are not replicate-level?

      10. Differences in Oxphos have been reported by others (https://www.sciencedirect.com/science/article/pii/S2589004224012343 ). This study and others should be included in the discussion.

      11. Can the authors speculate whether the innate immune response observed links to the metabolic changes reported?

      Minor:

      1. Overall, in the discussion the data should be contextualised with results from other studies. Particularly work focussing on primary airway epithelial cells and variant infections.

      2. Please provide more detail on how the merged NES was calculated for Alpha/Beta and BA.1/BA.5. For Fig. 4, either the merged or the unmerged NES data would be sufficient, rather than including both analyses. Enrichment of pathways would benefit from indicating which genes associated have been detected and how they functionally might contribute.

      3. Please include how ISGs were defined for the analysis of Fig. 3.

      4. Please clarify for each experiment how many replicates/measurements were taken. This information should be included in the figure legend. If data/or measurements were excluded, this should also be highlighted. From the supplementary data (amino acid data; qPCR vs RNASeq) there seems to be variation in the amount of reported measurements (aa-metabolism: 7 vs 8 measurements; RNASeqvsqPCR: 37 vs 39 measurements).

      5. Fig. 3D: The colour scaling used for log2FC is unbalanced. Consider using different gradings.

      6. In the NES analysis, I would expect an indication of the leading edge in the figure or in the supplementary data.

      7. Several figures would benefit from inclusion of p-values/indication of significance (Fig. 3D, 5B, 6A/C).

      8. Fig .6D requires some more explanation as to what it is shown in the figure. Statistics should be included to confirm that there are no overall differences between conditions.

      9. L266-269 "All amino acid measurements were expressed as nmol per 10⁶ viable, counted cells, and viability at 24 hpi was comparable across conditions, indicating that differences in abundance reflect infection-driven metabolic changes rather than variation in cell number." - Data should be included.

      10. L270-273: "The variant-amino acid interaction network (Figure 6B) visualises these differences by linking each variant to its most strongly altered amino acids. Edge width reflects the absolute log₂ fold change, and colour indicates direction (red for increases, blue for decreases relative to mock)." The network figure does not add any additional information that is not already contained in Fig. 6C. Consider removing this panel.

      11. For Fig. 6C: The colour scale for the legend is imbalanced starting at -1 with a mid point at 0 and the max at 0.35.

      12. L428-41 "Dot colour indicated the direction of regulation (red, up-regulated; blue, down-regulated), and dot size was proportional to the absolute NES value. Vertical reference lines at NES = 0 were included to indicate neutral enrichment." This does not describe the data that is presented in the figure.

      Significance

      My criticisms are in part outlined above. While the central question of the study is important and timely, the data reported is largely incremental and lacks mechanistic insight.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors of the manuscript entitled "Evolutionary rewiring of host metabolism and interferon signalling by SARS-CoV-2 variants" investigated the diversity of different SARS-CoV-2 isolates regarding gene expression, kinase activity and amino acid profiles in infected primary human nasal epithelial cells. Somji et al. found certain distinct alterations of measured factors after infections compared to mock and differences in cells infected with the mentioned different SARS-CoV-2 isolates.

      The topic of the manuscript as such is of high importance since understanding virus host interactions in general and virus host coevolution particularly on the level of cellular metabolism and beyond comes with great potential in deeper understanding the infection biology of viral invaders.

      Nevertheless, the study needs to be enlarged and further defined, the experimental set up has to be improved and the drawn conclusions have to be proven by experiments. The presentation of the obtained data needs to be improved, checked and carefully chosen to allow the reader to follow the article in a much more guided way. At this stage of experimental data depth, presentation and interpretation, there is room for certain overinterpretations of the biological meanings of the presented data.

      Please find a detailed list of comments for the consideration of the authors below.

      1. The authors state about virus growth kinetics in Fig.1. To be able to do so in full extend, virus particle counts (PFU/ml) need to be measured and included in this data set.

      2. From Fig.2 on, the presentation and introduction of the data set is often very hard to follow. Certain panel labeling is not correct e.g. in Figure 2, Figure 2A is not introduced, 72h data are linked to Figure 2C but 2C is a Venn diagram of 24h gene expression downregulation. The Venn diagrams are not mentioned in the text at all. This problem is occurring at different occasion, which makes it hard to impossible to follow the experimental flow of the study. Therefore, a complete revision of the data presentation within the figures and the linked text is needed. Further example, lines 213 and 224, Figure 4B two times mentioned with different data supposed to be shown in Fig. 4B.

      3. The authors are inconsistent with including statistics in their figures. Please include all statistics in your figures to allow the reader to get this information. Please declare how often and how each experimental set has been done and clarify e.g. in the figure legends. In addition, please improve the figure quality for better allowance of cross comparability of data sets. As example, used the same x-axis scale for all graphs in Fig 4.

      4. The authors create claims about metabolic profiles without measuring deeper metabolic circumstances. Why are only amino acids measured and not metabolite concentrations in general. Metabolic gene expressions as measurement of metabolic pathway activities can be strongly misleading since gene expression per definition does not necessarily mean enzyme activity, which of course is finally important for pathway activity as well.

      5. The authors need to carefully crosslink the obtained data sets. As an easy example, how much of the found differences in gene expression, pathway activities etc. is due to viral growth differences. With other words, are there regulatory differences or are the differences seen due to different growth kinetics. Are ISG expression level linked to virus growth? These type of questions not be asked and correlations need to done by the authors to guide the reader through all those assays conducted in this study.

      Referee cross-commenting

      I do fully agree with reviewer 1 and 3 in terms of the importance of much more comprehensive data on virus growth. Measurement of real virus progeny (PFU/ml) and viral protein and RNA expression is needed to state about the importance of altering viral dynamics for interpreting the findings.

      I do fully agree with reviewer 1 and 3 that data analysis, presentation and interpretation has to be improved. Information such as how often has each experiment been done and how has the experimental set up been constructed has to be clarify e.g. in the figure legends.

      As reviewer 1 mentioned, direct analysis of metabolite concentrations is needed to be able to judge about metabolic changes driven by the different SARS CoV-2 variants.

      In line with both, reviewer 1 and 3, conclusions drawn by the authors should be toned down. More data and improved data analysis and presentation are needed to foster the conclusions drawn .

      Significance

      While the topic as such is interesting and hoighly relevant, the manuscript has several major flaws, both with regard to paper organisation and content. In the current state it is hard to judge, whether the data are of significance.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In the submitted work, authors seek to understand the transcriptional and metabolic changes induced by different variants of SARS-CoV-2 infection. They employ a model of pooled, primary nasal epithelial cells (NEC) differentiated within an air-liquid-interface. Subsequently, cultures are infected with isolates representing key variants of SAR-CoV-2 from initial D614G, Alpha, Beta, Delta, and Omicron. Following initial characterization, authors compare transcriptional changes at 24 and 72 hours post infection. Analysis focuses on differentially expressed genes, upregulated Interferon Stimulated genes, and transcripts within known metabolic pathways. Subsequently, infected cultures are also analyzed by phosphoproteomic analysis to identify changes in cell signaling and measured for amino acid content. Throughout, changes in each profile are correlated with different variants of SARS-CoV-2, with Delta and Omicron revealing particular diametrically opposed changes. There are reasonable controls applied, including the use of IFNalpha treatment to "benchmark" ISG production. Overall, authors create a picture that Omicron infections do not suppress IFN signaling as efficiently as Delta variants and further exhibit limited hallmarks of cell stress and metabolic dysregulation.

      This is a remarkable study that attempts to cross-correlate multiple -omics analyses of cell responses to characterize differences in infection. It is very well written and the data is exemplary. I do have some concerns related to the placement and emphasis of interpretation in the results section that need to be revised. Beyond my stylistic concern, the interpretation of the experimental changes between variants are compromised by the failure to analyze the extent of infection within the NEC model. Using an MOI of 0.01 will produce a dramatically heterogeneous extent of infection at both 24 and 72 hours post infection that will also depend on the extent of viral transmission within the culture. The limited analysis of secreted E-gene detection is insufficient to overcome the inherent unequal comparison of cell responses between variants. There are ways to assuage, but not eliminate, this problem when it comes to comparing and interpreting experimental results. My concerns and suggestions are detailed in the concerns below.

      Major Concerns:

      1) Heterogeneous extent of infection. The MOI of 0.01 used to initiate infection is extraordinarily low for the types of analysis that is employed with the NEC culture. The interpretation of the data does not take into account that there will be infected and uninfected cells, of varying extents, making up the changes observed. Further, the variants likely have differing abilities to spread through the NEC culture, complicating both interpretation of changes and comparison between variants. At a minimum, authors need to evaluate the extent of SARS-CoV-2 infection through either flow cytometry or immunofluorescence analysis against viral protein(s). It is possible that Omicron, while secreted well, has more limited transmission allowing for more cells to mount an IFN response. Delta is a prolifically spreading virus that likely has more extensive infection at 72 hpi than the other variants. These statements are conjecture and highlight how such differences could alter the interpretation of the subsequent experiments.

      2) Further evaluation of IFNalpha treated cells. The paper emphasizes the ISG analysis, but the IFN treated cells should be included in the DEG and metabolic pathway analysis. IFN treatment is known to alter metabolic changes in cells, and it would be valuable to see those changes reflected in your analysis. Consider the evidence presented in the following:

      Fritsch SD, Weichhart T. Effects of Interferons and Viruses on Metabolism. Front Immunol. 2016 Dec 21;7:630.

      Heer CD, Sanderson DJ, Voth LS, Alhammad YMO, Schmidt MS, Trammell SAJ, Perlman S, Cohen MS, Fehr AR, Brenner C. Coronavirus infection and PARP expression dysregulate the NAD metabolome: An actionable component of innate immunity. J Biol Chem. Elsevier BV; 2020 Dec 25;295(52):17986-17996.

      Palmer CS. Innate metabolic responses against viral infections. Nat Metab. 2022 Oct;4(10):1245-1259

      Further, It is possible that the changes attributed to Omicron are quite similar to the effects of the IFN treatment, given the extensive ISG detection. The same is true for the phosphor-proteomic analysis and amino acid content. I also have concerns that using a treatment of IFNalpha that impacts all cells as a benchmark for heterogeneous infection is not truly comparable. How was the concentration of IFN chosen? What was the extent of IFN activation in the culture?

      3) Further correlation of transcriptional changes with metabolic changes - While many published works emphasize transcriptional changes as a proxy for metabolic changes, there are robust methods that can be applied to directly analyze metabolite content and changes in the context of viral infection. In particular these studies should be assessed and compared for the interpretation of the presented results:

      Kramaric, T., Thein, O.S., Parekh, D. et al. SARS-CoV2 variants differentially impact on the plasma metabolome. Metabolomics 21, 50 (2025).

      Loveday EK, Welhaven H, Erdogan AE, Hain KS, Domanico LF, Chang CB, June RK, Taylor MP. Starve a cold or feed a fever? Identifying cellular metabolic changes following infection and exposure to SARS-CoV-2. PLoS One 2025 Feb 12;20(2):e0305065.

      Irún P, Gracia R, Piazuelo E, Pardo J, Morte E, Paño JR, Boza J, Carrera-Lasfuentes P, Higuera GA, Lanas A. Serum lipid mediator profiles in COVID-19 patients and lung disease severity: a pilot study. Sci Rep. 2023 Apr 20;13(1):6497.

      Luke Whiley, Nathan G. Lawler, Annie Xu Zeng, Alex Lee, Sung-Tong Chin, Maider Bizkarguenaga, Chiara Bruzzone, Nieves Embade, Julien Wist, Elaine Holmes, Oscar Millet, Jeremy K. Nicholson, and Nicola Gray, "Cross-Validation of Metabolic Phenotypes in SARS-CoV-2 Infected Subpopulations Using Targeted Liquid Chromatography-Mass Spectrometry (LC-MS)", Journal of Proteome Research 2024 23 (4), 1313-1327

      4) Editing to limit interpretation within experimental results. I appreciate that this is a stylistic concern and it is an issue in the paper. Statements in the results are often over-reaching. Some examples include: Line 156 -"suggesting attenuated or delayed early sensing" - The Low MOI and time leaves these results open to various explanations. Better to just state and move on.

      Line 157 "Delta drove the most extensive" - drove has a lot of assumption. "produced" "resulted in " or something more passive is more appropriate

      Line 179 "pointing to sustained suppression of interferon responses." - sustained is a leading interpretation. Effective? Comprehensive? again, the MOI is complicating interpretations of global transcript changes.

      Line 186 "suggesting a weaker activation of interferon signaling" Too much leading interpretation here. You detect fewer ISGs that are differentially regulated. Could be for many reasons.

      Minor Concerns:

      1) Line 72 "has evolved unique strategies" Unique can be easily misconstrued to mean different mechanisms. More likely, it is a subtle balance between promotion of viral replication and suppression of IFN responses.

      2) Line 126 - 128 "NECs were derived from three commercially available donor pools". The following text doesn't make it clear that they are the same produce from different lots. The methods clarify somewhat, but should be clarified for transparency.

      3) Line 129 "Viral replication kinetics" Need to highlight that this is detection of secreted viral genomes. which is a proxy measure for replication and dissemination in the culture. Direct measurement of the extent of infection is not being made nor can be interpreted.

      4) Line 149 "Differentially expressed genes (DEGs)" What is the comparison group? The figure legend/design suggests that IFNa treatment. Is there a matched uninfected control for each timepoint as well? Later experiments specify the comparison group. Text should be clarified here for transparency.

      5) Line 224 and Figure 4B - I don't see the value of the "merged NES" values given these are only aggregate of the Pre-Omicron and Omicron species. If you had compared multiple D614G and Delta variants, then there would be utility.

      6) Line 261 "quantified at 24 hpi" Why this timepoint? Changes were minor and not representative to extensive infection.

      7) Line 268 "rather than variation in cell number." I appreciate the rigor and control of experimentation. And how many of those cells are infected? That is not controlled.

      8) Line 428-429 "direction of regulation" This seems like an over-interpretation of the data. You have performed pathway analysis based on the quantity of RNA transcription detected in sequencing then imputing an interpretation of regulation. Without pulse labeling of metabolic standards or kinetic analysis of metabolite quantity, it is difficult to assert regulatory direction.

      Referee cross-commenting

      I am in agreement with the comments and suggestions of Reviewer #2 and #3. In particular, the comment of Reviewer #3 to estimate viral replication from the RNASeq data is quite valuable to begin addressing some of the concerns about the extent of viral replication. It does not negate the need to further assess productive viral titer (PFU/mL) or the extent of viral infection (immunofluorescence or flow cytometry).

      I also agree with Reviewer #3 regarding the extent of mechanistic interpretation that can be drawn from the current study. This concern can largely be addressed through revision of the text and a tempering of the interpretations that are drawn.

      I also agree and appreciate the detailed analysis of reviewer #2 regarding the inconsistencies between the text and the figures. It is critically important to be consistent in the data and presentation of these complex experiments. Resolving these issues will only strengthen the work.

      Significance

      • The work detailed in this manuscript is takes a very broad approach to identify differences in the effects of SARS-CoV-2 variant infections. Elements of this work have been published, including transcriptomics, metabolomics, and phosphoproteomics. This work is significant in that multiple variants are evaluated with comparable methods in the very relevant human nasal epithelial cell model. The use of this model, and the direct integration of multiple -omics, sets this work apart from previously published studies. This cross-omic analysis, with the IFN-treated controls, provides a robust foundation of data that can be used to detail the differences in the response to the SARS-CoV-2 variant infections.

      • That said, a significant limitation to the study was the low MOI used to initiate infection and the lack of detailed analysis infection progression of the different variants. Further, there is limited comparison of the IFN-treatment condition in relation to the transcriptional changes, and no inclusion of IFN-controls in the other methods. Both of these limitations undercut the potential significance of the paper and its findings.

      • Audience: This work will have be important to bench researchers interested in further characterizing and comparing the effects of SARS-CoV-2 infection. Potentially, clinicians involved in diagnostics will find utility in the study of changes for potential biomarker analysis for severe COVID19 disease.

      • My expertise is the field of virology, having studying multiple RNA and DNA viruses, including SARS-CoV-2, to understand virus-cell interactions. My focus includes primary cell culture models of infection, proteomic and metabolic analysis of infection induced changes, and monitoring the spread of viral infection through direct and indirect measurements.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the Referees for their valuable input and critical review of our manuscript. Please find below our responses to the provided feedback.

      Referee # 1

      1.This is a well-done study. However, I ask to the Authors to include also figures showing the macroscopic in ovo evidence

      Response: Two photographs, each depicting an in situ xenograft tumour of the BxPC-3 and PANC-1 cell line have been added as a new figure (new Figure 1).

      and to discuss the role of the CAM assay in the study of xenografts.

      Response: In the introduction, we briefly explained in more detail the rationale behind the use of the CAM model for xenografting and pointed out an important limitation of the use of nucleoside labeling within this model.

      Referee # 2

      Major comments

      1.The manuscript's clinical relevance is limited, and methodological flaws prevent proper statistical validation of the in vitro findings.

      Response: We would like to clarify that the aim of this paper is to share some important preclinical scientific findings highly relevant for future translational studies. First, we demonstrate that cell cycle labelling can be performed in CAM model xenografts via the application of nucleoside analogues. Additionally, we provide evidence of a novel biological phenomenon that on itself could entail an additional important limitation to the use of nucleoside labelling in the CAM model. Additionally, we have only provided two images concerning in vitro culture of cell lines which serve as an illustration for the similarity to the in ovo growth pattern and thus are unsure what statistical validation is required for this aspect of the manuscript.

      2.The entire article is based on the assertion that "... " However, even here, the authors do not provide any evidence that these are erythrocytes and not some other cells.

      Response:

      We completely agree with the reviewer that not all chicken (embryonic) cells produce IgY. However, we would like to point out that the majority (if not all) IgY that is present within the developing embryo at the time of the CAM assay is derived originally from the hen and passed through to the embryo via the egg yolk in order to provide passive immunity. In fact, IgY production by embryo/chick cells is commonly observed only several days post hatching.1-3 Thus, IgY detection in chicken embryonic tissues remains species-specific. Though, it appears that large quantities of IgY are sequestered in the embryonic connective tissues. As such, this helps in segregation of 'chicken/stromal' pixels versus 'human/xenograft' pixels. In the revised manuscript, Figure 6 now includes quantitative data regarding the cellular origins with the different strategies described to substantiate our claims.

      Since application of this principle is of course not exclusively linked to FITC as a fluorophore, we performed additional immunofluorescent labelling on xenograft sections using Cy3-labelled Donkey anti-chicken IgY-antibodies as antibody staining to demonstrate the principle and reproducibility of using anti-IgY immunofluorescence. Additionally, using an anti-goat IgG-directed secondary antibody we demonstrate that, in the latter case, a completely different and non-specific staining pattern is obtained compared to that which is observed following anti chicken IgY labelling. Lastly, we added an extra image demonstrating the differences in staining pattern and intensity between xenograft cells and embryonic tissues and embryonic CAM epithelial cells. These images are provided as supplementary Figure S1.

      We have further nuanced the use of the secondary anti-chicken IgY labelling to the identification of chicken stroma rather than generalising the detection method to include specific labelling of all chicken cells. Though, in our experience, human cancer cells do not exhibit substantial levels of autofluorescence in the orange to far red spectrum within the xenografts. In contrast, some autofluorescence in the blue-green spectrum is observed for chicken tissues and some cells in non-immunolabelled sections of non-grafted embryos). Therefore, we reserve the spectral regions with the least amount of autofluorescence (i.e., orange-far red) for subsequent (IF)-labelling for xenograft cells in order to maintain sufficient specificity. In other words, the FITC-staining aids in determining whether a pixel is chicken in identity rather than human, in our tissue preparations.

      1. Dias da Silva, W. & Tambourgi, D. V. IgY: A promising antibody for use in immunodiagnostic and in immunotherapy. Vet. Immunol. Immunopathol. 135, 173-180 (2010).
      2. Ulmer-Franco, A. M. Transfer of Chicken Immunoglobulin Y (IgY) from the Hen to the Chick. Avian Biol. Res. 5, 81-87 (2012).
      3. Carlander, D., Wilhelmson, M. & Larsson, A. Immunoglobulin Y Levels in Egg Yolk From Three Chicken Genotypes. Food Agric. Immunol. 15, 35-40 (2003). Since application of this principle is of course not exclusively linked to FITC as a fluorophore, we performed additional immunofluorescent labelling on xenograft sections using Cy3-labelled Donkey anti-chicken IgY-antibodies as antibody staining to demonstrate the principle and reproducibility of using anti-IgY immunofluorescence. Additionally, using an anti-goat IgG-directed secondary antibody we demonstrate that, in the latter case, a completely different and non-specific staining pattern is obtained compared to that which is observed following anti chicken IgY labelling. Lastly, we added an extra image demonstrating the differences in staining pattern and intensity between xenograft cells and embryonic tissues and embryonic CAM epithelial cells. These images are provided as supplementary figure S1.

      We have further nuanced the use of the secondary anti-chicken IgY labelling to the identification of chicken stroma rather than generalising the detection method to include specific labelling of all chicken cells. Though, in our experience, human cancer cells do not exhibit substantial levels of autofluorescence in the orange to far red spectrum within the xenografts. In contrast, some autofluorescence in the blue-green spectrum is observed for chicken tissues and some cells in non-immunolabelled sections of non-grafted embryos). Therefore, we reserve the spectral regions with the least amount of autofluorescence (i.e., orange-far red) for subsequent (IF)-labelling for xenograft cells in order to maintain sufficient specificity. In other words, the FITC-staining aids in determining whether a pixel is chicken in identity rather than human, in our tissue preparations.

      3.The latter is actually confirmed by the authors in Fig. 7 in the form of the following sentence: "Highly autofluorescent (nucleated) embryonic erythrocytes can be observed throughout the tissue (arrowheads)." However, even here, the authors do not provide any evidence that these are erythrocytes and not some other cells.

      Response: We agree with the reviewer that the use of the green emission spectra in fluorescence should be used with caution, especially when evaluating tissue samples. In fact, this is the reason why we reserved the higher wavelength channels for anti-human fluorescent detection or the (click-based) detection of nucleosides.

      Indeed, we did not present full identification of these cells, but their avian origin is undisputed as these cells are also observed in the same quantity in tissues on non-grafted embryos. However, we now adapted the wording in the manuscript to the more general term 'chicken blood cells'. Throughout the manuscript and discussion section, we now elaborate on the possibility that nucleated cells, observed within blood vessels of the CAM can be erythrocytes, leukocytes or thrombocytes (which also are nucleated in avians).

      4. Tissue-specific and species-specific monoclonal antibodies to avian red cell nuclear proteins. Proceedings of the National Academy of Sciences of the United States of America, 79(20), 6265-6269. https://doi.org/10.1073/pnas.79.20.6265], and I recommend the authors use these instead of the fluorescein-labeled donkey anti-chicken IgY antibodies, which were misused. On the same matter, the article doesn't clarify if the antibodies for Ki67 and cyclin B1 can differentiate between human and chicken antigens.

      Response: Other groups have also applied this anti Ki-67 primary antibody in CAM-model studies without clear evidence of cross-reactivity to chicken embryonic nuclei.4,5 We have not observed ourselves any substantial species cross-reactivity for both the applied primary anti-Ki-67 antibody nor the primary anti-cyclin B1 which is also illustrated by the images provided within the manuscript.

      Jarrosson, L. et al. An avian embryo patient-derived xenograft model for preclinical studies of human breast cancers. iScience 24, 103423 (2021). Javed, S., Soukhtehzari, S., Fernandes, N. & Williams, K. C. Longitudinal bioluminescence imaging to monitor breast tumor growth and treatment response using the chick chorioallantoic membrane model. Sci. Rep. 12, 17192 (2022).

      5. Why is there no staining with antibodies to human antigens for tumor cell identification in Figures 1-2?

      Response: The aim of this figure was to demonstrate the highly pleiotropic nuclear morphology of the PANC-1 cell line compared to BxPC-3 cells while growing in vitro monocultures in turn comparing the growth pattern to that observed in.

      6. What specific markers in Fig. 4 should lead the reader to conclude that the nuclei pointed out by the arrows represent embryonic epithelial nuclei (EE), nuclei of chicken embryonic blood cells (EB), and human AsPC-1 tumor xenograft nuclei (T)?

      Response: The identification of different nuclei within these images was performed on the basis of their morphological aspects (i.e. irregular shape, larger size...) in combination with the relative localisation of these nuclei within the tissue section and the typical nucleolar staining pattern. The latter is not observed in any of the human xenografts we routinely perform. Nevertheless, we agree that accurate discrimination between embryonic epithelial nuclei and embryonic blood cells cannot be guaranteed with absolute certainty in the absence of the use of additional markers. However, the morphological aspects combined with the anti-Ki-67 staining, strongly suggest the human identity of the annotated nuclei. We have modified Figure 4 (now Figure 3) to include Anti-Ki67 staining. Additionally, an annotation that was erroneously pointing out a mitotic figure was omitted. For the remaining mitotic nuclei, clear perichromosomal localisation of Ki-67 is observed which supports the claim that these are mitotic nuclei.

      7. The nuclei shown in this same figure, which supposedly display mitotic figures of dividing tumor nuclei and can also be clearly distinguished (M), actually more closely resemble giant multinucleated cells.

      Response:

      We have added the immunofluorescent staining for Ki-67 of these images (new Figure 3); these nuclei present with different patterns for the marker, which would be highly unlikely in a single multinucleated cell. Further, we have altered the description of embryonic epithelial cells to the more general term embryonic cells (E). Additionally, annotation of embryonic cells was omitted from the lower panels in order to draw the focus on the polymorphic nature of the human tumour nuclei rather than the embryonic surrounding cells.

      8. The statement "After immunolabeling for human Ki67, we confirmed that all EdU+ xenograft nuclei were also Ki67+, confirming the specificity and compatibility of both labeling strategies in CAM xenograft tumor cells (Figure 5)" is not supported by the image, which shows that the majority of Ki67-positive nuclei are EdU-negative.

      Response:

      During cell cycle progression, nuclear levels of Ki-67 gradually increase during S-phase and peak during mitosis. As a consequence, all S-phase cells (EdU+ nuclei) will present detectable nuclear levels of Ki-67. The reverse is not necessarily true: the nuclei of cells that are within G2 and mitosis during the nucleoside labelling will present with high levels of nuclear Ki-67 but will not show nuclear incorporation for EdU as these have already completed replication during the S-phase. Figure 4 now shows quantification of both markers across 7 different tumour BxPC-3 xenografts. With the applied classification strategy, 94% of the detected EdU+ nuclei were also Ki-67+. Irrespective of Ki-67 positivity, 44% of BxPC-3 nuclei were calculated to be EdU+ for the labelling duration of 1 hour.

      9. The caption for Figure 6 needs to be revised, particularly the statement "Combined, these markers allow in ovo segregation of proliferating tumor cells into early S-phase (ES, EdU+), late S or early G2 (LS, EdU+CB1+), and G2 (EdU-CB1+)".

      Response: We have altered the phrasing to stress the cytoplasmic presence of cyclin B1 as indicative for late S-phase and G2 phase.

      1. What do the pink cell nuclei in Figures 8 and 9 represent?

      Response: The orange immunofluorescent staining in these images demonstrates anti-Ki-67 labelling. Due to overlap with the nuclear (blue) staining the appearance may have a pink undertone but this does not alter the interpretability of the images.

      1. In Fig. 10, it is impossible to see the yellow line, which, according to the authors' statement ". yellow lines indicate the area classified as tumor", should indicate the tumor origin of the cells!

      Response: In the submitted PDF version of the manuscript, a yellow line is clearly visible in panels C and F. Perhaps due to compression-related quality loss, its presence is more difficult to see. We now submitted a higher resolution image.

      1. There is a distinct lack of evidence in Figures 11A-C suggesting that the S-phase nuclei are attributed to both embryonic liver epithelial cells (ES) and BxPC-3 cells (BS). Furthermore, there is no evidence that the apparent cytoplasmic EdU inclusions (arrowheads) belong specifically to chicken embryonic cells.

      Response:

      Concerning the identity of the cells in Figure 11 A-B (Now Figure 7). We would like to clarify that these images are taken from embryos which had not been subjected to tumour grafting. Therefore, the presence of any human tumour cells within these liver sections can 100% be excluded. This has now also been stressed in the text and the image caption. With respect to Figure 11C (now 7C), we have noticed that this image contained a tissue processing artefact which may lead some readers to question its authenticity. Therefore, we have replaced this panel (7C, now 7C) with another image taken from the same tissue section and included the anti-IgY staining channel to allow identification of CAM tissue.

      We have addressed the confusion regarding the apparent different magnifications in the figure legend: all images presented in Figure 7 were acquired using a 40x magnification objective. In order to focus on some select regions, different digital zoom levels are present for each panel. To account for this, each panel is now annotated with its own scale bar.

      1. As no macroscopic images depicting tumor nodules from the implantation of AsPC-1 and PANC-1 tumor cells into the CAM were provided.

      Response: A new figure (now Figure 1) has been added with macroscopic images of PANC-1 and BxPC-3 tumours in situ at ED14. Successful tumour grafting of AsPC-1 cells was demonstrated via the histological images provided throughout the manuscript.

      1. A newly identified biological phenomenon: non-nuclear EdU accumulation in chicken embryonic cells," is not well supported unless compelling evidence is presented to establish that these cells indeed belong to chicken embryos

      Response: We have clarified in the body of the text as well as in various figure legends that the apparent nucleoside presence within the cytoplasm of cells has been consistently observed also in liver sections taken from embryos that have not received tumour grafts.

      15. Lacking any demonstration of concurrent EdU accumulation alongside cytoplasmic and/or membrane staining within the same cells. The latter is quite feasible by staining the cells with a suitable agent (or fluorophore), such as...

      Response: We agree with the reviewer that we have not performed dedicated staining for cytoplasmic or membrane components in order to demonstrate the colocalisation of the EdU signal with the . Though we believe that the simultaneously acquired brightfield images are sufficiently convincing that these signals localise to the cytoplasm rather than to the extracellular space. Additional experiments were performed with F-ara-EdU and BrdU/IdU labelling in non-grafted embryos which demonstrated the robustness of these findings and further point towards cytoplasmic rather than extracellular signal. Multiple new figures and paragraphs were added into the revised manuscript which support our claim.

      1. Third, regarding "Extranuclear EdU staining": Extranuclear EdU staining is not a standard or typical...

      Response: We agree that this is not the intended use of nucleoside incorporation assays (including halogenated analogues). With this manuscript we aimed to illustrate the unexpected findings that could lead to misinterpretation of experimental data as a consequence of a new biological phenomenon.

      16.1 Cellular damage or death: Cells that are dying or have been damaged may release their DNA, causing it to be detected outside the nucleus.

      Response: The cells with apparent extranuclear (F-ara-)EdU, and BrdU and IdU) show no signs of nuclear pyknosis nor DNA fragmentation. In addition, we also describe this phenomenon in healthy developing embryos at various stages of development, in livers that show no macroscopic signs of tissue damage. Moreover, if the extranuclear EdU detection does signify degraded DNA (with the incorporated analogue), the remaining nuclear DNA would also show signs of (replication-dependent) EdU incorporation.

      16.2 Apoptosis: During apoptosis, cells undergo DNA degradation, and some DNA fragments may be found in the extranuclear space

      Response: We would like to refer the reviewer to the reply above. It seems unlikely that from ED14 onwards such massive apoptotic evens would be taking place in healthy embryos. Moreover, nucleoside incorporation and (caspase-dependent) DNA fragment generation would need to take place within one hour (= a typical nucleoside labelling period).

      16.3 Technical artifacts: Errors during cutting of formalin-fixed tissues, cell fixation, permeabilization, or the staining procedure itself can lead to mislocalization of the fluorescent signal.

      Response: We agree that several technical reasons can lead to altered subcellular localisation during the detection of some (protein) markers. Iduring detection.6,7 In our experiments, tissues were fixed in ice-cold 4% formalin for at least 12 hours. Therefore, it is highly unlikely that insufficient fixation would have caused this phenomenon.

      Regarding the possibility that the staining procedure itself can lead to mislocalisation, it is important to note that the copper-catalysed click detection of EdU is not susceptible to many of the possible causes for altered epitope detection through standard immuno-labelling. We can also not conceive how the click-reaction, performed on formalin-fixed tissues, could induce a complete shift in EdU from the nuclear to the extranuclear compartment of these select cells only without any other alterations in the nuclear or cellular morphology. We have performed several control staining procedures, such as performing the click reaction on tissue sections of embryos that had not received any alkyne-containing nucleoside labelling. This did not result in the detection of any signal. These results are referred to in the manuscript.

      Lastly, the simultaneous presence of correctly localised (nuclear) (F-ara-)EdU and 'incorrectly' localised (F-ara-)EdU within the same tissue section, and across different tissue types (liver versus CAM xenograft) demonstrates that it is unlikely that the observed phenomenon is the result of a technical artifact.

      Yoshida, S. R., Maity, B. K. & Chong, S. Visualizing Protein Localizations in Fixed Cells: Caveats and the Underlying Mechanisms. J. Phys. Chem. B 127, 4165-4173 (2023). Stadler, C., Skogs, M., Brismar, H., Uhlén, M. & Lundberg, E. A single fixation protocol for proteome-wide immunofluorescence localization studies. J. Proteomics 73, 1067-1078 (2010).

      16.4 Unique cell types: In some specific cell types, such as megakaryocytes, DNA or other nuclear components may be localized extranuclearly, leading to extranuclear staining with techniques like EdU.

      Response: Referring to the paper of Lan et al. (2019; doi: 10.1111/acel.12901) to which the reviewer alluded by citing the following statement "Nicked DNA was strongly visible in old cells, prominently in the cytosol, but undetectable in young cells, and was more intense in old cells upon" , we would like to stress that these authors investigated the presence of dsDNA in various human cell lines in vitro through OTHER assays than nucleoside labelling. Additionally, in the cited paper, also no specific staining for cytoplasm or cell membranes was performed to accurately segment the nucleus versus cytoplasm. Lastly, the intense focal-like nucleoside signal we observe within our tissue sections does not resemble the ambiguous and apparently random signal localisation within the cytoplasm of the cells presented in the cited paper. Lastly, the reviewer seems not to be aware of the fact that megakaryocytes dot NOT occur in avian species.

      16.5 Extranuclear accumulation of histones and nucleosomes is an early event of apoptosis in human lymphoblasts (https://doi.org/10.1136/ard.2003.011452)

      Response: The referred paper investigated cellular processes during cell death (apoptosis). Unfortunately, it does not present microscopic images to compare our data to; we therefore cannot assess its relevance to our findings.

      16.6 In some specific cell types, such as megakaryocytes, DNA or other nuclear components may be localized extranuclearly, leading to extranuclear staining with techniques like EdU [Frydman, G.H., Tessier, S.N., Wong, K.H.K. et al. Megakaryocytes

      Response: As already mentioned above, to our knowledge, megakaryocytes have NOT been described or identified in avian species so far. Therefore, we disagree that the cited phenomenon/reference would be highly relevant for nucleoside accumulation in the cytoplasm of non-megakaryocyte cells in in ovo model systems. Furthermore, the paper by Frydman et al., does not describe the use of nucleoside labelling in these cells.

      16.7 However, since the authors do not present any absolute markers proving that EdU-cyto+ cells are EdU+ chicken granulocytes, the authors' statement that "Given the high number of EdU+ granulocytes observed, it is more likely that these are neutrophils rather than eosinophils" appears highly speculative.

      Response: We agree with the reviewer that our claim that EdU-cyto+ cells represent granulocytes is still speculative. However, concerning the exact wording; brightfield images did show colocalisation of the nucleoside signal within the cytoplasm of granule-containing chicken cells. We have emphasized more clearly within the discussion section that the extranuclear nucleoside signal is highly unlikely to be nucleoside-containing DNA. Additional experiments were conducted to investigate whether these cells represent chicken thrombocytes, known to possess phagocytotic functionality. IF staining for CD41/CD61 as a marker for thrombocytes revealed that CD41/CD61-positive cells do not exhibit the alluded phenomenon. A new figure, Figure 10 illustrates this finding.

      Methodological ambiguities

      1. In the section "Nucleoside labeling of dividing cells in the CAM model": It is not clear when BrdU and EdU were given after tumor cell implantation, or how to standardize the distance from the tumor as suggested by "...as far away from the visible tumor as possible...". I question whether such a "precise" description of the application site would contribute "...to ensure low variability in labeling duration...", especially when it was performed by two independent researchers "...in tandem, ensuring that an experimental group of 16 embryos could be labeled in less than five minutes."

      Response:

      In this manuscript, we have described nucleoside labelling in grafted embryos as well as in non-grafted embryos. Alongside depictions of nucleoside labelling, the embryonic day at which the assay was performed is mentioned in the figure legend of all figures demonstrating nucleoside labelling. For example, BrdU labelling of the BxPC-3 xenograft presented in Figure 2 (revised manuscript) was performed at ED13 (which is 5 days following the grafting procedure at ED7). In the revised manuscript, we have ensured that for each labelling depicted, its timing in the developmental period is explicitly mentioned. Nucleoside labelling in non-grafted embryos was performed throughout various stages of development; the timing of the assay is now also mentioned in the corresponding figure legends.

      We fully agree that there is still room for improvement concerning the standardisation of the application of the labelling solution onto the CAM. The main aim of the present manuscript, however, is to demonstrate the feasibility of this labeling protocol. In addition, we report the unexpected but important finding of cytoplasmic accumulation due to a possible biological cause. In this respect we rephrased the text and pointed to a potential pitfall of the application in the CAM model so that future studies can anticipate to misinterpretations.

      1. In the section "Immunofluorescence labeling of cryosections: "...a blocking and permeabilizing solution in a humidified atmosphere."" ... : Absence of detergent is used during incubation with primary and secondary antibodies (even the widely recommended Tween20!). I wonder whether it might be the source of non-specific tissue labelling by donkey anti-chicken IgY that was considered as an evidence of chicken origin.

      Response: We interpreted this as that the reviewer concluded that we applied our primary and secondary antibody staining procedures in the presence of Triton X-100. We can assure that following the initial blocking incubation step and three rinses with PBS, all subsequent antibody labelling procedures (primary or secondary) are performed in the absence of detergent within the staining buffer. This was mentioned in the second paragraph under the heading "__Immunofluorescence __labelling of cryosections" __in the Materials & Methods section.

      Referee #3

      Major comments

      1. The current manuscript lacked figure panels to show the quantification results of the labeling to convincedly demonstrate the positive correlation between the nucleoside labeling versus Ki67. Bar charts with statistical analysis should be included. The study utilized 2 human cancer cell lines as an example. T

      Response: We have added quantification data of EdU and Ki-67 across several BxPC-3 xenografts and added statistical analysis in Figure 4E.

      1. The study would be improved if the authors could include cancer cell lines that are known to be highly proliferative vs slowly proliferative to demonstrate the robustness of this method.

      Response: We do not expect failure of nucleoside labelling due to tumour cell-line dependent characteristics for several reasons. For instance, when performing EdU labelling in vitro, initial dose-and duration titration are recommended to establish satisfactory labelling efficiency for the research hypothesis in question. In our research, we are predominantly focusing on relatively short-lived effects of treatments on S-phase progression and therefore prefer to apply labelling (in vitro and in ovo) for durations of less than 60 minutes. In contrast, extended labelling durations of several hours will allow one to capture replication even in very slowly proliferating (cancer) cells at the cost of temporal resolution. Importantly, the growth rates of individual cell lines within the CAM model may differ from their behaviour in vitro. Additionally, long duration nucleoside (EdU) labelling may induce additional toxicity. Alternative analogues such as F-ara-EdU with improved safety profiles for long labelling durations should then be considered. In the revised manuscript, labelling with BrdU/IdU as well as F-ara-EdU of non-grafted chicken tissues were added which demonstrate the robustness of the application of the technique in the CAM model. Longer-term labelling of lowly proliferative cells can be achieved with F-ara-EdU to minimise embryo (and tumour cell) toxicity which will extend the application potential of the described .

      In summary, labelling parameters should be optimised for each specific research question, but nucleoside incorporation remains a gold standard for identifying S-phase progression across diverse cell types. Given that we successfully established its feasibility in the CAM model, there is no biological reason to suggest it would not be equally valid for any proliferating cancer cell line.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study explored the utilization of nucleoside labelling in the human cancer cell xenografts grown in the chick CAM. The authors tried 2 different reagents (BrdU and EdU) which would label S-phase proliferative cells in a species non-specific way. The authors concluded that EdU labeling reliably detected Ki67 positive cells compared to BrdU labeling which would detect non-Ki67 positive cells as well (might be due to dsDNA denaturing). In order to distinguish the human versus the chick cells, the authors further utilized chicken-specific antibody anti-IgY and the segmentation algorithm in QPath to distinguish cells of the two species. This allows the authors to develop (supervised) automated or manual annotation for individual cell detection, in this case the proliferating Ki67+ human cancer cells. The study also showed an unexpected finding of cytoplasmic positive EdU cells in the embryonic chicken liver, which the authors speculated to be non-antigen presenting granulocytes.

      Major comments:

      This study provided a good methodology path to analyze and to quantify proliferating human cancer cells inside the CAM xenograft. The current manuscript lacked figure panels to show the quantification results of the labeling to convincedly demonstrate the positive correlation between the nucleoside labeling versus Ki67. Bar charts with statistical analysis should be included. The study utilized 2 human cancer cell lines as an example. The study would be improved if the authors could include cancer cell lines that are known to be highly proliferative vs slowly proliferative to demonstrate the robustness of this method.

      Minor comments:

      The authors used parentheses for the subheadings inside the discussion section. This is not an usual practice unless it is required by the journal formatting requirement.

      Significance

      General assessment: The strength of this study is to develop a straight forward solution to detect human-specific proliferative cancer cells inside the chicken CAM xenograft. The data presented in the IF staining and segmentation results were clear. The limitation is that the study only tested 2 human cancer cell lines and it is unknown whether the current method is robust enough in a pan-cancer setting.

      Advance: This study showed good advances in the methodology for the CAM xenograft field.

      Audience: The audience of this study would include researchers in cancer biology, imaging processing, and 3R.

      My expertise: cancer biology, CAM, systems and spatial biology

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      An intriguing article, "Application of Nucleoside Analogue Labelling to Study the Cell Cycle of Xenografted PDAC Cell Lines in the Chorioallantoic Membrane Model," explores the overlooked potential of the chorioallantoic membrane (CAM) model as an innovative alternative in the realm of cell cycle research related to tumor xenografts. The authors were resolute in their pursuit of evidence to bolster their fascinating hypothesis, which posits that exposure to nucleosides enhances the labelling of nucleosides with 5-ethynyl-2'-deoxyuridine (EdU). This improvement allows effective multiplexing with cell-cycle markers like cyclin B1 and Ki67, especially when combined with advanced digital image analysis. They claimed that accurately separating human and chicken cells allowed them to identify a specific type of chicken embryonic cell with a high level of extranuclear EdU accumulation. Researchers propose that cells without chicken MHC II are likely to be non-proliferating granulocytes found in the embryonic liver of both grafted and non-grafted embryos, including xenograft tissues. According to the authors, the CAM xenograft model effectively helps in studying tumor cell cycles in live conditions. Behind, authors claim a novel biological phenomenon, namely that of extranuclear nucleoside accumulation in certain chicken embryonic cells. If the authors successfully prove their hypothesis, it will significantly confirm CAM as a unique in vivo tool for better immuno-/histological exams and more precise cell cycle assessments compared to standard rodent models. The research encourages greater use of this alternative animal model in cancer studies to help us understand the regulation of the cancer cell cycle. This, in turn, may improve the implementation of existing treatment methods or uncover potential vulnerabilities in the cancer cell cycle.

      Unfortunately, in this article, the authors do not provide sufficiently convincing evidence of the hypothesis postulated by them. The manuscript's clinical relevance is limited, and methodological flaws prevent proper statistical validation of the in vitro findings. My questions and remarks below are designed to find undeniable proof or opposing views that back the authors' key perspective.

      Major - Conceptual:

      Point 1 - In Result section: The entire article is based on the assertion that "... distinguishing between both species can already be achieved to a large extent in a relatively simple manner through the use of anti-chicken IgY fluorophore-conjugated antibodies, which are routinely used only as secondary antibodies. In doing so, cellular as well as acellular chicken embryonic components are stained (Figure 7)." This assumption seems off, as anti-chicken IgY staining is employed in research to detect chicken antibodies (IgY). This secondary antibody is designed to specifically bind to the IgY immunoglobulin in chickens, targeting both heavy and light chains. A donkey anti-chicken IgY antibody labels chicken tissues to identify cells that express chicken IgY immunoglobulins. It's clear that not every cell in a chicken embryo tissues produces IgY immunoglobulins. Hence, cells stained by donkey anti-chicken IgY FITC-conjugated antibody cannot represent all chicken tissue cells. Lastly, the use of the green fluorescence channel is often unfounded, considering that there are always cells with non-specific and considerably high fluorescence levels in this area of the spectrum. The latter is actually confirmed by the authors in Fig. 7 in the form of the following sentence: "Highly autofluorescent (nucleated) embryonic erythrocytes can be observed throughout the tissue (arrowheads)." However, even here, the authors do not provide any evidence that these are erythrocytes and not some other cells.

      Point 2 - In their article, the authors consistently emphasize how their method excels in differentiating human tumor cells from the abundant chicken embryonic cells that encase the tumor. Nonetheless, the authors fail to present direct evidence regarding the identity of the nuclei or cells in question, nor do they substantiate the validity of the algorithm selected for processing digital images of fluorescently labeled cells, which would be essential for discerning their origin, whether chicken or human. Given the importance of the authors' subsequent conclusions, such evidence should be provided, at least for initial validation. Antibodies for chicken nuclear antigens are well established [e.g., Kane, C. M., Cheng, P. F., Burch, J. B., & Weintraub, H. (1982). Tissue-specific and species-specific monoclonal antibodies to avian red cell nuclear proteins. Proceedings of the National Academy of Sciences of the United States of America, 79(20), 6265-6269. https://doi.org/10.1073/pnas.79.20.6265], and I recommend the authors use these instead of the fluorescein-labeled donkey anti-chicken IgY antibodies, which were misused. On the same matter, the article doesn't clarify if the antibodies for Ki67 and cyclin B1 can differentiate between human and chicken antigens. Why is there no staining with antibodies to human antigens for tumor cell identification in Figures 1-2? What specific markers in Fig. 4 should lead the reader to conclude that the nuclei pointed out by the arrows represent embryonic epithelial nuclei (EE), nuclei of chicken embryonic blood cells (EB), and human AsPC-1 tumor xenograft nuclei (T)? The nuclei shown in this same figure, which supposedly display mitotic figures of dividing tumor nuclei and can also be clearly distinguished (M), actually more closely resemble giant multinucleated cells. However, their affiliation with either chicken or human tumor cells is neither obvious nor proven. The statement "After immunolabeling for human Ki67, we confirmed that all EdU+ xenograft nuclei were also Ki67+, confirming the specificity and compatibility of both labeling strategies in CAM xenograft tumor cells (Figure 5)" is not supported by the image, which shows that the majority of Ki67-positive nuclei are EdU-negative. The caption for Figure 6 needs to be revised, particularly the statement "Combined, these markers allow in ovo segregation of proliferating tumor cells into early S-phase (ES, EdU+), late S or early G2 (LS, EdU+CB1+), and G2 (EdU-CB1+)". The authors' method for labeling cells should consider that the location of cyclin B1 is key to determining a cell's stage in the cell cycle: o Cytoplasmic: Associated with G2/M arrest. o Nuclear: Associated with the transition into mitosis and the G2 to M phase. The assertion that "Species-distinction can be further facilitated by combining anti-chicken IgY IF (Figure 8C) with additional anti-human IF, such as anti-human Ki67 (Figure 8D)" lacks clarity, as the images in Figure 8 do not support the authors' claim. What do the pink cell nuclei in Figures 8 and 9 represent? In Fig. 10, it is impossible to see the yellow line, which, according to the authors' statement "... yellow lines indicate the area classified as tumor", should indicate the tumor origin of the cells! There is a distinct lack of evidence in Figures 11A-C suggesting that the S-phase nuclei are attributed to both embryonic liver epithelial cells (ES) and BxPC-3 cells (BS). Furthermore, there is no evidence that the apparent cytoplasmic EdU inclusions (arrowheads) belong specifically to chicken embryonic cells. Furthermore, in Fig. 11 B and C, the equal magnification of 40x is apparently incorrectly written.

      Discussion section:

      The initial assertion made by the authors, "In this paper, we have demonstrated that our previously published protocol concerning the xenografting of the BxPC-3 cell line10 can be applied to the AsPC-1 and PANC-1 cell lines," seems inadequately supported, as no macroscopic images depicting tumor nodules from the implantation of AsPC-1 and PANC-1 tumor cells into the CAM were provided.

      The authors' second assertion, "A newly identified biological phenomenon: non-nuclear EdU accumulation in chicken embryonic cells," is not well supported unless compelling evidence is presented to establish that these cells indeed belong to chicken embryos. Moreover, the assertion regarding non-nuclear EdU accumulation seems to be speculative, lacking any demonstration of concurrent EdU accumulation alongside cytoplasmic and/or membrane staining within the same cells. The latter is quite feasible by staining the cells with a suitable agent (or fluorophore), such as fluorophore-conjugated Phalloidin for the cytoskeleton or PKH26 (https://www.sigmaaldrich.com/RU/en/product/sigma/pkh26gl?srsltid=AfmBOorEbKBTeYSCKZY6qs-pWjCZCg4lhOvNqE0YYByS2A545f-POa24) for membrane structures.

      Third, regarding "Extranuclear EdU staining": Extranuclear EdU staining is not a standard or typical application for the EdU (5-ethynyl-2′-deoxyuridine) assay, which is designed to label and detect newly synthesized DNA in the nucleus during the S-phase of the cell cycle. Extranuclear EdU staining likely refers to a misinterpretation or unusual experimental result where EdU or its detection reaction product is found outside the nucleus. This could be due to cell damage, processing issues, tissue cutting artefacts or a cell type with unique DNA localization, as seen in some contexts of apoptosis or extranuclear DNA accumulation. Potential reasons for extranuclear EdU staining may include, but not limited to:

      • Cellular damage or death: Cells that are dying or have been damaged may release their DNA, causing it to be detected outside the nucleus. • Apoptosis: During apoptosis, cells undergo DNA degradation, and some DNA fragments may be found in the extranuclear space.
      • Technical artifacts: Errors during cutting of formalin-fixed tissues, cell fixation, permeabilization, or the staining procedure itself can lead to mislocalization of the fluorescent signal.
      • Unique cell types: In some specific cell types, such as megakaryocytes, DNA or other nuclear components may be localized extranuclearly, leading to extranuclear staining with techniques like EdU. Several examples:
      • Nicked DNA was strongly visible in old cells, prominently in the cytosol, but undetectable in young cells, and was more intense in old cells upon induction of DNA damage by the DNA damaging agent cytarabine/Ara‐C which causes DSBs [Lan YY, Heather JM, Eisenhaure T, Garris CS, Lieb D, Raychowdhury R, Hacohen N. Extranuclear DNA accumulates in aged cells and contributes to senescence and inflammation. Aging Cell. 2019 Apr;18(2):e12901. doi: 10.1111/acel.12901].
      • Extranuclear accumulation of histones and nucleosomes is an early event of apoptosis in human lymphoblasts. [Gabler, C., Blank, N., Hieronymus, T., Schiller, M., Berden, J. H., Kalden, J. R., & Lorenz, H. M. (2004). Extranuclear detection of histones and nucleosomes in activated human lymphoblasts as an early event in apoptosis. Annals of the rheumatic diseases, 63(9), 1135-1144. https://doi.org/10.1136/ard.2003.011452]
      • In some specific cell types, such as megakaryocytes, DNA or other nuclear components may be localized extranuclearly, leading to extranuclear staining with techniques like EdU [Frydman, G.H., Tessier, S.N., Wong, K.H.K. et al. Megakaryocytes contain extranuclear histones and may be a source of platelet-associated histones during sepsis. Sci Rep 10, 4621 (2020). https://doi.org/10.1038/s41598-020-61309-3].

      I would, if I could, strongly concur with the statements made by the authors: "It is also evident that the presence of EdU-cyto+ cells is not confined to the liver, as they were also found within the CAM. These cells likely represent a ubiquitously distributed cell population present in both healthy and xenografted chicken embryos," and "...since EdU-cyto+ cells are consistently present in non-grafted embryos as well, the phenomenon is not triggered by xenografting or the presence of PDAC cells." However, since the authors do not present any absolute markers proving that EdU-cyto+ cells are EdU+ chicken granulocytes, the authors' statement that "Given the high number of EdU+ granulocytes observed, it is more likely that these are neutrophils rather than eosinophils" appears highly speculative.

      Fourth, regarding Ki67 staining: Careful consideration and robust validation are essential when drawing conclusions and interpretations about Ki67's role in cell proliferation and the cell cycle. Research shows that Ki-67, an important cell cycle marker, has two main splice variants, α and β, which are regulated differently in normal and cancer cells at mRNA and protein levels. Moreover, Ki-67 undergoes constant regulation and degradation through the proteasome system in both cell types, indicating a dynamic control mechanism for this protein. It was also observed a putative extranuclear elimination pathway of Ki-67, where it is transported to the Golgi apparatus. Furthermore, the unforeseen extranuclear removal of Ki-67 strongly indicates the necessity to examine this protein beyond the confines of the "nuclear box," a perspective that has been overlooked until now [see e.g., Chierico L, Rizzello L, Guan L, Joseph AS, Lewis A, Battaglia G (2017) The role of the two splice variants and extranuclear pathway on Ki-67 regulation in non-cancer and cancer cells. PLoS ONE 12(2): e0171815. https://doi.org/10.1371/journal.pone.0171815].

      Last but not least, regarding the authors' conclusion: "We report a novel phenomenon: the apparent cytoplasmic accumulation of EdU in nondividing chicken granulocytes." The phenomenon of DNA replication taking place in the cytoplasm is not a novel observation. In instances like replication stress, the cytosol can initiate a response pathway that encompasses the detection of cytosolic DNA and subsequent signaling processes focused on genome protection. Cytosolic DNA generated after replication stress activates a Ca2+-dependent pathway to protect stalled replication forks [Li, S., Lu, H. T., & You, Z. (2025). Cytosolic DNA and intracellular Ca2+: Maintaining genome stability during replication stress. DNA repair, 152, 103877. https://doi.org/10.1016/j.dnarep.2025.103877]. Damage to the DNA template caused by environmental pollutants, like radiation and genotoxic chemicals, can impede the replication process.Physiological stressors also affect fork dynamics, including metabolic byproducts like reactive oxygen species, conflicts in replication and transcription, repetitive DNA elements such as telomeres, sequences that form secondary structures, DNA-RNA hybrids, misincorporated ribonucleotides, and low availability of DNA precursors. In response to these challenges, cells have developed an intricate network of surveillance and repair mechanisms. They identify replication stress, support stalled forks, fix problems, and allow replication to proceed. These pathways are crucial for maintaining genome stability and ensuring proper cellular function. Several excellent reviews on the topic included [M.R. Higgs BOD1L is required to suppress deleterious resection of stressed replication forks Mol. Cell (2015) R. Kumar et al. RIF1: a novel regulatory factor for DNA replication and DNA damage response signaling DNA Repair(2014) W. Leung ATR protects ongoing and newly assembled DNA replication forks through distinct mechanisms Cell Rep.(2023) M.B. Adolph et al. Mechanisms and regulation of replication fork reversal DNA Repair (Amst. )(2024) Z. You et al. The role of single-stranded DNA and polymerase alpha in establishing the ATR, Hus1 DNA replication checkpoint J. Biol. Chem.(2002) A. Kumagai TopBP1 activates the ATR-ATRIP complex Cell(2006) J. Lee et al. The Rad9-Hus1-Rad1 checkpoint clamp regulates interaction of TopBP1 with ATR J. Biol. Chem.(2007)]. To validate the authors' conclusions regarding the presence or absence of the declared phenomenon, I would recommend modulating TREX1 expression. Overexpression of TREX1, a nuclease that degrades cytosolic DNA, suppresses TRPV2-mediated Ca2+ release under replication stress. TREX1 depletion, however, leads to cytosolic DNA accumulation.

      Methodological ambiguities: In the section "Nucleoside labeling of dividing cells in the CAM model": It is not clear when BrdU and EdU were given after tumor cell implantation, or how to standardize the distance from the tumor as suggested by "...as far away from the visible tumor as possible...". I question whether such a "precise" description of the application site would contribute "...to ensure low variability in labeling duration...", especially when it was performed by two independent researchers "...in tandem, ensuring that an experimental group of 16 embryos could be labeled in less than five minutes."

      In the section "Immunofluorescence labeling of cryosections": "...a blocking and permeabilizing solution in a humidified atmosphere." This solution is comprised of final concentrations of 0.1% Triton-X-100, 0.02% sodium azide, 5% horse serum (Sigma-Aldrich, Cat#H1270), 0.01% thimerosal, and 0.3% bovine serum albumin (Sigma-Aldrich, Cat#A7284) in PBS at pH 7.4." Absence of detergent is used during incubation with primary and secondary antibodies (even the widely recommended Tween20!). I wonder whether it might be the source of non-specific tissue labelling by donkey anti-chicken IgY that was considered as an evidence of chicken origin.

      Significance

      General asssesment:

      The research has technical significance, and encourages greater use of this alternative animal model in cancer studies to help us understand the regulation of the cancer cell cycle. This, in turn, may improve the implementation of existing treatment methods or uncover potential vulnerabilities in the cancer cell cycle. No clinical significance so far. Certan groups conducting cancer biology studies and modelling might be interested in implementation of the described method, although it will be challenging without macroscopic evidence of the expected tumor nodules.The data and the methods presented in such a way that reproducing them might be challenging.The experiments are not adequately replicated and statistical analysis inadequate.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      By using the CAM assay, the Authors of this studyshow that for the BxPC-3 and AsPC-1 cell lines, nucleoside labelling with 5-ethynyl-2'-deoxyuridine (EdU) can be multiplexed successfully with other cell-cycle markers such as cyclin B1 and Ki67, especially when combined with digital image analysis techniques. Starting from ED14, they observe the presence of a chicken embryonic cell type that appears to possess a high-quantity of extranuclear accumulation of EdU. Initial assessment of these cells showed that they are likely granulocytes which can be found in the embryonic liver of grafted and non-grafted embryos, as well as in xenograft sections. These cells do not express chicken MHC II, in turn making it less likely that they represent professional antigen presenting cells.

      Significance

      Remarks. This is a well-done study. However, I ask to the Authors to include also figures showing the macroscopic in ovo evidence and to discuss the role of the CAM assay in the study of xenografts.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank the reviewers for their positive and constructive feedback.

      We apologise for the delay in coming back. The first author has moved to the LMB, and the Trost lab has been relocating to the University of Manchester, which delayed our ability to respond quickly.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Reviewer Comments

      The manuscript by Chatterjee et al. describes a novel ultra-sensitive isolation and deep proteomics workflow to investigate phagosome dynamics of bacterium-containing phagosomes. The method enables dual proteome coverage of both host and pathogen, and the authors report quantitative changes in the host and bacterial proteomes using Salmonella isogenic mutants defective in intracellular survival. They further leverage these datasets to assess the relevance of selected Salmonella genes in intracellular fitness.

      Overall, the manuscript presents a powerful and technically impressive approach that will be of significant interest to the infection biology community. The study is well conceived and addresses an important gap in the field. However, several clarifications and additions would strengthen the work and improve interpretability of the results.

      Specific Comments

      Line 76: The authors should consider including the following relevant citations: PMID: 30079117 and PMID: 31009521.

      We thank the reviewer for pointing this out. We have now included the suggested references


      Line 104: Please define the abbreviation BFP clearly upon first use.

      We thank the reviewer; we have defined the abbreviation upon first instance.

      Figure 1A, Step 2: From the schematic, it is unclear whether the pellet or the supernatant is used for the subsequent step in which the CellVue dye is added. Please clarify.

      We thank the reviewer for bringing this to our attention. We have now modified Figure 1A.

      Figure 1B: It would be informative to report the percentage of S. Typhimurium that are double positive, especially in the BFP + Claret condition. A small bar plot for each condition would help visualize and compare the proportion of Claret-labelled bacteria.

      We have now included a figure for the percentage of BFP + Claret for STM in S1H.

      Figure 1C: The distinction between the upper and lower images is unclear. Do they represent different particles or different fields of view of the same sample? Please clarify.

      They both are from different fields of view.

      Line 122: The statement is not entirely accurate. Cells that lyse via pyroptosis will leave behind cellular remnants, including nuclei, that may still co-sediment with intact cells in such preparations.

      We have modified the sentence accordingly.

      Line 128: CellVue and Claret appear to be used interchangeably-are they the same reagent? Please clarify and use consistent terminology throughout.

      We have rectified this inconsistency in our revised manuscript.

      Line 136: Please explain the basis for the stated estimates. If this is common knowledge within the field, additional explanation would still be helpful for non-experts.

      We have clarified this further in the manuscript. Obviously, these numbers are estimates but give the reader an idea with how little material we are working.

      Lines 143 & 145: Please define "protein IDs" and indicate how many correspond to host proteins versus Salmonella proteins.

      We have defined this in our revised manuscript. Also, to avoid any confusion, these proteomics methods were optimised using a commercially available HeLa protein digest, and hence no Salmonella proteins are detected here.

      Figure 2D: Please specify the number and type of replicates used. Also indicate the plot type (e.g., violin plot) and the statistical test used to determine significance.

      We have updated figure legend for 2D and 2E stating the number of biological replicates, i.e. n=4 and n=3.

      Line 244: Please consider citing PMID: 32514074 and PMID: 23162002.

      *We have included these references. *

      Line 253: Have the authors considered how their observations regarding MHC relate to prior findings (PMID: 27832589)?

      *Thank you for suggesting this paper and we enjoyed reading it. However, since the paper suggested by the reviewer focusses on cell surface MHC molecules and we are looking at the phagolysosomal compartment, we feel it may be difficult to make connections. *

      Line 265: Clarify which "cell" is being referred to-the host cell or the bacterial cell.

      We have modified the sentence to reduce confusion.

      Line 278: Have the authors considered how their observations on glycolytic proteins relate to earlier work (PMID: 19380470 and PMID: 37594988)?

      *Thank you for pointing out these papers. We have cited both of these and added another sentence that intracellular STM utilises host metabolites. *

      Line 285: The claim that "PhoP-dependent effectors actively remodel..." requires clarification. If the authors are referring to all PhoP-regulated genes as "effectors," this terminology may cause confusion, as "effectors" in the Salmonella field typically denotes T3SS-secreted proteins. While some T3SS effectors are PhoP-regulated, PhoP controls many additional genes, and the observed phenotypes may reflect broader defects in intracellular survival rather than absence of secreted effectors specifically. Rewording is recommended.

      Thank you for your suggestion, we have modified the same in text.

      Line 313: Have the authors examined later time points (e.g., 8 hpi), when the SCV is more established and SPI-2 effector expression is higher?

      We did not test the 8 hpi timepoint because our primary aim was to identify the induction of SPI-2 effectors at earlier stages. Testing later timepoints would be problematic, as PhoP mutants show poor survival at these times, which would confound comparisons between STM WT and PhoP mutants.

      Line 317: Were secreted SPI-2 effectors detectable using PhagoCyt, and if so, how did they behave?

      We detected some of the secreted effectors as well, and they are in accordance with the literature. As expected, most of them were detected only in WT at 4 hpi.

      For example, PipB2, SseL and SctB1 are significantly decreased in the PhoP mutant compared to the STM WT at 4 hpi.

      Line 319: Have the candidate Salmonella mutants been evaluated at later time points (6-8 hpi)? Stronger phenotypic differences may emerge when intracellular replication relies more heavily on SPI-2 function.

      We acknowledge that there may be larger differences at later time points; However, we wanted to be comparable with the data within the manuscript, i.e. within the 4 hour time-point that we have kept throughput. Moreover, at later timepoint we see increase macrophage cell death and therefore refrain from doing timepoints much longer after the 4 hour mark.

      Figure 5B: For all mutant strains, please also report in vitro growth to determine whether the phenotypes reflect general growth defects or are specific to the intracellular environment.

      We have performed the growth curve for the PhoP mutant, which is in the supplemental figure 1.

      Line 336: As above, please reconsider the use of the term "effectors." Unless evidence is provided that these are bona fide secreted SPI-2 effectors, an alternative term would avoid confusion.

      We have modified the sentence to reduce confusion.

      Supplementary Figure 5: The volcano plots appear pixelated. Please provide higher-resolution versions.

      Thank you for pointing this out. We have rectified this.

      Reviewer #1 (Significance (Required)):

      General assessment:

      This study introduces a highly sensitive dual host-pathogen proteomics workflow for profiling bacterium-containing phagosomes. Its key strengths are the technical innovation and the mechanistic insight gained using Salmonella mutants. The main areas needing improvement are clarification of methodological details and tighter interpretation of some biological claims.

      Advance:

      To my knowledge, this is the first study to achieve such deep, simultaneous proteomic coverage of both host and intracellular bacteria within purified phagosomes. This represents a notable technical advance and provides new mechanistic insight into intracellular adaptation and immune regulation.

      Audience:

      The work will interest a specialized audience in infection biology, host-pathogen interactions, and proteomics, with broader relevance for researchers studying organelle isolation or intracellular pathogens. The workflow and datasets will be useful as a resource for future studies.

      Reviewer expertise:

      Expertise in host-pathogen interactions, bacterial intracellular survival, macrophage biology, and functional proteomics. Limited expertise in MS instrumentation.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this work, Chatterjee, Rubio and colleagues use a novel flow cytometry-based method to isolate phagosomes from Salmonella infected macrophages. This method is applied both to wild-type and to a mutant (deletion of phoP) that does not express virulence genes, prior to the proteome characterization of these phagosomes and the bacteria that they contain. The experiments were done at an early point of infection (30 min) and a later time point (4 h). The authors first identified mitochondrial proteins in their analysis, which had previously been considered contaminants from the preparation of phagosomes. However, some Salmonella effector proteins are known to affect mitochondria, and the authors demonstrate that inhibition of Complex I showed decreased Salmonella intracellular viability. Comparing WT and the phoP mutant also highlighted two Salmonella proteins that enhance intracellular survival. In addition, the authors show that their method recapitulates previously known proteins involved in Salmonella infection. The study is well designed and clearly written.

      I have only some minor comments that I hope will strengthen the work:

      It would be interesting to compare the results with a whole cell proteome analysis, and to other approaches that involve subcellular fractionation (both in the context of Salmonella infection) to: a) highlight proteins that are specifically changing in abundance in the phagosomes (but not necessarily in the cell), and b) to show that this approach is able to capture previously unknown phenomena. To avoid the performing additional experiments, the authors can compare their dataset to previous proteomic datasets of Salmonella infection. We have compared this with the ultracentrifugation methods STM WT 4h vs STM WT uptake (Figure 6A).

      A color scale for the heatmap in Fig 2C is needed. I assume that this heatmap shows intensity and not fold-changes, and thus suggest that the authors use a single-color gradient for easier visualization.

      *This has now been included. *

      Best regards,

      André Mateus

      Reviewer #2 (Significance (Required)):

      General assessment: This study provides a novel approach to study intracellular pathogenic bacteria. The method is applied to Salmonella, but can potentially be used for any bacteria, including non-genetically tractable organisms. A strength of the approach is that it captures the bacterial proteome, which is mostly undetectable when studying infected cells. Further, by enriching phagosomes, it allows measuring the spatial distribution of proteins to these organelles. The study could be improved by distinguishing proteome changes that are caused by trafficking of proteins to phagosomes vs general changes in protein abundance.

      Advance: Apart from a new methodology, the authors use the approach to identify novel aspects of Salmonella infection biology, e.g., the importance of mitochondrial proteins in host defense or novel Salmonella proteins that are involved in intracellular survival. Audience: The audience for this study is mostly those in the field of infection biology, particularly Salmonella. The dataset generated can be used to identify novel aspects of Salmonella infection, and the described method could be applied to other pathogens.

      My field of expertise: Proteomics, microbiology.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In the manuscript "Flow cytometry-based isolation of Salmonella-containing phagosomes combined with ultra-sensitive proteomics reveals novel insights into host-pathogen interactions", the authors describe a new method for analysis of composition pathogen-containing phagosomes and the pathogens within. Combination of FACS-based single phagosome analysis and sorting combined with optimised highly sensitive proteomic analysis of sorted vesicles has potential for identification of so far overlooked host-pathogen interactions. Although this is well described in the manuscript, some controls are missing.

      Major comments:

      1) The sorting of labelled bacteria is a crucial bottleneck in the whole procedure. The gating strategy presented in the Fig. 1B suggest that the initial "bacterial phagosome size" is limited from the bottom based on the noise signal but not from top. Therefore any not broken THP-1 cell remaining in the sample would be also included in the analysis. In respect to very high sensitivity of the mass spectrometry procedure and high abundance of housekeeping genes in host cells, this contamination could well explain the appearance of mitochondria, ribosome, and nuclear envelope proteins identified in Fig 2B and undermine the following results. Therefore, the gating strategy should be more stringent and data from this more stringent gating shall be compared with the current data sets. Since the authors use BFP+ Salmonella and do not analyse the claret+BFP- events, a BFP vs FSC gating step could help to distinguish free bacteria, bacteria in vesicles, and not or only partially broken host cells.

      We use a series of centrifugations to ensure that we do not have intact cells in the prepared samples. We have also visualised the final samples under the microscope and did not observe any intact cells. Because of the side/forward scatter gating, intact cells are not within the field of sorting. In Figure 1B we show that free bacteria are not within the gating strategy that we used. Finally, we visually inspected >100 pictures of sorted phagosomes by imaging flow cytometry and did not see any intact cells or free bacteria.

      2) Since the authors present data previously well accepted as contaminations from other fractions, these shall be carefully validated by other methods. For example the contact of mitochondria with SCV could be validated using a FRET- or split FP- based assays. Change of abundance of surface proteins on SCV in individual timepoints shall be validated using antibody-based flow cytometry on isolated SCVs. Most relevant antibodies are already described in the manuscript or available commercially (IL4R, IFNgR, integrins, TLRs). Microscopy-based quantification could help with the soluble proteins present within SCVs.

      We agree with the reviewer that this would be very interesting. However, we feel that this is outside of the scope of this paper and will be very laborious and time consuming, practically a whole project in itself.

      3) Since the authors describe an alternative method to methods used previously, they shall discuss the differences in results obtained by the formerly used methods.

      We have now provided a dataset that is with SCVs isolated using ultracentrifugation as a comparatively analysis to our method (Figure S6A and Table S8). __The data show that the ultracentrifugation-isolated phagosomes have many more proteins from any organelle (__Figure S6B), suggesting that they are less pure than the phagosomes isolated by the PhagoCyt approach.

      4) Only 15 Salmonella proteins downregulated between 0.5 and 4 h timepoints were identified. However, at least genes from SPI-1 and flagella would be expected to be downregulated at 4 h p.i. How do the authors explain this discrepancy? In contrast, are the SPI-2 genes among those identified as upregulated?

      In our supplementary table 6 (comparison between WT 4h vs WT uptake), we see that there are 458 Salmonella proteins that are only present in uptake samples, these were not included in limma analysis since they are completely absent in the WT 4h. We decided to report these as “unique” proteins rather than perform imputation. In Figure 5B, we specifically highlight STM proteins down-regulated, which include flagellar proteins and SPI-1 proteins.

      To answer your second question, yes, several SPI-2 genes (effectors and other regulatory proteins) are upregulated at 4 hpi. 131 Salmonella proteins are significantly upregulated, and 55 proteins are exclusively present in the WT 4hpi samples. Some selected examples are in Figure 5A.

      Minor comments:

      1) Fig 1, the figure caption seems to remain parts of an older version, mentioning blue bars not present in the current version?

      The figure caption appears to be correct for us; the “blue” is in the unstained BFP Salmonella, which is hidden behind the purple, which is the BFP Salmonella + CellVue Claret.

      2) Fig 1A point 1, how were the dead cells removed? Normal centrifugation is not able to discriminate dead and living cells well enough as percoll gradient centrifugation for example would be. Such gradient centrifugation is not mentioned in the Methods section though.

      We have not used Percoll-based centrifugation to remove dead cells; instead, we have washed the adherent macrophages in dishes 3-4 times with ice-cold PBS to remove dead, floating cells, and then washed the pellet several times with PBS to ensure we are not taking any dead cells into the sample preparation.

      3) Fig 1A point 2, did the authors check for the composition of the pellet fraction in each centrifugation step? What are the losses and cross contaminations of the other fraction?

      No, we have not checked the composition of each fraction using mass spec; however, we did run some western blots to correctly identify the major organelle contribution in each fraction.

      4) Suppl. Fig 1, caption for panels F and G are missing. The axis in the panel G is misleading - the bacteria obtained in "output" contain proliferating intracellular bacteria that originate only from a fraction of the "input" bacteria. Since the figure clearly show increase in the number of intracellular bacteria and all the extracellular bacteria should be killed by gentamicin, all bacteria in the "output" probably proliferate intracellularly and, therefore, originate from the same fraction of the "input" throughout the whole assay. Showing these results as CFU per well/plate/surface area or cell count would be more exact, in this case the "input" data shall be shown as a separate data point.

      We thank the reviewer for this observation. We have now modified the figure legends. These are normalised per cell, and we think they provide accurate results.

      5) Fig 1B, could the authors show the percentages in individual quadrants for the green "Sample with BFP Salmonella + claret"?

      Yes, there is the plot that depicts the percentage in Supplementary Figure 1H, this varies between WT and PhoP mutant, and hence, we decided to not show this in one figure.


      6) All proteins identified as significantly up or down represented shall be listed in a supplementary file.

      They are listed in the supplemental tables.

      7) Fig 2C suggests that some mitochondrial proteins are similarly present at the SCV containing WT Salmonella at 4h as ∆phoP mutant at 0.5 h p.i. Could the authors speculate how is that? The scale of blue/orange transition shall be shown in Fig 2C.

      We speculate that Salmonella WT alters the maturation of the SCVs is heavily arrested by the pathogen and hence resemble the early SCV of a mutant that is unable to arrest the SCV degradation stages.

      8) In the Fig 2D, the authors show decrease of CFU obtained from THP-1 cells treated with Rotenone. However, rotenone is known to induce host cell apoptosis. Were the presented data normalized to amount of living host cells in the sample? For example measurement of protein concentration in the sample lysate after washing away the dying host cells should enable this.

      Yes, we have normalised the data to the account for the percentage of live cells using live dead staining. However, in the timepoints used, we did not observe significant cell death.

      9) Microscopy-based observation of mitochondria relocation to SCVs in time shall strengthen the claim that mitochondria-derived ROS are involved in anti-Salmonella host defense.

      There are multiple literature PMID: 38356294, PMID: 41444067, PMID: 15866946, PMID: 41198672 that support our data in this regard.

      10) The Salmonella proteins identified in the Fig 5 shall be validated using qPCR.

      We think that data from qPCR would not be accurate to validate Salmonella proteins, as it has been shown that Salmonella mRNAs can have sub-minute half-lives (PMID: 38527194). We used rather conservative proteomics analysis settings, that have shown in a recent pre-print of our lab to have 0% false discoveries and 0.4% false quantitative rate ( https://doi.org/10.1101/2025.09.22.677725). We acknowledge that another reviewer did not find this experiment to be essential.

      Reviewer #3 (Significance (Required)):

      The manuscript was reviewed mainly from the Salmonella and flow cytometry/FACS expertise point of view. The main interest in the study lies within its methodological advances - combination of single vesicle analysis using flow cytometry/FACS with highly sensitive mass spectrometry analysis. In comparison to other similar studies in the field, this combination significantly expands the possibilities of sorting of distinct subpopulations of vesicles from the same cells. This will make the article of interest to scientists in the broad field of host-pathogen interactions and immunology.

      **Referee cross-commenting**

      Reviewer 3 - @Reviewer #1: I see your point and leave it at the editors to judge how important this comment is. My reasoning was this: Fig 5

      serves as a proof of concept that PhagoCyt has the power to make new discoveries in Salmonella biology. While behavior of some of the proteins

      shown if Fig 5 is well described (e.g. flagella or SPI-1 T3SS components and effectors), some are novel and to prove the functionality of the

      method, these results should be confirmed by some other well accepted mean. Given the great sensitivity of PhagoCyt, other proteomic

      approaches are unlikely to help in this case (e.g. flagella or SPI-1 T3SS components and effectors are not detectable by western blot at 4 h p.i.).

      Therefore, I suggest qPCR (but would accept any other method as well) as a very sensitive and well accepted approach, but leave at the authors

      to chose what proteins they want to use for the validation.

      Reviewer 1- I agree with comments raised by the other two reviewers, except the following point from Reviewer 3 '10) The Salmonella proteins

      identified in the Fig 5 shall be validated using qPCR.' It is not clear which proteins are being referred to and it is unclear to this reviewer how this

      experiment(s) would improve the manuscript in its current form.

      Reviewer 3- I agree with all comments raised.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In the manuscript "Flow cytometry-based isolation of Salmonella-containing phagosomes combined with ultra-sensitive proteomics reveals novel insights into host-pathogen interactions", the authors describe a new method for analysis of composition pathogen-containing phagosomes and the pathogens within. Combination of FACS-based single phagosome analysis and sorting combined with optimised highly sensitive proteomic analysis of sorted vesicles has potential for identification of so far overlooked host-pathogen interactions. Although this is well described in the manuscript, some controls are missing.

      Major comments:

      1. The sorting of labelled bacteria is a crucial bottleneck in the whole procedure. The gating strategy presented in the Fig. 1B suggest that the initial "bacterial phagosome size" is limited from the bottom based on the noise signal but not from top. Therefore any not broken THP-1 cell remaining in the sample would be also included in the analysis. In respect to very high sensitivity of the mass spectrometry procedure and high abundance of housekeeping genes in host cells, this contamination could well explain the appearance of mitochondria, ribosome, and nuclear envelope proteins identified in Fig 2B and undermine the following results. Therefore, the gating strategy should be more stringent and data from this more stringent gating shall be compared with the current data sets. Since the authors use BFP+ Salmonella and do not analyse the claret+BFP- events, a BFP vs FSC gating step could help to distinguish free bacteria, bacteria in vesicles, and not or only partially broken host cells.
      2. Since the authors present data previously well accepted as contaminations from other fractions, these shall be carefully validated by other methods. For example the contact of mitochondria with SCV could be validated using a FRET- or split FP- based assays. Change of abundance of surface proteins on SCV in individual timepoints shall be validated using antibody-based flow cytometry on isolated SCVs. Most relevant antibodies are already described in the manuscript or available commercially (IL4R, IFNgR, integrins, TLRs). Microscopy-based quantification could help with the soluble proteins present within SCVs.
      3. Since the authors describe an alternative method to methods used previously, they shall discuss the differences in results obtained by the formerly used methods.
      4. Only 15 Salmonella proteins downregulated between 0.5 and 4 h timepoints were identified. However, at least genes from SPI-1 and flagella would be expected to be downregulated at 4 h p.i. How do the authors explain this discrepancy? In contrast, are the SPI-2 genes among those identified as upregulated?

      Minor comments:

      1. Fig 1, the figure caption seems to remain parts of an older version, mentioning blue bars not present in the current version?
      2. Fig 1A point 1, how were the dead cells removed? Normal centrifugation is not able to discriminate dead and living cells well enough as percoll gradient centrifugation for example would be. Such gradient centrifugation is not mentioned in the Methods section though.
      3. Fig 1A point 2, did the authors check for the composition of the pellet fraction in each centrifugation step? What are the losses and cross contaminations of the other fraction?
      4. Suppl. Fig 1, caption for panels F and G are missing. The axis in the panel G is misleading - the bacteria obtained in "output" contain proliferating intracellular bacteria that originate only from a fraction of the "input" bacteria. Since the figure clearly show increase in the number of intracellular bacteria and all the extracellular bacteria should be killed by gentamicin, all bacteria in the "output" probably proliferate intracellularly and, therefore, originate from the same fraction of the "input" throughout the whole assay. Showing these results as CFU per well/plate/surface area or cell count would be more exact, in this case the "input" data shall be shown as a separate data point.
      5. Fig 1B, could the authors show the percentages in individual quadrants for the green "Sample with BFP Salmonella + claret"?
      6. All proteins identified as significantly up or down represented shall be listed in a supplementary file.
      7. Fig 2C suggests that some mitochondrial proteins are similarly present at the SCV containing WT Salmonella at 4h as ∆phoP mutant at 0.5 h p.i. Could the authors speculate how is that? The scale of blue/orange transition shall be shown in Fig 2C.
      8. In the Fig 2D, the authors show decrease of CFU obtained from THP-1 cells treated with Rotenone. However, rotenone is known to induce host cell apoptosis. Were the presented data normalized to amount of living host cells in the sample? For example measurement of protein concentration in the sample lysate after washing away the dying host cells should enable this.
      9. Microscopy-based observation of mitochondria relocation to SCVs in time shall strengthen the claim that mitochondria-derived ROS are involved in anti-Salmonella host defense.
      10. The Salmonella proteins identified in the Fig 5 shall be validated using qPCR.

      Referee cross-commenting

      Reviewer 3 - @Reviewer #1: I see your point and leave it at the editors to judge how important this comment is. My reasoning was this: Fig 5 serves as a proof of concept that PhagoCyt has the power to make new discoveries in Salmonella biology. While behavior of some of the proteins shown if Fig 5 is well described (e.g. flagella or SPI-1 T3SS components and effectors), some are novel and to prove the functionality of the method, these results should be confirmed by some other well accepted mean. Given the great sensitivity of PhagoCyt, other proteomic approaches are unlikely to help in this case (e.g. flagella or SPI-1 T3SS components and effectors are not detectable by western blot at 4 h p.i.). Therefore, I suggest qPCR (but would accept any other method as well) as a very sensitive and well accepted approach, but leave at the authors to chose what proteins they want to use for the validation.

      Reviewer 1- I agree with comments raised by the other two reviewers, except the following point from Reviewer 3 '10) The Salmonella proteins identified in the Fig 5 shall be validated using qPCR.' It is not clear which proteins are being referred to and it is unclear to this reviewer how this experiment(s) would improve the manuscript in its current form.

      Reviewer 3- I agree with all comments raised.

      Reviewer 2- I agree with the other reviewer's comments/suggestions.

      Significance

      The manuscript was reviewed mainly from the Salmonella and flow cytometry/FACS expertise point of view. The main interest in the study lies within its methodological advances - combination of single vesicle analysis using flow cytometry/FACS with highly sensitive mass spectrometry analysis. In comparison to other similar studies in the field, this combination significantly expands the possibilities of sorting of distinct subpopulations of vesicles from the same cells. This will make the article of interest to scientists in the broad field of host-pathogen interactions and immunology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this work, Chatterjee, Rubio and colleagues use a novel flow cytometry-based method to isolate phagosomes from Salmonella infected macrophages. This method is applied both to wild-type and to a mutant (deletion of phoP) that does not express virulence genes, prior to the proteome characterization of these phagosomes and the bacteria that they contain. The experiments were done at an early point of infection (30 min) and a later time point (4 h). The authors first identified mitochondrial proteins in their analysis, which had previously been considered contaminants from the preparation of phagosomes. However, some Salmonella effector proteins are known to affect mitochondria, and the authors demonstrate that inhibition of Complex I showed decreased Salmonella intracellular viability. Comparing WT and the phoP mutant also highlighted two Salmonella proteins that enhance intracellular survival. In addition, the authors show that their method recapitulates previously known proteins involved in Salmonella infection. The study is well designed and clearly written.

      I have only some minor comments that I hope will strengthen the work:

      1. It would be interesting to compare the results with a whole cell proteome analysis, and to other approaches that involve subcellular fractionation (both in the context of Salmonella infection) to: a) highlight proteins that are specifically changing in abundance in the phagosomes (but not necessarily in the cell), and b) to show that this approach is able to capture previously unknown phenomena. To avoid the performing additional experiments, the authors can compare their dataset to previous proteomic datasets of Salmonella infection.
      2. A color scale for the heatmap in Fig 2C is needed. I assume that this heatmap shows intensity and not fold-changes, and thus suggest that the authors use a single-color gradient for easier visualization.

      Best regards, André Mateus

      Referee cross-commenting

      Reviewer 3 - @Reviewer #1: I see your point and leave it at the editors to judge how important this comment is. My reasoning was this: Fig 5 serves as a proof of concept that PhagoCyt has the power to make new discoveries in Salmonella biology. While behavior of some of the proteins shown if Fig 5 is well described (e.g. flagella or SPI-1 T3SS components and effectors), some are novel and to prove the functionality of the method, these results should be confirmed by some other well accepted mean. Given the great sensitivity of PhagoCyt, other proteomic approaches are unlikely to help in this case (e.g. flagella or SPI-1 T3SS components and effectors are not detectable by western blot at 4 h p.i.). Therefore, I suggest qPCR (but would accept any other method as well) as a very sensitive and well accepted approach, but leave at the authors to chose what proteins they want to use for the validation.

      Reviewer 1- I agree with comments raised by the other two reviewers, except the following point from Reviewer 3 '10) The Salmonella proteins identified in the Fig 5 shall be validated using qPCR.' It is not clear which proteins are being referred to and it is unclear to this reviewer how this experiment(s) would improve the manuscript in its current form.

      Reviewer 3- I agree with all comments raised.

      Reviewer 2- I agree with the other reviewer's comments/suggestions.

      Significance

      General assessment: This study provides a novel approach to study intracellular pathogenic bacteria. The method is applied to Salmonella, but can potentially be used for any bacteria, including non-genetically tractable organisms. A strength of the approach is that it captures the bacterial proteome, which is mostly undetectable when studying infected cells. Further, by enriching phagosomes, it allows measuring the spatial distribution of proteins to these organelles. The study could be improved by distinguishing proteome changes that are caused by trafficking of proteins to phagosomes vs general changes in protein abundance.

      Advance: Apart from a new methodology, the authors use the approach to identify novel aspects of Salmonella infection biology, e.g., the importance of mitochondrial proteins in host defense or novel Salmonella proteins that are involved in intracellular survival.

      Audience: The audience for this study is mostly those in the field of infection biology, particularly Salmonella. The dataset generated can be used to identify novel aspects of Salmonella infection, and the described method could be applied to other pathogens.

      My field of expertise: Proteomics, microbiology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Chatterjee et al. describes a novel ultra-sensitive isolation and deep proteomics workflow to investigate phagosome dynamics of bacterium-containing phagosomes. The method enables dual proteome coverage of both host and pathogen, and the authors report quantitative changes in the host and bacterial proteomes using Salmonella isogenic mutants defective in intracellular survival. They further leverage these datasets to assess the relevance of selected Salmonella genes in intracellular fitness. Overall, the manuscript presents a powerful and technically impressive approach that will be of significant interest to the infection biology community. The study is well conceived and addresses an important gap in the field. However, several clarifications and additions would strengthen the work and improve interpretability of the results.

      Specific Comments

      Line 76: The authors should consider including the following relevant citations: PMID: 30079117 and PMID: 31009521. Line 104: Please define the abbreviation BFP clearly upon first use. Figure 1A, Step 2: From the schematic, it is unclear whether the pellet or the supernatant is used for the subsequent step in which the CellVue dye is added. Please clarify. Figure 1B: It would be informative to report the percentage of S. Typhimurium that are double positive, especially in the BFP + Claret condition. A small bar plot for each condition would help visualize and compare the proportion of Claret-labelled bacteria. Figure 1C: The distinction between the upper and lower images is unclear. Do they represent different particles or different fields of view of the same sample? Please clarify. Line 122: The statement is not entirely accurate. Cells that lyse via pyroptosis will leave behind cellular remnants, including nuclei, that may still co-sediment with intact cells in such preparations. Line 128: CellVue and Claret appear to be used interchangeably-are they the same reagent? Please clarify and use consistent terminology throughout. Line 136: Please explain the basis for the stated estimates. If this is common knowledge within the field, additional explanation would still be helpful for non-experts. Lines 143 & 145: Please define "protein IDs" and indicate how many correspond to host proteins versus Salmonella proteins. Figure 2D: Please specify the number and type of replicates used. Also indicate the plot type (e.g., violin plot) and the statistical test used to determine significance. Line 244: Please consider citing PMID: 32514074 and PMID: 23162002. Line 253: Have the authors considered how their observations regarding MHC relate to prior findings (PMID: 27832589)? Line 265: Clarify which "cell" is being referred to-the host cell or the bacterial cell. Line 278: Have the authors considered how their observations on glycolytic proteins relate to earlier work (PMID: 19380470 and PMID: 37594988)? Line 285: The claim that "PhoP-dependent effectors actively remodel..." requires clarification. If the authors are referring to all PhoP-regulated genes as "effectors," this terminology may cause confusion, as "effectors" in the Salmonella field typically denotes T3SS-secreted proteins. While some T3SS effectors are PhoP-regulated, PhoP controls many additional genes, and the observed phenotypes may reflect broader defects in intracellular survival rather than absence of secreted effectors specifically. Rewording is recommended. Line 313: Have the authors examined later time points (e.g., 8 hpi), when the SCV is more established and SPI-2 effector expression is higher? Line 317: Were secreted SPI-2 effectors detectable using PhagoCyt, and if so, how did they behave? Line 319: Have the candidate Salmonella mutants been evaluated at later time points (6-8 hpi)? Stronger phenotypic differences may emerge when intracellular replication relies more heavily on SPI-2 function. Figure 5B: For all mutant strains, please also report in vitro growth to determine whether the phenotypes reflect general growth defects or are specific to the intracellular environment. Line 336: As above, please reconsider the use of the term "effectors." Unless evidence is provided that these are bona fide secreted SPI-2 effectors, an alternative term would avoid confusion. Supplementary Figure 5: The volcano plots appear pixelated. Please provide higher-resolution versions.

      Referee cross-commenting

      Reviewer 3 - @Reviewer #1: I see your point and leave it at the editors to judge how important this comment is. My reasoning was this: Fig 5 serves as a proof of concept that PhagoCyt has the power to make new discoveries in Salmonella biology. While behavior of some of the proteins shown if Fig 5 is well described (e.g. flagella or SPI-1 T3SS components and effectors), some are novel and to prove the functionality of the method, these results should be confirmed by some other well accepted mean. Given the great sensitivity of PhagoCyt, other proteomic approaches are unlikely to help in this case (e.g. flagella or SPI-1 T3SS components and effectors are not detectable by western blot at 4 h p.i.). Therefore, I suggest qPCR (but would accept any other method as well) as a very sensitive and well accepted approach, but leave at the authors to chose what proteins they want to use for the validation.

      Reviewer 1- I agree with comments raised by the other two reviewers, except the following point from Reviewer 3 '10) The Salmonella proteins identified in the Fig 5 shall be validated using qPCR.' It is not clear which proteins are being referred to and it is unclear to this reviewer how this experiment(s) would improve the manuscript in its current form.

      Reviewer 3- I agree with all comments raised.

      Reviewer 2- I agree with the other reviewer's comments/suggestions.

      Significance

      General assessment:

      This study introduces a highly sensitive dual host-pathogen proteomics workflow for profiling bacterium-containing phagosomes. Its key strengths are the technical innovation and the mechanistic insight gained using Salmonella mutants. The main areas needing improvement are clarification of methodological details and tighter interpretation of some biological claims.

      Advance:

      To my knowledge, this is the first study to achieve such deep, simultaneous proteomic coverage of both host and intracellular bacteria within purified phagosomes. This represents a notable technical advance and provides new mechanistic insight into intracellular adaptation and immune regulation.

      Audience:

      The work will interest a specialized audience in infection biology, host-pathogen interactions, and proteomics, with broader relevance for researchers studying organelle isolation or intracellular pathogens. The workflow and datasets will be useful as a resource for future studies.

      Reviewer expertise:

      Expertise in host-pathogen interactions, bacterial intracellular survival, macrophage biology, and functional proteomics. Limited expertise in MS instrumentation.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all three reviewers for their careful and constructive engagement with our manuscript. We are encouraged by their overall positive assessment of the work. Reviewer 1 described this as "an important study" that addresses a significant gap in understanding systemic, inter-organ responses to hypoxia, and noted the potential relevance of our findings to mammalian IL-6 biology. Reviewer 2 highlighted the study as being of "high significance" and described it as "a foundation study that will be the motivation for numerous high-impact papers in the future", noting its broad relevance to understanding hypoxia in both health and disease. In the revised manuscript, we have addressed all of the reviewers' comments and critiques. This includes performing several new experiments, expanding our Discussion, and making a number of clarifications to the text, figures, and methods as detailed below.

      Reviewer #1


      __(Evidence, reproducibility and clarity (Required)): __The authors describe a role of Unpaired 3 (Upd3) in tissue communication in responses to hypoxia in Drosophila adult flies. Upd3 mRNA is strongly upregulated in hypoxia, along with well-characterized JAK/STAT downstream target genes, in both adult fly males and females, as well as in larvae. Interestingly, adult females but not males require Upd3 for 15 to 24 h survival in hypoxia, as Upd3 mutant females but not males die to a much larger proportion in these conditions. Adult females they display strong hypoxic upregulation of Upd3 in the gut, assessed by RT-PCR or through a Gal4 transcriptional reporter, mainly in epithelial enterocytes. Enterocyte-specific RNAi-mediated KD indicated that this enterocyte expression of Upd3 represents about 40% of Upd3 expression in the whole body. Enterocyte-specific KD of Upd3 in adult females significantly reduced survival in hypoxia, suggesting that this expression is critical for hypoxic adaptation. Tissue-specific analysis of the expression of the STAT target genes, SOCS36E, TotA and TotM revealed that stimulation of the JAK/STAT pathway in hypoxia is widespread, although more pronounced in abdominal tissues. Indeed, overexpression of Upd3 in enterocytes provokes upregulation o both target genes TotA and TotM. Consistent with this RNAi-dependent inhibition of the JAK/STAT pathway in the fat body and oenocytes significantly reduced survival of female flies in hypoxia. Nitric oxide synthase (NOS) is strongly upregulated in adult female abdomens upon hypoxic exposure, and KD of NOS in fat body and oenocytes reduced hypoxic survival. Surprisingly, the found that ubiquitous KD of HIFa/Sima led to mitigation of Upd3 hypoxic induction and, more clearly, to JAK/STAT target gene induction. HIF KD flies displayed increased lethality in hypoxia, and this lethality was slightly mitigated in Upd3 heterozygous flies. The authors conclude that increased lethality of HIF-minus flies in hypoxia stems at least in part from excessive levels of Upd3. The authors then find that HIF/Sima-dependent inhibition of Upd3 expression is non-cell autonomous, since KD of Sima specifically in the gut does not affect expression of Upd3 in this organ. Instead, Sima KD at the fat body led to significant increase of Upd expression in the gut, suggesting that a Sima-born signal communicates these two organs, leading to restriction of Upd3 intestinal expression. ROS does not seem to be the signal that communicates the fat body with the gut, as expression of catalase in the fat body did not affect expression of Upd3 in the gut.

      (Significance (Required)): This is an important study, because most previous studies have focused on cell-autonomous responses to hypoxia, but much less is known about systemic responses to low oxygen conditions, particularly in relation to inter-organ communication during this responses. This work defines the cytokine unpaired 3, homolog of human interleukin 6, as a major regulator of systemic responses to hypoxia. Future studies will determine if interleukin 6 plays similar roles in mammals. This work might be of interest for a broad audience interested in responses to hypoxia, as well as general physiology.

      We thank Reviewer 1 for their careful reading and comments on the manuscript. We are pleased that they found this to be "an important study" that addresses a gap in understanding systemic, inter-organ responses to hypoxia. We have addressed each of their concerns in the revised manuscript as outlined below.

      __MAJOR CONCERNS __ 1) Figure 1 lacks statistical analysis. It is important to determine if the apparent differences in gene expression are statistically significant.

      We have now added the statistical analyses to the revised version of the figures.

      2) Is NOS expression in fat body/oenocytes JAK/STAT-dependent? Block the pathway in hypoxia specifically in this cells and check.

      To address this, we blocked JAK/STAT signaling specifically in fat body/oenocytes under hypoxia and examined the expression of Nos, as well as bnl and Hipk - two additional genes we find are regulated by gut-derived Upd3 and required for hypoxia tolerance.

      Interestingly, fat body/oenocyte-specific knockdown of STAT92E suppressed hypoxia-induced Hipk expression but did not affect Nos or bnl expression in these tissues. These results suggest that gut-derived Upd3 can control fat body/oenocyte expression of hypoxia regulators through both direct and indirect (relay) mechanism There is precedent for indirect, relay in the context of other Upd3/Upd2-mediated inter-organ responses. For example, in response to CO2, neuronal Upd3 controls blood cell differentiation in the lymph gland; however, this effect is not direct - Upd3 first signals to the fat body to induce Dilp6 expression, and Dilp6 then signals to the lymph gland to regulate hematopoiesis. A second example involves gut-derived Upd2: upon infection, Upd2 controls olfactory behavior, but does so via a relay in which Upd2 signals to glial cells, which in turn alter apolipoproteins expression, and these then modify olfactory neuron function.

      We have incorporated the new tissue-specific data into the manuscript and expanded the Discussion to address both direct and indirect modes of Upd3 action. (Fig 5 and lines 427-441)

      3) The authors relate the HIF-dependent limitation of Upd3 induction in hypoxia to regulation of cytokine-dependent immune responses in mammals; specifically they propose a parallel with a cytokine storm. This relationship is unclear to this reviewer, as in the Drosophila response Upd3 fulfils a signalling function (rather than immunological). I suggest they consider modifying this assumption.

      We appreciate this comment. Our intent in drawing a comparison to mammalian cytokine storm response was to illustrate the concept of fine-tuning cytokine responses, where too little or too much signaling can be deleterious, as we observe when comparing upd3 mutants to upd3-overexpressing animals. We have revised the Discussion to retain this concept while tempering the suggestion that our findings directly mirror cytokine storm pathologies in human (lines 511-536).

      4) Mitigation of lethality of HIF KD flies in Upd3 heterozygotes is very modest. Thus, the conclusion that one of the mechanisms by which HIF mediates adaptation to hypoxia is through inhibition of Upd3 expression is not sufficiently supported by the data. It seems like an over-interpretation of the results.

      We agree that the rescue is modest, and we would argue this may be expected given HIF-1's role as a master regulator that coordinates many gene expression changes required for hypoxia tolerance. Loss of HIF-1 therefore likely disrupts multiple essential processes simultaneously - including metabolic reprogramming and tracheal remodeling - that may not be restored by reducing upd3 dosage. We take the reviewer's point that this should not be framed as a primary mechanism. The partial reversal of lethality in upd3 heterozygotes nonetheless implicates excessive Upd3 signaling as one small component of what HIF-1 does to promote hypoxia adaptation, and we have revised the manuscript language to reflect this more measured interpretation (lines 529-536).

      5) HIF expression is well-known to reduce ROS levels in hypoxia by controlling mitochondrial activity through a wide array of mechanisms. Thus, this reviewer feels that the experiments utilized to rule out a role of ROS in fat body-to-gut communication are insufficient. Catalase reduces hydrogen peroxide levels, but not necessarily other reactive oxygen species. The authors might try to express other ROS scavengers such as superoxide dismutase. In addition, expression of scavengers should be carried out both at the fat body and gut.

      We thank the reviewer for this important point. We have now addressed it by overexpressing CatA, SOD1, or SOD2 individually in either fat body or enterocytes and measuring hypoxia-induced upd3 expression in each case. In all six conditions, hypoxia-induced upd3 expression was unaffected (Figs. S6B–G). Together, these experiments scavenge both hydrogen peroxide and superoxide in both tissues and collectively argue against a role for ROS in mediating upd3 induction

      __MINOR CONCERNS __ 6) The authors state that hypoxic upregulation of Upd3 in the gut occurs mostly in "large epithelial enterocytes". In Figure 3B, it is evident that GFP does not express in all cells; please utilize cell-type specific markers to identify which cells do express the cytokine.

      We appreciate this suggestion. Despite multiple requests to different laboratories, we were unable to obtain antibodies suitable for marking enterocyte subtypes in this context. To address the question of cell identity genetically, we used drivers specific for enterocytes (mex-GAL4) or progenitor cells (stem cells and enteroblasts; esg-GAL4) to drive RNAi-mediated knockdown of upd3 and then measured the effect on hypoxia-induced upd3 expression in whole guts. These experiments indicate that hypoxia-induced upd3 expression occurs mostly in enterocytes, with a smaller contribution from progenitor cells. This mirrors previous findings showing that infection-induced upd3 induction occurs in both enterocytes and enteroblasts, and supports our conclusion that enterocytes are the predominant source of hypoxia-induced Upd3. We have incorporated these results into the revised manuscript (Fig 3C and Fig S2C).

      7) The title of Fig 4 caption reads "Gut-derived upd3 controls adipose expression of hypoxia regulators." Only one hypoxia regulator has been analysed: Nitric Oxide Synthase. Please change the title to "Gut-derived upd3 controls adipose expression of Nitric Oxide Synthase."

      In the revised manuscript we now show that gut-derived Upd3 controls the expression of Nos, bnl, and Hipk in fat body and oenocytes, and that all three genes are required for hypoxia tolerance. We have therefore revised the figure title, to better reflect the findings presented in this version.

      8) Supplementary Figures 1 A and B lack statistical analysis.

      We have now included the statistical analyses in the revised manuscript figures.

      Reviewer 2


      __(Evidence, reproducibility and clarity (Required)): __This study by Ding and colleagues identifies a novel role for the cytokine Unpaired-3 (upd3) and the JAK/STAT signaling pathway coordinate a whole-body response to systemic hypoxia in Drosophila. The authors describe how low-oxygen conditions rapidly induce upd3 expression in both larvae and adults. Interestingly, this pathway's importance is sex-specific, as female flies require upd3 for survival in hypoxia, while males do not.

      Intriguingly, the authors identify the intestine as a crucial source of the hypoxia-induced upd3. This gut-derived upd3 then signals to the fat body and oenocytes, promoting the expression of nitric oxide synthase, which is essential for hypoxia tolerance. Furthermore, the study reveals an unexpected role for the transcription factor HIF-1α/sima as a molecular brake. Instead of simply promoting the hypoxia response, sima prevents the overproduction of upd3, demonstrating that a precise dosage of this cytokine is necessary for survival. The findings define a novel gut-to-fat/oenocyte signaling axis that coordinates systemic hypoxia adaptation and highlights the fly as an ideal system for studying interorgan communication during bouts of hypoxia. Overall, I find this manuscript an important step forward in understanding the link between hypoxia signaling and inflammation.

      __ (Significance (Required)): __This study is of high significance, as it not only demonstrates that a clear role for cytokine signaling in the Drosophila hypoxia response, but also demonstrates this response requires interorgan communication between adipose tissue and the intestine. Moreover, the study reveals a clear role for Hif1alpha in modulating upd3 expression, suggesting that this highly conserved transcription factor play a key role in fine tuning the inflammatory response.

      I think these findings are of broad interest and are potentially relevant to two aspects of public health. First, I believe the findings should be of particular interest to anyone studying hypoxic injuries, such as stroke and ischemia-reperfusion. Secondly, the observations could be relevant to a previous study that revealed an important role for hypoxia signaling in the mosquito larval intestine. Thus, this study could be important for revealing new mechanisms for inhibiting mosquito development, which would be of broad public health interest.

      Finally, I would highlight how this study raises a number of important question. Why are there sex-specific differences for upd3 in the hypoxia response? What is the signal from the fat body to the intestine? How does sima modulate upd3 signaling. Thus, I think this manuscript represents a foundation study that will be the motivation for numerous high-impact papers in the future.__ ____ __ We thank Reviewer 1 for their careful reading and comments on the manuscript. We are pleased that they found this to be "a study of high significance” that will be importance for our understanding of hypoxia and health. We have addressed each of their concerns in the revised manuscript as outlined below.

      __Major Concerns and Suggestions: __ I have no real for the manuscript as written - the experiments are well designed and control, the results, as presented, support the major conclusions. While there are clearly open questions, including what it the basis of the sex-specific effects, how does sima modulate upd3 expression, and what is the signal communicating fat body sima activity with intestinal upd3 expression, these open questions do NOT diminish the importance of the study.

      My only major concern is that the current draft lacks a discussion of previous studies in the mosquito Aedes aegypti, where hypoxia signaling plays a key role in larval development (https://doi.org/10.1073/pnas.1719063115). This body of literature should be incorporated into the discussion, as it hints at a conserved molecular mechanism.

      We thank the reviewer for pointing us to this important study. Valzania et al. demonstrate that gut hypoxia acts as a systemic signal in Aedes aegypti larvae, activating HIF to coordinate fat body metabolism and whole-body growth. We agree this is relevant context for our findings, as both studies support the idea that the gut can function as a hypoxia sensor that controls whole-body physiology through effects on the fat body. We have incorporated this into our Discussion (lines 488-492).

      Minor comments:

      Please include a list of fly stocks used in the methods with complete genotypes. Whenever possible, include the RRID number for the stock - these can be found on the BDSC page for the stock.

      We have now added the list of fly stocks as well as a supplemental table with full genotypes.

      Line 477-479 - provide citations that sima regulates glycolysis in the fly.

      We have now added these citations

      Lines 501-505 - please state if gasses were premixed or mixed in lab. Also, were flies contained in standard food vials during the exposure?

      We have now provided more detail on these points – the gases were premixed and flies were on standard food vials during the exposure.

      Lines 507-513 - how long after the hypoxia exposure were the flies assayed?

      We have now provided more detail on this point in the methods (lines 592-596) – the flies were assessed 24hrs after hypoxia exposure.

      In figures that display qRT-PCR data, please note that data were normalized to reference genes listed in Table S2.

      We have now added this methodological point.

      Please reference Flybase in either the acknowledgements or methods and include citations to the latest Flybase papers published in Genetics.

      We have now acknowledged Flybase and referenced the relevant papers

      Genetics nomenclature is inconsistent throughout the study, a few examples included: Figure legend 1 - italicize gene names Figure 2 legend - italicize upd3-null Line 259 - Capitalize gal4 Figure 4 legend - NOS is written in all capital, but in line 270, written as Nos. Please be consistent. Line 297 - gal4 is lower case, in contrast with elsewhere.

      We have now made these corrections

      Additional suggestions:

      While not required for publication, it would be interesting to examine intestinal upd3 expression when sima is inappropriately stabilized in the fat body of animals under normoxic conditions. This could be achieved by driving a fatiga-RNAi construct within the fat body.

      We did carry out this experiment but didn’t see any effect of fat body fatiga RNAi on gut upd3 levels.

      Reviewer 3


      Evidence, reproducibility and clarity (Required)): __Summary: While local cellular and organ adaptations to hypoxia are well-documented, organism-wide responses to systemic hypoxia are still not well understood. In this paper, the writers were interested in investigating how organisms adapt to systemic hypoxia. From their investigations, they were able to show that gut-derived upd3 is crucial to animals' tolerance to hypoxia. They also show that the master hypoxia regulator Sima is required to keep the upd3 level in check to avoid the deleterious effect of excess upd3. They also showed that the fatbody Sima is important in the regulation of gut-upd3 level, showing an inter-organ communication network in the adaptation to systemic hypoxia. One of their findings shows sex dimorphism in hypoxia tolerance; however, they did not show the mechanism behind this. I think the major weakness is not knowing how the animal actually fail to survive. What causes reduced survival should be explored. Generally, the studies show how animals adapt to systemic hypoxia, this knowledge is important in systemic hypoxia pathology.

      __

      __Significance (Required)): __This paper explores how the organism copes with hypoxia, and explored how Upd from the gut plays a role in mediating this response in the fat body and the oenocytes

      We thank Reviewer 1 for their careful reading and comments on the manuscript. We have addressed each of their concerns in the revised manuscript as outlined below.

      __Major comment: __

      Figure 1: The authors clearly showed that Upd3 level was up in the hypoxia condition and is important for animal tolerance to hypoxia. Apart from Upd3, are there other members of the unpaired family increasing and involved in hypoxia tolerance?

      We thank the reviewer for this question. We examined expression of all three unpaired family members and found that both upd2 and upd3 are induced by hypoxia, while upd1 is not. We also have preliminary evidence that upd2 mutants show reduced hypoxia survival, and that this effect is not additive with loss of upd3. While these early results are intriguing, this paper is focused on defining the role of upd3 in hypoxia tolerance, and exploring upd2, both alone and in combination with upd3, across different aspects of hypoxia biology we see as the basis of future investigations.

      Notably, co-induction of upd2 and upd3 by the same stress is a recurring theme in Drosophila biology, yet their respective contributions to organismal physiology are complex - sometimes overlapping, sometimes distinct - and in many studies only one family member has been characterized in detail. Indeed, our current understanding of how upd2 and upd3 each contribute to responses to infection, high-fat diet, and other stresses has emerged from the collective findings of multiple independent studies rather than from any single paper addressing both cytokines simultaneously. For example, during infection both Upd2 and Upd3 are induced in the gut to promote stem cell-mediated repair, yet only Upd2 has been shown to additionally signal to the brain to control olfactory behavior. Similarly, on a high-fat diet both cytokines are upregulated, but with distinct effects on different aspects of organismal biology: enterocyte-derived Upd3 promotes intestinal stem cell divisions, hemocyte-derived Upd3 controls fat body lipid levels, and fat body-derived Upd2 alters nephrocyte function. We see the current study as a foundation for broader investigations into unpaired cytokine biology in hypoxia. Indeed, Reviewer 2 noted that this manuscript "represents a foundation study that will be the motivation for numerous high-impact papers in the future", and we anticipate that the effects of Upd2 and Upd3 in hypoxia will prove similarly pleiotropic and resolving their respective contributions to different aspects of organismal biology in low oxygen will require dedicated future investigation.

      Figure 2: From the method, female and male flies were subjected to different durations of hypoxia, 24-28 hours for females and 16-18 hours for males. What happens when subjecting different sexes to similar periods of hypoxia?

      We thank the reviewer for this question. Males and females show inherently different sensitivities to hypoxia, as they do for other environmental stresses such as starvation. To reliably detect genetic effects on hypoxia tolerance, it is important to use exposure conditions that produce partial lethality in controls (50-80% survival), ensuring experiments are conducted within the appropriate range of hypoxic sensitivity for each sex. Because males and females differ in their sensitivity, no single timepoint satisfies this criterion for both sexes. When males are exposed for the same duration used in female experiments (24-28h), all animals - controls and experimental genotypes alike - die, precluding any meaningful comparison. Conversely, exposing females to the shorter timepoint used for males (16-18h) produces no detectable lethality, making it equally uninformative. The sex-specific exposure durations we use are therefore an experimental design choice that allows us to assess hypoxia tolerance appropriately in each sex.

      Upon concluding that gut derived upd affects fat and oenocytes, it is a bit strange that the qPCR is done in the abdomen, which is presumably where the gut is. Should the gut be excluded in these assays?

      We thank the reviewer for raising this point. For abdominal qRT-PCR experiments examining fat body and oenocyte gene expression, we dissected and removed the gut and ovaries prior to RNA extraction, leaving an abdominal sample enriched in fat body and oenocytes. We have clarified this in the Methods and Results section of the revised manuscript (Lines 245-246 and 626-627).

      It is important to establish how the animals die under hypoxia.

      We thank the reviewer for raising this important question. Our results show that gut-derived Upd3 is required for hypoxia tolerance in part through its control of Nos, bnl, and Hipk expression in fat body and oenocytes, and that knockdown of each of these genes individually reduces hypoxia survival. However, precisely why animals die when upd3 or these downstream effectors are lost remains an open question, and we discuss much of what we outline below in the revised manuscript Discussion (lines 443-466).

      All three effectors are signaling molecules, and we speculate that they likely coordinate further downstream processes required for hypoxia tolerance, either within fat body and oenocytes or by acting on other tissues. In particular, both bnl, an FGF ligand, and nitric oxide, produced downstream of Nos, have established roles in tracheal development and remodeling, raising the possibility that Upd3-dependent regulation of tracheal responses to hypoxia contributes to survival. Nitric oxide can also regulate nitrosylation and has been shown to affect the unfolded protein response, a conserved pathway induced by hypoxia. bnl, in addition to its role in tracheal remodeling, has been shown to regulate metabolic changes in target tissues. Hipk is a kinase with likely many downstream targets and has been shown in flies to control metabolism and mitochondrial function. Together, these observations suggest that Upd3 engages a broad downstream signaling network, the full scope of which remains to be defined.

      We think this situation is analogous to other environmental stresses such as starvation, where survival requires the coordinated regulation of a spectrum of physiological processes across multiple tissues, and where even well-characterized regulators are known to engage many downstream targets and pathways. We see the current paper as establishing the gut-to-fat body Upd3 requirement for hypoxia tolerance, and we suggest this lays a foundation for future exploration of the full spectrum of Upd3 targets and investigation of how they coordinate adaptive responses to low oxygen.

      Figure 3-6: Controls for RNAi experiments - is there any reason for not using RNAi-specific control, such as mcherry-RNAi, lacZ-RNAi, etc, rather than a wildtype control in all the RNAi-mediated knockdowns? Please address this. Don't necessarily have to repeat all the experiments using RNAi-specific control, but repeating just a few to show that both wild-type and UAS-RNAi-specific controls show similar results would be important.

      We thank the reviewer for raising this point. To address potential non-specific effects of RNAi expression on hypoxia tolerance, we expressed control GFP RNAi or mCherry RNAi transgenes using the main Gal4 drivers employed in this study: mex-Gal4 (gut) and desat;r4-Gal4 (fat body and oenocytes), and found no effect on hypoxia survival compared to wild-type controls (Fig S2E and S4B). These results indicate that RNAi expression per se does not adversely affect hypoxia tolerance, and that the survival effects we observe reflect specific knockdown of the genes of interest.

      Although gut-derived upd3 contributes largely (40%) to hypoxia tolerance, what other tissues' upd3 is important for hypoxia tolerance?

      We thank the reviewer for this important question. We find that upd3 is induced in multiple tissues during hypoxia, including the head, thorax, and abdomen. However, when we knocked down upd3 using drivers targeting the major cell types in these tissues, including muscle, neurons, and fat body/oenocytes, we observed no significant effect on hypoxia survival, in contrast to the robust effect seen with gut-specific knockdown. These new data, included in the revised manuscript, suggest that gut-derived Upd3 is a primary contributor to hypoxia tolerance (Fig S3).

      That said, we do not conclude that the gut is the only relevant source. Other tissues we have not yet examined, including hemocytes, glia, and tracheal cells, may also contribute, and it is possible that Upd3 produced from multiple tissues acts redundantly, such that knockdown in any single tissue other than the gut is insufficient to cause a survival defect. By analogy with other stress contexts such as nutrient deprivation and infection, where upd cytokines are produced from multiple tissues and exert distinct effects on different aspects of physiology, we anticipate that Upd3 from tissues other than the gut may well contribute to hypoxia tolerance. However, fully defining these contributions will require detailed tissue-specific experiments that are beyond the scope of the current paper and will be the focus of future investigations. We have expanded on this point in the Discussion of the revised manuscript (lines 420-425).

      Can you use a hypoxia readout to experimentally show that the gut is the main sensor of hypoxia compared to other tissues? Looking at the data, the fatbody could also be major sensors of hypoxia. Therefore, investigating hypoxia readout in these and other tissues would further strengthen the direction of communication.

      We thank the reviewer for this suggestion, however, we wish to clarify that we are not claiming the gut is the main or primary sensor of hypoxia. All tissues are likely capable of sensing low oxygen and mounting cell-autonomous responses, and in some cases perhaps also non-autonomous signals to other tissues. Our findings specifically show that one consequence of gut hypoxia sensing is upregulation of Upd3, which then acts as an inter-organ signal to coordinate responses in target tissues such as the fat body and oenocytes. The fat body itself also senses hypoxia and mounts its own responses, as we and others have shown, including HIF-dependent regulation of gut Upd3 expression described in this paper. An analogous situation exists during nutrient starvation, where all cells autonomously sense and respond to nutrient deprivation, but on top of these cell-autonomous responses, specific tissues also mediate inter-organ signaling to coordinate whole-body physiological adaptations. We propose that hypoxia responses are organized similarly, and that the gut-to-fat body Upd3 signaling axis we describe here represents one such inter-organ communication pathway. We have clarified this point in the revised manuscript (lines 468-492).

      __Minor comment:

      __

      Should check the alignment of the confocal image in Figure 3b, especially the top panel.

      We have now fixed the images to better align them

      Figure 6: "gut-specific sima knockdown (mex>sima-RNAi) did not significantly alter intestinal upd3 mRNA levels compared to controls (mex>+) under hypoxic conditions (Figure 6C)." This statement refers to Figure 6B, not Figure 6C

      We have now corrected this

      Since the fat body Sima non-autonomously control the gut upd3 level, can you also show this functionally important by investigating the animal's survival or other functional studies?

      We thank the reviewer for this suggestion. Ideally, we would manipulate sima and upd3 independently and in parallel, knocking down sima specifically in the fat body while simultaneously reducing upd3 in the gut, to directly test the functional importance of this inter-organ axis for survival. In principle this could be achieved using orthogonal binary expression systems such as the GAL4/UAS and QF/QUAS systems in combination, but this would require the development of new genetic tools. An additional challenge is that based on our results, such experiments would require fine-tuned reduction of gut upd3, sufficient to suppress the elevated levels caused by fat body sima knockdown, but not so low as to itself compromise survival, as we have shown that loss of upd3 is detrimental. For these reasons, while we agree these would be, in principle, interesting experiments, they would technically be challenging to carry out.

      Strangely, all the statistically significant data/results from both supplementary and main figures had a one-star significance even in graphs with very obvious differences and less sample variation.

      We thank the reviewer for this observation. In all figures, a single asterisk is used to denote statistical significance at p < 0.05, regardless of whether the actual p value is substantially lower. This is a presentation convention we adopted consistently across all figures rather than a reflection of the strength of the underlying differences.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: While local cellular and organ adaptations to hypoxia are well-documented, organism-wide responses to systemic hypoxia are still not well understood. In this paper, the writers were interested in investigating how organisms adapt to systemic hypoxia. From their investigations, they were able to show that gut-derived upd3 is crucial to animals' tolerance to hypoxia. They also show that the master hypoxia regulator Sima is required to keep the upd3 level in check to avoid the deleterious effect of excess upd3. They also showed that the fatbody Sima is important in the regulation of gut-upd3 level, showing an inter-organ communication network in the adaptation to systemic hypoxia. One of their findings shows sex dimorphism in hypoxia tolerance; however, they did not show the mechanism behind this. I think the major weakness is not knowing how the animal actually fail to survive. What causes reduced survival should be explored. Generally, the studies show how animals adapt to systemic hypoxia, this knowledge is important in systemic hypoxia pathology.

      Major comment:

      • Figure 1: The authors clearly showed that Upd3 level was up in the hypoxia condition and is important for animal tolerance to hypoxia. Apart from Upd3, are there other members of the unpaired family increasing and involved in hypoxia tolerance?
      • Figure 2: From the method, female and male flies were subjected to different durations of hypoxia, 24-28 hours for females and 16-18 hours for males. What happens when subjecting different sexes to similar periods of hypoxia?
      • Upon concluding that gut derived upd affects fat and oenocytes, it is a bit strange that the qPCR is done in the abdomen, which is presumably where the gut is. Should the gut be excluded in these assays?
      • It is important to establish how the animals die under hypoxia.
      • Figure 3-6: Controls for RNAi experiments - is there any reason for not using RNAi-specific control, such as mcherry-RNAi, lacZ-RNAi, etc, rather than a wildtype control in all the RNAi-mediated knockdowns? Please address this. Don't necessarily have to repeat all the experiments using RNAi-specific control, but repeating just a few to show that both wild-type and UAS-RNAi-specific controls show similar results would be important.
      • Although gut-derived upd3 contributes largely (40%) to hypoxia tolerance, what other tissues' upd3 is important for hypoxia tolerance?
      • Can you use a hypoxia readout to experimentally show that the gut is the main sensor of hypoxia compared to other tissues? Looking at the data, the fatbody could also be major sensors of hypoxia. Therefore, investigating hypoxia readout in these and other tissues would further strengthen the direction of communication.

      Minor comment:

      • Should check the alignment of the confocal image in Figure 3b, especially the top panel.
      • Figure 6: "gut-specific sima knockdown (mex>sima-RNAi) did not significantly alter intestinal upd3 mRNA levels compared to controls (mex>+) under hypoxic conditions (Figure 6C)." This statement refers to Figure 6B, not Figure 6C
      • Since the fat body Sima non-autonomously control the gut upd3 level, can you also show this functionally important by investigating the animal's survival or other functional studies?
      • Strangely, all the statistically significant data/results from both supplementary and main figures had a one-star significance even in graphs with very obvious differences and less sample variation.

      Significance

      This paper explores how the organism copes with hypoxia, and explored how Upd from the gut plays a role in mediating this response in the fat body and the oenocytes

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This study by Ding and colleagues identifies a novel role for the cytokine Unpaired-3 (upd3) and the JAK/STAT signaling pathway coordinate a whole-body response to systemic hypoxia in Drosophila. The authors describe how low-oxygen conditions rapidly induce upd3 expression in both larvae and adults. Interestingly, this pathway's importance is sex-specific, as female flies require upd3 for survival in hypoxia, while males do not.

      Intriguingly, the authors identify the intestine as a crucial source of the hypoxia-induced upd3. This gut-derived upd3 then signals to the fat body and oenocytes, promoting the expression of nitric oxide synthase, which is essential for hypoxia tolerance. Furthermore, the study reveals an unexpected role for the transcription factor HIF-1α/sima as a molecular brake. Instead of simply promoting the hypoxia response, sima prevents the overproduction of upd3, demonstrating that a precise dosage of this cytokine is necessary for survival. The findings define a novel gut-to-fat/oenocyte signaling axis that coordinates systemic hypoxia adaptation and highlights the fly as an ideal system for studying interorgan communication during bouts of hypoxia. Overall, I find this manuscript an important step forward in understanding the link between hypoxia signaling and inflammation.

      Major Concerns and Suggestions:

      I have no real for the manuscript as written - the experiments are well designed and control, the results, as presented, support the major conclusions. While there are clearly open questions, including what it the basis of the sex-specific effects, how does sima modulate upd3 expression, and what is the signal communicating fat body sima activity with intestinal upd3 expression, these open questions do NOT diminish the importance of the study.

      My only major concern is that the current draft lacks a discussion of previous studies in the mosquito Aedes aegypti, where hypoxia signaling plays a key role in larval development (https://doi.org/10.1073/pnas.1719063115). This body of literature should be incorporated into the discussion, as it hints at a conserved molecular mechanism.

      Minor comments:

      Please include a list of fly stocks used in the methods with complete genotypes. Whenever possible, include the RRID number for the stock - these can be found on the BDSC page for the stock.

      Line 477-479 - provide citations that sima regulates glycolysis in the fly.

      Lines 501-505 - please state if gasses were premixed or mixed in lab. Also, were flies contained in standard food vials during the exposure?

      Lines 507-513 - how long after the hypoxia exposure were the flies assayed?

      In figures that display qRT-PCR data, please note that data were normalized to reference genes listed in Table S2.

      Please reference Flybase in either the acknowledgements or methods and include citations to the latest Flybase papers published in Genetics.

      Genetics nomenclature is inconsistent throughout the study, a few examples included:

      Figure legend 1 - italicize gene names

      Figure 2 legend - italicize upd3-null

      Line 259 - Capitalize gal4

      Figure 4 legend - NOS is written in all capital, but in line 270, written as Nos. Please be consistent.

      Line 297 - gal4 is lower case, in contrast with elsewhere.

      Additional suggestions:

      While not required for publication, it would be interesting to examine intestinal upd3 expression when sima is inappropriately stabilized in the fat body of animals under normoxic conditions. This could be achieved by driving a fatiga-RNAi construct within the fat body.

      Significance

      This study is of high significance, as it not only demonstrates that a clear role for cytokine signaling in the Drosophila hypoxia response, but also demonstrates this response requires interorgan communication between adipose tissue and the intestine. Moreover, the study reveals a clear role for Hif1alpha in modulating upd3 expression, suggesting that this highly conserved transcription factor play a key role in fine tuning the inflammatory response.

      I think these findings are of broad interest and are potentially relevant to two aspects of public health. First, I believe the findings should be of particular interest to anyone studying hypoxic injuries, such as stroke and ischemia-reperfusion. Secondly, the observations could be relevant to a previous study that revealed an important role for hypoxia signaling in the mosquito larval intestine. Thus, this study could be important for revealing new mechanisms for inhibiting mosquito development, which would be of broad public health interest.

      Finally, I would highlight how this study raises a number of important question. Why are there sex-specific differences for upd3 in the hypoxia response? What is the signal from the fat body to the intestine? How does sima modulate upd3 signaling. Thus, I think this manuscript represents a foundation study that will be the motivation for numerous high-impact papers in the future.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors describe a role of Unpaired 3 (Upd3) in tissue communication in responses to hypoxia in Drosophila adult flies. Upd3 mRNA is strongly upregulated in hypoxia, along with well-characterized JAK/STAT downstream target genes, in both adult fly males and females, as well as in larvae. Interestingly, adult females but not males require Upd3 for 15 to 24 h survival in hypoxia, as Upd3 mutant females but not males die to a much larger proportion in these conditions. Adult females they display strong hypoxic upregulation of Upd3 in the gut, assessed by RT-PCR or through a Gal4 transcriptional reporter, mainly in epithelial enterocytes. Enterocyte-specific RNAi-mediated KD indicated that this enterocyte expression of Upd3 represents about 40% of Upd3 expression in the whole body. Enterocyte-specific KD of Upd3 in adult females significantly reduced survival in hypoxia, suggesting that this expression is critical for hypoxic adaptation. Tissue-specific analysis of the expression of the STAT target genes, SOCS36E, TotA and TotM revealed that stimulation of the JAK/STAT pathway in hypoxia is widespread, although more pronounced in abdominal tissues. Indeed, overexpression of Upd3 in enterocytes provokes upregulation o both target genes TotA and TotM. Consistent with this RNAi-dependent inhibition of the JAK/STAT pathway in the fat body and oenocytes significantly reduced survival of female flies in hypoxia. Nitric oxide synthase (NOS) is strongly upregulated in adult female abdomens upon hypoxic exposure, and KD of NOS in fat body and oenocytes reduced hypoxic survival. Surprisingly, the found that ubiquitous KD of HIFa/Sima led to mitigation of Upd3 hypoxic induction and, more clearly, to JAK/STAT target gene induction. HIF KD flies displayed increased lethality in hypoxia, and this lethality was slightly mitigated in Upd3 heterozygous flies. The authors conclude that increased lethality of HIF-minus flies in hypoxia stems at least in part from excessive levels of Upd3. The authors then find that HIF/Sima-dependent inhibition of Upd3 expression is non-cell autonomous, since KD of Sima specifically in the gut does not affect expression of Upd3 in this organ. Instead, Sima KD at the fat body led to significant increase of Upd expression in the gut, suggesting that a Sima-born signal communicates these two organs, leading to restriction of Upd3 intestinal expression. ROS does not seem to be the signal that communicates the fat body with the gut, as expression of catalase in the fat body did not affect expression of Upd3 in the gut.

      Major concerns

      1) Figure 1 lacks statistical analysis. It is important to determine if the apparent differences in gene expression are statistically significant.

      2) Is NOS expression in fat body/oenocytes JAK/STAT-dependent? Block the pathway in hypoxia specifically in this cells and check.

      3) The authors relate the HIF-dependent limitation of Upd3 induction in hypoxia to regulation of cytokine-dependent immune responses in mammals; specifically they propose a parallel with a cytokine storm. This relationship is unclear to this reviewer, as in the Drosophila response Upd3 fulfils a signalling function (rather than immunological). I suggest they consider modifying this assumption.

      4) Mitigation of lethality of HIF KD flies in Upd3 heterozygotes is very modest. Thus, the conclusion that one of the mechanisms by which HIF mediates adaptation to hypoxia is through inhibition of Upd3 expression is not sufficiently supported by the data. It seems like an over-interpretation of the results.

      5) HIF expression is well-known to reduce ROS levels in hypoxia by controlling mitochondrial activity through a wide array of mechanisms. Thus, this reviewer feels that the experiments utilized to rule out a role of ROS in fat body-to-gut communication are insufficient. Catalase reduces hydrogen peroxide levels, but not necessarily other reactive oxygen species. The authors might try to express other ROS scavengers such as superoxide dismutase. In addition, expression of scavengers should be carried out both at the fat body and gut.

      Minor concerns

      6) The authors state that hypoxic upregulation of Upd3 in the gut occurs mostly in "large epithelial enterocytes". In Figure 3B, it is evident that GFP does not express in all cells; please utilize cell-type specific markers to identify which cells do express the cytokine.

      7) The title of Fig 4 caption reads "Gut-derived upd3 controls adipose expression of hypoxia regulators." Only one hypoxia regulator has been analysed: Nitric Oxide Synthase. Please change the title to "Gut-derived upd3 controls adipose expression of Nitric Oxide Synthase."

      8) Supplementary Figures 1 A and B lack statistical analysis.

      Significance

      This is an important study, because most previous studies have focused on cell-autonomous responses to hypoxia, but much less is known about systemic responses to low oxygen conditions, particularly in relation to inter-organ communication during this responses. This work defines the cytokine unpaired 3, homolog of human interleukin 6, as a major regulator of systemic responses to hypoxia. Future studies will determine if interleukin 6 plays similar roles in mammals. This work might be of interest for a broad audience interested in responses to hypoxia, as well as general physiology.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Goldman et al., Response to Review April 16, 2026

      Referee 1 (R1): The revised manuscript by Goldman et al has changed significantly since its first submission. Upon further analysis of a putatively truncated mutant, the authors have removed the interpretation that G2019S LRRK2 alters iron handling through a kinase-independent mechanism. They should be commended for the transparency and rigor applied in this revision and the resulting data are conceptually easier to digest.

      Heidi McBride (HM): We thank the reviewer for their understanding. We would point out that our phenotypes remain resistant to the commonly used LRRK2 kinase inhibitor MLi2, yet are sensitive to the type II kinase inhibitors which we believe is novel.

      R1: However, now that the proposed pathway involves over-active LRRK2 kinase activity and a putative role for Rab8a phosphorylation, the manuscript is rather descriptive in nature. The mechanisms underlying dramatic changes in NCOA4, FTL, and FTH protein levels upon iron exposure are not explored. In addition, the reliance on these effects, or the precise involvement of LRRK2 substrates is not established.

      HM: We agree there remains much to be uncovered mechanistically. However, the fact that phosphorylated Rab8 is active at the plasma membrane upon iron overload is novel, suggesting more broad roles beyond the Golgi/lysosomal localization of LRRK2 substrates. We would also argue that the majority of studies on LRRK2 follow the phosphorylation of Rab substrates by western blot, but the functional roles of these Rabs, their GTPase cycles and effectors, in membrane traffic is not at all clear within the field. It is difficult for us to determine the mechanisms by which NCOA4, FTL and FTH are altered, but we did expand our work in the revision to include full proteomics analysis of control and iron treated cells with wild type or LRRK2G2019S to get a better picture of the changes that occur. This included major changes in lysosomal hydrolyases, induction of oxidative stress pathways linked to ferroptotic cell death, and cytoskeletal changes that may link to the plasma membrane blebbing we observe. Our data indicate that the hyperactivity of LRRK2 during iron overload is having significant impact on the capacity of the lysosome to handle the iron, which we observe also at the morphological level in the loss of microautophagy of NCOA4. Taken together, the impact of our work lies in setting a new framework by which to examine LRRK2 function in iron homeostasis.

      R1: The limitation of a single LRRK2 mutant (in the context of a more canonical increased kinase activity mechanism) is also more evident. G2019S is a relatively weak Rab kinase when compared to Y1699C or R1441C - is there a possibility for G2019S In addition, the impact of this iron defect is limited to oxidative cell injury and implications for ferroptosis in macrophages (not neurons, where these outcomes might be of greater relevance) whereas how these phenotypes affect the immune-like functions of the chosen cell type (e.g. cytokine release, chemotaxis, phagocytosis) is not addressed.

      HM: We agree that the examination of additional LRRK2 mutations in our model would be important, but the G2019S is the most common patient mutation and has a very strong impact on the cellular response to iron. As for the impact on cytokine release, chemotaxis and phagocytosis was not requested in the first round of review and we believe is beyond the scope of our current study.

      R1: While the proteomics and other analyses provide strong rigor for the data that are presented, without more mechanistic interrogation or consideration of cell type specific behaviors, the manuscript does not advance very far beyond confirming that iron related proteins altered in parallel with iron itself and highly dependent on iron loading to evoke phenotypes.

      HM: We thank the reviewer for recognizing the rigour of our revisions. We did interrogate the function of the oxidative stress signatures and performed experiments to demonstrate the oxidation of lipids in the lysosome along with assays for ferroptotic cell death. We highlighted key signatures that were changed, including the selective loss of lysosomal enzymes, and changes in cellular cytoskeletal elements. We agree that there is more to do, but our data has provided many important new observations on what exactly is changed in LRRK2G2019S cells in response to iron, from lysosomal morphology changes, to critical analysis of NCOA4 and ferritin dynamics, the novel observation of phosphorylated Rab8 at the cell surface, and a very unexpected insensitivity to MLi2 yet selective inhibition by new type II LRRK2 kinase inhibitors. With this we believe our work does provide a great deal of novelty and impact on LRRK2 functions in iron homeostasis.

      A final point on the concept that the phenotypes are only induced upon iron overload. We would point out that we observed changes in ferritin and NCOA4 levels in LRRK2G2019S cells even without iron overload, so there are indeed impact on lysosomal and iron homeostasis in steady state. It is also perhaps important to note that LRRK2 is established to be activated in response to cellular/lysosomal stressors, so it is not unusual to observe the impact of the mutations only in certain conditions. This would be consistent with the mutations being linked to an age-related disease with relatively low penetrance among heterozygous carriers. We would argue that it is important to understand exactly which types of cellular stressors may lead to pathology, and that iron dysregulation has been a long-standing candidate stressor in PD.

      **************************************************************************

      Referee 2 (R2): The authors have done all I have requested before.

      HM: We thank the reviewer for acknowledging our efforts in addressing each of the reviewers concerns.

      R2: I am concerned about the substantial changes to the original manuscript including the realisation that some clones were not the knock-outs expected. However, removing these, is the right thing.

      HM: We agree that the removal of the truncated clone, along with our identification of a class of kinase inhibitors that reversed the p-Rab8 upon iron overload led to changes in a key conclusion of the first submission; namely that the effects were kinase-independent. In our opinion, this does not change the fundamental observations we have made on the impact of LRRK2 on iron homeostasis, many of which are made here for the first time.

      R2: I still have major concerns that the phenotype could not be reproduced in BMDMs. The RAW cells are a good tool, but far from physiological. The authors' arguments that others have also seen a phenotype in iron homeostasis is however reassuring.

      HM: We tried to make it very clear throughout our manuscript that there is a cell specificity to the iron phenotypes observed in LRRK2G2019 backgrounds. As stated in our response to review, we did observe changes in NCOA4 within BMDM but not the levels of FTL and FTH. We will continue to explore the impact of LRRK2 mutations in primary lines in future work. However, this should not negate the phenotypes we observe in the RAW cells that are consistent with other observations from ssRNAseq, and the work of others looking at microglia upon LPS treatment in vivo, as an example.

      R2: I leave it to the editor if they feel that a somewhat artificial cell line is the right model.

      HM: The RAW macrophages are not “artificial”. Widely used in Parkinson’s research, cancer and immunity, the ATCC site describes “RAW 264.7 are an adherent cell line isolated from a mouse tumor that was induced by Abelson murine leukemia virus. This cell line, with macrophage differentiation, can be used in oxidative stress, inflammatory, and antibacterial activity studies.” We understand that the standard of cell culture is rapidly moving entirely to human iPSC derived lines and organoids. We argue that there is still a role for fundamental cell biological studies to be done in established cultured cell lines where we can more rapidly generate critical hypothesis to be tested in other model systems in future studies. There has been no published work linking LRRK2 to ferritinophagy and NCOA4 microautophagy before this, and we humbly submit that we have made some critical new observations in our study that will direct future work in a highly targeted manner.

      *********************************************************************

      Referee 3 (R3): We thank the authors for addressing our concerns. As a result, the work looks very different, and the main message has changed.

      HM: We thank the reviewer for acknowledging our efforts in the revision. We agree that one point of our study, that the effects of the LRRK2G2019S were kinase independent, has changed. Our initial observation that the effects were resistant to MLi2 remains, it was the use of a second class of kinase inhibitors that now confirmed a dependence on kinase activity. This is a very important point to be made about the different activities of these two classes of kinase inhibitors. In addition, the rest of our work stands and was extended in the revision with many new experiments highlighting the impact of LRRK2G2019S on iron homeostasis. Therefore the fact that LRRK2 linked iron phenotypes are not blocked by the common kinase inhibitor remains unchanged.

      R3: The main issue with this new manuscript relative to the previous one is that a key part of the novelty has been lost. In the original draft the authors claimed to have found a LRRK2 kinase-independent phosphorylation of Rab8. This was a major new finding potentially hinting that LRRK2 is not the only kinase for Rab8 which would have had major ramifications for the field. All data from this truncation mutant has been withdrawn from the manuscript and the message is now significantly different. It is unfortunate that the armadillo domain data from the original manuscript has been withdrawn due to the results of the proteomics experiment revealing 7 peptides C-terminal to their stop codon. I suggest that, independently of the fate of the current manuscript, the authors update the biorxiv preprint as in the current manuscript this is not addressed (as this part has been removed) and the community needs to be aware of the changes that are potentially misleading.

      HM: While the new data show that p-Rab8 can be inhibited by type II kinase inhibitors upon iron overload, we present a great deal of novelty showing that this activated Rab8 is, unexpectedly, at the plasma membrane upon iron loading in G2019S cells. We also performed experiments examining ferritinophagy and NCOA4 microautophagy in these conditions, which we argue are the first instance of these pathways being impacted by LRRK2. Our full proteomics datasets in Figure 4 provide an unbiased view of all changes driven by LRRK2G2019S that led us to additional functional analysis of lipid oxidation and ferroptosis pathways being impacted. Again, we agree that the concept that our phenotypes are no longer kinase independent, however this should not negate the many important observations we have made in our study.

      We apologize for not updating the manscript on Biorxiv. I had linked our response to reviews to the bioRxiv during submission to EMBO J and thought the manuscript would automatically be updated. So it was an oversight on my part, which has now been remedied, including all links to raw data files and protocols linked to Zenodo.

      R3: Despite the changes and modifications, the main concern remains. The authors are claiming in the title to have discovered that the LRRK2 G2019S mutation leads to more ferroptotic cell death. However, the only cell death data that is shown is a Sytox experiment and a C11-Bodipy stain. Although this is informative about whether cell death is occurring, the authors need more evidence that this is being caused by ferroptosis. For example, they could have probed for GPX4 levels in their lysates as the levels would decrease in ferroptosis. Including inducers and inhibitors of ferroptosis (erastin and ferrostatin-1) would confirm if the FAS-induced LRRK2 G2019S cell death phenotype is mediated by ferroptosis.

      HM: We appreciate that the impact of LRRK2G2019S on lipid oxidation and cell death were new to the revision and could have been explored further. However, as we were under a time constraint for the revisions we didn’t employ erastin and ferrostatin-1 in our study. The decision to look at ferroptosis came from the new proteomics analysis we did on wild type and G2019S cells in absence or presence of iron overload. This led us to uncover a major oxidation signature consistent with a sensitivity to ferroptosis. Therefore we used two common assays to test this, that was included in our new Figure 4. Ferroptosis is driven by oxidized lipids, which we observe in lysosomes using the C-11 Bodipy stains, and a rupture of the plasma membrane that can be monitored by the uptake of SYTOX green. SYTOX green uptake also occurs in pyroptotic and necrotic cell death, but the combination of oxidized lipids with SYTOX green is very commonly used to conclude the death is ferroptotic. In our case, we are also adding very high levels of iron to the cells (instead of using erastin), which would be further suggestive of ferroptosis.

      Our proteomics analysis does not show any changes in GPX4, but again, we are in conditions of high iron overload, which is very different from experiments with erastin that inhibit this channel to drive ferroptosis. The accumulation of iron in the late endosome, where we see the accumulation of oxidized C-11 Bodipy is consistent with several recent studies all demonstrating critical roles for lysosomal iron in the regulation of ferroptosis (see new review in Trends in Cell Biology, March 30 2026 DOI: 10.1016/j.tcb.2026.03.007). The bioRxiv paper looking at LRRK2G2019S and other mutants also showed increased oxidized lipids and suggested a susceptibility to ferroptosis (the paper R3 wanted us to cite during first review). Therefore, we agree that there will be much more to learn about how exactly LRRK2 may govern the lysosomal/cellular oxidation pathways, we consider our use of proteomics, lipid probes and cell death assays to be sufficient to conclude that LRRK2G2019 renders our macrophages sensitive to ferroptotic cell death upon iron overload.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this paper, the authors report an interesting phenotype of the LRRK2 G2019S mutation on iron homeostasis in RAW264.7 macrophages. The phenotype is well characterised through proteomic and western blot approaches investigating transferrin and ferritin trafficking. The study is well conducted and data of high quality. The authors also appear to have discovered a cellular context where Rab8 is phosphorylated independently of LRRK2. This is a major finding which can potentially have an important impact in the LRRK2 field. What is missing in the study is the physiological relevance of these findings, mainly whether this effect actually results in higher cell death during iron overload. Since iron overload is known to result in ferroptosis, it is surprising that the authors have not checked whether the LRRK2 G2019S and ARM cells undergo more ferroptosis relative to LRRK2 WT cells. Moreover, their conclusion of the findings as "resistant to LRRK2 kinase inhibitors" is not convincing, since in most of the studies, they have removed the kinase domain, and this description implies the use of pharmacological kinase inhibition which has not been done in this paper.

      Significance

      Major comments

      In Figure 1:

      • There is lower LRRK2 expression in LRRK2 G2019S cells, have the authors checked Rab phosphorylation to validate the mutation?
      • The authors should specify if their cells are heterozygous or homozygous since they are discussing a dominant interfering mutant.
      • The transferrin phenotype validated through proteomics and western blot is solid.
      • Quantification in figure 1F-G is problematic, not clear what they mean by "diffuse and lysosomal". Puncta is either colocalising with lysosomes or not colocalising. This needs to be clarified and re-analysed.
      • Text in the first results part called "LRRK2G2019S RAW macrophages have altered iron homeostasis" is very long. It could be divided into more sections to improve readability.

      In Figure 2:

      • If the effect is armadillo-dependent, where does LRRK2 G2019S is implicated since there is no kinase domain in these cells?
      • The authors do not show any controls (PCR, sequencing) confirming knockout or truncation.
      • The data is interesting and the image quality with the insets is very high.

      In Figure 3:

      • Mutant not clearly described in text, did the authors remove just the kinase and ROC-COR domains or all the domains downstream of the Armadillo domain? This is not clear.
      • The authors cannot conclude that their phenotype is due to the independence of the kinase domain specifically as they are also interfering with the GTPase activity by removing the ROC-COR domains.
      • In Figure 3E, is the difference between the "ARM CTRL" and the "ARM FAS" conditions significant? A trend appears to be there, but the p-value is not shown.

      In Figure 4:

      • In figure 4A, it would have been important to check if Rab8 phosphorylation is also observed in LRRK2 KO cells after administration of FAS to further evaluate the mechanism through which this Rab8 phosphorylation is occurring.
      • The vinculin bands in figure 4A are misaligned with the rest of the bands.
      • The authors do not have any controls to validate the pRab8 staining in IF. This is an important caveat and needs to be addressed.
      • The authors should have checked if FAS administration in the LRRK2 G2019S and the ARM cells is leading to ferroptotic cell death (or cell death in general). This is key to validate the link between the altered iron homeostasis in LRRK2 G2019S cells and increased cytotoxicity observed during neurodegeneration. Regarding the literature, the authors are missing some important papers that are preprinted and these studies need to be discussed. This includes a report with opposite findings https://www.biorxiv.org/content/10.1101/2025.09.26.678370v1.full and a report showing kinase independent cell death in macrophages https://www.biorxiv.org/content/10.1101/2023.09.27.559807v1.abstract
    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript the authors describe an interesting connection between the Parkinson's kinase LRRK2 and iron trafficking in RAW macrophages. Expression of the LRRK2 G2029S mutation affects the abundance of ferritin heavy and light chains and therefore the uptake and storage of iron. Interestingly, the loss of the kinase domain still had a strong phenotype, suggesting that this is independent of the kinase function.

      The paper is well written and excellently cited. The data is convincing and of good quality.

      I have only one request and else very minor comments:

      Major: Please confirm that the observed phenotype is conserved within bone marrow-derived macrophages of LRRK2 G2019S mice. These mice are widely available within the community and frozen bone marrow could be sent to the labs.

      The main reason for this experiment is that CRISPR macrophage cell lines do sometimes acquire weird phenotypes (at least in our lab they sometimes do!) and it would strengthen the validity of the observations.

      Minor comments:

      Supplementary Fig 1: I don't think one should normalize all controls to 1 and then do a statistical test as obviously the standard deviation of control is 0. I would normalize to the average of the control, which will provide an error for the control.

      The raw data needs to be submitted to PRIDE or similar. This has not happened yet.

      Some of the western blots could be improved. If these are the best shown, I am a little concerned about the reproducibility. How often has they been done?

      Significance

      Considering the importance of LRRK2 biology in Parkinson's and the new biology shown, this paper will be of great interest to the community and wider research fields.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Goldman et al describe some novel findings with respect to LRRK and iron handling in a series of RAW macrophage cell lines. This cell background is chosen for its recognized high levels of endogenous LRRK2 protein expression, its somewhat broad use in the field, and the investigators add its relevance due to phagocytosis of red blood cells, thus requiring iron robust metabolic processes. Proteomic analyses of WT and G2019S RAW cells revealed multiple iron-related proteins affected by LRRK2 mutation. A deeper candidate-based analysis revealed complex changes in ferritin heavy and light chain and changes in ferric and ferrous iron. Notably, reliable changes in the levels and/or solubility of NCOA4 result from this pathogenic LRRK2 mutation. Unexpectedly, however, these changes were not sensitive to LRRK2 kinase inhibitor treatment. The investigators suggest a dominant effect rather than loss-of-function as subsequent experiments revealed that these effects could be replicated with a LRRK2 variant lacking the kinase domain (LRRK2-ARM) and were not replicated by LRRK2 KO. The data are internally consistent throughout and could certainly shed new important light onto unique and unexpected effects of this LRRK2 mutation.

      There are two major concerns with the data in their present form. In brief, first, the G2019S cells express much less LRRK2 and more Rab8 that the WT cells and this severely affects interpretability. Second, the investigators used CRISPR to truncate the endogenous LRRK2 locus to produce a hypothetical truncated LRRK2-ARM polypeptide. This appears to have robust effects on NCOA4, in particular, which drives the overall interpretation of the data. However, the expression of this novel LRRK2 specie is not confirmed nor compared to WT or G2019S in these cells (although admittedly the investigators did seek to address this with subsequent KO in the ARM cells). It would be premature to account for the changes reported without evidence of protein expression. This latter issue may be more easily addressed and could provide very strong support for a novel function/finding, see more detailed comments below, most seeking clarifications beyond the above.

      • Need to make clear in the results whether the G2019S CRISPR mutant is heterozygous or homozygous (presumably homozygous, same for ARM)
      • The text of the results implies that MLi2 was used in both WT and G2019S Raw cells, but it's only shown for G2019S. Given the premise for the use of RAW cells, it's important to show that there is basal LRRK2 kinase activity in WT cells to go along with its high protein expression. This is particularly important as the G2019S blot suggests minor LRRK2-independent phosphorylation of Rab8a (and other detected pRabs). One would imagine that pRab8 levels in both WT and G2019S would reduce to the same base line or ratio of total Rab in the presence of MLi2, but WT untreated is similar to G2019S with MLi2. This suggests no basal LRRK2 activity in the Raw cells, but I don't think that is the case.
      • Also, in terms of these cells, the levels of LRRK2 are surprisingly unmatched (Fig 1A, 1D, 1H, S1D, etc.) as are total levels of Rab8 (but in opposite directions) between the WT and G2019S. This is not mentioned in the Results text and is clearly reproducible and significant. Why do the investigators think this is? If Rab8 plays a role in iron, how do these differences affect the interpretation of the G2019S cells (especially given that MLi2 does not rescue)? Are other LRRK2-related Rabs affected at the protein (not phosphorylation level)? Could reduced levels of LRRK2 or increase Rab 8 alone or together account for some of these differences? Substantial further characterization is required as this seriously affects the interpretability of the data. Since pRab8 is not normalized to total Rab8, this G2019S model may not reflect a total increase in LRRK2 kinase activity, and could in fact have both less LRRK2 protein and less cellular kinase activity than WT (in this case).
      • Presumably, the blots in 1H are whole cell lysates and account for the pooled soluble and insoluble NCOA4 (increased in G2019S), as there is no difference in soluble NCOA4 (Fig 2H). I suspect the prior difference is nicely reflected in the insoluble fraction (Fig 2H). This should be better explained in the Results text. This is a very interesting finding and I wonder what the investigators believe is driving this phenotype? Is the NCOA4 partitioning into a detergent-inaccessible compartment? Does this replicate with other detergents, those perhaps better at solubilizing lipid rafts? Is this a phenotype reversible with MLi2? Very interesting data.
      • Figure 2 describes the increased NCOA4-positive iron structures after iron load, but does not emphasize that the G2019S cells begin preloaded with more NCOA4. How do the investigators account for differential NCOA4 in this interpretation? Is this simply a reflection of more NCOA4 available in G2019S cells? This seems reasonable.
      • These are very long exposures to iron, some as high as 48 hr which will then take into account novel transcriptomic and protein changes. Did the investigators evaluate cell death? Iron uptake would be trackable much quicker.
      • The legend for 2F is awkward (BSADQRED)
      • Why are WT cells not included in Fig 2G?
      • The biochemical characterization of NCOA4 in the LRRK2-arm cells is a great experiment and strength of the paper. The field would benefit by a bit further interrogation, other detergents, etc.
      • Have the investigators looked for aberrant Rab trafficking to lysosomes in the LRRK2-arm cells? Is pRab8 mislocalized compared to WT? Other pRabs?
      • The expression levels and therefore stability of the ARM fragment is not shown. This is necessary for interpretation. While very intriguing, the data in Aim 3 rely on the assumption that the ARM fragment is expressed, and at comparable levels to G2019S to account for phenotypes. The generation of second clone is admirable, but the expression of the protein must be characterized. This is especially true because of the different LRRK2 levels between WT and G2019S. One could easily conceive of exogenous expression of a tagged-ARM fragment into LRRK2 KO cells, for example, as another proof-of-concept experiment. If it is truly dominant, does this effect require or benefit from some FL LRRK2? It seems easy enough to express the LRRK2-ARM in at least WT and KO RAW cells.
      • Does iron overload induce Rab8a phosphorylation in a LRRK2 KO cell? This would be a solid extension on the ARM data and support the important finding that an additional kinase(s) can phosphorylate Rab8a under these conditions, and while not unexpected, this may not have been demonstrated by others as clearly. It also addresses whether the ARM domain is important to this other putative kinase(s), which may add value to the authors' model.

      Minor concern - the abstract but not the introduction emphasizes a hypothesis that loss of neuromelanin may promote cell loss in PD (through loss of iron chelation), while post mortem studies are by definition only correlative, early works suggested that the higher melanized DA neurons were preferentially lost when compared to poorly melanized neurons in PD. This speculation in the abstract is not necessary to the novel findings of the paper.

      Significance

      This study could shed light on a both novel and unexpected behavior of the LRRK2 protein, and open new insights into how pathogenic mutations may affect the cell. While studied in one cell line known for unusually high LRRK2 expression levels, data in this cell type have been broadly applicable elsewhere. Give the link to Parkinson's disease, Rab-dependent trafficking, and iron homeostasis, the findings could have import and relevance to a rather broad audience.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 Major comments: 1. Data demonstrated the statistical differences in MuSC behaviors between CD90+ve and CD90-ve cells. However, the difference is small. For example, it is unclear whether the minimal difference in CALCR expression level between CD90+ve and CD90-ve cells gives rise to any biological difference.

      We thank the reviewer for this thoughtful comment and agree that it is important to distinguish statistical significance from biological relevance. However, describing the differences as “small” is somewhat subjective, as it is not clear whether this refers to per-cell expression differences or the magnitude of downstream functional consequences. In the case of CALCR, even modest shifts in receptor abundance can be biologically meaningful in threshold-dependent ligand–receptor systems, and in our study, the CALCR difference is supported by concordant orthogonal readouts (flow cytometry/immunofluorescence and transcript levels, see revised Fig. 6 F-I) together with functional evidence showing differential sensitivity of CD90+ve versus CD90−ve MuSCs to Col6-mediated restraint (see Fig. 6 D-E). In addition, to directly address the Reviewer’s question, we now include in this revised version of the manuscript further experimental data showing a differential response between CD90+ve and CD90−ve MuSCs to pharmacological inhibition of the CALCR signaling pathway (see revised Fig. S9G, H, I). Taken together, these complementary molecular and functional findings indicate that, although the quantitative differences may not appear large in absolute terms, they are sufficient to generate consistent, measurable, and biologically meaningful outcomes.

      1. Negative controls of FACS analyses are required because different sizes of cells might exert different background intensities. (Figure 2I, 2L, and 6F).

      We have now added negative controls to Fig. 2L and 6F, demonstrating the specificity and size independence of the measured signal. Panel 2I, which highlighted size differences between CD90+ve and CD90−ve MuSCs in the previous version, is now relocated to Fig. S3J.

      1. If CD90+ve MuSCs express Col6 higher than CD90-ve MuSCs, they should also highly express the primary target of Notch target genes, Hes1, Hey1, and HeyL. The authors should examine the expression levels of these genes.

      We have now expanded Suppl Table 1 to include all genes upregulated in CD90+ve MuSCs with p-values

      1. As described above, the quantifications of many results, including MyoD, were based on the fluorescent intensity. I know the difficulty of preparing enough cells for experiments, but the authors need to present data supporting these results.

      The evaluation of fluorescent intensity as a reliable and sensitive readout of protein content has been used in multiple publications in the myogenesis field by independent authors and by us (de Morree et al. 2019; Florio et al. 2023; Vetter and Lawlor 2026; Zanotti et al. 2022). The availability of sufficient amounts of materials is an important limitation for proteomic studies, and we thank the Reviewer for acknowledging this. To demonstrate the reliability of our pixel quantification-based assay, we have confirmed selected datasets using an alternative quantification approach based on visual discrimination of MyoD+ve and MyoD-ve cells (see Fig. S3E and associated Fig. S3D in the revised version). The results we obtained confirm those from pixel-quantification. Moreover, to further corroborate the trustworthiness of our analytical approach based on pixel quantification, we performed in parallel western blot analysis and the evaluation of fluorescent intensity on proliferating and differentiating myoblasts using antibodies recognizing MyoD, one of the markers we used throughout the manuscript. The data derived from this parallel evaluation (see Additional Fig. 1 below) clearly demonstrate robust parallelism between the two quantification methods.

      1. Figure 7G-H; More quantitative analyses should be included. In addition, the sample number was different between Fig7E and H. There is no significant difference in the CD90 expression in Fig7G. The authors need to confirm the reproducibility.

      We thank the reviewer for the opportunity to clarify these points. We apologize for not having clearly explained the design of this experiment. Figure 7E refers to a different time-point than Fig. 7F-I (i.e., 1.5 vs. 4.5 days after injury; see Fig. 7D for a visualization of the experimental design). The analysis at 1.5 days post injury was performed in 3 independent biological replicates. The difference in sample numbers between Fig. 7F and Fig. 7H is due to the fact that the absolute cell count was not performed in one biological replicate of CD90-depleted muscle and one biological replicate of control muscle. As a result, those samples were included in the qualitative and morphometric analyses but could not be incorporated into the cell number quantification. Importantly, this discrepancy does not affect the overall outcome or statistical interpretation of the results. We have clarified this in the revised figure legends. Regarding Fig. 7G, we would like to specify that CD90 staining is not shown in this panel. We really apologize for the confusion. The images display laminin and embryonic MyHC (eMyHC) staining to highlight the size and regeneration status of newly formed myofibers. Therefore, CD90 expression in this panel is not relevant to the analysis presented. We have revised the legend to explicitly clarify this point and avoid potential confusion. Overall, we confirm the reproducibility of the findings and have improved the clarity of presentation in the revised manuscript.

      Minor comments: 1. Figure S4. The authors need to show evidence that these cells are proliferating. Without the evidence, CD90 expression my just be retained in non-dividing cells. If it is difficult, the results should be removed.

      We thank the reviewer for highlighting this possibility, and we have now removed panels D and E from Figure S4.

      1. Heterogeneity in cell cycle progression in MuSCs is well documented as fast and slow dividing cells. This reviewer recommends discussing the relevance of CD90 expression to these reports. PMID: 22349695 PMID: 8608871

      In the revised version of the manuscript, we have included the indicated reports in a paragraph of the discussion centered on a possible “division of labor” between CD90+ve and CD90-ve subsets of MuSCs during muscle regeneration. See also the response to the first major point raised by Reviewer #2.

      Reviewer #2

      Major Comments: 1. It is perplexing that the CD90+ fraction is implicated in activation, proliferation, and differentiation (Mgn+ data) while simultaneously contributing to the CD90-ve population (Fig. 3E). However, the reverse does not seem to occur, with CD90-ve cells not replenishing the CD90+ fraction. If the CD90+ subpopulation indeed accounts for the majority of myogenesis, this provokes the question: what is the functional role of the CD90− fraction? Notably, CD90-ve MuSCs appear to divide effectively during regeneration (Fig. 2E-G), further emphasizing the need to clarify their contribution to the overall regenerative process. The presence of a substantial number of CD90-ve MuSCs across conditions suggests they cannot simply be dismissed as irrelevant and understanding their role will help clearly establish the +/- subpopulations as functionally different.

      We thank the reviewer for raising this important point. We would like to clarify that we do not suggest at any stage that the CD90−ve MuSC population is insignificant or dispensable for regeneration. On the contrary, our data consistently show that CD90−ve MuSCs are numerically substantial across homeostatic and regenerative conditions and retain clear proliferative capacity during muscle repair (Fig. 2E-G). We fully agree that their persistence strongly argues for a functional role. Our study was designed primarily to identify and characterize functional differences between CD90+ve and CD90−ve MuSCs in terms of activation dynamics and quiescence control, rather than to comprehensively define the lineage hierarchy or long-term fate of each subpopulation. In this context, the observation that CD90+ve MuSCs can give rise to CD90−ve cells in vitro (Fig. 3E) suggests a degree of plasticity and may indicate that CD90 expression marks an activation-prone state rather than a rigid lineage boundary. The lack of reciprocal conversion under the tested conditions does not imply that CD90−ve cells are functionally inert, but rather that the two fractions may occupy distinct positions along a continuum of activation states. Importantly, our in vivo data demonstrate that CD90−ve MuSCs do enter the cell cycle during regeneration, albeit with slower kinetics compared to CD90+ve cells. This supports a model in which CD90+ve cells are primed for rapid early activation, while CD90−ve cells may represent a more dormant or reserve-like fraction that contributes to regeneration with delayed kinetics or plays a stabilizing role during regeneration. Such division of labor would be consistent with emerging concepts of functional heterogeneity within stem cell compartments. Similar “division-of-labor” models have already been proposed in other stem cell systems, including muscle, where subsets differ in proliferation kinetics, differentiation, or self-renewal behavior, as well as in other tissues. We have included a dedicated paragraph in the discussion to highlight this aspect in light of classical and recent literature. A detailed dissection of the long-term lineage contribution and self-renewal capacity of the CD90−ve fraction would be highly informative; however, addressing this question would require dedicated clonal tracing and transplantation experiments beyond the scope of the present study. We have now clarified this point in the revised Discussion, explicitly stating that our goal is to highlight differential activation modalities between the two subpopulations rather than to assign exclusive regenerative responsibility to one fraction. Taken together, our findings support the view that CD90+ve and CD90−ve MuSCs represent functionally distinguishable, yet complementary, subpopulations within the muscle stem cell pool, rather than hierarchically “major” versus “minor” or “relevant” versus “irrelevant” fractions.

      1. The depletion of CD90+ cells (Fig. 7D-I) is the correct experimental approach to assess the function of these cells in vivo. However, the method employed, using IP injections of a CD90 antibody, can lack specificity. Even with optimal specificity, CD90 is expressed on numerous cell types across the body. This raises the possibility that observed effects may result from targeting other CD90+ cells in skeletal muscle or other tissues, both locally and systemically. To mitigate these confounding factors, the authors should attempt strategies to reduce off-target effects. While the technical challenges are acknowledged by this reviewer and may be prohibitory, addressing these limitations would substantially enhance the impact of this work. Additionally, the embryonic myosin heavy chain (eMHC) images (Fig. 7G, H) should be more representative of the quantification data to ensure consistency.

      We thank the reviewer for this constructive comment and agree that antibody-mediated depletion strategies may raise concerns regarding specificity. As correctly pointed out, CD90 is expressed in additional cellular compartments beyond MuSCs, a limitation that we have explicitly acknowledged in the revised manuscript. Importantly, the anti-CD90 antibody used in this study is highly specific, as validated by flow cytometry and immunofluorescence analyses (see Fig. S1 and Fig. 1H). Moreover, the same clone (30-H12) has been previously employed by other groups for in vivo depletion approaches with comparable experimental aims, supporting its reliability for targeting CD90+ve cells (Powell et al. 2012; Zhou et al. 2022). While we cannot completely exclude effects on other CD90-expressing cells, our depletion strategy was performed in the context of acute muscle damage, with local intramuscular administration at the time of injury following systemic priming, which may partially limit potential broader systemic confounding effects. The timing of the phenotype - restricted to early regenerative stages - argues in favor of a local MuSCs-related contribution. We agree that genetic or lineage-restricted strategies would provide a more selective approach; however, such models are currently unavailable for selectively targeting CD90+ve MuSCs without affecting other CD90-expressing populations. Finally, as requested, we have replaced the representative eMHC images in Fig. 7G and 7H to better reflect the quantification data and ensure improved consistency between images and measured outcomes.

      1. Similar concerns about off-target effects noted in point #2, apply to the use of the Col6 KO mouse model, which appears to be a full body KO, meaning Col6 is absent not only in MuSCs but also in other cell types that typically express Col6. This deficiency would have been present throughout development, complicating the interpretation of the observed effects. The authors do acknowledge Col6 expression by non-MuSC cell types, but the in vivo impact remains challenging to interpret, particularly due to the potential developmental and systemic effects of removing Col6. Also, the observation that the CD90-ve subpopulation still expresses Calcr raises further questions about Col6 acting only on the CD90+ fraction and expression by MuSCs being consequential in vivo. The trend observed in Fig. 6M for CD90-ve cells suggests that this mechanism might not be exclusive to CD90+ cells, warranting further investigation or explanation since an outlier in the Col6KO CD90-ve group may have influenced interpretation.

      We thank the reviewer for this thoughtful comment and agree that the use of a constitutive Col6a1−/− model introduces interpretative limitations. As correctly noted, Col6 is expressed by multiple cell types, and its absence throughout development may potentially result in niche-level and systemic effects that complicate attribution of the phenotype exclusively to MuSC-derived Col6. We have clarified this point in the revised Discussion and tempered our conclusions accordingly. Importantly, we do not propose that the Collagen VI - CALCR axis operates exclusively in the CD90+ve fraction. CALCR is also expressed, although at lower levels, in CD90−ve MuSCs, and the in vivo data in Col6a1−/− mice indicate that both subpopulations are affected. Indeed, parameters such as MyoD modulation shift in the same direction in both fractions (see Fig. 6L), supporting the idea that this signaling axis is functionally active across the MuSC pool. However, several observations indicate that components of the Col6–CALCR axis are more pronounced in CD90+ve MuSCs and that certain responses are more robust in this fraction. These include higher Col6 and CALCR expression levels (Fig. 6A-C and 6F-I), a more pronounced increase in cell size upon Col6 ablation (Fig. 6K), and a clearer modulation of activation-associated readouts (levels of pAMPK and MyoD, and EdU incorporation) upon inhibition of the Col6-CALCR axis in vitro (see the newly introduced experiments with PKA inhibitor in Fig. S9 G-H of the revised version). Thus, rather than invoking strict differential sensitivity, our data support a quantitative model in which the pathway operates in both subpopulations but with greater amplitude or prominence in CD90+ve MuSCs. Regarding the trend observed in Fig. 6M for CD90−ve cells, we acknowledge that variability might have played a role and have revised the text to avoid overstatement. As mentioned above, we expanded characterization of the Col6-CALCR axis in the two subpopulations by performing additional ex vivo experiments to investigate the activation of the signaling pathway downstream of CALCR and the impact of its pharmacological inhibition on the two subpopulations of MuSCs. Overall, we conclude that the Collagen VI–CALCR axis is not exclusive to CD90+ve MuSCs, but that its components and functional consequences appear particularly evident in this fraction.

      1. The siCD90 experiment in Fig. 5 demonstrates effective KD at both the transcript and protein levels, but the observed impact on the proliferation of CD90+ cells (Fig. 5G), while statistically significant, appears to be less than expected. This result is also confusing given the substantial reduction in pAMPK levels observed in Fig. 5L, leading to the expectation of a more pronounced effect on proliferation if the proposed CD90-pAMPK mechanism is a driving pathway. Additionally, Fig. 5N suggests that pAMPK supports proliferation in both CD90+ and CD90− subpopulations. While the AICAR treatment in CD90− cells does not achieve significance, the data exhibit a bimodal distribution among replicates, with an apparent outlier in the control group potentially skewing the analysis. This variability necessitates further clarification for the relationship between CD90, pAMPK, and MuSC proliferation.

      We thank the reviewer for this careful evaluation of the siRNA and AICAR experiments. First, to improve clarity and better reflect the relationship between CD90 knockdown efficiency and biological outcome, we have now re-expressed the proliferation data in Fig. 5 F, G, L as percentage reduction relative to the CD90+ve scrambled control condition. Presenting the data in this normalized manner highlights a strong correlation between the extent of CD90 ablation (Fig. 5E) and the magnitude of the physiological effect. While the absolute change in proliferation may appear moderate, it scales consistently with the degree of CD90 protein reduction, supporting a dose-effect relationship rather than an all-or-none response. Our data support the interpretation that the CD90-AMPK axis contributes to, but does not solely determine, proliferative behavior. MuSC proliferation is likely governed by multiple converging pathways, and CD90-dependent modulation of AMPK represents an important component of this regulatory network. To further strengthen the mechanistic link, we have complemented the primary MuSCs data with gain-of-function experiments in C2C12 reserve cells (see Fig. S6 in this revised version). Overexpression of CD90 in this model enhances activation-associated features, including increased Myod upregulation, pAMPK levels, and augmented entry into the cell cycle upon stimulation, compared to control untransduced cells. These findings provide independent evidence that CD90 expression is sufficient to potentiate AMPK signaling and bias cells toward a more activation-prone state, supporting the causal nature of the CD90–AMPK axis beyond observations in primary MuSCs. Concerning the AICAR experiments (Fig. 5N), we acknowledge that in vivo pharmacological activation can be influenced by biological variability. It is indeed possible that AICAR exerts effects on both CD90+ve and CD90−ve populations, as AMPK is present in both fractions (Additional Fig. 2).

      1. The CD90 related findings in human samples appear less robust compared to those in mice. While the sorting successfully identifies sizable CD90+ and CD90-ve populations (Fig. 4A), the sequencing data show only small regions of high CD90 expression, as highlighted in red by the authors (Fig. 4C, D). Have the authors considered replicating the sequencing experiments within their own laboratory? While it is acknowledged that sourcing human tissue may be a limitation, it may strengthen the translational impact if possible.

      We thank the reviewer for this thoughtful comment. We agree that the CD90-related signal in the human scRNA-seq dataset appears less striking than in the murine Cy-TOF data; however, we believe that the most parsimonious explanation lies in the well-documented technical limitations of single-cell RNA sequencing, particularly its limited sensitivity for low-abundance transcripts. It is widely recognized that in scRNAseq experiments, the number of genes detected depends on the sequencing depth, and therefore scRNA-seq suffers from “dropout” effects and reduced detection efficiency, especially for transcripts expressed at moderate-to-low levels, such as those that do not encode abundant structural proteins (Kharchenko et al. 2014; Svensson et al. 2017). CD90 falls within this category, as it is not an abundant structural protein and may therefore be underrepresented at the mRNA level despite robust protein detection with FACS. Indeed, discrepancies between protein-level heterogeneity and scRNA-seq signal intensity are commonly reported, particularly for surface markers (Linderman et al. 2022; Stuart and Satija 2019). Importantly, in our study, the presence of substantial CD90+ve and CD90−ve human MuSC populations is robustly demonstrated by flow cytometry-based sorting (Fig. 4A-B), which directly measures protein abundance and shows a clear bimodal distribution. The scRNA-seq data were used as supportive, orthogonal evidence and are consistent with enrichment of CD90-expressing clusters, even if the signal is spatially restricted. Furthermore, functional assays (ex vivo EdU incorporation and activation parameters following injury) independently validate that CD90 marks a functionally distinct fraction in human muscle. While we agree that performing scRNA-seq in-house would be valuable, access to freshly isolated human MuSCs in sufficient numbers for high-depth single-cell sequencing remains technically and ethically challenging. Given that our conclusions rely primarily on protein-level stratification and functional validation, we believe that the translational relevance of the findings is adequately supported, and we hope the Reviewer will agree.

      Minor Comments: 1. Fig. 1D - the MuSC population has an uncharacteristically low representation amongst cells of uninjured muscle. Can the authors comment on this in text?

      We thank the reviewer for raising this point. We have re-examined our calculations on a per-sample basis, and the proportion of MuSCs among total mononuclear cells isolated from uninjured muscle ranges between approximately 0.8% and 3%. This frequency is within the lower end of the range typically reported for MuSCs isolated by FACS from adult uninjured murine muscle, which is commonly described to fall around ~1-4% depending on digestion protocol, gating strategy, and muscle type (Liu et al. 2015; Machado et al. 2017; Montarras et al. 2005). Importantly, we intentionally applied a conservative gating strategy to minimize contamination from non-myogenic populations and to ensure that CD90 detection was strictly restricted to bona fide MuSCs. While this approach may reduce the apparent overall frequency of MuSCs, it increases confidence in the purity of the analyzed population and in the interpretation of CD90-based subfractionation. To improve clarity and transparency, we have now included an additional figure (Additional Fig. 3) below detailing the full gating scheme, along with absolute numbers and percentages of MuSCs across samples. We have also added explanatory text in the legend of Figure 1D of the revised manuscript to explicitly address this point.

      1. Fig. 2 - the g-alert phenotype corresponding with CD90 expression is interesting. Can the authors add a molecular marker to confirm this phenotype?

      We have now added to our size, rosa locus activity, and mitochondrial content analysis the quantification of phosphorylated S6, a marker of cells in the alert state according to preexisting literature (Rodgers et al. 2014) (see Fig. 2J of the revised version). The more pronounced presence of phospho-S6 in CD90+ve MuSCs under “alerting” conditions supports our conclusion that CD90+ve MuSCs present a more rapid tendency to enter the G0alert state compared to their CD90-ve counterpart.

      1. The authors mention "significantly higher fraction of CD90+ve MuSCs incorporated EdU in vivo at 1.5, 2.75, and 6 days after injury..." in 2G, but it only seems 1.5 and 2.75 are different. The text should be corrected.

      We apologize for the mistake and thank the Reviewer for noticing this typo, which we have corrected in the current version of the manuscript.

      1. 3B, C - 7dpi seems late for the analyses of Myogenin at a single-cell level since most differentiating MuSCs are fused at this point. Can the authors comment in the text why 7 dpi was chosen?

      In adult murine skeletal muscle, most of the events associated to the regenerative process following acute injury typically unfolds over approximately 14 days, with early activation and proliferation occurring within the first few days, followed by differentiation and fusion, and progressive maturation of newly formed fibers until ~30 days post-injury. Thus, 7 days post-injury represents an intermediate stage of the regenerative timeline, during which differentiation is still ongoing and newly formed fibers are not yet fully matured. At this time point, Myogenin-positive cells can still be detected, reflecting ongoing differentiation of MuSC progeny before complete fusion and maturation. Told that, the major goal of this experiment was to begin exploring potential mechanisms restoring the initial 1:1 ratio between CD90+ve and CD90-ve MuSCs after injury resolution. Our observations reported in Fig. 3A suggest that ~6 days post-injury, the reequilibration process has started. The choice of 7 days post-injury for the myogenin analysis is a consequence of this observation. Moreover, around this time-point, CD90+ve and CD90-ve MuSCs showed similar EdU incorporation rates (see Fig. 2G). We have clarified this rationale in the revised manuscript to better contextualize the timing of the analysis.

      1. Note - 7L is very interesting, but emphasizes the major Q#1 above. Mdx mice are known to lose MuSC capacity due to continuous rounds of proliferation. So, the CD90+ loss is supportive, but there are still many CD90-ve cells present. In other figures, the authors demonstrated the negative fractions still harbor decent proliferative potential, so why is there no rescue?

      We thank the reviewer for highlighting this important aspect of our findings and for recognizing the relevance of the observation in Fig. 7L. We agree that in mdx mice, where MuSCs undergo repeated cycles of activation and proliferation, the preferential loss of the CD90+ve subset is consistent with the idea that this subpopulation may be particularly vulnerable to chronic regenerative stress. At the same time, we acknowledge that a substantial number of CD90−ve MuSCs remain present. As the Reviewer correctly notes, our data indicate that CD90−ve cells retain proliferative potential under acute regenerative conditions. However, our study was designed primarily to define differences in activation dynamics and quiescence control between CD90+ve and CD90−ve MuSCs, rather than to establish their relative capacity to sustain long-term regeneration in pathological contexts. Understanding why the remaining CD90−ve population does not compensate for the loss of CD90+ve cells in mdx muscle would require dedicated lineage-tracing, transplantation, or long-term functional assays, which go beyond the scope of the present work. One possible interpretation, consistent with our data, is that CD90−ve MuSCs exhibit slower activation kinetics and may not efficiently support the rapid or repeated regenerative demands characteristic of dystrophic muscle. Thus, they may be insufficient to fully rescue regeneration under chronic pathological stress. Future studies specifically addressing the regenerative potential of purified CD90−ve MuSCs in transplantation or chronic injury models will be required to resolve this question in detail. We have clarified this point in the revised Discussion and tempered our interpretation accordingly.

      1. The authors should clearly state the number of cells and number of replicates for their single cell distribution graphs in all legends.

      We have added this information to all relevant legends, where it was not already present.

      1. In their stats section of methods, ns= p{greater than or equal to}0.15, please clarify.

      We thank the reviewer for pointing this out and apologize for the lack of clarity. The threshold “ns ≥ 0.15” was introduced as an arbitrary and conservative criterion to avoid labeling as “non-significant” comparisons with p values only marginally above the conventional 0.05 threshold. Our intention was to distinguish likely truly non-significant results (p ≥ 0.15) from those showing a statistical trend (0.05

      Reviewer #3

      Major comments 1. While the manuscript provides valuable insights into the functional heterogeneity of MuSCs, there are some critical aspects that remain unclear. Specifically, the mechanism by which the quiescence of CD90+ MuSCs, maintained through the COL6-CALCR pathway, confers an advantage for their rapid activation is not sufficiently addressed. Understanding why this pathway enables such responsiveness would significantly strengthen the authors' conclusions. Additionally, the manuscript does not elucidate how the quiescence of CD90-negative MuSCs is maintained, leaving a gap in the characterization of MuSC heterogeneity. Without this information, the functional significance of this heterogeneity, particularly in the context of muscle regeneration, remains incomplete.It would be interesting to explore why CD90-negative cells appear less responsive to injury or why CD90+ cells are more readily activated. Addressing these questions would provide a more comprehensive understanding of the biological implications of MuSC heterogeneity and enhance the impact of the study.

      We thank the reviewer for this thoughtful and conceptually important comment. We agree that clarifying the mechanistic basis underlying the differential activation propensity of CD90+ve and CD90−ve MuSCs strengthens the interpretation of functional heterogeneity. To address this point, we have expanded the mechanistic component of the study in two directions. First, to better understand why CD90+ve MuSCs display a more pronounced activation profile, we performed gain-of-function experiments in C2C12 reserve cells by overexpressing CD90. These experiments, reported in Figure S6 of this revised version, demonstrate that CD90 overexpression enhances activation-associated features, supporting a causal link between CD90 expression and activation propensity. This complements the loss-of-function data in primary MuSCs and reinforces the concept of a CD90-AMPK axis contributing to a primed metabolic state. Second, regarding quiescence, we have further dissected the COL6-CALCR pathway by analyzing downstream signaling components and comparing its mechanistic features with the previously described COL5-CALCR axis (Baghdadi et al. 2018). Our new data show that COL6 engagement leads to modulation of downstream effectors (e.g., YAP localization and PKA-dependent signaling), consistent with a CALCR-mediated quiescence program that shares similarities with COL5-driven regulation (see Fig.S9 of the revised version). We have expanded the Discussion to more clearly articulate this mechanistic convergence. Importantly, we do not propose that COL6-mediated quiescence directly “confers” activation capacity in a deterministic sense. Rather, our model suggests that CD90+ve MuSCs exist in a poised state: they exhibit an intrinsically primed activation program (via CD90-AMPK), while concurrently maintaining quiescence through a COL6-CALCR-dependent restraint. This dual regulatory architecture may allow rapid transition upon injury without premature exhaustion, thereby providing a kinetic advantage. In this context, the destruction of muscle extracellular architecture associated with injury would release the “break” imposed by Collagen 6 on the activation of CD90+ve cells. Regarding CD90−ve MuSCs, we acknowledge that the precise mechanisms maintaining their quiescence remain incompletely defined in this study. However, our data suggest that they are less reliant on the COL6–CALCR axis. A full dissection of these pathways would require dedicated transcriptional and signaling analyses beyond the scope of the present work. We have clarified this aspect in the revised Discussion. Finally, we have further elaborated in the Discussion that CD90+ve and CD90−ve MuSCs may represent functionally complementary subpopulations: CD90+ve cells being primed for rapid early activation, and CD90−ve cells potentially contributing under different temporal or regenerative contexts. We believe that these additions provide a more comprehensive framework for understanding the biological implications of MuSC heterogeneity while maintaining appropriate caution regarding unresolved mechanistic aspects.

      1. I am particularly concerned about Figure 7. As the authors mentioned, CD90 is not specific to MuSCs. Therefore, the conclusion that CD90+ MuSCs are important for muscle regeneration based on the current experiment is not fully convincing. I suggest incorporating additional approaches to confirm this point. For example, transplantation of CD90+ or CD90- MuSCs into injured muscles would provide stronger support for their findings.

      We thank the reviewer for raising this important concern, which was also emphasized by Reviewer #2. We fully agree that the lack of absolute specificity of CD90 for MuSCs represents a limitation when interpreting in vivo depletion experiments. A number of observations somehow mitigate the concern (see answer to point #2 of Reviewer 2). At present, however, there are no available genetic tools that would allow selective targeting of the CD90+ve MuSC subpopulation. Addressing this question definitively would likely require the generation of compound mouse models combining at least two independent genetic modifications (e.g., a MuSC-specific driver together with a CD90-dependent conditional ablation system), which are currently not available. We have explicitly clarified this limitation in the revised manuscript. Regarding the suggestion of transplantation experiments, we agree that this approach is often used to assess regenerative potential. However, in the specific context of our study, transplantation may not directly resolve the key mechanistic question we are addressing. Indeed, isolation and transplantation procedures inevitably activate MuSCs due to enzymatic digestion and removal from their niche, thereby erasing differences related to activation kinetics and quiescence maintenance. Since the central focus of our work is the differential propensity for activation and the regulation of quiescence between CD90+ve and CD90−ve subpopulations, transplantation of already activated cells may obscure precisely the phenotypic differences we aim to characterize. Importantly, our conclusions do not rely solely on the depletion experiment in Fig. 7. The functional relevance of CD90+ve MuSCs is supported by multiple complementary lines of evidence, including differences in activation kinetics, AMPK signaling, response to Collagen VI, and their preferential depletion in dystrophic muscle. The in vivo antibody-mediated depletion, therefore, serves as supportive, rather than exclusive, functional validation. We have revised the Discussion to explicitly acknowledge these technical constraints (and, therefore, temper our conclusions), while emphasizing that the available evidence supports a functional contribution of the CD90+ve fraction to early regenerative dynamics.

      1. A previous study has demonstrated that the COL5-CALCR pathway is essential for maintaining MuSC quiescence. In this manuscript, the authors propose the COL6-CALCR pathway; however, the current study lacks specific experiments to clarify the differences and similarities between these pathways. Additionally, the discussion section does not adequately address these points, leaving the interpretation incomplete. A more thorough discussion comparing these pathways would significantly improve the manuscript.

      We thank the reviewer for highlighting this important conceptual point. We agree that a clearer comparison between the previously described Col5-CALCR pathway and the Col6-CALCR axis proposed in our study strengthens the interpretation of our findings. To directly address this issue, we have now included two additional sets of experiments in the revised manuscript. First, we performed a comparative analysis of Col5, Col6, and Col4 as substrates, evaluating their ability to activate downstream CALCR signaling, using the reduction of nuclear YAP accumulation as a functional readout (Zhang et al. 2019). Although only the effect induced by Col6 was statistically different from those induced by Col 4, these experiments suggest that both collagen 5 and 6 can activate this pathway (see Fig. S9E-F of the revised manuscript). Second, we assessed activation and proliferation parameters in freshly isolated CD90+ve MuSCs plated on Col5, Col6, or Col4 substrates. This allowed us to directly compare the functional consequences of these different ECM components on activation kinetics and proliferative behavior within the same experimental framework. The results indicate overlapping effects of Col6 and Col5, distinct from those induced by Col4. These observations support the idea that Col6 contributes to quiescence regulation in a manner that is at least partially convergent with the previously described Col5 pathway (see Fig S9 A-B of the revised manuscript). Importantly, the commune effect induced by Col6 and Col5 appears to be specific, as Col1 behaves similarly to Col4 under similar testing conditions (see Fig S9 C-D of the revised manuscript). In addition to incorporating these new data, we have substantially expanded the Discussion to more thoroughly compare the COL5-CALCR and COL6-CALCR axes, emphasizing both shared mechanisms (CALCR engagement and quiescence modulation) and potential differences in expression patterns, subpopulation bias, and magnitude of response. We believe that these additions significantly clarify the relationship between the two pathways and strengthen the overall mechanistic framework of the manuscript.

      Minor comments 1. It would be interesting to see the spatial localization of CD90+ and CD90- MuSCs in skeletal muscle tissue.

      We would like to point out that the spatial localization of CD90+ve and CD90−ve MuSCs within skeletal muscle tissue is already shown in Fig. 1H and Fig.6B, where immunofluorescence analysis of muscle cryosections demonstrates the presence of both subpopulations in their native niche. In these images, CD90 staining is visualized in combination with MuSC markers, allowing identification of CD90+ve and CD90−ve MuSCs in situ. We have added to this response to Reviewers additional examples, in which Col6 staining is also highlighted (Additional Fig. 4). To improve clarity, we have revised the legend for Fig. 1H to more explicitly highlight this aspect and guide the reader to the relevant panel.

      1. I suggest conducting a more thorough investigation to characterize the quiescent CD90+ and CD90- MuSCs, particularly focusing on aspects such as protein translation machinery.

      We thank the reviewer for this insightful suggestion. We agree that a deeper characterization of quiescent CD90+ve and CD90−ve MuSCs, including analysis of protein translation machinery and related metabolic features, would provide valuable additional insight into the molecular basis of their functional differences. However, such an in-depth investigation would require dedicated molecular investigations, such as proteomic and/or ribosome profiling approaches, and goes beyond the scope of the present study, which is focused primarily on differential activation dynamics and quiescence regulation between the two subpopulations. We believe this represents an important and promising direction for future work.

      1. The authors should include a discussion of previously identified markers of MuSC heterogeneity, such as CD34, to provide better context for their findings.

      We thank the reviewer for this helpful suggestion. We agree that placing our findings in the context of previously described markers of MuSC heterogeneity is important. In the manuscript, we have explicitly evaluated the relationship between CD90+ve and CD90−ve MuSCs and previously reported heterogeneity markers, including CD34. These analyses are presented in Fig. S2, where we show that CD90-based stratification does not simply recapitulate previously defined subsets. For clarity, we have also added a new summary table (Additional Table 1 below) highlighting the limited overlap between CD90-defined fractions and other reported markers of MuSC heterogeneity. Furthermore, we have expanded the Discussion to note that CD90 does not align with markers such as CD34 and others described in the literature, emphasizing that CD90 identifies a functionally distinct layer of heterogeneity, primarily related to activation kinetics and quiescence regulation, rather than directly overlapping with previously characterized subpopulations.

      1. In Figure 6, the authors used COL4 as a negative control; however, this is insufficient to conclusively demonstrate the importance of COL6 in maintaining CD90+ MuSC quiescence. Including additional substrates beyond collagen, such as fibronectin or laminin, along with COL5, would strengthen the conclusions drawn from these experiments.

      We thank the reviewer for this valuable suggestion. We agree that expanding the range of substrates strengthens the interpretation of the role of COL6 in regulating MuSC quiescence. In the revised manuscript, we have now included additional comparative experiments using Collagen V and Collagen I as alternative substrates. Collagen V, consistent with previous reports implicating the COL5–CALCR axis in MuSC quiescence, produced effects qualitatively similar to those observed with Collagen VI, supporting a partially convergent mechanism at the level of CALCR signaling (see also the response to major comment #3 above) (Fig. S9A-B of the revised version). In contrast, Collagen I was less effective at promoting quiescence-associated features in CD90+ve MuSCs, yielding results similar to Col4 in terms of EdU incorporation and expression of pAMPK (see Fig. S9C-D of the revised version). These findings reinforce the idea that Collagen VI (and Collagen V) are not merely generic ECM components, but exert specific regulatory effects on MuSC activation dynamics. We have incorporated these new data into Figure S9, where we have also created a graphical scheme to summarize and better contextualize similarities and differences among ECM substrates in shaping MuSC behavior (Fig. S9J).

      ADDITIONAL REFERENCES Almada AE, Horwitz N, Price FD, Gonzalez AE, Ko M, Bolukbasi OV, Messemer KA, Chen S, Sinha M, Rubin LL, et al. 2021. FOS licenses early events in stem cell activation driving skeletal muscle regeneration. Cell Reports 34: 108656. Baghdadi MB, Castel D, Machado L, Fukada S-I, Birk DE, Relaix F, Tajbakhsh S, Mourikis P. 2018. Reciprocal signalling by Notch-Collagen V-CALCR retains muscle stem cells in their niche. Nature 557: 714–718. Beauchamp JR, Heslop L, Yu DSW, Tajbakhsh S, Kelly RG, Wernig A, Buckingham ME, Partridge TA, Zammit PS. 2000. Expression of CD34 and Myf5 defines the majority of quiescent adult skeletal muscle satellite cells. Journal of Cell Biology 151: 1221–1233. Cerletti M, Jurga S, Witczak CA, Hirshman MF, Shadrach JL, Goodyear LJ, Wagers AJ. 2008. Highly Efficient, Functional Engraftment of Skeletal Muscle Stem Cells in Dystrophic Muscles. Cell 134: 37–47. Chakkalakal JV, Christensen J, Xiang W, Tierney MT, Boscolo FS, Sacco A, Brack AS. 2014. Early forming label-retaining muscle stem cells require p27kip1 for maintenance of the primitive state. Development (Cambridge, England) 141: 1649–59. Chakkalakal JV, Jones KM, Basson MA, Brack AS. 2012. The aged niche disrupts muscle stem cell quiescence. Nature 490: 355–360. de Morree A, Klein JDD, Gan Q, Farup J, Urtasun A, Kanugovi A, Bilen B, van Velthoven CTJ, Quarta M, Rando TA. 2019. Alternative polyadenylation of Pax3 controls muscle stem cell fate and muscle function. Science 366: 734–738. Der Vartanian A, Quétin M, Michineau S, Auradé F, Hayashi S, Dubois C, Rocancourt D, Drayton-Libotte B, Szegedi A, Buckingham M, et al. 2019. PAX3 Confers Functional Heterogeneity in Skeletal Muscle Stem Cell Responses to Environmental Stress. Cell Stem Cell 24: 958-973.e9. Florio F, Vencato S, Papa FT, Libergoli M, Kheir E, Ghzaiel I, Rando TA, Torrente Y, Biressi S. 2023. Combinatorial activation of the WNT ‐dependent fibrogenic program by distinct complement subunits in dystrophic muscle. EMBO Molecular Medicine 15: 1–20. García-Prat L, Perdiguero E, Alonso-Martín S, Dell’Orso S, Ravichandran S, Brooks SR, Juan AH, Campanario S, Jiang K, Hong X, et al. 2020. FoxO maintains a genuine muscle stem-cell quiescent state until geriatric age. Gayraud-Morel B, Chrétien F, Jory A, Sambasivan R, Negroni E, Flamant P, Soubigou G, Coppée J-Y, Di Santo J, Cumano A, et al. 2012. Myf5 haploinsufficiency reveals distinct cell fate potentials for adult skeletal muscle stem cells. Journal of cell science 125: 1738–1749. Guardiola O, Iavarone F, Nicoletti C, Ventre M, Rodríguez C, Pisapia L, Andolfi G, Saccone V, Patriarca EJ, Puri PL, et al. 2023. CRIPTO-based micro-heterogeneity of mouse muscle satellite cells enables adaptive response to regenerative microenvironment. Developmental Cell 58: 2896-2913.e6. Kharchenko PV, Silberstein L, Scadden DT. 2014. Bayesian approach to single-cell differential expression analysis. Nat Methods 11: 740–742. Kuang S, Kuroda K, Le Grand F, Rudnicki MA. 2007. Asymmetric Self-Renewal and Commitment of Satellite Stem Cells in Muscle. Cell 129: 999–1010. Linderman GC, Zhao J, Roulis M, Bielecki P, Flavell RA, Nadler B, Kluger Y. 2022. Zero-preserving imputation of single-cell RNA-seq data. Nature Communications 13: 1–11. Liu L, Cheung TH, Charville GW, Rando TA. 2015. Isolation of skeletal muscle stem cells by fluorescence-activated cell sorting. Nature Protocols 10: 1612–1624. Machado L, Esteves de Lima J, Fabre O, Proux C, Legendre R, Szegedi A, Varet H, Ingerslev LR, Barrès R, Relaix F, et al. 2017. In Situ Fixation Redefines Quiescence and Early Activation of Skeletal Muscle Stem Cells. Cell Reports 21: 1982–1993. Montarras D, Morgan J, Collins C, Relaix F, Zaffran S, Cumano A, Partridge T, Buckingham M. 2005. Direct isolation of satellite cells for skeletal muscle regeneration. Science (New York, NY) 309: 2064–2067. Powell N, Walker AW, Stolarczyk E, Canavan JB, Gökmen MR, Marks E, Jackson I, Hashim A, Curtis MA, Jenner RG, et al. 2012. The Transcription Factor T-bet Regulates Intestinal Inflammation Mediated by Interleukin-7 Receptor+ Innate Lymphoid Cells. Immunity 37: 674–684. Relaix F, Montarras D, Zaffran S, Gayraud-Morel B, Rocancourt D, Tajbakhsh S, Mansouri A, Cumano A, Buckingham M. 2006. Pax3 and Pax7 have distinct and overlapping functions in adult muscle progenitor cells. Journal of Cell Biology 172: 91–102. Rocheteau P, Gayraud-Morel B, Siegl-Cachedenier I, Blasco MA, Tajbakhsh S. 2012. A subpopulation of adult skeletal muscle stem cells retains all template DNA strands after cell division. Cell 148: 112–125. Rodgers JT, King KY, Brett JO, Cromie MJ, Charville GW, Maguire KK, Brunson C, Mastey N, Liu L, Tsai CR, et al. 2014. MTORC1 controls the adaptive transition of quiescent stem cells from G 0 to GAlert. Nature 510: 393–396. Scaramozza A, Park D, Kollu S, Beerman I, Sun X, Rossi DJ, Lin CP, Scadden DT, Crist C, Brack AS. 2019. Lineage Tracing Reveals a Subset of Reserve Muscle Stem Cells Capable of Clonal Expansion under Stress. Cell Stem Cell 24: 944-957.e5. Stuart T, Satija R. 2019. Integrative single-cell analysis. Nat Rev Genet 20: 257–272. Svensson V, Natarajan KN, Ly L-H, Miragaia RJ, Labalette C, Macaulay IC, Cvejic A, Teichmann SA. 2017. Power analysis of single-cell RNA-sequencing experiments. Nat Methods 14: 381–387. Vetter TA, Lawlor MW. 2026. Automated Quantification of Dystrophin Expression by Immunofluorescence in Humans and Animal Models. Methods Mol Biol 2975: 67–87. Zanotti S, Magri F, Poggetti F, Ripolone M, Velardo D, Fortunato F, Ciscato P, Moggio M, Corti S, Comi GP, et al. 2022. Immunofluorescence signal intensity measurements as a semi-quantitative tool to assess sarcoglycan complex expression in muscle biopsy. Eur J Histochem 66: 3418. Zhang L, Noguchi Y-T, Nakayama H, Kaji T, Tsujikawa K, Ikemoto-Uezumi M, Uezumi A, Okada Y, Doi T, Watanabe S, et al. 2019. The CalcR-PKA-Yap1 Axis Is Critical for Maintaining Quiescence in Muscle Stem Cells. Cell Rep 29: 2154-2163.e5. Zhou L, Zhou W, Joseph AM, Chu C, Putzel GG, Fang B, Teng F, Lyu M, Yano H, Andreasson KI, et al. 2022. Group 3 innate lymphoid cells produce the growth factor HB-EGF to protect the intestine from TNF-mediated inflammation. Nature immunology 23: 251–261.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: In this manuscript, the authors explore the heterogeneity of adult muscle stem cells (MuSCs) in murine and human skeletal muscles. They identify diverse expression levels of CD90 on MuSCs and demonstrate that CD90+ MuSCs are primed for myogenic commitment during muscle regeneration. The authors show that CD90+ MuSCs rapidly enter the cell cycle upon MYOD activation, mediated by rapid AMPK phosphorylation. Furthermore, they investigate the characteristics of both CD90+ and CD90- MuSCs in the quiescent state, revealing that the quiescence of CD90+ MuSCs is maintained through the COL6-CALCR pathway-an original finding of this study. Lastly, using an antibody-mediated depletion model targeting CD90+ MuSCs, they confirm the critical role of this population in muscle regeneration. Antibody-induced depletion or imbalance of CD90+ MuSCs results in impaired or delayed muscle regeneration.

      Major comments

      1. While the manuscript provides valuable insights into the functional heterogeneity of MuSCs, there are some critical aspects that remain unclear. Specifically, the mechanism by which the quiescence of CD90+ MuSCs, maintained through the COL6-CALCR pathway, confers an advantage for their rapid activation is not sufficiently addressed. Understanding why this pathway enables such responsiveness would significantly strengthen the authors' conclusions. Additionally, the manuscript does not elucidate how the quiescence of CD90-negative MuSCs is maintained, leaving a gap in the characterization of MuSC heterogeneity. Without this information, the functional significance of this heterogeneity, particularly in the context of muscle regeneration, remains incomplete.It would be interesting to explore why CD90-negative cells appear less responsive to injury or why CD90+ cells are more readily activated. Addressing these questions would provide a more comprehensive understanding of the biological implications of MuSC heterogeneity and enhance the impact of the study.
      2. I am particularly concerned about Figure 7. As the authors mentioned, CD90 is not specific to MuSCs. Therefore, the conclusion that CD90+ MuSCs are important for muscle regeneration based on the current experiment is not fully convincing. I suggest incorporating additional approaches to confirm this point. For example, transplantation of CD90+ or CD90- MuSCs into injured muscles would provide stronger support for their findings.
      3. A previous study has demonstrated that the COL5-CALCR pathway is essential for maintaining MuSC quiescence. In this manuscript, the authors propose the COL6-CALCR pathway; however, the current study lacks specific experiments to clarify the differences and similarities between these pathways. Additionally, the discussion section does not adequately address these points, leaving the interpretation incomplete. A more thorough discussion comparing these pathways would significantly improve the manuscript.

      Minor comments

      1. It would be interesting to see the spatial localization of CD90+ and CD90- MuSCs in skeletal muscle tissue.
      2. I suggest conducting a more thorough investigation to characterize the quiescent CD90+ and CD90- MuSCs, particularly focusing on aspects such as protein translation machinery.
      3. The authors should include a discussion of previously identified markers of MuSC heterogeneity, such as CD34, to provide better context for their findings.
      4. In Figure 6, the authors used COL4 as a negative control; however, this is insufficient to conclusively demonstrate the importance of COL6 in maintaining CD90+ MuSC quiescence. Including additional substrates beyond collagen, such as fibronectin or laminin, along with COL5, would strengthen the conclusions drawn from these experiments.

      Significance

      Overall, this study provides a unique perspective on the functional heterogeneity of MuSCs by using CD90 as a marker to delineate distinct MuSC subpopulations. This approach sheds light on the specific roles of CD90+ MuSCs in muscle regeneration and offers new insights into the regulatory mechanisms governing MuSC function.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Kheir et al. explore the heterogeneity within the MuSC compartment, identifying a CD90+ subpopulation with enhanced activation and proliferative capacity compared to its CD90-ve counterpart. While functional heterogeneity in MuSCs is a well-recognized and intriguing area of study, prior research has often focused on differences inferred from transcriptional profiles. This current study advances the field by intriguingly linking CD90 expression to distinct functional outcomes, thereby providing more compelling evidence for the existence of this subpopulation. The authors further investigate potential mechanisms, suggesting a connection between CD90 expression and intrinsic pAMPK activity as a driver of proliferation, as well as a role for Col6-Calcr binding in maintaining quiescence. While the majority of experiments are rigorously conducted, additional studies are suggested to determine whether the CD90+ fraction represents a subpopulation with substantial functional relevance. Detailed critiques and recommendations for further studies are outlined below.

      Major Comments:

      1. It is perplexing that the CD90+ fraction is implicated in activation, proliferation, and differentiation (Mgn+ data) while simultaneously contributing to the CD90-ve population (Fig. 3E). However, the reverse does not seem to occur, with CD90-ve cells not replenishing the CD90+ fraction. If the CD90+ subpopulation indeed accounts for the majority of myogenesis, this provokes the question: what is the functional role of the CD90− fraction? Notably, CD90-ve MuSCs appear to divide effectively during regeneration (Fig. 2E-G), further emphasizing the need to clarify their contribution to the overall regenerative process. The presence of a substantial number of CD90-ve MuSCs across conditions suggests they cannot simply be dismissed as irrelevant and understanding their role will help clearly establish the +/- subpopulations as functionally different.
      2. The depletion of CD90+ cells (Fig. 7D-I) is the correct experimental approach to assess the function of these cells in vivo. However, the method employed, using IP injections of a CD90 antibody, can lack specificity. Even with optimal specificity, CD90 is expressed on numerous cell types across the body. This raises the possibility that observed effects may result from targeting other CD90+ cells in skeletal muscle or other tissues, both locally and systemically. To mitigate these confounding factors, the authors should attempt strategies to reduce off-target effects. While the technical challenges are acknowledged by this reviewer and may be prohibitory, addressing these limitations would substantially enhance the impact of this work. Additionally, the embryonic myosin heavy chain (eMHC) images (Fig. 7G, H) should be more representative of the quantification data to ensure consistency.
      3. Similar concerns about off-target effects noted in point #2, apply to the use of the Col6 KO mouse model, which appears to be a full body KO, meaning Col6 is absent not only in MuSCs but also in other cell types that typically express Col6. This deficiency would have been present throughout development, complicating the interpretation of the observed effects. The authors do acknowledge Col6 expression by non-MuSC cell types, but the in vivo impact remains challenging to interpret, particularly due to the potential developmental and systemic effects of removing Col6. Also, the observation that the CD90-ve subpopulation still expresses Calcr raises further questions about Col6 acting only on the CD90+ fraction and expression by MuSCs being consequential in vivo. The trend observed in Fig. 6M for CD90-ve cells suggests that this mechanism might not be exclusive to CD90+ cells, warranting further investigation or explanation since an outlier in the Col6KO CD90-ve group may have influenced interpretation.
      4. The siCD90 experiment in Fig. 5 demonstrates effective KD at both the transcript and protein levels, but the observed impact on the proliferation of CD90+ cells (Fig. 5G), while statistically significant, appears to be less than expected. This result is also confusing given the substantial reduction in pAMPK levels observed in Fig. 5L, leading to the expectation of a more pronounced effect on proliferation if the proposed CD90-pAMPK mechanism is a driving pathway. Additionally, Fig. 5N suggests that pAMPK supports proliferation in both CD90+ and CD90− subpopulations. While the AICAR treatment in CD90− cells does not achieve significance, the data exhibit a bimodal distribution among replicates, with an apparent outlier in the control group potentially skewing the analysis. This variability necessitates further clarification for the relationship between CD90, pAMPK, and MuSC proliferation.
      5. The CD90 related findings in human samples appear less robust compared to those in mice. While the sorting successfully identifies sizable CD90+ and CD90-ve populations (Fig. 4A), the sequencing data show only small regions of high CD90 expression, as highlighted in red by the authors (Fig. 4C, D). Have the authors considered replicating the sequencing experiments within their own laboratory? While it is acknowledged that sourcing human tissue may be a limitation, it may strengthen the translational impact if possible.

      Minor Comments:

      1. Fig. 1D - the MuSC population has an uncharacteristically low representation amongst cells of uninjured muscle. Can the authors comment on this in text?
      2. Fig. 2 - the g-alert phenotype corresponding with CD90 expression is interesting. Can the authors add a molecular marker to confirm this phenotype?
      3. The authors mention "significantly higher fraction of CD90+ve MuSCs incorporated EdU in vivo at 1.5, 2.75, and 6 days after injury..." in 2G, but it only seems 1.5 and 2.75 are different. The text should be corrected.
      4. 3B, C - 7dpi seems late for the analyses of Myogenin at a single-cell level since most differentiating MuSCs are fused at this point. Can the authors comment in the text why 7 dpi was chosen?
      5. Note - 7L is very interesting, but emphasizes the major Q#1 above. Mdx mice are known to lose MuSC capacity due to continuous rounds of proliferation. So, the CD90+ loss is supportive, but there are still many CD90-ve cells present. In other figures, the authors demonstrated the negative fractions still harbor decent proliferative potential, so why is there no rescue?
      6. The authors should clearly state the number of cells and number of replicates for their single cell distribution graphs in all legends.
      7. In their stats section of methods, ns= p{greater than or equal to}0.15, please clarify.

      Significance

      General Assessment: The study is well conducted and addresses MuSC functional heterogeneity. There seems to be substantial evidence that CD90 fractionates the MuSC population and is related to proliferative capacity. Functional assessment in vivo needs some clarification with additional experiments, but the study seems promising. Also, interpretation of graphs should be updated as well since some distribution of replicates may be impacting statistical significance that can alter interpretation/outcomes.

      Advance: Again, MuSC heterogeneity has been an area of intense investigation for many years. The advancement would be mechanistic/functional.

      Audience: Specialized in skeletal muscle. There is potential for the CD90 fractionation to extend to other cell and tissue types, but this extent is unknown until this work is expanded.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: Kheir and colleagues found that CD90 expression levels could be utilized to divide MuSCs into two populations. The authors demonstrated that CD90+ve MuSC became activated state faster than CD90-ve cells. Mechanistically, AMPK is more activated in CD90+ve cells in response to niche loss. To suppress the high responsiveness to activation signalings, CD90+ ve cells highly express ColVI and CALCR than CD90-ve cells. The authors carefully examined the heterogeneous expression of CD90 in MuSCs. Overall, however, the differences in the characteristics of CD90+ve and CD90-ve cells are small. In addition, most of the data were based on fluorescent intensity. This reviewer does not feel that this study will have a significant impact on our understanding of MuSC biology.

      Major comments:

      1. Data demonstrated the statistical differences in MuSC behaviors between CD90+ve and CD90-ve cells. However, the difference is small. For example, it is unclear whether the minimal difference in CALCR expression level between CD90+ve and CD90-ve cells gives rise to any biological difference.
      2. Negative controls of FACS analyses are required because different sizes of cells might exert different background intensities. (Figure 2I, 2L, and 6F)
      3. If CD90+ve MuSCs express Col6 higher than CD90-ve MuSCs, they should also highly express the primary target of Notch target genes, Hes1, Hey1, and HeyL. The authors should examine the expression levels of these genes.
      4. As described above, the quantifications of many results, including MyoD, were based on the fluorescent intensity. I know the difficulty of preparing enough cells for experiments, but the authors need to present data supporting these results.
      5. Figure 7G-H; More quantitative analyses should be included. In addition, the sample number was different between Fig7E and H. There is no significant difference in the CD90 expression in Fig7G. The authors need to confirm the reproductivity.

      Minor comments:

      1. Figure S4. The authors need to show evidence that these cells are proliferating. Without the evidence, CD90 expression my just be retained in non-dividing cells. If it is difficult, the results should be removed.
      2. Heterogeneity in cell cycle progression in MuSCs is well documented as fast and slow dividing cells. This reviewer recommends discussing the relevance of CD90 expression to these reports. PMID: 22349695 PMID: 8608871

      Significance

      The heterogeneity of muscle stem cells is of great interest to muscle stem cell biologists, including this reviewer. The orchestrated expression and regulation of activation and quiescence pathways is conceptually new. Several molecules are heterogeneously expressed in muscle stem cells, but the expression pattern of CD90 does not correlate with them. However, as noted above, the difference between CD90-positive and CD90-negative cells is relatively small.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Programmed cell death is prominent in developing nervous systems across evolution, but its function remains obscure. Recent work suggests that it might impact behavior, but an examination of its effects on behavior and underlying neuronal circuits in intact organisms has not been determined. In this manuscript we report that programmed cell death sculpts the developing nervous system and shapes innate behavior. Using synaptic labeling, in vivo calcium imaging, targeted rescue of programmed cell death, and automated high-resolution analysis of cell death mutants, we find that loss of programmed cell death alters animal behavior. These findings reveal that neuronal cell death during development provides a reservoir of fates and circuit connections that could be accessed on evolutionary time scales to modify innate behavioral programs. Our manuscript thus answers one of the major outstanding questions in developmental neuroscience—why programmed cell death is so prevalent—by identifying consequences for brain function at the subcellular, cellular, circuit, and behavioral level. This study will be of interest to those interested in evolution of the nervous system and behavior, developmental biology, and neural circuit development.

      We thank the reviewers for their careful attention to the manuscript. Both reviewers were enthusiastic about the work. Here we address their suggestions. As noted below, we have already addressed most of their points, and we discuss in detail the remaining point—whether it is possible to perform experiments for a more specific targeting of the undead RIM cell death event to provide additional evidence for its role in altering reversal behavior.

      2. Description of the planned revisions

      *Reviewer 1: “1. The argument that that differences in reversal behavior are likely attributable to the difference in RIM neuron numbers in the ced-3 rescue studies is very plausible. Nonethless, there remains the possibility that for some reason in animals with 4 RIMs there may be a more global effect on the fate of cells slated to die, unrelated to the number of RIMs. I think there are two ways to test this. (1) quantify the behavior in 2- vs. 4- RIM neurons in animals also containing a marker for other undead neurons, and see if there is any correlation between 4 RIMs and survival of unrelated neurons (but preferably reasonably closely related by lineage- in case that's the issue). (2) Since the authors are able to distinguish the undead cells, can they perform laser ablations on these cells and assess whether behavior is restored to normal values?” *

      • *We agree that this point is already very plausible. We also appreciate the reviewer’s suggestions on how to extend this conclusion.

      Regarding suggestion (1): Unfortunately there is not a reliable marker for undead neurons (although a current project in the lab is indeed to develop one). However, we note that the undead RIM sister cells adopt a RIM neuron fate in 96% of ced-3 mutants, while with other undead cells investigated neuron fate adoption ranged from 59% (ASEL) to 77% (ASER). This suggests that the undead RIM fate adoption is not strongly correlated with the fates of other undead cells.

      Regarding suggestion (2): We attempted to perform laser ablation of undead RIM neurons in ced-3 mutants, but we could not overcome the technical hurdles (despite our lab’s expertise in laser axotomy). We found that we could not reliably remove both undead RIMs without damaging the wildtype RIM that is in close proximity, especially in the quantities of animals necessary for behavioral experiments.

      As an alternative, we plan to perform more targeted experiments to manipulate cell death in the undead RIM to address the points raised by both reviewers. Our goal is to generate two strains. In one, programmed cell death is prevented specifically in the RIM neurons in wild type animals. We hope to achieve this by either transgenic expression of a gain-of-function mutation of ced-9, or else by RIM-specific RNAi against egl-1, ced-3 or ced-4. To do this we will use the RIM promoter tdc-1, which is confined to RIM and RIC. The second strain will allow cell death to occur only in RIM (and RIC) in animals that otherwise have no cell death. Here, we will drive wild-type ced-3 or ced-4 under the tdc-1 promoter in the corresponding mutant background.

      We note 2 caveats for both of these approaches: 1) RIC also has an undead sister; 2) Most probably, the tdc-1 promoter will not be active in time to block cell death. Caveat #2 is actually the reason why we did not do these experiments initially (instead we used the most specific promoter we could find that is expressed early in the RIM lineage, before RIM is born).

      However, we agree that if successful these experiments would complement the existing experiments, and we will build all these strains.

      Reviewer 2: “Mosaic rescue of RIM via stochastic loss of a rescue array helped demonstrate the contribution RIMu have to the locomotor phenotype. As the authors emphasise these animals have many other undead cells (outside of the reverse network). A conditional rescue of only the RIMu would greatly improve the strength of the claims made. Would a conditional RIM egl-1 knockdown (via RNAi) be possible to selectively inhibit apoptosis in those neurons. This experiment should be considered OPTIONAL. It may be that such specific promoters do not allow for egl-1 RNAi to function at the right time to rescue death.”

      • *We appreciate the reviewer’s suggestion. As stated above, we are working to perform an expanded version of these exact experiments, as well as their converse. However, as the reviewer notes, it is very possible that the timing of expression will prevent these approaches from working (Caveat #2 above).

      Reviewer 2: There is a slight issue with interpretation of the data with the mosaic GLR-1::tagRFP Fig 2M which reveals the postsynaptic compartment of one RIM even though there are two present. There seems to be no obvious apposition between pre/post and they somewhat seem to be floating in space. Why is this the case? One would have imagined that the structures in Fig 2L would be tiled composites of both AIB & RIM pre and postsynaptic elements coalescing. Can the authors provide an alternative explanation for this phenotype. Nevertheless, the data on Fig 2L seems solid.. that is animals with extra undead RIM cells have additional cell-type specific synaptic terminals

      We have selected a different micrograph that is more representative of the RIM post-synapses in ced-3 mutants. In this animal, the array labeling the post-synapses in RIM has been lost from one of the two RIM neurons, making it easier to discern that the post-synapses are apposed to the AIM pre-synaptic marker (Fig 1M).

      Reviewer 2: Clarity should be improved around the use of 'expected number' in figure 1. The description of the metric 'The 'expected number' is defined as the number of neurons of the type present in wild-type animals, plus the number of lineage-proximate undead cells.' suggests that expected (blue) regions of pie charts represents lineages with expected sum total of wt and extra undead cells. However, in reference to panel H 'The wild-type animal has two RIM neurons, and the ced-3(n717) animal has two additional RIMlike cells and is counted as contributing to the orange "more than expected" sector in panel (A)' it is said that the animals with 2 WT accompanied by each undead sister contributes to more than expected (orange) region. These appear inconsistent. Can you qualify?

      We thank the reviewer for this point and have added a schematic to clarify the quantification of undead cell fates (Fig. 1).

      Reviewer 2: Specific observations shown in supplemental data SI-L despite being cited in the text is not explained or formally referenced. The details of these panels should either be briefly explained/their inclusion qualified in the text or simply remove from the figure

      We have added reference to these figures in the main text “Undead cells are even capable of producing complex morphology, such as the highly branched dendrites of the PVD neurons (Figure S1I-L).” (p. 3)

      Reviewer 2: The dual image photomicrographs could be in green/ magenta or red/cyan to make colourblind friendly.

      We have updated micrograph colors to be colorblind friendly (Fig 1K-M, S1L).

      Reviewer 2: Do the authors have data with the pRIMtagRFP egl-nucGFP. If they do it would be useful to show it.

      We have added a micrograph of egl-1::GFP and RIM labeled using NeuroPAL (Fig. S2A).

      Reviewer 1: 2. The authors speculate, if I understand correctly, that the mechanism by which reversal frequencies are decreased in 4 RIM animals may be that the reversal state is stabilized, resulting in longer reversals and consequently fewer reversal events. This is a nice model that is testable. The authors could, for example, examine the connections of RIM neurons to the AVA neuron, a main command interneuron for reversal initiation, and assess whether there are indeed more such synapses. Furthermore, the authors can assess whether the frequency of AVA firing is decreased. Of course, there are other plausible mechanisms involving connectivity of other neurons onto AVA which could explain the phenomenon. The authors may wish to add a comment regarding this in the discussion.

      • *

      We thank the reviewer for this suggestion. There are multiple postsynaptic receptors expressed in AVA for RIM neurotransmitters and the contribution of each to reversal behavior is still being debated, making it challenging to dissect the contribution of each of these to the effects on reversal behavior mediated by the undead RIM. Given this, we believe that addressing this point experimentally is beyond the scope of this paper. We have added a sentence in the discussion commenting on this as a future direction for this work “The mechanism of the downstream circuit mediating the effects of the undead RIM could be determined through quantification of AVA postsynaptic receptors and examining reversal behavior of cell death mutants with knockouts of AVA receptors.”

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The blockade of programmed cell death (PCD) results in the generation of supernumerary neurons from the rescue of normally dying progeny within a lineage. Many of these are the product of a lineage branch that produce a single surviving neuronal daughter.

      The authors show that for many such divisions the rescued sibling expresses fate markers associated with their normally surviving sibling. The authors use the RIM neurons and their role in reversal behaviour as a model. By generating supernumerary RIM interneurons, they show that these undead neurons correlate with an increase in synaptic terminals, with their normal presynaptic partners (AIBs), and that these same supernumerary neurons have an activity pattern preceding reversal events.

      Using mosaic experiments, based around the stochastic differences in the segregation of an extrachromosomal array, the authors are able to demonstrate that a strong component to alterations in locomotor behaviour is likely due to the supernumerary undead RIMu neurons.

      To address the impact of blockade of PCD more broadly the authors assayed 4 independent PCD blocking alleles (2x ced-3, 2x ced-4) and show that the rescue of many neurons alters normal turning dynamics associated with foraging and free feeding behavioural states.

      Interestingly, subtle differences in the behaviours of ced-3 mutant animals and ced-4 mutant animals may also hint at the broader significance of these genes beyond simply controlling the on/off PCD in these lineages. One possible issue with work published in this paradigm is that by generating undead neurons, by removing caspase activity, the intervention may also impact non-apoptotic caspase function.

      The study presented does not pose any issues relating to reproducibility or associated statistical analysis.

      Major comments:

      • Mosaic rescue of RIM via stochastic loss of a rescue array helped demonstrate the contribution RIMu have to the locomotor phenotype. As the authors emphasise these animals have many other undead cells (outside of the reverse network). A conditional rescue of only the RIMu would greatly improve the strength of the claims made. Would a conditional RIM egl-1 knockdown (via RNAi) be possible to selectively inhibit apoptosis in those neurons. This experiment should be considered OPTIONAL. It may be that such specific promoters do not allow for egl-1 RNAi to function at the right time to rescue death.
      • The changes in the RIM/AIB synaptic organisation and the correlation of observed RIM/RIMu activity coincident with reverse locomotor bouts strongly supports the assertion that these two features are causally linked.
      • One interpretation is that when the additional undead RIM neuron is present the cell-type specific connectivities are recapitulated i.e. many of the pre and post-synaptic elements of RIM/AIB look apposed see Fig 2L.
      • There is a slight issue with interpretation of the data with the mosaic GLR-1::tagRFP Fig 2M which reveals the postsynaptic compartment of one RIM even though there are two present. There seems to be no obvious apposition between pre/post and they somewhat seem to be floating in space. Why is this the case? One would have imagined that the structures in Fig 2L would be tiled composites of both AIB & RIM pre and postsynaptic elements coalescing. Can the authors provide an alternative explanation for this phenotype. Nevertheless, the data on Fig 2L seems solid.. that is animals with extra undead RIM cells have additional cell-type specific synaptic terminals.

      Minor comments:

      • Clarity should be improved around the use of 'expected number' in figure 1. The description of the metric 'The 'expected number' is defined as the number of neurons of the type present in wild-type animals, plus the number of lineage-proximate undead cells.' suggests that expected (blue) regions of pie charts represents lineages with expected sum total of wt and extra undead cells. However, in reference to panel H 'The wild-type animal has two RIM neurons, and the ced-3(n717) animal has two additional RIMlike cells and is counted as contributing to the orange "more than expected" sector in panel (A)' it is said that the animals with 2 WT accompanied by each undead sister contributes to more than expected (orange) region. These appear inconsistent. Can you qualify?
      • Specific observations shown in supplemental data SI-L despite being cited in the text is not explained or formally referenced. The details of these panels should either be briefly explained/their inclusion qualified in the text or simply remove from the figure
      • The dual image photomicrographs could be in green/ magenta or red/cyan to make colourblind friendly.
      • Do the authors have data with the pRIMtagRFP egl-nucGFP. If they do it would be useful to show it.

      Significance

      The work presented here complements similar studies performed on insects, illustrating that this biological motif holds true outside of that class. Showing this is a more general feature within other taxa broadens the appeal and will be of great interest to those in developmental biology, neural circuit development/function and evolutionary biology.

      Although the work does not illustrate a completely novel finding, it is rigorous and well-conceived, adding support to previous studies in the field and is an important jumping off point for future studies. The authors present compelling independent support that undead developmentally 'doomed' neurons retain the ability to differentiate, show molecular hallmarks of sibling fate, can integrate within networks and function.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is an interesting paper describing the neural circuit and behavioral consequences of blocking programmed cell death in C. elegans with a mutation in the apoptotic caspase ced-3. The authors survey several reporters for neurons whose sister cells normally die, and observe that undead cells can express markers of the normally living sister cell. They also show that ced-3 mutants exhibit a large variety of behavioral defects. They interrogate the effects of inappropriate survival of two RIM sister neurons in animals carrying a complement of 4 instead of 2 RIMs. The authors demonstrate that the presynaptic neuron AIB makes synapses onto the undead RIM neurons and show GCaMP activity in these neurons that correlates with reversal behavior, a normal function of RIM neurons. Animals with 4 RIM neurons have a reduced number of reversal events compared to wild type animals, suggesting circuit defects. Importantly, restoring ced-3 expression only in the RIM lineage partially restores reversal behavior frequency. The authors conclude that undead neurons interfere with the normal function of the nervous system by making aberrant connections that interfere with circuit activity.

      This paper is beautifully written. The logic is crystal clear, and the experiments are appropriate and rigorously executed. The conclusions are generally appropriate. I have a couple of comments that may be useful for the authors to consider:

      1. The argument that that differences in reversal behavior are likely attributable to the difference in RIM neuron numbers in the ced-3 rescue studies is very plausible. Nonethless, there remains the possibility that for some reason in animals with 4 RIMs there may be a more global effect on the fate of cells slated to die, unrelated to the number of RIMs. I think there are two ways to test this. (1) quantify the behavior in 2- vs. 4- RIM neurons in animals also containing a marker for other undead neurons, and see if there is any correlation between 4 RIMs and survival of unrelated neurons (but preferably reasonably closely related by lineage- in case that's the issue). (2) Since the authors are able to distinguish the undead cells, can they perform laser ablations on these cells and assess whether behavior is restored to normal values?
      2. The authors speculate, if I understand correctly, that the mechanism by which reversal frequencies are decreased in 4 RIM animals may be that the reversal state is stabilized, resulting in longer reversals and consequently fewer reversal events. This is a nice model that is testable. The authors could, for example, examine the connections of RIM neurons to the AVA neuron, a main command interneuron for reversal initiation, and assess whether there are indeed more such synapses. Furthermore, the authors can assess whether the frequency of AVA firing is decreased. Of course, there are other plausible mechanisms involving connectivity of other neurons onto AVA which could explain the phenomenon. The authors may wish to add a comment regarding this in the discussion.

      Significance

      This is an interesting paper describing the neural circuit and behavioral consequences of blocking programmed cell death in C. elegans with a mutation in the apoptotic caspase ced-3. The authors survey several reporters for neurons whose sister cells normally die, and observe that undead cells can express markers of the normally living sister cell. They also show that ced-3 mutants exhibit a large variety of behavioral defects. They interrogate the effects of inappropriate survival of two RIM sister neurons in animals carrying a complement of 4 instead of 2 RIMs. The authors demonstrate that the presynaptic neuron AIB makes synapses onto the undead RIM neurons and show GCaMP activity in these neurons that correlates with reversal behavior, a normal function of RIM neurons. Animals with 4 RIM neurons have a reduced number of reversal events compared to wild type animals, suggesting circuit defects. Importantly, restoring ced-3 expression only in the RIM lineage partially restores reversal behavior frequency. The authors conclude that undead neurons interfere with the normal function of the nervous system by making aberrant connections that interfere with circuit activity.

      The paper should be of interest to researchers studying neural circuit assembly and function, programmed cell death, behavior, and evolution of behavior and the nervous system.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statement We thank all three reviewers for their careful and constructive evaluation of our manuscript. We are pleased that the reviewers recognised the importance of the work we describe and found the experimental approach sound.

      This manuscript reports that undesired insertion of the plasmid backbone, including vector sequences not intended to be part of the genome edit, occurs at high frequency during CRISPR/Cas9-mediated HDR in Drosophila. We document this phenomenon across multiple independent genome editing projects, using three different plasmid backbones and targeting distinct genomic loci, demonstrating that it is not an isolated or project-specific artefact. We further introduce pVID, a new donor vector incorporating a ZsGreen negative selection marker that allows straightforward identification and exclusion of lines carrying undesired insertions, providing a practical solution to avoid this genome editing issue.

      In response to the reviewers' comments, we have revised the manuscript to: (i) correct and contextualise prior descriptions of this problem, incorporating the references suggested by Reviewer 2; (ii) add a table summarising gRNA characteristics for all editing projects; (iii) expand the discussion of the underlying DNA repair mechanisms, the potential influence of Cas9 source choice, and the relevance of the findings beyond Drosophila; (iv) confirm the stability of problematic template vector insertions across multiple generations; and (v) improve figure clarity, correct typographical errors, and clarify several passages flagged by the reviewers. All responses are described in detail below.

      1. Point-by-Point Description of the Revisions

        Reviewer 1

        Major Comment 1 — DNA repair pathways underlying backbone capture • I think the authors should discuss potential DNA repair pathways (e.g., NHEJ, MMEJ) underlying plasmid backbone capture in more detail. Did you check for knockouts within your screened transformants? That could provide insight into the underlying mechanisms.

      Response: We screened humanized TDP-43 line for tbph knockouts, since our aim was to fully knock out the Drosophila gene and insert the human ortholog. However, we did not screen any of the other lines described in the manuscript for indels caused by NHEJ, since the dsRed selection we employed would not enable us to recover lines without insertion events. We hypothesise that one of the two gRNAs used being more inefficient than the other causes a single homologous recombination event and insertion of the vector template. However, the underlying mechanism is still unclear, and could be caused by NHEJ, HDR or a combination of these mechanisms as has previously observed (44). We have expanded on potential mechanisms inducing HDR template vector insertion events in the discussion of the revised manuscript.

      Major Comment 2 — gRNA characteristics and design parameters • It would be important to describe gRNA characteristics and general design parameters (GC content, distance from cut to intended edit, homology arm length) and analyze whether these correlate with correct HDR vs. plasmid insertion. A table summarizing these details could help reveal potential trends.

      Response: At the reviewers suggestion, we have added a table (Table 1) describing the all the characteristics of the gRNAs further in the material and method section. Unfortunately though, no commonality was immediately apparent to us.

      Major Comment 3 — Single versus dual gRNA strategies • Did the authors consider exploring whether using a single gRNA reduces backbone insertion frequency compared to dual-gRNA strategies? I understand that two gRNAs are needed for your strategy, but it would be interesting to know whether these outcomes are linked to the dual-gRNA design.

      Response: As stated in the discussion, we theorize that perhaps one of the two gRNAs used in our strategies cuts more efficiently and thereby causes a single homologous recombination event and insertion of the vector template. It is possible that originally using a strategy with only one gRNA could cause less insertion of the vector template, however this may be at the cost of gene editing efficiency. Indeed, when Ge et al (17) compared using one versus two gRNAs to induce HDR, they observed more reliable repair events when two gRNAs were used.

      Major Comment 4 — Stability of backbone insertions across generations • Did you evaluate whether backbone insertions are stable across generations or prone to rearrangement?

      Response: We did keep several of the lines reported in this paper stably across multiple generations, and we have added this observation to the manuscript

      Major Comment 5 — Broader applicability in non-model organisms and therapeutic settings • A broader discussion of the potential applications of this approach in non-model insects, mammalian cells, or therapeutic settings where HDR is inefficient would be valuable.

      Response: While we only investigated this effect in the creation of CRISPR/Cas9 Drosophila melanogaster models, it is very possible that this could also affect other model organisms or cells. We encourage the use of HDR template negative selection markers in all uses of HDR-mediated CRISPR/Cas9 genome editing.

      Major Comment 6 — Cas9 promoter and expression level • The authors also mentioned using a validated Cas9 line (ref #23). What promoter drives Cas9 expression in this line? Did you consider testing different promoters? Since timing of Cas9 expression can be critical, promoter choice may have influenced the results and should be discussed.

      Response: We used the nos promoter for the expression of Cas9, as this promoter is expressed in germ cells and is known to have better efficiency than the other germline promotor like vasa (Port et al 2014, Ref #23). However, it is conceivable that the high Cas9 concentration in this line could induce a higher rate of double stranded breaks and thus template vector insertion. We agree it would be interesting to test other Cas9 sources, though this would likely come at the cost of overall editing efficiency. As we describe, the use of pVID now allows negative selection against HDR template vector insertion even with this Cas9 source. We have expanded upon the potential use of other Cas9 sources in the revised discussion.

      Reviewer 2

      Major comments

      None

      Minor Comment 1 — Line 38: prior descriptions of backbone insertion in Drosophila Line 38: "this type of unwanted template vector insertion in the case of Drosophila genome editing has to our knowledge not been previously described." Insertion of vector sequences after CRISPR editing in Drosophila and strategies to mitigate such events have been previously described in multiple studies. The authors need to incorporate these into their manuscript. https://doi.org/10.1242/bio.20147682, https://doi.org/10.1080/19336934.2020.1832416, https://doi.org/10.1534/g3.116.032557.

      Response: We are very grateful to the reviewer for pointing out these prior observations of vector insertion events of which we were not aware. This prior work has now been fully incorporated and referenced in the revised manuscript, and we have removed this erroneous statement. We feel this manuscript validates and quantifies the extent of HDR template insertion across multiple genome editing strategies and templates plus, with pVID, provides a solution to this vexing problem.

      Minor Comment 2 — Line 79: PAM sequence sentence I have difficulties understanding the following sentence: Line 79: "At this location, on both sides of the insertion, the PAM sequence of the target region was edited to match the PAM sequence of the template donor plasmid." I assume what is meant here is that in the donor vector the PAM sequence was mutated to prevent recutting, but that means this sequence is no longer a PAM. Please rephrase for added clarity.

      Response: The PAM sequence was indeed edited in the template donor plasmid to prevent re-cutting, and we are referring to this edited version of the PAM sequence in this sentence. We edited this sentence this to clarify that the PAM sequences have been edited.

      Minor Comment 3 — Figure 2: panel D arrangement In Figure 2 panel D is arranged between panels E and F.

      Response: Thank you for pointing this out. We have corrected this error.

      Minor Comment 4 — Primer positions in figures In Figure 2 it would be useful to also indicate the position of the primers used in 2d in the schematic in 2e. The same applies to Fig. 3a and 4a.

      Response: We have added the position of the primers in figure 2. Since the primers are targeting the backbone of the plasmid commonly in all projects included in this manuscript, we have chosen to only include one figure of this (figure 2).

      Minor Comment 5 — Lines 89–90: duplicated sentence Lines 89, 90: Duplication of the same sentence.

      Response: Thank you, we have corrected this mistake.

      Minor Comment 6 — VGAT editing: consecutive editing and sgRNA placement Editing of the VGAT gene: In this case correct editing and plasmid insertions could be found on the same chromosomes. This might be caused by concatemer formation of repair intermediates (as has been described in multiple systems) or by consecutive editing events. Can you please specify whether the donor vector was designed to prevent consecutive editing? I'm also a bit confused about the locations of the sgRNA target sites according to Fig. 3a. It appears that part of the insertion (i.e. the ALFA tag) was encoded on the homology arm and not between the target sites. While such strategies have been described, they are often avoided as the efficiency of insertion decreases with increasing distance to the cut site. Was it not possible to us a sgRNA better matching the insertion cassette?

      Response: For Vgat genome editing, we followed an existing strategy that has been proven effective, reusing the same gRNAs and overall approach to replace the 9×V5 tag with a 1×ALFA tag (Certel et al. 2022, Ref #28)

      Minor Comment 7 — Line 133: mini-white marker unreliability Line 133: Please describe why the mini-white marker was unreliable.

      Response: In our first design of the pVID vector, we used mini-white as the negative selection marker. However in a number of white eyed lines, we could still confirm the undesired insertion of the HDR template vector. We speculate that expression of mini-white (which we confirmed was not mutated) was repressed in these lines by an unknown mechanism. Since (Nyberg et al. 2020 , Ref #35) also proposed using mini-white as a negative vector selection marker, we wanted to mention this problem with mini-white negative selection, though we remain unsure of the exact cause. In any case, the use of exogenous ZsGreen in pVID as described in the manuscript fully resolved the issue allowing reliable detection of template vector insertion events as we describe.

      Minor Comment 8 — Line 161: "varying frequency" Not sure I understand the sentence in line 161: If 54% of lines had vector insertion, what does the "varying frequency" refer to?

      Response: We have edited this sentence to clarify that 54% of lines had vector insertion.

      Minor Comment 9 — pVID availability in methods Consider highlighting the availability of pVID also in the methods section that described this plasmid.

      Response: This has been added to the methods section.

      Reviewer 3 No edits suggested.

      We thank Reviewer 3 for their positive assessment of the manuscript and for confirming that no revisions are required.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Highly frequent undesired insertional mutagenesis during Drosophila genome editing by Kallstig et al. revolves around Homology-Directed Repair (HDR) and the surprisingly high frequency of plasmid backbone insertions into the genome.

      In brief, the authors describe three independent experiments in which the intended homology regions were inserted together with plasmid backbone sequences into the Drosophila genome. Each experiment was designed with a slightly different setup: the first aimed to generate a humanized version of the TAR DNA-binding protein 43 (hTDP-43), while the second introduced an alpha tag into the Vesicular GABA transporter (VGAT) gene. In the first experiment, the pCR4 vector served as the backbone, whereas the second experiment relied on the pHSG298 vector. Both experiments resulted in relatively high frequencies of incorrectly edited genomes - 18% and even 66%, respectively. The authors hypothesized that the rate of undesired events could be even higher if the targeted gene is non-essential. To test this, the third experiment focused on mutagenesis of the Glutamate Receptor IIA (GluRIIA) gene, which is homozygous viable even in protein-null mutants. Indeed, the frequency of incorrect edits was approximately 11:1 (more than 90%). These findings suggest that plasmid backbone insertion is a common and important issue in HDR-based genome editing in Drosophila.

      To address this problem, the authors designed a new vector. While the classical eye color marker (e.g., dsRED) serves for positive identification of HDR recombination, a second fluorescent marker (ZsGreen), encoded in the plasmid backbone and also expressed in the compound eye, enables clear detection of undesired plasmid backbone insertions.

      The study is clearly written, and the plasmids are sufficiently well described in the figures. The reproducibility is somewhat limited by the use of different plasmids in combination with different target genes. Nevertheless, the number of analyzed insertions was high enough to convincingly illustrate the issue.

      Significance

      I find this manuscript to be a valuable description of an existing problem, together with a potentially efficient method for detecting undesired plasmid insertions. From an experimental perspective, I consider the comparison of three different vector backbones combined with different target genes to be rather difficult. On the other hand, as an experimental biologist, I completely understand the logic and the history of the problem-solving process. Undesired insertions were identified by different approaches (PCR and sequencing), and the authors clearly kept this issue in mind. When the problem persisted in the second experiment, and was even more pronounced in the third experiment (involving a non-lethal gene), they developed a vector that makes the screening process more efficient. Altogether this is a valuable technical study worth of reporting.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary In this manuscript Källstig, Ruchti, McCabe and colleagues report frequent undesired editing outcomes after CRISPR gene knock-ins in Drosophila. Using Cas9 for the targeted induction of DNA double strand breaks and plasmids with long homology arms as donor molecules, they find that the whole plasmid inserts with high frequency at multiple loci. To detect such events they generate a plasmid with a dominant marker encoded on the plasmid backbone, which can be used to enrich for correct insertions by negative selection.

      Major comments

      Minor comments

      Line 38: "this type of unwanted template vector insertion in the case of Drosophila genome editing has to our knowledge not been previously described." Insertion of vector sequences after CRISPR editing in Drosophila and strategies to mitigate such events have been previously described in multiple studies: https://doi.org/10.1242/bio.20147682, https://doi.org/10.1080/19336934.2020.1832416, https://doi.org/10.1534/g3.116.032557. The authors need to incorporate these into their manuscript.

      I have difficulties understanding the following sentence: Line 79: "At this location, on both sides of the insertion, the PAM sequence of the target region was edited to match the PAM sequence of the template donor plasmid." I assume what is meant here is that in the donor vector the PAM sequence was mutated to prevent recutting, but that means this sequence is no longer a PAM. Please rephrase for added clarity.

      In Figure 2 panel D is arranged between panels E and F.

      In Figure 2 it would be useful to also indicate the position of the primers used in 2d in the schematic in 2e. The same applies to Fig. 3a and 4a.

      Lines 89, 90: Duplication of the same sentence.

      Editing of the VGAT gene: In this case correct editing and plasmid insertions could be found on the same chromosomes. This might be caused by concatemer formation of repair intermediates (as has been described in multiple systems) or by consecutive editing events. Can you please specify whether the donor vector was designed to prevent consecutive editing? I'm also a bit confused about the locations of the sgRNA target sites according to Fig. 3a. It appears that part of the insertion (i.e. the ALFA tag) was encoded on the homology arm and not between the target sites. While such strategies have been described, they are often avoided as the efficiency of insertion decreases with increasing distance to the cut site. Was it not possible to us a sgRNA better matching the insertion cassette?

      Line 133: Please describe why the mini-white marker was unreliable.

      Not sure I understand the sentence in line 161: If 54% of lines had vector insertion, what does the "varying frequency" refer to?

      Consider highlighting the availability of pVID also in the methods section that described this plasmid.

      Significance

      This manuscript describes vector backbone insertions as a frequent complication of CRISPR knock-in experiments in Drosophila and introduces a cloning vector with a selectable marker on the plasmid backbone that allows counter selection of such undesired events. The manuscript is very well written and the experiments are overall well designed.

      Insertion of vector sequences during homologous recombination (often referred to as "ends-in" recombination events) has been described on multiple occasions in a wide variety of model systems. Also in Drosophila, the system used here, such events have been described by multiple groups (see comments above). Furthermore, plasmids designed to allow to counter select for such events have also been described previously (e.g. Addgene plasmids 157991, 80801).

      In summary, this manuscript highlights once more an important complication in genome engineering experiments, but does not significantly advance the knowledge in the field beyond the existing literature and the described plasmid is largely redundant with preexisting plasmids designed for the same purpose. While this overall severely limits the significance of this work, it does provide important replication of previous work.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      CRISPR/Cas9 genome editing has improved the ability to introduce precise genetic modifications in multiple organisms such as Drosophila melanogaster. By coupling Cas9-induced double-strand breaks with homology-directed repair (HDR), researchers can replace, insert, or delete genomic sequences with high specificity.

      In this work, the authors explore significant concerns about the fidelity and outcomes of HDR-based editing. They identify a recurring issue since unintended insertions of the entire donor template vector into the genome was observed. These undesired events are observed across multiple genes, indicating that the problem is not locus- or construct-specific. These insertions can occur at high frequencies, complicating efforts to establish accurate transgenic lines. They not only mask intended edits but may also introduce unpredictable phenotypes unrelated to the desired genetic modification.

      The authors addressed the problem of frequent donor plasmid insertions during CRISPR/Cas9 HDR in Drosophila by redesigning their HDR template vectors. They incorporated a GFP marker into the plasmid backbone alongside a DsRed cassette. This design allowed them to distinguish correct HDR events, which carried only DsRed, from aberrant plasmid integrations, which carried both DsRed and GFP. By screening flies for marker expression, they could rapidly identify and exclude incorrect insertions.

      Please, see below my comments:

      • I think the authors should discuss potential DNA repair pathways (e.g., NHEJ, MMEJ) underlying plasmid backbone capture in more detail. Did you check for knockouts within your screened transformants? That could provide insight into the underlying mechanisms.
      • It would be important to describe gRNA characteristics and general design parameters (GC content, distance from cut to intended edit, homology arm length) and analyze whether these correlate with correct HDR vs. plasmid insertion. A table summarizing these details could help reveal potential trends.
      • Did the authors consider exploring whether using a single gRNA reduces backbone insertion frequency compared to dual-gRNA strategies? I understand that two gRNAs are needed for your strategy, but it would be interesting to know whether these outcomes are linked to the dual-gRNA design.
      • Did you evaluate whether backbone insertions are stable across generations or prone to rearrangement?
      • A broader discussion of the potential applications of this approach in non-model insects, mammalian cells, or therapeutic settings where HDR is inefficient would be valuable.
      • The authors also mentioned using a validated Cas9 line (ref #23). What promoter drives Cas9 expression in this line? Did you consider testing different promoters? Since timing of Cas9 expression can be critical, promoter choice may have influenced the results and should be discussed.

      Significance

      This paper will appeal primarily to researchers in the fields of functional genomics, insect genetics, and genome engineering, particularly those working with Drosophila or other model organisms where CRISPR/Cas9 is widely used. It is also of interest to scientists engaged in vector biology, agricultural pest control, and translational applications of genome editing, as the findings touch on broader issues of editing accuracy and unintended repair outcomes.

      The main advance of the study is the clear demonstration that unintended donor plasmid backbone insertions are not rare artifacts, but frequent and systematic events during CRISPR/Cas9-mediated HDR in Drosophila. By integrating a GFP marker into the plasmid backbone alongside the intended DsRed marker, the authors provide a straightforward and practical method to identify, separate, and exclude these erroneous events. This approach both highlights the hidden pitfalls of HDR-based editing and offers an effective solution, thereby improving the reliability of CRISPR applications. Beyond Drosophila, the work advances the field by underscoring the need for careful design and validation of donor constructs, with potential implications for genome editing strategies in other organisms where HDR efficiency and fidelity remain key challenges.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer 1 1. The code used for simulations is available on a public repository, but it does not directly ensure that results are reproducible. To do so would require a clear step-by-step guide referring the user to the specific pieces of code which have been used for the results and figures presented in the paper. At the moment, I could not find any such guide and the large number of scripts, executables and jupyter notebooks are not clearly linked to the paper's contents

      We agree that the code should be as accessible as possible for reproducing the results. We have updated the public repository (linked given in the 'data and code availability' section of the manuscript, lines 350-352) to include the SLURM job scripts used to run the evolutionary simulations and analyses, together with an overview of which scripts and notebooks were used for creating the figures.

      2. The methods themselves involve a number of arbitrary choices. Though this is understandable given the nature of the work, one aspect in particular that would deserve better clarity is the modeling of gene network dynamics. The stochastic model (l.516 & following) involves a nesting of "Hill-like" terms (those in Eqs. (7) and (11)) which is unusual and given without justification. There should be some explanation of how this approach relates to standard approaches such as those reviewed e.g. in: Bintu et al. Current opinion in genetics & development 15.2 (2005): 116-124.

      We agree that the formulation of the developmental model requires clearer justification and contextualisation. We have added a citation to situate our implementation within existing modelling frameworks, and a brief explanation of the choice for Hill equations in the Methods section (lines 577-579).

      1. It is also unclear at the moment how exactly the GRN dynamics is used; are time-stepping algorithms used until the system reaches a stationary regime? If so, how is stationarity assessed? This needs to be explained both in the main text and in the methods. The table of parameters suggests that there was a cut-off time, but there is no explanation whatsoever about the state of the dynamics at this time.

      We have revised the main text to briefly explain how the developmental dynamics are implemented (lines 88-90) and expanded the Methods section (Gene expression and regulation in the developmental model) to describe the integration procedure in detail (lines 617-620).

      The GRN dynamics are modelled as stochastic differential equations (SDEs), which are numerically integrated for a fixed developmental duration of T_D = 140 hours, regardless of whether a stationary state is reached.

      Instead, stationarity is indirectly favored by the fitness function. Fitness is calculated as the time average of the phenotype (protein states) over a window at the end of development (Equation 23 in the Methods). As a result, GRNs that exhibit large fluctuations or ongoing transient dynamics during this evaluation window tend to have lower fitness (and in turn, reproduction rate) than GRNs that have stabilised their expression patterns. We now mention this in the model introduction of the results section (lines 98-99).

      As a result of this, we observe that the vast majority of evolved GRNs reach a stable gene expression state by the end of development (aside from small fluctuations as expected from the SDEs).

      1. Related to the previous point, the table of parameters (Table S1) is provided without any explanation; through what process (exploratory, literature review, trial and error...) where the values selected? As there been any type of sensitivity analysis?

      We have clarified in the revised manuscript how each group of parameters was chosen (lines 618-620 and 744-746). In brief:

      Developmental time parameters (e.g., integration time, diffusion coefficient) were set to roughly match the developmental window of H. trionum from stage 0 to stage 2 (~150 hours; Riglet et al. 2024), during which pre-patterning is established. Molecular concentrations are expressed in arbitrary units Evolutionary parameters (e.g., mutation rates) are based on previous published work using this modeling framework and were slightly adjusted during an initial exploratory phase to ensure stable evolutionary dynamics. We have added citations for this. We have not performed a full global sensitivity analysis across all parameters. Such an analysis would be computationally expensive given the cost of running evolutionary simulations and the difficulty of assessing parameter effects in this multi-scale system. Importantly, the core GRN parameters (expression rates, interaction topology, and interaction strengths) are evolvable rather than fixed. We have conducted sensitivity analyses at the level of individual evolved GRNs, but a systematic analysis is beyond the scope of this paper.

      Minor Comments

      1. The fitness function used in simulations specifically encodes the desired pattern, with two zones having differential gene expression. This allows the artificial selection to evolve towards such patterns, as expected, but it is not entirely clear how this relates to natural selection itself. At the very start of the paper, the authors briefly review some possible sources of selective pressure for flowers to exhibit patterns such as bullseye, among others. None of the selective factors would likely act on the plants as a direct incentive for two regions, as specified in the cost function. Instead, one may expect a more high level criterion, such as "conspicuousness" for a pollinator, for instance. This is admittedly not naturally represented as a fitness function, but the choice of this function definitely influences the outcomes of a simulation. Some further numerical experiments may allow to demonstrate that the exact cost function is not critical for the findings of the paper, but I understand they would likely be computationally costly, to the point of unfeasibility. This limitation should be mentioned at least.

      We agree that natural selection acts on higher-level criteria such as pollinator attraction or conspicuousness rather than a predefined measure like "two distinct regions." However, our goal in this study is specifically to understand how the bullseye pattern in particular is produced, motivated by comparison to Hibiscus and other angiosperms where this pattern has documented adaptive relevance. The fitness function was therefore designed to ensure this particular pattern evolved, which results in evolving between-level novelty rather than constructive novelty (as defined in Colizzi et al., Essays Biochem 2022: of interest here is the evolved dynamics of development, not the resulting pattern). In this way, the fitness function serves as a proxy for selection on floral patterning. We have clarified this rationale more explicitly in the Results section (lines 97-98).

      The choice of fitness function does influence simulation outcomes. Within the scope of selecting for a bullseye pattern, we previously ran simulations where bullseye size was fixed rather than dynamic, and boundary cell types still evolved in those cases. This suggests our findings are robust across variations of the bullseye fitness function. Of course, selecting for a more abstract ecological criterion such as "conspicuousness" rather than a distinct spatial pattern would affect outcomes more substantially. However, translating such high-level criterion into a quantitative fitness function is a non-trivial challenge and outside the scope of this study. We have added a note on this point in the Methods section on the fitness function (lines 687-691).

      1. The number of genes used in the simulations is very small in comparison to real organisms. This is clearly justified by the complexity of the work, but one wonders if simulations could be made more efficient by using a much simplified approach for the gene network dynamics. At the time scales of interest, it seems that the use of SDEs and the numerical intricacies they require might be an unnecessary burden. Have the authors considered a much simpler approach, for instance based on Boolean models? Since the study only uses static tissues, all the GRN dynamics could be by-passed, determining steady states very quickly and using them to determine fitness. If this saved significant computational time, this would allow a more comprehensive survey of the "purely genetic" part of the model.

      While the number of genes may indeed be indeed small compared to real organisms, our simulations should be viewed as operating on subnetworks that form part of a much larger developmental GRN. This is a common approach in modelling the evolution of developmental processes, which we now highlight in the methods section. Furthermore, we find that the functional part of the GRN (which we identify by pruning away the redundant genes and interactions) always uses only a subset of the gene types, showing that we provide sufficient degrees of freedom for the evolutionary process to find a solution. We now also make note of this in a new figure (Figure S12) where we explain the pruning algorithm.

      We agree that simplified representations of the GRN, such as Boolean models or direct steady-state mappings, could substantially reduce computational cost. However, the use of stochastic differential equations (SDEs) in the present study is deliberate. Continuous, stochastic GRN dynamics allow us to capture key features that would be difficult or impossible to represent in Boolean or purely steady-state frameworks. In particular, they enable (i) gradual spatial distributions of morphogens, which are central to pattern formation, (ii) explicit treatment of gene expression noise, and (iii) consider and analyse the developmental dynamics in detail.

      Finally, in response to Reviewer 2's comment 1, we show all evolved networks (Figure S3 & S4) and perform a GRN motif comparison between noisy and deterministic simulations (Figure S15) to provide more information about the genetic part of the model.

      _Reviewer 2_

      1. There is a major missed opportunity to analyze the evolved networks. Only one of the 30 GRNs is analyzed in figure 4. Please add further analysis of the GRNs from all the populations. Within a population after 30K generations, how much variation is there in the GRNs of individuals? How similar are the optimal fitness evolved GRNs across all 35 populations? Are there common motifs across networks? Is there always an antagonism between proximal and distal proteins somewhere in the network? A lot of previous work on GRNs has established the function of common motifs, and these should be analyzed. Please provide all 30 gene regulatory networks in the supplement.

      We have substantially expanded the analysis of evolved networks across all populations. Specifically, we now (i) provide two supplementary figures showing the final pruned GRNs from all 35 simulations (Figures S3 & S4), and (ii) quantify motif frequencies across all evolved networks and compare motif distributions between GRNs evolved with and without molecular noise (Figure S15). This new analysis is summarised in a dedicated Results paragraph where we identify regulatory asymmetries and condition-dependent differences in feedback architecture, including changes in abundance of mutual inhibition and positive autoregulation (lines 233-239).

      We find that, while the evolved maximum fitnesses are very similar across simulations (Fig. 2Ai), the networks are highly variable. Nevertheless, the motif analysis shows some trends that differ between the noise and no-noise simulations, such as a bias towards mutual inhibition between PROX and DIST in the no-noise compared to the noise simulations.

      As to the variation within a population: we find that at any timepoint, all individuals are descended from a common ancestor that lived on average ~600 generations back, meaning that they form a single (quasi)species. We therefore analyse a single, highly fit individual at the last timepoint.

      1. The purpose and significance of examining the evolutionary lineage is not clear. Please explain your logic. This is most important for Figure 5 where it becomes clear that the boundary cells are often formed transiently in the evolution of the GRN. If this boundary cell type does not persist, how can it help the petal generate a bullseye. What happens after the boundary cell type is lost? Has the GRN evolved into a more stable place where it no longer needs the boundary? In several instances it looks like they come and go many times. Please explain how these transient boundary cells in the evolutionary lineage can make a difference. This point also comes up in lines 113-115 "For each simulation, we traced back the ancestral lineage of the final fittest individual and sampled 12 of its ancestors at evenly spaced generational intervals, performing this analysis on each sampled ancestor." I could understand if the boundary cell type were developmentally transient, but I have a hard time what its significance is since it is evolutionarily transient.

      The persistence of the boundary cell type over evolutionary time is used as a signal for its functional role in establishing the bullseye pattern. We observe that mostly two extremes occur: boundary cell types can be conserved over long evolutionary periods, or they can be highly transient. In our simulations, boundary cell types that are functionally important tend to persist, whereas the ones that are not involved in producing the bullseye pattern appear only transiently. The fact that both cases can occur suggests that boundary cell types are a "free" or easily accessible feature during the evolution of this patterning system: they can arise repeatedly without being strictly required, but may nonetheless become functionalised under certain evolutionary trajectories (see also our discussion of the Mimulus leaf stripe). We have added more explanation on the logic of examining the evolutionary lineage at the beginning of the results section related to Figure 5 (lines 205-209 and caption of Figure 5).

      To further clarify this point, we have added a supplementary figure (Figure S16) focusing on a deterministic simulation with a highly evolutionarily transient boundary cell type. By identifying the GRN mutations associated with the (re-)appearance of the boundary, we show that the patterning mechanism producing the bullseye slowly mutates while preserving the bullseye, while the mutational neighbourhood of the GRN contains diverse mutations that generate boundary cell types. In this case, boundary cells arise independently through distinct mutations rather than repeated rediscovery of a single change, explaining both their frequent appearance and their lack of long-term evolutionary stability.

      1. It is worth saying more about how the 9 lineages without a boundary cell types manage to make a robust bull's eye pattern because this is also interesting.

      This is indeed a good idea, we have carried out an analysis similar to that in Figure 4 for a GRN from a lineage without a boundary cell type and included it as a supplementary figure (Figure S11).

      4. How were 12 proteins chosen for the network, as opposed to 6 or 20 for instance? In the network pruning, it seems like fewer proteins are required. How many proteins are required to produce a bulls eye pattern?

      This choice is indeed somewhat arbitrary. We settled on 12 gene types to provide enough degrees of freedom while also keeping the evolutionary simulations computationally feasible. In practice, we find that pruned GRNs typically only use a subset of the 12 gene types, suggesting that the system has enough degrees of freedom to produce the bullseye pattern. For example, the smallest networks that evolved (after pruning) have 5 genes in the deterministic model and 7 in the noisy model.

      To clarify this choice, we now added a brief mention of these considerations to the relevant methods section (lines 641-643).

      Minor Comments

      1. The title needs to be changed to include computational modeling or simulation because otherwise the current version of the title implies that these boundary cell types are found in plant species evolution.

      We agree and have renamed the paper "Computational Model of Flower Pattern Evolution Predicts Spontaneous Emergence of Boundary Cell Types Across Petal Epidermis."

      1. Line 103 - 106 "We found that over a third of all simulations evolved a bullseye size of approximately 50% of the petal's central height (Figure 2A.ii). This indicates a tendency for simulations to converge toward these proportions, possibly due to the interaction between the patterning signal distribution and the tissue geometry." The phrasing here is confusing. Which proportions does "these proportions" refer to? Presumably, 50% from the preceding sentence. But the second proportion is not clear from the text. Maybe it is the peak at approximately 65% seen in the graph. Please clarify in the text.

      The 50% figure refers to the bin with the highest peak in Figure 2A.ii, reflecting a bias toward certain bullseye proportions rather than a uniform distribution across all possible sizes. We have rewritten the sentence to clarify this (lines 109-112): "This indicates a tendency for simulations to converge towards certain proportions more than others, possibly due to the interaction between the patterning signal distribution and the tissue geometry"

      1. Line 118 "To further explore cell identity in the third cluster, we analysed the gene expression profiles of the three identified cell types." It is not clear what the third cluster refers to. The previous sentence mentions 9 lineages without boundary cell types. So, a transition here back to lineages with boundary cell types, would help here.

      We agree and have improved the phrasing here by referencing back to the lineages with boundary cell types (lines 124-125):

      "Focusing on the majority of lineages in which this third boundary cell type arose, we analysed the gene expression profiles of the three identified cell types."

      1. Figures 3C-D, it would help to label these volcano plots proximal versus boundary and distal versus boundary. Although they do fit your color scheme and legend for the color scheme, it is important to specify it explicitly.

      We have added labels inside the volcano plots in Figure 3C-D to clarify proximal versus boundary and distal versus boundary.

      1. On Figure 4A it would help to label which gene is Prox and Dist. I assume they are the purple and yellow genes, but it would be easier if they were labeled.

      We have added labels in Figure 4A here to clarify.

      6. Line 185-186 "Gene 5 delays and spatially restricts the expression of gene 10, ensuring the symmetric development of the pattern." This statement needs to be supported by showing a time series simulation-movie or timepoints-revealing this timing aspect of Gene 5.

      We agree with the reviewer that this is currently lacking a clear visualisation and thank them for pointing this out. To address this, we have updated Figure 4 to include the temporal expression of genes 5 and 10 in the wild type and mutant for cells along the left-right axis in the proximal bullseye region. We have also included the following extra details in the results text (lines 194-199):

      ** Decreasing the spatial range of gene 5's regulatory influence by turning it into a TF resulted in a delay in its inhibition of gene 10 and reduced its self-activation range, explaining the smaller bullseye. In this mutant, expression of gene 5 is progressively delayed in cells located further from the origin of the patterning signal, and is ultimately absent on the right side of the proximal region of the bullseye (Figure 4C.ii). As a consequence, gene 10 becomes expressed in the right region, resulting in DIST identity instead of PROX, and leading to an asymmetric bullseye pattern.

      Reviewer 3

      1. How are the cell types defined from the simulations? Are they attractors of the dynamics of the corresponding proteins? And how are they computationally defined? Please provide more details about how the HBSCAN was used. In Figure S5, simulations #6 and #8 appear to have a 4th cell type (coloured in green), but the authors do not mention this result in the text. If cell types are defined by gene expression profiles, then the number of cell types will be dependent on the kind of clustering performed. Clarifying the definition of cell types will help resolve this issue.

      We thank the reviewer for raising this point and agree that the definition of cell types in our simulation results requires clearer explanation.

      The concept of cell type / cell identity is a complex theme which is still yielding interesting debate and discussion in the literature (see for instance Rafelski and Theriot, 2024). In our simulations, cell types are defined based on gene expression profiles rather than being explicitly identified as mathematical attractors of the underlying dynamical system. Operationally, we perform dimensionality reduction (UMAP) followed by clustering (using HDBSCAN) on the gene expression profiles across cells. This clustering serves as an initial, automated indication of distinct expression states across the petal.

      We recognise that the clustering results depend on the chosen dimensionality reduction and clustering method, as well as their parameterisation. For example, clustering applied to a smooth gradient (e.g., arising from diffusion alone) can artificially partition continuous variation into multiple discrete groups. For this reason, we do not rely solely on the clustering output: we use it as a first-pass classification and then manually verify the resulting groups by manually inspecting their gene expression profiles across the petal. This additional step ensures that identified "cell types" correspond to distinct expression states rather than arbitrary thresholds along a gradient. We have clarified both the computational procedure (dimensionality reduction + HDBSCAN clustering + manual verification) and the conceptual definition of cell types in the Methods section (lines 748-753).

      Regarding Figure S5, the fourth cell type (shown in green) in simulations #6 and #8 is indeed a distinct gene expression profile. We do occasionally observe the evolution of more and different cell types, this second boundary cell type being one of them, but also for example a salt-and-pepper type cell type (not shown). These cell types are however usually very transient and infrequent.

      * Rafelski, S.M. and Theriot, J.A., 2024. Establishing a conceptual framework for holistic cell states and state transitions. Cell, 187(11), pp.2633-2651.*

      2. In relation to the previous question, are the phenotypes used in the evolutionary simulations' steady states of the underlying dynamics?

      As clarified in response to Reviewer 1's comment 3, we do not explicitly require or enforce that phenotypes correspond to steady states of the underlying GRN dynamics. The developmental dynamics are always simulated for a fixed duration, and the fitness of a GRN is defined as the time-averaged gene expression pattern over a window at the end of this (lines 88-90) and Methods (lines 617-620).

      Because fitness is computed from this late-stage average, selection favors GRNs that produce consistent and stable expression patterns during that window. Networks that remain in strong transient or oscillatory regimes during this phase are typically penalised through reduced fitness.

      Therefore, while steady states are not imposed as a constraint, selection strongly favors solutions that are effectively stationary by the end of development. Indeed, inspection of the evolved GRNs shows that they converge to stable expression states.

      1. In Figure 3A it seems there are probably two cell types in the boundary region, is that right? Or are the elongated purple and elongated white cells basically the same cell type? Please clarify. If there are two, why did the authors choose to do the transcriptome analysis of the boundary region as one region, and not two subregions, to capture the two cell types?

      Correct, there are two different boundary cell types at the mature stage 5 petal: flat, elongated purple cells (lower boundary), and flat, elongated cream cells (upper boundary). However, the transcriptome data comes from an earlier stage (stage 2), where the boundary cells have not yet developed their characteristic shape and texture and the petal only comprises visibly pigmented (proximal) and non-pigmented (distal) cells. The morphological differences that distinguish the two boundary cell types at stage 5 are not yet apparent, hence we can only treat the boundary as one region at this stage, defined as the transition zone between pigmented and unpigmented cells

      We have made this distinction clearer in the figure caption of the Stage 2 petal (Figure 3B).

      1. I appreciate the explanation of the GRN pruning in the methods, but could the authors illustrate the network pruning process with an example and show that it works in this example?

      We have added a supplementary figure (Figure S12) depicting the pruning process for a GRN which keeps its boundary cell type during pruning and one for a GRN which loses its boundary cell type after pruning.

      1. From the methodological perspective, I suggest further clarifying what is new from this study and what is not. For instance, is the GRN pruning idea new or has it done before? The authors could consider reducing the formalities in the methods of the main text when they are not needed or when they are not new, to facilitate the readability of what is really important and novel in this work, and what is not. E.g., it is not really needed to mathematically define a Voronoi tessellation in the main methods section; this could be simplified or moved to a supplementary methods section.

      We agree that the distinction between methodological novelty and established components of the framework should be made clearer. We have therefore streamlined the description of non-novel methods and added appropriate citations to prior work where relevant, for example in the section on pruning.

      1. I believe the diffusion term used in Eqs. 14 and 17 does not conserve the total number of protein molecules; could the authors verify that? An example of a correct passive transport term for cell i of protein concentration p_i would be the sum of (p_j-p_i) for all j-cell neighbours, normalized by the area of cell i, or the formulation by Sukumar and Bolander (2003). This is especially important when noise is added, as the non-conservation of the number of proteins can lead to unwanted instabilities. Likely, these effects do not invalidate the results of the paper, but the authors should clarify the reason for their choice or double-check the conclusions using a correct, mass-conserved diffusion term.

      Thank you for pointing this out, this is indeed an error in our mathematical description. We double-checked our implementation, and confirmed our implementation correctly normalises by the area of cell i. We have a unit test which tests for mass conservation (https://gitlab.developers.cam.ac.uk/slcu/teamrv/evo-framework/-/blob/paper-2024-stoch-sims/tests/petal_test.cc?ref_type=tags#L66), which also confirms that our implementation is correct and this is only an error in the mathematical description in the paper. We have updated the equations to correctly reflect the implementation.

      1. It is important to facilitate the reproducibility of the results whenever possible, especially given that the computational framework used in this work has great value. I truly appreciate that the authors uploaded the code to a Gitlab. Please add further information in the readme file to facilitate reproducing the results, beyond the information regarding the code installation, whenever possible.

      We thank the reviewer for emphasising the importance of reproducibility. As noted in our response to Reviewer 1's comment 1, we have improved the structure and documentation of the public repository to facilitate reproduction of the results, including the SLURM scripts used for the evolutionary simulations and documenting code used for analysis and creating figures.

      Minor comments

      1. What is the reasoning behind the choice of the number of protein species? Why 12? Would the same results hold with a smaller number of proteins? As I imagine that the more species one considers, the more chances one has to get the desired phenotypes (or any desired phenotype for that matter). I could imagine that with 12 or more proteins, one could get more than 3 cell types (as defined by the clustering of their expression profiles). Is there something inherent in the creation of a boundary that leads to only 1 additional cell type and not more? Further simulations would be ideal to address this point, but otherwise, please comment on that if possible.

      As noted in our response to Reviewer 2's comment 4, the choice of 12 protein species is to some extent arbitrary. We selected this number as a compromise between providing sufficient degrees of freedom and maintaining computational feasibility of the evolutionary simulations. In a recently published manuscript from our team (van der Jagt et al., 2026), we tested the impact of reducing the number of genes and showed that important evolutionary dynamics are by and large the same.

      Regarding the possibility of obtaining more than three cell types: while rare, we do observe the emergence of additional cell types in simulation #6 and #8 in Figure S9. A larger number of proteins could in principle support more combinations of expression patterns, but the number of stable cell types that emerge is strongly determined by the fitness function and by the spatial structure of the task (i.e., generating two pre-specified domains). That is, the emergence of a single additional boundary cell type is driven primarily by the developmental and selective constraints, rather than being directly limited by the number of proteins in our simulations.

      van der Jagt, Pjotr L., Steven Oud, and Renske MA Vroomans. "System drift in the evolution of plant meristem development." PLOS Genetics 22.4 (2026): e1012089.

      2. What is the fundamental difference between Gene profiles I and II in generating cell types? If a cell type is defined by the specific expression of certain genes, then are not Gene Profiles I and II just different sides of the same coin? For instance, Gene profile I is characterized by the expression of a single gene at the boundary. Why do their simulations they do not obtain patterns where 2 genes are expressed in the boundary? Or 3? Or is there a fundamental difference in how these are generated, like the boundary being a stripe of a Turing pattern, or something similar? This also links with the work of Ding et al. and Lu et al.-which the authors mention in the introduction- where they propose that self-organized (Turing) patterns can explain anthocyanin patterning in petals. Could the authors clarify these points and maybe contextualize these results with previous works on petal patterns?

      The fundamental difference between the two gene profiles lies in how the boundary cell type is generated. In gene profile II, genes expressed in the boundary are also expressed in the proximal region, but some genes expressed proximally are not present in the boundary. The boundary cell type therefore emerges as the intersection of two differently-sized proximal bullseyes (Fig. 2B.ii). In gene profile I, by contrast, genes are more expressed in the boundary than anywhere else, producing a central striped expression pattern. While gene profile I can arise from profile II (Fig. S10), we also find cases where mechanism I appears independently, without mechanism II being present (Fig. S9; Simulation #25). This shows the two mechanisms are genuinely distinct, and we therefore treat them separately.

      Profile I includes infrequent cases where several genes are preferentially expressed at the boundary (see for example simulation #23 in Figure S9). As for why we rarely observe two or more genes uniquely expressed in the boundary, we are not sure, however we suspect this may relate to the limited number of distinct gene types available in our model, which constrains how many genes can play a flexible, boundary-specific role.

      Regarding the link to Turing patterns and the work of Ding et al. and Lu et al.: our model addresses the pre-patterning mechanism upstream of anthocyanin patterning, which subdivides the petal into distinct spatial regions. Based on evidence from Hibiscus, this pre-patterning is thought to be initiated by an asymmetric signal. The problem we investigate is therefore how an existing asymmetric signal is converted into a bullseye pattern, which is fundamentally different from Turing-type symmetry breaking from a uniform state. Our work thus complements Ding et al. and Lu et al. by addressing the upstream question of how the spatial regions that constrain these self-organised patterns to specific petal domains are first established. We have added a discussion of this connection in the Discussion section (lines 301-306).

      1. In relation to the previous point regarding the mechanisms underlying boundary formation, the authors could consider whether the theoretical works by the J. Sharpe lab on stripe formation might be relevant to cite (e.g., Cotterell and Sharpe 2010 or Jimenez et al 2015)

      We agree that they are relevant and have added a section about theoretical work on stripe formation as part of the discussion on novel phenotypes (lines 305-310).

      1. If possible, it would be ideal to have at least one video/animation of both the dynamics of each phenotype and the evolution of the phenotypes as their fitness increases, to see the evolutionary trajectories and test whether similar phenotypes can be achieved through different trajectories.

      We thank the reviewer for the suggestion, since the temporal dynamics can indeed be informative. We have added two supplementary videos (Video S1 & S2) illustrating the developmental dynamics of two GRNs: one that generates a boundary cell type via gene profile I, and one via gene profile II. These videos provide a clearer view of the developmental model's dynamics, and how boundary cell types emerge dynamically during development. References to these videos have been added to the main text immediately after introducing the two gene profiles.

      In addition, we have added two supplementary figures containing evolutionary trajectories: one tracing an individual's evolutionary trajectory including detailed changes in fitness and gene expression over time (Figure S8), and one showing the evolution of PROX and DIST expression during the early adaptive phase across the first 10 simulations (Figure S6).

      1. In the Discussion, I believe that the emergence of the novel cell type would benefit from stronger contextualization within known evo-devo frameworks. In particular, the authors describe that a new cell type emerges as a byproduct of the selection of a higher-order developmental process-the bullseye pattern with a clearly defined boundary-rather than through direct selection of the cell type itself. I am confident the authors know these phenomena have been discussed under the term spandrels (Gould & Lewontin, 1979), and have been the subject of extensive study and debate. While identifying traits as spandrels is complicated-largely because in practice we lack reliable frameworks to distinguish them from actual adaptations-the work presented here provides a plausible mechanism of how such features could arise. To me, this fact alone is interesting, as not many works (as far as I know) have addressed this problem explicitly. Maybe the authors want to emphasize this fact as a novelty of their approach. To be clear, I am not suggesting that the authors should adopt a specific terminology; rather, I believe that explicitly invoking the concept of spandrel would resonate with readers familiar with the foundations of evo-devo and would strengthen the main message of the paper.

      We thank the reviewer for this great suggestion. We have added a reference to Gould & Lewontin's seminal paper in our discussion, placing our findings in the context of spandrels (lines 320-323).

        1. *Some additional considerations related to figures

      Please change colours in the figures to be colour-blind whenever possible The stripes in the striped purple cell shown in Fig. 3A are not seen unless one zooms in on it; would it be possible to represent this differently? In Fig. 5 Aii and Bii, it would be easier for the reader to connect with the statements in the main text if the x-axis is x 1000 or x100 instead of x500 Perhaps clarify panel captions of Fig. panels 3C and 3D. Probably I am missing something basic, but I was also wondering how their numbers are connected to the numbers in the panel of Fig. 3F. Why does Fig. 3F have three subpanels? Is it because of different expression levels? Please clarify.

      We thank the reviewer for bringing this up. On revisiting our figures, we noticed some hard-to-distinguish colours for the common red-green colorblindness (deuteranopia). We have improved this by changing the reds closer to magenta, making the figures more accessible. We increased the size of the cartoon cell in Figure 3A and increased the contrast of the colours used to indicate the stripes. We have changed this to read x1000 to improve clarity. We have added the following text to the caption of Fig 3E, page 6, to clear this up: The number in the intersection indicates genes enriched in the boundary compared to both proximal and distal regions.

      The numbers within each non-overlapping portion of the circles indicate genes enriched in the boundary relative to only one region (proximal or distal), minus those shared in the intersection.

      Yes indeed, they represent different order of magnitudes in expression (high, medium, and low, respectively). We have clarified this in the caption of Figure 3F.

      1. Could the authors clarify the choice of using the Stratonovich approach in the stochastic simulations?

      We decided on the Stratonovich interpretation, as it is the interpretation that is most natural when comparing with the deterministic model, where we "turned off" the noise. With the Stratonovich interpretation, we can get a deterministic system by simply dropping the noise terms. Had we chosen the Ito interpretation, this same approach would require changing the dynamics of the deterministic system by including a noise-induced bias in the drift term.

      1. Note equations are referred to in the text as Eq. S (...) whereas they are not supplementary equations

      Thanks for pointing this out, we have fixed this in the revised manuscript.

      1. The code is very large (more than 1GB), and I believe much of the space is used by Voronoi tessellations. If the authors have the time and have the scripts generating the Voronoi tessellations, the authors could add them to the repository and ensure that these tessellations are generated during the simulations whenever needed (but I am aware that code organization takes time). I would recommend having the code also in a repository with a DOI (e.g., Zenodo or OSF).

      We have significantly reduced the repository size by removing some Voronoi tessellations that are not used in this work, and have created a DOI for the code (line 352).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Oud et al. explores the evolution of a developmental mechanism generating bullseye patterns in petals using evolutionary simulations of gene regulatory networks and transcriptomics data. The authors provide a plausible mechanism of how a novel cell type can emerge as a byproduct of selecting for a higher-order process-in this case, the establishment of a bullseye pattern with two clearly delineated regions. Moreover, the authors show that the emergence of the new cell type persists longer in their evolutionary simulations when the system is noisy, suggesting a functional role of the cell type in buffering developmental variability. The approach is very impressive, bridging in silico-generated GRNs that model a patterning process and evolve over generations, and in turn, combining them with transcriptome analysis experiments. However, precisely due to the complexity of the work done, I would like the authors to clarify and/or address key elements of the methodology, especially those related to the assumptions regarding the modelling approach and their implications for the validity of the results, as well as from the analysis.

      Major comments:

      1. There are some aspects to clarify; some are mentioned here, but others are mentioned in minor points.

      1.1. How are the cell types defined from the simulations? Are they attractors of the dynamics of the corresponding proteins? And how are they computationally defined? Please provide more details about how the HBSCAN was used. In Figure S5, simulations #6 and #8 appear to have a 4th cell type (coloured in green), but the authors do not mention this result in the text. If cell types are defined by gene expression profiles, then the number of cell types will be dependent on the kind of clustering performed. Clarifying the definition of cell types will help resolve this issue.

      1.2. In relation to the previous question, are the phenotypes used in the evolutionary simulations' steady states of the underlying dynamics?

      1.3. In Figure 3A it seems there are probably two cell types in the boundary region, is that right? Or are the elongated purple and elongated white cells basically the same cell type? Please clarify. If there are two, why did the authors choose to do the transcriptome analysis of the boundary region as one region, and not two subregions, to capture the two cell types?

      1.4. I appreciate the explanation of the GRN pruning in the methods, but could the authors illustrate the network pruning process with an example and show that it works in this example?

      1.5. From the methodological perspective, I suggest further clarifying what is new from this study and what is not. For instance, is the GRN pruning idea new or has it done before? The authors could consider reducing the formalities in the methods of the main text when they are not needed or when they are not new, to facilitate the readability of what is really important and novel in this work, and what is not. E.g., it is not really needed to mathematically define a Voronoi tessellation in the main methods section; this could be simplified or moved to a supplementary methods section. 2. I believe the diffusion term used in Eqs. 14 and 17 does not conserve the total number of protein molecules; could the authors verify that? An example of a correct passive transport term for cell i of protein concentration p_i would be the sum of (p_j-p_i) for all j-cell neighbours, normalized by the area of cell i, or the formulation by Sukumar and Bolander (2003). This is especially important when noise is added, as the non-conservation of the number of proteins can lead to unwanted instabilities. Likely, these effects do not invalidate the results of the paper, but the authors should clarify the reason for their choice or double-check the conclusions using a correct, mass-conserved diffusion term. 3. It is important to facilitate the reproducibility of the results whenever possible, especially given that the computational framework used in this work has great value. I truly appreciate that the authors uploaded the code to a Gitlab. Please add further information in the readme file to facilitate reproducing the results, beyond the information regarding the code installation, whenever possible.

      Minor comments:

      1. What is the reasoning behind the choice of the number of protein species? Why 12? Would the same results hold with a smaller number of proteins? As I imagine that the more species one considers, the more chances one has to get the desired phenotypes (or any desired phenotype for that matter). I could imagine that with 12 or more proteins, one could get more than 3 cell types (as defined by the clustering of their expression profiles). Is there something inherent in the creation of a boundary that leads to only 1 additional cell type and not more? Further simulations would be ideal to address this point, but otherwise, please comment on that if possible.
      2. What is the fundamental difference between Gene profiles I and II in generating cell types? If a cell type is defined by the specific expression of certain genes, then are not Gene Profiles I and II just different sides of the same coin? For instance, Gene profile I is characterized by the expression of a single gene at the boundary. Why do their simulations they do not obtain patterns where 2 genes are expressed in the boundary? Or 3? Or is there a fundamental difference in how these are generated, like the boundary being a stripe of a Turing pattern, or something similar? This also links with the work of Ding et al. and Lu et al.-which the authors mention in the introduction- where they propose that self-organized (Turing) patterns can explain anthocyanin patterning in petals. Could the authors clarify these points and maybe contextualize these results with previous works on petal patterns?
      3. In relation to the previous point regarding the mechanisms underlying boundary formation, the authors could consider whether the theoretical works by the J. Sharpe lab on stripe formation might be relevant to cite (e.g., Cotterell and Sharpe 2010 or Jimenez et al 2015)
      4. If possible, it would be ideal to have at least one video/animation of both the dynamics of each phenotype and the evolution of the phenotypes as their fitness increases, to see the evolutionary trajectories and test whether similar phenotypes can be achieved through different trajectories.
      5. In the Discussion, I believe that the emergence of the novel cell type would benefit from stronger contextualization within known evo-devo frameworks. In particular, the authors describe that a new cell type emerges as a byproduct of the selection of a higher-order developmental process-the bullseye pattern with a clearly defined boundary-rather than through direct selection of the cell type itself. I am confident the authors know these phenomena have been discussed under the term spandrels (Gould & Lewontin, 1979), and have been the subject of extensive study and debate. While identifying traits as spandrels is complicated-largely because in practice we lack reliable frameworks to distinguish them from actual adaptations-the work presented here provides a plausible mechanism of how such features could arise. To me, this fact alone is interesting, as not many works (as far as I know) have addressed this problem explicitly. Maybe the authors want to emphasize this fact as a novelty of their approach. To be clear, I am not suggesting that the authors should adopt a specific terminology; rather, I believe that explicitly invoking the concept of spandrel would resonate with readers familiar with the foundations of evo-devo and would strengthen the main message of the paper.
      6. Some additional considerations related to figures:

      9.1. Please change colours in the figures to be colour-blind whenever possible.

      9.2. The stripes in the striped purple cell shown in Fig. 3A are not seen unless one zooms in on it; would it be possible to represent this differently?

      9.3. In Fig. 5 Aii and Bii, it would be easier for the reader to connect with the statements in the main text if the x-axis is x 1000 or x100 instead of x500

      9.4. Perhaps clarify panel captions of Fig. panels 3C and 3D. Probably I am missing something basic, but I was also wondering how their numbers are connected to the numbers in the panel of Fig. 3F.

      9.5. Why does Fig. 3F have three subpanels? Is it because of different expression levels? Please clarify. 10. Could the authors clarify the choice of using the Stratonovich approach in the stochastic simulations? 11. Note equations are referred to in the text as Eq. S (...) whereas they are not supplementary equations. 12. The code is very large (more than 1GB), and I believe much of the space is used by Voronoi tessellations. If the authors have the time and have the scripts generating the Voronoi tessellations, the authors could add them to the repository and ensure that these tessellations are generated during the simulations whenever needed (but I am aware that code organization takes time). I would recommend having the code also in a repository with a DOI (e.g., Zenodo or OSF).

      Referee cross-commenting

      The comments by other referees are complementary to mine; there are some common aspects with my comments and other important points to look into.

      Significance

      This study provides a plausible explanation of how new cell types can emerge as byproducts of the selection of other processes. This is an important advance in understanding the mechanisms underlying the origin of evolutionary novelties, particularly from the point of view of morphogenesis and patterning, rather than from a more traditional, strictly gene-centric views which focus on changes in specific loci, gene duplications, or neofunctionalization. By highlighting evolutionary novelty as a consequence of higher-order constraints, this work broadens the frameworks through which cellular diversity can be understood.

      I believe most of the limitations of the study are conceptual and regarding improving clarity rather than methodological. For instance, the definition of what a cell type is remains, in my opinion, somewhat vague, especially if the clustering has been performed with only 12 genes. However, I am aware of the conceptual difficulty in defining cell types in general. In addition, the emergence of only a single additional cell type, rather than multiple types, might be a consequence of the limited number of proteins considered. Aside from these issues, the methodology is sound and provides a useful framework for exploring the origin of novel cell types.

      I see this work as being of substantial interest to researchers concerned with the conceptual foundations of evo-devo, particularly those interested in the origins of novelty and in the role of constraints in shaping such novelty. It should also be relevant to studying morphogenesis from a dynamical systems perspective. Finally, this work will be of interest to those investigating the ecological roles of petal patterns, especially in relation to their roles in attracting pollinators or protecting reproductive organs from environmental factors.

      Overall, I think this work represents a very valuable contribution to the evo-devo community, providing conceptual advances into our understanding of the emergence of novelty, as well as providing a complex computational framework addressing cellular patterning in evolving GRNs.

      Field of expertise: developmental biology, nonlinear dynamics, pattern formation, evo-devo.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the manuscript entitled "Recurrent emergence of boundary cell types during evolution of floral bullseye patterns" Oud et al use computational modeling to determine how gene regulatory networks can set up the prepattern for a bullseye pigmentation. They use a modeling template that is similar to the hibiscus petal primordium, create a gene regulatory network composed of the interaction of cell autonomous transcription factors, transcription factors that can diffuse from one cell to another, and cell-cell communication signals. Each simulation started from a diffusing signal from the base and all other genes with no expression. Such a signal diffusing from the base of the organ has been hypothesized many times in plant morphogenesis, so this is plausible. They started 35 populations with initially random GRNs and let them evolve for 30,000 generations selecting for simulated petals with higher bullseye fitness in each generation. All 35 generated bullseyes. The authors used a UMAP dimensionality reduction similar to single cell RNA-seq to identify different cell types in the models. I have not seen this analysis applied to modeling before, and I thought this approach was innovative. Interestingly 26 out of the 35 initiated a boundary cell type to help in the robust establishment of the symmetric bullseye, whereas 9 did not. There are two major ways these boundary cell types is established: (1) boundary specific gene expression and (2) two nested proximal genes with one extending beyond the other. Then the authors examine real hibiscus petals and identify boundary cells, which express 30 boundary specific genes. The authors then examine one of the GRNs from one of their populations and find that gene 5 is crucial for setting up the boundary. Finally, the analyze over evolutionary time in each population and see that these boundary cells come and go in the lineages, but they have a longer persistence time when there is noise in the modeling, suggesting that they add robustness to the generation of the bullseye.

      Major comments:

      There is a major missed opportunity to analyze the evolved networks. Only one of the 30 GRNs is analyzed in figure 4. Please add further analysis of the GRNs from all the populations. Within a population after 30K generations, how much variation is there in the GRNs of individuals? How similar are the optimal fitness evolved GRNs across all 35 populations? Are there common motifs across networks? Is there always an antagonism between proximal and distal proteins somewhere in the network? A lot of previous work on GRNs has established the function of common motifs, and these should be analyzed. Please provide all 30 gene regulatory networks in the supplement.

      The purpose and significance of examining the evolutionary lineage is not clear. Please explain your logic. This is most important for Figure 5 where it becomes clear that the boundary cells are often formed transiently in the evolution of the GRN. If this boundary cell type does not persist, how can it help the petal generate a bullseye. What happens after the boundary cell type is lost? Has the GRN evolved into a more stable place where it no longer needs the boundary? In several instances it looks like they come and go many times. Please explain how these transient boundary cells in the evolutionary lineage can make a difference. This point also comes up in lines 113-115 "For each simulation, we traced back the ancestral lineage of the final fittest individual and sampled 12 of its ancestors at evenly spaced generational intervals, performing this analysis on each sampled ancestor." I could understand if the boundary cell type were developmentally transient, but I have a hard time what its significance is since it is evolutionarily transient.

      It is worth saying more about how the 9 lineages without a boundary cell types manage to make a robust bull's eye pattern because this is also interesting.

      How were 12 proteins chosen for the network, as opposed to 6 or 20 for instance? In the network pruning, it seems like fewer proteins are required. How many proteins are required to produce a bulls eye pattern?

      Minor comments:

      The title needs to be changed to include computational modeling or simulation because otherwise the current version of the title implies that these boundary cell types are found in plant species evolution.

      Line 103 - 106 "We found that over a third of all simulations evolved a bullseye size of approximately 50% of the petal's central height (Figure 2A.ii). This indicates a tendency for simulations to converge toward these proportions, possibly due to the interaction between the patterning signal distribution and the tissue geometry." The phrasing here is confusing. Which proportions does "these proportions" refer to? Presumably, 50% from the preceding sentence. But the second proportion is not clear from the text. Maybe it is the peak at approximately 65% seen in the graph. Please clarify in the text.

      Line 118 "To further explore cell identity in the third cluster, we analysed the gene expression profiles of the three identified cell types." It is not clear what the third cluster refers to. The previous sentence mentions 9 lineages without boundary cell types. So, a transition here back to lineages with boundary cell types, would help here.

      Figures 3C-D, it would help to label these volcano plots proximal versus boundary and distal versus boundary. Although they do fit your color scheme and legend for the color scheme, it is important to specify it explicitly.

      On Figure 4A it would help to label which gene is Prox and Dist. I assume they are the purple and yellow genes, but it would be easier if they were labeled.

      Line 185-186 "Gene 5 delays and spatially restricts the expression of gene 10, ensuring the symmetric development of the pattern." This statement needs to be supported by showing a time series simulation-movie or timepoints-revealing this timing aspect of Gene 5.

      Referee cross-commenting

      I agree with all reviews, which are aligned.

      Significance

      How pigment patterns in petals are established is an important and fascinating question, that sheds light on broader issues of how tissues are pre-patterned. Previous studies focus on the reaction diffusion gene regulatory networks that create beautiful petal pigment spot patterns. This paper fills the gap in addressing how a prepattern is established to create a simple proximal distal bullseye pigment pattern. Overall, the use of modeling in this study raises several novel and exciting hypotheses for how a pre-pattern can be established during development. One limitation of the study as acknowledged by the authors is that the actual petal grows, whereas the model does not. Although growth is likely to make an interesting contribution to the pattern, I agree that it is beyond the scope of this manuscript. Modeling papers are always challenging to write clearly, and I point out some areas where clarifications are needed below. The figures illustrate the results well.<br /> This paper will be of interest to developmental biologists, gene regulatory network afficionados and computational biologists.

      My expertise is in plant morphogenesis and patterning as well as computational modeling.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript presents the findings of a computational investigation, whereby populations of artificial "genomes" and their products are evolved algorithmically. They are subjected to a fitness constraint defined in terms of a spatial expression pattern on a petal shaped template. The specific focus of this work is the formation of two-pigment patterns on flower petals, which give rise to "bullseye" patterned flowers. A computational survey suggests that besides the two main genetic identities which are strictly required to form such patterns, a third population is likely to emerge, as a marker located at the interface between the two main identities. This prediction is then tested by dissecting petals of Hibiscus trionum and performing an mRNA-seq survey. The resulting data set is consistent with the simulations, with a population of genes specifically expressed at the boundary between the two main regions. The paper then discusses a number of hypotheses on the evolution of underlying gene regulatory networks, testing them computationally. In particular, by comparing simulations with and without stochastic terms in the dynamics of gene regulation/expression, it is suggested that the 3rd identity is contributing to robustness of the pattern in the face of noise. Overall the main text is clear and makes an interesting case.

      Major comments:

      1. The code used for simulations is available on a public repository, but it does not directly ensure that results are reproducible. To do so would require a clear step-by-step guide referring the user to the specific pieces of code which have been used for the results and figures presented in the paper. At the moment, I could not find any such guide and the large number of scripts, executables and jupyter notebooks are not clearly linked to the paper's contents.
      2. The methods themselves involve a number of arbitrary choices. Though this is understandable given the nature of the work, one aspect in particular that would deserve better clarity is the modeling of gene network dynamics. The stochastic model (l.516 & following) involves a nesting of "Hill-like" terms (those in Eqs. (7) and (11)) which is unusual and given without justification. There should be some explanation of how this approach relates to standard approaches such as those reviewed e.g. in: Bintu et al. Current opinion in genetics & development 15.2 (2005): 116-124.

      3. It is also unclear at the moment how exactly the GRN dynamics is used; are time-stepping algorithms used until the system reaches a stationary regime? If so, how is stationarity assessed? This needs to be explained both in the main text and in the methods. The table of parameters suggests that there was a cut-off time, but there is no explanation whatsoever about the state of the dynamics at this time.

      4. Related to the previous point, the table of parameters (Table S1) is provided without any explanation; through what process (exploratory, literature review, trial and error...) where the values selected? As there been any type of sensitivity analysis?

      Minor comment:

      1. The fitness function used in simulations specifically encodes the desired pattern, with two zones having differential gene expression. This allows the artificial selection to evolve towards such patterns, as expected, but it is not entirely clear how this relates to natural selection itself. At the very start of the paper, the authors briefly review some possible sources of selective pressure for flowers to exhibit patterns such as bullseye, among others. None of the selective factors would likely act on the plants as a direct incentive for two regions, as specified in the cost function. Instead, one may expect a more high level criterion, such as "conspicuousness" for a pollinator, for instance. This is admittedly not naturally represented as a fitness function, but the choice of this function definitely influences the outcomes of a simulation. Some further numerical experiments may allow to demonstrate that the exact cost function is not critical for the findings of the paper, but I understand they would likely be computationally costly, to the point of unfeasibility. This limitation should be mentioned at least.
      2. [optional suggestion] The number of genes used in the simulations is very small in comparison to real organisms. This is clearly justified by the complexity of the work, but one wonders if simulations could be made more efficient by using a much simplified approach for the gene network dynamics. At the time scales of interest, it seems that the use of SDEs and the numerical intricacies they require might be an unnecessary burden. Have the authors considered a much simpler approach, for instance based on Boolean models? Since the study only uses static tissues, all the GRN dynamics could be by-passed, determining steady states very quickly and using them to determine fitness. If this saved significant computational time, this would allow a more comprehensive survey of the "purely genetic" part of the model.

      Referee cross-commenting

      I agree with both other reviewers. As mentioned by them, our reviews bring complementary suggestions, while being overall in good agreement.

      Significance

      Reviewer's expertise: mathematical modeling, mathematical biology.

      This paper is mostly a conceptual study, in which the majority of results are based on computer simulations. The findings are biologically interesting, but it is hard to prove these evolutionary claims through physical experiments. The complexity of the simulations requires a large number of technical assumptions and parameter choices, which overall make it very difficult to assess how plausible these simulations are, compared to the natural processes they are meant to represent. All the findings are well-argued and provide an overall convincing case, but it is by design impossible to fully assess experimentally. As such, this work will be mostly valuable to theoretical biologists, computational modelers, and researchers interested in "artificial life" and gene evolution.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer 1

      Point

      Summary

      Response

      1.1

      Overall, the study lacks well-controlled experiments comparing hypoxia induced by DMOG with hypoxia induced by 1% O₂ for assessing ERα occupancy throughout.

      To assess whether DMOG-induced changes in ERα occupancy reflect bona fide hypoxia, we measured ERα binding by ChIP-qPCR under 1% oxygen over 48 hours, compared to normoxic (21% oxygen) cells and input controls in matched cells at the GREB1 and TFF1 loci. Our findings demonstrate that 1% oxygen treatment recapitulates the ERα binding changes observed with DMOG, at the time points of our RNA-seq experiments.

      We have included these results in __Figure 1F __of the preliminary revision of the manuscript.

      1.2

      Lack of evidence for other co-transcription factors impact under hypoxia HIF's in Fig1.

      We thank the reviewer for this comment. We have clarified that motif enrichment analysis is included to characterise the sequence context of ERα binding sites and to confirm enrichment of known ER-associated motifs (e.g. EREs), rather than to infer functional involvement of additional transcription factors under hypoxia. Corresponding interpretative statements have been removed from the Results and restricted to the Discussion.

      1.3

      Lack of evidence for DMOG induce HIF protein expression in MCF7 cells.

      To confirm DMOG induces HIF-protein expression we have analysed HIF1α and HIF2α protein levels by western blot. We have included these in __Supplementary Figure S1A __within the preliminary revision to address this concern.

      1.4

      Figure 1: ATAC-seq was performed under 1% O₂, whereas ChIP-seq was conducted with DMOG treatment, making these conditions not directly comparable.

      We acknowledge that the ERα ChIP-seq (DMOG) and ATAC-seq datasets were generated under different conditions and are therefore not directly comparable. To address this, we have performed ChIP-qPCR under bona fide hypoxia (1% oxygen) at canonical ERα target loci (TFF1 and GREB1), demonstrating that the directionality of ERα binding changes observed with DMOG is recapitulated under physiological hypoxia. These data provide a direct comparison of ERα occupancy across conditions and support the use of DMOG as a proxy for hypoxia in our ChIP-seq experiments.

      If requested, we are willing to perform ATAC-seq at 16 h under 1% oxygen. However, because the original dataset was generated under 0.1% oxygen, and canonical ERα-bound sites show minimal accessibility changes under severe hypoxia, we anticipate limited additional insight from repeating this experiment.

      1.5a

      Figure S1: ERα ChIP lacks estradiol (E2) treatment in MCF7 cells with or without DMOG.

      The statement that the ERα ChIP samples lack estrogen treatment is incorrect. Estradiol was not an experimental variable and cells were intentionally maintained under estrogen-rich conditions to preserve tumour-relevant ERα activity.

      We have now clarified within the preliminary revision by stating that cells were routinely cultured in “estrogen-rich Dulbecco’s Modified Eagle Medium” in the methods section, and clarified the use of estrogen-rich conditions in the Figure S1 legend.

      1.5b

      The single-gene examples of DMOG effects shown in Fig. S1A are not significant.

      The peak illustrated in Figure S1A (now Figure S1D) __is intended to provide a visual confirmation of peak calling and enrichment patterns underlying the genome-wide redistribution observed in __Figure 1. The peak was called by the MACS2 pipeline (code available from https://doi.org/10.5281/zenodo.17221105) with a log10(q-value) = 268.5, which passes the MACS2 cut-off q

      1.6a

      Fig. S2 lacks 1% O₂ conditions,

      We wish to clarify that Figure S2 (now Figure S4) serves as quality control specifically for the DMOG-treated ChIP-seq dataset presented in Figure 1C. The purpose of the plot is to visualize unfiltered motif enrichment to confirm that the identified peaks represent bona fide ERα binding events within the DMOG condition. Motif enrichment under a 1% oxygen environment would not provide this validation. In all cases the ERE is the most significantly enriched motif.

      With respect to ERα binding under 1% oxygen, we have now assessed this via targeted ChIP-qPCR validation (Figure 1F).

      1.6b

      Fig. S3 lacks DMOG-induced HIF factor assessments.

      The DMOG-induced changes in HIF1α and HIF2α expression are shown in the__ Figure S1__ of this revision proposal and have been incorporated into the manuscript as part of the changes described in response 1.3.

      1.7a

      Figure S4: Estradiol (E2) treatment is missing from the controls, and the figure labeling is of poor quality.

      We have substantially improved the labelling of Figure S4, now__ Figure S6.__

      Additionally, we have clarified that all samples were cultured in estrogen-rich media and treated with either vehicle control or 100 nM fulvestrant; thus estrogen is present in all conditions including the controls.

      1.7b

      Hypoxic conditions for assessing ER status and appropriate controls are also lacking.

      We agree that monitoring ERα stability under hypoxic conditions is essential.

      We provided a western blot assessment of ERα protein levels at 0, 8 and 48 hours of treatment with 1% oxygen or DMOG, compared to normoxic controls, included as Supplementary Figures S1B, C in the preliminary revision.

      These demonstrate the cells remain positive for ERα protein expression at 0, 8 and 48h.

      1.8

      Figure S5: The description of fulvestrant treatments under hypoxic conditions is unclear.

      We thank the reviewer for this comment. To clarify the experimental design, we now signpost the reader in the figure legend of Figure S5 (now S7) to the schematic diagram provided in Figure 3B, and provide a summary stating the experiment employed a factorial design combining a 96-hour fulvestrant treatment with exposure to 1% oxygen for the final 48 hours.**

      1.9

      Supplemental legends: These require major revision; they are of poor quality and lack statistical details and references to biological replicates.

      We have extensively revised all supplementary figure legends to ensure clarity and precision.

      1.10

      Overall comparisons throughout the manuscript are weak; the figures appear sloppy and lack sufficient effort in presentation.

      Following this comment, we carefully reviewed the presentation of all figures throughout the manuscript. We improved the organisation and labelling of the Supplementary Figures to facilitate clearer comparison of the data. In particular, full western blots are now clearly annotated and supplementary legends have been expanded to provide sufficient context for each figure to be interpreted independently.

      1.11

      i) In general, the manuscript in its present form does not greatly contribute from published work as the ERα cistrone is well documented work studied for its role in regulating gene expression, particularly in ERα-positive breast cancer.

      ii) Additionally, a lack of a thorough comparison between DMOG and or 1 %oxygen induce hypoxia in the MCF7 ER+ model, diminished initial interest in the manuscript.

      iii) The lack of considering estradiol exposure under hypoxic conditions with either 1%oxygen and or DMOG also limits relevance to patients with ER+ BrCa.

      iv) The ERα epigenomic profile has been extensively studied including work under hypoxic conditions.

      i) We respectfully disagree that the manuscript does not extend prior work. Despite extensive characterisation of ERα, its role in shaping hypoxia-driven transcription in ER+ breast cancer has not been defined. Here, we identify an ERα-dependent hypoxic response (EDHR), demonstrating a reciprocal interaction between hypoxia and ERα activity.

      ii) In revision, we address concerns regarding DMOG by validating ERα binding under 1% oxygen using ChIP-qPCR thereby confirming our result in bona fide hypoxia. Additionally, all RNA-seq and functional assays, including ENaC targeting, were performed under 1% oxygen in the original manuscript.

      iii) All experiments were conducted under estrogen-complete conditions, now explicitly clarified, reflecting tumour-relevant ERα activity.

      iv) Together, these data establish a reciprocal interaction between ERα and hypoxia and uncover a targetable vulnerability in hypoxic ER+ breast cancer, linking transcriptional regulation to therapeutic opportunity.

      Reviewer 2

      No.

      Summary

      Response

      General Comments

      2.1

      ENAC is proposed as a therapeutic vulnerability based on amiloride sensitivity assays. Additional experiments are required, such as western blot validation of ENaC regulation under hypoxia and loss-of-function approaches to assess its contribution to the phenotype.

      We agree that further validation of ENaC involvement would strengthen this observation. We will assess ENaC protein levels under 1% hypoxia ± fulvestrant by western blot and perform siRNA-mediated depletion of ENaC subunits to test their contribution to the hypoxia-specific amiloride-sensitive phenotype by viability assay (see also response 3.3).

      2.2

      Fulvestrant is used to dissect ERa dependency. However, as a SERD, it may alter chromatin and transcription independently of a simple loss of ERα. Addition control would strengthen interpretation.

      The experimental design already controls for potential fulvestrant-specific transcriptional effects, as all four conditions (± hypoxia, ± fulvestrant) were included. EDHR genes were defined based on induction under hypoxia, loss of this induction following ERα degradation, and absence of residual hypoxic induction in the presence of fulvestrant. Consistent with this, SCNN1B and SCNN1G do not show significant fulvestrant-responsive changes under normoxia (Figure 5C,D).

      We also note that fulvestrant has been shown to induce minimal global chromatin remodelling (Guan et al., 2019), supporting its use to assess ERα dependency without broadly confounding chromatin accessibility; this reference is now included in the manuscript.

      2.3

      The molecular mechanism by which ERα modulates the hypoxic transcriptome, specifically how ERα and HIF pathways converge at ENAC loci should be more studied.

      We further examined the potential convergence of ERα and hypoxic signalling at the ENaC loci (included as __Figure 5E __in the revision proposal) showing genome browser views of the SCNN1G and SCNN1B loci, highlighting hypoxia-induced HIF1α binding and ERα association at these sites.

      To further support this, we will perform RT-qPCR validation of SCNN1G and SCNN1B expression following treatment ± IOX5 and ± fulvestrant. IOX5 is a selective PHD inhibitor that stabilises HIF proteins, enabling us to assess the contribution of HIF signalling independently of other oxygen-dependent effects associated with hypoxia.

      2.4

      In addition, to assess the relevance of this work for luminal breast cancer and ERα expression, specific validation in TNBC should be performed

      To assess the clinical relevance of SCNN1B and SCNN1G in ER-positive and ER-negative subgroups, we performed Cox proportional hazards analyses in TCGA and METABRIC cohorts individually, including ER status and stratifying by ER-positive and ER-negative cases (Figure 6C). These analyses support the association of SCNN1G with poorer relapse-free survival specifically in ER-positive patients.

      2.5

      The authors should provide RT-qPCR validation of the key EDHR genes, especially since this signature is later used for downstream analyses.

      We agree that independent validation would strengthen these findings. We will perform RT-qPCR validation of key EDHR genes (including SCNN1B and SCNN1G) under ± hypoxia and ± fulvestrant conditions to confirm ERα-dependent hypoxic induction.

      Limitations

      2.6

      Reprogramming of the ERα cistrome under cellular stress is well documented. The study extends these ideas but does not clearly demonstrate a new mechanistic paradigm, particularly because the EDHR is defined primarily through omics approaches without strong mechanistic validation. In addition, we have to keep in mind that the study uses DMOG to model hypoxia-driven chromatin changes, but DMOG inhibits many 2-oxoglutarate-dependent dioxygenases non-selectively.

      This makes it difficult to attribute ERα cistrome reprogramming specifically to hypoxia, rather than to broad off-target effects. The transcriptomic dataset is more convincing by need the validation suggested previously.

      While ERα cistrome reprogramming has been described, our study demonstrates a reciprocal interaction in which ERα not only responds to hypoxia but actively shapes hypoxia-driven transcription, defining an ERα-dependent hypoxic response (EDHR).

      We acknowledge the limitations of DMOG and have addressed this by validating key ERα binding events under bona fide hypoxia (1% oxygen) using ChIP–qPCR, confirming our findings under physiological conditions (response 1.1).

      To further strengthen mechanistic insight, we will assess the requirement for HIF stabilisation using the selective PHD inhibitor IOX5, combined with RT-qPCR analysis of SCNN1G and SCNN1B ± IOX5 ± fulvestrant (response 2.3 and 2.5). In addition, we will validate the functional relevance of ENaC through protein-level analysis and siRNA-mediated depletion, as described in__ response 2.1.__

      Together, these additions address concerns regarding DMOG specificity and provide further support for a functional interaction between ERα and hypoxic signalling.

      Audience

      2.7

      Given its reliance on omics datasets and preliminary functional assays, the paper will likely appeal to a specialized audience in transcriptional regulation, hypoxia signalling, and ER+ breast cancer biology. However, the limited mechanistic depth and uncertain translational relevance due to the lack of in vivo validation, may reduce its impact for broader oncology or therapeutic-development audiences. Without stronger validation, the findings may be perceived as niche and mainly of interest to researchers focused on ERα chromatin dynamics rather than to the wider cancer research community.

      The study incorporates multiple layers of human relevance, including spatial transcriptomic analyses demonstrating enrichment of EDHR within hypoxic tumour regions and survival analyses linking EDHR and ENaC expression to clinical outcome.

      In revision, we address the reviewer’s concerns through targeted validation (ChIP-qPCR in hypoxia, western blotting, and RT–qPCR). Together, these additions strengthen the mechanistic and translational relevance of the study.

      Reviewer 3

      No.

      Summary

      Response

      Major comments

      3.1

      The DMOG ChIP-seq provides a valuable first look at ERα redistribution. Since DMOG inhibits both HIF hydroxylases and oxygen-dependent demethylases, the driver of the observed changes remains ambiguous. It would help to include either ERα ChIP-seq under bona fide hypoxia or a selective PHD inhibitor condition (for example IOX5, as you discuss) to separate HIF stabilisation from broad demethylase inhibition. If ChIP-seq is not feasible, a brief ATAC validation at a small panel of gained and lost loci would still increase confidence.

      We acknowledge that mimetics of hypoxia can introduce off-target effects. To address this, we have validated our ERα ChIP-seq findings using ChIP-qPCR at representative loci (TFF1 and GREB1), demonstrating consistent changes in ERα binding under bona fide hypoxia (1% oxygen) (now included in Figure 1F).

      As acknowledged by the reviewer, ChIP-seq under these conditions is likely not feasible due to cell number constraints. We are willing to undertake ATAC-seq if required (as stated in response 1.1); however, we do not feel it would directly address ERα occupancy at these loci. We therefore consider our targeted ChIP-qPCR to be the most appropriate approach to validate ERα redistribution under hypoxia.

      3.2a

      The factorial RNA-seq is well designed and the attenuation analyses are clear. The EDHR selection is stringent and reproducible across two ER+ lines.

      To support the claim of ERα dependence mechanistically, a small number of targeted perturbations would go far. For example,

      i) confirm EDHR induction for SCNN1B and SCNN1G in hypoxia with and without fulvestrant by RT-qPCR

      We agree that targeted validation would strengthen the mechanistic support for ERα dependence. We will perform RT-qPCR validation of SCNN1B and SCNN1G under hypoxia ± fulvestrant to confirm ERα-dependent hypoxic induction (see also response 2.5).

      3.2b

      ii) test whether short-term ERα knockdown reproduces the effect.

      ERα dependency is already assessed through fulvestrant-mediated degradation within the factorial design, which provides a well-established and direct approach to evaluate ERα function. As EDHR genes are defined by loss of hypoxic induction following ERα degradation, this constitutes a robust assessment of ERα-dependent effects.

      We will therefore focus on orthogonal validation through RT-qPCR (response__ 2.5__), together with additional mechanistic and functional analyses using IOX5 and ENaC perturbation (responses 2.1 and 2.3), rather than introducing an ERα knockdown approach, although we would consider this if required.

      3.2c

      iii) A complementary test with a HIF-1α or HIF-2α knockdown at one time point would help position EDHR relative to HIF.

      This request aligns with point 2.3, which addresses the convergence of ERα and HIF signalling. While HIF knockdown under hypoxia would assess necessity, we will instead assess the contribution of HIF signalling using the selective PHD inhibitor IOX5, as this allows us to isolate HIF stabilisation from broader hypoxia-associated effects and avoids additional perturbation associated with transfection-based approaches. We will perform RT-qPCR analysis of SCNN1B and SCNN1G following treatment ± IOX5 ± fulvestrant to determine whether HIF stabilisation is sufficient to support ERα-dependent induction of EDHR genes.

      3.3

      The amiloride result is intriguing and consistent with a hypoxia-specific dependency. Because amiloride is pleiotropic, it would strengthen the conclusion to add one genetic and one pharmacological specificity control. A brief SCNN1B or SCNN1G knockdown in hypoxia should phenocopy the viability effect if ENaC contributes. In parallel, testing benzamil at sub-micromolar doses would provide a more ENaC-selective pharmacological readout. These can be performed in MCF7 and, resources permitting, in T47D.

      To address the reviewer’s concern regarding pleiotropic effects, we propose (aligning with our__ response to 2.1__) to apply siRNA-mediated knockdown of SCNN1B and SCNN1G under hypoxia to determine whether this reproduces our observed viability effect, thereby providing direct evidence for ENaC involvement.

      We agree that additional pharmacological validation could further support specificity, and would consider inclusion of a more ENaC-selective inhibitor if required.

      3.4

      The RFS associations for

      SCNN1B and SCNN1G are compelling. It would be helpful to report whether the associations persist in a multivariable model that at least includes ER status, grade and nodal status where available, or to state clearly when this is not possible across merged datasets. Even a sensitivity analysis in TCGA with ER+ cases only would contextualise the hazard ratios.

      We have analysed TCGA and METABRIC cohorts individually using Cox proportional hazards models, as this functionality is not available for merged datasets in KMplot. ER status was included in the models, and analyses were additionally stratified by ER-positive and ER-negative subgroups. The number of relapse events per subgroup is approximately 40; therefore, additional covariates such as grade and nodal status were not included given the limited number of events per model.

      Within ER-positive patients, high SCNN1G expression is associated with poorer relapse-free survival (TCGA HR 1.45, p = 0.0027), while SCNN1B shows a similar trend that does not reach statistical significance. These analyses are presented in Figure 6C and in the results section of the preliminary revision, and support the findings from the Kaplan–Meier analysis.

      3.5

      The spatial association of EDHR with EMT hotspots is a nice piece of the story. A short clarification of how spot-level cell type composition was handled will help readers interpret proximity results. If cell type deconvolution scores are available in the source dataset, adding a sentence on whether EDHR enrichment tracks tumour epithelial content would be useful.

      Spatial cell type composition and spot annotations were used as provided in the SpottedPy dataset, based on Cell2location-derived deconvolution scores and STARCH tumour annotations, without additional re-estimation.

      To address the reviewer’s suggestion, we examined the relationship between EDHR enrichment and epithelial content and observed no significant correlation at the neighbourhood level.

      These points have now been clarified in the manuscript.

      3.6

      Data processing for ChIP-seq and RNA-seq is documented and accessions are provided. The RNA-seq includes n=3 per condition, which is appropriate, and the correlation and LFC analyses are clearly presented. For the amiloride assay, the two-way ANOVA with interaction is appropriate; please add the exact n and whether experiments were independently repeated, and include the underlying values in a source table for transparency. These are small presentational edits rather than new experiments.

      In the preliminary revision we have added a statement to the amiloride assay figure (Figure 6D) clarifying that n = 3 independent biological replicates were performed per condition. In addition, we now provide the underlying numerical values for this assay in Table S11.

      3.7

      A small, hypothesis-driven mechanistic link from EDHR to ENaC function would substantially elevate impact without becoming a long project. For example, testing whether hypoxia increases amiloride-sensitive Na⁺ current in MCF7 and whether fulvestrant abrogates that increase would directly connect the transcriptional and functional observations. If available, patch-clamp or a simple SBFI-based Na⁺ imaging readout could suffice.

      We agree that directly linking EDHR to ENaC channel activity would further strengthen the mechanistic connection. We will prioritise genetic validation of ENaC function through siRNA-mediated depletion (response 2.1), which directly tests the requirement for ENaC in the hypoxia-specific viability phenotype.

      We are willing to explore the feasibility of measuring the amiloride-sensitive Na+ currents under normoxia and acute hypoxia (via perfusion of cells with bathing solution bubbled with nitrogen during recording) ± fulvestrant to further connect hypoxic regulation to channel activity.

      Minor Comments

      3.8

      Please show representative ERα ChIP-seq browser snapshots for at least one gained, one conserved and one lost locus alongside input for both conditions.

      We have now included representative ERα ChIP-seq browser snapshots for gained, conserved, and lost loci, together with input controls for both conditions, in Figure S3 of the revised manuscript.

      3.9

      In Figure 1D, the ATAC-seq comparison uses 0.1% O₂ for 48 h while the RNA-seq uses 1% O₂. Briefly justify the choice and discuss any expected differences.

      We thank the reviewer for this point. The ATAC-seq dataset was generated under 0.1% oxygen in the original study, whereas RNA-seq experiments in this work were performed at 1% oxygen to reflect tumour-relevant hypoxic conditions. The more severe hypoxia used for ATAC-seq would be expected to maximise detection of chromatin accessibility changes. Despite this, chromatin accessibility changes were limited, with ERα binding occurring predominantly at pre-accessible regions. This has now been clarified in the manuscript.

      3.10

      In the Methods for spatial analyses, specify the thresholds for hotspot calling and how the neighbourhood radius was chosen.

      The neighbourhood parameter was set to 8, corresponding to the immediate neighbouring spots in Visium data, consistent with package guidance. We have clarified this in the manuscript text.

      3.11

      For the EDHR heatmap, consider marking the 14 consensus genes and indicating which belong to the ENaC module to aid readability.

      We have marked the 14 EDHR consensus genes and indicated the ENaC module in the revised heatmap to aid readability.

      3.12

      Please report exact sample sizes and replicate numbers in all figure legends and provide a single table with all statistical tests, n, and p values.

      We have reported exact sample sizes and replicate numbers in all relevant figure legends and included Table S11 summarising all statistical tests, sample sizes (n), and p values.

      3.13

      A schematic summarising the experimental timelines for ChIP-seq, RNA-seq and viability would help orient readers.

      We have added timelines for these experiments as requested.

      3.14

      Minor copyedits: consistent formatting of O₂, gene symbols and reagent catalogue numbers.

      We have standardised oxygen notation throughout the manuscript to use “oxygen” in the main text and “O2” where appropriate (e.g. figures).

      Reagent catalogue numbers have now been standardised for consistency of presentation in the revised manuscript.

      Gene and protein nomenclature were already formatted according to accepted conventions and were verified for consistency.

      3.15

      The manuscript is well referenced. Where you contrast your findings with long-term CoCl₂ hypoxia, a sentence on why acute DMOG and short-term 1% O₂ may reveal different ERα behaviours would help position the novelty.

      We thank the reviewer for this suggestion. We have expanded the manuscript to clarify that acute hypoxia (1% oxygen) and DMOG treatment capture early, dynamic hypoxic responses, in contrast to chronic CoCl2 exposure, which reflects longer-term adaptation. This distinction is relevant to tumour biology, where hypoxia is often transient due to unstable vascularisation. The following statement has been added to the manuscript:

      “In addition to such chronic hypoxic adaptation, tumour hypoxia can also be dynamic, with cells experiencing acute or transient hypoxic exposure due to unstable vascularisation; an established contributor to tumour progression (Liu et al, 2022a; Koh & Powis, 2012). Thus, in contexts where both signalling pathways remain active, the dependence of the hypoxic response on ERα in ER+ cells has not been previously characterised.”

      Primary Limitations

      3.16

      DMOG vs hypoxia in the cistrome experiment,

      To address concerns regarding the use of DMOG, we have validated key ERα binding events from the ChIP-seq dataset by ChIP–qPCR at the TFF1 and GREB1 loci under bona fide hypoxia (1% oxygen) in biological triplicate__ (Figure 1F)__. These data demonstrate consistent changes in ERα binding under hypoxia, supporting that the DMOG-induced redistribution reflects hypoxia-driven changes.

      3.17

      the absence of direct HIF or cofactor perturbations

      We acknowledge the absence of direct HIF perturbation. To address this, we will assess the contribution of HIF signalling through stabilisation approaches, including RT-qPCR analysis of SCNN1B and SCNN1G ± IOX5 ± fulvestrant (response 3.2), to determine whether HIF activation is sufficient to support ERα-dependent induction.

      3.18

      and the pleiotropy of amiloride.

      To address the potential pleiotropy of amiloride, we will perform siRNA-mediated knockdown of SCNN1G and SCNN1B to provide independent validation of ENaC-dependent effects (response 3.3).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      This study explores how hypoxia reshapes ERα signalling in ER-positive breast cancer and whether this cross-talk exposes targetable vulnerabilities. The authors first map ERα binding in MCF7 cells after dioxygenase inhibition with DMOG and observe a genome-wide redistribution with enrichment of ERE, FOXA1 and AP-1 motifs at gained sites while chromatin accessibility at these loci appears unchanged in public ATAC-seq after hypoxia. They then perform RNA-seq in MCF7 and T47D using a factorial design that combines fulvestrant-mediated ERα degradation with 1% O₂ to define an ERα-dependent hypoxia response (EDHR). A 14-gene consensus EDHR signature includes ENaC regulatory subunits SCNN1B and SCNN1G, whose higher expression is associated with poorer RFS in ER+ cohorts. Functionally, amiloride increases viability in normoxia but reduces viability under hypoxia in MCF7 across a dose range. Spatial transcriptomics from ER+ tumours shows EDHR expression enriched at the margins of hypoxia and estrogen-hallmark regions and adjacent to EMT hotspots. Raw data and code availability are stated for the central datasets and accessions are provided. Together the results argue that ERα helps organise a distinct hypoxic programme and suggest a context-specific sensitivity to ENaC inhibition.

      Major comments

      The paper addresses a timely question with a clear narrative arc and brings together ChIP-seq, RNA-seq, pharmacology, survival analysis and spatial transcriptomics. The EDHR concept is interesting and the ENaC angle is original. The work is already strong and with a few targeted additions and clarifications it can be made more persuasive without becoming a new project.

      1) The DMOG ChIP-seq provides a valuable first look at ERα redistribution. Since DMOG inhibits both HIF hydroxylases and oxygen-dependent demethylases, the driver of the observed changes remains ambiguous. It would help to include either ERα ChIP-seq under bona fide hypoxia or a selective PHD inhibitor condition (for example IOX5, as you discuss) to separate HIF stabilisation from broad demethylase inhibition. If ChIP-seq is not feasible, a brief ATAC validation at a small panel of gained and lost loci would still increase confidence. Estimated time: 6-8 weeks for a focused follow up with two conditions and biological duplicates/triplicates.

      2) The factorial RNA-seq is well designed and the attenuation analyses are clear. The EDHR selection is stringent and reproducible across two ER+ lines. To support the claim of ERα dependence mechanistically, a small number of targeted perturbations would go far. For example, confirm EDHR induction for SCNN1B and SCNN1G in hypoxia with and without fulvestrant by RT-qPCR and test whether short-term ERα knockdown reproduces the effect. A complementary test with a HIF-1α or HIF-2α knockdown at one time point would help position EDHR relative to HIF. Estimated time: 3-4 weeks for qPCR and siRNA validations.

      3) The amiloride result is intriguing and consistent with a hypoxia-specific dependency. Because amiloride is pleiotropic, it would strengthen the conclusion to add one genetic and one pharmacological specificity control. A brief SCNN1B or SCNN1G knockdown in hypoxia should phenocopy the viability effect if ENaC contributes. In parallel, testing benzamil at sub-micromolar doses would provide a more ENaC-selective pharmacological readout. These can be performed in MCF7 and, resources permitting, in T47D. Estimated time: 4-6 weeks.

      4) The RFS associations for SCNN1B and SCNN1G are compelling. It would be helpful to report whether the associations persist in a multivariable model that at least includes ER status, grade and nodal status where available, or to state clearly when this is not possible across merged datasets. Even a sensitivity analysis in TCGA with ER+ cases only would contextualise the hazard ratios. Estimated time: 1-2 weeks.

      5) The spatial association of EDHR with EMT hotspots is a nice piece of the story. A short clarification of how spot-level cell type composition was handled will help readers interpret proximity results. If cell type deconvolution scores are available in the source dataset, adding a sentence on whether EDHR enrichment tracks tumour epithelial content would be useful. Estimated time: 1 week.

      Reproducibility and statistics

      Data processing for ChIP-seq and RNA-seq is documented and accessions are provided. The RNA-seq includes n=3 per condition, which is appropriate, and the correlation and LFC analyses are clearly presented. For the amiloride assay, the two-way ANOVA with interaction is appropriate; please add the exact n and whether experiments were independently repeated, and include the underlying values in a source table for transparency. These are small presentational edits rather than new experiments.

      Optional

      A small, hypothesis-driven mechanistic link from EDHR to ENaC function would substantially elevate impact without becoming a long project. For example, testing whether hypoxia increases amiloride-sensitive Na⁺ current in MCF7 and whether fulvestrant abrogates that increase would directly connect the transcriptional and functional observations. If available, patch-clamp or a simple SBFI-based Na⁺ imaging readout could suffice. Estimated time: 6-8 weeks.

      Minor comments

      1. Please show representative ERα ChIP-seq browser snapshots for at least one gained, one conserved and one lost locus alongside input for both conditions.
      2. In Figure 1D, the ATAC-seq comparison uses 0.1% O₂ for 48 h while the RNA-seq uses 1% O₂. Briefly justify the choice and discuss any expected differences.
      3. In the Methods for spatial analyses, specify the thresholds for hotspot calling and how the neighbourhood radius was chosen.
      4. For the EDHR heatmap, consider marking the 14 consensus genes and indicating which belong to the ENaC module to aid readability.
      5. Please report exact sample sizes and replicate numbers in all figure legends and provide a single table with all statistical tests, n, and p values.
      6. A schematic summarising the experimental timelines for ChIP-seq, RNA-seq and viability would help orient readers.
      7. Minor copyedits: consistent formatting of O₂, gene symbols and reagent catalogue numbers.

      Prior studies

      The manuscript is well referenced. Where you contrast your findings with long-term CoCl₂ hypoxia, a sentence on why acute DMOG and short-term 1% O₂ may reveal different ERα behaviours would help position the novelty.

      Significance

      General assessment

      The strongest aspects are the carefully designed factorial RNA-seq that cleanly separates ERα and hypoxia effects, the discovery of a concise EDHR signature reproducible across two ER+ lines, and the integration with spatial transcriptomics that places EDHR near EMT-rich tumour regions. The ENaC connection is new and potentially actionable, and the context-dependent amiloride response is a practical lead. Limitations are primarily mechanistic: DMOG vs hypoxia in the cistrome experiment, the absence of direct HIF or cofactor perturbations, and the pleiotropy of amiloride.

      Advance

      To my knowledge, this is the first description of a distinct ERα-dependent hypoxic programme in ER+ breast cancer that includes ENaC regulatory subunits and links to an EMT-adjacent spatial niche. The conceptual advance is the positioning of ERα as a coordinator of a subset of hypoxia-induced genes rather than as a parallel pathway, together with an initial functional readout that suggests a therapeutic angle through ENaC modulation. With the targeted additions outlined above, the study would move from strong association to a more mechanistic and translationally relevant model.

      Audience

      The work will interest a specialised audience in nuclear receptor biology, hypoxia signalling, tumour microenvironment, and ion transport in cancer. It has potential relevance for basic researchers studying ERα cistrome dynamics, for groups using spatial transcriptomics to define micro-niches, and for translational researchers exploring metabolic and ionic vulnerabilities in ER+ disease.

      Expertise disclosure

      Keywords: nuclear receptors,, chromatin profiling, transcriptomics, spatial transcriptomics, breast cancer biology.

      I am not a domain expert in ion channel electrophysiology; my comments on ENaC pharmacology focus on specificity and study design rather than detailed channel biophysics.

      Tone

      I find the paper well conceived and already compelling. The suggested experiments are focused, realistic in scope, and primarily aim to turn several strong associations into concise mechanistic statements that would further increase confidence and impact.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      ERα drives most luminal breast cancers. However, how hypoxia reshapes ERα activity and how ERα itself might influence the hypoxic response remain unclear. Understanding this interaction is crucial, as hypoxia is strongly linked to endocrine resistance and poor outcomes. In this study, authors investigated how hypoxia modifies ERα signalling in ER+ breast cancer and whether ERα contributes to the transcriptional response to low oxygen. Using MCF7 and T47D cells, they combined genome-wide profiling of the ERα cistrome under DMOG, hypoxic transcriptomics with or without ERα degradation, and spatial transcriptomics in tumours. This revealed an ERα-dependent hypoxic response (EDHR), prominently involving regulation of epithelial sodium channel (ENaC) subunits, whose expression requires both hypoxia and active ERα signalling. Functionally, ENaC inhibition with amiloride reduced cell viability under hypoxia. Together, these findings uncover a previously unrecognised ERα-dependent layer of the hypoxic transcriptome and identify ENaC as a potential therapeutic vulnerability in hypoxic ER+ breast cancer. Although the study is interesting, the manuscript lacks several essential functional and experimental validations. ENAC is proposed as a therapeutic vulnerability based on amiloride sensitivity assays. Additional experiments are required, such as western blot validation of ENaC regulation under hypoxia and loss-of-function approaches to assess its contribution to the phenotype. Fulvestrant is used to dissect ERa dependency. However, as a SERD, it may alter chromatin and transcription independently of a simple loss of ERα. Addition control would strengthen interpretation. The molecular mechanism by which ERα modulates the hypoxic transcriptome, specifically how ERα and HIF pathways converge at ENAC loci should be more studied. In addition, to assess the relevance of this work for luminal breast cancer and ERα expression, specific validation in TNBC should be performed Finally, the authors should provide RT-qPCR validation of the key EDHR genes, especially since this signature is later used for downstream analyses.

      Significance

      General assessment strengths:

      This study uncovers a previously unrecognised ERα-dependent hypoxic response in breast cancer, revealing that ERα actively shapes the hypoxic transcriptome rather than functioning as an isolated pathway. To me, the main strength of this work is the identification of ENaC as a novel hypoxia-specific therapeutic vulnerability in ER+ breast cancer, suggesting that ion-channel regulation may play a broader and underappreciated role in endocrine resistance.

      Limitation:

      Reprogramming of the ERα cistrome under cellular stress is well documented. The study extends these ideas but does not clearly demonstrate a new mechanistic paradigm, particularly because the EDHR is defined primarily through omics approaches without strong mechanistic validation. In addition, we have to keep in mind that the study uses DMOG to model hypoxia-driven chromatin changes, but DMOG inhibits many 2-oxoglutarate-dependent dioxygenases non-selectively. This makes it difficult to attribute ERα cistrome reprogramming specifically to hypoxia, rather than to broad off-target effects. The transcriptomic dataset is more convincing by need the validation suggested previously.

      Audience:

      Given its reliance on omics datasets and preliminary functional assays, the paper will likely appeal to a specialized audience in transcriptional regulation, hypoxia signalling, and ER+ breast cancer biology. However, the limited mechanistic depth and uncertain translational relevance due to the lack of in vivo validation, may reduce its impact for broader oncology or therapeutic-development audiences. Without stronger validation, the findings may be perceived as niche and mainly of interest to researchers focused on ERα chromatin dynamics rather than to the wider cancer research community.

      Expertise:

      My evaluation is based on my background in breast cancer, ERα signaling and breast tumorigenesis. However, I have limited expertise in spacial transcriptomic analyses and advanced CHiP-seq bioinformatic analyses, which may affect my assessment of some computational analyses.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Malcom et al. present evidence that, under hypoxic conditions, hypoxia-inducible factors (HIFs) alter the estrogen receptor alpha (ERα) epigenomic landscape in a model of estrogen receptor-positive (ER+) breast cancer (BrCa). The response of ER+ BrCa to estradiol (E2) in MCF7 (ER+) cells, as well as ERα signaling in both primary and metastatic breast cancer, has been well studied, and the epigenomic landscape of ERα+ BrCa is well documented. The differentially expressed genes (DEGs) identified under treatment with the hypoxia mimetic dimethyloxalylglycine (DMOG) revealed a subset of ERα-dependent hypoxic response (EDHR) genes. The outcome was a reprogramming of the basal ERα cistrome, coinciding with sites enriched for estrogen response elements (EREs) and co-transcription factor binding motifs for ERα, including FOXA1 and AP-1. This was demonstrated by ERα ChIP-seq (i.e. DMOG) and ATAC-seq (i.e. 1% O2) performed under different hypoxic conditions. The transcripts identified following DMOG treatment were leveraged and compared to publicly available RNA-seq datasets from various breast cancer subtypes exposed to 1% hypoxic oxygen. Although the comparison methods varied, the results suggested that BrCa cell lines under 1% hypoxic oxygen conditions showed strong similarity to MCF7 cells treated with DMOG. Genes upregulated in response to DMOG correlated with poorer survival outcomes. To demonstrate the requirement for ERα in this model, MCF7 cells were treated with the selective estrogen receptor degrader (SERD) fulvestrant-the only FDA-approved SERD for ER+ BrCa-showing a dampening of the HIF response among EDHR genes. This suggests that ERα is necessary for the expression of DEGs under hypoxic conditions induced by DMOG. Finally, the sodium channel protein ENaC subunits (i.e., SCNN1B and SCNN1G) were further characterized as candidate EDHR genes. Analyses of publicly available datasets indicated that high mRNA expression levels of these subunits were associated with worse survival outcomes, supporting the clinical relevance of EDHR genes SCNN1B and SCNN1G. To further validate clinical relevance, utilize the Spatial Transcriptome in a small subset of ER+ BrCa.

      Major:

      1. Overall, the study lacks well-controlled experiments comparing hypoxia induced by DMOG with hypoxia induced by 1% O₂ for assessing ERα occupancy throughout.
      2. Lack of evidence for other co-transcription factors impact under hypoxia HIF's in Fig1.
      3. Lack of evidence for DMOG induce HIF protein expression in MCF7 cells.
      4. Figure 1: ATAC-seq was performed under 1% O₂, whereas ChIP-seq was conducted with DMOG treatment, making these conditions not directly comparable.
      5. Figure S1: ERα ChIP lacks estradiol (E2) treatment in MCF7 cells with or without DMOG. The single-gene examples of DMOG effects shown in Fig. S1A are not significant.
      6. Figures S2 and S3: Fig. S2 lacks 1% O₂ conditions, and Fig. S3 lacks DMOG-induced HIF factor assessments.
      7. Figure S4: Estradiol (E2) treatment is missing from the controls, and the figure labeling is of poor quality. Hypoxic conditions for assessing ER status and appropriate controls are also lacking.
      8. Figure S5: The description of fulvestrant treatments under hypoxic conditions is unclear.
      9. Supplemental legends: These require major revision; they are of poor quality and lack statistical details and references to biological replicates.

      Minor:

      1. Overall comparisons throughout the manuscript are weak; the figures appear sloppy and lack sufficient effort in presentation.

      Significance

      In general, the manuscript in its present form does not greatly contribute from published work as the ERα cistrone is well documented work studied for its role in regulating gene expression, particularly in ERα-positive breast cancer. Additionally, a lack of a through comparison between DMOG and or 1 %O2 induce hypoxia in the MCF7 ER+ model, diminished initial interest in the manuscript. The lack of considering estradiol exposure under hypoxic conditions with either 1%O2 and or DMOG also limits relevance to patients with ER+ BrCa. The ERα epigenomic profile has been extensively studied including work under hypoxic conditions.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We thank to all reviewers on their careful consideration of our manuscript. We highly appreciate their thoughtful comments and suggestions, that helped us to improve the quality of our work. We address each comment point-by-point below.

      2. Description of the planned revisions

      __Reviewer #1 __

      Minor comments:

      Figure 5 would be more informative if it included more higher magnification images that would reveal the staining at the cellular level.

      To fulfil the suggestion, we will perform a new round of immunostaining followed by high-resolution confocal imaging. This requires additional time for laboratory work.

      __Reviewer #2: __

      Major comments

      1d. The authors tried to attribute the minor phenotype to the incomplete depletion of S100A4+ cells. However, it is possible that if the S100A4+ cells only represented a minor population, their function may be compensated by other populations. This might be confirmed by quantification of S100A4+ cells or S100A4-Cre; GFP+ cells in fibroblast or CD45 populations from images showed in Figure 5.

      We will address this comment by performing required quantifications.

      Moreover, we have now included data on the presence of S100A4+ cells in S100a4-Cre;DTA mice (Figure for Reviewers 5a,b; Supplementary Figure 7a,b in the revised manuscript), which demonstrate incomplete depletion of the S100A4+ cells in the nipple and the mammary gland. This is likely due to ongoing tissue remodeling and continuous S100A4+ replenishment/ supply. Another study using the same S100a4-Cre;DTA mouse model reported an efficient S100A4+ cell depletion in mandibular condyle (Tuwatnawanit et al., 2025), which suggests that the presence of S100A4+ cells in the S100a4-Cre;DTA mammary gland and nipple is due to tissue-specific dynamics rather than lack of depletion efficiency.

              We have included in Discussion: “Notably, we observed incomplete depletion of S100A4+ cells in the mammary gland and nipple. Interestingly, a study using the same S100a4-Cre;DTA mouse model reported complete S100A4+ cell depletion in the superficial layer of mandibular condyle46. This suggests that incomplete depletion of S100A4+ cells in nipple and mammary gland is due to tissue-specific dynamics, rather than lack of depletion efficiency, indicating a compensatory mechanism that can balance the cell loss.”
      

      The images in Figure 5 and Figure S4 are difficult to confirm colocalization. A higher magnification image would be required for each panel. Furthermore, a precise quantification based on the current images would be more supportive of the conclusion regarding the discrepancy of the composition of S100A4 lineage between epidermis and mammary gland (lines 163-165).

      To address this comment, we will perform a new round of immunostaining and high-resolution confocal imaging and quantifications and include the results in the fully revised manuscript.

      Line 163, the author hypothesis the Langerhans cells due to morphology. Those cells should be able to be confirmed by a co-staining with F4/80 in addition to the current form of Fig 5h.

      To address this comment, we will perform co-staining of GFP and F4/80 (or, eventually, AIF1, depending on antibody availability) and include the results in the fully revised manuscript.


      Reviewer #3

      Minor comments

      Figure 2c: The H&E images are not fully convincing. Immunofluorescence analysis of epithelial architecture would support the authors' interpretation and should be feasible if tissues are already available.

      We will perform immunostaining for epithelial markers, such as keratins, and include the results in the fully revised manuscript.

      Figure 4f: The proliferation data are compelling, but the authors could extend this by examining how cell differentiation and epithelial organisation are affected.

      We will perform immunostaining for epithelial markers (keratins, αSMA) and include the results in the fully revised manuscript.

      Figure 5b: To more convincingly show that GFP+ cells contact endothelial cells, co-labelling with an endothelial marker such as CD31 would be helpful.

      We will perform the requested co-labeling of GFP and CD31 and include the results in the fully revised manuscript.

      Figure 5f-h: The structures referenced in the text (lines 159-163) should be clearly indicated on the immunofluorescence images.

      We will incorporate these explanations into the new, high-resolution/detailed Figure 5 in the fully revised manuscript.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #1:

      Major comments

      1. It is rather difficult to conclude whether the observed nipple phenotype reflects an early embryonic/prepubertal defect in establishing the nipple stroma, is caused by a constitutive response to ongoing cell death, or a response to continuous DTA expression (or a combination of some of these).

      The data raise a couple of additional questions: Is there a nipple phenotype at 3 wk of age? It would not be totally unsurprising if ablation of a major fraction of dermal fibroblasts in the nipple area would lead to an early embryonic/prepubertal phenotype but there is no data on this. Hence, is there a "congenital" nipple deformity, as concluded by the authors (line 191)?

      We appreciate the reviewer’s insightful comments. We have now included data on embryonic nipple development. These data demonstrate abundant S100A4-lineage cells in E15.5 and E18.5 skin of S100a4-Cre;mT/mG embryos (Figure for Reviewers 1a, corresponding to Figure S3a in the revised manuscript) and normal appearance of nipple sheath in S100a4-Cre;DTA embryos at E18.5 (Figure for Reviewers 1b, corresponding to Figure S3b in the revised manuscript), suggesting no embryonic defect.

      Unfortunately, we cannot provide data on 3-weeks old mice (we have not collected this timepoint previously and currently we do not have this mouse line alive). Instead, however, we provide in situ pictures of DTA and S100a4-Cre;DTA nipples at 7 weeks of age (Figure for Reviewers 1c; Figure S3c in the revised manuscript), which demonstrate that the phenotype of defective nipple is fully established at this timepoint. Because the late embryonic data did not support the “congenital” establishment of the nipple deformity and we could not provide any more data from early postnatal development, we have corrected the statement “we describe a congenital nipple deformity” in the discussion to “we describe a nipple deformity”.

      Are there S100a4+ cells in the nipple area of pubertal S100a4-Cre/DTA mice? I.e. is there a continuous supply of new S100a4+ cells and thereby continuous cell death and DTA expression as one might expect based on the RNA-seq data?

      The S100A4+ cells are present in the nipple area of S100a4-Cre;DTA mice, suggesting a continuous supply of new S100A4+ cells (Figure for Reviewers 1b, corresponding to Figure S3b in the revised manuscript; and Figure for Reviewers 5a,b, corresponding to Figure S7a,b in the revised manuscript). In the revised manuscript, we comment on this in Discussion: “Notably, we observed incomplete depletion of S100A4+ cells in the mammary gland and nipple. Interestingly, a study using the same S100a4-Cre;DTA mouse model reported complete S100A4+ cell depletion in the superficial layer of mandibular condyle46. This suggests that incomplete depletion of S100A4+ cells in nipple and mammary gland is due to tissue-specific dynamics, rather than lack of depletion efficiency, indicating a compensatory mechanism that can balance the cell loss.”

      Figure for Reviewers 1 (Figure S3 in the revised manuscript): Embryonic and pubertal nipple phenotype. (a) Representative images of cleared whole-mount S100a4-Cre;mT/mG nipple tissue at embryonic developmental time-points: E15.5 and E18.5. Scale bar = 100 µm. (b) Immunofluorescent labeling for S100A4 on embryonic DTA and S100a4-Cre;DTA whole-mount skin (E18.5). Scale bar = 100 µm. (c) Representative in situ photographs of nipples from DTA and S100a4-Cre;DTA pubertal (7-weeks old) mice. Scale bar = 1 mm.

      The subtitle on line 54 implies that that S100a4-Cre/DTA mice display a branching phenotype. However, it looks to me as if there is a pubertal outgrowth defect (as is also written in the body text, line 64) rather than a branching phenotype, potentially reflecting the much smaller size of S100a4-Cre/DTA mice (Fig. 2a). Unless there is a change in branch point frequency, I suggest rephrasing the title and discussion. Instead, I suggest the authors discuss the observed outgrowth delay considering the gross overall growth defect (Fig. 2a). If ductal outgrowth was normalized to the overall growth defect, would one still observe 'a delay in branching morphogenesis'?

      We apologize for the section title confusion. We have analyzed branching frequency in 7-weeks-old females and observed reduced total number of branching points in S100a4-Cre;DTA mice (Figure for Reviewers 2a, corresponding to Figure 2f in the revised manuscript). A significant difference in number of branching points remained also after their normalization to body weight, (Figure for Reviewers 2c, corresponding to Figure 2h in the revised manuscript). We have now added the new quantifications to the revised manuscript with accompanying descriptions in the main text “Analysis of mammary epithelial development using whole-mount carmine staining revealed no significant differences in the prenatal establishment of the mammary epithelial tree but did reveal significantly delayed epithelial outgrowth and reduced branching in pubertal (7 weeks old) S100a4-Cre;DTA mice (Figure 2e,f). Normalization of epithelial outgrowth and branching to body weight indicates that the observed defect represents a mammary-specific impairment rather than a consequence of reduced body growth (Figure 2g,h).”.

      __Figure for Reviewers 2 (Figure 2 in the revised manuscript): __Pubertal branching morphogenesis is delayed in S100a4-Cre;DTA. (a-c) The plots show total number of branching points (a), epithelial outgrowth [mm] normalized to body weight [g] (b), and total number of the branching points normalized to body weight [g] (c) in 7 weeks old DTA and S100a4-Cre;DTA mice. All plots show the mean ± SD, *p

      Fig. 4e shows Masson's Trichrome and Picrosirius Red staining and the authors report the findings as follows (lines 120-124): "collagen fibers were loosened in the DTA nipples and more densely packed in the S100a4-Cre;DTA nipples". Perhaps the authors could help non-specialists to observe the loosened fibers and if they wish to make quantitative statements ("more densely packed"), such statements should be backed-up by quantifications.

      Picrosirius Red staining viewed under polarized light is a classic way to assess collagen organization, thickness, and packing. Red / orange / yellow color typically marks thicker, more mature, and more tightly packed collagen fibers (often associated with type I collagen), while green color usually marks thinner, less organized, or less densely packed fibers (often associated with type III collagen or immature collagen). We had included this explanation in the Figure legend of the submitted manuscript already: “Typically, thicker collagen fibers exhibit stronger birefringence and appear red or orange, while thinner fibers exhibit weaker birefringence and appear green or yellow.” To help with the quantification, we have extracted the red channel and quantified color intensity. The results are shown in Figure for Reviewers 3, corresponding to Figure S4 in the revised manuscript. Moreover, we will also quantify the differences in pattern of the collagen fibers. The fibers in DTA nipples look shorter and more curved, while the fibers in S100a4-Cre;DTA nipples look longer and straighter, more aligned. The results will be included in the fully revised manuscript.

      Figure for Reviewers 3 (Figure S4 in the revised manuscript): Collagen fibers are densely packed in S100a4-Cre;DTA nipples contain more . (a) Representative pictures of histological sections of DTA and S100a4-Cre;DTA stained for collagen by Picrosirius red. Polarized light images and the red channel (mature/densely packed collagen) are shown alongside detail pictures of selected regions A and B. Scale bar = 200 µm and 100 µm (in detail pictures). (b) Quantification of Intensity Mean Value for the red channel (densely packed collagen), showing statistically non-significant difference. The plot shows the mean ± SD, ns p > 0.05 (Mann-Whitney test), n = 3 DTA / 4 S100a4-Cre;DTA.

      I found the Discussion on the various mouse models somewhat problematic. Overall, the paper is written is a way that it often remains unclear whether it refers to studies addressing the role of S100a4 itself, studies addressing the function of S100a4+ cells via ablation approaches (S100a4-Cre or S10 0a4-CreERT2 crossed with floxed DTA), or those where S100a4-Cre has been used to delete gene X/Y/Z. These are all very different experimental approaches where one approach is not necessarily informative when trying to understand the results from another one. The authors should make these points clear and consider whether all their discussion points are relevant.

      We apologize for the confusion. We have carefully reviewed the references and their interpretations, and corrected them as necessary.

      The abstract states S100a4 (fibroblast-specific protein 1) is "expressed by mesenchymal cells and has been implicated in the development of eccrine glands, hair follicles, and mammary branching morphogenesis". However, the study on eccrine glands (ref. 19) shows that S100A4+ cells play a role in eccrine gland development but it does not address the role of S100a4 itself, while the study on hair follicles (ref.20) in turn reports the expression pattern of S100a4 in hair follicles but does not address its function, nor the role of S100a4+ cells. Finally, I failed to find references in the paper to studies addressing the role of S100a4, or S100a4+ cells in the mammary gland.

      Instead, the paper had references to studies where S100A4-Cre had been used to delete different genes and these mice had various mammary phenotypes - which, as indicated above, is a very different approach compared to deleting S100a4 or ablating S100a4+ cells.

      Thank you for your comment. We addressed the concern in the Abstract and further in the Discussion. We revisited the present the cited studies more carefully, clearly distinguishing the different approaches and particular findings.

      In our literature review, we also considered studies that used S100a4-Cre mouse model, to manipulate gene expression within S100A4+ cells. We believe that these studies bring indirect evidence of S100A4+ cell involvement in development and/or homeostasis of a tissue, such as mammary gland. Please, find the rephrased part of Abstract in the text, and below:

      “S100A4 (S100 calcium binding protein A4, also known as fibroblast-specific protein 1) is expressed by mesenchymal cells and has been associated with hair follicle regeneration. S100A4-expressing cells have been implicated in the development of eccrine glands, and studies using S100a4-Cre to manipulate gene function have suggested that S100A4-expressing cells may contribute to mammary branching morphogenesis.”

      __In Discussion (lines 197-200), __the authors write: "We described significant delay in mammary branching morphogenesis in puberty, confirming an important role for S100A4+ cells in mammary development, as it was previously described (refs 37-39)."

      It should be noted that none of these studies addressed the role of S100A4+ cells:

      • Ref 37 used S100a4-Cre to delete sharpin

      • Ref 38 used the same Cre line to delete Ptch1, did not address the role of S100a4 or S100a4 expressing cells

      • Likewise ref 39 deleted another gene using S100a4-Cre

      Later on in Discussion, the authors compare the reported phenotype to previous studies (lines 248-255): "...targeting S100A4+ cells through knockout experiments can result in severe phenotypes, such as a reduction in adipose tissue (ref 26), skin phenotypes, a disrupted estrous cycle, reduced fertility (ref. 38), and complete infertility, hypogonadism and defects in pituitary endocrine function (ref. 28).

      Of these, Ref. 26 used the same approach as the current study (S100a4-Cre; DTA) (Fig. 7A in the paper)

      • these mice were significantly lean, with markedly reduced fat compared with the control mice - also the mice in the current study are very small, so perhaps they could also be described as 'lean'. Yet ref. 26 reports that female mice had comparable food uptake, respiratory exchange ratio and physical activity, and slightly increased energy expenditure

      Ref. 38 (as mentioned above) reports deletion of Ptch1 using S100a4-Cre lines and these mice "displayed a disrupted estrous cycle and dramatically reduced fertility over 6.5 weeks". However, this has nothing to do with the approaches where Fsp1/S100a4+ cells are depleted with DTA. Likewise, reference 28 analyzed the phenotype of S00a4-Cre;Ptch1fl/fl mice. Obviously, deleting Ptch1 using S100a4-Cre mice is quite a different approach than "targeting S100A4+ cells" through knockout experiments". Ptch1 deletion leads to a combination of gain-of-function (of Hedgehog activation) and loss-of-function (loss of Hh-independent functions of Ptch1) and hence comparisons with these phenotypes is rather challenging. I suggest the authors focus their phenotype comparisons to ref. 26 where S100a4/Fsp1+ cells were ablated with DTA, i.e. the same approach as in the current study.

      Please, find the rephrased part of Discussion in the text (lines 236-256), and below:

      “A key consideration when interpreting studies involving S100A4 is that fundamentally different experimental approaches have been used to investigate its role. These include descriptive analyses of S100A4 expression, functional studies targeting the S100A4 protein itself, genetic models using S100a4-Cre to manipulate unrelated genes in S100A4-expressing cells, and ablation models such as S100a4-Cre;DTA, which deplete S100A4⁺ cells. These approaches are not equivalent and provide distinct types of information. In the present study, we specifically assess the consequences of ablating S100A4-expressing cells, and comparisons to other studies should therefore be interpreted within this context.

      Studies using S100a4-Cre to manipulate specific signaling pathways (e.g. Wnt or Hedgehog signaling via gene deletion) in S100A4-expressing cells have reported diverse phenotypes, including effects on fertility and endocrine function28,34. However, these phenotypes primarily reflect the consequences of pathway perturbations within S100A4-expressing cells rather than the role of S100A4⁺ cells themselves. This is fundamentally different from the ablation approach used here, which removes the S100A4⁺ cell population.

      In contrast, studies employing S100a4-Cre–driven DTA–mediated ablation represent a directly comparable approach. Such studies have reported systemic phenotypes, including reduced adipose tissue and altered metabolic parameters26, indicating that S100A4-expressing cells contribute to multiple aspects of tissue homeostasis. Consistent with these previous reports, S100a4-Cre;DTA mice used in our study were significantly smaller than their littermates. Our findings extend these observations by identifying a specific and previously unrecognized role for this cell population in nipple morphogenesis.”

      I find the Discussion is somewhat off the topic by starting with WHO recommendations on breastfeeding and linking this to observed mouse phenotype. Overall, the discussion is rather long and from time-to-time more like a literature review. I would recommend keeping the Discussion more succinct and focused.

      To improve the conciseness and focus of Discussion, we have deleted this part of text.

      **Referee cross-comenting**

      I agree with the comments of other reviewers. However, to me it seems that the analysis of S100a4 knockout mice would not be feasible within a reasonable timeframe and would represent a study of its own. My understanding was that the authors were not interested in S100a4 itself. Rather, S100a4-Cre was used as a tool to understand the importance of a certain (fibroblast) cell population for mammary gland morphogenesis.

      Indeed, our goal was to study the role of a specific cell population (S100A4+ cells) in mammary gland morphogenesis, not to study the role of S100A4 protein per se.

      Reviewer #1 (Significance (Required)): General assessment:

      This study reveals the importance of the S100a4+ cell lineage for nipple formation while showing the same cells are dispensable for mammary gland morphogenesis. The main limitation is that it remains unclear whether the observed nipple phenotype is derived from an early embryonic/prepubertal defect in establishing the nipple stroma, is caused by a constitutive response to ongoing cell death, or a response to continuous DTA expression (or a combination of some of these). Hence its relevance as a model of human inverted nipple condition remains rather speculative.

      Thank you for consideration of our work and valuable feedback. We did not intend to claim that S100a4-Cre;DTA mouse represents a model of human inverted nipple condition. However, considering morphological features, it might resemble it. We now rephrased the Discussion so it is clearer and more concise.

      Reviewer #2

      Major comments:

      1. My key concern is the discussion part. I think the authors need to re-organize/re-phrase the discussion part, it confused me a bit in terms of logic, phrases and interpretation of literatures.

      We have significantly re-organized and re-phrased the Discussion.

      Here are few examples:

      1. The lines 195-199 contain lot of repeated information

      We have rephrased the paragraph and removed repeated information. The new text can be found in lines 201-206 in the revised manuscript.

      1. The authors mentioned the studies in ref 26,28 and 38 using "targeting S100A4+ cells through knockout experiment can result in sever phenotypes". This is very misleading. Those studies using the same (or similar if the origin is different) S100A4-Cre line as the current study but induced the activation of Wnt and sHH signalling pathways, respectively. The observed phenotypes are largely due to the pathway function, rather than the S100A4 gene or normal S100A4+ cell itself. This is significantly differed from the current study.

      We apologize for the confusion; we have now rephrased our claims (lines 236-256):

      “A key consideration when interpreting studies involving S100A4 is that fundamentally different experimental approaches have been used to investigate its role. These include descriptive analyses of S100A4 expression, functional studies targeting the S100A4 protein itself, genetic models using S100a4-Cre to manipulate unrelated genes in S100A4-expressing cells, and ablation models such as S100a4-Cre;DTA, which deplete S100A4⁺ cells. These approaches are not equivalent and provide distinct types of information. In the present study, we specifically assess the consequences of ablating S100A4-expressing cells, and comparisons to other studies should therefore be interpreted within this context.

      Studies using S100a4-Cre to manipulate specific signaling pathways (e.g. Wnt or Hedgehog signaling via gene deletion) in S100A4-expressing cells have reported diverse phenotypes, including effects on fertility and endocrine function28,34. However, these phenotypes primarily reflect the consequences of pathway perturbations within S100A4-expressing cells rather than the role of S100A4⁺ cells themselves. This is fundamentally different from the ablation approach used here, which removes the S100A4⁺ cell population.

      In contrast, studies employing S100a4-Cre–driven DTA–mediated ablation represent a directly comparable approach. Such studies have reported systemic phenotypes, including reduced adipose tissue and altered metabolic parameters26, indicating that S100A4-expressing cells contribute to multiple aspects of tissue homeostasis. Consistent with these previous reports, S100a4-Cre;DTA mice used in our study were significantly smaller than their littermates. Our findings extend these observations by identifying a specific and previously unrecognized role for this cell population in nipple morphogenesis.”

      1. In the lines 253-255, why the author believe complete S100A4+ depletion would leads to the fatal of mouse? Is there study suggest that? Or have authors checked the expression of S100A4 in the S100A4-Cre;DTA model to confirm the efficiency?

      We have now included, also in response to other Reviewers’ comments, data on S100A4 expression in the S100A4-Cre;DTA model (Figure for Reviewers 5, corresponding to Figure S7 in the revised manuscript), and commented on these results in lines 257-262: “Notably, we observed incomplete depletion of S100A4+ cells in the mammary gland and nipple. Interestingly, a study using the same S100a4-Cre;DTA mouse model reported complete S100A4+ cell depletion in the superficial layer of mandibular condyle48. This suggests that incomplete depletion of S100A4+ cells in nipple and mammary gland is due to tissue-specific dynamics, rather than lack of depletion efficiency, indicating a compensatory mechanism that can balance the cell loss.”

      In Fig. 1, the authors described the impaired nursing capacity of S100A4-Cre;DTA dam. However, it seems the little size is also smaller (Fig 1a). Do authors have any explanation or hypothesis?

      Thank you for this insightful observation. It is well established that metabolic and nutritional condition directly affect female reproductive functions. Adult S100A4-Cre;DTA mice are generally smaller compared to their litter counterparts, potentially because of lower body fat content or other anatomic/metabolic condition that might negatively influence fecundity, for instance, lowering ovulation rate and/or embryonic survival. In support of this, earlier studies have reported a positive correlation between growth rate/body condition and litter size (Eisen & Durrant, 1980). Unfortunately, in the case of S100A4-Cre;DTA mice, we can only speculate about the possible explanations, as we do not have supporting data which could confirm it.

      In lines 181-184, the authors states "the results showed that the tissue reacted to a foreign chemical or an endogenous compound....." , which results are referring here? I could not find any inflammation related GO terms in figure 6b. It would be more accurate to specify them in lines 179-181, which appears to be a technical statement rather than a result in current form.

      Thank you for this comment. Indeed, there are no GO terms explicitly labeled as “inflammation” and “repair”; however, several GO terms are functionally related to these processes. Our interpretation was based on broader biological context rather the explicit annotation. To clarify this, we revisited the text and included GO terms that reflect the tissue response (lines 187-193).

      “The GO terms indicated that the tissue reacted to a foreign chemical or an endogenous compound (xenobiotic metabolic process, cellular response to xenobiotic stimulus, response to xenobiotic stimulus, epoxygenase P450 pathway), and responded to inflammation and repair (actin filament-based process, actin cytoskeleton organization; eicosanoid and lipid metabolic processes) (Figure 6b).”

      The lines 182-184 was not clear. Does the author refer the "nipple tissue response" in general as malfunction of development or inflammation and tissue repair as mentioned in the previous sentence? If the later cases, the authors should consider the failure of lactation might mimic the involution, which may cause the apoptosis and inflammation as well. This might be independent of the DTA expression.

      Thank you for raising this point. Indeed, in this line, we refer to ongoing tissue inflammation and repair. We also considered the hypothesis that the ejection incapability (and consecutive milk stasis) triggers involution. However, tissues were collected within a few hours after parturition, when only very early signs of involution, if any, would be detectable; therefore, we expect minimal influence of involution. To reflect this comment, we added new text to the Discussion (lines 272– 277). “The observed tissue response can be also associated with hallmarks of mammary involution, the process which is triggered by the milk stasis. However, the tissues were collected within few hours after parturition, when the effect of involution should be minimal53. Rather, we hypothesize that immune cell recruitment, and the upregulation of the lipid skin barrier might be caused in response to the continuous apoptosis of S100A4+ cells and their replacement.”

      Minor comments:

      1. The authors demonstrated in Figure S1 and lines 92-96 that no significant differences were observed in pituitary glands and ovaries in S100a4-Cre:DTA and DTA mice. Have the authors checked the S100A4 expression or lineage cells in these organs, or have been reported by others?

      Yes, we checked the S100A4-lineage cells in the pituitary gland and ovary and have now included the results here (Figure for Reviewer 4a,b corresponding to Figure S1a,b in the revised manuscript), along with relevant text description (lines 94-95 in the revised manuscript). “We observed S100A4-lineage traced cells in pituitary gland and ovaries using S100a4-Cre;mT/mG model (Figure S1a,b).” The presence of S100A4+ cells in these organs was also reported previously (Ren et al., 2019).

      Figure for Reviewers 4 (Figure S1 in the revised manuscript): S100A4-lineage cells are abundant in the pituitary gland and ovary. (a) Representative images of a cleared whole-mount pituitary gland from a S100a4-Cre;mT/mG mouse. (b) Representative images of a cleared whole-mount ovary from a S100a4-Cre;mT/mG mouse. Scale bar = 100 µm.

      The authors have performed live imaging to evaluate the contraction of alveoli. It would be better to include a video together with the snapshots showed in Figure S2.

      We have included the videos as supplementary movies, Movie S1 (DTA) and Movie S2 (S100a-Cre;DTA).

      Since the study is mainly using S100a4, it would be better to avoid using FSP1 in the results, for example Fig 5h.

      We apologize for this oversight; it has now been corrected.

      What does L1 stand for? Lactation Day 1? It should be spelt out in the first instance.

      Yes, indeed, L1 is lactation day 1. Please note that it was already spelled out in the first version of the manuscript, now in line 48.

      Line 150. Figure S4 should be Figure S4a.

      (Please note, that by adding new Supplementary figures, this comment is referring to Figure S6 in the new version of manuscript.) Thank you for this comment. In the text, we state “GFP+ cells were spread throughout the fat pad but were also localized in the periepithelial stroma and infiltrated the epithelium”. This we show in Figure S6a and in S6b; therefore, we now changed the reference accordingly, as it might be more accurate.

      **Referee cross-comenting**

      I agree with the other reviewers, as well as the Consultation Comments. The manuscript would benefit greatly from a thoroughly optimised Discussion section to address issues raised by all reviewers.

      __ Reviewer #2__ (Significance (Required)):

      • Overall, this study is well designed and the key findings are valid, especially the role of S100A4 during nipple development is novel and interesting.

      -One limitation of the study is that RNA-seq was performed using a mixture of all cell types present in the nipple. While this approach is reasonable-given that depletion of the S100A4+ lineage may exert both direct and indirect effects contributing to nipple dysfunction-it should be more clearly acknowledged and discussed in the manuscript. Additionally, this experimental design may limit the utility of the dataset for other researchers interested in nipple development and the specific functions of S100A4.

      Reviewer #3

      Major comments:

      2) The differential systemic versus mammary-specific effects of DTA-mediated S100A4 cell ablation are intriguing. The authors should address why the mammary fat pad appears unaffected.

      Thank you for this comment. The role of S100A4+ cells in adipose tissue was previously reported (Zhang et al., 2018). Authors reported significantly smaller adipose tissue of S100a4-Cre;DTA mice (males and females), measured as the weight of the dissected fat pad. In our work, we measured the in-situ area of the fat pad, which appeared to be unaffected. It is possible that the volume (weight) of the fat pad would be different, however we do not have data to confirm / reject this hypothesis.

      Are S100A4 expressing cells present during embryonic mammary development, or are they mainly postnatal? Would an inducible S100A4CreERT model lead to similar phenotypes, or might the timing of depletion influence the outcome? Discussing these points would reinforce the conclusions regarding the contribution of S100A4-expressing cells to mammary and nipple development and could also clarify the transient nature of the ductal branching phenotype.

      S100A4-expressing cells are present during embryonic mammary development, too. Please, refer to the embryonic lineage-tracing time-points incorporated in the first version of the manuscript (Figure 5a and Figure S6a). Now, we have added Figure for Reviewers 1 corresponding to Figure S3 in the revised manuscript), which focuses on the embryonic nipple phenotype but also provides information on the presence of S100A4+ cells.

      We agree that the use of inducible S100a4-CreERT model could potentially bring new insights toward developmental stage-specific roles of S100A4+ cells, and thus would be interesting to use in a follow-up study. Currently, such experiments are beyond our capacity.

      Therefore, we have included a new subsection on Limitations of the study, where we comment:

      “A major limitation of this study is that the timing of DTA-mediated cell depletion cannot be precisely defined in the constitutive mouse model employing S100a4-Cre because recombination may occur continuously following the initial expression of S100a4 (E8.518). This limitation could be overcome by usage of inducible S100a4-CreERT instead. With this approach, it could be more feasible to determine if the nipple deformity arises as a defect of embryonic development or postnatal morphogenesis.”

      3) Although the authors attribute lactation failure primarily to defects in nipple architecture, the RNA seq data reveal downregulation of key milk production genes and luminal differentiation keratins, strongly suggesting impaired secretory activation. The authors should more explicitly discuss the relative contributions of epithelial functional maturation defects versus nipple structural abnormalities to the lactation failure observed upon S100A4+ cell depletion. Thank you for this comment. We believe that performing an immunofluorescence labeling of epithelial architecture (requested in the Minor comment 2) could bring more light into this. However, we deduce that secretory activation is not impaired, as the presence of the milk observed on in situ wholemounts, and H&E-stained alveoli (Figure 3d) implies luminal secretion of milk components. The observed phenotype of the lactating mammary gland strongly suggests there is a structural abnormality inhibiting the milk ejection.

      The downregulation of key milk production genes and luminal keratins in the bulk RNA-seq data may be influenced by differences in tissue composition between samples. In control mice, more fully developed nipples and an extended ductal network likely contribute to a greater representation of differentiated luminal epithelial cells, thereby increasing the expression of these markers.

      Minor comments:

      1. Figure 1: Including an immunohistochemistry or immunofluorescence control confirming depletion of S100A4 expressing cells would strengthen the conclusions.

      We have now included Figure for Reviewers 5 that corresponds to Figure S7 in the revised manuscript and comment on the results in sections Results (lines 169-171) and Discussion (lines 257-262).

      In Results: “Interestingly, S100A4 antibody labeling revealed presence of S100A4+ cells in S100a4-Cre;DTA tissues (Figure S3b, Figure S7a,b).”

      In Discussion: “Notably, we observed incomplete depletion of S100A4+ cells in the mammary gland and nipple. Interestingly, a study using the same S100a4-Cre;DTA mouse model reported complete S100A4+ cell depletion in the superficial layer of mandibular condyle48. This suggests that incomplete depletion of S100A4+ cells in nipple and mammary gland is due to tissue-specific dynamics, rather than lack of depletion efficiency, indicating a compensatory mechanism that can balance the cell loss.”

      Figure for Reviewers 5 (Figure S7 in the revised manuscript): S100A4+ cells are found in S100a4-Cre;DTA nipple and mammary tissues. (a) Immunofluorescent labeling for S100A4 and vimentin on FFPE sections of DTA and S100a4-Cre;DTA L1 nipples. (b) Immunofluorescent labeling for S100A4 and smooth muscle actin on FFPE sections of DTA and S100a4-Cre;DTA L1 mammary gland. Scale bar = 100 µm.

      Figure 3c: The histological defects more accurately reflect failure of secretory activation rather than "lactation failure" per se. The terminology should be refined to reflect this more precisely.

      Thank you for this comment. As explained in the response to your major comment 3, we believe our results show that the secretory activation is conserved in S100a4-Cre;DTA lactating mice. We understand that “lactation failure” might be misleading terminology, as the production of the milk is conserved as well. We therefore change the phrasing into “nursing defect” (line 51, 73, 83), as this could reflect the phenotype most precisely.

      **Referee cross-comenting**

      I agree with the Reviewer, the authors do not need to do knockout experiments in the revised manuscript. However, it would be great if they could address my comment in the discussion.

      Reviewer #3 (Significance (Required)):

      This is an important study for mammary developmental biology, addressing the relatively understudied mechanisms that govern nipple development at the stromal-epithelial interface, and the determinants of lactational performance. A major strength is the elegant integration of DTA-mediated cell ablation, advanced imaging, lineage tracing, and transcriptomics to uncover previously uncharacterised roles for S100A4-expressing stromal populations in shaping nipple morphology and function. The work lays a foundation for future studies into nipple biology and pathologies and mechanisms underlying successful lactation.

      Although the study is already mature, it could be further strengthened by incorporating more specific genetic models, such as inducible S100A4CreERT or S100A4 gene knockout/knockdown approaches.

      Thank you for appreciation of our work.

      4. Description of analyses that authors prefer not to carry out

      Reviewer #1

      Major Comment 1.

      It is rather difficult to conclude whether the observed nipple phenotype reflects an early embryonic/prepubertal defect in establishing the nipple stroma, is caused by a constitutive response to ongoing cell death, or a response to continuous DTA expression (or a combination of some of these). The data raise a couple of additional questions: Is there a nipple phenotype at 3 wk of age?...

      Unfortunately, we cannot provide data on 3 weeks old mice because we did not collect such samples before and we had to terminate our mouse colony due to an infection in the animal house (mouse line reanimation is possible because we had stored sperm of the mouse line but it would take a lot of time and resources). Nevertheless, we tried to address this comment by providing other relevant available data (see Figure for Reviewers 1).

      Reviewer #2

      Major Comment 3.

      In Fig S1c, d and lines 93-96, the authors investigated the estrus cycles to determine the potential cause of lactation failure. The data was presented as the number of mice in each stage. A more intuitive approach would be to follow the same mice for two to three cycles and observe the duration of each stage.

      We agree that the suggested approach would be more accurate in determining truly cycling females. Unfortunately, we cannot perform this experiment currently because we do not have these mice alive anymore. Nevertheless, because the S100a4-Cre;DTA females bore pups, they had cycled and were fertile.

      Reviewer #3

      Major comment 1.

      While the S100A4Cre::DTA model is powerful for evaluating the roles of S100A4 expressing cells, the authors should discuss the potential outcomes of using S100A4 knockout or knockdown approaches. If the authors have such data available, this could help distinguish phenotypes caused by loss of S100A4 function itself from those arising due to ablation of S100A4 expressing cell populations and would add mechanistic depth to the study.

      We thank the Reviewer for this insightful suggestion. We agree that genetic approaches targeting S100A4 function (e.g., knockout or knockdown) could, in principle, help disentangle cell-autonomous effects of S100A4 from those resulting from the loss of S100A4-expressing cell populations. However, we would like to clarify that the primary objective of our study is to investigate the functional contribution of S100A4⁺ stromal cells at the population level, rather than to dissect the molecular function of S100A4 protein per se. In this context, the S100A4-Cre;DTA model provides a well-established and appropriate strategy to ablate this cell population and assess its role in tissue development. Importantly, S100A4 is not only a functional protein but also a widely used marker of a heterogeneous stromal cell population. Genetic ablation of S100A4 itself would not eliminate these cells, and may result in relatively subtle or compensable phenotypes due to functional redundancy within the S100 protein family or context-dependent roles of S100A4. Therefore, such approaches would address a distinct biological question and may not directly recapitulate the phenotypes observed upon cell ablation.

      References

      Eisen, E. J., & Durrant, B. S. (1980). Genetic and Maternal Environmental Factors Influencing Litter Size and Reproductive Efficiency in Mice. Journal of Animal Science, 50(3), 428–441. https://doi.org/10.2527/jas1980.503428x

      Ren, Y. A., Monkkonen, T., Lewis, M. T., Bernard, D. J., Christian, H. C., Jorgez, C. J., Moore, J. A., Landua, J. D., Chin, H. M., Chen, W., Singh, S., Kim, I. S., Zhang, X. H. F., Xia, Y., Phillips, K. J., MacKay, H., Waterland, R. A., Cecilia Ljungberg, M., Saha, P. K., … Richards, J. A. S. (2019). S100a4-Cre–mediated deletion of Ptch1 causes hypogonadotropic hypogonadism: Role of pituitary hematopoietic cells in endocrine regulation. JCI Insight, 4(14). https://doi.org/10.1172/jci.insight.126325

      Tuwatnawanit, T., Wessman, W., Belisova, D., Sumbalova Koledova, Z., Tucker, A. S., & Anthwal, N. (2025). FSP1/S100A4-Expressing Stem/Progenitor Cells Are Essential for Temporomandibular Joint Growth and Homeostasis. Journal of Dental Research, 104(5), 551–560. https://doi.org/10.1177/00220345251313795

      Zhang, R., Gao, Y., Zhao, X., Gao, M., Wu, Y., Han, Y., Qiao, Y., Luo, Z., Yang, L., Chen, J., & Ge, G. (2018). FSP1-positive fibroblasts are adipogenic niche and regulate adipose homeostasis. PLoS Biology, 16(8). https://doi.org/10.1371/journal.pbio.2001493

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this pre-print, Belisova et al. investigate the under-explored mechanisms regulating nipple development and its essential role in offspring nourishment, focusing on the contribution of S100A4‑expressing cells in the mouse mammary gland. The authors use an elegant combination of Cre::DTA-mediated cell depletion, lineage tracing, imaging, RNA-seq, and functional assays to reveal roles for S100A4‑expressing fibroblasts and immune cells in nipple morphogenesis and lactation. The manuscript is generally well written, and the experimental design is strong, with appropriate controls supporting the overall conclusions. However, I have several comments and suggestions to improve this initial manuscript.

      Major comments:

      1) While the S100A4Cre::DTA model is powerful for evaluating the roles of S100A4 expressing cells, the authors should discuss the potential outcomes of using S100A4 knockout or knockdown approaches. If the authors have such data available, this could help distinguish phenotypes caused by loss of S100A4 function itself from those arising due to ablation of S100A4 expressing cell populations and would add mechanistic depth to the study.

      2) The differential systemic versus mammary-specific effects of DTA-mediated S100A4 cell ablation are intriguing. The authors should address why the mammary fat pad appears unaffected. Are S100A4 expressing cells present during embryonic mammary development, or are they mainly postnatal? Would an inducible S100A4CreERT model lead to similar phenotypes, or might the timing of depletion influence the outcome? Discussing these points would reinforce the conclusions regarding the contribution of S100A4-expressing cells to mammary and nipple development and could also clarify the transient nature of the ductal branching phenotype.

      3) Although the authors attribute lactation failure primarily to defects in nipple architecture, the RNA seq data reveal downregulation of key milk production genes and luminal differentiation keratins, strongly suggesting impaired secretory activation. The authors should more explicitly discuss the relative contributions of epithelial functional maturation defects versus nipple structural abnormalities to the lactation failure observed upon S100A4+ cell depletion.

      Minor comments:

      1. Figure 1: Including an immunohistochemistry or immunofluorescence control confirming depletion of S100A4 expressing cells would strengthen the conclusions.

      2. Figure 2c: The H&E images are not fully convincing. Immunofluorescence analysis of epithelial architecture would support the authors' interpretation and should be feasible if tissues are already available.

      3. Figure 3c: The histological defects more accurately reflect failure of secretory activation rather than "lactation failure" per se. The terminology should be refined to reflect this more precisely.

      4. Figure 4f: The proliferation data are compelling, but the authors could extend this by examining how cell differentiation and epithelial organisation are affected.

      5. Figure 5b: To more convincingly show that GFP+ cells contact endothelial cells, co-labelling with an endothelial marker such as CD31 would be helpful.

      6. Figure 5f-h: The structures referenced in the text (lines 159-163) should be clearly indicated on the immunofluorescence images.

      Referee cross-comenting

      I agree with the Reviewer, the authors do not need to do knockout experiments in the revised manuscript. However, it would be great if they could address my comment in the discussion.

      Significance

      This is an important study for mammary developmental biology, addressing the relatively understudied mechanisms that govern nipple development at the stromal-epithelial interface, and the determinants of lactational performance. A major strength is the elegant integration of DTA-mediated cell ablation, advanced imaging, lineage tracing, and transcriptomics to uncover previously uncharacterised roles for S100A4-expressing stromal populations in shaping nipple morphology and function. The work lays a foundation for future studies into nipple biology and pathologies and mechanisms underlying successful lactation.

      Although the study is already mature, it could be further strengthened by incorporating more specific genetic models, such as inducible S100A4CreERT or S100A4 gene knockout/knockdown approaches.

      I have expertise in mammary epithelial biology.

      I estimate that revisions would require 3-6 months if new experiments are performed, and 1-3 months if revisions focus on clarifying claims and strengthening the discussion.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this study, Jaros Belisova et al. systematically investigated the composition and roles of S100A4+ cells during mammary gland development and identified a novel role for S100A4 for nipple development. Depletion of S100A4+ lineage using the S100a1-Cre;DTA model resulted in delayed pubertal mammary gland development but with normal morphology and milk production capacity during lactation. The authors further investigated the milk-ejection function of the alveoli using an ex vivo culture method combined with live imaging. This revealed that depletion of S100A4+ cells does not interfere with the normal function of alveoli. However, the abnormal development of the nipple, characterised by smaller size, shorter length, lacking protrusion, increased collagen composition and decreased cell proliferation at the onset of lactation, results in milk delivery failure which is responsible for the lethality of the pups. To further understand the consequences of S100A4+ cell depletion, the author utilised the S100a4-Cre;mTmG model to trace the cell types depleted in the DTA model across various developmental stages. Immunofluorescent staining revealed that S100A4 lineage cells comprised both fibroblasts and immune cells, consistent with previous studies. Interestingly, some S100A4 lineage (GFP+) retain the expression of S100A4. In addition, the RNAseq data comparing the nipple cells from S100a4-Cre;DTA and DTA lactation mice confirmed their observations in transcription level. Overall, the experiments are well designed and the key findings are valid, especially about the role of S100A4 during nipple development is novel and interesting.

      Major comments:

      1. My key concern is the discussion part. I think the authors need to re-organize/re-phrase the discussion part, it confused me a bit in terms of logic, phrases and interpretation of literatures. Here are few examples:

      a. The lines 195-199 contain lot of repeated information

      b. The authors mentioned the studies in ref 26,28 and 38 using "targeting S100A4+ cells through knockout experiment can result in sever phenotypes". This is very misleading. Those studies using the same (or similar if the origin is different) S100A4-Cre line as the current study but induced the activation of Wnt and sHH signalling pathways, respectively. The observed phenotypes are largely due to the pathway function, rather than the S100A4 gene or normal S100A4+ cell itself. This is significantly differed from the current study.

      c. In the lines 253-255, why the author believe complete S100A4+ depletion would leads to the fatal of mouse? Is there study suggest that? Or have authors checked the expression of S100A4 in the S100A4-Cre;DTA model to confirm the efficiency?

      d. The authors tried to attribute the minor phenotype to the incomplete depletion of S100A4+ cells. However, it is possible that if the S100A4+ cells only represented a minor population, their function may be compensated by other populations. This might be confirmed by quantification of S100A4+ cells or S100A4-Cre; GFP+ cells in fibroblast or CD45 populations from images showed in Figure 5. 2. In Fig. 1, the authors described the impaired nursing capacity of S100A4-Cre;DTA dam. However, it seems the little size is also smaller (Fig 1a). Do authors have any explanation or hypothesis? 3. In Fig S1c, d and lines 93-96, the authors investigated the estrus cycles to determine the potential cause of lactation failure. The data was presented as the number of mice in each stage. A more intuitive approach would be to follow the same mice for two to three cycles and observe the duration of each stage. 4. The images in Figure 5 and Figure S4 are difficult to confirm colocalization. A higher magnification image would be required for each panel. Furthermore, a precise quantification based on the current images would be more supportive of the conclusion regarding the discrepancy of the composition of S100A4 lineage between epidermis and mammary gland (lines 163-165). 5. Line 163, the author hypothesis the Langerhans cells due to morphology. Those cells should be able to be confirmed by a co-staining with F4/80 in addition to the current form of Fig 5h. 6. In lines 181-184, the authors states "the results showed that the tissue reacted to a foreign chemical or an endogenous compound....." , which results are referring here? I could not find any inflammation related GO terms in figure 6b. It would be more accurate to specify them in lines 179-181, which appears to be a technical statement rather than a result in current form. 7. The lines 182-184 was not clear. Does the author refer the "nipple tissue response" in general as malfunction of development or inflammation and tissue repair as mentioned in the previous sentence? If the later cases, the authors should consider the failure of lactation might mimic the involution, which may cause the apoptosis and inflammation as well. This might be independent of the DTA expression.

      Minor comments:

      1. The authors demonstrated in Figure S1 and lines 92-96 that no significant differences were observed in pituitary glands and ovaries in S100a4-Cre:DTA and DTA mice. Have the authors checked the S100A4 expression or lineage cells in these organs, or have been reported by others?
      2. The authors have performed live imaging to evaluate the contraction of alveoli. It would be better to include a video together with the snapshots showed in Figure S2.
      3. Since the study is mainly using S100a4, it would be better to avoid using FSP1 in the results, for example Fig 5h.
      4. What does L1 stand for? Lactation Day 1? It should be spelt out in the first instance.
      5. Line 150. Figure S4 should be Figure S4a.

      Referee cross-comenting

      I agree with the other reviewers, as well as the Consultation Comments. The manuscript would benefit greatly from a thoroughly optimised Discussion section to address issues raised by all reviewers.

      Significance

      • Overall, this study is well designed and the key findings are valid, especially the role of S100A4 during nipple development is novel and interesting.
      • One limitation of the study is that RNA-seq was performed using a mixture of all cell types present in the nipple. While this approach is reasonable-given that depletion of the S100A4+ lineage may exert both direct and indirect effects contributing to nipple dysfunction-it should be more clearly acknowledged and discussed in the manuscript. Additionally, this experimental design may limit the utility of the dataset for other researchers interested in nipple development and the specific functions of S100A4.

      My expertise:

      mammary gland development and breast cancer

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      In this study, Belisova et al. investigate the function of S100a4+ (a.k.a. Fsp1) cells in the mammary gland. S100a4 expressing cells were constitutively ablated using the DTA system by crossing S100a4-Cre mice with ROSA26-eGFP-DTA mice. Female mice exhibited a severe nursing defect, leading to whole-litter mortality within 1-2 days postpartum. However, no abnormalities were detected in the morphology of the mammary ductal tree, milk production, or alveolar contractility of S100a4-Cre;DTA mice. Instead, nipples were malformed, likely prevent normal suckling. Analysis of the lineage of S100a4 expressing cells in the mammary gland using the S100a4-Cre mouse in combination with a fluorescent Cre reporter identified S100a4+ cells as fibroblasts and immune cells in the nipple region, while only immune cells were labelled in the mammary gland stroma, findings that agree with previous studies.

      Major comments:

      1. It is rather difficult to conclude whether the observed nipple phenotype reflects an early embryonic/prepubertal defect in establishing the nipple stroma, is caused by a constitutive response to ongoing cell death, or a response to continuous DTA expression (or a combination of some of these). The data raise a couple of additional questions: Is there a nipple phenotype at 3 wk of age? It would not be totally unsurprising if ablation of a major fraction of dermal fibroblasts in the nipple area would lead to an early embryonic/prepubertal phenotype but there is no data on this. Hence, is there a "congenital" nipple deformity, as concluded by the authors (line 191)? Are there S100a4+ cells in the nipple area of pubertal S100a4-Cre/DTA mice? I.e. is there a continuous supply of new S100a4+ cells and thereby continuous cell death and DTA expression as one might expect based on the RNA-seq data?
      2. The subtitle on line 54 implies that that S100a4-Cre/DTA mice display a branching phenotype. However, it looks to me as if there is a pubertal outgrowth defect (as is also written in the body text, line 64) rather than a branching phenotype, potentially reflecting the much smaller size of S100a4-Cre/DTA mice (Fig. 2a). Unless there is a change in branch point frequency, I suggest rephrasing the title and discussion. Instead, I suggest the authors discuss the observed outgrowth delay considering the gross overall growth defect (Fig. 2a). If ductal outgrowth was normalized to the overall growth defect, would one still observe 'a delay in branching morphogenesis'?
      3. Fig. 4e shows Masson's Trichrome and Picrosirius Red staining and the authors report the findings as follows (lines 120-124): "collagen fibers were loosened in the DTA nipples and more densely packed in the S100a4-Cre;DTA nipples". Perhaps the authors could help non-specialists to observe the loosened fibers and if they wish to make quantitative statements ("more densely packed"), such statements should be backed-up by quantifications.
      4. I found the Discussion on the various mouse models somewhat problematic. Overall, the paper is written is a way that it often remains unclear whether it refers to studies addressing the role of S100a4 itself, studies addressing the function of S100a4+ cells via ablation approaches (S100a4-Cre or S100a4-CreERT2 crossed with floxed DTA), or those where S100a4-Cre has been used to delete gene X/Y/Z. These are all very different experimental approaches where one approach is not necessarily informative when trying to understand the results from another one. The authors should make these points clear and consider whether all their discussion points are relevant. The abstract states S100a4 (fibroblast-specific protein 1) is "expressed by mesenchymal cells and has been implicated in the development of eccrine glands, hair follicles, and mammary branching morphogenesis". However, the study on eccrine glands (ref. 19) shows that S100A4+ cells play a role in eccrine gland development but it does not address the role of S100a4 itself, while the study on hair follicles (ref.20) in turn reports the expression pattern of S100a4 in hair follicles but does not address its function, nor the role of S100a4+ cells. Finally, I failed to find references in the paper to studies addressing the role of S100a4, or S100a4+ cells in the mammary gland. Instead, the paper had references to studies where S100A4-Cre had been used to delete different genes and these mice had various mammary phenotypes - which, as indicated above, is a very different approach compared to deleting S100a4 or ablating S100a4+ cells.

      In Discussion (lines 197-200), the authors write: "We described significant delay in mammary branching morphogenesis in puberty, confirming an important role for S100A4+ cells in mammary development, as it was previously described (refs 37-39)." It should be noted that none of these studies addressed the role of S100A4+ cells:

      • Ref 37 used S100a4-Cre to delete sharpin
      • Ref 38 used the same Cre line to delete Ptch1, did not address the role of S100a4 or S100a4 expressing cells
      • Likewise ref 39 deleted another gene using S100a4-Cre

      Later on in Discussion, the authors compare the reported phenotype to previous studies (lines 248-255): "...targeting S100A4+ cells through knockout experiments can result in severe phenotypes, such as a reduction in adipose tissue (ref 26), skin phenotypes, a disrupted estrous cycle, reduced fertility (ref. 38), and complete infertility, hypogonadism and defects in pituitary endocrine function (ref. 28). Of these, Ref. 26 used the same approach as the current study (S100a4-Cre; DTA) (Fig. 7A in the paper) - these mice were significantly lean, with markedly reduced fat compared with the control mice - also the mice in the current study are very small, so perhaps they could also be described as 'lean'. Yet ref. 26 reports that female mice had comparable food uptake, respiratory exchange ratio and physical activity, and slightly increased energy expenditure

      Ref. 38 (as mentioned above) reports deletion of Ptch1 using S100a4-Cre lines and these mice "displayed a disrupted estrous cycle and dramatically reduced fertility over 6.5 weeks". However, this has nothing to do with the approaches where Fsp1/S100a4+ cells are depleted with DTA. Likewise, reference 28 analyzed the phenotype of S00a4-Cre;Ptch1fl/fl mice. Obviously, deleting Ptch1 using S100a4-Cre mice is quite a different approach than "targeting S100A4+ cells" through knockout experiments". Ptch1 deletion leads to a combination of gain-of-function (of Hedgehog activation) and loss-of-function (loss of Hh-independent functions of Ptch1) and hence comparisons with these phenotypes is rather challenging. I suggest the authors focus their phenotype comparisons to ref. 26 where S100a4/Fsp1+ cells were ablated with DTA, i.e. the same approach as in the current study. 5. I find the Discussion is somewhat off the topic by starting with WHO recommendations on breastfeeding and linking this to observed mouse phenotype. Overall, the discussion is rather long and from time-to-time more like a literature review. I would recommend keeping the Discussion more succinct and focused.

      Minor comments:

      Figure 5 would be more informative if it included more higher magnification images that would reveal the staining at the cellular level.

      Referee cross-comenting

      I agree with the comments of other reviewers. However, to me it seems that the analysis of S100a4 knockout mice would not be feasible within a reasonable timeframe and would represent a study of its own. My understanding was that the authors were not interested in S100a4 itself. Rather, S100a4-Cre was used as a tool to understand the importance of a certain (fibroblast) cell population for mammary gland morphogenesis.

      Significance

      General assessment:

      This study reveals the importance of the S100a4+ cell lineage for nipple formation while showing the same cells are dispensable for mammary gland morphogenesis. The main limitation is that it remains unclear whether the observed nipple phenotype is derived from an early embryonic/prepubertal defect in establishing the nipple stroma, is caused by a constitutive response to ongoing cell death, or a response to continuous DTA expression (or a combination of some of these). Hence its relevance as a model of human inverted nipple condition remains rather speculative.

      Advance:

      This study provides novel information on nipple morphogenesis, with potential (though with reservation) relevance to the congenital human inverted nipple condition affecting 3-5% of women.

      Audience:

      This work should appeal to mammary gland biologists interested in mammary gland development and nipple formation; those with interest on fibroblasts biology given that S100a4 was once thought to be a broad marker of fibroblasts, as well as those with interest in the inverted nipple condition.

      My expertise:

      Mammary gland morphogenesis, developmental biology, cell signaling

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their careful evaluation of our manuscript and their constructive comments. Overall, the reviewers recognize the technical value and potential impact of the pBID2 platform as a unified framework for generating transgenic tools across multiple binary expression systems in Drosophila.

      In response to the reviewer’s suggestions, we will strengthen the manuscript in two main directions. First, we will perform additional experiments to further support key claims regarding the QFG4 system, including (i) assessing temporal dynamics of transgene expression across multiple time points, (ii) extending validation to additional tissues, (iii) generating new driver lines. (iv) In addition we will confirm co-expression of pBID2-VGMN-GAL4 and pBID2-VGMN-QF2 in the same neurons. These experiments are currently underway and will directly address concerns regarding synchronicity and reporducibility. Second, we will revise the manuscript to improve clarity, accuracy, and context within the existing literature. This includes justifying claims where appropriate, refining terminology, expanding discussion of prior work, and improving data presentation.

      Reviewer # 1 __Major Comments __

      1 - Synchronous temporal control: multiple time points in Fig. 7

      Synchronous temporal control has not been rigorously demonstrated in Fig. 7. Only a single time point (7 days after temperature shift) was examined. The synchronicity of expression between the two systems remains unclear, and a temporal delay between them is possible. I suggest examining multiple time points to assess true synchronicity.

      We thank Reviewer 1 for the positive assessment of the study and for the constructive suggestions. We agree with this concern. Our current data examine only a single time point after temperature shift, which is insufficient to support claims of synchronous regulation between the GAL4 and QFG4 systems. We plan to repeat the Fig. 7 experiment and collect imaging data at multiple time points spanning approximately one week after temperature shift. The resulting data will be incorporated into a revised Fig. 7, and the text will be updated accordingly.

      2 - Coordinated control of Gal4 and QFG4 activity in additional tissues

      Expression in distinct cells or tissues is demonstrated only in VGMN and muscle. If the claim is intended to be broadly applicable, additional examples would strengthen it. Including other tissues or cell types would provide stronger support.

      We agree that demonstrating QFG4 in additional tissues is important to substantiate the generality of the approach. We will test the coordinated response of a nSyb-QFG4 driver line (in neurons) together with a Repo-GAL4 line (in glia), in the adult brain. This experiment directly addresses the reviewer's request for a second tissue context and provides a biologically meaningful example of intersectional control across two distinct cell types within the same organ.

      3 - New QFG4 driver lines: demonstrating pipeline scalability

      Demonstrating additional QFG4 lines targeting other tissues would highlight the versatility and scalability of the approach and would represent valuable community resources. Given that this is a methodological paper focused on pipeline development, such experiments would directly test the ease and efficiency of the system.

      We plan to use the pBID2 pipeline to generate at least one additional QFG4 driver line targeting another cell type, which we will image to confirm expression patterns. These lines will serve as further proof-of-concept for the scalability of the platform and will be deposited as community resources.

      __Minor Comments __

      All minor comments from Reviewer 1 will be addressed through text and figure revisions:

      • The typographical error "Kusubira" (line 111) will be corrected to "Kusabira".
      • Figure resolution will be improved: Fig. 3B and 3C panels will be enlarged to better demonstrate nuclear localisation of mKO2, and Fig. 7B will include higher magnification images to illustrate differential localisation between VGMN and muscle.

        Reviewer # 2 __Major Comments __

        1 - Novelty of this study

      There is very little novelty in this study. The gateway compatible vectors to generate LexA, Gal4 and QF drivers were generated in Janelia years ago and are currently in use. The only observable difference is the use of insulators in this manuscript.

      We thank the reviewer for their critical evaluation of our work. We respectfully disagree with the reviewer’s assessment that the study lacks novelty. While individual components such as Gateway-compatible vectors and binary expression systems have been previously developed, the pBID2 platform provides a unified, modular framework that minimizes transcriptional leakage (through the use of Gypsy insulators and a DSCP promoter) and achieves strong expression (through a p10 UTR terminator, multiple repeats of activator sequences, and a Syn21 element upstream of drivers), integrating the GAL4/UAS, LexA/LexOP, and QF/QUAS systems within a single architecture. This standardization enables the streamlined generation of complex transgenic combinations that would otherwise be fragmented. In addition, the QFG4 system introduces a GAL80-sensitive QF-based activator, enabling coordinated temporal regulation across independent binary systems. We believe this represents a conceptual advance beyond existing implementations. To better reflect this contribution, we will revise the Introduction and Discussion to more clearly position our work relative to existing tools and explicitly acknowledge prior developments, including Janelia-based constructs.

      2 - P2A versus T2A

      T2A is the 2A peptide that has been used in Drosophila research. P2A was shown to work worse than T2A in the Diao and White 2012 paper. This decreases the novelty of this finding.

      We will revise the relevant sections of the Results and Discussion to more accurately reflect the existing literature on 2A peptide performance in Drosophila, including the findings of Diao and White 2012 and their demonstration of T2A efficacy. We will clarify the rationale for our use of P2A in the pBID2 system and discuss this choice in proper context of both the Daniels et al. 2014 and Diao and White 2012 publications, as suggested also by Reviewer 3.

      3 - QFG4 co-regulation and leakiness

      The proof of principle experiment does not show how co-regulation of QF driver and Gal4 driver by Gal80 can be beneficial, and shows that the regulation of QFG4 is not as tight when used in conjunction with a Gal4 driver. The reason for the leakier QFG4 regulation is not clear and not explored.

      The reviewer raises important points regarding the functional advantages and potential limitations of QFG4, including its regulatory tightness and biological utility. To address this, we will (i) expand the Discussion to better articulate the contexts in which coordinated regulation of independent systems is advantageous, (ii) clarify that QFG4 is intended as a flexible tool whose performance may vary depending on experimental context and discuss the observed differences in repression efficiency between QFG4 and GAL4, (iii) moderate our claims where appropriate to reflect the current level of validation. In addition, the new experiments outlined above for Reviewer 1 (multiple time points and additional tissues) will provide further insight into the performance and applicability of the system.

      __Minor Comments __

      All minor comments from Reviewer 2 will be addressed through text revision:

      • The use of "permissive" and "restrictive" temperatures will be corrected throughout to align with conventional usage in the field (restrictive = 29°C, permissive = 18°C).
      • The discussion of the LexA-GAD strategy will be incorporated into the Results section where relevant, rather than appearing only in the Discussion.
      • The Diao and White 2012 reference will be appropriately cited alongside Daniels et al. 2014 in the P2A/T2A discussion.

      Reviewer # 3 Major Comments

      Reviewer 3 raised no major experimental concerns and found the data sufficient to support the main claims of the paper. All comments from Reviewer 3 will be addressed through text and figure revisions. Nevertheless, we are still planning to perform an additional experiment in response to the remarks about the VGMN dual labelling in the same cells.

      New experiment

      1 - Co-expression of VGMN-GAL4 and VGMN-QF2 in the same neurons

      It is not clear if pBID2-GAL4 and pBID2-QF2 constructs express in exactly the same neurons, e.g., with VGMN. Figure 4 shows independent labelling, but it is not clear if these were validated as the exact same expression pattern. Dual labelling experiments in the same animal would clarify this.

      We thank the reviewer for recognising the relevance of the topic and for this valuable suggestion. We plan to perform dual-labelling experiments using the VGMN enhancer to directly compare the expression patterns driven by pBID2-VGMN-GAL4 and pBID2-VGMN-QF2 within the same cells. Fig. 4 currently shows independent labelling in separate animals, which does not allow direct comparison at single-cell resolution. The dual-labelling data will allow us to confirm whether the two constructs drive expression in the same neurons and will directly support the claim that pBID2 produces equivalent and interchangeable driver lines across binary systems.

      Minor Comments

      All minor comments from Reviewer 3 will be addressed through text revision:

      • Figure 1 will be revised to add arrows indicating that Activator and Responder constructs are inserted at position 0 of pBID2.
      • The name of the MCS variant (pBID2-MCS) will be made explicit in the relevant results section (lines 86-87).
      • The contribution of Diao and White 2012 to the validation of T2A in Drosophila will be more clearly described in the Results section, and Ref #68 will be cited at line 120.
      • The Figure 3 legend labelling errors ("B)" covering panels B and C; "C)" covering panel D) will be corrected.
      • Lines 258-260 will be revised: the discussion of GAL80 binding to the GAL4 activation domain will be clarified to avoid implying a role for the middle domain without supporting experimental data. As correctly noted by Reviewer 3, demonstrating a role for the middle domain would require a QFG4 construct using only the GAL4 activation domain.
      • The typographical error "otor" in the Fig. 7 legend will be corrected to "motor".
      • A comparison of pBID2-UAS constructs with Janelia UAS constructs (e.g., pJFRC7-20XUAS-IVS-mCD8::GFP) will be added to the Discussion, including any direct comparisons we have performed.
      • Dual-labelling experiments to confirm co-expression of pBID2-GAL4 and pBID2-QF2 in the same neurons will be performed (see Major Comment 4 above).
      • The Acknowledgments will be corrected: "Christopher G. Potter" will be corrected to "Christopher J. Potter".
      • The Materials and Methods section will be corrected: "CsChrismson" will be corrected to "CsChrimson".
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In the manuscript by Ruchti and McCabe, the authors introduce and validate many new constructs for use in generating new transgenic reagents in Drosophila. The authors introduce a number of improvements, including the pBID2 plasmid for generating flies that utilize the binary expression systems GAL4/UAS, LexQF/LexOp, QF2/QUAS. The authors generated two new reporters that introduce a nucleus-membrane marker to simultaneously label both the nucleus of the cell and its processes. The authors also identify a new enhancer derived from the VGlut genomic area that drives specific expression in motor neurons (labeled as VGMN). The authors also introduce a new activator, QFG4, which utilizes the QF DNA binding domain and the Gal4 activation domain. They demonstrate that this new reagent induces robust expression and has the additional benefit of being GAL80 sensitive. Overall, this work represents new and useful additions to the Drosophila toolkit that have the potential to become widely adopted.

      Major Comments:

      I do not have any major concerns regarding the work presented. The authors demonstrate the practical use of their new reagents via a number of experiments with new transgenic flies. As such, the conclusion that these new reagents are an improvement over existing reagents is justified. Additional experiments are not necessary to support the major claims on this paper. The data and methods are presented in a way that allows reproduction as well as utilization of the newly introduced reagents. The figures are well presented and adequately demonstrate the function of the new reagents in transgenic Drosophila.

      Minor Comments:

      In Figure 1, it was not entirely clear that the Activator constructs and the Responder Constructs have been inserted at position 0 of the pBID2 construct. Perhaps adding an arrow onto the lines that point to 0 could make this point clearer.

      Line 86-87. The authors have a variation of pBID2 that uses a MCS. What is the name of these constructs? Please add this to this section so its obvious. I assume it is pBID2-MCS as reflected in Figure S2.

      Regarding T2A (lines ~106-146). T2A was first validated to be useful for transgene expression by Ref#68 (Diao and White 2012.). This paper is why many current Drosophila constructs use T2A. This should be better reflected in the results section when reporting on the use of T2A and P2A experiments. As written, it was not clear that T2A was previously validated as a useful method for expression in Drosophila. As one example that could be updated, Ref #68 should also be cited on line 120 "we used ribosomal skipping sequences (63-65)".

      Figure 3 legend. "B)" should be "B) and C)". "C)" should be "D)".

      Lines 258-260. GAL80 binds directly to the activation domain of GAL4 at its C-terminus (~aa 761- 880). The middle domain likely doesn't play a role in GAL80 binding and might just function for structural stability. To make this statement in the discussion, the authors would need to make a QFG4 that uses just the GAL4 activation domain without its middle domain, similar to what was used to make QF2.

      Figure 7 legend. 3rd to last line. "otor" should be "motor".

      To the discussion section, please comment on how pBID2-UAS constructs might compare to Janelia UAS constructs, eg., pJFRC7-20XUAS-IVS-mCD8::GFP. If the authors have made direct comparisons, it would be helpful to include their observations. The Janelia constructs have similar features, and it would be helpful to include the authors thoughs on why to choose pBID2-20xUAS vs pJFRC7-20xUAS (for example).

      To the results or discussion section, please comment if the authors have examined if pBID2-GAL4 and pBID2-QF2 constructs express in exactly the same neurons (eg., with VGMN). For example, by conducting dual labeling experiments in the same animal. Figure 4 shows independent labeling, but it is not clear if these were validated as the exact same expression pattern. As the authors correctly pointed out, the promoter can influence expression (hsp70 or DSCP), but so can sequences from the transcription factor (eg., GAL4 or QF). It is possible the gypsy insulators have addressed these issues, but if the authors have data demonstrating that the exact same expression patterns are induced by the different constructs, it would be helpful to include.

      Acknowledgments. Bibliography lists papers by a Christopher J. Potter, not a Christopher G. Potter.

      Materials and Methods, page 13, pBID Gateway responder vector series. It should be "CsChrimson" not "CsChrismson"

      Significance

      This work represents a significant technical advance to the Drosophila toolkit. It introduces and validates many new reagents (both activators and reporters) that will prove useful to the Drosophila community. These new reagents will enable both simple and complex experiments to be more efficiently performed in Drosophila, especially those interested in investigating complex tissues such as the brain.

      This work will be of primary interest to those developing new reagents for studying Drosophila biology, as well as those interested in genetic tool development. The reagents developed here could also be applied to other genetic systems, such as other insect models.

      This reviewer's expertise is in development genetic tools for use in Drosophila and other insects, and in applying these new genetic methods to the field of neuroscience.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript Ruchti and McCabe describe incremental technical developments to create Gateway compatible vectors with increased transgene expression or driver expression, a dual reporter that expresses membrane targeted fluorescent protein together with nuclear targeted fluorescent protein and a QF version that can be repressed by Gal80 (QFG4).

      Major issues:

      • There is very little novelty in this study. The gateway compatible vectors to generate LexA, Gal4 and QF drivers were generated in Janelia years ago and are currently in use. The only observable difference is the use of insulators in this manuscript. Leakiness of driver lines inserted in well characterized landing sites is not a great concern.
      • Except for the initial studies that showed P2A can work in Drosophila cells other Drosophila, T2A is the 2A peptide that has been used in Drosophila research. P2A was shown to work worse and T2A in the Diao and White 2012 paper. This decreases the novelty of this finding.
      • The proof of principle experiment in the paper do not show how having co-regulation of QF driver and Gal4 driver by using Gal80 or Gal80ts can be beneficial and if anything shows that the regulation of QFG4 is not as tight when used in conjunction with a Gal4 driver. The reason for the leakier QFG4 regulation compared to Gal4 regulation is also not clear and not explored.

      Minor issue:

      • The authors use the terms permissive temperature and restrictive temperature in a manner that is against the conventional use. These terms conventionally refer to functionality of Gal80ts in the given temperature, not the activity of Gal4 which is negatively correlated. Hence, in literature restrictive temperature typically refers to 29 C and permissive temperature is 18 C. This is confusing.
      • Although the references are cited, the information about some of the papers are not properly presented. For example, LexA-GAD strategy is only brought up in discussion but in results, the authors make the statement that no Gal80 regulation strategies exist for LexA. When discussing the use of P2A versus T2A the authors mainly refer to Daniels et al. 2014 publication whereas it was also done in Diao and White 2012. The authors do cite both of these papers but discuss the findings in an incomplete manner.

      Significance

      This manuscript has very limited novelty and is better suited for a more specialized journal such as G3.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      Ruchti et al. developed new vectors compatible with the GAL4/UAS, LexA/LexOP, and QF/QUAS systems to enable high throughput construct generation (pBID2). They validated the platform by generating responder constructs (UAS, LexOP, and QUAS lines) and VGMN (glutamatergic motor neuron) driver lines. In addition, they engineered a new hybrid binary system combining QF and GAL4, termed QFG4, and tested its capacity to modulate expression levels. This design permits regulation of QUAS by GAL80, as demonstrated by experiments examining expression of ontogenetic proteins and co expression of transgenes under VGMN and MNC control with GAL80ts. Overall, the experiments are well designed, carefully performed, and quantitatively analyzed. The vectors and fly lines generated will be valuable resources if deposited in Addgene and the Bloomington Drosophila Stock Center. However, several claims appear speculative or overstated, as outlined below.

      Major Comments

      • Are the key conclusions convincing?

      Most experiments are of high quality and generally convincing. However, two conclusions would benefit from further clarification: (i) simultaneous regulation by GAL4 and QFG4, and (ii) coordinated expression of two transgenes in distinct tissues. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The summary states that QFG4 "enables simultaneous coordinate regulation of UAS and QUAS transgenes by Gal80, allowing synchronous temporal control of independent transgene expression in distinct cells or tissues." I have several concerns regarding this claim: 1. Synchronous temporal control has not been rigorously demonstrated in Fig. 7. Only a single time point (7 days after temperature shift) was examined. The synchronicity of expression between the two systems remains unclear, and a temporal delay between them is possible. I suggest examining multiple time points to assess true synchronicity. 2. Expression in distinct cells or tissues is demonstrated only in VGMN and muscle. The title also refers to "discrete Drosophila tissues." If the claim is intended to be broadly applicable, additional examples would strengthen it. Including other tissues or cell types would provide stronger support. 3. Related to point (2), the pBID2 system is presented as a pipeline for generating new lines. Demonstrating additional QFG4 lines targeting other tissues would highlight the versatility and scalability of the approach. Such lines would also represent valuable community resources. - Would additional experiments be essential to support the claims of the paper? Additional experiments would be necessary unless the authors adopt more conservative language in their claims. Are the suggested experiments realistic in terms of time and resources? The proposed experiments appear realistic.

      For point (1), the authors could repeat the existing experiment and collect images at multiple time points over approximately one week, as the fly lines are already available. For points (2) and (3), generating two to three additional QFG4 lines and imaging their expression in distinct tissues would provide meaningful validation. Given that this is a methodological paper focused on pipeline development, such experiments would directly test the ease and efficiency of the system. Generating new lines may require approximately 2-3 months, followed by ~1 month of imaging and analysis, which is a reasonable investment. - Are the data and methods presented in a reproducible manner?

      Yes, the presentation is generally clear and detailed enough to ensure reproducibility. - Are the experiments adequately replicated and statistically analyzed?

      Overall, replication and statistical analysis appear appropriate. However, inclusion of additional time points for Fig. 7 would strengthen the conclusions.

      Minor Comments

      • Specific experimental issues that are easily addressable

      No major additional concerns beyond those noted above. - Are prior studies referenced appropriately?

      A minor typographical issue: Line 111 lists "Kusubira," which should be corrected to "Kusabira." - Are the text and figures clear and accurate?

      As noted above, certain phrases (e.g., "simultaneous coordinate regulation," "distinct cells or tissues," "discrete tissues") appear overstated relative to the current data. Clarifying or moderating this language would improve accuracy. Some figures require higher resolution presentation. In Fig. 3B and 3C, the images are too small to clearly demonstrate nuclear localization of mKO2; larger panels would help. In Fig. 7B, higher magnification images would better illustrate differential localization between VGMN and muscle. - Do you have suggestions to improve presentation?

      Overall, the data presentation is strong and well organized. Clarifying the scope of the claims and providing higher resolution images where noted would further improve the manuscript.

      Significance

      • Describe the nature and significance of the advance (e.g., conceptual, technical, clinical) for the field.

      Drosophila research relies heavily on binary expression systems for spatial and temporal control of gene function. The unified vector platform developed here, incorporating Gateway compatibility across GAL4/UAS, LexA/LexOP, and QF/QUAS systems, represents a meaningful technical advance. By streamlining construct generation across multiple systems within a single framework (pBID2), the authors lower the technical barrier for complex genetic manipulations. The previously developed pBID system for UAS/GAL4 has been widely adopted and highly cited, underscoring community demand for standardized and scalable tools. The current expansion to additional binary systems is therefore likely to have broad impact. Inclusion of a few additional validation experiments, as noted above, would further strengthen confidence in the robustness and versatility of the platform. - Place the work in the context of the existing literature (provide references, where appropriate).

      This work builds directly upon foundational binary expression systems widely used in the Drosophila field, including GAL4/UAS (Brand and Perrimon, 1993), LexA/LexOP (Lai and Lee, 2006), and QF/QUAS (Potter et al., 2010). By providing a unified and modular cloning strategy compatible with these systems, the authors enhance the practicality and interoperability of established genetic tools rather than introducing an entirely new paradigm. This technical consolidation is valuable for laboratories that routinely combine multiple binary systems for intersectional or parallel manipulations. - State what audience might be interested in and influenced by the reported findings.

      The primary audience will be researchers working with Drosophila genetics, particularly those employing complex intersectional strategies, circuit mapping, developmental biology, and functional manipulation of defined cell populations. Laboratories developing new driver or responder lines will especially benefit from the streamlined cloning pipeline. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Field of expertise: Drosophila genetics, neurobiology, binary expression systems, and circuit analysis. I have sufficient expertise to evaluate the genetic and technical aspects of the manuscript.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The study 'Excess Met1-ubiquitination leads to solid aggregate formation' by Kaypee et. al suggests a previously unrecognised role for the E3 ligase HOIL-1 in clearing protein aggregates via autophagy (e.g. aggrephagy). In their model, toxic protein aggregates in cells are modified with ubiquitin chains, including M1-linked Ub-chains catalysed by LUBAC (of which HOIL-1 is a component). The HOIL-1 ubiquitin signal is posited to induce trafficking of aggregates to lysosomes for subsequent clearance. However, when HOIL-1 is inactive (catalytic C460A mutation), the pathway is interrupted. As a result, protein aggregates fail to clear, they increase in size and shift their biophysical properties from liquid-like to more rigid, insoluble aggregates. The authors explain their observations by an increasing amount of M1-linked chains on protein aggregates, which occur as a result of 'unrestrained HOIP activity' due to HOIL-1 inactivity (based on previous work). Increasing amounts of M1-chains are posited to promote aggregate formation, aggregate growth, and prevent clearance.

      The major claims made in this manuscript are the following:

      1. Following the induction of protein aggregate formation (e.g. alpha-synuclein, tau, beta-amyloid, p62 bodies), cells that express catalytically inactive HOIL-1 fail to clear protein aggregates and end up with more, larger and more rigid protein aggregates compared to cells that express WT HOIL-1.
      2. The observations made in 1. are due to disruptions in late-stage autophagic flux. While aggregates in cells that express WT HOIL-1 co-localise with autophagy and lysosomal markers, aggregates in cells that express mutant HOIL-1 show autophagic, but not lysosomal markers.
      3. An increase in M1-chains (either due to HOIL-1 inactivity or due to OTULIN knockout) is believed to be the cause for claims 1. and 2.

      Main methodologies used:

      The authors use two cellular systems. The first one is SH-SY5Y cells in which either WT or mutant HOIL-1 are transiently overexpressed (via the pcDNA3.1 plasmid), and physiologically important aggregates (Tau, Abeta, asyn) are induced. The second cellular system is MEF cells in which either WT or mutant HOIL-1 are endogenously expressed; in these cells aggregates are formed crudely through disruption of ribosomal translation. It is questionable if both systems can be compared. Aggregate formation is mainly monitored and quantified via fluorescent microscopy in both fixed and live cells, or via sucrose gradient fractionation to separate soluble and insoluble (=aggregate) fractions. The rigidity of protein aggregates is analysed in cells via FRAP, size and circularity measurements and 1,6-HD treatment, or in vitro after aggregate formation assays via size and circularity measurements. The observations are on the whole interesting, though the authors fail to discuss their data in light of previously published work. For example, HOIL-1 KO and KI animals were shown to feature polyglucososan bodies in brain, which is not mentioned. Also, McCrory et al on HOIL-1 chain types is not cited but seems relevant (Figure S4). Yet, the manuscript reports a number of interesting findings, more or less coherently, most useful for scientists embedded in current ubiquitin, autophagy, and LLPS fields. These reviewers believe that this manuscript will make a lot of sense in due course, and be well cited for a first description of the role of HOIL-1 in cellular quality control processes. A number of improvements seem required to consolidate the findings, and improve readability and impact.

      Major:

      1. Figure 1A-C: The authors transiently overexpress either WT HOIL-1L or catalytically inactive (C460A) HOIL-1 in SH-SY5Y cells, then induce and compare the formation of protein aggregates (alpha-synuclein, tau, amyloid-beta) in those cells over 72 h. More cells with aggregates were found in cells that overexpressed mutant HOIL-1L. While these findings are interesting, the cellular system used is artificial due to the transient overexpression of HOIL-1 (in presence of endogenous HOIL-1). Crucial controls are missing:

      a. Adding a condition in which no protein is overexpressed, for example via an empty pcDNA3.1 or GFP only vector. This would help ruling out secondary effects due to the transient overexpression. It would also allow to monitor whether the same amount of aggregates form in the empty ctr compared to when WT HOIL-1 is overexpressed.

      b. Figure legends and raw data points (?) in graphs do not match. The graphs show dubious statistics from 2-3 grey dots, while the figure legend refers to n=100 cells etc. This needs to be fixed.

      c. Showing Western Blots of HOIL-1, to better understand the levels of endogenous HOIL-1 vs overexpressed HOIL-1 in these cells, and to compare overexpression levels between WT and mutant HOIL-1.

      d. The study would also improve by western blotting and IF staining for other LUBAC components such as HOIP and SHARPIN. Do alpha-synuclein aggregates in both WT and mutant conditions co-localise with the other LUBAC components, and are there any differences between WT and mutants. This would further help strengthening the claims made in Figure S1A: '...suggesting that LUBAC is recruited to or retained within α-Synuclein aggregates.' And in the discussion: 'we found that LUBAC components were sequestered in aggregates, as evidenced by microscopy and gradient fractionation of soluble and insoluble proteins, confirming the direct involvement of LUBAC in aggregate processing.' 2. Figure 2A-F: The authors change to a genetic-derived system (comparing endogenously expressed HOIL-1 WT with mutant HOIL-1 based on MEF cells from their mouse models). However, they use puromycin to produce aggregates from random protein homeostasis defects, which yes leads to aggregates, but is not as nice as the induced generation of neiurodegeneration-relevant aggregates. It was observed that after 2 h of puromycin treatment, cells accumulate p62-positive protein aggregates, and in during recovery (2 h washout), the aggregates in the HOIL1 mutant cells outgrow the aggregates in the WT HOIL1.

      a. However, the authors claim that: 'While Hoil-1+/+ MEFs efficiently cleared puromycin-induced p62 bodies,...', which is not supported by the data shown here. When comparing WT in panel C with WT in panel E, it becomes evident that the average number of p62 puncta before and after recovery is the same (around 5 puncta/cell in both pre and post washout conditions). A similar observation can be made for the mutant (around 12 puncta/cell in both pre and post washout conditions). Can the authors please amend their claims, or comment and perform a direct statistical comparison between the pre and post recovery conditions to test for clearance of p62 puncta in the WT after puromycin washout.

      b. The authors state that: 'These findings indicate that although HOIL-1 catalytic activity is dispensable for the initial formation of puromycin-induced aggregates, it is essential for their subsequent clearance.'

      As long as clearance of p62 bodies in the WT is not clearly shown, the second part of this sentence should be amended/removed.

      c. The experiments shown would improve by adding a t = 0 condition. How many p62 granules are present before puromycin treatment? Is there already a basal difference between WT and mutant HOIL-1L cells? 3. Figure S1A: The authors claim that other LUBAC components co-localise to protein aggregates, based on sucrose gradient fractionation and the presence of the respective proteins in the insoluble fractions. Could the authors perform IF and stain for other LUBAC components (SHARPIN and/or HOIP) in their MEF cell system to directly validate this claim? 4. Figure 3G-H: The authors created a GFP-mCherry-p62 reporter system in both their WT and mutant HOIL-1 MEF cells and performed live cell imaging following puromycin treatment, which allows monitoring of both aggregate formation and loss of GFP signal due to the acidic lysosomal localisation. Excitingly, the ratio of GFP/mCherry in the later timepoints is reduced in the WT compared to mutant HOIL-1, indicating that HOIL-1 activity is required to traffic p62 bodies to lysosomes.

      a. In panel G, a surprisingly large amount of p62 granules are present at t0, which (according to the relevant method section) is the time of puromycin treatment. This observation can be made for both WT and mutant cells. After 80 min of puromycin treatment in the WT, the majority of these puncta are cleared. Can the authors please comment on this high amount of p62 granules at t0 (before the effects of puromycin? And also on the observation that after 80 min there are now less granules than before puromycin? In case that t0 indicates the time of puromycin washout rather than puromycin addition, could this please be clarified in the methods or figure legend?

      b. Panel H would improve by adding the quantifications for t=0 (or ideally for all the time points).

      c. Fig S3C-D: Same comments as before but for GFP-mCherry-LC3. 5. Fig. S4B and Fig 4A-B: The authors state that circular aggregates are more soluble and have more LLP characteristics, whereas non-circular aggregates are less soluble and have more aggregate-like characteristics. However, the aggregates shown in Fig. S4B are un-circular but easily dissolve in response to 1,6-HD treatment, which seems contradictory. On the other hand, the aggregates shown in Fig 4B in HOIL-1 mutant cells appear much rounder than the ones in S4B, but do not dissolve in response to 1,6-HD treatment. Can the authors please comment on these discrepancies? 6. Fig. 4E-G. Here the authors suddenly switch to an in vitro aggregate-formation assay using mCherry-p62. In-vitro M1-chain reactions with either WT LUBAC or LUBAC with mutant HOIL-1L, together with the respective M1-chain reaction product, are added. This is not clear from the figure, and a schematic, as well as a gel (Coomassie) should be included to show component purity and indicate the biochemical in vitro nature of the experiment. It is good to have this breadth of methods, but does not help in the presentation if all figures look alike.

      a. The key difference to the cellular situation is p62 aggregates are not directly ubiquitinated here, and instead ubiquitin chains are (non-covalently) added to samples. Can the authors please make this important difference clearer in their text? Why not directly ubiquitinate mCherry-p62 via LUBAC (WT vs mutant HOIL-1) and then perform an aggregation assay on the reaction product?

      b. Can the authors please clarify whether the reaction was inactivated prior to addition to the aggregate-formation assay? If not, the enzymes might still be active at the point of aggregate formation, and the observed effects might be influenced by enzymatic activities and not only the presence of different M1-chain architectures. 7. Fig. 5B-C: The M1-specific DUB OTULIN is knocked down (again, cells) to increase the overall amount of M1-linked Ub-chains present in cells. P62 aggregate formation is induced and the authors claim that the increase in M1-chains influences aggregate size. This claim would be strengthened if it was directly shown that M1-chains form on p62 aggregates in this assay, for example via IF using an M1-antibody (and potentially a total ubiquitin antibody). This would also enable to directly compare the abundance of M1-chains between conditions (ctr vs Otulin, WT vs HOIL-1L mutant).

      Minor:

      1. Figure 1 D: The authors state that 'Notably, HOIL-1 C460A was detected within these structures, as demonstrated by its colocalization with tau aggregates' and show a co-localisation comparison between WT and mutant HOIL-1L. This sentence implies that WT HOIL-1 was not detected in aggregates, however the chosen image of the WT cells does not show any obvious tau aggregate, even though aggregates were induced in this condition according to 1A-C. The better comparison would be to pick an image that includes a tau aggregate. Moreover, this experiment would benefit from quantification, calculating the percentage of total aggregates that co-localise with WT HOIL-1L vs with mutant HOIL-1L.
      2. Figure S1A: The authors state that 'Cells expressing HOIL-1 C460A displayed a pronounced accumulation of high-molecular-weight α-Synuclein species in the insoluble pellet fraction.' While the difference seen by Western Blot is apparent and seems to match Fig 1 A-C, it is not very strong. Moreover, the relevant comparison (WT vs mutant) is made between two different blots/membranes, and it is difficult to assess equal input solely based on the TCL lane. This experiment could be improved by normalising samples (for example via BCA) and by loading and imaging the two conditions (or at least the TCL and pellet samples) on the same membrane. The authors also state that: 'LUBAC subunits, HOIP, HOIL-1, and SHARPIN, were also enriched in this fraction, suggesting that LUBAC is recruited to or retained within α-Synuclein aggregates.' Both Sharpin and HOIL-1 seem to be present in similar levels in the pellet fractions of WT and mutant HOIL-1. Overall HOIP levels seem to be significantly increased in the mutant over the WT (see TCL lane), and to a similar level in the pellet fraction. It would be great if the authors could include these observations in their interpretation.
      3. Figure 2G: Similar to before, this experiment would improve if the authors could find a way to normalise samples between conditions prior to sucrose gradient fractionation or have the most relevant samples on the same blot. It is challenging to properly interpret the results while the bands in the total cell lysate (TCL) lane do not have similar intensities between samples. A blot in which only the TCL and the pellet samples of all conditions were loaded onto the same gel would solve this, allowing for a better comparison between conditions. Based on what is shown in panel G, the authors should amend their claim: 'Consistent with microscopic observations, denser fractions from Hoil-1C458A/C458A MEFs contained increased signals of p62 specifically during the recovery phase (Figure 2G).' It is not apparent that more p62 is present in the insoluble fractions of mutant HOIL-1 cells after puromycin treatment. The band intensities look very similar (This is different for the recovery condition, which shows a strong difference, as stated).
      4. Figure S2: Similar to before, the authors induce protein aggregate formation and compare cells endogenously expressing WT vs mutant HOIL-1L. The size of aggregates increases in mutant cells under proteotoxic stress. What happens to the number of aggregates per cell in these conditions? Does it also increase, or is it just the size?
      5. Fig. S4A: Here the authors analyse the circularity of p62 aggregates in HOIL-1L mutant cells after recovery from puromycin treatment. This experiment would improve if the same analysis could be performed for the WT cells and for the pre-recovery condition (under the condition that large enough granules are present), allowing to make a comparison between WT and mutant, as well as between pre- and post-recovery.
      6. Fig. 4H-J: The authors use a p62 mutant that is known for its enhanced ubiquitin affinity, repeat Fig. 4E-G and state: 'These aberrant condensates were similar to those observed in a reaction using wild-type p62.' Can the authors please comment on what they conclude from this similarity and why this experiment was performed? A quantitative comparison (condensate size and circularity) between WT p62 and mutant p62 may further be useful here.
      7. Fig. 5A: Here the authors pulldown M1-linked Ub-chains in WT vs mutant HOIL-1 cells, with or without puromycin-induced aggregate formation. More M1-chains are observed in mutant HOIL-1 cells under puromycin treatment, but the difference is very subtle. The conclusions drawn from this experiment could be strengthened by including alternative methods, for example (if available) Ub-AQUA to measure the abundance of M1-chains, or using the M1-antibody for IF analysis.
      8. Fig. S5B: The described differences between puromycin treatment and untreated conditions are extremely subtle on the anti-Ub blot, and absent in the anti-Met1 blot. I recommend that the authors remove this sentence, based on the shown data: 'we observed a modest increase in the signal of Met1-linked ubiquitin chains after puromycin treatment'.
      9. Fig. 5F-G: The authors went back to their more-artificial system from Fig 1, in which HOIL-1L was transiently overexpressed (WT or inactive mutant) in SH-SY5Y cells, alongside alpha-synuclein aggregate formation.

      a. The claims made from this experiment would be stronger in the other cell system. Could OTULIN be transiently overexpressed in the MEF cells, to monitor the effect of aggregate formation and clearance. Again, staining aggregates with the M1-antibody would improve this experiment.

      b. The authors claim that HOIL-1 activity fine-tunes the function of HOIP within LUBAC (from discussion: 'This regulatory mechanism ensures that in the presence of functional HOIL-1, the overall quantity and potentially the architecture of Met1-linked ubiquitin chains are tightly controlled'). What is the quantity and architecture of M1-chains catalysed by LUBAC when HOIL-1 is very highly abundant, as it would be the case in this cellular overexpression system? 10. Sentence structure: 'A comprehensive understanding of how Met1-linked ubiquitination, particularly through intricate regulation by Linear Ubiquitin Chain Assembly Complex (LUBAC) components, such as HOIL-1, influences aggregate dynamics and clearance; therefore, it is crucial to develop targeted therapeutic strategies against neurodegenerative proteinopathies'.

      Significance

      The conclusions drawn from this study are very intriguing and give LUBAC (and HOIL-1) a so far unrecognised role in the clearance of protein aggregates, which are a hallmark of several neurodegenerative diseases for which there are currently no cures. Some of the findings described in this manuscript have the potential to be of very high impact and interest to a broader community, in particular researchers interested in protein homeostasis, autophagy, ubiquitin biology and neurodegeneration. In fact, those findings might even expand to autophagic pathways that target other cargo than protein aggregates. Both the novelty aspect and the potential for translational/therapeutic applications comprise the major strength of this manuscript. However, multiple of the presented experiments are currently lacking crucial controls, show weak effect sizes or were performed in artificial settings that likely do not represent relevant in vivo conditions, overall weakening or not fully supporting the claims made. Consequently, further experiments, data re-analyses and validations were recommended to fully support all the claims made here.

      This review was written from the perspective of a researcher in the ubiquitin field.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, the authors show that the branching activity of the E3 ligase HOIL-1, a component of the LUBAC complex, contributes to the autophagic clearance of p62 bodies and protein aggregates. This activity is attributed to enhanced linear, unbranched ubiquitin chain formation by the second E3 ligase of LUBAC, called HOIP. The model systems employed are cell lines including MEFs expressing a catalytically dead version of HOIL-1. In addition, the authors perform in vitro reconstitution experiments with purified ubiquitin chains, the LUBAC complex and p62. The main message is that solid p62 bodies are poor substrates for autophagy and that linear, non-branched ubiquitin chains promote solidification. The mechanism remains unclear and some of the effects sizes are rather modest.

      Major comments:

      The key observations mentioned above are convincingly shown. Since the authors don't claim any detailed molecular mechanisms, the number of conclusions in this study are limited.

      Overall, the authors are quite careful regarding their conclusion, and therefore the ones that are made in this manuscript are generally well supported. The data regarding the clearance of the p62 bodies presented in Figure 3 should be backed up with additional data. The authors could add a macroautophagy inhibitors such as VPS34 IN1 and/or perform the clearance experiments in a ATG KO/KD cell line to corroborate the contribution of macroautophagy to the clearance. In addition, a proteasome inhibitor should be used for comparison.

      The expertise and resources for the experiments mentioned above are expected to be well within the authors' capacity and should be doable within a few weeks.

      Some of the effects sizes (e.g. Fig. 5 and S5) are very small and it is possible that some of them are below statistical significance if the number of replicates are increased.

      Minor comments:

      Figure 1D should be quantified, for example using PCC, Pearson correlation coefficient. Figure S1 should be quantified. Figure S3: It should be explained how the region for the profiles are shown were selected.

      It is suggested to include a scheme of the LUBAC complex and its E3 ligase activities in Figure 1A. This will make it more accessible for readers, who are not so familiar with this complex, in particular as HOIP and HOIL can be easily confused. The authors may also want to clarify this in the abstract.

      Significance

      As mentioned in the summary. The authors report the observation that an excess of linear ubiquitin chains produced by HOIP in the absence of HOIL-1 activity results in the solidification of p62 bodies and reduced clearance by autophagy. This observation is novel and will be interesting for the proteostasis field.

      This reviewer is expert in autophagy and protein degradation, but less so in the LUBAC complex.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Background

      GWAS analyses carried out several years ago identified over 40 genetic loci associated with increased risk of developing Alzheimers and other types of dementia. They included loci encoding the E3 ubiquitin ligase HOIL-1 and the protein SHARPIN, two of the three components of the Linear Ubiquitin Assembly Complex (LUBAC) (Bellenguez et al, 2022, cited in the paper). The third component of LUBAC, HOIP, is the only E3 ubiquitin ligase known to catalyse the formation of Met1-linked ubiquitin (also known as linear ubiquitin). HOIL-1 is one of the few E3 ubiquitin ligases that attaches ubiquitin to serine and threonine residues in proteins forming ester bonds (Kelsall et al. 2019) and has been reported to restrict the HOIP-catalysed formation of Met1-linked ubiquitin (Kelsall et al. 2019; Fuseya et al. 2020; Rodriguez Carvajal et al. 2021).

      Summary of Paper's findings:

      In this study the authors report that HOIL-1 catalytic activity prevents neurodegenerative protein aggregation (synuclein, tau, A) in the human SH-SY5Y neuroblastoma cell line or in mouse embryonic fibroblasts (MEFs) expressing a catalytically inactive mutant of HOIL-1. They argue that this is achieved by maintaining the dynamic, liquid-like properties of protein condensates through regulation of Met1-linked ubiquitin chain levels, thereby facilitating efficient clearance via the aggrephagy pathway. They report that loss of HOIL-1 activity leads to excess Met1-ubiquitylation that drives the transition to rigid, solid-like aggregates resistant to autophagic degradation. In support of this conclusion, they also report that the siRNA knock-down of Otulin, a deubiquitylase that hydrolyses Met1-linked ubiquitin specifically, produces the same effect . The reframing of HOIL-1 as a key factor for fine-tuning ubiquitylation to maintain cellular protein homeostasis is an interesting development and the paper is generally well-written, focused and concise. Further work is required however, to fully convince these reviewers that the effects observed are entirely attributable to excess Met1-linked ubiquitylation, as claimed.

      Major comments:

      1. The causal link between elevated Met1-linked chains and solid-like aggregates in cells is the central claim of the paper. Throughout the study the authors use inactive HOIL-1 to enhance aggregate formation, which they attribute to increased Met1-linked ubiquitylation, something observed by themselves and others previously (Kelsall et al. 2019; Fuseya et al. 2020; Rodriguez Carvajal et al. 2021). However, the immunoblot for Met1-linked ubiquitin (Fig 5A) is not very convincing. In addition, the authors have not excluded the possibility that the loss of HOIL-1 enzyme activity has other effects on ubiquitylation, such as a change in the architecture of the ubiquitin chains caused by the absence of HOIL-1 catalysed formation of oxyester linkages. Many/most ubiquitin chains formed in cells contain more than one ubiquitin linkage type. It is therefore important for the authors to perform immunoblots for other ubiquitin linkage types, such as Lys63-linked ubiquitin, and to include these results in Fig 5.
      2. The reviewers also think that the authors' claims that the transition of condensate property is linked to elevated Met1-linked ubiquitin chains would be strengthened by performing the biophysical assays (FRAP and 1,6-hexanediol resistance) after Otulin knockdown/knockout (and ideally also with Otulin rescue). This will provide direct biophysical evidence linking Met-1 linked chain elevation to condensate liquidity and 1,6-HD sensitivity.
      3. The authors have not shown any evidence that Met1-linked chains are more enriched at the sites of protein aggregation. Would the authors be able to demonstrate direct spatial colocalization of Met1-Ub with the analysed aggregates?
      4. Do the authors know if the effects that they are seeing are general effects on autophagy? For example, is starvation-induced autophagy similarly impaired in the cells studied? A simple flux-style experiment looking at LC3-II levels and p62 with starvations vs puromycin (-/+ bafilomycin) would be informative here.

      Minor comments:

      1. The authors show that loss of HOIL-1 catalytic activity causes p62 bodies to transition from dynamic liquid-like states to rigid solid-like states and claim this as a more general effect on protein aggregates. But the study does not directly demonstrate a liquid-to-solid transition for the disease-relevant α-synuclein, tau, or Aβ aggregates, limiting the generalisation of the claim beyond p62 bodies. Perhaps the authors should modify the text to better reflect this (or, even better, consider treating α-synuclein/tau/Aβ aggregates with 1,6-hexanediol to measure the response). [optional]
      2. Given that the blots presented in Fig S1A appear to come from different membranes, and high-molecular-weight species of α-synuclein seem to exist in the insoluble pellet fraction of both WT and C460A expressing cells, the reviewers would caution against concluding anything about differences, which can only be assessed if the samples are run side-by-side on the same gel.
      3. The Methods section says that two different total ubiquitin antibodies were purchased, but which one was used in Figure 5 and other figures are not stated. Please clarify.
      4. On page 10 ABIN1 is mentioned but it is not mentioned that it is the protein product of the TNIP1 gene that is mentioned in the Introduction. This will confuse to many readers.
      5. 1st paragraph of Discussion line 5 from bottom:- change "oof" to "of".

      Significance

      Prior research has established that the components of LUBAC are recruited to, and are components of, protein aggregates. A link between LUBAC and selective autophagy has also been established previously. The significance of this paper is that it identifies the catalytic function of HOIL-1 as a brake on the activity of LUBAC in proteostasis. The reviewer and co-reviewer are not experts in autophagy or aggregate formation in dementia but, if those reviewers who are find the data presented in these areas to be convincing, then this paper may be the first to suggest a molecular mechanism by which polymorphism/mutation of HOIL-1 leads to increased formation of the aggregates observed in Alzheimer's and other dementias. The results presented in the paper also suggested that initial autophagosome recruitment to aggregates is intact but subsequent late-stage autophagy is impaired. Hence, the study begins to identify the specific step that fails. However, as the authors themselves acknowledge, validation of these potentially exciting findings using in vivo models of neurodegeneration should be the aim of future studies. The paper combines the molecular dissection of ubiquitin and autophagy pathways to understand the causes of neurodegenerative disease. The paper will therefore be of interest to a broad audience, encompassing both the basic research and clinical research communities.

      Reviewers field of expertise: Biochemists and cell biologists with an interest in ubiquitin and cell signalling.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility, and clarity (Required)):

      Summary: In this manuscript, the authors examine how peripherin-2 (PRPH2) contributes to the localization of CNGβ1 within rod outer segment structures. PRPH2 and its homolog ROM1 are structural components of rod discs and are required for disc morphogenesis. In the absence of PRPH2, rod outer segments do not form, and various outer segment materials accumulate and are released as cilia-derived ectosomes. PRPH2 is thought to be transported through an unconventional secretory pathway, whereas cGMP-gated channels follow a conventional trafficking route. Although these components reach the outer segment through distinct pathways, PRPH2 is necessary for the proper delivery of CNGB1, a subunit of the cGMP-gated channel, to its correct destination. It was previously reported that a small fraction of PRPH2 reaches the outer segments through the conventional pathway when it forms a complex with Rom1 in mouse photoreceptors. Using Rom1 KO mice, the authors show that this conventionally trafficked PRPH2 fraction is not required for CNGB1 transport to the outer segment. Using various chimeric constructs, the authors verified that tetraspanin core of PRPH2, delivered to the OS, is sufficient to promote OS localization of CNGB1. Ct and Nt cytoplasmic regions of PRPH2 are dispensable for the role. Overall, the majority of the experiments are well-executed with statistical rigor, written in a way that others can reproduce, and support the major conclusion indicated in the title, "PRPH2 is essential for OS localization of CNGB1".

      Major comments: I believe that the majority of the conclusions are well-supported in this manuscript. Below, I am listing the major points that may need additional experiments or clarifications: 1) CNGA1 subunit is transported to and enriched within ciliary exosomes or the outer segment in PRPH2 deficient mice (Figure 1). The reduced levels of CNGA1 and CNGB1 in rds-/- mice suggest limited stability of these proteins. Their diminished abundance is also influenced by decreased mRNA expression of the corresponding genes. These findings imply that CNGB1 may not be essential for outer segment delivery of cGMP-gated channels if CNGA1 alone contains adequate targeting information. Related to these points, it is unclear whether CNGB1 exhibits a trafficking defect or encounters other problems before leaving the endoplasmic reticulum. Such problems may involve deficiencies in folding, holo-channel assembly, or related quality control processes.

      RESPONSE: We agree with this reviewer and have added additional data and interpretation to address this point. Our new data finds that in fact a low level of CNGB1 can reach ectosomes in rds-/- rods, which makes sense since we and others had observed CNGA1 was present and we know that channel assembly occurs in the ER. This suggests that the CNG channel can properly fold and assemble. Furthermore, overexpressing CNGB1 did not restore ciliary localization in Rds-/-, leading to our interpretation that in the absence of an outer segment membrane compartment, there is no place to deliver the CNG channel and it is subsequently degraded. Apart from perihperin’s binding partner, ROM1, this is unique to the CNG channel. CNG channel subunits are still significantly lower at P21 than other outer segment membrane proteins, such as ABCA4 (shown here), rhodopsin, and PCDH21(shown elsewhere).

      2) CNGB1 overexpression in rds-/- mice does not result in outer segment localization of CNGB1 channels (Figure 2A). These findings do not clarify whether CNGB1 successfully transits through the Golgi apparatus or associates properly with CNGA1 subunits. Elevating expression levels alone would not compensate for problems in folding or assembly.

      RESPONSE: We recognize that our previous submission lacked clarity on this point. Therefore, we have restructured the order of figures and provided additional controls to improve our manuscript. First, the fact that CNG channel is present at P21 and even increases over time suggests that in rds-/- rods channel processing (folding and assembly) is unaffected. Second, we recognize that channel stoichiometry is important for proper channel assembly, so we added a new supplementary figure that shows endogenous CNGA1 expression increases in rds-/- rods that are overexpressing myc-CNGB1 and FLAG-peripherin-2. This adds credence to our CNGB1 overexpression experiments and shows that CNGB1 being trapped is not due to inefficient channel assembly.

      3) Claims related to Figure 6 (P45 rds-/-) need further evidence. It remains uncertain whether CNGA1 and CNGB1 are delivered to lamellar ciliary membranes or to a distinct plasma membrane compartment comparable to that observed in wild type rod outer segments, or whether they accumulate in ciliary ectosomes. Those lamellar structures could be a part of cone outer segments. The observed GARP signal may originate solely from soluble GARP proteins. It is also unclear if CNGA1 and ROM1 colocalize in P45 rds-/- mice. Clarifying these points would strengthen the conclusion that lamellar formation, rather than specific function of PRPH2, is sufficient for CNGB1 delivery to the cilium or outer segment plasma membrane.

      RESPONSE: CNGA1/B1 are not expressed in cones, so the elevated outer segment localization observed at P45 must be coming from rods. In mouse retina, cones make up only 3% of the photoreceptor population. The SEM data clearly show that the lamellar ciliary protrusions are present on the majority of the photoreceptors. We now include CNGB1 staining from Rds-/- P45 sections that corroborate these data and show that CNGB1 is present at P45 and not P21 (Supplemental Figure 2).

      Below are minor comments: 1) The study does not establish whether a direct interaction between PRPH2 and CNGB1 is required for CNGB1 delivery to rod outer segments. Prior work by the senior author (ref 13) suggests that this interaction is not essential, since the PRPH2 binding site within the GARP domain is distinct from outer segment transport signal of CNGB1. Including a discussion of the PRPH2-GARP (or CNGB1) interaction and its relevance to CNGB1 trafficking would help readers interpret the findings more fully.

      RESPONSE: We have included this in our discussion.

      2) The authors propose that the ROM1 core is sufficient for outer segment delivery of CNGB1 based on experiments with chimeric constructs. However, in Figure 1, ROM1 is present in the outer segments (or ciliary ectosomes) of rds-/- mice even though CNGB1 is not delivered to these structures.

      RESPONSE: Our new data, including MS analysis and Western analysis from an enriched ectosome preparation, reveal that, along with ROM1, low levels of the CNG channel are delivered to ciliary ectosomes in Rds-/- mice. However, at this early timepoint photoreceptor cilia do not produce a membrane protrusion, which we observe is required to augment CNG delivery. We expressed a FLAG-ROM1 construct to try to drive earlier creation of these membrane protrusions, but this was unsuccessful, as we observed ROM1 was primarily localized to the inner segment. This suggests that overexpression of ROM1 did not increase ROM1 delivery to the cilia. Luckily, we were able to overcome this bottleneck with several of our chimeric ROM1/Prph2 constructs that did localize to the cilia and restore CNG localization. All of these new results have been included in the revised manuscript.

      3) Line 80: "Theouter" A space shall be inserted between "The" and "outer".

      RESPONSE: Done

      **Referee cross-commenting**

      Both reviewer #2 and reviewer #3 express views that align with mine. They clearly described the study's limitations, and their comments are highly valuable.

      Reviewer #1 (Significance (Required)):

      Prior studies showed that CNGB1 is not present in cilia-derived ectosomes of rds-/- mice, indicating that PRPH2 is necessary for ciliary or outer segment localization of CNGB1 in rods. Building on these earlier findings, I consider this study significant for the following reasons: 1) Using detailed analysis of different PRPH2 domains and chimeric constructs, it clarifies that PRPH2 core region, delivered to OSs, is essential and sufficient for OS localization of CNGB1. 2) PRPH2 and CNGB1 are thought to travel through different post-ER transport routes, with one pathway bypassing Golgi regions and the other passing through them. This study shows that CNGB1 depends on PRPH2, which suggests that these two routes may converge or interact at later stages and opens new directions for future investigation. 3) The study is relevant to basic scientists and biologists investigating how membrane structures acquire specialized functions in neurons, and its implications extend beyond photoreceptor biology.

      Limitation of the study: I believe that clarifying these points will make the manuscript more significant. 1) Is it not clear, as mentioned above, how PRPH2 contributes to the delivery of CNGB1 to the OSs in the different secretory pathways.

      RESPONSE: In the absence of ROM1, Prph2 only travels through the unconventional secretory pathway directly from the ER. By looking at CNG trafficking and localization in ROM1-/- mice, we rule out the possibility that the small portion of PRPH2/ROM1 complexes that traffic conventionally through the Golgi are required for channel localization (Figure 3). Further, our Rho-Prph2 chimera that includes the trafficking signal from Prprh2 did not rescue CNGB1 localization (Figure 4). These findings suggest that it is unlikely that these proteins engage during secretory transport to the outer segment.

      2) The prior study using a fluorescence complementation approach (Ritter et al, 2011) suggests that PRPH2 and CNGB1 can associate within rod ISs, likely before their delivery to OSs. However, it remains unclear whether this interaction supports the potential cotransport of CNGB1 and PRPH2 or whether the authors view these proteins as being transported independently.

      RESPONSE: As described above, our experiments rule out the notion that co-transport through the Golgi is driving CNG channel ciliary localization. We now note in our discussion that this data does not rule out the possibility of an earlier association between these proteins. However, the bulk of our data supports that any early interaction is not required for ciliary delivery.

      3) At the end of the result section (Figure 6, rds-/- P45), the authors suggest that lamellar formation (evaginations?) is required for CNGB1 transport. However, CNGB1 is normally not seen in evaginations or lamellar structures, and thus the assumption is not consistent with prior findings.

      RESPONSE: Absolutely, we agree that the CNG channel does not enter newly forming disc membranes, which has been shown by multiple groups. We included this in our discussion and have now added a clearer statement of our hypothesis: “Together, these data suggest that the partitioning of disc membranes from the plasma membrane by tetraspanin proteins is a key step for localizing the CNG channel and could play a role in segregating other proteins into the plasma membrane.”

      Overall, the manuscript is insightful and has the potential to advance our field and related disciplines.

      RESPONSE: Thanks!

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Cyclic nucleotide gated channels (CNG) localize to the plasma membrane of the rod photoreceptor outer segments, and are a key component of the phototransduction cascade. Understanding how outer segment proteins are trafficked and sequestered to the outer segments is an important field of investigation as it addresses both a fundamental aspect of cell biology and mechanism of disease, many of which have trafficking defects at the core of the pathogenic process. Using primarily IHC analysis of rodent models in combination with introduction of various expression constructs to the retina (through electroporation), this study finds that two rod outer segment structural proteins, peripheral-2 and ROM1, facilitate CNG channel localization to the outer segment.

      While this conclusion is interesting, a major concern that tempers enthusiasm is that in peripherin-2 null photoreceptors, there are no outer bona fide segments. In lieu of outer segments, there are rudimentary membranous protrusions and vesicles distal to the connecting cilia where outer segments should be. So the basis for concluding that peripherin-2 is required for CNG localization to the outer segment seems a bit wobbly. It is understood that the authors assumed the membranous materials distal to cilia as proxy for outer segments in their analysis and narrative. This assumption may have some merits. However, it is well known that when outer segment morphogenesis is severely compromised, all normally outer segment-bound proteins are ectopically localized or largely absent due to increased degradation. This could be simply due to the loss of their destination compartment, among other things. It is not clear how the authors could distinguish between a direct causal relationship where loss of one protein leads to the mislocalization of another, from secondary outcomes due to loss of the outer segments. The last sentence of the Abstract is telling. "Interestingly, this notion is supported by endogenous staining of CNGB1, which reappears in aged Rds-/- rods that have produced ciliary membrane protrusions." So in aged mice CNGB1 did localize to the OS, but what changed? There was more OS like material to house the CNGB1 protein in the aged mice.

      RESPONSE: We agree that the loss of the OS compartment is likely driving downregulation of all OS proteins and have included a statement as such in our manuscript. We also performed additional qRT-PCR analysis on ROM1 and ABCA4 to show global downregulation at the mRNA level – consistent with the notion that there are reduced outer segment proteins when morphogenesis is compromised. However, our Westerns and IHC (as well as published data) clearly find a specific decrease in the CNG channel at the protein level, suggesting that not all proteins behave similarly when the outer segment is not formed. We included additional discussion on this point as well. While not directly examined in our manuscript, previous reports have shown the reverse effect: some outer segment proteins (e.g. PCDH21, Prom1) are upregulated in rds-/- retinas (Rattner et al JBC 2004). Therefore, it is an oversimplification to state that all outer segment proteins behave the same when outer segments are not formed properly. Other models of outer segment dysmorphia (e.g. RhoKO, PCDH21KO, Prom1KO, or WASF3) localize the CNG channel properly. We have added this to the discussion and hope that by restructuring our manuscript, we clearly outline that we do think that membrane retention at the tip of the cilia is driving CNG channel localization and that molecularly the tetraspanin proteins play a role in organizing these membranes.

      Reviewer #2 (Significance (Required)):

      Trafficking of nascent proteins to the outer segment in support of its renewal is an important subject, which has significant impact in understanding the mechanisms of retinal degeneration. The conclusion from this study, that peripherin-2 and ROM1 have a direct role in supporting CNG subunit trafficking may well be meritorious. However the data presented are less than fully convincing, and specifically the question of a direct vs secondary effect needs to be better addressed.

      RESPONSE: We appreciate this reviewer’s enthusiasm for investigating this process. The initial premise of our study was to investigate whether a direct effect of peripherin-2 on CNG delivery was possible, which was meritorious based on previously published data. However, we now find no direct trafficking link between CNG and peripherin-2; instead, our data largely find that CNG delivery is dependent on the presence of retained membranes at the ciliary tip – either through natural mechanisms or by driving “rudimentary” outer segment membrane lamination by overexpression of tetraspanin domains. We have restructured the manuscript to help guide the discussion.

      The following quote underpins some of the reasoning in the study. Lines 139-144, "(Figure 2A). This localization pattern suggests that the CNGB1 subunit is trapped in the biosynthetic pathway. In contrast, when FLAG-tagged rhodopsin is overexpressed in Rds-/- rods it traffics properly to outer segment ectosomes (Figure 2B, (19)). We posit that without proper exit from the biosynthetic pathway, the endogenous CNGB1 protein is rapidly degraded to undetectable levels, which we circumvent through overexpression. These data suggest the localization defect of CNGB1 in Rds-/- rods is in the trafficking of CNGB1. " This in my view is an over- interpretation of limited data. The statement implies that rhodopsin and CNGB1 qualitatively differ in their fate but I would argue that both proteins are heavily degraded intracellularly except more of rhodopsin escaped to the "OS" and shows up in IHC. In many rhodopsin mutant transgenic mice, mutant rhodopsin appeared in OS even though intracellular degradation (gumming up the system) is a major factor in the disease process. The claim "rhodopsin trafficked properly to outer segment ectosomes" is not grounded in solid data.

      RESPONSE: We do fundamentally agree that the endogenous CNG channel is heavily degraded, which we confirm by overexpressing an exogenous CNGB1-myc and finding it trapped in the biosynthetic pathway. As stated by the reviewer, this localization pattern is in contrast to what we and others have observed for endogenous rhodopsin, and now show for overexpressed FLAG-rhodopsin – that rhodopsin does traffic to the OS ectosomes. By comparing the localization of both endogenous and overexpressed constructs (using the same promoter), we feel that our conclusion is well supported. We appreciate that our wording of “rhodopsin trafficked properly to the outer segment” is misleading, as traffic of membrane proteins in Rds-/- rods is generally affected and not “proper”. Importantly, we follow up this “limited data” with additional experiments showing that at high expression levels, we are unable to drive CNGB1 localization to OS ectosomes unless we co-express with a tetraspanin domain.

      A further minor comment is that the scope of the study appear limited, with no attempted experiments on how these proteins might interact to effect facilitation of trafficking.

      RESPONSE: Our approach was to be agnostic to the outcome of our hypothesis that peripherin-2 was directly involved in CNG channel trafficking. The experiments we performed to test this (ROM1-/- analysis and Prph2 C-terminal chimeras) did not support a role for peripherin-2 in CNG trafficking. Instead, our data support a model in which membrane retention and organization at the ciliary tip drives CNG channel delivery. We feel that our approach was not limited.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      in the gene encoding tetraspanin protein peripherin 2 (Prph2), i.e., Rds-/-, examining the requirements for various portions of the Prph2 protein in the context of an assortment of chimeric constructs expressed via transfection into photoreceptor cells, to restore localization of the beta subunit of the cyclic nucleotide-gated channel (CNGbeta1) to photoreceptor outer segments (OS) (in a small number of experiments) or, in the majority of experiments, to do so for a recombinant tagged version of this protein also overexpressed by transfection.

      The concluding sentences of the Discussion, which summarize the major conclusions are as follows: "Our data clearly show that localization of the CNG channel is dependent upon peripherin-2 after biosynthetic exit, further suggesting that the necessary action is at the ciliary base. Supporting evidence for this comes from analysis of Rhodopsin knockout outer segments which have internal disc-like structures and localize CNG channel properly. Therefore, in the absence of a fully elaborated outer segment, peripherin-2's ability to delineate a disc is sufficient to drive CNG channel delivery. Together, these data suggest that the partitioning of disc membranes from the plasma membrane by tetraspanin proteins is a key step for trafficking the CNG channel and could play a role in segregating other proteins into the plasma membrane.

      The first sentence contains both reasonable conclusions and phrases whose meaning is unclear or not supported by the results presented. The statement: 'localization of the CNG channel is dependent upon peripherin-2 is supported by the data but, of course, has long been known from previous studies of Rds-/- mice. What is meant by "...after biosynthetic exit..." is unclear. If, by this term, apparently newly invented, the authors mean "after its synthesis of the protein is complete," the statement is accurate, but also a truism.

      RESPONSE: The absence of CNGB1 was reported in previous studies, but the mechanism driving its absence has not been investigated. In our resubmission, we have added additional data that now shows CNGB1 is present at very low levels in Rds-/- ectosomes but remains undetectable by IHC, which is consistent with previous studies mentioned by the reviewer, but is also a novel finding. Importantly, we find specific downregulation of CNG channel subunits in Rds-/- retinas compared to ABCA4, supported by Western blot analysis (Figure 1), and we investigate the mechanism driving this result.

      We appreciate the reviewer pointing out that “biosynthetic exit” is a niche term not broadly understood. We have removed this statement.

      The statement, "the necessary action is at the ciliary base," is NOT supported by the data presented, as the effect of the "successful" Prph2 constructs on CNGbeta1 localization is primarily to increase its levels at the distal end of cilia and at the base of OS-related structures formed in response to the presence of the Prph2 constructs. The restoration of these membranes, which, as the authors note, has been previously reported, is overwhelmingly the biggest effect of these constructs, and it could be argued that the restored localization, rather than degradation, of CNGbeta1 is merely a downstream consequence of the formation of these structures, with perhaps, an element of stabilization of CNGbeta1 toward degradation from direct binding to Prph2, which has also been previously reported.

      RESPONSE: We agree with the reviewer. Our interpretation of our data is that the presence of Prph2 (or its variants) at the distal end of the cilia localizes CNGB1, likely due to the formation of outer segment membrane structures. Previous to this work, there was a possibility that targeting information of Prph2 was required for CNGB1. That had never been explored. We definitively rule this possibility out when we express the C-terminal tail of Prph2, which is unable to rescue CNGB1 localization. Because the tetraspanin domain of Prph2 (or ROM1) can localize CNGB1, we do agree that the definition of an outer segment structure is the driving force for CNGB1 delivery – these are new findings. We’ve restructured and added additional discussion to the manuscript to clarify this point.

      The next suggested conclusion is, "Therefore, in the absence of a fully elaborated outer segment, peripherin-2's ability to delineate a disc is sufficient to drive CNG channel delivery," is partly accurate and partly misleading. If the word "localization" were to replace the term, "delivery," concerning which there are no data (aside from those confirming that Prph2 and CNGbeta1 pass through distinct secretory pathways), this statement would be an accurate summary.

      RESPONSE: We have updated to “localization”, but the fact that we confirm these two proteins do not traffic together through the Golgi would suggest that delivery is independent of trafficking.

      The final sentence, "Together, these data suggest that the partitioning of disc membranes from the plasma membrane by tetraspanin proteins is a key step for trafficking the CNG channel and could play a role in segregating other proteins into the plasma membrane," sentence, would also be accurate if the word "localization," were to replace the term, "trafficking." The key point for these qualifications is that the experiments presented measure steady state levels of CNGbeta1 constructs at certain locations, which are determined not only by rates of trafficking, but also rates of synthesis and degradation, and the data presented confirm that total levels of CNGbeta1 are greatly diminished in the absence of functional Prph2, rendering any conclusions about the relative roles of trafficking kinetics and degradation kinetics speculative in nature.

      RESPONSE: We agree and have revised.

      Aside from these major conceptual issues, there is one overriding technical question: why are almost all the experiments presented carried out with a highly over-expressed engineered version of CNGb1 with a tag, which is clearly context far from the physiological one, as opposed to examining redistribution of the endogenous CNGbeta1, which is of much greater interest. In some results relegated to a Supplemental figure (Supp. Fig. 2), the authors clearly demonstrate that sufficient signal can be obtained from immunofluorescence staining the endogenous proteins for such experiments to be readily interpretable. If the concern was cross-reactivity with non-covalently attached GARP proteins, a few experiments showing that similar results are obtained for immunostaining of the endogenous protein or of the tagged construct would haver been sufficient, and the paper could have had more physiological relevance and impact.

      RESPONSE: We agree that endogenous CNG staining is important and valuable, which is why we included it in our manuscript. We were able to confirm that overexpressed CNG recapitulated the endogenous staining. We proceeded with analyzing overexpressed, tagged CNG for the reasons stated by the reviewer. Yes, cross-reactivity with soluble GARP proteins was one consideration, as was the fact that the GARP antibody is a mouse monoclonal antibody. Increased IgG due to inflammation in the RDS-/- model can obscure the outer segment region in these retinas, confounding our quantification. The tagged versions of CNGB1 and corresponding quantification offered the most clarity and continuity for the reader; therefore, we relegate the endogenous staining to the supplement.

      The remaining concerns are generally of less significance and mostly conceptual or quite minor technical concerns. Technically, the imaging data and their quantification are of good quality and analyzed with reasonable rigor.

      RESPONSE: Thanks!

      Abstract: "In this study, we investigate how peripherin-2 is engaged in CNG channel delivery to the outer segment. Might this not be more a question of how the absence of properly formed discs impacts the formation of outer segments with plasma membranes surrounding the disks? Is this really a question of "delivery" or "lack of address to make the delivery"?

      RESPONSE: Our interpretation of this comment is that it boils down to semantics. Delivery is inclusive of both trafficking and localization, which we investigate in our manuscript.

      Page 3, "fluorescence complementation between peripherin-2 and CNGb1 in the inner segment of transgenic Xenopus rods (23) ". The wording is unclear. It should be stated clearly that they are describing results of "bimolecular fluorescence complementation assays" of highly overexpressed recombinant proteins expressed from transgenes.

      RESPONSE: We have revised.

      Page 4, "...trapped in the biosynthetic pathway," It is unclear what the authors mean by this phrase. Obviously, "biosynthesis," i.e., translation is indeed complete, but biochemical pathways are not places. Is the intention to suggest that post-translational processing, such as addition and editing of carbohydrate chains or assembly with the alpha subunit has not been completed? If so, it would be better just to say so clearly. Or, is it meant to imply that it is physically "trapped" in the ER and/or Golgi apparatus? In any case the meaning should be made clear. Co-staining with ER and Golgi markers would have been very informative with respect to the compartments in which the highly overexpressed recombinant protein is trapped.

      RESPONSE: We acknowledge that our phrasing here was indirect. We have revised. Co-staining with Calnexin (an ER-marker) was attempted, but proved to be uninformative.

      It should also be noted that accumulation of highly overexpressed membrane proteins within internal membranes and membrane aggregates is a very commonly observed experimental phenomenon, and not restricted to the highly specialized trafficking routes in photoreceptors.

      RESPONSE: We agree that exogenous expression of membrane proteins can lead to increased presence within internal membranes of the inner segment, which we routinely see in our experiments. Importantly, our analysis is restricted to the ability of these exogenously expressed proteins to reach the ciliary compartment in Rds mice. We also conduct these experiments in wild-type retinas to ensure that our constructs are expressed, and the proteins reach the ciliary outer segment under normal conditions.

      Page 4, " peripherin-2 facilitates trafficking of the CNGb1 subunit to the outer segment " The data presented to this point do not demonstrate an enhancement of transport, but only of steady-state levels. There is nothing to rule out the possibility that some beta subunit is trafficked in Rds-/-, but is unstable to degradation in the region near the cilium when peripherin-2 and outer segments are not available. An increase in transport is certainly a possible explanation for the results, but should not be taken as an unambiguous conclusion.

      RESPONSE: We have altered the description of these results to allow for more interpretation of our data, which show that CNGB1 delivery to the outer segment is reduced in Rds-/- mice and enhanced when peripherin-2 is re-expressed.

      Page 4, " We confirmed that the fraction of peripherin-2 that traffics conventionally through the Golgi is indeed absent in Rom1-/- retinas and found that trafficking of the CNG channel via the conventional pathway is unaffected (Figure 3A) . This is one of the stronger and more interesting results in this manuscript, and tilts the argument against trafficking as being the mechanism for enhancement by overexpressed peripherin-2 of beta subunit levels in the distal region of the photoreceptor layer.

      RESPONSE: We agree.

      Page 5, " Our finding that secretory trafficking of peripherin-2 and CNGb1 is distinct . Clumsy syntax- needs to be rewritten for clarity.

      RESPONSE: Revised

      Page 5, "two previously characterized fusion proteins... have been shown to localize to the outer segment and build a rudimentary membrane structure (19) " This previous result, which is critical to interpretation of the results in this manuscript, should be introduced early, before any experimental results using related constructs are presented, in order to avoid confusion.

      RESPONSE: Prior to these experiments, we used only full-length peripherin-2, rhodopsin, or CNGB1. This paragraph is the first introduction of any chimeric protein, and we explain these two constructs thoroughly. We believe this satisfies this reviewer’s request.

      Page 5, " We confirmed these data by staining for endogenous CNGb1 in Rds-/- rods electroporated with each construct (Supplemental Figure 2B,C) " This is the most informative result in this manuscript with regard to the ability of these constructs to restore proper localization of CNGB1- it is not clear that the overexpression constructs for CNGB1 present any advantage beyond stronger signal and they may not be assumed, a priori, to be faithfully reporting on interactions of Prph2 with endogenous CNGB1, which is the biologically significant question. A big problem with Supp. Fig. 2 is that there is no real control, i.e., one without any Prph2 construct electroporated. Even the Rho-Prph2CT construct has some ROS-related structures and some CNGB1 localized to the one shown at higher magnification. The Prph2-RhoCT construct seems to lead to a substantial increase in endogenous CNGB1 in inner segment membranes. This looks like a phenomenon that is potentially very interesting, although it doesn't fit with any of the models put forth in the manuscript.

      RESPONSE: We agree that endogenous staining (shown in Supplemental Figure 3 of our revised manuscript) is informative, but it was technically challenging. Once we verified that our overexpression system recapitulated results for endogenous CNGB1, we went forward with the epitope-tagged CNGB1, which was clearer when quantifying CNGB1 localization to rudimentary outer segments.

      Our electroporation method provides an excellent internal control, as all of the non-electroporated cells show no endogenous CNGB1 localization without peripherin expression (Sup Fig 3A).

      Page 5, " cytosolic N- and C-termini of peripherin-2 are dispensable for CNGb1 outer segment localization " No- if you could simply remove them and get proper localization, that would show they are "dispensable." In these experiments they are always replaced with the corresponding region of some other protein that is localized to OS, or in one case, with 3 copies of the FLAG tag at the N-terminus. There are also clear differences in the efficacy of the different "successful" constructs, but these results and their implications are not really discussed.

      RESPONSE: We make this statement in the context of these termini being dispensable to CNGB1 localization, not to peripherin-2’s stability, function, or localization. A complete truncation of either domain results in a non-functioning protein. Our supplemental data shows reduced expression with a truncated N-terminus, preventing analysis (Sup Fig 5C). The 3X-FLAG has no known function in the cell, and we believe it serves as a proxy for removing the N-terminus altogether. Removing the C-terminus would prevent proper outer segment targeting, which is key to determining how peripherin-2 impacts CNGB1 ciliary delivery. Replacing this C-terminus with an outer segment targeting domain from another protein is an established method of investigation.

      Page 6, " We then wanted to determine whether the ROM1 tetraspanin region was sufficient to facilitate CNGb1 delivery by further replacing ROM1's cytoplasmic N-terminus with that of peripherin-2 (Prph2NT/CT-ROM1) . " This experiment obviously does NOT test "sufficiency" of the TM segments, as the construct has the termini replaced with the corresponding regions of Prph2, which might functionally substitute for the missing ROM1 regions.

      RESPONSE: Our previous results had already ruled out a role for these termini in CNGB1 localization.

      Page 6, " We show a dramatic increase in GARP staining in the aged Rds-/- retinal sections " The age dependence of this phenomenon is quite interesting and puzzling. Any thoughts on the mechanism?

      RESPONSE: We agree that this natural process is very interesting. We have restructured the order of our figures and provided additional controls to support this finding. We have added this to the discussion and hope that by restructuring our manuscript, we clearly outline that we do think that membrane retention at the tip of the cilia is driving CNG channel localization and that molecularly the tetraspanin proteins play a role in organizing these membranes.

      Page 6, " Although CNGα1, known to form homotetramers, can localize to the extracellular vesicles released into the outer segment area. " Not a sentence.

      RESPONSE: Revised

      Page 6, " Our data now shows that the population of peripherin-2 in complex with ROM1 that travels through the conventional trafficking pathway does not play a role in CNGb1 localization to the outer segment. " This is an oddly accurate, albeit somewhat contradictory sentence. Yes, you have failed to answer the question you claim this work was designed to address. Apart from this negative result, nothing is learned about trafficking, per se, from the experiments in this manuscript.

      RESPONSE: Please see our response to the reviewer’s comment above that clarifies our thinking regarding our results on trafficking.

      Page 7, " anticipated " Hopefully, the authors mean to say, "hypothesized," here.

      RESPONSE: Revised

      **Referee cross-commenting**

      My impression from reading the reviewers' comments is that there is general agreement on both the strengths and the limitations of this work. In my opinion, the issues raised by the reviewers could be addressed by editing the manuscript to be more circumspect in drawing definite conclusions from data that are not fully conclusive, without necessarily adding new experiments.

      Reviewer #3 (Significance (Required)):

      This study addresses a problem of great interest in the photoreceptor field and in cell biology more generally of trafficking and localization of specialized membrane proteins to specialized ciliary membranes. The strengths are technical quality of data with good controls, in most cases. The limitations are largely conceptual in nature and derive from the rather simplistic approach to the experimental design, as described above. The rather dated, "mix and match" approach based on chimeric construct with pieces of sequences removed and replaced at will does not properly account for the conclusion reached many times from many experiments, including some this manuscript, that the "roles" of stretches of amino acid sequence depend exquisitely on the multidimensional context in which they are tested, not simply on their position in the linear sequence. The paper presents interesting and convincing results with respect to functional requirements for formation disc-like membranes, but very little with respect to 'trafficking."

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      in the gene encoding tetraspanin protein peripherin 2 (Prph2), i.e., Rds-/-, examining the requirements for various portions of the Prph2 protein in the context of an assortment of chimeric constructs expressed via transfection into photoreceptor cells, to restore localization of the beta subunit of the cyclic nucleotide-gated channel (CNGbeta1) to photoreceptor outer segments (OS) (in a small number of experiments) or, in the majority of experiments, to do so for a recombinant tagged version of this protein also overexpressed by transfection.

      The concluding sentences of the Discussion, which summarize the major conclusions are as follows: "Our data clearly show that localization of the CNG channel is dependent upon peripherin-2 after biosynthetic exit, further suggesting that the necessary action is at the ciliary base. Supporting evidence for this comes from analysis of Rhodopsin knockout outer segments which have internal disc-like structures and localize CNG channel properly. Therefore, in the absence of a fully elaborated outer segment, peripherin-2's ability to delineate a disc is sufficient to drive CNG channel delivery. Together, these data suggest that the partitioning of disc membranes from the plasma membrane by tetraspanin proteins is a key step for trafficking the CNG channel and could play a role in segregating other proteins into the plasma membrane.

      The first sentence contains both reasonable conclusions and phrases whose meaning is unclear or not supported by the results presented. The statement: 'localization of the CNG channel is dependent upon peripherin-2 is supported by the data but, of course, has long been known from previous studies of Rds-/- mice. What is meant by "...after biosynthetic exit..." is unclear. If, by this term, apparently newly invented, the authors mean "after its synthesis of the protein is complete," the statement is accurate, but also a truism. The statement, "the necessary action is at the ciliary base," is NOT supported by the data presented, as the effect of the "successful" Prph2 constructs on CNGbeta1 localization is primarily to increase its levels at the distal end of cilia and at the base of OS-related structures formed in response to the presence of the Prph2 constructs. The restoration of these membranes, which, as the authors note, has been previously reported, is overwhelmingly the biggest effect of these constructs, and it could be argued that the restored localization, rather than degradation, of CNGbeta1 is merely a downstream consequence of the formation of these structures, with perhaps, an element of stabilization of CNGbeta1 toward degradation from direct binding to Prph2, which has also been previously reported.

      The next suggested conclusion is, "Therefore, in the absence of a fully elaborated outer segment, peripherin-2's ability to delineate a disc is sufficient to drive CNG channel delivery," is partly accurate and partly misleading. If the word "localization" were to replace the term, "delivery," concerning which there are no data (aside from those confirming that Prph2 and CNGbeta1 pass through distinct secretory pathways), this statement would be an accurate summary. The final sentence, "Together, these data suggest that the partitioning of disc membranes from the plasma membrane by tetraspanin proteins is a key step for trafficking the CNG channel and could play a role in segregating other proteins into the plasma membrane," sentence, would also be accurate if the word "localization," were to replace the term, "trafficking." The key point for these qualifications is that the experiments presented measure steady state levels of CNGbeta1 constructs at certain locations, which are determined not only by rates of trafficking, but also rates of synthesis and degradation, and the data presented confirm that total levels of CNGbeta1 are greatly diminished in the absence of functional Prph2, rendering any conclusions about the relative roles of trafficking kinetics and degradation kinetics speculative in nature.

      Aside from these major conceptual issues, there is one overriding technical question: why are almost all the experiments presented carried out with a highly over-expressed engineered version of CNGb1 with a tag, which is clearly context far from the physiological one, as opposed to examining redistribution of the endogenous CNGbeta1, which is of much greater interest. In some results relegated to a Supplemental figure (Supp. Fig. 2), the authors clearly demonstrate that sufficient signal can be obtained from immunofluorescence staining the endogenous proteins for such experiments to be readily interpretable. If the concern was cross-reactivity with non-covalently attached GARP proteins, a few experiments showing that similar results are obtained for immunostaining of the endogenous protein or of the tagged construct would haver been sufficient, and the paper could have had more physiological relevance and impact.

      The remaining concerns are generally of less significance and mostly conceptual or quite minor technical concerns. Technically, the imaging data and their quantification are of good quality and analyzed with reasonable rigor.

      Abstract: "In this study, we investigate how peripherin-2 is engaged in CNG channel delivery to the outer segment. Might this not be more a question of how the absence of properly formed discs impacts the formation of outer segments with plasma membranes surrounding the disks? Is this really a question of "delivery" or "lack of address to make the delivery"?

      Page 3, "fluorescence complementation between peripherin-2 and CNG1 in the inner segment of transgenic Xenopus rods (23) ". The wording is unclear. It should be stated clearly that they are describing results of "bimolecular fluorescence complementation assays" of highly overexpressed recombinant proteins expressed from transgenes.

      Page 4, "...trapped in the biosynthetic pathway," It is unclear what the authors mean by this phrase. Obviously, "biosynthesis," i.e., translation is indeed complete, but biochemical pathways are not places. Is the intention to suggest that post-translational processing, such as addition and editing of carbohydrate chains or assembly with the alpha subunit has not been completed? If so, it would be better just to say so clearly. Or, is it meant to imply that it is physically "trapped" in the ER and/or Golgi apparatus? In any case the meaning should be made clear. Co-staining with ER and Golgi markers would have been very informative with respect to the compartments in which the highly overexpressed recombinant protein is trapped. It should also be noted that accumulation of highly overexpressed membrane proteins within internal membranes and membrane aggregates is a very commonly observed experimental phenomenon, and not restricted to the highly specialized trafficking routes in photoreceptors.

      Page 4, " peripherin-2 facilitates trafficking of the CNG1 subunit to the outer segment " The data presented to this point do not demonstrate an enhancement of transport, but only of steady-state levels. There is nothing to rule out the possibility that some beta subunit is trafficked in Rds-/-, but is unstable to degradation in the region near the cilium when peripherin-2 and outer segments are not available. An increase in transport is certainly a possible explanation for the results, but should not be taken as an unambiguous conclusion.

      Page 4, " We confirmed that the fraction of peripherin-2 that traffics conventionally through the Golgi is indeed absent in Rom1-/- retinas and found that trafficking of the CNG channel via the conventional pathway is unaffected (Figure 3A) . This is one of the stronger and more interesting results in this manuscript, and tilts the argument against trafficking as being the mechanism for enhancement by overexpressed peripherin-2 of beta subunit levels in the distal region of the photoreceptor layer.

      Page 5, " Our finding that secretory trafficking of peripherin-2 and CNG1 is distinct . Clumsy syntax- needs to be rewritten for clarity.

      Page 5, "two previously characterized fusion proteins... have been shown to localize to the outer segment and build a rudimentary membrane structure (19) " This previous result, which is critical to interpretation of the results in this manuscript, should be introduced early, before any experimental results using related constructs are presented, in order to avoid confusion.

      Page 5, " We confirmed these data by staining for endogenous CNG1 in Rds-/- rods electroporated with each construct (Supplemental Figure 2B,C) " This is the most informative result in this manuscript with regard to the ability of these constructs to restore proper localization of CNGB1- it is not clear that the overexpression constructs for CNGB1 present any advantage beyond stronger signal and they may not be assumed, a priori, to be faithfully reporting on interactions of Prph2 with endogenous CNGB1, which is the biologically significant question. A big problem with Supp. Fig. 2 is that there is no real control, i.e., one without any Prph2 construct electroporated. Even the Rho-Prph2CT construct has some ROS-related structures and some CNGB1 localized to the one shown at higher magnification. The Prph2-RhoCT construct seems to lead to a substantial increase in endogenous CNGB1 in inner segment membranes. This looks like a phenomenon that is potentially very interesting, although it doesn't fit with any of the models put forth in the manuscript.

      Page 5, " cytosolic N- and C-termini of peripherin-2 are dispensable for CNG1 outer segment localization " No- if you could simply remove them and get proper localization, that would show they are "dispensable." In these experiments they are always replaced with the corresponding region of some other protein that is localized to OS, or in one case, with 3 copies of the FLAG tag at the N-terminus. There are also clear differences in the efficacy of the different "successful" constructs, but these results and their implications are not really discussed.

      Page 6, " We then wanted to determine whether the ROM1 tetraspanin region was sufficient to facilitate CNG1 delivery by further replacing ROM1's cytoplasmic N-terminus with that of peripherin-2 (Prph2NT/CT-ROM1) . " This experiment obviously does NOT test "sufficiency" of the TM segments, as the construct has the termini replaced with the corresponding regions of Prph2, which might functionally substitute for the missing ROM1 regions.

      Page 6, " We show a dramatic increase in GARP staining in the aged Rds-/- retinal sections " The age dependence of this phenomenon is quite interesting and puzzling. Any thoughts on the mechanism?

      Page 6, " Although CNGα1, known to form homotetramers, can localize to the extracellular vesicles released into the outer segment area. " Not a sentence.

      Page 6, " Our data now shows that the population of peripherin-2 in complex with ROM1 that travels through the conventional trafficking pathway does not play a role in CNG1 localization to the outer segment. " This is an oddly accurate, albeit somewhat contradictory sentence. Yes, you have failed to answer the question you claim this work was designed to address. Apart from this negative result, nothing is learned about trafficking, per se, from the experiments in this manuscript.

      Page 7, " anticipated " Hopefully, the authors mean to say, "hypothesized," here.

      Referee cross-commenting

      My impression from reading the reviewers' comments is that there is general agreement on both the strengths and the limitations of this work. In my opinion, the issues raised by the reviewers could be addressed by editing the manuscript to be more circumspect in drawing definite conclusions from data that are not fully conclusive, without necessarily adding new experiments.

      Significance

      This study addresses a problem of great interest in the photoreceptor field and in cell biology more generally of trafficking and localization of specialized membrane proteins to specialized ciliary membranes. The strengths are technical quality of data with good controls, in most cases. The limitations are largely conceptual in nature and derive from the rather simplistic approach to the experimental design, as described above. The rather dated, "mix and match" approach based on chimeric construct with pieces of sequences removed and replaced at will does not properly account for the conclusion reached many times from many experiments, including some this manuscript, that the "roles" of stretches of amino acid sequence depend exquisitely on the multidimensional context in which they are tested, not simply on their position in the linear sequence. The paper presents interesting and convincing results with respect to functional requirements for formation disc-like membranes, but very little with respect to 'trafficking."

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Cyclic nucleotide gated channels (CNG) localize to the plasma membrane of the rod photoreceptor outer segments, and are a key component of the phototransduction cascade. Understanding how outer segment proteins are trafficked and sequestered to the outer segments is an important field of investigation as it addresses both a fundamental aspect of cell biology and mechanism of disease, many of which have trafficking defects at the core of the pathogenic process. Using primarily IHC analysis of rodent models in combination with introduction of various expression constructs to the retina (through electroporation), this study finds that two rod outer segment structural proteins, peripheral-2 and ROM1, facilitate CNG channel localization to the outer segment.

      While this conclusion is interesting, a major concern that tempers enthusiasm is that in peripherin-2 null photoreceptors, there are no outer bona fide segments. In lieu of outer segments, there are rudimentary membranous protrusions and vesicles distal to the connecting cilia where outer segments should be. So the basis for concluding that peripherin-2 is required for CNG localization to the outer segment seems a bit wobbly. It is understood that the authors assumed the membranous materials distal to cilia as proxy for outer segments in their analysis and narrative. This assumption may have some merits. However, it is well known that when outer segment morphogenesis is severely compromised, all normally outer segment bound proteins are ectopically localized or largely absent due to increased degradation. This could be simply due to the loss of their destination compartment, among other things. It is not clear how the authors could distinguish between a direct causal relationship where loss of one protein leads to the mislocalization of another, from secondary outcomes due to loss of the outer segments. The last sentence of the Abstract is telling. "Interestingly, this notion is supported by endogenous staining of CNGB1, which reappears in aged Rds-/- rods that have produced ciliary membrane protrusions." So in aged mice CNGB1 did localize to the OS, but what changed? There was more OS like material to house the CNGB1 protein in the aged mice.

      Significance

      Trafficking of nascent proteins to the outer segment in support of its renewal is an important subject, which has significant impact in understanding the mechanisms of retinal degeneration. The conclusion from this study, that peripherin-2 and ROM1 have a direct role in supporting CNG subunit trafficking may well be meritorious. However the data presented are less than fully convincing, and specifically the question of a direct vs secondary effect needs to be better addressed.

      The following quote underpins some of the reasoning in the study. Lines 139-144, "(Figure 2A). This localization pattern suggests that the CNGB1 subunit is trapped in the biosynthetic pathway. Incontrast, when FLAG-tagged rhodopsin is overexpressed in Rds-/- rods it traffics properly to outer segment ectosomes (Figure 2B, (19)). We posit that without proper exit from thebiosynthetic pathway, the endogenous CNGB1 protein is rapidly degraded to undetectablelevels, which we circumvent through overexpression. These data suggest the localization defect of CNGB1 in Rds-/- rods is in the trafficking of CNGB1. " This in my view is an over- interpretation of limited data. The statement implies that rhodopsin and CNGB1 qualitatively differ in their fate but I would argue that both proteins are heavily degraded intracellularly except more of rhodopsin escaped to the "OS" and shows up in IHC. In many rhodopsin mutant transgenic mice, mutant rhodopsin appeared in OS even though intracellular degradation (gumming up the system) is a major factor in the disease process. The claim "rhodopsin trafficked properly to outer segment ectosomes" is not grounded in solid data.

      A further minor comment is that the scope of the study appear limited, with no attempted experiments on how these proteins might interact to effect facilitation of trafficking.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: In this manuscript, the authors examine how peripherin-2 (PRPH2) contributes to the localization of CNGβ1 within rod outer segment structures. PRPH2 and its homolog ROM1 are structural components of rod discs and are required for disc morphogenesis. In the absence of PRPH2, rod outer segments do not form, and various outer segment materials accumulate and are released as cilia-derived ectosomes. PRPH2 is thought to be transported through an unconventional secretory pathway, whereas cGMP-gated channels follow a conventional trafficking route. Although these components reach the outer segment through distinct pathways, PRPH2 is necessary for the proper delivery of CNGB1, a subunit of the cGMP-gated channel, to its correct destination.

      It was previously reported that a small fraction of PRPH2 reaches the outer segments through the conventional pathway when it forms a complex with Rom1 in mouse photoreceptors. Using Rom1 KO mice, the authors show that this conventionally trafficked PRPH2 fraction is not required for CNGB1 transport to the outer segment. Using various chimeric constructs, the authors verified that tetraspanin core of PRPH2, delivered to the OS, is sufficient to promote OS localization of CNGB1. Ct and Nt cytoplasmic regions of PRPH2 are dispensable for the role. Overall, the majority of the experiments are well-executed with statistical rigor, written in a way that others can reproduce, and support the major conclusion indicated in the title, "PRPH2 is essential for OS localization of CNGB1".

      Major comments: I believe that the majority of the conclusions are well-supported in this manuscript. Below, I am listing the major points that may need additional experiments or clarifications:

      1) CNGA1 subunit is transported to and enriched within ciliary exosomes or the outer segment in PRPH2 deficient mice (Figure 1). The reduced levels of CNGA1 and CNGB1 in rds-/- mice suggest limited stability of these proteins. Their diminished abundance is also influenced by decreased mRNA expression of the corresponding genes. These findings imply that CNGB1 may not be essential for outer segment delivery of cGMP-gated channels if CNGA1 alone contains adequate targeting information. Related to these points, it is unclear whether CNGB1 exhibits a trafficking defect or encounters other problems before leaving the endoplasmic reticulum. Such problems may involve deficiencies in folding, holo-channel assembly, or related quality control processes.

      2) CNGB1 overexpression in rds-/- mice does not result in outer segment localization of CNGB1 channels (Figure 2A). These findings do not clarify whether CNGB1 successfully transits through the Golgi apparatus or associates properly with CNGA1 subunits. Elevating expression levels alone would not compensate for problems in folding or assembly.

      3) Claims related to Figure 6 (P45 rds-/-) need further evidence. It remains uncertain whether CNGA1 and CNGB1 are delivered to lamellar ciliary membranes or to a distinct plasma membrane compartment comparable to that observed in wild type rod outer segments, or whether they accumulate in ciliary ectosomes. Those lamellar structures could be a part of cone outer segments. The observed GARP signal may originate solely from soluble GARP proteins. It is also unclear if CNGA1 and ROM1 colocalize in P45 rds-/- mice. Clarifying these points would strengthen the conclusion that lamellar formation, rather than specific function of PRPH2, is sufficient for CNGB1 delivery to the cilium or outer segment plasma membrane.

      Below are minor comments:

      1. The study does not establish whether a direct interaction between PRPH2 and CNGB1 is required for CNGB1 delivery to rod outer segments. Prior work by the senior author (ref 13) suggests that this interaction is not essential, since the PRPH2 binding site within the GARP domain is distinct from outer segment transport signal of CNGB1. Including a discussion of the PRPH2-GARP (or CNGB1) interaction and its relevance to CNGB1 trafficking would help readers interpret the findings more fully.
      2. The authors propose that the ROM1 core is sufficient for outer segment delivery of CNGB1 based on experiments with chimeric constructs. However, in Figure 1, ROM1 is present in the outer segments (or ciliary ectosomes) of rds-/- mice even though CNGB1 is not delivered to these structures.
      3. Line 80: "Theouter" A space shall be inserted between "The" and "outer".

      Referee cross-commenting

      Both reviewer #2 and reviewer #3 express views that align with mine. They clearly described the study's limitations, and their comments are highly valuable.

      Significance

      Prior studies showed that CNGB1 is not present in cilia-derived ectosomes of rds-/- mice, indicating that PRPH2 is necessary for ciliary or outer segment localization of CNGB1 in rods. Building on these earlier findings, I consider this study significant for the following reasons:

      1) Using detailed analysis of different PRPH2 domains and chimeric constructs, it clarifies that PRPH2 core region, delivered to OSs, is essential and sufficient for OS localization of CNGB1.

      2) PRPH2 and CNGB1 are thought to travel through different post-ER transport routes, with one pathway bypassing Golgi regions and the other passing through them. This study shows that CNGB1 depends on PRPH2, which suggests that these two routes may converge or interact at later stages and opens new directions for future investigation.

      3) The study is relevant to basic scientists and biologists investigating how membrane structures acquire specialized functions in neurons, and its implications extend beyond photoreceptor biology.

      Limitation of the study:

      I believe that clarifying these points will make the manuscript more significant.

      1) Is it not clear, as mentioned above, how PRPH2 contributes to the delivery of CNGB1 to the OSs in the different secretory pathways.

      2) The prior study using a fluorescence complementation approach (Ritter et al, 2011) suggests that PRPH2 and CNGB1 can associate within rod ISs, likely before their delivery to OSs. However, it remains unclear whether this interaction supports the potential cotransport of CNGB1 and PRPH2 or whether the authors view these proteins as being transported independently.

      3) At the end of the result section (Figure 6, rds-/- P45), the authors suggest that lamellar formation (evaginations?) is required for CNGB1 transport. However, CNGB1 is normally not seen in evaginations or lamellar structures, and thus the assumption is not consistent with prior findings.

      Overall, the manuscript is insightful and has the potential to advance our field and related disciplines.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The study by Amal et al. investigates how signaling cues regulate epithelial permeability using Drosophila oogenesis as a model system. During mid-oogenesis, a process known as patency occurs, in which tricellular junctions within the follicular epithelium transiently open, allowing yolk proteins to be transported from the hemolymph to the oocyte. The authors demonstrate that the spatial pattern of patency along the anterior-posterior axis of the egg chamber is inversely correlated with the activity gradient of TGF-β signaling. They further show that TGF-β signaling inhibits vertex opening and influences both actomyosin contractility and DE-cadherin levels. Importantly, although DE-cadherin is required for the TGF-β-dependent suppression of vertex opening, elevated actomyosin contractility itself does not appear to be required for this effect. Overall, this is a well-executed study that links a tissue patterning signal to the regulation of epithelial permeability. The experiments are clearly presented, and the quantification and statistical analyses are rigorous. I nevertheless have several points that should be addressed, either through additional experiments or through further discussion in the manuscript.

      Main Points

      1. Suppressing the effect of activated Tkv (TkvQD) by mad depletion is indeed good yet indirect evidence for the involvement of canonical (Mad-dependent) TGF-ß signaling. I believe a more direct way to reach this conclusion would be the generation of anterior mad loss of function clones which should mimic the tkv8 phenotypes.
      2. On a more general note, most of the results of the paper are based on the hyperactivation of the pathway using TkvQD overexpression. I find this limiting for two reasons: First, the levels of TGF-ß signaling are abnormally high under these conditions. In this context, the interpretation of the contribution of TGF-ß induced MyoII and MyoII activity is unclear. The authors find that TGF-ß signaling activates MyoII activity, however inhibiting actomyosin contractility by various means did not restore vertex opening. This is however at levels of Tkv activity that are far beyond normal (TkvQD). At the same time, the same manipulations are sufficient to open vertices in cells that experience peak, endogenous levels of Tkv activity (anterior cells). Does endogenous Tkv signaling induce MyoII, MyoII activity, Rho1 in anterior levels? Addressing this in tkv8 mosaics would be helpful. I can imaging that, unlike Cadherin which seems to be epistatic to TkvQQ, it is a very difficult to exclude a contribution of TGF-ß mediated actomyosin contractility and there is probably not a good experiment to address this. However, I do not agree with the statement of line 174 "Although.... MyoII activity is dispensable for TGF-ß -mediated inhibition of vortex opening..." I think more appropriate would be to state that MyoII is dispensable for the abnormally/experimentally high TGF-ß signaling-mediated inhibition of vortex opening...". The explanation would be that under these conditions the exceptionally high TGF-ß signaling bypasses the need for MyoII (maybe through exceptionally high adhesion). This is apparently not the case at physiological levels of TGF-ß signaling at anterior cells. Second, high levels of TkvQD, a protein that has been found to localize at junction in other systems, might have secondary effects in vertex opening for example by affecting their structural integrity or even by affecting endocytosis.
      3. The effects of clonal manipulation of TGF-ß signaling within the clones are clear and solid. Although this would not affect the statements of this paper, it would be good if the authors could comment on the effects at clone boundaries. What happens to "hybrid" TCJ when wild-type cells (at the respective position and patency status) meet a clone with elevated or reduced TGF-ß signaling?
      4. From a TGF-ß signaling-centric point of view: In this and other tissues, most of the TGF-ß signaling effects are mediated through the transcriptional repressor Brinker. The pattern of Brk expression is at the patency stage inverse to the pMad/ TGF-ß signaling activity (pMad represses brk transcription) and would in principle be identical in its graded profile with the pattern of vertex opening. Did the authors tried to manipulate levels of Brk? Is it possible to restore tkv8 phenotypes by simultaneously depleting brk?

      Minor points

      • Other than stated, not all egg chambers seem to be at stage 10 A in Fig. 1. Are the eggs shown in C older ?
      • The box in 2A is very hard to see
      • It is hard to correlate the dad::GFP-nls staining of 2A with the intensity profile of 2B. Is the quantification really at the sub-apical region as stated in the legend?

      Significance

      The findings of this study are highly significant and likely to be of broad interest, as they establish a strong link between a signaling pathway (TGF-β signaling), best known for its role in gene expression and tissue patterning, and a highly dynamic cellular process-the remodeling of epithelial junctions that regulates epithelial permeability. While the involvement of TGF-β signaling in this process is not entirely new (see Row et al., iScience, 2021), the present study provides a more detailed analysis and offers a molecular explanation linking TGF-β signaling to epithelial junction patency.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Amal et al investigate how canonical TGF-β signaling regulates tricellular junction (TCJ) remodeling during follicular patency in the Drosophila ovarian follicular epithelium. Using genetic mosaics, quantitative imaging, and perturbations of signaling and cytoskeletal pathways, the authors show that TGF-β signaling suppresses patency in a cell-autonomous manner.

      The authors convincingly show that TGF-β signaling prevents remodeling of tricellular junctions (TCJs) during patency. The figures and quantitative analyses are of an excellent standard, and I commend the authors on the clarity of their data presentation. Previous work from this laboratory demonstrated that patency is regulated by actomyosin activity. In the present study, the authors show that although TGF-β signaling increases actomyosin contractility, perturbation of downstream effectors of actomyosin contractility does not rescue the patency defect caused by constitutively active TGF-β signaling. This is a surprising and interesting result.

      The authors then show that TGF-β regulates patency through effects on E-Cadherin. However, the mechanism by which TGF-β signaling regulates E-Cad remains somewhat unclear. Although the authors show that E-Cad levels appear elevated when TGF-β signaling is activated, E-Cad overexpression alone does not affect patency. The authors also test whether the effect reflects a broader change in adhesion proteins by examining Fas2 and N-Cad, which appear unchanged, suggesting that the effect is specific to E-Cad.

      The introduction and discussion are scholarly and cite the appropriate literature. Overall, the manuscript is rigorous, clearly presented, and ready for publication.

      The experimental approaches are described in sufficient detail to allow reproduction, and the statistical analysis and quantification appear appropriate. The experiments appear adequately replicated, and the presentation of the quantitative data is clear.

      Major comments:

      N numbers for experiments Cells/Egg chambers appear to be missing. Please add these details.

      Single images showing no change in the localization of Fas2 and NCad found in supplementary are not convincing. The authors should quantify this data.

      Minor comments:

      Figure 2A: Instead of sagittal sections through egg chambers, it may be more informative to show the imaging plane that highlights the surrounding follicular epithelium, which would better illustrate the spatial organization of the follicle cells.

      Lines 73-85: Consider referring the reader to Figure 1A earlier in the text to help orient the reader to the architecture of the egg chamber.

      It would also be helpful to include the abbreviation CPFC in the schematic in Figure 1A to make the terminology consistent with the text.

      Significance

      This is an exceptionally well-written and well-presented manuscript. The story presented is logical and the work is carefully executed with top-level figures and quantification. The manuscript is logically organized and controls and statistical tests are appropriate. The authors provide convincing evidence through careful genetic manipulations that TGF-β signaling suppresses vertex opening primarily by reinforcing E-Cad-dependent adhesion rather than through actomyosin contractility.

      A particular strength of the study is the clear dissection of two potential downstream pathways of TGF-β signaling regulated patency- actomyosin contractility and E-Cad-mediated adhesion - and the demonstration that the suppression of patency depends primarily on E-Cad function. The manuscript represents a conceptual advance over the lab's previous work by demonstrating that patency is regulated by an upstream signaling pathway. Whereas earlier studies from this group established the cell biological mechanism of patency, this work shows that TGF-β signaling acts as a regulatory input controlling this process.

      The main limitation of the study is that the downstream molecular mechanism linking TGF-β signaling to stabilization of E-Cad at tricellular vertices remains only partially defined. While the authors show that TGF-β signaling increases E-Cad levels and promotes its retention at vertices many questions remain unclear as to how this is achieved. The data implicate p120-catenin as a possible contributor, but it does not appear to be required, leaving the mechanistic basis of E-Cad stabilization incompletely resolved.

      The primary advance of the study is conceptual and mechanistic, showing that morphogen signaling can control TCJ integrity by stabilizing cadherin-based adhesion independently of actomyosin contractility. The work therefore advances our understanding of of how epithelial junction remodeling is regulated during development in the common model system of the Drosophila ovary.

      In my opinion, the manuscript is exceptionally well presented and appropriate for publication essentially as-is.

      The primary audience for this work will be researchers studying epithelial biology, morphogenesis and developmental cell biology, primarily those working in Drosophila. The manuscript will also be of interest to the broader cell and developmental biology community because it provides evidence for how signaling pathways and morphogen patterning regulates epithelial architecture and barrier function.

      My expertise lies in epithelial morphogenesis, cell-cell adhesion, junction dynamics, and developmental cell biology and I use the Drosophila ovary as a model system. I reviewed the previous paper from this lab that went to Current Biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, the authors explore how TGF signaling inhibits patency in the follicular epithelia of the Drosophila ovary. In this setting, patency is the opening of the tricellular junctions within the follicular epithelium (FE) covering the ovary to allow the transfer of yolk proteins into the underlying ovary. The authors first demonstrate that there is an inverse correlation between levels of Dpp signaling (based on a Dad-GFP reporter) to both the vertex (tricellular junction) opening size and the "circularity" of the FE cells, with Dpp signaling being highest at the anterior end. They show that activated Dpp signaling (Dad-GFP signal) is highest in the most anterior FE as are the highest levels of F-actin and MyoII (mCherry reporter) and that ectopic activation of Dpp signaling (using an activated receptor) in posterior FE cells is sufficient to induce higher levels of RhoI, junctional F-actin and MyoII at the tricellular junctions. However, neither knockdown of RhoI nor expression of a dominant negative form of MyoII have any impact on whether Dpp signaling blocks patency. Thus, although activated by Dpp signaling, MyoII activation is not required for Dpp to block patency. They show that Ecad is not present in the patent tricellular junctions, although it is present earlier and that Dpp signaling is required for enhanced levels of Ecad in anterior FEs and is sufficient to induce Ecad transcription (based on a lacZ reporter in the Ecad gene) and to increase Ecad protein levels. They show that Ecad is required to block patency regardless of Dpp signaling. They show that MyoII activity is not required for Dpp enhancement of Ecad protein levels. They show that Dpp signaling can increase p120cat levels and that p120ctn can increase Ecad levels. However, knockdown of P120cat has no effect on patency in either WT or TKV activated FEs.

      The experiments are nicely down and illustrated, and the paper is well written.

      I think the authors are overstating what they can conclude in both the title and abstract.

      Significance

      I think some of the conclusions cannot be made with the data in hand. Overall, the authors have shown that Dpp signaling enhances levels of several proteins that would be thought to block patency (Rho1, MyoII, F-Actin, p120cat, and Ecad (transcriptionally). They have shown that, except for Ecad, knockdown of most of these do not affect Dpp-dependent patency. However, showing that patency is severely enhanced in both WT and Dpp-activated cells with loss of Ecad is not sufficient evidence that Dpp signaling works through Ecad. Taking away Ecad is going to cause near or complete loss of AJs - thus, it is no surprise that patency is enormously increased everywhere. Importantly, overexpression of Ecad (or of p120cat, which increases Ecad levels) did not block patency. Indeed, it seems like the only manipulation that mimics the effects of Dpp activation on patency is blocking endocytosis - so this seems a likely mechanism (it could also explain the higher levels of p120cat and/or Ecad at junctions). Overall, I agree that the authors can conclude that the Rho1 activation of MyoII observed downstream of Dpp signaling does not impact repression of patency. However, since overexpression of Ecad had no impact on patency, I think they can only conclude that the Ecad expression is enhanced downstream Dpp signaling but that this increase in Ecad expression is insufficient to block patency on its own. Thus, the the title and abstract should be modified to more accurately reflect the conclusions that can be made.

      Minor suggestions

      Figure 1G. Please clearly indicate where the clone of tkv8 null cells is located within the follicular epithelium.

      In my opinion, both supplemental figures should be included in the main body of the paper. They make important points relevant to the conclusion. Figure S1 should be included as part of Figure 3. Figure S2 should be included a stand-alone figure, as there are currently only six figures in the manuscript and the panel in that figure showing that blocking endocytosis blocks patency is an interesting and potentially relevant finding.

      In its current state, the paper is most appropriate for a specialized reader in the field of Drosophila oogenesis. If the authors were to follow up on a potential link between Dpp signaling and endocytosis and find such a link, then I think it would be of more general interest.

      The time estimate below is based on not doing major experiments. If the authors were to follow up on the observation regarding endocytosis, it would be more in the 6 month range.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      To improve the quality of this study, consider implementing strategic improvements that might enhance the significance of your intriguing findings. The results showing that pyruvate can circumvent IFT88 reliance represent a substantial change in our understanding of ciliary assembly; however, the paper would benefit from a more thorough examination of the mechanisms behind this physical development. Since IFT88 is usually seen as the main "elevator" for ciliary parts, figuring out if other proteins like IFT81 or IFT52 are being reused or increased when pyruvate is present will provide a clearer understanding of how this bypass works.

      While you have successfully outlined the signaling pathways linked to tubulin acetylation and detyrosination, the connection between histone acetylation and MAPK signaling poses a complex question. Figuring out if EP300-mediated acetylation starts the MAPK cascade or works as a feedback loop-possibly through specific inhibition tests-would improve the clarity needed for scientific publications. Furthermore, given the pronounced impact shown in colonic fibroblasts, it would be prudent to investigate if this pyruvate-induced ciliogenesis is a ubiquitous biological phenomenon by doing the same experiment in a conventional model, such as RPE1 cells. This would assist in ascertaining if you have discovered a fundamental metabolic principle of biology or a specific adaptation of the gastrointestinal system.

      Concerning the findings on tubulin detyrosination, there exists a little discrepancy: VASH inhibition influences ciliary length at elevated pyruvate concentrations, but the Western blots do not clearly show the predominant alterations in detyrosination at the same concentrations. To address this discrepancy, one may employ high-resolution immunofluorescence to assess detyrosination selectively within the ciliary axoneme, rather than examining the entire cell. This would likely disclose the localized alterations indicated by your functional data. In the discussion about the DSS-induced colitis model, understanding how pyruvate works as both an energy source for colon cells and an antioxidant, along with its effects on cilia, would strengthen the case for its potential as a treatment. Improving these detailed understandings and clarifying which cell types are involved will elevate the paper from a niche discovery to an important addition to cell biology and mucosal immunology.

      Prospective other Improvement Areas Analyzing the MAPK/Histone Acetylation Feedback Loop:

      1.The findings indicate that histone acetylation and MAPK signaling both play a role in pyruvate-induced ciliogenesis. Comment: As said, it is still unknown if histone acetylation triggers MAPK, or the other way around, or whether they create a feedback loop. Incorporating particular tests, such as assessing MAPK activity while blocking EP300 and vice versa, might elucidate this hierarchy. 2.The article suggests that pyruvate's capability to bypass IFT88 may be exclusive to colonic fibroblasts or certain cell types. Comments: Evaluating this effect in a widely utilized ciliary model such as RPE1 or IMCD3 cells will substantially enhance the paper's significance by ascertaining if this is a universal or specialized biological process. +1 3. The work demonstrates that PC forms in the absence of IFT88 when pyruvate is available, although it fails to elucidate the mechanism of structure assembly without this essential transport protein. Comment: Examining if additional IFT proteins (such as IFT81 or IFT52) or alternative transport pathways are elevated or repurposed in the presence of pyruvate will significantly enhance the understanding of the "bypass" discovery. 4.The authors noted that VASH inhibition (LV80) decreased PC length at both 2mM and 10mM pyruvate, however bulk detyrosination alterations were only observable at 2mM. Comment: Although the authors explain this to the "higher sensitivity" of PC length measures, including high-resolution immunofluorescence quantification of the ciliary axoneme, rather than overall cell levels, might furnish the necessary visual proof for detyrosination alterations at 10mM. 5.The authors appropriately recognize that pyruvate may have effects on colitis that are independent of PC. Comment: To give a more comprehensive picture of pyruvate's therapeutic advantages, it would be helpful to broaden the interaction to briefly clarify how its ciliary effects could work in conjunction with its recognized functions in antioxidant defense or epithelial energy metabolism.

      Significance

      The study identifies pyruvate as a distinctive environmental regulator of ciliary length and ciliogenesis in colonic fibroblasts (CF).

      A major discovery is that pyruvate can help produce primary cilia in cells lacking IFT88, challenging the earlier belief that IFT88 is essential for making primary cilia.

      The authors clearly explain the signaling pathways, showing that pyruvate affects the amount of primary cilia by changing tubulin acetylation (which involves acetyl-CoA and ATAT1) and influences the length of primary cilia by altering tubulin

      Strong evidence from experiments with Col6a1cre-Ift88flx/flx mice in a DSS-induced colitis model strongly backs the importance of these findings for both biology and potential treatments.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The present manuscript demonstrates an important role for pyruvate in ciliogenesis and in the regulation of ciliary length/MT modifications. Previous work by the authors' group showed that primary cilia in colon fibroblasts are critical in experimental colitis: loss of cilia increases susceptibility to colitis. In the current study, the authors not only propose potential mechanisms by which pyruvate affects ciliary length and ciliogenesis, but also show that pyruvate treatment positively impacts cilia number and/or length and ameliorates experimentally induced colitis. Overall, I consider this study highly timely and very carefully executed. The quality of the data is excellent, and the findings will be highly relevant to the cilia research community. I have only a few minor points that could further strengthen the manuscript.

      Minor Comments:

      1. To better assess the generality of the findings beyond colonic fibroblasts and to determine whether the pyruvate axis also plays a role in other cell types, the authors could consider performing analogous experiments in widely used cilia model cell lines, e.g. mIMCD, MCKD, RPE1, or similar. It would also be very interesting to evaluate to what extent differences in commonly used media (e.g., RPMI versus DMEM or others) contribute to differences in cilia number and length. Even if beyond the scope of the current study, the present work will likely inspire such investigations in many laboratories.
      2. It remains unclear how the acetylation level is calculated/determined (Fig. 2B/D). The same applies to detyrosination. Please clarify the quantification method (e.g., normalization strategy, region of interest, background subtraction, and whether the readout is intensity per cilium, per cell, or population-averaged).
      3. Some inhibitors are used at relatively high concentrations compared to their EC50 values (e.g., UK-5099 at 50 µM; LV80 at 100 µM; C646 at 25 µM). At these doses, specificity may become an issue and should be validated experimentally or discussed as a limitation. For example, C646 has been reported to inhibit HDACs at higher concentrations.
      4. Can the authors exclude that β-mercaptoethanol (β-ME) in the medium interferes with the effect of pyruvate? Would it be feasible to culture the colonic fibroblasts without β-ME, at least for the treatment window, to rule out confounding effects?
      5. Are the pictures in Fig. 6C derived taken from in vivo tissue or cultured cells? Quantification would be helpful. Small typo in legend: "10μM" should be "10 µm".
      6. Excluding physiological changes due to sodium pyruvate or osmolarity-matched NaCl in vivo based solely on body weight curves may not be sufficient. Potential effects of the high-salt regimen should be discussed as a limitation, and the difference in the anion component should be discussed. For instance, in addition to renal effects such as polyuria, polydipsia, changes in blood pressure etc. eight weeks of 0.2 M salt in the drinking water could plausibly affect the immune system and, thus, indirectly influence the phenotype.

      Significance

      This study is highly relevant to the cilia-/ciliopathy field. It demonstrates that pyruvate - or, more broadly, the composition of the culture medium - can substantially influence ciliogenesis and ciliary length. Whether the observed effects are specific to colonic fibroblasts or extend to other ciliated cell types remains unclear. Nevertheless, this is a genuinely inspiring piece of work that will likely stimulate follow-up studies across the community.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The paper entitled "Pyruvate promotes ciliogenesis bypassing IFT88 dependency and attenuates DSS-induced colitis" by Maya Sarieddine, Ilaria Cicalini, Damiana Pieragostino, Federica Dimarco, Matthieu Lacroix, Krzysztof Rogowski, Valerie Pinet and Michael Hahne, describes the impact of pyruvate concentration on the number and length of primary cilia in murine colonic fibroblasts. The authors further demonstrate that pyruvate promotes both primary cilium acetylation and the expression of acetyltransferases. These findings appear to correlate with pyruvate's regulation of genes essential for cilia assembly, as well as activation of the MAPK signaling pathway, as revealed by RNA-seq and proteomic analyses. Additionally, pyruvate modulates tubulin detyrosination at primary cilia via MAPK-dependent mechanisms. Notably, pyruvate rescues primary cilium assembly in mice deficient for IFT88, a key protein for cilia assembly and maintenance, and reduces susceptibility to experimentally induced colitis in these mice. These results are interesting and are interesting and open opportunities to explore eventual treatments to colitis. However, the manuscript requires a thorough review, as many sections lack rigor, and several conclusions are drawn based on indirect evidence. The manuscript requires that the authors address the following concerns before publication:

      Major

      • In the different sections, through the text, the authors should clearly state that immunofluorescence microscopy was used to assess the number and length of primary cilia, as well as the intensity of the various markers (including the specific antibodies used). This clarification will allow readers to properly interpret the graphical data.
      • Could the progressive increase in pyruvate and sodium acetate concentrations induce osmotic stress in the cells? If so, the inclusion of an osmotic stress control would be warranted.
      • Regarding the ATAT1 inhibitor data, it is unclear why tubulin acetylation in primary cilia (PC) was not quantified. Since microtubule acetylation is likely affected, this could also impact the proportion of ciliated cells. The authors should address and discuss this point.
      • Regarding the use of MAPK signaling inhibitors, the authors show that only primary cilium length is dependent on this pathway. Inhibition of the pathway does not appear to affect, either the number of primary cilia, or their acetylation status. Therefore, the authors should clarify how acetylation is maintained despite the reduction in primary cilium length observed in Figures 4A and 4C. -It would be nice if authors have investigated if pyruvate increases PC length in control mouse as occurs in cells. In fact, in Figure 6F they only analyzed the PC length in Ift88-deficient mice. The same occurs in experiments reported in figure 7 when they establish a relationship between pyruvate role in cilia and induced colitis.

      Regarding the discussion

      Discussion has a lack of rigor:

      1-pg 15 " This concurs with reports showing that increased tubulin acetylation, catalyzed by ATAT1, can promote cilium assembly 19,20"- Reference 19- The authors describe that ATAT1-depleted cells, following siRNA treatment, display a similar percentage of ciliated cells after 24 h of serum removal compared to control cells, despite the cilia being non-acetylated. This indicates that tubulin acetylation does not promote cilia assembly per se, but rather enhances the efficiency of the biogenesis process, as cilia formation occurs more slowly in its absence. 2-pg 15- "This model aligns with our observation that elevated tubulin acetylation enhances the proportion of ciliated CFs." This is an indirect conclusion because in experiments where aTAT1 is inhibited the authors did not measure the intensity of cilia tubulin acetylation. Additionally, when MAPK signaling is inhibited, they observed that PC acetylation decreases but % of ciliated cells is not affected.

      3-pg16- "Nonetheless, our finding that histone acetylation contributes to pyruvate-driven ciliogenesis is in agreement with previous reports indicating that such modification can modulate transcriptional programs involved in PC formation. For example, depletion of the histone acetyltransferase KAT2B (lysine acetyltransferase 2B) in mouse embryonic fibroblasts impaired ciliogenesis 27." This sentence lacks rigor because the authors forgot that many of the acetyl-transferases have distinct substrates. For example, in the cited paper (27) it is shown that KAT2B directly acetylates tubulin affecting primary cilia assembly. The same critic can be extended to sentence "This concurs with additional reports linking histone acetylation with cilia formation 28,29."

      4-The authors should clarify the consistence between their observations and those described in paper 33- "that Vash deletion affects anterograde IFT train movement, leading to ciliary elongation 33. Consistent with these findings, we observed that treatment of ciliated CF cells with LV80, a potent VASH inhibitor34 did not alter the proportion of ciliated cells but did negatively affect PC length." They observed that Vash inhibition leads to smaller cilia, but in Clamydomonas Vash deletion causes cilia elongation.

      Minor

      Figure 1

      C-The table below Graphic C should be clarified, as the +/- symbols are used simultaneously to indicate both addition and presence/absence, which may cause confusion. In the legend -" after 24h starvation in RPMI, RPMI complemented with either Glucose (Glu, 0.14mM)" - Glucose should be 14 mM??? F- The graphic is not mentioned in the text, and the information overlaps with that of D and E, therefore is redundant.

      Significance

      The results described are interesting and open opportunities to explore eventual treatments to colitis and its relationship with primary cilia. However, the manuscript needs to be profoundly reviewed since in many sites lacks rigor.

      The authors state many conclusions based on indirect evidence.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. Point-by-point description of the revisions


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this study, the authors investigated the effect of nutritional stress (HSD and HFD) on cardiac function by assessing multiple parameters on adult flies. They next identified the adaptive transcriptomic changes in the heart in response to these nutritional stresses and screened for their roles under ND, HSD and HFD. They identified fit gene, encoding a satiety gene, expressed by cardiomyocytes and pericardial cells.

      I think the characterisation is thorough; however, the conclusion is not well supported by the evidence. My main concern is that in many graphs, the difference between control and experiment is subtle, and, secondly, the authors showed some conflicting results (e.g. one RNAi showed a reduction of one parameter, however, the other independent RNAi did not. In this case, I believe the authors shouldn't conclude that the RNAi is functionally required, since the RNAis are meant to confirm each other.

      First, we thank the reviewer for her/his constructive comments and suggestions. We obtained new results presented in the last version of the manuscript, which consistently support our conclusions and improve the study.

      High-Sugar and High-Fat Diets modified cardiac performance

      They assessed how HSD and HFD affect Adult fly heart performance. Instead of performing 3 weeks of dietary manipulation as has been done before by other groups, they put adult flies on HSD for 7 days and HFD for only 3 days.

      We would like to clarify the nutritional challenge used. Cardiac function of flies was assessed at 10 days after emergence. Flies were put either in ND or HSD during these 10 days (ND and HSD conditions), or in ND for 7 days then transferred on HFD for 3days (HFD condition). Finally, all the females spent 10 days in a diet before being imaged or before hearts/brains dissection.

      They found: HSD increases HP and SI, and reduces AI. The difference is too small and not consistent between different control lines. Also, when the difference is this small, p value does not tell much!

      They probably intentionally induced a milder effect so that they could assess adaptive transcriptomic changes to this nutritional stress. In Fig. 1D SI is increased under HSD with control-KK, In Fig. S1C, SI is not changed under HSD with control-GD and control-GFP. Instead, DI is increased, which is also opposite to what they showed in Fig. 1 C. HFD increased ESD, EDD, SV, FS and CO.(Hypertrophy). This is not true with control-GD and control-GFP lines though! Comments: They have assessed many parameters in live animals with many different control lines, which is thorough. However, it is hard to draw any conclusions based on these conflicting results. Are these effect KK line specific?

      Globally, we agree with the reviewer that the results, presented in the first version of the manuscript, for the control lines were difficult to understand due to the inconsistency of the phenotypes. In this revised version, we performed new results in Figure 1 and __S1 __regarding the effect of 10 days HSD and 3 days HFD exposure vs ND.

      105 to 187 flies were imaged for the 3 control conditions, in the 3 diets concomitantly, to increase the power of our analysis. As mentioned in the main text (page 3, line 30-35; page 4, line 1-5), both diets deteriorate cardiac function with HFD leading to consistent phenotypes on heart diameters and rhythm and HSD milder effects. Indeed, the 3 control lines were uniformly affected by HFD after 3 days exposure, whereas 10 days in HSD was not sufficient to quantify a significant effect despite consistent the trends on several phenotypes (EDD, ESD, DI, AI and CO. These results revealed a different sensitivity of the cardiac performance when exposed to sugar and fat.

      As described in the text, we were nevertheless confident that our approach would be good to investigate the early molecular dysregulations induced by sugar. This was the purpose of our analysis, presented in the follow-up of the manuscript.

      Regarding the small differences measured in the phenotypes in HSD and HFD compared to ND, we would like to outline that the values presented are normalized values to control. Normalization is done for every independent experiment, performed at different dates, and permits the graphical representation of pooled values. Statistical analysis is performed using non-parametric Kruskal-Wallis test accordingly. Values are presented with the X axis cutting the Y axis at 0, this graphical representation also contributes to flattening the differences and p-values indicate their significance.

      Analysis of the fly cardiac transcriptome upon nutritional stress

      RNA seq to detect differentially expressed genes under HSD and HFD vs ND. Most DE genes are downregulated, which prompts them to assess how the downregulation of these genes adapts the animals to this nutritional stress.

      High Sugar Diet downregulated 1c-metabolism and Leloir galactose pathways.

      In this revised manuscript, we first present RT-qPCR validating the downregulation of Gnmt, Sardh and Galk expressions in the heart of 10days old HSD-fed females compared to ND-fed ones (Figure S3A).

      We apologize for the confused explanations in the first version of the manuscript. We show new results in Figure 3 and __S3 __on the cardiac function of both Gnmt and Sardh, where following reviewer’s suggestion, both genes were knocked down in the heart in ND and Gnmt overexpressed in HSD. No available tools allowed us to test Sardh overexpression in HSD and we could not get some for Galk.

      GNMT is downregulated under HSD and HFD.

      In ND, GNMT knockdown increased ESD, EDD and CO. Sardh knockdown did the same? However, Sardh knockdown did not affect ESD significantly.

      We reanalyze our first data and added new ones, comparing only knockdown or overexpression to the corresponding controls performed in concomitant experiments. Results are now shown in Figure 3C-E; S3C-H. Knocking down Gnmt in the heart increased HP, EDD, ESD and CO, Sardh knockdown in ND resulted in milder phenotypes but inducing significant hypertrophy in ND as Gnmt does. In both cases, FS was not impacted.

      Both genes have been previously shown as beneficial to muscular function in time-restricted feeding context (Livelo et al., 2023, Nat.Comm.), illustrating that, even if both enzymes are involved in opposite reaction, their function has the same effect on organ/tissue function, as they did for heart diameters. The text corresponding to results and discussion were updated accordingly (pages 5, 11).

      The conclusion here is: GNMT knockdown induces hypertrophy, similar to the effect of HFD.

      In HSD, further knockdown of GNMT reduced (rescued) HP, suggesting downregulation of GNMT under HSD is adaptive. Should overexpress GNMT under HSD to see if this manipulation further increases HP, to claim GNMT downregulation is an adaptive change to high sugar stress.

      We thank the reviewer for her/his suggestion. We now used UAS-GnmtWT (from FlyORF) to assess the role of Gnmt on cardiac function in HSD.

      As shown in (Figure 3C-E; S3C,F), overexpressing Gnmt in the heart in HSD was sufficient to rescue some sugar induced phenotypes or to induce other dysfunctions, when compared to corresponding controls evaluated in the same experiments in ND and HSD. Notably, HP increase and CO decrease are rescued by Gnmt cardiac overexpression in HSD. Interestingly, the cardiac diastolic constriction induced by HSD is associated to increased FS and CO in this genotype in sugar diet. These new results strengthen the positive effect of Gnmt on cardiac function, improving it in HSD and preventing its deterioration in this diet.

      Of note, Gnmt overexpression in ND did not trigger cardiac dysfunctions (data not shown).

      The results and conclusions have been corrected.

      Interestingly, HSD itself tends to decrease AI, a further knockdown of GNMT further decreases AI. This indicates GNMT downregulation under HSD contributes to AI reduction. Together, GNMT downregulation under HSD prevents HP from going higher, while its downregulation causes AI going down.

      In the manscript, the authors claim that " Gnmt KD led reduced HP and AI, suggesting that it is able to counteract the effect of HSD observed in control flies on these phenotypes". This is not true according to the logic in Results section 1. As in section 1, the effect of HSD on AI is not significant, so the authors shouldn't say" HS tended to reduce AI".

      Our reanalyzes and new results showed no Gnmt impact on AI, so these Figure panels were removed.

      Why GNMT knockdown reduced FS under ND (Fig. S3C), while increasing FS under HSD (Fig. 3F)? If GNMT knockdown induces hypertrophy, I would expect it to increase FS.

      Gnmt overexpression did not affect cardiac diameters in HSD, but it nevertheless led to an increased contractile efficacy compared to HSD controls (Figure S3F).

      These new results strengthen the positive effect of Gnmt on cardiac function, preventing its deterioration in sugar diet. The text was modified accordingly.

      High Fat Diet modulated CD36-scavenger receptor and Glut8 orthologues

      In this revised manuscript, we present RT-qPCR validating the downregulation of Snmp1 expression and the slight upregulation of nebu’s in the heart of 10days old HFD-fed females compared to ND-fed ones (Figure S3B).

      HFD: Snmp1 gene is downregulated, however, both overexpression and knockdown of Snmp1 in ND induced some phenotypes.

      Indeed, as mentioned in the revised manuscript (page 6, lines 21-24), in heart of females fed ND, both Snmp1 knockdown (Snmp1KK) and overexpression (Snmp1WT) showed a reduction of EDD and ESD (Figure 3J; S3J) but FS is increased accordingly only in Snmp1KK.

      As notified in the text, both downregulation and overexpression of Snmp1 led to side-phenotypes (page 6, lines 24-28): Snmp1KK exhibited abdominal fat increase (Figure S3K) and ostial cells seemed clearly malformed in Snmp1WT (Figure 3M). This may explain why the heart shows the same type of functional impairment in both genotypes.

      We now discussed the hypothesis that these similar cardiac dysfunctions may result from Snmp1 being a regulator of organismal or cardiac lipid homeostasis. Indeed, increasing body fat content is deleterious as is increasing the import of fat in the cardiomyocytes. Finally, both affects cardiac cells’ health and functioning.

      HFD: nebu has a role in regulating cardiac function under ND.

      HSD and HFD revealed the secretory function of the heart

      They identified diet-regulated secreted proteins that are required for cardiac dysfunction.

      Cardiac Fit expression impacted Cardiac performance.

      The author used Hand-G4 to knock down Fit using KK and GD lines, KK line showed a reduction in HP (Fig. 5A), but not GD line (Fig. S5D). How did the author conclude that Fit is required for cardiac function? Also, with the positive data, the difference is too subtle.

      We apologize and agree that the contradictory or inconsistent results obtained with the two RNAi lines were confusing.

      For this revised version, we first assess the effect of the two RNAi lines (KK and GD) on fit expression in the dissected hearts. RT-qPCR for KK line is presented in Figure S5A. GD line did not show a significant reduction of fit expression when expressed in the heart with Hand>, which can explain the former results presented (not shown but data are available). So, we removed all results obtained with the GD line in this revised version.

      To confirm the KK effects, we used fit KO allele (fit81) and truncated version of fit, without its signal peptide (fitDeltaSP), which has a dominant negative effect, both previously published and validated (Sun et al. 2017, Nat. Comm.). These two mutants were used to investigate the cardiac function of fit in our analysis. Results presented in Figure 5 and S5 confirm the phenotypes already observed with the KK line when expressed with Hand> in the heart and with Lsp2> in the fat body.

      Our results validate the effect of fit decrease on rhythmicity and contractility, the reverse effects being consistently observed in fit overexpression. In conclusion, we are confident in the requirement of Fit in the regulation of cardiac performance.

      These new data are now included in the results section “Cardiac Fit expression impacted Cardiac performance” (pages 8-9)

      **Referee cross-commenting**

      i agree with the experiments proposed by reviewer 2.

      Reviewer #1 (Significance (Required)):

      The study aims to examine the effect of diet on cardiac function.

      The strength is that a lot of characterisations were done.

      the weakness is the functional data regarding fit could not be validated in two different RNAis, thus the evidence is not strong to support the conclusions.

      We again would like to thank the reviewer for her/his remarks and suggestions. She/He highlights the weakness of the first analysis and this was an important and constructive feedbacks for us. We strengthened our results by increasing samples, reanalyzing data and performing mandatory new experiments that are now included in this revised version.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Khamvongsa-Charbonnier et al. reported a RNA-seq analysis and RNA interference screening on high-fat and high-sugar-induced cardiomyopathy in Drosophila. The authors uncovered novel genes in 1C-metabolism, galactose metabolism, CD36-scavenger receptor and glucose transporter, as adaptative factors of cardiac function under high-fat and high-sugar treatment. The authors also identified a satiety hormone, Fit, as a cardiokine to control food intake and , expressed by dilp5 secretion. In summary, this study leverages the powerful genetic model Drosophila to uncover a number of new factors in regulating cardiac function under nutritional stresses and potentially offers new insights into molecular mechanisms underlying diet-related cardiac diseases. I have a few concerns, as listed below.

      First, we would like to thank the reviewer for her/his comments and suggestions that deeply help us to improve the take-home messages of our manuscript. Following her/his recommendations and suggestions, we can now present a revised and stronger version of our manuscript.

      1. Quantitative RT-PCR is required to validate the expression patterns of candidate genes identified from the RNAseq analysis.

      RT-qPCR have been performed on hearts dissected from 10 days old females fed ND, HSD or HFD. Gnmt, Sardh and Galk validated downregulation are presented in Figure S3A, Snmp1 downregulation and nebu upregulation (trend but non-significant) in Figure S3B, fit downregulation in Figure S5A.

      The authors state that the dysregulated gene expression patterns reflect acute adaptation to HSD and HFD stresses. Most of the candidate genes in this study were downregulated upon HSD and HFD. However, it is recommended that overexpression of these genes, rather than knockdown, is needed to confirm whether the downregulation of these candidate genes upon stresses is an adaptative response.

      We agree with the reviewer and followed her/his recommendation when tools were accessible for our analysis.

      For example, HSD feeding induces the heart period. Knocking down Gnmt, specifically in the heart, under the HSD feeding changes can reduce the heart period. This evidence is insufficient to suggest the protective role of Gnmt under the HSD diet. Gnmt has already been downregulated under the HSD. Further knockdown of Gnmt, instead of returning the Gnmt expression to normal levels, to protect cardiac contractile performance complicates the model.

      We thank the reviewer for her/his suggestion. We used UAS-*GnmtWT * (from FlyORF) to perform these experiments.

      As shown in (Figure 3C-E; S3C,F), knocking down Gnmt in the heart increased HP, EDD, ESD and CO. In the same Figure panels and in Figure S3F, we showed that overexpressing Gnmt with Hand> in HSD was sufficient to rescue some sugar induced phenotypes or to induce some, when compared to corresponding controls evaluated in the same experiments in ND and HSD. Gnmt overexpression in ND did not trigger cardiac dysfunctions (data not shown).

      HP increase and CO decrease are rescued by Gnmt cardiac overexpression in HSD. Interestingly, the cardiac constriction induced by HSD is not rescued by Gnmt overexpression, but it is enough to increase FS and CO in sugar diet. These new results strengthen the positive effect of Gnmt on cardiac function, improving it in HSD and preventing its deterioration in this diet.

      Sardh knockdown in ND, resulted in milder phenotypes but induced significant hypertrophy in ND as Gnmt does. No available tools allowed us to test its overexpression in HSD.

      Nevertheless, as mentioned and discussed in the manuscript (page 5, line 27-30; page 11, lines 11-14), such protective role of muscular function and integrity has already been characterized in fly IFM in time-restricted feeding experiments for Gnmt and Sardh (Livelo et al., 2023, Nat.Comm.). Our experiments show that both genes encounter the same role in cardiac function upon nutritional stresses. The text was modified accordingly.

      The authors suggest that the effect of nebu on heart contractility is not dependent on diet. However, based on the result from Figure 3O-P, the HFD treatment blocks the effect of nebu knockdown on heart contractility. The authors need to further explain these results and modify their conclusions accordingly.

      We completely agree with the reviewer. We did not correctly analyze these results. We reanalyze our data, taking into account only the experiments of nebu knockdown that were performed in ND and in HFD concomitantly. Results are shown in Figure 3O-P; S3L-N.

      As mentioned in the manuscript (page 7, lines 3-8), nebu knockdown led to identical HP decrease in both diets but its constrictive effect (reduction of heart diameters) in ND is abrogated by fat diet.

      We modified the text accordingly in the results and discussion (page 7, lines 8-11; page 12, lines 7-12).

      It is a bit confusing that knockdown of fit using Hand-Gal4 induced food intake, but knockdown of fit using tin-Gal4 or Dot-Gal4 did not significantly induce food intake (Fig 6A). The author did not provide any explanation of these results. What is even more confusing is that overexpressing fit using Dot-Gal4 decreased food intake, but overexpressing fit using Hand-Gal4 or tin-Gal4 did not significantly decrease food intake (Fig 6B). Why was the strong food intake phenotype not observed using Hand-Gal4 in both experiments? These confusing results lead to a question, which cell type is responsible for the production of cardiokine, Fit?

      We apologize for the misleading results presented in the initial manuscript. We hope that our revised version will clarify Fit function regarding its remote impact.

      Concerning the requirement of Fit function and the cell types that produces Fit, the results we obtained when evaluating cardiac performance strongly suggest that both cardiomyocytes and pericardial cells are important and recapitulate the effect of Hand> (Figure 5A-C; S5G-H).

      In the case of food intake measurements, we now present results with newly performed food intake experiments for the Hand>fitWT (Figure 6D). They show a significant reduction of food intake in this condition, corroborating the results obtained with Dot>. We add a clarification in the manuscript for this point (page 10, lines 11-16).

      When testing the role of cardiac Fit in Dilp5 secretion, the authors subjected flies to starvation stress. However, the main focus of the present study is on HSD and HFD. The RNAseq analysis showed that Fit expression was downregulated by both HSD and HFD. Can the authors show that Dilp5 secretion is reduced by both HSD and HFD? Most importantly, can the authors test whether overexpression of cardiac Fit blocks HSD- or HFD-reduced Dilp5 secretion?

      We understand the point raised by the reviewer. First of all, we wanted to correlate the measured impact on food intake, when manipulating fit expression in the heart, to the level of Dilp release, as it has been used and validated in (Sun et al. 2017, Nat. Comm.). In this purpose, we used the same approach and protocol and results are shown in Figure 6 E-F.

      As mentioned by the reviewer, fit expression is downregulated in both HSD and HFD (which we confirmed by RT-qPCR in Figure S5A). As suggested by the reviewer, we performed Dilp5 immunostaining on CNS from females that were fed HSD of HFD for 10 days. Our results, in Figure 6B (left panels) and corresponding quantifications in Figure 6C, show that both diets strongly induce a decrease in Dilp5 amount in the IPCs and that it was not due to an altered Dilp2 or Dilp5 expression in the CNS (Figure S6A). In this condition, overexpressing fit, which has a promoting effect on Dilp secretion (Figure 6B, right panels ND), may only have an additive effect. This is shown in Figure 6B-C.

      Reviewer #2 (Significance (Required)):

      In summary, this study leverages the powerful genetic model Drosophila to uncover a number of new factors in regulating cardiac function under nutritional stresses and potentially offers new insights into molecular mechanisms underlying diet-related cardiac diseases.

      We again would like to thank the reviewer for her/his remarks and suggestions. Her/His important and constructive feedbacks helped us to improve and strengthen our study. Despite the weak points of the first version, she/he had supportive feedback and we deeply thank her/him. This revised version had improved results and analysis, thanks to the use of new genetic tools that strengthen this analysis.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Khamvongsa-Charbonnier et al. reported a RNA-seq analysis and RNA interference screening on high-fat and high-sugar-induced cardiomyopathy in Drosophila. The authors uncovered novel genes in 1C-metabolism, galactose metabolism, CD36-scavenger receptor and glucose transporter, as adaptative factors of cardiac function under high-fat and high-sugar treatment. The authors also identified a satiety hormone, Fit, as a cardiokine to control food intake and , expressed by dilp5 secretion. In summary, this study leverages the powerful genetic model Drosophila to uncover a number of new factors in regulating cardiac function under nutritional stresses and potentially offers new insights into molecular mechanisms underlying diet-related cardiac diseases. I have a few concerns, as listed below.

      1. Quantitative RT-PCR is required to validate the expression patterns of candidate genes identified from the RNAseq analysis.
      2. The authors state that the dysregulated gene expression patterns reflect acute adaptation to HSD and HFD stresses. Most of the candidate genes in this study were downregulated upon HSD and HFD. However, it is recommended that overexpression of these genes, rather than knockdown, is needed to confirm whether the downregulation of these candidate genes upon stresses is an adaptative response. For example, HSD feeding induces the heart period. Knocking down Gnmt, specifically in the heart, under the HSD feeding changes can reduce the heart period. This evidence is insufficient to suggest the protective role of Gnmt under the HSD diet. Gnmt has already been downregulated under the HSD. Further knockdown of Gnmt, instead of returning the Gnmt expression to normal levels, to protect cardiac contractile performance complicates the model.
      3. The authors suggest that the effect of nebu on heart contractility is not dependent on diet. However, based on the result from Figure 3O-P, the HFD treatment blocks the effect of nebu knockdown on heart contractility. The authors need to further explain these results and modify their conclusions accordingly.
      4. It is a bit confusing that knockdown of fit using Hand-Gal4 induced food intake, but knockdown of fit using tin-Gal4 or Dot-Gal4 did not significantly induce food intake (Fig 6A). The author did not provide any explanation of these results.

      What is even more confusing is that overexpressing fit using Dot-Gal4 decreased food intake, but overexpressing fit using Hand-Gal4 or tin-Gal4 did not significantly decrease food intake (Fig 6B). Why was the strong food intake phenotype not observed using Hand-Gal4 in both experiments?

      These confusing results lead to a question, which cell type is responsible for the production of cardiokine, Fit? 5. When testing the role of cardiac Fit in Dilp5 secretion, the authors subjected flies to starvation stress. However, the main focus of the present study is on HSD and HFD. The RNAseq analysis showed that Fit expression was downregulated by both HSD and HFD. Can the authors show that Dilp5 secretion is reduced by both HSD and HFD? Most importantly, can the authors test whether overexpression of cardiac Fit blocks HSD- or HFD-reduced Dilp5 secretion?

      Significance

      In summary, this study leverages the powerful genetic model Drosophila to uncover a number of new factors in regulating cardiac function under nutritional stresses and potentially offers new insights into molecular mechanisms underlying diet-related cardiac diseases.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, the authors investigated the effect of nutritional stress (HSD and HFD) on cardiac function by assessing multiple parameters on adult flies. They next identified the adaptive transcriptomic changes in the heart in response to these nutritional stresses and screened for their roles under ND, HSD and HFD. They identified fit gene, encoding a satiety gene, expressed by cardiomyocytes and pericardial cells.

      I think the characterisation is thorough; however, the conclusion is not well supported by the evidence. My main concern is that in many graphs, the difference between control and experiment is subtle, and, secondly, the authors showed some conflicting results (e.g. one RNAi showed a reduction of one parameter, however, the other independent RNAi did not. In this case, I believe the authors shouldn't conclude that the RNAi is functionally required, since the RNAis are meant to confirm each other.

      High-Sugar and High-Fat Diets modified cardiac performance

      They assessed how HSD and HFD affect Adult fly heart performance. Instead of performing 3 weeks of dietary manipulation as has been done before by other groups, they put adult flies on HSD for 7 days and HFD for only 3 days. They found: HSD increases HP and SI, and reduces AI. The difference is too small and not consistent between different control lines. Also, when the difference is this small, p value does not tell much! They probably intentionally induced a milder effect so that they could assess adaptive transcriptomic changes to this nutritional stress. In Fig. 1D SI is increased under HSD with control-KK, In Fig. S1C, SI is not changed under HSD with control-GD and control-GFP. Instead, DI is increased, which is also opposite to what they showed in Fig. 1 C.

      HFD increased ESD, EDD, SV, FS and CO.(Hypertrophy). This is not true with control-GD and control-GFP lines though!

      Comments: They have assessed many parameters in live animals with many different control lines, which is thorough. However, it is hard to draw any conclusions based on these conflicting results. Are these effect KK line specific?

      Analysis of the fly cardiac transcriptome upon nutritional stress

      RNA seq to detect differentially expressed genes under HSD and HFD vs ND. Most DE genes are downregulated, which prompts them to assess how the downregulation of these genes adapts the animals to this nutritional stress.

      High Sugar Diet downregulated 1c-metabolism and Leloir galactose pathways.

      GNMT is downregulated under HSD and HFD. In ND, GNMT knockdown increased ESD, EDD and CO. Sardh knockdown did the same? However, Sardh knockdown did not affect ESD significantly.

      The conclusion here is: GNMT knockdown induces hypertrophy, similar to the effect of HFD.

      In HSD, further knockdown of GNMT reduced (rescued) HP, suggesting downregulation of GNMT under HSD is adaptive. Should overexpress GNMT under HSD to see if this manipulation further increases HP, to claim GNMT downregulation is an adaptive change to high sugar stress. Interestingly, HSD itself tends to decrease AI, a further knockdown of GNMT further decreases AI. This indicates GNMT downregulation under HSD contributes to AI reduction. Together, GNMT downregulation under HSD prevents HP from going higher, while its downregulation causes AI going down.

      In the manscript, the authors claim that " Gnmt KD led reduced HP and AI, suggesting that it is able to counteract the effect of HSD observed in control flies on these phenotypes". This is not true according to the logic in Results section 1. As in section 1, the effect of HSD on AI is not significant, so the authors shouldn't say" HS tended to reduce AI".

      Why GNMT knockdown reduced FS under ND (Fig. S3C), while increasing FS under HSD (Fig. 3F)? If GNMT knockdown induces hypertrophy, I would expect it to increase FS.

      High Fat Diet modulated CD36-scavenger receptor and Glut8 orthologues

      HFD: Snmp1 gene is downregulated, however, both overexpression and knockdown of Snmp1 in ND induced some phenotypes.

      HFD: nebu has a role in regulating cardiac function under ND.

      HSD and HFD revealed the secretory function of the heart

      They identified diet-regulated secreted proteins that are required for cardiac dysfunction.

      Cardiac Fit expression impacted Cardiac performance.

      The author used Hand-G4 to knock down Fit using KK and GD lines, KK line showed a reduction in HP (Fig. 5A), but not GD line (Fig. S5D). How did the author conclude that Fit is required for cardiac function? Also, with the positive data, the difference is too subtle.

      Referee cross-commenting

      i agree with the experiments proposed by reviewer 2.

      Significance

      The study aims to examine the effect of diet on cardiac function.

      The strength is that a lot of characterisations were done.

      the weakness is the functional data regarding fit could not be validated in two different RNAis, thus the evidence is not strong to support the conclusions.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Figure 1D: It would be useful to indicate the number of embryos analyzed for these experiments (n = ?).

      Number of embryos now included in figure legend

      Figure 3B: The control condition for gcl⁻/⁻; ras-RNAi is labeled as "EV". This terminology (presumably "empty vector") is not defined in either the text or the figure legend. In addition, the magenta channel for the Ras-G37 condition appears to be flipped horizontally.

      We replaced with “-“ in figure and figure legend

      Page 7: The text states that "Ras-C40 activates the PI3K pathway," whereas the figure depicts Ras-C40 as activating the RalA pathway. This discrepancy could be confusing for the reader and should be corrected.

      The diagram has been corrected

      Figures 4 and 5: To facilitate interpretation, it may be helpful to include a schematic of the PI3K complex indicating the different subunits used in the study, along with information (potentially color-coded) about whether each construct primarily acts as an activator or inhibitor of PI3K function.

      Figure 4E and Figure 5E were added

      Figure 4A and 4B: For clarity and consistency with the text, the panels (and corresponding plots) for dp110-WT and dp110-CAAX could be placed before those for dp110-D954A and dp110-ΔRBD.

      Order of constructs was rearranged

      Figure 5C: The term "p60-TCEp3," which appears to correspond to the germ plasm-targeted p60-WT construct, is not defined in either the figure legend or the main text.

      Clarification was added to the text (p.11, line 225)

      Page 12: The reference "(Fig. S1A, Movie 1)" should be corrected to "(Fig. S2A, Movie 1)."

      Corrected

      Page 13: There is a missing word in the sentence "the biosensor appeared to be enrich to...", which should be corrected to "enriched."

      Corrected

      Figure 7A: Although the data presented are interesting and ultimately support the authors' conclusion that Torso regulates PIP3 levels, the results are somewhat counter-intuitive and may be confusing for readers. The authors might consider moving this panel to the Supplementary Figures. In addition, it could be informative to include PIP3 measurements for gcl⁻/⁻ (and possibly gcl⁺/⁻) pole buds in Figure 7B, as PIP3 appears particularly enriched in these conditions compared to wild type.

      We agree that at first the findings in the early embryos were confusing, but we prefer including them in the main figure to demonstrate changes in PIP 3 distributions in torso mutants. We are now providing a possible explanation for these findings (p13 line 270-). The differences are quite clear in the older embryos and measurements shown in 7B-D. Pole bud measurements for gcl-/- and gcl+/- are shown in figure 6 E-G.

      Reviewer #2

      Fig. legends to 1C and 1D are swapped.

      Corrected

      Why is csw not necessary for PGC formation? It acts upstream of Ras. This is not discussed.

      We now highlight this point in the text (and refer to studies on the sevenless kinase, which suggested a similar position of Csw parallel or downstream of Ras (page 6 line 107-).

      Fig 3C. Consider changing the order of the ras-variants used: S35, G37, C40 instead of S35, C40, G37.

      We changed the schematic in Figure 3C that should make the order of Ras variants more intuitive.

      Fig 4A, B: Consider changing the order of the panels. Control, dp110-wt, dp110-CAAX, dp110-D954A, dp110-deltaRBD.

      Order of constructs was rearranged

      Fig S4 is mentioned in the text before S2 and S3. Consider changing the suppl. figure order.

      Order of supplementary figures was rearranged

      Page 12: Fig S1 A does not show PIP2 dynamics. Movie 1 is not available to this reviewer. The authors most likely refer to fig. S2.

      Movie 1 was uploaded and figure calls were corrected

      Page 13, 1st para: Why do the authors use glc heterozygous embryos to look at PIP3 and PIP2? Particularly so when they report later in the MS that glc+/- behave differently to wt controls in terms of PIP3 levels (Fig. 7C). By looking at gcl+/+, they might find that now PIP2 levels are different in gcl mutant embryos or that the differences between PIP3 levels in +/+ and -/- are larger than compared with +/-.

      Since gcl+/- embryos form the same number of PGCs as WT but show a statistically significant increase in PI3K activity when comparing membrane to cytoplasm staining intensity, we favor using gcl+/- embryos, as these embryos may represent a more sensitive test for PIP2 and PIP3 levels.

      Pages 15 and 16: revise figure calls in the text.

      Figure calls were revised

      M+M: How were gcl+/- and gcl-/- embryos identified?

      Since all genetic manipulations in this alter the maternal contribution to the embryo, we us the term ‘mutant’ embryos referring to the maternal genotype (indicated on page 3 line 33 and more clearly stated in material and methods and reagent table). Embryos derived from mother of a specific maternal genotype are all identical, thus we can easily distinguish between embryos derived from homozygous mutant mothers (gcl-/-) or heterozygous mutant mothers (gcl-/+) In the reagents table we include the precise genotype description. “CyO” refers to the balancer chromosome commonly used to identify heterozygotes on the second chromosome. Flies with the CyO balancer have curly wings.

      Reviewer #3

      Figure 1B: The authors describe that embryos with OptoSos still form buds which protruded from the cortex, but PGCs largely fail to cellularize (described in pg. 5). I'm not sure what they meant by "fail to cellularize" as this is not obvious to me when looking at the figure. The authors should describe how they know it's cellularized in the controls and not in the OptoSos or change the wording to "suggesting a failure to cellularize".

      We used the word ‘protruded’ to describe our live observations. PGCs were quantified in fixed embryos, immunostained with anti-Vasa antibody to count Vasa positive cells (Fig 1C and D. We observe a lack of Vasa-positive PGCs, only in the light-activated OptoSos condition.

      Fig. 1B, lines 4-5: at what stage are these embryos? Cycle 9? Cycle 14? Both?

      Nuclear cycles of embryos for each panel are noted on the left side of each panel

      Fig. 4A: add dp110-CAAX results to Results section

      dp110-CAAX results are included in the Results section (p.9. line 177)

      Figure 5C: The hyper-clustered phenotype they describe is hard to visualize in this figure (described in pg. 11). The authors should describe what is meant by "hyper-clustered".

      We agree and re-worded the description of this observation to be clearer, page 11, line 226-.

      Figure 7: When comparing Fig. 7A and 7B torsoHH/WK images, we can see that in Fig. 7A that PIP3 pattern changes such that PIP3 is now at the most posterior end where PGC will eventually form (compared to control that has low PIP3 in this region), but then in Fig. 7B they are looking at the buds and they say PIP3 levels decrease, which does not correspond to Fig. 7A. Are these simply different stages and PIP3 levels change over time (looking at Fig. 7C, PIP3 does not seem to change a lot over time)?

      The figure legend now states more clearly that embryos were of different ages. We also explain in the text the apparent discrepancy in the patterns before and during budding (page13 line 266). The time points in figure 7C span nuclear cycle 10, not earlier (page14 line 274). By measuring membrane to cytoplasmic distribution, a more accurate comparison is possible at this stage.

      p. 5, line 5: "Optosos" is written "OptoSos" elsewhere (suggest using OptoSos throughout)

      Corrected

      Is it possible that inhibition of myosin II recruitment is due to conversion of PIP2 -> PIP3, thus loss of PIP2, or is it that myosin is specifically recruited to regions where PIP2 is high? This seems like a point that should be added to the discussion.

      This point is now discussed on page 20, line 403

      p. 5, line 6: suggest adding a comma after "Ras" for clarity

      Corrected

      p. 5, last line: the genotype is "w^1118" (with ^ indicating a superscript), not "w^-1118", and is italicized (this should be corrected throughout)

      Corrected

      p. 6, line 2: replace "cellularizing" with "cellularization"

      Corrected

      p. 6, lines 11-13: Where is it shown that knockdown of csw, dsor1 and rolled did not restore PGC formation? The data are not present in Fig. 2C (could include in supp fig?)

      We added these data as Supplementary figure 1

      p. 7, line 1: replace "interfere" with "interferes"

      Corrected

      p. 7, last three lines: what is stated here, "Ras-G37 [activates] both the RalA and the PI3K pathways, and Ras-C40 activates the PI3K pathway" is not consistent with what is diagrammed in Fig. 3C, where Ras-C40 is indicated as activating RalA (please correct either the text or the diagram)

      We apologize and corrected the figure

      p. 11, lines 1-2: the Pi3K21B gene and transcript should be italicized (note that Pi3K21B is the official gene name on FlyBase)

      Gene name was italicized

      p. 11, lines 6-10: it might be helpful to explain how the p60 construct was overexpressed (current lines 9-10) before describing the results (current lines 7-8)

      Clarification on p60 construct was added to p.11, line 215-

      p. 12, paragraph 2, line 2: the PIP2 biosensor should be written as "PLCgamma[PH]:mCherry" throughout, not "PLCy[PH]:mCherry"; this should be changed in the figures as well as the text (Symbol font can be used to turn "g" into lower-case "gamma", both in Word and in Illustrator)

      Gamma symbol was added

      It would also be helpful to show the overlap of the PIP2 and PIP3 signals in control vs. gcl mutants at different stages so the relative distribution and intensity of the signals can be better appreciated (consider adding this as a supplementary figure).

      Our data show that PIP2 is not affected by lack of GCL (Fig 6 B-D). We thus do not think that simultaneous imaging of PIP2 and PIP3 in gcl-/- would add to our conclusions. Furthermore, these experiments would require a significant time investment to generate the respective genotypes. Thus, we agree with the reviewer that this is experiment is beyond the scope of the paper.

      p. 12, paragraph 2, line 3: it does not appear that the two PIP markers were used "simultaneously" in Fig. 6A; however, this is evident from Fig. S2 and Movie 1 (consider placing callouts to these earlier in the paragraph or moving the description of simultaneous expression and observation of the two markers later in the paragraph to avoid confusion)

      We did simultaneously image PIP2 and PIP3 sensors and have added this as Movie 1 and also in supplementary Figure S4, which are now clearly referred to in the text.

      p. 12, paragraph 2, line 7: replace "Fig. S1A" with "Fig. S2" (this was confusing)

      Figure call was updated

      p. 16: change "Fig. 7G-I" to "Fig. 8G-I"

      Figure call was updated

      p. 20, Deming reference: there appears to be a stray asterisk in the title

      Asterisk was removed from reference

      Fig. 1D: need to explain that the colors in the graph indicate the numbers of PGCs formed (this could also be added as a label across the top of the graph); in addition, the number of embryos examined for each genotype should be included in the legend

      We added a label at the top of the graph and ‘n’ were added to figure legend

      Fig. 2B: spell out where csw, dsor1 and rolled data are shown; also, "n" is not defined; was this the number of embryos per genotype?

      We added these data as Supplemental Figure 1

      Fig. 3B: "EV" should be defined in the legend; is this "empty vector"?

      We are using a “-“ to mark controls without transgene

      Fig. 3C: see previous comment re: mistake in the diagram; I believe Ras-C40 was described as activating PI3K, not RalA

      We apologize and corrected the figure

      Fig. 4B, line 2: was the graph plotted from the data in panel (C) or panel (A)? panel (A) seems more likely, because the data in C is plotted in D; please correct the panel callout

      Figure legend was updated to refer to the correct panel

      Fig. 5C: describe "p60-TCEp3" in the legend

      We added germplasm-targeting 3’UTR (TCEp3) to legend and the construct and reference are provided in Material and Methods section

      Figure 6: In Fig. 6E-G, the "brightness" of PIP3 at the membrane corresponds to the images even with different views (posterior and orthogonal) and agrees with the graph.

      However, when looking at Fig. 6B, it looks to me that PIP2 is brighter in gcl+/-, but the opposite is true when looking at Fig. 6D (i.e., PIP2 looks brighter in gcl-/-). The authors might want to comment on this.

      We have updated the figure to better reflect our observations.

      Fig. 6A: define "(fire)" here or in the first figure legend where this is used

      We added an inset for the fire lookup table to clearly define the pseudcolor scheme used in the image

      Figure 8 title: "Actin fluorescence is increased in gcl-/- pole buds",But their graph in Fig. 8B comparing actin in gcl+/- to -/- is not significant

      Thanks for catching our mistake, myosin not actin is changed

      Fig. 8I: replace "Scarlett" with "Scarlet"

      Corrected

      Fig. 8D-F: Although the plots in panel E agree with the images in panel D, it is unclear why those in panel F are not more concordant. In F, myosin appears enriched at the cortex relative to the cytoplasm in gcl-/- mutants, which is hard to reconcile with the data in D-E.

      We have updated the figure to better reflect our observations.

      Fig. S2A: define the three time points shown here, and clarify that these are shown left to right (if this is indeed the case)

      We removed S2A and updated the movie to replace it

      Fig. S4: change "P60" to "p60" in the figure title

      Corrected

      Movie: The movies showing PIP2 and PIP3 in whole embryos are nice, but it would also be helpful to also include merged images of the two channels, so the reader can examine the relative accumulation of the two PIPs over time.

      Merged images panel was added to the movie.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Although Torso is known to antagonize primordial germ cell (PGC) formation, the underlying mechanisms remain unclear. Canonical Torso signalling typically results in activation of Ras. However, the authors show that Ras-mediated suppression of PGC formation is independent of the Raf/MEK/ERK pathway. Instead, they uncover an unexpected role for Torso in activating phosphoinositide 3-kinase (PI3K) that promotes formation of PIP3 enriched posterior membrane domains. The resulting increase in PI3K activity disrupts PGC formation. Furthermore, they show that by promoting Torso degradation, the ubiquitin ligase adaptor Germ Cell-Less (GCL) primes the posterior membrane with reduced PIP3 to facilitate PGC formation. Lastly, the authors suggest a model where antagonistic relationship between GCL and Torso influences actomyosin contractility that may allow the bud to constrict for proper PGC formation.

      Major comments:

      Figure 1B: The authors describe that embryos with OptoSos still form buds which protruded from the cortex, but PGCs largely fail to cellularize (described in pg. 5). I'm not sure what they meant by "fail to cellularize" as this is not obvious to me when looking at the figure. The authors should describe how they know it's cellularized in the controls and not in the OptoSos or change the wording to "suggesting a failure to cellularize".

      Figure 5C: The hyper-clustered phenotype they describe is hard to visualize in this figure (described in pg. 11). The authors should describe what is meant by "hyper-clustered".

      Figure 6: In Fig. 6E-G, the "brightness" of PIP3 at the membrane corresponds to the images even with different views (posterior and orthogonal) and agrees with the graph. However, when looking at Fig. 6B, it looks to me that PIP2 is brighter in gcl+/-, but the opposite is true when looking at Fig. 6D (i.e., PIP2 looks brighter in gcl-/-). The authors might want to comment on this.

      It would also be helpful to show the overlap of the PIP2 and PIP3 signals in control vs. gcl mutants at different stages so the relative distribution and intensity of the signals can be better appreciated (consider adding this as a supplementary figure).

      Figure 7: When comparing Fig. 7A and 7B torsoHH/WK images, we can see that in Fig. 7A that PIP3 pattern changes such that PIP3 is now at the most posterior end where PGC will eventually form (compared to control that has low PIP3 in this region), but then in Fig. 7B they are looking at the buds and they say PIP3 levels decrease, which does not correspond to Fig. 7A. Are these simply different stages and PIP3 levels change over time (looking at Fig. 7C, PIP3 does not seem to change a lot over time)?

      Page 15, last paragraph: "If myosin II recruitment is inhibited when PIP3 levels are high" Is it possible that inhibition of myosin II recruitment is due to conversion of PIP2 -> PIP3, thus loss of PIP2, or is it that myosin is specifically recruited to regions where PIP2 is high? This seems like a point that should be added to the discussion.

      Overall, I think their claim that antagonistic activities of GCL and Torso is crucial for PGC formation is well justified. The combination of optogenetic tools with activation and lof mutants is nicely done. Some clarification regarding the PIP3 and PIP2 levels will be helpful to the reader (see my comments above). The myosin claim is less convincing (see my comment on Fig. 8D-F below).

      Minor comments on the text:

      p. 5, line 5: "Optosos" is written "OptoSos" elsewhere (suggest using OptoSos throughout) p. 5, line 6: suggest adding a comma after "Ras" for clarity p. 5, last line: the genotype is "w^1118" (with ^ indicating a superscript), not "w^-1118", and is italicized (this should be corrected throughout) p. 6, line 2: replace "cellularizing" with "cellularization" p. 6, lines 11-13: Where is it shown that knockdown of csw, dsor1 and rolled did not restore PGC formation? The data are not present in Fig. 2C (could include in supp fig?) p. 7, line 1: replace "interfere" with "interferes" p. 7, last three lines: what is stated here, "Ras-G37 [activates] both the RalA and the PI3K pathways, and Ras-C40 activates the PI3K pathway" is not consistent with what is diagrammed in Fig. 3C, where Ras-C40 is indicated as activating RalA (please correct either the text or the diagram) p. 11, lines 1-2: the Pi3K21B gene and transcript should be italicized (note that Pi3K21B is the official gene name on FlyBase) p. 11, lines 6-10: it might be helpful to explain how the p60 construct was overexpressed (current lines 9-10) before describing the results (current lines 7-8) p. 12, paragraph 2, line 2: the PIP2 biosensor should be written as "PLCgamma[PH]:mCherry" throughout, not "PLCy[PH]:mCherry"; this should be changed in the figures as well as the text (Symbol font can be used to turn "g" into lower-case "gamma", both in Word and in Illustrator) p. 12, paragraph 2, line 3: it does not appear that the two PIP markers were used "simultaneously" in Fig. 6A; however, this is evident from Fig. S2 and Movie 1 (consider placing callouts to these earlier in the paragraph or moving the description of simultaneous expression and observation of the two markers later in the paragraph to avoid confusion) p. 12, paragraph 2, line 7: replace "Fig. S1A" with "Fig. S2" (this was confusing) p. 16: change "Fig. 7G-I" to "Fig. 8G-I" p. 20, Deming reference: there appears to be a stray asterisk in the title

      Minor comments on the figures and figure legends:

      Fig. 1B, lines 4-5: at what stage are these embryos? Cycle 9? Cycle 14? Both? Fig. 1C: see previous comment about "w^1118" genotype nomenclature Fig. 1D: need to explain that the colors in the graph indicate the numbers of PGCs formed (this could also be added as a label across the top of the graph); in addition, the number of embryos examined for each genotype should be included in the legend Fig. 2B: spell out where csw, dsor1 and rolled data are shown; also, "n" is not defined; was this the number of embryos per genotype? Fig. 3B: "EV" should be defined in the legend; is this "empty vector"? Fig. 3C: see previous comment re: mistake in the diagram; I believe Ras-C40 was described as activating PI3K, not RalA Fig. 3E: fix "w^1118" as described above Fig. 4A: add dp110-CAAX results to Results section Fig. 4B, line 2: was the graph plotted from the data in panel (C) or panel (A)? panel (A) seems more likely, because the data in C is plotted in D; please correct the panel callout Fig. 5C: describe "p60-TCEp3" in the legend Fig. 6A: define "(fire)" here or in the first figure legend where this is used Figure 8 title: "Actin fluorescence is increased in gcl-/- pole buds",But their graph in Fig. 8B comparing actin in gcl+/- to -/- is not significant Fig. 8D-F: Although the plots in panel E agree with the images in panel D, it is unclear why those in panel F are not more concordant. In F, myosin appears enriched at the cortex relative to the cytoplasm in gcl-/- mutants, which is hard to reconcile with the data in D-E. Fig. 8I: replace "Scarlett" with "Scarlet" Fig. S2A: define the three time points shown here, and clarify that these are shown left to right (if this is indeed the case) Fig. S4: change "P60" to "p60" in the figure title

      Movie: The movies showing PIP2 and PIP3 in whole embryos are nice, but it would also be helpful to also include merged images of the two channels, so the reader can examine the relative accumulation of the two PIPs over time.

      Referees cross-commenting

      I agree enthusiastically with the comments of the other reviewers, who often came to the same conclusion I did about the manuscript and the data, including some of the detailed points about the figures, etc.

      Significance

      General assessment:

      The many strengths of this manuscript include elegant genetic and optogenetic approaches using well-designed transgenes.

      The main weakness is the lack of experiments showing simultaneous live imaging of the PIP2 and PIP3 sensors in gcl-/- and other genetic backgrounds, which would help the reader better envision how regulators of this pathway affect phospholipid distribution at the level of whole embryos and prospective pole cells. Note that because of the time required, I do not insist that they do this.

      Advance:

      Study demonstrates for the first time an unexpected role of Torso in PI3K regulation

      Audience:

      germ cell afficionados, developmental biologists, cell biologists, PI3K researchers

      My field of expertise:

      Drosophila, germ cell development, genetics, cell biology, live imaging, phosphoinositides

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The MS by Saiduddin et al. investigates the connection between the germ cell-less (gcl) germplasm component, the RTK Torso and cortical PI3P levels in the formation of a normal complement of primordial germ cells (PGCs) in the Drosophila embryo. The authors find that GCL regulates Torso levels, which in turn activate the PI3-kinase in a ras-dependent (but raf-independent) manner. It then follows that the realm of action of gcl defines a region at the posterior pole of the embryo where PI3P levels are sufficiently low to allow PGC formation.

      Specific points:

      • Fig. legends to 1C and 1D are swapped.
      • Why is csw not necessary for PGC formation? It acts upstream of Ras. This is not discussed.
      • Fig 3C. Consider changing the order of the ras-variants used: S35, G37, C40 instead of S35, C40, G37.
      • Fig 4A, B: Consider changing the order of the panels. Control, dp110-wt, dp110-CAAX, dp110-D954A, dp110-deltaRBD.
      • Fig S4 is mentioned in the text before S2 and S3. Consider changing the suppl. figure order.
      • Page 12: Fig S1 A does not show PIP2 dynamics. Movie 1 is not available to this reviewer. The authors most likely refer to fig. S2.
      • Page 13, 1st para: Why do the authors use glc heterozygous embryos to look at PIP3 and PIP2? Particularly so when they report later in the MS that glc+/- behave differently to wt controls in terms of PIP3 levels (Fig. 7C). By looking at gcl+/+, they might find that now PIP2 levels are different in gcl mutant embryos or that the differences between PIP3 levels in +/+ and -/- are larger than compared with +/-.
      • Pages 15 and 16: revise figure calls in the text.
      • M+M: How were gcl+/- and gcl-/- embryos identified?

      Significance

      This work reveals a novel, transcription-independent role for Torso in the regulation of cortical lipid compartmentalization and provides a molecular explanation for how Tor activity at the embryo posterior delimits the area where PGCs arise. The experiments are superbly documented, the MS is a pleasure to read and the hypotheses are elegantly tested. While the broader generality of the findings remains uncertain, particularly since the role(s) of gcl in germ cell development do not seem to be evolutionary conserved, the study sheds light on a ras-dependent, transcription-independent function of RTKs in cellularization, a function most likely to be essential also in other contexts.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is an outstanding and elegant study that addresses an important question in developmental and germline biology: how the soma-germline boundary is established during embryogenesis. This represents one of the most fundamental cell-fate decisions during organismal development. The authors combine elegant genetics, optogenetics, quantitative live imaging, and lipid biosensors to provide a compelling mechanistic framework linking receptor degradation, lipid signaling, and cytoskeletal dynamics. They show that Torso signaling via PI3K and PIP3 antagonizes primordial germ cell (PGC) formation and promotes somatic cell fate. Furthermore, they demonstrate that GCL-mediated degradation of Torso at the posterior pole creates a PIP3-low membrane domain that permits myosin II recruitment and pole bud constriction, thereby enabling PGC formation. Together, these results clearly demonstrate how the soma-germline boundary is established.

      We have only minor comments on the manuscript, primarily aimed at improving clarity for non-specialist readers:

      1. Figure 1D: It would be useful to indicate the number of embryos analyzed for these experiments (n = ?).
      2. Figure 3B: The control condition for gcl⁻/⁻; ras-RNAi is labeled as "EV". This terminology (presumably "empty vector") is not defined in either the text or the figure legend. In addition, the magenta channel for the Ras-G37 condition appears to be flipped horizontally.
      3. Page 7: The text states that "Ras-C40 activates the PI3K pathway," whereas the figure depicts Ras-C40 as activating the RalA pathway. This discrepancy could be confusing for the reader and should be corrected.
      4. Figures 4 and 5: To facilitate interpretation, it may be helpful to include a schematic of the PI3K complex indicating the different subunits used in the study, along with information (potentially color-coded) about whether each construct primarily acts as an activator or inhibitor of PI3K function.
      5. Figure 4A and 4B: For clarity and consistency with the text, the panels (and corresponding plots) for dp110-WT and dp110-CAAX could be placed before those for dp110-D954A and dp110-ΔRBD.
      6. Figure 5C: The term "p60-TCEp3," which appears to correspond to the germ plasm-targeted p60-WT construct, is not defined in either the figure legend or the main text.
      7. Page 12: The reference "(Fig. S1A, Movie 1)" should be corrected to "(Fig. S2A, Movie 1)."
      8. Page 13: There is a missing word in the sentence "the biosensor appeared to be enrich to...", which should be corrected to "enriched."
      9. Figure 7A: Although the data presented are interesting and ultimately support the authors' conclusion that Torso regulates PIP3 levels, the results are somewhat counter-intuitive and may be confusing for readers. The authors might consider moving this panel to the Supplementary Figures. In addition, it could be informative to include PIP3 measurements for gcl⁻/⁻ (and possibly gcl⁺/⁻) pole buds in Figure 7B, as PIP3 appears particularly enriched in these conditions compared to wild type.

      Significance

      The biological question is highly interesting, the experimental design is very clear, and the data are convincing throughout. The imaging, quantification, and movies are of very high quality and strongly support the authors' conclusions. Overall, this manuscript represents a significant conceptual and technical advance and will be of broad interest to the fields of germline biology, membrane biology, and embryonic morphogenesis.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      In this manuscript, the authors investigate the role of the tubulin polyglutamylase TTLL6 in maintaining colonic epithelial homeostasis and its potential role in colorectal cancer (CRC). Using transcriptomic analyses, mouse genetics, histology, and proteomics, the authors report that TTLL6 is highly expressed in colonic epithelial cells and decreases during CRC progression. Constitutive and epithelial-specific deletion of Ttll6 in mice leads to elongated colonic crypts, expansion of proliferative and stem cell compartments, and increased susceptibility to chemically induced colitis-associated carcinogenesis. Mechanistically, the authors identify the nucleic acid-binding protein PurA as a potential non-tubulin substrate of TTLL6. They propose that TTLL6-mediated polyglutamylation of PurA regulates its nuclear localization, thereby contributing to epithelial homeostasis in the colon. Together, the study suggests a TTLL6-PurA axis that may restrain early colorectal tumorigenesis.

      Major comments

      1. Evidence that PurA is a physiologically relevant TTLL6 substrate remains incomplete. A central conclusion of the manuscript is that PurA is a substrate of TTLL6 whose polyglutamylation regulates nuclear localization. While the authors present several lines of evidence (PolyE immunoprecipitation, co-transfection experiments, and mutagenesis of the PurA C-terminal glutamate residues), the physiological relevance of this modification remains somewhat indirect. For example, polyglutamylation of endogenous PurA in colonic epithelial cells is inferred but not directly demonstrated. The PolyE antibody detects glutamate chains but does not identify the specific modified protein in tissue. Direct evidence that PurA is polyglutamylated in vivo (e.g., MS identification of the modification site on PurA or PurA immunoprecipitation followed by PolyE detection) would strengthen the mechanistic claim. At present, the data convincingly show that TTLL6 can glutamylate PurA in an overexpression system, but the endogenous modification remains less clearly demonstrated.
      2. Mechanistic link between PurA localization and the epithelial phenotype is not established. The authors propose that loss of TTLL6 disrupts PurA nuclear localization and thereby alters epithelial homeostasis. However, the manuscript does not establish a causal relationship between PurA localization and the observed crypt phenotypes. Specifically, it is not shown whether PurA loss phenocopies Ttll6 deficiency in the colon. No experiments test whether restoring nuclear PurA rescues the Ttll6 phenotype. Downstream transcriptional or signaling pathways regulated by PurA are not explored. Thus, while the TTLL6-PurA relationship is intriguing, the study remains largely correlative with respect to functional consequences.
      3. Interpretation of the tumorigenesis data should be tempered. The authors conclude that Ttll6 deficiency promotes colon carcinogenesis. However, the tumor data appear somewhat limited. Increased tumor numbers are reported only at an early time point (day 40) and are described as a trend toward significance. By day 70, tumor numbers and sizes appear comparable between groups. The increased incidence of vimentin-positive crypts is interesting but does not clearly establish increased tumor burden. Given these results, the conclusion that TTLL6 restrains tumorigenesis may be stronger than supported by the data. The authors may wish to frame this as enhanced early tumor development or altered tumor progression rather than increased tumorigenesis per se.
      4. Expansion of multiple epithelial cell populations requires clarification. The authors report that Ttll6-deficient colons exhibit expansion of stem/progenitor compartments as well as increased numbers of differentiated cells (e.g., goblet cells and enterocytes). While these findings are interesting, the biological interpretation is somewhat unclear. For example, expansion of stem/progenitor compartments typically accompanies reduced differentiation rather than increased differentiation. It is not clear whether the increased numbers of differentiated cells reflect overall crypt enlargement or altered lineage allocation. Quantification of cell-type proportions rather than absolute cell numbers would help clarify whether differentiation programs are altered.
      5. Nuclear polyglutamylation requires further clarification The authors report nuclear PolyE staining in colonic epithelial cells and propose that this reflects polyglutamylation of non-tubulin substrates such as PurA. However, it is not clear whether other nuclear proteins could account for this signal. The specificity of the nuclear PolyE signal should be better validated. Additional controls (e.g., peptide competition or validation with alternative approaches) would strengthen the interpretation.

      Minor comments

      1. The manuscript would benefit from clearer distinction between tubulin vs non-tubulin glutamylation throughout the text.
      2. Some conclusions in the Discussion appear slightly overstated relative to the data (e.g., the role of the TTLL6-PurA axis in tumor suppression).
      3. The description of the Ttll6 mouse models (constitutive vs conditional deletion) could be clarified earlier in the Results section.
      4. Quantification methods for histological analyses (crypt length, cell counts, marker-positive cells) should be described in greater detail in the Methods.
      5. It would be useful to include representative images of PurA localization in control vs Ttll6-deficient colon tissue in the main figures.
      6. Several minor typographical issues appear throughout the manuscript and should be corrected during revision.

      Significance

      General assessment

      This study investigates the role of the polyglutamylase TTLL6 in intestinal epithelial biology and colorectal cancer. The identification of a potential non-tubulin substrate (PurA) is conceptually interesting and expands the emerging view that tubulin-modifying enzymes can regulate additional cellular proteins. The study combines mouse genetics, histological analysis, transcriptomic datasets, and proteomics, which together provide a substantial dataset supporting a role for TTLL6 in regulating crypt architecture and epithelial proliferation. However, the mechanistic link between TTLL6 activity, PurA modification, and epithelial homeostasis remains incompletely resolved. The tumorigenesis data also suggest only modest effects on carcinogenesis.

      Advance relative to previous literature

      Previous studies have linked members of the TTLL family primarily to microtubule regulation and ciliary biology. This work extends these findings by suggesting a tissue-specific function of TTLL6 in the colon, and the existence of non-tubulin substrates regulating epithelial biology. If further validated, the identification of PurA polyglutamylation could represent an interesting conceptual advance.

      The manuscript will likely be of interest to researchers working in cytoskeletal post-translational modifications, intestinal epithelial biology, colorectal cancer biology

      My expertise lies in cytoskeletal regulation, epithelial biology, and intestinal tissue organization, which are directly relevant to the central themes of this manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: In this study, the authors investigate novel functions of the tubulin typrosine ligase-like protein 6 (TTLL6), which covalently adds glutamate residues to the C-terminus of a given protein. The authors have already published previously on this topic. In the current study, the role of TTLL6 in colon function and pathologies was investigated. The study consists of two major parts. In the first part, a mouse model is used to show that TTLL6 is expressed at elevated levels and activity in epithelial cells of the colon. A database search indicated that TTL6 expression positively correlates with prognosis of patients with colorectal cancer. The authors generated a TTLL6 KO mouse and showed that induced tumor growth at 40 days was more positive for vimentin in the crypts of these mice, which should correlate with tumor aggressiveness. This difference was not observed anymore after 70 days. Morphological analyses of the crypts showed that in TTLL6 KO mice the crypts increased in length, a difference in proliferation markers, and a change of cell types in the crypts was observed.<br /> In the second part, the authors used a modification-specific antibody to immunoprecipitate (IP) proteins modified by TTLLs. To identify TTLL6-dependently modified target proteins, they compared these results with IPs from TTLL6 KO mice. A total of 43 proteins were identified this way. Because of their similarity to the tubuline tail sequence, two of the enriched proteins, PurA and PurB, were further analyzed. The authors provide evidence that PurA but not PurB is modified by TTLL6, which as a result changes its subcellular localization. While the first part of this work provides convincing novel insights into TTLL6's function with potential pathological relevance, the second part raises some concerns. I would therefore tend to rate the quality of the first part significantly higher than the second part.

      Major comments:

      1) When considering the results of the induced colorectal cancer test, the only significant difference between WT and KO was the moderately higher expression of Vimentin (figure 5E-F). Since this is the main evidence for a pathological relevance of TTLL6 in cancer, it is important to understand how the quantification of Vimentin in the complex tissue shown in figure 5E was done. A detailed description of how these images were analysed and perhaps a table with raw data would be essential to convince the reader of the conclusions. In the currently presented form, I find the analyses not too convincing.

      2) Figure 7A: It was somewhat surprising that two of the least significant (PurA is just below the cutoff) were used for further analyses. Although the authors explain that both proteins have strong sequence similarity to the know TTLL6 target, tubuline, the C-terminal, genomically encoded protein sequence of PurA and PurB already contain several glutamates. This raises the concern that the polyE antibody in the IP possibly detected the non-modified C-terminal tail of PurA and PurB and that both proteins may not be modified by TTLL6. Because of this and the lower significance than other candidates, the authors should consider focussing on other hits (OPTIONAL). Besides being much more significant, they lack an accumulation of glutamates in their C-terminus (at least the ones I looked at). Alternatively, the concern of having potentially IP-ed unmodified proteins should be addressed.

      3) Figure 8A: this figure compares PurA with a modified PurA that lacks the C-terminal EEE stretch. The authors conclude that the subcellular localization is different between both and that the nuclear localization of WT PURA must be due to modification by the co-expressed TTLL6. There are two major concerns with this conclusion: Firstly, the expression of PurA without TTLL6 co-expression is a missing essential control. This would show if PurA itself is already predominantly located in the nucleus regardless of potential modifications (PurA seems to have different nucleocytoplasmic localization in different cell types). Secondly, both depicted cells look very different. In PurA the nuclei are much smaller and the cytoplasm seems also much smaller than in the PurA DDD-expressing cells. Furthermore, IF staining without proper quantification of several cells seem less than ideal for such conclusions. In case, the authors want to convincingly validate this conclusion such a quantification with several cells would be required. OPTIONAL: an alternative approach would be a nucleo-cytoplasmic fractionation experiment followed by a western blot.

      Figure 8B: it seems that the contrast between the images of the upper and lower panel is very different. For this reason, I find it difficult to follow the conclusions. However, even when ignoring this aspect, I have great problems coming to the same conclusions as the authors.

      Minor comments:

      1) In figure 3A it would help if the legend describes what exactly "Control (+ or -)" means.

      2) In figure 3E-F, a label inside of the figure (what is the red bar, what the blue) would help the reader to faster grasp the subfigures.

      3) Figure 7C-D: these experiments are based on strong overexpression of TTLLs, which might result in unphysiological modifications of PurA. I would suggest to include a note of caution in the discussion that this is a possibility.

      4) In the discussion (page 9, last paragraph), it is stated: "Our findings suggest that the polyglutamylation of PurA is essential for maintaining colonic homeostasis". I do not understand this statement, as this study does not provide any evidence that modification of PurA does play a functional role in the colon (expression itself is not an evidence for function importance or even being "essential"). I recommend to remove this statement.

      5) Not all abbreviations are introduced properly (like CRC).

      Significance

      In general, this study addresses a very interesting aspect - i.e. the covalent addition of multiple glutamate residues to the C-terminus of a target gene by the enzyme TTLL6. The authors convincingly show that this protein regulates the morphology and composition of crypts in subregions of the colon. This is certainly a new and important finding that expands our knowledge about the functional breadth of this class of enzymes. If convincingly validated (see major concerns), also the pathological relevance of this enzyme for cancer progression would be of general interest. However, this statement has to be considered with a note of caution as this is not my area of expertise.

      The validation of novel targets of TTLL6 after IP is - at this stage of the manuscript - not very convincing to me. In particular the claim that PurA does play a functional role in the TTLL6-dependent regulation (of crypts) is not justified by the data. However, given that the list of other candidates contains several important gene regulators, this work might have the potential to open up to open up the field for new research directions.

      The reviewer's areas of expertise: cell biology, biochemistry, histology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This study shows that loss of TTLL6 affects colonic epithelial homeostasis (crypt architecture and proliferation/differentiation markers) and proposes that TTLL6 contributes to a nuclear glutamylation signal, with PurA presented as a candidate non-tubulin substrate. The authors also connect TTLL6 mRNA levels to human CRC progression and outcome. Overall, the observations are potentially interesting, but the manuscript currently does not establish a clear mechanistic link between TTLL6 activity, PurA, and the in vivo phenotype.

      Major comments:

      1. The "TTLL6-PurA link" framing is currently too strong. The pull-down data indicate multiple candidate substrates, and the study does not show that PurA is the key functional mediator of the epithelial phenotype. As written, the manuscript reads as though PurA is the central downstream effector, which is not yet supported. Requested change: Either add substrate-specific functional evidence (additional KO/rescue-type experiments) or soften the language throughout (title/abstract/discussion) to reflect that PurA is one candidate among several.
      2. PurA glutamylation should be demonstrated directly by MS. PolyE/GT335 immunoblotting and enrichment in PolyE pull-downs are suggestive, but they do not conclusively establish glutamylation of PurA at the C-terminal end (antibody specificity and/or co-precipitating glutamylated proteins remain possible explanations). Essential experiment: MS/MS identification of glutamylated residue(s) on PurA, ideally with evidence that the modification is TTLL6-dependent (WT vs KO or epithelial-inducible KO).
      3. Regional TTLL6 expression vs phenotype needs to be reconciled. TTLL6 expression is reported to be highest in distal colon, yet the strongest crypt-length phenotype appears in transverse colon (as presented). Proximal colon data are not shown in the main text. Requested revision: Provide complete regional analyses (proximal/transverse/distal) with consistent quantification and statistics, and discuss explicitly why TTLL6 expression levels and phenotype do (or do not) align.
      4. Several internal inconsistencies and missing statistics.
        • Fig. 1A vs 1B: CEC enrichment appears ~80-fold in panel A and ~4-fold in the panel B; if these reflect the same enrichment workflow, this discrepancy needs a clear explanation (normalization, starting material, ....).
        • Fig. 2A: statistics are missing.
        • Fig. 5D: the effects appear borderline; the conclusions should match the statistical support/significance. Requested revision: Ensure complete statistical reporting in the manuscript (n, definition of replicates, test used, p-values/thresholds) and avoid interpretive language where differences are not significant.
      5. PurA Localization claims would benefit from stronger imaging and quantification. For nuclear localization/redistribution conclusions (main Fig. 8 and related supplement), confocal imaging with Z-stacks (and orthogonal views) would be more convincing than representative single-plane images. In addition, conditions with PurA-only expression need a clear baseline description and quantification. Requested additions: confocal Z-stacks + blinded quantification of nuclear/cytosolic localization across replicates; ideally support with subcellular fractionation and quantitative immunoblotting.
      6. Overexpression artifacts should be considered more carefully. If TTLL6 has been described as an elongase in prior work (Mahalingan, NSMB, 2020, DOI: 10.1038/s41594-020-0462-0) high-level overexpression may generate non-physiological modifications or localization patterns. Requested revision: Soften conclusions drawn from overexpression experiments and provide appropriate expression controls and/or supportive evidence in more physiological settings.
      7. Mouse tumor data should be interpreted more cautiously relative to the human correlations. The human datasets suggest a correlation between TTLL6 mRNA levels and clinical features/outcome (including recurrence-free survival), which is potentially interesting. In contrast, the mouse CAC results appear modest/borderline and, in places, are interpreted as stronger evidence than the data support. Requested change: Avoid strong claims about TTLL6 promoting or suppressing tumor growth unless supported by robust, clearly significant differences and comprehensive burden metrics.

      Minor comments:

      • Every figure should clearly state n (biological vs technical), statistical test, and multiple-comparison correction where applicable.
      • Where effects are segment-specific, the text should reflect that specificity and avoid global statements.
      • The Discussion would benefit from a clearer separation of what is directly shown versus what is proposed (especially near the end).
      • TTLL6 expression is largely presented at the transcript level; it would help to make this explicit throughout and avoid wording that implies protein-level validation where it is not shown.

      Significance

      The manuscript has the potential to be of interest because it points to a possible role for TTLL6 in non-tubulin, nuclear glutamylation in the intestinal epithelium, and it links TTLL6 expression to human CRC datasets. At present, however, the broader impact is limited by (i) insufficient direct evidence that PurA is glutamylated in vivo and (ii) the lack of a causal connection between PurA and the epithelial phenotype. In addition, while the human data show correlations between TTLL6 expression and clinical parameters/outcome, the mouse CAC phenotype appears comparatively modest/borderline and should be interpreted with appropriate caution. With stronger biochemical validation (MS), improved localization quantification, and more restrained framing (or additional functional data), the work could appeal to readers in intestinal epithelial biology, post-translational modification biology, and CRC research.

      Expertise: enzymology; post-translational modifications; microscopy; cancer mechanisms.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank the reviewers for taking the time to review our manuscript and for the insightful comments given us that will help to improve our manuscript. Please find below a point-by-point answer to each reviewer.

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)): **

      The authors have set up a mouse embryonic sensory neuron system to study impact of complete loss of frataxin (using a nice cre-based AAV approach). There is careful delineation of the phenotype of these cells upon complete frataxin loss using a significant range of relevant endpoints (e.g. OCR, oxidative stress, mitochondrial imaging at EM level). A major finding is the failure of neurons lacking frataxin to undergo full soma maturation - so smaller cells. In addition, AMPK is activated (maybe not surprisingly given the severe loss of mitochondrial function and drop in ATP). Solid mechanistic experiments reveal that AMPK activation when blocked prevents the suppression of soma size (we do not get the same data with regard to alanine supplementation). There are interesting studies with alanine that, in part, reverse indices of oxidative stress (mitochondrial stress, specifically). The experiments are well designed with mechanistic insight and the data clearly presented with appropriate statistical analysis. A major problem is the culture system. The labelling studies and soma size analysis reveal that this is not a truly representative population of DRG neurons. It seems all the small neurons are missing - I assume all trkA positive and GDNF-dependent neurons have been lost somewhere (this comprises 80% of the neurons at the lumbar level). The methods section covering the mouse DRG culture is sparse in terms of details and refers to a text book which I cannot access. Another issue is the background glucose concentration - growing such cells at 25mM is standard I know - but its still sub-optimal. Glucose at this concentration represents a hyperglycemic state - normal glucose is 5-10mM - its not really correct to term it glycolysis inhibitory since hexokinase, the rate limiting enzyme, has a Kd around 0.3-1mM glucose. When studying AMPK this system will exhibit suppressed AMPK activity/expression due to the high background glucose concentration of 25mM.*

      * Reviewer #1 (Significance (Required)):

      The use of this unrepresentative culture system does lower the significance. While large caliber sensory neuron, e.g. proprioceptive, dysfunction is important during development and into the adult it seems rather unfortunate that the authors ignore all other sensory neurons! Persons with Friedreich ataxia (FA) also suffer from small fiber abnormalities, e.g. pain, and these neurons actually express a higher density of mitochondria (since they are unmyelinated). So, when the authors state this model "faithfully recapitulates key hallmarks of FA...." I have to say I disagree. In terms of general significance the work is well performed with some good mechanistically strong studies, however, it does still contain a major purely descriptive component. The focus on AMPK is understandable but we learn nothing really novel about its function and role in sensory neurons. *

      We sincerely thank Reviewer #1 for the careful evaluation of our work and for the positive appreciation of the experimental design, mechanistic approach, and data presentation. We are grateful for the reviewer’s comments, which helped us clarify several aspects of the manuscript and improve the description of our culture system and metabolic conditions.

      Comment on alanine/ALA

      We would first like to clarify a terminology issue. In our study, we did not use alanine supplementation, but alpha-lipoic acid (ALA). We have checked and revised the text to avoid any possible ambiguity on this point.

      Comment on the DRG culture system and representation of sensory subtypes:

      We appreciate the reviewer’s concern regarding the representativeness of the embryonic dorsal root ganglia (DRG) culture system. We agree that this in vitro model does not fully reproduce the cellular diversity and maturation state of the in vivo DRG environment, and we have revised the manuscript to make this limitation more explicit. That said, we respectfully do not think our cultures are devoid of small sensory neurons. In the original submission, Supplementary Fig. 1D-E already showed a substantial population of CGRP-positive neurons__, supporting the presence of peptidergic small-diameter sensory neurons. In addition, we performed TrkA immunostaining,__ which showed that a large proportion of neurons in our cultures are also TrkA-positive. We can add these TrkA data to the revised manuscript if the reviewer and editor consider that this would strengthen the characterization of the culture system.

      More broadly, the reviewer raises an important point: dissociated embryonic DRG cultures maintained under simplified trophic conditions cannot be expected to preserve the full in vivo balance of mature sensory neuron subtypes. Embryonic and neonatal DRG neurons are known to depend strongly on trophic support in vitro, and sensory subtype maturation normally requires both neurotrophic cues and interactions with the native microenvironment. We therefore agree that our system should be viewed as a reductionist model of frataxin loss in developing sensory neurons rather than a complete reconstruction of the mature DRG. We have now expanded the methods section to better describe the culture conditions and revised the discussion to acknowledge more explicitly that future work using more complex conditions, such as combined trophic factor regimens, neuron–glia co-cultures, or organotypic approaches, may help preserve a more physiological sensory subtype composition.

      Comment on glucose concentration and “glycolysis-inhibitory” conditions:

      We thank the reviewer for prompting us to clarify this point. We agree that chronic exposure to 25 mM glucose can influence neuronal metabolism and AMPK signaling, and this issue has been discussed in the literature for neuronal culture systems. However, we believe there was a misunderstanding regarding the specific experiment referred to in our manuscript. In the condition that we termed “glycolysis-inhibitory,” the neurons were not maintained in high glucose. Rather, these experiments were performed in glucose-free medium supplemented with galactose, i.e. in the absence of glucose. Galactose substitution is commonly used to reduce ATP production from glycolysis and increase dependence on mitochondrial oxidative phosphorylation. We have revised the methods and results sections to make this point much clearer and now explicitly distinguish between low-glucose conditions and glucose-free/galactose conditions__.__

      Comment on significance and disease relevance:

      We appreciate the reviewer’s concern regarding the extent to which this model recapitulates the full spectrum of sensory pathology in FA. We agree that our culture system is rather artificial and might therefore not model the entire peripheral phenotype of FA.

      That being said, we believe the model remains highly relevant to a major and well-established component of FA neuropathology. Multiple neuropathological and clinical studies indicate that FA is characterized predominantly by a dorsal root ganglionopathy / sensory neuronopathy with marked involvement of large myelinated sensory neurons and their projections, which is central to the loss of proprioception and sensory ataxia that define the disease. Reviews of FA neuropathology consistently emphasize DRG hypoplasia/atrophy and loss of large myelinated fibers as hallmark features.

      We agree that small-fiber abnormalities have also been reported, including reduced intraepidermal nerve fiber density in some studies, and we do not wish to dismiss that aspect of the disease. However, the current literature still supports that the dominant and most characteristic peripheral lesion in FA affects large sensory neurons and large myelinated fibers more prominently than small fibers. We have therefore revised our wording and no longer state that the model “faithfully recapitulates” the full disease.

      * *Comment on novelty of AMPK findings:

      We agree that AMPK is a canonical metabolic stress sensor and that its activation in the context of severe mitochondrial dysfunction is not, by itself, unexpected. We have therefore revised the discussion to better frame the novelty of our study. In our view, the main contribution is not the mere observation of AMPK activation, but the demonstration, in frataxin-deficient primary sensory neurons, that AMPK activation is functionally linked to the defect in soma growth/maturation and that pharmacological AMPK inhibition can rescue this phenotype. We hope this distinction is now clearer in the revised manuscript.

      * Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: In the Present study, the authors develop a new model of FA in cultured DRG neurons, and show its relation with Fe_S deficiency. It is also associated with defects in mTOR signaling, ALA synthesis and AMPKs

      The conclusions convincing and the work is thorough. The results are well presented and easily understood and repeatable.

      Reviewer #2 (Significance (Required)):

      While there have been hints at some of the findings ( references to AMPK), there have not been so well documented before. Thus they are important Is there any evidence of the present finding on cell size in the clinical literature ( pt size, cell size) in non DRG tissue? ( Patient size etc) Might the present findings reflect a developmental event that drives the spinal cord hypoplasia.*

      We sincerely thank Reviewer #2 for the very positive evaluation of our work. We are grateful for the recognition of the rigor, clarity, and reproducibility of the study, as well as for highlighting the relevance of our findings linking frataxin deficiency to Fe-S cluster impairment, mitochondrial dysfunction, and alterations in AMPK and mTOR signaling, as well as lipoic acid metabolism.

      We also thank the reviewer for the insightful comment regarding the potential relevance of our observations on reduced neuronal soma size.

      To our knowledge, there is no direct clinical evidence describing reduced neuronal cell size per se in patient tissues outside of the DRG. However, neuropathological studies of FA consistently report hypoplasia and atrophy of the DRG__, characterized by a marked reduction in the size and number of sensory neurons, particularly affecting large neurons. These features are widely interpreted as reflecting a developmental defect rather than purely degenerative loss.__

      More broadly, several studies have described spinal cord hypoplasia__,__ including reduced cross-sectional area of the cord and thinning of posterior columns, which are thought to arise early in disease progression. These observations support the idea that impaired neuronal growth and maturation may be a key component of the pathology.

      In this context, we agree with the reviewer that our findings may reflect a developmental mechanism contributing to the hypoplasia observed in FA__, __rather than solely a degenerative process. Our in vitro data showing reduced soma size in frataxin-deficient sensory neurons, together with the involvement of AMPK/mTOR signaling pathways known to regulate cellular growth, are consistent with this hypothesis.

      We have now revised the discussion to incorporate this point and to more explicitly propose that bioenergetic stress and AMPK activation in frataxin-deficient neurons may limit neuronal growth and maturation during development__,__ thereby contributing to the structural deficits observed in patients.

      At the same time, we have moderated our conclusions to emphasize that our model primarily captures cell-autonomous mechanisms in developing sensory neurons__,__ and that further in vivo studies will be required to directly establish the contribution of these mechanisms to human pathology.

      • Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary In the present study, the authors develop a new model of Friedreich ataxia (FA), a disease caused by frataxin deficieny, using primary cultures of embryonic mouse Dorsal Root Ganglia neurons with complete frataxin depletion. This model reproduces key biochemical hallmarks of FA, including Fe-S enzyme deficiency, mitochondrial iron dysregulation, and oxidative stress. They also observe that these frataxin-deficient neurons exhibit a reduction in soma size. They claim that this defect is mediated by AMP-activated protein kinase (AMPK) hyperactivation and suppression of mTOR signaling, which occurs in response to mitochondrial dysfunction and redox imbalance. They are able to restore soma growth by genetic inhibition of AMPK or treatment with lipoic acid (ALA). The study is carried out meticulously, and the results are generally well presented, with the exception of a few specific experiments that will be noted below.

      Major points: - Mitochondrial iron was measured using the fluorescent iron sensor RPA. However, when using this probe loss of signal can be caused by either increased iron or by loss of membrane potential. Thus, as mitochondrial membrane potential is decreased in the model used, it can not be concluded from the results obtained that mitochondrial iron is increased. To confirm that mitochondrial iron is increased, authors should either use a dequenching approach (as indicated in Petrat F, et al., Biochem J. 2002 362:137-47), or use another mitochondrial iron specific probe.

      • Authors describe that ALA treatment improves mitochondrial function and reduces oxidative stress, and they hypothesize that restored mitochondrial activity may contribute to AMPK downregulation. However, to provide a more mechanistic insight into this observation, it would be advisable to assess whether the indicated treatment is able to restore mitochondrial functionality by performing a Seahorse assay

      • Authors state that their data supports a model in which full frataxin depletion first induces a deficit of Fe-S synthesis, subsequently triggering downstream consequences such as iron dysregulation and oxidative stress. This may be plausible for oxidative stress, as it has been measured at 15 div. However, as alterations in iron homeostasis have not been measured at 15 div. it can not be concluded that they appear later than deficiency in FeS proteins. The authors should measure TfR and FT-L expression at 15div, or alternatively indicate in the discussion that it cannot be concluded whether the alteration in iron metabolism occurs after the deficiency in Fe‑S proteins

      Minor points: Previous studies have reported dysregulation of the AMPK and mTOR signaling pathways in various models of Friedreich's ataxia. It would therefore be appropriate to highlight these findings in the discussion According to authors, Immunofluorescence confirmed efficient mitochondrial localization of mtLplA delivered via AAV9-mediated transduction (Fig. S5A). However, the image provided suggests partial co-localization. This should be acknowledged in the description of the results, or either provide further data or measures confirming such efficient mitochondrial localization.

      Reviewer #3 (Significance (Required)):

      General assessment: Authors present a new model of Friedreich ataxia (FA) in Dorsal Root Ganglia neurons. This new model offers the advantage of being conditional, allowing frataxin deficiency to be induced and enabling the analysis of the emergence of various alterations across different generations. However, it also presents the limitation of inducing a complete loss of frataxin, a condition that does not occur in patients, who typically exhibit only a partial deficiency of this protein. Although the experimental work presented is of generally good quality (aside from some minor issues previously noted), it remains unclear whether the study provides substantial advances to the field of Friedreich's ataxia. The conditional nature of the model would, in principle, allow for a deeper exploration of mechanistic aspects underlying how frataxin deficiency leads to the observed phenotypes; however, this potential is not fully exploited in the current manuscript. In this context, the proposed relationship among energy deficiency, AMPK hyperactivation, and treatment with lipoic acid would be considerably strengthened by analyzing the effects of this compound on mitochondrial respiration Advance: The effects of frataxin deficiency on DRGs had been previously addressed by other authors. In this new model, the authors describe a series of phenotypes, most of which have already been reported in other models of the disease (including models using DRGs). On the one hand, this reinforces the validity of the model, but on the other, it reduces the novelty of the observations presented.*

      • *

      We thank Reviewer #3 for the careful evaluation of our manuscript and for the constructive and insightful comments. We are grateful for the positive appreciation of the overall quality of the study and for the suggestions that helped us improve the rigor and clarity of our work.

      Major points:

      Iron probe

      We thank the reviewer for this important remark. We agree that RPA fluorescence depends both on mitochondrial membrane potential and iron-dependent quenching. To address this point, we performed iron modulation experiments. Treatment with a membrane-permeant iron chelator strongly increased RPA fluorescence in both CT and KO neurons, whereas iron loading with ferric ammonium citrate (FAC) decreased the signal in both conditions. These bidirectional changes demonstrate that RPA is efficiently targeted and remains fully responsive to mitochondrial iron in KO neurons, arguing against impaired probe loading as the primary cause of the reduced basal signal.

      Nevertheless, to exclude any potential contribution of mitochondrial membrane potential differences, we propose to complement these experiments with an independent mitochondrial iron probe, Mito-FerroGreen, which detects mitochondrial Fe²⁺ via a distinct mechanism, independent of mitochondrial membrane potential. We would need about 8 weeks to perform these experiments.

      Effect of ALA on mitochondrial function

      We thank the reviewer for this suggestion. We agree that assessing mitochondrial respiration would provide additional mechanistic insight into the effect of alpha-lipoic acid (ALA). In the original version, we had data showing that ALA treatment restores intracellular ATP levels, suggesting an improvement of mitochondrial function. However, we agree that this is not formal proof. We propose for a revised version to look at mitochondrial membrane potential as a proxy for mitochondrial function. While we agree that Seahorse-based analysis of oxygen consumption would be highly informative, these experiments require substantial time in primary DRG cultures and would significantly delay the revision. But if the reviewer or editor consider this essential, this could be performed.

      Temporal relationship between Fe-S deficiency and iron dysregulation

      We thank the reviewer for this important comment.

      In response, we have now analyzed markers of iron homeostasis (TFR1 and FRTL) at 15 DIV, the same time point at which Fe-S protein deficiency is already evident. These new data show that iron homeostasis is not significantly altered at this stage, supporting our interpretation that Fe-S deficiency precedes detectable changes in iron metabolism.

      We have included these new results in the revised manuscript (Fig. S2E) and clarified the temporal sequence in the results and discussion sections.

      Minor points:

      1. We thank the reviewer for this suggestion. We have expanded the discussion to better acknowledge previous studies reporting dysregulation of AMPK and mTOR signaling pathways in various models of Friedreich ataxia, and we now position our findings within this existing body of work.
      2. We thank the reviewer for this important observation. We agree that the immunofluorescence data indicate partial, rather than complete, co-localization of mtLplA with mitochondrial markers. We believe this is most likely due to high levels of mtLplA overexpression, leading to partial saturation of the mitochondrial import machinery and consequently incomplete mitochondrial targeting. This interpretation is supported by our western blot analysis (Fig. S5B), which shows the presence of two bands corresponding to processed (mitochondrial) and unprocessed (non-imported) forms of the protein. We have revised the text accordingly to more accurately reflect these observations. We thank the reviewer for the thoughtful evaluation of the significance of our work and for highlighting both the strengths and limitations of our model. We agree that our model, based on complete frataxin depletion, does not fully recapitulate the partial deficiency observed in patients with FA. However, we believe that this approach provides a valuable experimental advantage, allowing us to: precisely control the timing of frataxin loss, investigate early cellular events, and dissect cell-autonomous mechanisms in sensory neurons. We have revised the manuscript to more clearly acknowledge this limitation.

      Regarding novelty, we agree that several individual phenotypes observed in our study (e.g., Fe-S deficiency, oxidative stress, mitochondrial dysfunction) have been reported in previous models. However, we would like to emphasize that our model enables the integration of these features within a single conditional system in primary sensory neurons, and importantly allows us to uncover a functional link between bioenergetic stress, AMPK activation, and impaired neuronal growth.

      In particular, our data identify AMPK as a key mediator of soma size reduction, and demonstrate that its inhibition can rescue this phenotype. We believe this provides a novel mechanistic connection between mitochondrial dysfunction and neuronal growth regulation in frataxin-deficient sensory neurons.

      Finally, we have revised the discussion to better highlight both the strengths and limitations of the model, and to more clearly position our findings as contributing to the understanding of early pathogenic mechanisms and developmental aspects of sensory neuron dysfunction in FA.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary In the present study, the authors develop a new model of Friedreich ataxia (FA), a disease caused by frataxin deficieny, using primary cultures of embryonic mouse Dorsal Root Ganglia neurons with complete frataxin depletion. This model reproduces key biochemical hallmarks of FA, including Fe-S enzyme deficiency, mitochondrial iron dysregulation, and oxidative stress. They also observe that these frataxin-deficient neurons exhibit a reduction in soma size. They claim that this defect is mediated by AMP-activated protein kinase (AMPK) hyperactivation and suppression of mTOR signaling, which occurs in response to mitochondrial dysfunction and redox imbalance. They are able to restore soma growth by genetic inhibition of AMPK or treatment with lipoic acid (ALA). The study is carried out meticulously, and the results are generally well presented, with the exception of a few specific experiments that will be noted below.

      Major points: - Mitochondrial iron was measured using the fluorescent iron sensor RPA. However, when using this probe loss of signal can be caused by either increased iron or by loss of membrane potential. Thus, as mitochondrial membrane potential is decreased in the model used, it can not be concluded from the results obtained that mitochondrial iron is increased. To confirm that mitochondrial iron is increased, authors should either use a dequenching approach (as indicated in Petrat F, et al., Biochem J. 2002 362:137-47), or use another mitochondrial iron specific probe.

      • Authors describe that ALA treatment improves mitochondrial function and reduces oxidative stress, and they hypothesize that restored mitochondrial activity may contribute to AMPK downregulation. However, to provide a more mechanistic insight into this observation, it would be advisable to assess whether the indicated treatment is able to restore mitochondrial functionality by performing a Seahorse assay

      • Authors state that their data supports a model in which full frataxin depletion first induces a deficit of Fe-S synthesis, subsequently triggering downstream consequences such as iron dysregulation and oxidative stress. This may be plausible for oxidative stress, as it has been measured at 15 div. However, as alterations in iron homeostasis have not been measured at 15 div. it can not be concluded that they appear later than deficiency in FeS proteins. The authors should measure TfR and FT-L expression at 15div, or alternatively indicate in the discussion that it cannot be concluded whether the alteration in iron metabolism occurs after the deficiency in Fe‑S proteins

      Minor points: Previous studies have reported dysregulation of the AMPK and mTOR signaling pathways in various models of Friedreich's ataxia. It would therefore be appropriate to highlight these findings in the discussion According to authors, Immunofluorescence confirmed efficient mitochondrial localization of mtLplA delivered via AAV9-mediated transduction (Fig. S5A). However, the image provided suggests partial co-localization. This should be acknowledged in the description of the results, or either provide further data or measures confirming such efficient mitochondrial localization.

      Significance

      General assessment: Authors present a new model of Friedreich ataxia (FA) in Dorsal Root Ganglia neurons. This new model offers the advantage of being conditional, allowing frataxin deficiency to be induced and enabling the analysis of the emergence of various alterations across different generations. However, it also presents the limitation of inducing a complete loss of frataxin, a condition that does not occur in patients, who typically exhibit only a partial deficiency of this protein. Although the experimental work presented is of generally good quality (aside from some minor issues previously noted), it remains unclear whether the study provides substantial advances to the field of Friedreich's ataxia. The conditional nature of the model would, in principle, allow for a deeper exploration of mechanistic aspects underlying how frataxin deficiency leads to the observed phenotypes; however, this potential is not fully exploited in the current manuscript. In this context, the proposed relationship among energy deficiency, AMPK hyperactivation, and treatment with lipoic acid would be considerably strengthened by analyzing the effects of this compound on mitochondrial respiration

      Advance: The effects of frataxin deficiency on DRGs had been previously addressed by other authors. In this new model, the authors describe a series of phenotypes, most of which have already been reported in other models of the disease (including models using DRGs). On the one hand, this reinforces the validity of the model, but on the other, it reduces the novelty of the observations presented.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In the Present study, the authors develop a new model of FA in cultured DRG neurons, and show its relation with Fe_S deficiency. It is also associated with defects in mTOR signaling, ALA synthesis and AMPKs

      The conclusions convincing and the work is thorough. The results are well presented and easily understood and repeatable.

      Significance

      While there have been hints at some of the findings ( references to AMPK), there have not been so well documented before. Thus they are important Is there any evidence of the present finding on cell size in the clinical literature ( pt size, cell size) in non DRG tissue? ( Patient size etc) Might the present findings reflect a developmental event that drives the spinal cord hypoplasia.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors have set up a mouse embryonic sensory neuron system to study impact of complete loss of frataxin (using a nice cre-based AAV approach). There is careful delineation of the phenotype of these cells upon complete frataxin loss using a significant range of relevant endpoints (e.g. OCR, oxidative stress, mitochondrial imaging at EM level). A major finding is the failure of neurons lacking frataxin to undergo full soma maturation - so smaller cells. In addition, AMPK is activated (maybe not surprisingly given the severe loss of mitochondrial function and drop in ATP). Solid mechanistic experiments reveal that AMPK activation when blocked prevents the suppression of soma size (we do not get the same data with regard to alanine supplementation). There are interesting studies with alanine that, in part, reverse indices of oxidative stress (mitochondrial stress, specifically).

      The experiments are well designed with mechanistic insight and the data clearly presented with appropriate statistical analysis.

      A major problem is the culture system. The labelling studies and soma size analysis reveal that this is not a truly representative population of DRG neurons. It seems all the small neurons are missing - I assume all trkA positive and GDNF-dependent neurons have been lost somewhere (this comprises 80% of the neurons at the lumbar level). The methods section covering the mouse DRG culture is sparse in terms of details and refers to a text book which I cannot access. Another issue is the background glucose concentration - growing such cells at 25mM is standard I know - but its still sub-optimal. Glucose at this concentration represents a hyperglycemic state - normal glucose is 5-10mM - its not really correct to term it glycolysis inhibitory since hexokinase, the rate limiting enzyme, has a Kd around 0.3-1mM glucose. When studying AMPK this system will exhibit suppressed AMPK activity/expression due to the high background glucose concentration of 25mM.

      Significance

      The use of this unrepresentative culture system does lower the significance. While large caliber sensory neuron, e.g. proprioceptive, dysfunction is important during development and into the adult it seems rather unfortunate that the authors ignore all other sensory neurons! Persons with Friedreich ataxia (FA) also suffer from small fiber abnormalities, e.g. pain, and these neurons actually express a higher density of mitochondria (since they are unmyelinated). So, when the authors state this model "faithfully recapitulates key hallmarks of FA...." I have to say I disagree.

      In terms of general significance the work is well performed with some good mechanistically strong studies, however, it does still contain a major purely descriptive component. The focus on AMPK is understandable but we learn nothing really novel about its function and role in sensory neurons.

  4. Mar 2026
    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This study identifies CD81 as an anti-viral host factor that restricts EBOV infection. In-turn, EBOV encodes CD81 antagonism activity in GP and VP40. The two major lines of inquiry of this study are how CD81 restricts EBOV infection and how EBOV antagonizes CD81. These are evaluated using BSL-2 level models and authentic EBOV infection in different cell types, including human primary macrophages. This study underscores the intricate interplay between mechanisms of host innate immunity and mechanisms of virus immune evasion and antagonism.

      CD81 restriction of EBOV

      A previous study published by this group demonstrated that CD81 antagonizes NF-κB signaling. In this study and a very recently published study by another group, VP40 is demonstrated to activate NF-κB signaling. This study shows that VP40 activates NF-κB through a TLR4-independnet mechanism that is antagonized by CD81. This study also demonstrated that CD81 suppresses authentic EBOV infection in HEK293T cells and human primary blood monocyte derived macrophages. Using a transcription and replication-competent tetracistronic minigenome system (trVLPs), this study found that CD81 reduces the abundance of all viral RNA species (mRNA, vRNA genome, cRNA genome complement) but does not interfere with the NP-driven formation of inclusion bodies where viral RNA synthesis occurs. CD81 was also found to suppress cellular micropinocytosis activity and restrict EBOV entry. The CD81-stimulating antibody 5A6 suppressed both trVLP and authentic EBOV infection.

      EBOV antagonism of CD81

      CD81 (as well as the other tetraspanins CD63 and CD9) was identified to be downregulated on the cell surface of HeLa and HEK293 cells transfected with an EBOV GP expression plasmid as a part of a larger screen for EBOV GP's impact on cell surface receptor expression. Using a combination of bimolecular fluorescent complementation (BiFC), fluorescence resonance energy transfer (FRET), and proximity ligation assays, EBOV GP was shown to directly interact with CD81. Imaging analysis showed that EBOV GP colocalizes with CD81 at the cell surface. Here, GP blocks accessibility of CD81 through a glycan shielding mechanism that is reversed with PNGase F treatment. EBOV GP also reduces the total cellular abundance of CD81 protein by inducing CD81 degradation through both lysosomal and proteasome mediated degradation pathways without altering CD81 mRNA transcript levels. The GPs of other relative filoviruses (Marburg virus, Sudan virus, Reston virus, and Taï Forest virus) also downregulated accessible cell surface CD81, CD63, and CD9 in HEK293T cells. This CD81 downregulating activity appears to be both conserved amongst and specific to filovirus GPs, as none of the glycoproteins from 14 other RNA viruses tested (including arenaviruses, rhabdoviruses, influenza viruses, coronaviruses, and retroviruses) significantly altered cell surface CD81, CD63, or CD9 in HEK293T cells. In human primary blood monocyte derived macrophages, authentic EBOV infection was found to reduce the abundance of CD81 and CD9 accessible at the cell surface. EBOV VP40 was the only other EBOV structural protein that downregulated total CD81, though its effect was mild. In contrast to the mechanisms utilized by GP, VP40 was found to induce CD81 degradation mainly through the proteasome mediated degradation pathway.

      Major Comments:

      The claims made by the authors are appropriate and are supported by their data and their use of appropriate controls which yielded the expected results based on references from the literature. There are no new experiments that must be introduced to support the claims made by the authors. The methods section is excellent and provides extensive detail for techniques and organized lists of plasmid and antibody reagents used and their original sources. Graphed data shows individual replicates, representative flow scatter plots and images are show, and appropriate statistical analyses were used and reported. Excitingly, this study opens many new lines of question which can be addressed in future studies.

      The only major comment:

      • It would be valuable to add more discussion around something the data presented in Figure 7 b-d suggest, and that is that EBOV entry appears to be targeted by CD81 by multiple mechanisms. Figure 7d demonstrates CD81 suppresses cellular macropinocytosis activity, which would yield less uptake of EBOV which utilizes PS receptors to be internalized through the macropinocytosis pathway. Since PS receptors recognize the PS in the viral envelope, and not the viral GP, it makes sense that trVLPs pseudotyped with VSV-G were restricted like those with EBOV GP (Fig 7C). However, in the pseudotyped lentivirus system, EBOV GP-mediated entry was significantly suppressed by CD81 while VSV-G mediated entry was not (Fig 7B). Together this data shows that CD81 restricts EBOV entry in both viral envelope-targeted and GP-targeted mechanisms, demonstrating the vast innate immune mechanisms of CD81 against EBOV. I think this is of great impact and should be discussed more.

      Minor Comments:

      Some modest experimental and analysis suggestions that are not required to support the claims of the paper but would add additional depth are:

      • The experiments in Figure 8E would have benefited from collection and titering of supernatants from infected cells.
      • While bafilomycin treatment or MG132 treatment can partially rescue CD81 from the degradation induced by EBOV GP or VP40, neither drug is able to fully rescue this for either viral protein. It would be insightful to assess if co-treatment of bafilomycin and MG132 would yield a full rescue of total CD81. Based on the presented results, I would expect co-treatment to fully restore total CD81 in VP40-expressing cells, but I would expect co-treated GP-expressing cells to still have increased CD81 surface downregulation equivalent to the strength of which the glycans of EBOV GP shield CD81 from recognition. These experiments would give valuable insight into the relative strengths of the glycan shielding effect and induced degradation effects in EBOV GP's antagonism of CD81.
      • The impact of the imaging experiments shown in Fig. 5D would be strengthened with a quantitative colocalization analysis such as Pearson's Coefficient.
      • Another publication (citation 132 in this study) shows cooperativity of GP in VP40's ability to activate NF-ΚB. This study shows in Fig. 6E that CD81's suppressive activity can overcome VP40's activation of NF-ΚB. It would be valuable to assess (in the same format as the experiment done in Fig 6E) if this would remain true in cells co-expressing EBOV GP and VP40; ie, would CD81 still be able to overcome, and at a similar rate, the NF-ΚB-activating activity of VP40 in the presence of the CD81-antagonizing activity of GP.
      • Understandably, the VP40 gene was used as a probe in the Fig. 6 trVLP experiments because it is encoded within the tetracistronic minigenome. However, this became mildly confusing when reading Figure 6 because other parts of the manuscript discuss and measure the effects and activities of VP40. One thought is to probe the VP24 gene. However, a simple way to reduce the initial confusion in interpreting the data in the figure is to just remove "(VP40)" from Fig.6 A-C. The methodology of using VP40 primers to probe for the viral RNAs is adequately detailed in the figure caption and methods section.

      Significance

      This paper gives more insight into another mechanism of innate immune responses of the host and mechanisms of evasion/antagonism by EBOV VP40 and GP. This is the first report of CD81 as an anti-viral host factor of EBOV. This study shows that CD81 interferes with EBOV viral entry and reduces the abundance of EBOV RNAs using different mechanism. Additionally, CD81 was found to have a previously unrecognized negative regulatory role in cellular micropinocytosis. In terms of how EBOV antagonizes CD81, this study demonstrates multiple mechanisms of antagonism and more broadly demonstrate the diverse mechanisms of innate immune evasion encoded by EBOV. Building on previous reports of the N-linked glycans of EBOV having a steric shielding effect that blocks accessibility of cell surface human leukocyte antigen class-1 (HLA-I) and MHC class I polypeptide-related sequence A (MICA), this study found that these glycans also sterically shield the accessibility of cell surface CD81. This opens important new lines of inquiry, such as: what else do the EBOV glycans sterically interfere with at the cell surface?

      This group previously demonstrated that CD81 suppresses NF-κB activation and this study builds on those observations by demonstrating that despite the CD81-antagonizing activity of VP40, VP40's ability to activate NF-κB can be overcome by CD81. This is in line with some of the most impactful findings in this paper, namely that despite encoding CD81 antagonism activity in at least GP and VP40, EBOV remains susceptible to CD81 activation, as demonstrated by inhibition of both the trVLP system and authentic EBOV infection by the CD81-stimulating antibody 5A6.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript entitled "CD81 is an Ebola virus inhibiting factor that is antagonized by GP and VP40" by Hu et al. investigates the role of the tetraspanins CD81, CD63, and CD9 during Ebola virus infection. They found that CD81, among the three tetraspanins, plays a major role as a cellular antiviral factor by interfering with EBOV glycoprotein and VP40. CD81 suppressed NFkB signaling and was found to restrict EBOV replication and VLP uptake. Overall, the study design, choice of experimental approaches, and presentation of the data meet an excellent scientific standard. The figures and their legends are comprehensive and very clearly written. However, I recommend that the authors make the following clarifications:

      Major:

      1. I recommend including a figure that depicts the domain organization of the tetraspanins, as this would help readers better appreciate how these three tetraspanins differ from one another. Did the authors determine the minimal region of CD81 required for interaction with EBOV GP or VP40? How are tetraspanins trafficked from the plasma membrane to intracellular compartments or into extracellular vesicles, and are these trafficking pathways also altered during EBOV infection?
      2. Given the clear role of VP40 in this CD81-dependent mechanism, it is important to demonstrate whether VP40 and CD81 interact directly. As the BiFC assay did not resolve this question, I recommend using a complementary approach, such as co-immunoprecipitation followed by western blotting, to address it.

      Minor:

      1. Please clarify how the '+' and '++' GFP categories are quantitatively defined.
      2. In Figure 4b (right panels), what explains the different effects of the DMSO control on surface versus total CD81?
      3. For clarity, I suggest defining the exact numerical boundaries of the individual domains shown in Figure S2C.
      4. In Figure S3C, the data presentation could be improved by using different colors for the control and KO groups, or by increasing the size of the symbols representing the data points.

      Significance

      This article is well written, and the study underscores the critical role of the tetraspanin CD81 as a cellular antiviral factor during EBOV infection and defines its role in filoviral immune response regulation. This article can be accepted for publication after the minor revision.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      To identify cell surface receptors modulated by GP, the authors performed a flow cytometry-based screen using the LEGENDScreen Human Cell PE Kit, which targets 332 host cell surface markers. Among the validated hits in two human cell lines, the authors focused on the tetraspanin CD81 as its expression was selectively reduced by EBOV GP and other filovirus GPs, but not by GPs from unrelated viruses, suggesting a filovirus GP-specific mechanism. This is analogous to the downregulation of CD81 expression by viral proteins such as Vpu and Gag from HIV, and NS5A from HCV. The authors also found that VP40 overexpression reduced CD81 levels, potentially enhancing VP40-mediated NF-κB activation. The authors suggest that CD81 reduction may result from degradation led by GP-CD81 interaction. CD81 downregulation was also observed in infected monocyte-derived macrophages (MDMs). Detailed analysis using CD81-KO cells and the transcription- and replication-competent VLP (trVLP) system demonstrated that CD81 is involved in EBOV entry and replication steps. While these data provide key insights, concerns remain regarding their statistical significance and interpretation.

      1. Have the authors investigated the functional consequences of CD81 downregulation by GP, VP40, or viral infection? In particular, could this enable superinfection? This can be examined using the approaches used in the manuscript.
      2. "Fold of modification (GP-/GP+)" in Figure 1a does not appear to match the results presented in Figure S2 and Table S2.
      3. Where appropriate, please indicate 'n.s.' for comparisons that are not statistically significant. With n = 3, the results may be unreliable; increasing the number of replicates to five would be recommended, as this is critical for supporting the manuscript's conclusions.
      4. Although the authors conclude that GP suppresses or counteracts CD81-mediated inhibition of viral replication (e.g., VP40 protein expression) based on experiments using trVLPs with or without GP, the presented data do not support this conclusion. In fact, higher VP40 expression was detected in trVLPΔGP-infected cells compared to trVLP-infected control cells (Fig. 3a and 3b), or no statistically significant differences was provided. These results seem inconsistent with the authors' interpretation and require clarification.
      5. Increased p65 expression does not necessarily indicate activation of p65 or NF-κB signaling. Indeed, VP40-induced or infection-induced increase in p65 expression level was not significantly different between wild-type and CD81-KO cells (Fig. 6c and 6d). To properly assess the NF-κB activation, the phosphorylation status of p65 and/or nuclear translocation should be examined.
      6. The authors suggest that CD81 is involved in macropinocytosis based on experiments using CD81-KO cells (Fig. 7) and anti-CD81 antibody (5A6 clone) (Fig. 8). Have the authors examined whether CD81 regulates macropinocytosis-associated signaling pathway (e.g., the PI3K/AKT1 pathway)? It is possible that AKT1 is constitutively activated in CD81-KO cells, given the increased dextran uptake. Such analysis would strengthen the authors' claim.
      7. In the experiments assessing viral entry in CD81-KO and control cells, both cells were co-transfected with Tim-1. Have the authors confirmed that Tim-1 expression levels were comparable between KO and control cells?
      8. The VP40-CD81 interaction was assessed only by PLA, but the results were not shown due to high background signals. Other methods, such as co-IP or the BiFC assay used in the manuscript, could yield clear data and deepen the discussion.
      9. In the Fig.4b, both the number of GFP-positive cells and the GFP intensity are noticeably lower than in other similar experiments (e.g., Fig. 1c and Fig. S4a). The "GP ++" population is much smaller and difficult to define or gated. Please clarify this discrepancy.
      10. The statement that "VP40-mediated downregulation of surface CD81 was strongly blocked by MG132 and partially by BafA1 (Fig. 4b)" is not supported by the data shown.
      11. In Fig.4c, assessing the role of GP glycan shield, is there a statistically significant difference between GP and GPΔmucin? It appears that deletion of the mucin domain does not affect the structural shielding of CD81, whereas PNGase treatment does.
      12. In Fig.5c, please enlarge the PLA image for better visibility.
      13. CD81 localization in the presence of GP differs between Fig. 5c and Fig. 5d under similar conditions. In the Fig. 5d, GP redistributes CD81 to both the cytoplasm and the cell surface. Please clarify this discrepancy.
      14. In Fig. 8a, the percentage of GFP-positive cells (infected cells) are very low, up to 5%. What MOI was used, and can the effect of the CD81 antibody on infection be reliably evaluated under this condition? Statistical significance should compare CD81 antibody with the isotype control, not with no antibody.
      15. The authors use the term "multiple" (e.g., "multiple cell lines" and "CD81 inhibits multiple steps throughout the viral life cycle"); however, this wording feels overstated as two cell lines were used and the two steps (entry and replication steps) are inhibited.
      16. Please cite the following article where relevant: Nanoscale organization of tetraspanins during HIV-1 budding by correlative dSTORM/AFM (Nanoscale, 2019).
      17. What primary antibodies are used in the PLA. Please describe them in the method section.
      18. Lines 468-471. Please provide the relevant references for the GP mutations tested.

      Significance

      To identify cell surface receptors modulated by GP, the authors performed a flow cytometry-based screen using the LEGENDScreen Human Cell PE Kit, which targets 332 host cell surface markers. Among the validated hits in two human cell lines, the authors focused on the tetraspanin CD81 as its expression was selectively reduced by EBOV GP and other filovirus GPs, but not by GPs from unrelated viruses, suggesting a filovirus GP-specific mechanism. This is analogous to the downregulation of CD81 expression by viral proteins such as Vpu and Gag from HIV, and NS5A from HCV. The authors also found that VP40 overexpression reduced CD81 levels, potentially enhancing VP40-mediated NF-κB activation. The authors suggest that CD81 reduction may result from degradation led by GP-CD81 interaction. CD81 downregulation was also observed in infected monocyte-derived macrophages (MDMs). Detailed analysis using CD81-KO cells and the transcription- and replication-competent VLP (trVLP) system demonstrated that CD81 is involved in EBOV entry and replication steps. While these data provide key insights, concerns remain regarding their statistical significance and interpretation.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Quantitative interactome mapping of skeletal muscle insulin resistance Ng et al present a series of proteomics/interactomics studies in skeletal muscle to identity insulin regulated complexes/interactions and changes ot these in insulin resistant muscle. More mechanistically, the Authors focus on changes in interactions involving chaperones in the ER/SR, presenting interesting data on the effect of PDIA6 overexpression alters insulin sensitivity in muscle ex vivo.

      Major Comments:

      The section entitled "Validating the regulation of PPIs with insulin resistance in C2C12 myotubes with quantitative XL-MS". This is not really a validation of th previous data as presented, but more an orthologous assay that helped pinpoint the interest in the ER. Suggest adjusting the title.

      Figure 3B - the "decrease" in AS160 pS588 regulation appears to be due to increased basal, not decreased phosphorylation in after insulin. This should be commented on or clarified.

      PDIA6 is down-regulated in muscle from people with T2D - so why did the authors decide to overexpress PDIA6? I note this rationale is explained in the discussion, and could be articulated better in the results.

      Figure 5J and K. The TA muscles are substantially larger from PDIA6 OE mice. Are the muscle fibres also larger? Tbhis relates to the normalisation of data in K. This appears to be normalised to g tissue. If so, is the difference between control, and OE mice being driven by the increase in muscle mass - with uptake per muscle or per fibre the same?

      Minor Comments:

      For the PCP-MS data form C2C12 cells. The authors use an analysis of AUC to assess protein abundance, which, as they state, is important for chronic treatments if total protein is not separately quantified. However, the analysis of changes in protein distribution is less clear from the text in the results section. Intuitively, a profile that is normalised to total intensity in all fractions would provide a protein abundance-independent read-out for changes in protein distribution. Does the "local analysis" capture this same information? Could the Authors provide a little more information here?

      Figure 1M - are the Authors sure that VPS41 should be in this panel. It doesn't seem to be insulin regulated, and the arrow appears to refer to movement between insulin sensitive and insulin resistant.

      Figure 1N - "This includes an array of TBC1 domain-containing proteins (TBC1D15, 195 TBC1D17, TBC1D8B) that are consistently reduced with IR". Do the Authors mean the abundance was less, or that complex formation was reduced?

      Optional. In general, there is a lot of text discussing the literature around proteins highlighted in the analysis. This is useful to an extent, but the Authors might consider streamlining this a little (perhaps moving some of the information ot supp tables?).

      Why do the Authors think the crosslinking MS was not able to capture acute PPI changes like the PCP-MS was?

      For the EDL crosslinking data. Are the Authors able to provide a comparison with C2C12 data - to highlight the differences and similarities between tissue and the cell model? This may be a challenge if the authors think most differences may be technical.

      Please check - "reduces free-glycerol levels essential for fatty acid synthesis". Glycerol does not directly contribute to FA synthesis. But is needed for triglyceride synthesis.

      Do the Authors think that the change in PDIA6 interactions may be a general/indirect indication of changes in ER redox and/or protein misfolding in insulin resistance?

      Is PDIA6 an ER luminal protein? If so, it being phosphorylated is interesting.

      Referees cross-commenting

      Similarly, reviewer #1 raises important points on the description of key parts of the analysis, that will need to be addressed. I think we agree that the manuscript emcpmpasses a great deal of data, and that it is somewhat difficult to follow why PDIA6 was selected for validation. Overall, the reviews pick up on different aspects of the manuscript that could be improved.

      Significance

      Overall, the strength of the paper is in the underlaying proteomics workflows and analysis. The work presented of very high technical quality, and I have no doubt the data presented will be of use to the field beyond the analysis in this current publication.

      However, a weakness is doubts over the relevance of the data on PDIA6 overexpression in muscle insulin resistance.

      This will be of interest to those in the proteomics, interactomics and metabolism fields.

      My expertise is in glucose metabolism, insulin signalling and insulin resistance.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      Ng et al. use a combination of quantitative (structural) proteomics tools to study the effect of insulin resistance (IR) on protein interactions in mouse muscle cells (C2C12 myotubes) and muscle tissue. First, the authors use protein correlation profiling (PCP; co-fractionation) to separate protein complexes from C2C12 cells with and without induced insulin resistance. The PCP data is then used to reconstruct protein complexes and networks based on binary interactions and to uncover changes upon IR, this is illustrated by several examples (Fig. 1). As an orthogonal method quantitative cross-linking (XL) is used to study protein interactions in C2C12 cells. A multidimensional enrichment/fractionation scheme is designed to increase the number of cross-links/protein-protein interactions (PPIs), and again interaction networks are generated and changes between no IR vs. IR conditions assessed (Fig. 2). The XL approach is then carried forth into a mouse model of IR. Muscle tissue of mice fed on regular chow or on a high-fat diet are compared (Fig. 3). Selected XL data is validated on known structures and quantitative data from PCP and XL is compared to integrate the different interactome data (Fig. 4). Finally, the interactions of PDIA6, a key protein found to be affected by IR, are studied in more detail in the mouse model. The effect of overexpression of PDIA6 is studied using different readouts, including redox proteomics (Fig. 5). These data connect the redox imbalance / cysteine oxidation with the role of PDIA6 in insulin resistance.

      Overall, the manuscript is impressive with respect to the methodological effort undertaken, generating and combining many large-scale proteomics data sets, both from a cell line and from mouse tissue. The large amount of data accumulated can be seen as a strength and as a weakness, because it is impossible to follow up on all findings (changes of interactions induced by IR observed in any sample and with any method). Nevertheless, the PDIA6 example was evaluated in more detail and with dedicated follow-up experiments. The conclusions from this experiment are plausible and presented logically. It is, however, difficult for a reviewer to quickly judge whether there would have been more promising leads for validation experiments than PDIA6.

      Given the large amount of data already generated, I do not have suggestions for additional experiments. Instead, some analysis and interpretation of existing data need further clarification.

      Major comments

      The search strategy and statistical treatment of the cross-linking data need to be explained more clearly. The authors write that the data were searched with pLink2 and the FDR was controlled at 1%. However, it remains unclear whether the FDR was controlled at the level of peptide/cross-link-spectral-matches (PSMs/CSMs), or at the level of non-redundant site pairs or even protein-protein interactions for inter-protein links, which would be more appropriate. Controlling the FDR only at the PSM/CSM level will lead to an inflation of false positives when aggregating results at the interaction level. Moreover, it remains unclear whether all data sets were searched together or whether individual data sets or subsets of the entire data were searched individually. For example, if single files or only files belonging to a certain higher-order fraction would be searched individually, this would again lead to an underestimation of false positives. What is particularly noteworthy in this context is that in a typical dataset inter-protein links are underrepresented compared to intra-protein links for statistical reasons, while the numbers in this manuscript are much more balanced (almost equal numbers in the cell line data set, and even more inter-protein links than intra-protein links in the tissue). The authors should consider articles that discuss FDR control in XL-MS, for example by the Rappsilber group. Minimally, more details about what was searched together and at what level the FDR was controlled need to be provided.

      On a similar note, it has been discussed in the literature that validating large-scale XL data on selected structures of complexes is a poor proxy for accurate FDR control, as such complexes are commonly not representative for more transient or substoichiometric PPIs, or PPIs involving low-abundant proteins.

      Minor comments

      Figure 1: CCT is a complex composed of eight subunits, but only seven are shown. What happened to the remaining one (CCT6)?

      The authors performed a redox proteomics experiment in a PDIA6 overexpression system. However, the statement in the Conclusion section that PDIA6 overexpression promotes disulfide bond formation in interacting proteins is not directly justified because the method only quantifies cysteine oxidation, not S-S bond formation directly.

      All supplementary data is not provided in an independent repository, but in a repository of the authors' institution. It is unclear whether the data could be accessed anonymously. Proteomics data need to be provided in an independent, community accepted repository such as from the members of the proteomeXchange consortium (PRIDE etc.).

      A clear description of what is shown in the SI tables is missing, e.g. in the form of figure legends. In their present form, SI data are difficult to interpret. For example, I did not find information about cross-link identifications, only quantitative data on cross-link changes. However, if the identification of a cross-link is not confident in the first place (see my comments above), then the quantification will be irrelevant.

      Referees cross-commenting

      I trust the expertise of reviewer #2 on matters related to insulin resistance. It seems that we both agree that the PDIA6 example might require a more consistent justification throughout the manuscript.

      Significance

      The study is one of only a few so far that combines PCP and XL on such a large scale for a mammalian system. There are also very few studies of cross-linking on tissue. Therefore, from a methodological point of view, the study is highly innovative. The application to the muscle cell system and insulin resistance as a biological research question is furthermore very novel. As such, the study is valuable to different communities - those developing and refining experimental methods and those using them to uncover regulatory mechanisms. Another strength is that the authors made serious efforts at each step to optimize the XL method and adapt it to their sample types of interest.

      The wealth of data is both a strength and a weakness of the work. Inevitably, a reviewer might argue that some aspects of the work could have been done differently. Unless someone spends a lot of time going deep into the result tables, it will be difficult to make constructive suggestions on additional targets for further investigation. Nevertheless, some statistical aspects of data analysis need to be clarified, and parts of the data analysis might need to be repeated. This, in turn, may require some reinterpretation of findings related to the XL data.

      Advances: Conceptual, methodological, mechanistic

      Audience: Specialized, basic research, translational

      Reviewer expertise

      My background is in proteomics, structural proteomics, mass spectrometry, analytical sciences, experimental methodology, and computational data analysis. I have general knowledge of biological processes, but I am not an expert on insulin resistance.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General Statements

      Our study identifies characteristics of secretory signal peptides in fungi, and how their sequence determines which alternative pathways that proteins take to the endoplasmic reticulum. All 3 reviewers grasp this, and agree that the study is publishable. Reviewer 3 puts it well, that we "convincingly show that the length of the hydrophobic helix in a signal peptide is the main factor distinguishing [...] pathways. This simplifies a previous model [...] provides a modest but important advancement to the field of protein secretion. ... The study extends its computational analysis beyond the model yeast Saccharomyces cerevisiae to a diverse range of fungal species."

      Thank you to all the reviewers: we found the reviews fair and constructive. and have addressed them in full.

      In the process of responding to reviews, we softened the claim in the title to "Protein secretion routes in fungi are predicted by the length of the hydrophobic helix in the signal sequence". We also reorganised the manuscript to put the cross-fungal analysis first, followed by the more detailed mechanistic analysis. We feel that this leads a broader audience through the story more effectively. This reorganisation also moved some material from introduction to discussion. Also on larger-scale changes, we reformatted the materials and methods section as requested.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript the authors analyze characteristics of secretory signal peptides in fungi. They identify length of the hydrophobic core rather than overall hydrophobicity as the parameter that determines whether proteins use SRP-dependent cotranslational import through the Sec61 channel, or SRP-independent posttranslational translocation through the hetero-heptameric Sec complex to enter the ER.

      Major comments

      1. The authors need to adequately use the existing nomenclature in the field:

        There is no 'Sec63 translocon'. Proteins with more hydrophobic signal sequences are targeted to the ER by SRP and its receptor, and these proteins are translocated cotranslationally by the Sec61 channel (aka the translocon). Proteins with less hydrophobic signal sequences are imported into the ER postranslationally by the Sec complex consisting of the Sec61 channel and hetero-tetrameric Sec63 complex (Sec62, Sec63, Sec71, Sec72).

        Sec63 on its own also contributes to co-translational import (Brodsky et al, PNAS, 1995), so the term 'Sec63 translocon' is really confusing and should be replaced by the standard nomenclature as above throughout the paper.

      We sincerely appreciate the advice in correctly navigating terminology in the secretion and translocation field. We now say "Sec complex", and not the incorrect "Sec63 translocon". In the same spirit, we have replaced the terminology "Sec63-dependent" with "Sec-dependent", which is a more accurate description of the overall role of the Sec complex. For example, Ast et al. primarily assayed dependence on the Sec complex using sec72∆ strains.

      The paper should contain a proper methods section.

      We have reformatted the manuscript with a separate materials and methods section in the main manuscript, per Genetics/G3 journal family guidelines.

      The authors should explain more explicitly the differences of the Phobius and DeepTMHMM algorithms. Why was that particular algorithm chosen for comparison to Phobius?

      We initially focused on algorithms that distinguish SPs and TM sequences in a single tool, which both Phobius and DeepTMHMM do. This differs from other algorithms such as the SignalP family, that do not also predict TM sequences - SignalP version 4.0 onwards was indeed trained to exclude TM sequences from their predictions (PMID: 21959131).

      In response to this and the similar comment from reviewer 2, we expanded our analysis to compare with the SignalP6.0 algorithm as well as DeepTMHMM.

      Minor comments

      • p2, para 2: ER protein import has been studied for 50 years, and its complexity been obvious for well over a decade

      We corrected this to "However, detailed functional investigations of secretion mechanisms in eukaryotes have focused on a handful of model yeasts and mammalian cells, revealing unexpected complexity"

      • p2, para 3: ref for the signal sequence should be one of the original Blobel papers instead of [8]

      We added the citation to Blobel and Sabatini, 1971, and kept the 1979 citation as we find the additional context is helpful to readers.

      • p3, para 1: ref for SRP should be Walter, Ibrahimi, & Blobel, JCB 1981, instead of [11]

      We added the original citation, and again kept the more modern citation that summarizes the field in decades following initial discovery.

      • p3, para 1: NB: SRP and its receptor do NOT translocate anything, they TARGET proteins to the ER

      We have corrected this, thank you.

      Reviewer #1 (Significance (Required)):

      The authors report an interesting observation which is of interest to the field and sufficiently well documented in this manuscript to be convincing. The paper does extend our understanding of the critical characteristics of secretory signal peptides.

      A limitation of all signal peptide prediction by current algorithms is that they are trained on 'standard' signal peptides and tend to miss ones that do not sufficiently conform to the standard parameters.

      Thank you for this point, the "standard/non-standard" conceptualization is helpful and we now mention this in our expanded discussion. We agree that testing the limits of these models would involve experimental screening of non-standard or non-natural sequences.

      Reviewer's expertise: SRP and Sec61 channel structure/function analysis, cell-free assays for ER protein import, yeast genetics

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Review of manuscript of Sones-Dykes et al. entitled: 'Protein secretion routes in fungi are mostly determined by the length of the hydrophobic helix in the signal peptide'

      This manuscript deals with the important question of how different fungi exhibit variety in protein targeting to the secretory pathway mostly using bioinformatic sequence analysis. This is important for understanding the evolution of the diverse targeting routes within the early secretory pathway, but also for biotechnology since diverse fungi are used as "biofactories" in biotechnological production of secreted proteins. While the results of the current study mostly confirm the analyses already carried out in S.cerevisiae, the work is important and warrants publication in a suitable journal.

      We appreciate this positive and balanced appraisal.

      Major points:

      1. Could the authors elaborate what was the motivation to use Phobius and not some other signal peptide predictor? I am wondering because of the cited Ast et al. paper is already several years old and new improved prediction tools such as the latest SignalP iteration have been developed since that study.

      The main motivation to use Phobius, and check with DeepTMHMM, was that these tools simultaneously predict cleaved signal peptides and transmembrane helices, unlike other tools that predict only cleaved signal peptides and can give false positives with N-terminal transmembrane helices.

      To clarify this point, we also emailed Prof. Henrik Nielsen, the lead developer of SignalP. I asked: "Although we mostly used Phobius prediction and also compared to DeepTMHMM, reviewers have asked us to also compare to SignalP. A critical part of our argument is about predictions of the h-region length, so we would like to compare h-region lengths to SignalP4.1 HMM mode in addition to SignalP6.0."

      Prof. Nielsen replied:

      As for your question, I must tell you that SignalP 4.1 does not have an HMM mode at all. The last SignalP version to have an HMM mode was 3.0. Therefore, 4.0, 4.1, and 5.0 do not output signal peptide regions; this was first reintroduced with version 6.0. See also the FAQ tab at the website.

      *You could try to install version 3.0, but for your purpose, I would not recommend it. The old HMM module had a strong preference for certain h-region lengths because of a specific kind of overtraining. This was, at least partially, solved in Phobius through regularization of the length distribution. Since h-region length is a crucial parameter in your analysis, I would not trust the region assignments by SignalP 3.0. You are welcome to cite me for that to the reviewers, if needed. *

      But comparing the region assignments between Phobius and SignalP 6.0 will be interesting.**

      Regarding SignalP3.0, we now cite Liaci et al., who analysed all experimentally verified eukaryotic signal peptides using SignalP 3.0, and Xue et al., who analysed S. cerevisiae signal peptides, and both arrived at similar conclusions that cleaved signal peptides have hydrophobic regions of length 8-14 amino acids.

      Also, we have expanded our analysis to also compare Phobius and SignalP6.0 predictions of entire signal peptides and of h-regions. The comparisons are now in Figures 4, S3, and S4.

      I am slightly puzzled by the analysis of the annotation of the Sec63- and SRP-dependent targeting sequences presented in Fig. 1. Could the "SRP-dependent" sequences with long hydrophobic sequences simply be called transmembrane helices? Based on structure of the SPC, it has been proposed that cleavable signal peptides with h-regions beyond 18 residues are extremely rare so I would imagine that majority of these sequences are longer transmembrane segments.

      The point of this figure is to compare lists of proteins that are experimentally verified to be Sec-dependent or SRP-dependent in their targeting, so that's the correct way to refer to them for the purpose of this analysis. Yes, the conclusion of this paper and other work (e.g. Ast et al.) is that these SRP-dependent sequences with long hydrophobic sequences are mostly transmembrane (TM) helices.

      I appreciate the analysis of protein targeting features in evolutionarily distinct fungal species, but since the authors highlight importance of fungi in heterologous industrial protein production, it would have been satisfying to see some of these fungi included in this analysis. In particular, Pichia pastoris and Trichoderma reesei are commonly used fungi with apparently a highly specialized secretory machinery capable of very high production levels of different secretory proteins. I would urge the authors to consider the aspect of selecting optimal secretion signals for these industrial fungi and perhaps include some discussion of it in this manuscript.

      We added Pichia pastoris (Komagataella phaffii) and Trichoderma reesei to the analysis. We appreciate the suggestion to discuss optimal secretion signals, however, our analysis doesn't directly address that so we chose to leave that point out.

      Minor points:

      1. The authors state that both Sec63 and SRP pathways converge at the Sec61 translocon. However, we now know that targeting of proteins to Sec61 is even more complicated and for example the EMC is a complex that delivers some proteins to Sec61. It might be appropriate to cite some recent reviews on complexity of early protein targeting to Sec61 in the Introduction.

      As a review of complexity of early protein targeting, we cite a Aviram and Schuldiner 2017 (Targeting and translocation of proteins to the endoplasmic reticulum at a glance). We could add other citations if the reviewer considers this to be necessary.

      Page 5. The authors repeat the compound hydropathy analysis of Ast et al. and used the earlier reported 9-amino acid window for this. Is this analysis result robust with other window sizes?

      Ast et al., checked that this result is robust to window sizes of 9, 11, or 19 aa, in their Figure S1A, which we now specifically mention. In our manuscript, we instead check robustness to different hydropathy scales and prediction algorithms.

      Page 12. Authors state that "cleaved signal peptides do not need to span a membrane". A recent structure of the signal peptidase complex (PMID: 34388369) directly suggests that the signal peptide does span the membrane immediately before its final cleavage. Importantly, the SPC thins the membrane in this region to accommodate the shorter signal peptide h-region and this is proposed as a basis for SPC discriminating between signal peptides and longer transmembrane segments. It would be appropriate to cite this paper in the Discussion.

      Thank you for bringing this important paper to our attention. We have clarified our wording here and cited Liaci et al (PMID: 34388369) in the updated manuscript. Both for the detailed structural discussion, and for similarly concluding that in mammals "Signal peptides possess short h-regions".

      Reviewer #2 (Significance (Required)):

      Protein targeting into the early secretory pathway is an important general concept, and recent years have revealed many new aspects into the diverse mechanisms that cells employ for targeting of proteins with diverse folding needs by use of protein-specific targeting sequences. Also, how proteins are targeted is an important biotechnological question as choice of e.g. the signal peptide can have a dramatic impact on quantity and quality of the produced protein.

      This work is generally interesting to cell biologists studying mechanisms of protein targeting, but the results are mostly confirmatory. Still, no-one has carried out such analysis and fungi are remarkably diverse with potential for new innovations in protein targeting and therefore, the work should be published in my opinion. The suitable audience in my view is quite specialized and could be cell biologists with high interest in fungal protein secretion or biotechnologists using fungi for heterologous expression. For the latter, I would request the authors to extend the data analysis to a few more most biotechnologically relevant fungi and add some discussion on choice of signal peptide in biotechnological protein production in fungi.

      We appreciate this fair perspective. Indeed, we have added analyses of the biotechnologically relevant fungi Komagataella phaffii (Pichia pastoris), and Trichoderma reesei.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      This manuscript revisits the analysis of hydrophobic forces driving endoplasmic reticulum translocation in fungi. Sones-Dykes and Wallace convincingly show that the length of the hydrophobic helix in a signal peptide is the main factor distinguishing SRP-dependent and Sec63-dependent pathways. This simplifies a previous model that relied on a compound hydropathy score, which incorporated both length and hydrophobicity. The analysis, confirmed by Phobius and DeepTMHMM, indicates that length alone is an equally effective and simpler metric for predicting the translocation route in fungi. The study extends its computational analysis beyond the model yeast Saccharomyces cerevisiae to a diverse range of fungal species. It finds that the bimodal distribution of hydrophobic helix lengths-short for predicted Sec63-dependent and long for SRP-dependent proteins-is highly conserved. By broadly identifying proteins with short hydrophobic helixes, the research suggests that the Sec63 translocation route is crucial for cell wall biogenesis and secretion (likely encompassing and the secretion of virulence factors). This provides a functional and pathological context for the translocation pathway choice.

      The manuscript was well written, and its central messages were clear.

      We appreciate this, and are glad that the messages came across clearly.

      Major points:

      • Extension of analysis to human secretome: In Fig 4, the helix length analysis is extended to additional organisms, among them Homo sapiens. It is observed that 'h-region lengths in humans had a similar distribution'. However, as the authors themselves note in the introduction, the functional thresholds of signal peptides are dramatically different in mammalian cells. Without overlaying 'ground truth' data of Sec63-dependence in humans, it is difficult to draw any conclusions about the meaning of h region length on human translocation preferences. I would suggest either: (1) Performing an analysis similar to that done in Fig 1 for the human secretome (2) Removing the human outgroup from the analysis in Fig 4.

      We appreciate the reviewer's point, but decided to keep the human analysis as an outgroup in Fig 4. only. This manuscript focuses on fungi by extrapolating and testing results from S. cerevisiae on other fungi. A mechanistic interpretation of signal peptides in human cells is out of scope due to the mentioned differences in functional thresholds of signal peptides in human cells. However, including humans gives a context that we feel readers would ask for if we did not include it.

      If we wanted to analyse the human signal peptides thoroughly then it would be interesting to extend to a more diverse range of eukaryotes, and extend beyond signal peptide prediction algorithms to structural modeling of signal peptides into cognate translocon structures. That's a whole different project.

      • Incorporate additional cross-validation: Since the key findings from this paper stem from hydrophobic segment predictions, it would be beneficial to augment the conclusions with another independent analysis. The Hessa scale (PMID: 15674282) has the advantage of being a 'biological' hydrophobicity scale defined by transmembrane helix insertion. It would be important to show that the findings obtained with Phobius (e.g. no improvement in categorization with compound score) also hold with this scale.

      Thank you for this helpful and important point. We also performed the analysis with the Hessa scale, included in the updated manuscript as Figure S2. The Hessa scale looks like a better predictor than the Kyte-Doolittle or Rose scales in that the distributions are clearly different for SRP-dependent and Sec63-dependent proteins. However, there is no improvement in classification, both because the Hessa maximum hydrophobicity distributions for SP and TM groups overlap, and also because the 97.5% accuracy of the length-based prediction is already so good that there's no room to improve in classifying this set of S. cerevisiae sequences.

      Minor points:

      • Incorporate GO analysis in Fig 4: Visualization of the GO analysis referenced in the text (Fig 4) may be useful to drive home the point of .

      We have indicated the top enriched GO terms in the paper, and also provided the full GO results in the supplementary data at https://github.com/TristanSones-Dykes/TMSP_Pub. There's not really more information in these GO analyses that makes it worth plotting. For example, for predicted signal peptides in all annotated fungi, "extracellular region" and "cell wall" come up as very highly enriched with extremely low p-values.

      • Cite origin of 'ground truth' protein list: The authors cite 83 and 107 bona-fide Sec63-dependent and SRP-dependent proteins which were used to define the 'ground truth' lists. It would be informative to define how these lists were collected; for example, the Ast et al. paper referenced appears to validate ~40-50 proteins as Sec63-dependent.

      The 'ground truth' protein list was collected and curated in the paper by Ast et al., and thoroughly explained there. In our expanded methods section, we now explain their classification based on localisation/mislocalisation of GFP-tagged proteins in sec72∆ (Sec63 complex deficient) strains. After careful checking, we didn't find any flaws in their analysis or any better yeast datasets more recent than 2013. So, we think the approach of giving a brief description here and referring to Ast et al. for a thorough description is most helpful for readers.

      Reviewer #3 (Significance (Required)):

      This manuscript by Sones-Dykes and Wallace provides a modest but important advancement to the field of protein secretion. While previous work has already identified that Sec63-dependent proteins in baker's yeast have moderately hydrophobic signal peptides, this paper refines this concept and extends it for additional fungal species. It will be of interest to researchers studying protein translocation/secretion pathways and fungal biology.

      Thank you for supporting the main point of our paper. We agree with the assessment, and that this analysis needed to be done to discover if and how results from S. cerevisiae extend to other fungi. We hope that this paper will encourage new work on mechanisms of protein secretion in other fungi, especially of the role of the Sec63 complex.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This manuscript revisits the analysis of hydrophobic forces driving endoplasmic reticulum translocation in fungi. Sones-Dykes and Wallace convincingly show that the length of the hydrophobic helix in a signal peptide is the main factor distinguishing SRP-dependent and Sec63-dependent pathways. This simplifies a previous model that relied on a compound hydropathy score, which incorporated both length and hydrophobicity. The analysis, confirmed by Phobius and DeepTMHMM, indicates that length alone is an equally effective and simpler metric for predicting the translocation route in fungi. The study extends its computational analysis beyond the model yeast Saccharomyces cerevisiae to a diverse range of fungal species. It finds that the bimodal distribution of hydrophobic helix lengths-short for predicted Sec63-dependent and long for SRP-dependent proteins-is highly conserved. By broadly identifying proteins with short hydrophobic helixes, the research suggests that the Sec63 translocation route is crucial for cell wall biogenesis and secretion (likely encompassing and the secretion of virulence factors). This provides a functional and pathological context for the translocation pathway choice. The manuscript was well written, and its central messages were clear.

      Major points:

      • Extension of analysis to human secretome: In Fig 4, the helix length analysis is extended to additional organisms, among them Homo sapiens. It is observed that 'h-region lengths in humans had a similar distribution'. However, as the authors themselves note in the introduction, the functional thresholds of signal peptides are dramatically different in mammalian cells. Without overlaying 'ground truth' data of Sec63-dependence in humans, it is difficult to draw any conclusions about the meaning of h region length on human translocation preferences. I would suggest either: (1) Performing an analysis similar to that done in Fig 1 for the human secretome (2) Removing the human outgroup from the analysis in Fig 4.
      • Incorporate additional cross-validation: Since the key findings from this paper stem from hydrophobic segment predictions, it would be beneficial to augment the conclusions with another independent analysis. The Hessa scale (PMID: 15674282) has the advantage of being a 'biological' hydrophobicity scale defined by transmembrane helix insertion. It would be important to show that the findings obtained with Phobius (e.g. no improvement in categorization with compound score) also hold with this scale.

      Minor points:

      • Incorporate GO analysis in Fig 4: Visualization of the GO analysis referenced in the text (Fig 4) may be useful to drive home the point of .
      • Cite origin of 'ground truth' protein list: The authors cite 83 and 107 bona-fide Sec63-dependent and SRP-dependent proteins which were used to define the 'ground truth' lists. It would be informative to define how these lists were collected; for example, the Ast et al. paper referenced appears to validate ~40-50 proteins as Sec63-dependent.

      Significance

      This manuscript by Sones-Dykes and Wallace provides a modest but important advancement to the field of protein secretion. While previous work has already identified that Sec63-dependent proteins in baker's yeast have moderately hydrophobic signal peptides, this paper refines this concept and extends it for additional fungal species. It will be of interest to researchers studying protein translocation/secretion pathways and fungal biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Review of manuscript of Sones-Dykes et al. entitled: 'Protein secretion routes in fungi are mostly determined by the length of the hydrophobic helix in the signal peptide'

      This manuscript deals with the important question of how different fungi exhibit variety in protein targeting to the secretory pathway mostly using bioinformatic sequence analysis. This is important for understanding the evolution of the diverse targeting routes within the early secretory pathway, but also for biotechnology since diverse fungi are used as "biofactories" in biotechnological production of secreted proteins. While the results of the current study mostly confirm the analyses already carried out in S.cerevisiae, the work is important and warrants publication in a suitable journal.

      Major points:

      1. Could the authors elaborate what was the motivation to use Phobius and not some other signal peptide predictor? I am wondering because of the cited Ast et al. paper is already several years old and new improved prediction tools such as the latest SignalP iteration have been developed since that study.
      2. I am slightly puzzled by the analysis of the annotation of the Sec63- and SRP-dependent targeting sequences presented in Fig. 1. Could the "SRP-dependent" sequences with long hydrophobic sequences simply be called transmembrane helices? Based on structure of the SPC, it has been proposed that cleavable signal peptides with h-regions beyond 18 residues are extremely rare so I would imagine that majority of these sequences are longer transmembrane segments.
      3. I appreciate the analysis of protein targeting features in evolutionarily distinct fungal species, but since the authors highlight importance of fungi in heterologous industrial protein production, it would have been satisfying to see some of these fungi included in this analysis. In particular, Pichia pastoris and Trichoderma reesei are commonly used fungi with apparently a highly specialized secretory machinery capable of very high production levels of different secretory proteins. I would urge the authors to consider the aspect of selecting optimal secretion signals for these industrial fungi and perhaps include some discussion of it in this manuscript.

      Minor points:

      1. The authors state that both Sec63 and SRP pathways converge at the Sec61 translocon. However, we now know that targeting of proteins to Sec61 is even more complicated and for example the EMC is a complex that delivers some proteins to Sec61. It might be appropriate to cite some recent reviews on complexity of early protein targeting to Sec61 in the Introduction.
      2. Page 5. The authors repeat the compound hydropathy analysis of Ast et al. and used the earlier reported 9-amino acid window for this. Is this analysis result robust with other window sizes?
      3. Page 12. Authors state that "cleaved signal peptides do not need to span a membrane". A recent structure of the signal peptidase complex (PMID: 34388369) directly suggests that the signal peptide does span the membrane immediately before its final cleavage. Importantly, the SPC thins the membrane in this region to accommodate the shorter signal peptide h-region and this is proposed as a basis for SPC discriminating between signal peptides and longer transmembrane segments. It would be appropriate to cite this paper in the Discussion.

      Significance

      Protein targeting into the early secretory pathway is an important general concept, and recent years have revealed many new aspects into the diverse mechanisms that cells employ for targeting of proteins with diverse folding needs by use of protein-specific targeting sequences. Also, how proteins are targeted is an important biotechnological question as choice of e.g. the signal peptide can have a dramatic impact on quantity and quality of the produced protein.

      This work is generally interesting to cell biologists studying mechanisms of protein targeting, but the results are mostly confirmatory. Still, no-one has carried out such analysis and fungi are remarkably diverse with potential for new innovations in protein targeting and therefore, the work should be published in my opinion. The suitable audience in my view is quite specialized and could be cell biologists with high interest in fungal protein secretion or biotechnologists using fungi for heterologous expression. For the latter, I would request the authors to extend the data analysis to a few more most biotechnologically relevant fungi and add some discussion on choice of signal peptide in biotechnological protein production in fungi.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript the authors analyze characteristics of secretory signal peptides in fungi. They identify length of the hydrophobic core rather than overall hydrophobicity as the parameter that determines whether proteins use SRP-dependent cotranslational import through the Sec61 channel, or SRP-independent posttranslational translocation through the hetero-heptameric Sec complex to enter the ER.

      Major comments

      1. The authors need to adequately use the existing nomenclature in the field: There is no 'Sec63 translocon'. Proteins with more hydrophobic signal sequences are targeted to the ER by SRP and its receptor, and these proteins are translocated cotranslationally by the Sec61 channel (aka the translocon). Proteins with less hydrophobic signal sequences are imported into the ER postranslationally by the Sec complex consisting of the Sec61 channel and hetero-tetrameric Sec63 complex (Sec62, Sec63, Sec71, Sec72).

      Sec63 on its own also contributes to co-translational import (Brodsky et al, PNAS, 1995), so the term 'Sec63 translocon' is really confusing and should be replaced by the standard nomenclature as above throughout the paper. 2. The paper should contain a proper methods section. 3. The authors should explain more explicitly the differences of the Phobius and DeepTMHMM algorithms. Why was that particular algorithm chosen for comparison to Phobius?

      Minor comments

      • p2, para 2: ER protein import has been studied for 50 years, and its complexity been obvious for well over a decade
      • p2, para 3: ref for the signal sequence should be one of the original Blobel papers instead of [8]
      • p3, para 1: ref for SRP should be Walter, Ibrahimi, & Blobel, JCB 1981, instead of [11]
      • p3, para 1: NB: SRP and its receptor do NOT translocate anything, they TARGET proteins to the ER

      Significance

      The authors report an interesting observation which is of interest to the field and sufficiently well documented in this manuscript to be convincing. The paper does extend our understanding of the critical characteristics of secretory signal peptides.

      A limitation of all signal peptide prediction by current algorithms is that they are trained on 'standard' signal peptides and tend to miss ones that do not sufficiently conform to the standard parameters.

      Reviewer's expertise: SRP and Sec61 channel structure/function analysis, cell-free assays for ER protein import, yeast genetics

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      we thank the reviewers for their close reading of the manuscript and detailed comments.

      __Reviewer #1 __

      1. The idea that Xrp1 induction switches around 16 h post-IR, becomes RpS12-dependent, and subsequently engages cell competition is interesting and potentially important. However, the evidence supporting RpS12-dependence of Xrp1 induction is currently not sufficiently convincing. For example, based on the images in Figure 6F-supplement 1, the conclusion that Xrp1 is induced in an RpS12-dependent manner appears difficult to support. The authors should strengthen and quantify this result or provide the raw image data. In addition, because this point is central to the authors' model, they should move the key supporting data from the supplementary figures to the main figures to ensure that this critical claim is clearly supported and readily accessible to readers.

      We apologize for confusing all three reviewers with this figure. Actually, Figure 6F supplement 1 does not compare RpS12-dependent and -independent Xrp1-HA expression. Instead, it shows that the rps12-independent Xrp1-HA expression is only mildly p53-dependent, which is consistent with our idea. We had not compared RpS12-dependence or Xrp1 expression in this manuscript because we had published that previously and found a substantial dependency (Fig 1N-P of Ji et al 2021). Because that previous paper used an anti-Xrp1 antibody, and the present paper measures an HA-tagged Xrp1 protein, it is probably a good idea to include the RpS12-dependence of late Xrp1 expression again, using the Xrp1-HA reagent. We have this data, which shows ~75% dependence, which is highly significant statistically. We will include this data in the revised manuscript, within one of the main figures.

      • The authors suggest a model in which Xrp1 executes two qualitatively distinct "modes"(pro-repair/acute DDR and elimination of aneuploid cells), but this remains only partially convincing as currently presented. The authors should at least (i) provide quantitative evidence that could explain how Xrp1 might produce distinct outcomes across phases(e.g., comparing Xrp1-HA levels and/or the fraction of Xrp1-HA-positive cells at 2-4 h versus 16-24 h post-IR), and (ii) explicitly discuss plausible mechanisms in the Discussion. Even if the molecular "switch" is not fully resolved experimentally, a clearer, data-grounded discussion of how Xrp1 could mediate these temporally distinct functions is needed. In addition, since ISR signaling (e.g., eIF2α phosphorylation) has been implicated as a single feature associated with Xrp1-dependent loser elimination, the authors should consider assessing p-eIF2α levels in Xrp1-HA positive cells at early versus late time points after IR(e.g., 4 h vs 24 h).

      We thank the reviewer for highlighting the need for this discussion. We will clarify these issues in the revised manuscript but do not think further experiments are necessary.

      1. It was well established previously and confirmed here that little DNA damage remains ~24h after IR. This is sufficient to explain why there is little DDR at this stage. We will make this clear in the revision.
      2. We did not intend to claim that no cell competition happens during the acute DDR ~4h after IR. We are not aware of experiments showing the DDR is strictly cell autonomous and not influenced by neighboring cells. If the acute DDR is indeed cell autonomous, or mostly so, this could be due to the additional genes induced directly by p53 that are not induced by Xrp1 ~24h after IR. The cell death gene Rpr is one example reported in our paper. We will discuss this in the revision.
      3. The reference to ISR as the single feature inducing Xrp1 expression is referring to two Nature Cell Biology papers published in 2021 (Baumgartner et al 2021; Recasens-Alvarez et al 2021). This idea has not stood the test of time. The ISR reporter activities shown in these papers were later shown to be downstream of Xrp1, not upstream (Langton et al 2021; Kiparaki et al 2022). Langton et al argued that there could be an initial ISR that was too small to be detectable, but this is hypothetical. There are now multiple papers and preprints showing that it is long isoforms of Xrp1 are ISR responsive, but that short isoforms of Xrp1 initiate cell competition, and that RpS12-dependent alternative splicing produces the short isoform. The short Xrp1 isoforms lack the uORF that responds to ISR (Elife 2021 Oct 4:10:e74047; bioRxiv 06.15.659587; bioRxiv 2025.10.29.685279). This is not consistent with the ISR initiating cell competition idea. Because we and others have shown that it is Xrp1 activity that induces eIF2α phosphorylation (Ochi et al 2021, Langton et al 2021, Kiparaki et al 2022), eIF2α phosphorylation in Xrp1 expressing cells would not prove a role for ISR and we do not propose to make these measurements. We are undecided whether to include this discussion of the ISR in the paper. It would lengthen the paper and we do not think it is directly relevant.
      4. The idea that aneuploid cells-or cells with altered ribosomal gene dosage-could be removed via Xrp1-mediated cell competition is intriguing. However, the manuscript does not currently provide any evidence that such cells are, in fact, being eliminated. The authors should therefore (i) quantify cell-level overlap metrics, such as the fraction of γH2Av-positive cells that are Xrp1-HA-positive (and vice versa), as well as the fraction of γH2Av-positive cells that are cleaved Dcp-1-positive (and vice versa) at 24 h post-IR. These quantitative analyses would clarify whether the late Xrp1-HA-positive population corresponds to persistently damaged cells and whether it is enriched for cells undergoing apoptosis/clearance. The authors should also (ii) directly assess aneuploidy/segmental copy-number imbalance in the late Xrp1-HA-positive clusters (e.g., by DNA FISH targeting one or two chromosome arms/regions), and if these experiments cannot be completed within a reasonable revision timeframe, the authors should temper their wording and present aneuploidy and selective elimination as a plausible interpretation supported byRpS12 dependency and prior literature, rather than as a demonstrated conclusion in the current study.

      We agree that aneuploidy is not demonstrated in the current study. Elimination of aneuploid cells with altered Rp gene dose was already established by previous papers. We cited previous work in the manuscript but did not summarize the evidence explicitly, so we are not sure whether the referee was fully aware. Ji et al (2021) created 17 different segmental aneuploidies using Flp/FRT recombination including or abutting 10 different Rp genes, together covering >20% of the euploid genome. The results showed that segmental aneuploidies are largely removed by Rp gene dose-dependent cell competition using the RpS12 and Xrp1 genes. Others have since confirmed that aneuploidies are removed by cell competition and that the effects of Rp gene dose depend on Xrp1 (Fusari et al Cell Genomics 2025). Therefore, we consider it established that aneuploid cells with altered Rp gene dosage are removed by this mechanism. We will discuss this explicitly in the revised manuscript.

      The question of whether cells dying in a p53-independent manner ~24h after irradiation are aneuploid cells undergoing cell competition was also addressed previously. Ji et al 2021 already showed that most of these cells are eliminated by RpS12 and Xrp1, consistent with altered Rp gene dosage, and that preventing cell competition leads to persistence into adulthood of cells that can be recognized at Rp+/- from their bristle phenotype. Evidence was shown that most such cells are segmental aneuploids, consistent with earlier studies of DNA repair mutants (Baker, 1978). We will summarize this in the revised manuscript so that it is not necessary to read the cited references to appreciate the evidence. The only new observation being made in this paper about the ~24h cell death stage is that loss of p53 increases the number of these cells, which could be because inadequate DNA repair leads to more aneuploid cells.

      It is important to appreciate that we do not claim that cells labeled by the DNA damage marker γH2Av are aneuploid, or being removed by cell competition. On the contrary, γH2Av labels cells with unrepaired DNA damage, whereas segmental aneuploidy can only occur as a consequence of completed DNA repair. Thus γH2Av-labeled cells are not generally expected to be Xrp1 positive or undergoing cell competition. Some may be, if they are cells that have both unrepaired DNA damage and repaired DNA damage that led to aneuploidy. We cannot quantify overlap in the existing data, since mouse antibodies for γH2Av and HA-tag were used in separate experiments. Repeating the experiments with different antibodies to measure the overlap would not address any outstanding questions.

      We doubt FISH would be effective at measuring aneuploidy because only gene dose corresponding to the probes would be detected. Only small portions of the genome could be assessed at a time so the frequency at which aneuploidy could be detected would be low. We will make it clear in the revised manuscript that cell competition of aneuploid cells is not a new claim of this paper but something that has been studied before.

      • Regarding the statistical analysis, revisions are warranted. In multiple panels, Student's t-tests are repeatedly performed against the same control, which inflates the family-wise error rate and increases the risk of false-positive findings. In such cases, an overall ANOVA (one-way) followed by an appropriate multiple-comparison procedure-such as Dunnett's-test would be more appropriate.

      This concern applies in particular to:

      Figure 1A- Supplement 1

      Figure 2M-R

      Figure 3Q, R

      Figure 5D

      Figure 5J- Supplement 1

      Figure 6G- Supplement 1

      1. Figure 6I- Supplement 2

      We agree and will apply Anova with multiple comparison procedures in the revised manuscript.

      Minor comments:

      1. Figure 2E is not cited in the text, and it is difficult to tell from the images as presented whether p53DN overexpression suppresses the Gstd-lacZ signal at 4 h post-IR.

      We will replace Fig 2E with a clearer example, and add a quantification of all our data, with statistics, as a supplemental figure. Note that the conclusion is already substantiated by qRT-PCR data (Figure 2M)

      In Figure 4, rpr150-lacZ does not appear to be upregulated by Xrp1 overexpression. Therefore, the authors should revise the figure title to avoid misleading readers, because rpr, a well-known p53-responsive pro-apoptotic gene, is not induced under this condition.

      We will change the Figure title. Failure to induce rpr150-LacZ here is a control to show that Xrp1 overexpression does not induce p53 activity.

      In Figure 6E, based on the data as presented, it is difficult to determine whether cleaved Dcp-1 (cDCP1)-positive cell counts are reduced upon Xrp1 knockdown. The authors should provide clearer representative images and/or include the underlying raw images as supplementary source data to support the conclusion.

      We will replace Fig 6E with a clearer example, and add a quantification of all the data.

      The authors should (i) show raw data points overlaid on summary plots (e.g., dot plots on top of bar graphs/box plots) to convey data distribution and (ii) include higher-magnification insets and/or quantitative localization/overlap analyses where colocalization is central to the interpretation (e.g., Xrp1-HA relative to γH2Av).

      We agree regarding the data display. As discussed later, colocalization is not relevant to the interpretation.

      __Reviewer #2 __

      1. First, authors present evidence that Xrp1 is induced in wing discs exposed to ionizing radiation (IR, known to cause DSBs) and that this induction relies on p53 regulating Xrp1transcription (Figure 1 and S1). Data are clear but there is a puzzling result. Xrp1-lacZ (a reporter of Xrp1 transcription) is induced by IR but independently of p53. These results need attention as they appear to be contradictory (why Xrp1-mRNA but not Xrp1-lacZ relies on p53). Nicely, authors show that Xrp1-lacZ induction relies on Xrp1/Irbp18 autoregulatory feedback. Is the lacZ insertion somehow interfering with the capacity of p53 to bind and regulate Xrp1 expression?

      We agree that it is a puzzling result. We have also noted elsewhere that Xrp1-LacZ does not always reflect Xrp1 mRNA and protein expression (Kumar and Baker 2022). We can add the reviewer's hypothesis to the manuscript, although it does not explain why Xrp1-LacZ is induced by IR

      • Second, authors use a collection of reporter genes and show that Xrp1 regulates, most but not all, Dp53 target genes. It is really unclear whether the reaper-lacZ used in Figure 3L-P recapitulates the induction of reaper by p53. I know this reporter was claimed by other do so, but NOT in the wing disc. I would then remove it as mRNA data are clear.

      rpr150-lacZ was used as a p53 reporter in wing imaginal discs by Wells et al. 2011 (PMC3296280). We will cite this in the revised manuscript. We prefer not to remove it as we also use this reporter for the experiment shown in Fig 4.

      3 Third, authors show that Xrp1, as expected from the previous data in Figure 2 and 3, also mediated the role of Dp53 in inducing cell death, although only partially, and these differences are attributed to the gene reaper (p53 but not Xrp1 target). Dcp1 should be cDcp1 and clones should be magnified in Fig 5E-G.

      We will follow this advice in the revised manuscript

      • First, the impact of Xrp1 on the levels of DNA damage and cell death after 24h of IR are shown in a p53 mutant background (6E1-6E3). Authors should present the data in a clean +/+ background. Quantification of 6F should also be done in the same background.

      This data was presented in a the p53 mutant background to focus on the p53-independent removal of cells by cell competition. We can perform an experiment in the presence of wild type p53 for completeness if desired, but a mixture of DDR and cell competition effects may result.

      Second, hid-GFP is being induced by IR already at 4 h after IR and this induction and this induction relies on p53 and Xrp1 activities as shown in previous figures. Thus, the data presented in 6G-J could be a trivial consequence of the strong perdurance of the GFP protein.

      hid-GFP is not expressed at 4 hours in p53DN and Xrp1 K/D (Fig 3D,E), so the expression in 6G-J cannot be explained by GFP perdurance from the earlier timepoint.

      Third, the role of cell competition (driven by Minute aneuploids) is not demonstrated and relies simply on the potential role of Xrp1 in the late wave of cell death, proposal that has not been demonstrated in this paper either. Indeed, the no-role of RpS12 in the late induction (24 h wave) of Xrp1 (Figure 6 S1-F) reinforces my doubts. Authors should reflect in the introduction and discussion sections the most recent literature in the field.

      The role of Xrp1 in the late wave of p53-independent cell death is shown in Fig 6D-F. As discussed above (reviewer 1 point 1), Fig 6S1-F shows the limited role of p53 in rpS12-independent Xrp1 induction, not the role of RpS12. We will add a figure to the revised manuscript showing the strong RpS12 dependence of the late induction of Xrp1-HA and explain this more clearly. We did not include this in the first manuscript version because we had already published this result, albeit with an anti-Xrp1 antibody (Ji et al Fig 1 N-P). As also discussed above (reviewer 1 point 3), we agree that the role of cell competition in removing aneuploid cells is not demonstrated in the present manuscript, but we considered this had been demonstrated previously (Ji et al 2021), and parts of that study recently confirmed by others (Fusari 2025 Cell Genomics), so it is not necessary to add further experimental support here, although it will be useful to explain the published literature more fully.

      Reviewer #3

      1. Figure 2E. Based on the text, I think the authors are claiming that the expression of GStD-LacZ is reduced in the posterior compartment of panel 2E compared to 2D. This is unconvincing. If at all, the expression along the DV boundary in the posterior compartment is stronger in E than in D. Am I missing something?

      We will replace Fig 2E with a clearer example, and add a quantification of all our data, with statistics, as a supplemental figure. Note that the conclusion is already substantiated by qRT-PCR data (Figure 2M)

      Figure 3I - K. The expression in the posterior compartment is supposed to be reduced compared to the anterior compartment. Once again, these differences are not easily apparent to me. Perhaps these images need to be quantified to illustrate the supposed difference.

      We are sorry that the reviewer found the images unconvincing. We will replace these figures with other examples, and add quantifications of all data, with statistics, as a supplemental figure. Note that the conclusions are already substantiated by qRT-PCR data (Figure 3R)

      • . *

      Line 286. The heading "Xrp1 is sufficient for the expression of p53-dependent DDR genes" is misleading. As stated in the final sentence of paragraph 2 of this section, the authors show that Xrp1 functions downstream of p53 and is sufficient for expressing a subset of p53-dependent DDR genes.

      We apologize for misleading the reviewer. We will change the heading to "Xrp1 is sufficient for the expression of many p53-dependent DDR genes", which is the meaning we intended.

      Figure 5, panels F and G could be made much easier for the reader to follow. The labels in these two panels are very difficult to see and understand. It might be better to show some high magnification regions (e.g. insets) that show the differences in the prevalence of cell death in regions with different genotypes. Also, why is Xrp1 +/- not quantified in panel H since the authors claim that cell death is reduced even in the heterozygous cells?

      It is a good idea to add enlarged figures, and we will do so. We can quantify the Xrp1+/- genotype as well.

      Line 363 and Figure 6D, E. The authors argue that the increase in H2Av in the posterior compartment implies that cells with damaged DNA are not being eliminated when Xrp1 function is reduced. An alternative explanation is that the p53 mutation together with the Xrp1 knockdown impairs the DDR even more resulting in increased H2Av staining. I don't know how that authors' data can exclude this possibility.

      We agree with the reviewer and did not intend to exclude this possibility. We will rewrite this text to make both explanations clear.

      Line 365. Is the resolution of the "double labeling" sufficient to conclude that some of the H2Av cells upregulate Xrp1-HA? A more conservative interpretation would be that in these regions that have increased H2Av, that there is more expression of Xrp1-HA.

      We apologize for a mistake in the submitted manuscript. In fact the anti-H2Av and anti-HA primary antibodies used were both raised in mouse, and Fig 6G,H show distinct wing discs, not double labels. We will replace line 365 with the sentence suggested by the reviewer.

      Figure 6 - supplement 1. The expression of Xrp1-HA is reduced in the p53DN cells when they are a loss mutant for rps12. Although statistically significant, this reduction is modest. If this induction were due to a cell competition like phenomenon, would you not expect the induction to be completely abolished since rpS12 mutations abolish cell competition completely? Please explain.

      We apologize for confusing all three reviewers with Figure 6F supplement 1. This figure does not compare RpS12-dependent and -independent Xrp1-HA expression. Instead, it shows that the rps12-independent Xrp1-HA expression is only mildly p53-dependent, which is consistent with our conclusions. We will add a figure to the revised manuscript showing the strong RpS12 dependence of the late induction of Xrp1-HA and explain this more clearly. We did not include this in the initial manuscript version because we had already published this result, albeit with an anti-Xrp1 antibody (Ji et al Fig 1 N-P).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Previous work has shown that when Drosophila imaginal discs are irradiated with X-rays that there are two phases of cell death. Within a few hours of irradiation, cells die in a p53-dependent manner. There is a much later phase of cell death that occurs approximately 20 hours after irradiation which seems to be mostly p53-independent. There is evidence that this latter phase of cell death might occur due to a phenomenon resembling cell competition where aneuploid cells are eliminated. In this manuscript, Chaitali Khan and colleagues explore the mechanistic basis of these two waves of cell death, focusing on the key regulator of cell competition Xrp1 and its relationship to p53. They make several conclusions: 1) Xrp1 appears to function downstream of p53 in activating the transcription of a number of genes involved in the DNA damage response. 2) Some pro-apoptotic genes but not others seem to be regulated via Xrp1 3) When p53 function is inhibited, cells with damaged DNA accumulate and Xrp1 expression is increased at the late time points. 4) Xrp1 contributes to the late death in the absence of p53 function consistent with its role in elimination of these cells by a mechanism that resembles cell competition. Overall the data are clean and the conclusions are mostly justified. Some conclusions appear a little overstated (see below). The authors could address most of these issues by more careful presentation of data and by more conservative interpretations of some of their experiments.

      1) Figure 2E. Based on the text, I think the authors are claiming that the expression fo GStD-LacZ is reduced in the posterior compartment of panel 2E compared to 2D. This is unconvincing. If at all, the expression along the DV boundary in the posterior compartment is stronger in E than in D. Am I missing something?

      2) Figure 3I - K. The expression in the posterior compartment is supposed to be reduced compared to the anterior compartment. Once again, these differences are not easily apparent to me. Perhaps these images need to be quantified to illustrate the supposed difference.

      3) Line 286. The heading "Xrp1 is sufficient for the expression of p53-dependent DDR genes" is misleading. As stated in the final sentence of paragraph 2 of this section, the authors show that Xrp1 functions downstream of p53 and is sufficient for expressing a subset of p53-dependent DDR genes.

      4) Figure 5, panels F and G could be made much easier for the reader to follow. The labels in these two panels are very difficult to see and understand. It might be better to show some high magnification regions (e.g. insets) that show the differences in the prevalence of cell death in regions with different genotypes. Also, why is Xrp1 +/- not quantified in panel H since the authors claim that cell death is reduced even in the heterozygous cells?

      5) Line 363 and Figure 6D, E. The authors argue that the increase in H2Av in the posterior compartment implies that cells with damaged DNA are not being eliminated when Xrp1 function is reduced. An alternative explanation is that the p53 mutation together with the Xrp1 knockdown impairs the DDR even more resulting in increased H2Av staining. I don't know how that authors' data can exclude this possibility.

      6) Line 365. Is the resolution of the "double labeling" sufficient to conclude that some of the H2Av cells upregulate Xrp1-HA? A more conservative interpretation would be that in these regions that have increased H2Av, that there is more expression of Xrp1-HA.

      7) Figure 6 - supplement 1. The expression of Xrp1-HA is reduced in the p53DN cells when they are alos mutant for rps12. Although statistically significant, this reduction is modest. If this induction were due to a cell competition like phenomenon, would you not expect the induction be be completely abolished since rpS12 mutations abolish cell competition completely? Please explain.

      Minor issue:

      Line 152: I assume you mean "p53-dependent apoptosis" and not p-53-dependent DDR".

      Significance

      Overall this manuscript clarifies the role of Xrp1 in DNA-damage repair and cell death following X-ray irradiation. Since mammals do not have an Xrp1 ortholog and mammalian p53 seems to function in cell competition similar to Xrp1 in Drosophila, this raises the interesting possibility that the tumor-suppressive function of p53 could, at least in part, be due to its role in cell competition that eliminates aneuploid cells.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this ms, authors utilize the Drosophila wing epithelium as model system to analyze the role of Xrp1 in DNA-damage induced cell death. The p53 gene is well known to have a conserved role (in mammals and flies) in driving cell death (and DNA repair) upon DNA damage induction (double stranded breaks, DSBs, in particular). Xrp1, a transcription factor mostly known for its role in cell competition induced by haploinsufficiency of ribosomal encoding genes (Minute genes), was indeed identified as a target of p53, but its role in the DNA damage response pathway was not addressed. In this ms, Baker and colleagues fill this gap. The paper is subdivided into the following chapters/figures: First, authors present evidence that Xrp1 is induced in wing discs exposed to ionizing radiation (IR, known to cause DSBs) and that this induction relies on p53 regulating Xrp1 transcription (Figure 1 and S1). Data are clear but there is a puzzling result. Xrp1-lacZ (a reporter of Xrp1 transcription) is induced by IR but independently of p53. These results need attention as they appear to be contradictory (why Xrp1-mRNA but not Xrp1-lacZ relies on p53). Nicely, authors show that Xrp1-lacZ induction relies on Xrp1/Irbp18 autoregulatory feedback. Is the lacZ insertion somehow interfering with the capacity of p53 to bind and regulate Xrp1 expression? Second, authors use a collection of reporter genes and show that Xrp1 regulates, most but not all, Dp53 target genes. It is really unclear whether the reaper-lacZ used in Figure 3L-P recapitulates the induction of reaper by p53. I know this reporter was claimed by other do so, but NOT in the wing disc. I would then remove it as mRNA data are clear. Third, authors show that Xrp1, as expected from the previous data in Figure 2 and 3, also mediated the role of Dp53 in inducing cell death, although only partially, and these differences are attributed to the gene reaper (p53 but not Xrp1 target). Dcp1 should be cDcp1 and clones should be magnified in Fig 5E-G. The last figure (6 and the two supplementary figures) are devoted to address the impact of Xrp1 in the well-known p53 independent second wave of cell death caused by IR (24 h later) induced by JNK and attributed by the Brodsky lab to the induction of aneuploid karyotypes (as a result of mistakes in DNA repair). Many of the results this section might be an artefactual consequence of GFP perdurance, some of the genetic tests are not clean enough, and lastly, the role of cell competition in this process relies on correlation (Xrp1 induction) but not clear functional data has been provided so far. I will go point by point

      (1) First, the impact of Xrp1 on the levels of DNA damage and cell death after 24h of IR are shown in a p53 mutant background (6E1-6E3) Authors should present the data in a clean +/+ background. Quantification of 6F should also be done in the same background

      (2) Second, hid-GFP is being induced by IR already at 4 h after IR and this induction and this induction relies on p53 and Xrp1 activities as shown in previous figures. Thus, the data presented in 6G-J could be a trivial consequence of the strong perdurance of the GFP protein.

      (3) Third, the role of cell competition (driven by Minute aneuploids) is not demonstrated and relies simply on the potential role of Xrp1 in the late wave of cell death, proposal that has not been demonstrated in this paper either. Indeed, the no-role of RpS12 in the late induction (24 h wave) of Xrp1 (Figure 6 S1-F) reinforces my doubts.<br /> Authors should reflect in the introduction and discussion sections the most recent literature in the field.

      Significance

      Overall, data presented in Figures 1-5 fills an important gap (a role of Xrp1 in mediating the activity of p53 in tissues subjected to IR and DNA damage) and data are convincing and need minor revision.

      However, the proposed role of Xrp1 and/or cell competition in this mysterious second wave of cell death attributed to the generation of aneuploid karyotypes is not demonstrated. Genetics are not clean, impact on reporters might by affected by GFP perdurance and the contribution of cell competition is correlative and lacks solid functional validation.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study reports that the Drosophila transcription factor Xrp1 plays two temporally distinct and crucial roles in maintaining genome integrity: a p53-dependent acute DNA damage response (DDR) and subsequent p53-independent cell competition. The authors show that immediately following ionizing radiation (IR), Xrp1 is induced in a p53-dependent manner. During this acute phase, Xrp1 acts as an effector of the p53-driven DDR by promoting the expression of target genes, including the pro-apoptotic gene hid and DNA repair genes such as rad50, mus205, lig4, and Ku80. Approximately 16 hours after IR, as the acute DDR winds down, Xrp1 is induced in a largely independent manner of p53 but in a RpS12-dependent manner. This second phase of Xrp1 induction serves to eliminate affected cells by cell competition, possibly through acquired aneuploidy (specifically segmental monosomies affecting Rp gene dose) due to defective DNA repair. Furthermore, the authors show that reducing p53 function increases the persistence/accumulation of γH2Av-positive cells at 24 h post-IR, supporting a model in which Xrp1 contributes both to early DDR outputs and to later tissue-level quality control after irradiation.

      Major comments

      1. The idea that Xrp1 induction switches around 16 h post-IR, becomes RpS12-dependent, and subsequently engages cell competition is interesting and potentially important. However, the evidence supporting RpS12-dependence of Xrp1 induction is currently not sufficiently convincing. For example, based on the images in Figure 6F- supplement 1, the conclusion that Xrp1 is induced in an RpS12-dependent manner appears difficult to support. The authors should strengthen and quantify this result or provide the raw image data. In addition, because this point is central to the authors' model, they should move the key supporting data from the supplementary figures to the main figures to ensure that this critical claim is clearly supported and readily accessible to readers.
      2. The authors suggest a model in which Xrp1 executes two qualitatively distinct "modes" (pro-repair/acute DDR and elimination of aneuploid cells), but this remains only partially convincing as currently presented. The authors should at least (i) provide quantitative evidence that could explain how Xrp1 might produce distinct outcomes across phases (e.g., comparing Xrp1-HA levels and/or the fraction of Xrp1-HA-positive cells at 2-4 h versus 16-24 h post-IR), and (ii) explicitly discuss plausible mechanisms in the Discussion. Even if the molecular "switch" is not fully resolved experimentally, a clearer, data-grounded discussion of how Xrp1 could mediate these temporally distinct functions is needed. In addition, since ISR signaling (e.g., eIF2α phosphorylation) has been implicated as a single feature associated with Xrp1-dependent loser elimination, the authors should consider assessing p-eIF2α levels in Xrp1-HA positive cells at early versus late time points after IR (e.g., 4 h vs 24 h).
      3. The idea that aneuploid cells-or cells with altered ribosomal gene dosage-could be removed via Xrp1-mediated cell competition is intriguing. However, the manuscript does not currently provide any evidence that such cells are, in fact, being eliminated. The authors should therefore (i) quantify cell-level overlap metrics, such as the fraction of γH2Av-positive cells that are Xrp1-HA-positive (and vice versa), as well as the fraction of γH2Av-positive cells that are cleaved Dcp-1-positive (and vice versa) at 24 h post-IR. These quantitative analyses would clarify whether the late Xrp1-HA-positive population corresponds to persistently damaged cells and whether it is enriched for cells undergoing apoptosis/clearance. The authors should also (ii) directly assess aneuploidy/segmental copy-number imbalance in the late Xrp1-HA-positive clusters (e.g., by DNA FISH targeting one or two chromosome arms/regions), and if these experiments cannot be completed within a reasonable revision timeframe, the authors should temper their wording and present aneuploidy and selective elimination as a plausible interpretation supported by RpS12 dependency and prior literature, rather than as a demonstrated conclusion in the current study.
      4. Regarding the statistical analysis, revisions are warranted. In multiple panels, Student's t-tests are repeatedly performed against the same control, which inflates the family-wise error rate and increases the risk of false-positive findings. In such cases, an overall ANOVA (one-way) followed by an appropriate multiple-comparison procedure-such as Dunnett's-test would be more appropriate. This concern applies in particular to: Figure 1A- Supplement 1 Figure 2M-R Figure 3Q, R Figure 5D Figure 5J- Supplement 1 Figure 6G- Supplement 1 Figure 6I- Supplement 2

      Minor comments

      1. Figure 2E is not cited in the text, and it is difficult to tell from the images as presented whether p53DN overexpression suppresses the Gstd-lacZ signal at 4 h post-IR.
      2. In Figure 4, rpr150-lacZ does not appear to be upregulated by Xrp1 overexpression. Therefore, the authors should revise the figure title to avoid misleading readers, because rpr, a well-known p53-responsive pro-apoptotic gene, is not induced under this condition.
      3. In Figure 6E, based on the data as presented, it is difficult to determine whether cleaved Dcp-1 (cDCP1)-positive cell counts are reduced upon Xrp1 knockdown. The authors should provide clearer representative images and/or include the underlying raw images as supplementary source data to support the conclusion.
      4. The authors should (i) show raw data points overlaid on summary plots (e.g., dot plots on top of bar graphs/box plots) to convey data distribution and (ii) include higher-magnification insets and/or quantitative localization/overlap analyses where colocalization is central to the interpretation (e.g., Xrp1-HA relative to γH2Av).

      Significance

      This study puts forward an appealing conceptual framework in which Xrp1 exhibits temporally distinct activation patterns after ionizing radiation and may connect cell-autonomous DDR outputs with non-cell autonomous tissue-level quality control. In particular, the idea that Xrp1 can function both as a downstream effector of p53-associated DDR programs and as a mediator of the subsequent elimination of damaged cells is potentially important for understanding how epithelia maintain homeostasis under genotoxic stress.

      The study should be of interest to DDR researchers because it dissects p53 downstream outputs in a genetically tractable in vivo tissue context and provides a temporal framework for how p53-linked programs are coordinated after irradiation.

      The manuscript will be of particular interest to the cell competition community. By proposing that RpS12-dependent Xrp1 induction engages a damaged-cell elimination program after IR, the study raises the possibility that tissues exposed to genotoxic stress might exploit cell competition as a quality-control machinery. Even if some mechanistic aspects require stronger support, this framework could broaden the contexts in which cell competition is thought to contribute to tissue homeostasis.

      The reviewer's expertise: mechanism of tissue growth control in Drosophila.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Lymphatic vessels drain tissue fluid, absorb lipids, and traffic immune cells. Recent studies on adaptive immunity have identified lymphatics as a potential key target to treat inflammation-associated diseases. In this context, studies on lymphatic sprouting, i.e. the process by which lymphatics expand, are timely. Although Zebrafish lymphatics are somewhat different from mammalian lymphatics, still, the zebrafish has been a useful model for the identification of the key players regulating lymphatic vessel growth, thus, presenting potential targets for pre-clinical studies.

      Woutersen et. al. have studied the shp2a and shp2b douple mutant zebrafish and identified a requirement for shp2 in lymphatic vessel formation 3-5 days post fertilization. The authors state that the shp2 is required for migration and differentiation of the future lymphatic vessels but not the formation of the venous intersegmental vessels (in contrast to other relevant genes, such as vegfr3). The phenotype is rescued by the expression of wild-type but not mutant shp2.

      Major comments:

      The authors use shp2 deleted strains, live imaging and mRNA rescue experiments. The results, as such, are convincing and the reporting is accurate, allowing reproduction of the experiments. Still, some of the conclusions are not fully backed up by the presented results and would need further experimentation as outlined below:

      1. The other "lymphatic vessel mutants", such as vegfr3, vegfc, and grb2, also cause blood vessel phenotypes, i.e. have an effect on venous intersegmental vessels. The authors state that the shp2 mutants are the first ones to have a lymphatic vessel-specific phenotype. Authors should discuss whether this is due to maternal contribution, i.e. long maternal shp2 mRNA or protein half-life? To back up the statement, authors should investigate later angiogenesis events (developmental or induced) to show that shp2 is not required. * We cannot exclude the possibility that maternally contributed Shp2 is responsible for normal venous intersegmental formation. However, this is unlikely, because at the same time, we did observe defects in lymphangiogenesis. It is unlikely that the half-life of Shp2 is regulated differentially in endothelial cells that contribute to future vISVs compared to future ISLVs.

      To show that shp2 has a lymphatic endothelium autonomous role, the authors show that the vegfc mRNA expression is not altered. Authors should quantify the in situ signals (vegfc and vegfr3) and use non-specific probes to show the level of non-specific staining. It is still possible that shp2 would have a lymphatic endothelium-independent role, for example, in Vegf-c processing. Authors should discuss this or delete shp2 in an endothelium-specific manner. Authors should also stain, use in situ hybridization or qPCR (of extracted flt4 reporter-expressing cells) to show that shp2 is expressed in lymphatic endothelial cells.

      * Expression of vegfc was assessed to establish whether loss of Shp2 affected its expression, not to show that Shp2 has a lymphatic endothelium autonomous role. In situ hybridization is semi-quantitative at best. The vegfc in situ hybridizations are similar between wild type and knock-out and do not provide an indication that vegfc expression is altered, warranting further investigation by qPCR. On the other hand, the flt4 in situ hybridizations show a clear reduction in signal in Shp2 double knockout embryos, which was confirmed by qPCR experiments (Fig. 3g). We cannot exclude the possibility that Shp2 has a role in Vegfc processing as suggested by the reviewer and we have included a statement to this effect in the Discussion of the revised version (line 411, 412). In situ hybridization patterns are not very informative for Shp2, because Shp2 is expressed in most, if not all cells, which results in rather indiscriminate expression patterns (Bonetti et al. 2014, PLoS ONE 9, e94884. doi:10.1371/journal.pone.0094884).

      Authors highlight lymphatic endothelial cells and precursors with flt4 (vegfr3) reporter. Furthermore, authors write "a pivotal role for Shp2 signaling in the migration and differentiation of lymphatic endothelial" but do not provide any evidence for the differentiation expect the presence of flt4 (vegfr3) reporter expressing cells. To use a second method for detecting lymphatic vessels and to investigate the differentiation, the authors should show and quantify Prox1 expression in PCV endothelial cells prior to sprouting and in migrating future lymphatic endothelial cells.

      * We changed “differentiation” in the title and in the abstract to “formation”, because we do not provide formal proof that Shp2 is involved in differentiation of lymphatic endothelial cells. We routinely use Tg(flt4:mCitrine; flt1:tdTomato) reporters to highlight lymphatic endothelial cells. We have also used Tg(fli1a:GFP; kdrl:mCherry) to highlight lymphatic endothelial cells. Because the signals were more robust, we mainly used the former transgenic line. We have included representative images of the Tg(fli1a:GFP; kdrl:mCherry) line in Supplementary Figure 1 as a second method for detecting lymphatic vessels. We included a statement to this effect in the text (line 182-188).

      SHP2 has not been linked to VEGFR3 earlier, but has been shown to control VEGFR2. However, it is not obvious whether SHP2 is a positive or a negative regulator of VEGFR2. Here, authors should try to stain pErk in sprouting control and shp2 deleted cells, similar to their previous study (Mauri et al. 2021), to show the effect of shp2 loss on the growth factor receptor downstream signaling.

      * We have considered staining pErk using whole mount immunohistochemistry. However, subsequent imaging of the target cells is extremely difficult, because we would be interested in a subset of endothelial cells, the ones that are sprouting. Timing is also an issue, because we would be interested to image these cells around the time they are sprouting. Only a small number of endothelial cells sprouts and these cells will be hard to discern from surrounding endothelial cells. Some of the surrounding endothelial and non-endothelial cells may express high levels of pErk as well. Hence, interpretation of the pErk immunohistochemistry data is extremely difficult. It would be interesting to use a reporter line for MAPK activation, which might allow for imaging specifically of the target cells in double or triple transgenic backgrounds, but this is beyond the scope of this paper.

      Reporting the sample numbers: In most of the experiments/figures, the authors do not have sufficient information. The number of independent experiments and biological replicates should be shown for each, even representative, experiment. Data should always be derived from more than one independent experiment.

      * We have included the number of experiments for the different experiments and we have increased the number of embryos for the different conditions to include the data of at least 8 samples for each experiment.

      Minor comments:

      P.13 rows 269-271: "In addition, we observed normal perfusion and blood flow in the established vISV connections of the ptpn11a-/-ptpn11b-/- embryos and their siblings, suggesting that Shp2 is dispensable for the formation of vISVs.". The authors should show all the data mentioned in the manuscript. If this is shown in a provided movie, please, indicate which one.

      * In the revised version, we refer to Figure 7d, where perfusion of vISVs is evident (line 278).

      Figure legend 6: change "arrow" to "arrowhead".

      * This has been corrected

      **Referee cross-commenting** No further comments

      Reviewer #1 (Significance (Required)):

      The current manuscript is focused on the characterization of the shp2 mutant embryo phenotype and the rescue experiments. Upon completion of the above-mentioned experiments, the manuscript presents shp2 as a novel regulator of lymphatic vessel formation/lymphatic endothelial cell survival. As such, this notion is quite isolated, since there is no biochemical evidence of, for example, VEGFR3-SHP2 interaction. Broader impact (and audience) would be reached if the authors could show the molecular mechanisms governed by Shp2. Now, in the absence of this data, the impact is moderate. Still, lymphangiogenesis researchers would find the results interesting, thus potentially opening new avenues.

      Reviewer's field of expertise: Lymphatic endothelium. No expertise in zebrafish.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Woutersen et al. describe the effect of single and double knockouts of the zebrafish SHP2 orthologs ptpn11a and ptpb11b. Although some effects of single deletion of ptpn11a are observed, compound deletion results in profound ablation of VEGFR3 (flt4 in zebrafish)-dependent but interestingly, not Tie1-dependent lymphangiogenesis. Rescue experiments with genes encoding WT and mutant forms of SHP2 indicate that intact SH2 domains, PTP activity, and C-terminal tyrosines are required. They also observe differential rescue by the zebrafish analogs of Noonan syndrome (NS) and Noonan syndrome with multiple lentigines (NS-ML) mutants.

      Overall, this is a comprehensive analysis of the effects of WT and mutant SHP2 in lymphatic development in zebrafish. I support its publication with minimal revisions addressing the points below.

      1) For the general reader, it would be helpful to include (in the Supplementary Materials or in Fig. 1) a diagram showing the steps in lymphatic development described in the Introduction that shows the position of the various structures that are subsequently referred to only by abbreviations.

      * In the introduction, we refer to Hogan and Schulte-Merker 2017 Dev Cell 46, 567-583, a review that shows schematics and all the abbreviations we use in our manuscript.

      2) For several figures, there is no statement of what the arrowheads and asterisks point to either in the text or figure legends (e.g. Fig. 2, Fig. 5, Fig. 7). Also, Fig. 6 has "arrowheads", not "arrows". Please check all figure legends carefully to ensure that they fully describe the results shown).

      * We have included statements of what the arrowheads and asterisks in all figures indicate in the revised version.

      3) In the legend to Fig. 1, the authors state that ptpn11a-/- embryos have a "slim" phenotype. How was this assessed-and can it be quantified?

      * We have not systematically quantified this trait of ptpn11a-/- fish and we have not studied the functional consequences, if any. This is a qualitative characteristic that is obvious when analyzing the embryos. We do not want to put much emphasis on the slim phenotype and we have removed the statement from the legend of Fig. 1 in the revised version (line 738).

      4) In the experiments shown in Fig. 6 (and Supplemental movie 1), the authors show that initial sprouting occurs in double mutant embryos, but the sprouts are unable to connect to an aiSV. There are clearly sprouts in the double mutant embryos shown, but there appear to be fewer of them. Do normal numbers of initial sprouts form?

      * Close analysis of the imaging data indicates that normal numbers of initial sprouts form in the double mutant, one sprout for each intersegmental vessel.

      5) If possible, the authors should show immunoblots for all the rescue experiments to convince the reader that each construct was expressed appropriately.

      * Whereas this is an interesting suggestion, this is technically not feasible, because the amount of material from individual embryos is not sufficient for detection of microinjected Shp2 protein by immunoblotting. In fact, only part of the embryo would be available, because a part is needed for genotyping, as we use incrosses of heterozygous fish to generate embryos for the injections. Instead, we expressed constructs encoding GFP and the autoproteolytic peptide 2A linker to the N-terminal side of Shp2a and variants. In line 121, we provide a reference to the paper where we first used this construct, which includes a schematic representation of the construct (Bonetti et al., 2014, Development 141, 1961-1970, DOI: 10.1242/dev.106310). We assessed GFP fluorescence at 1 dpf and discarded embryos that did not express GFP, thus selecting for embryos that did express Shp2 (variants).

      6) The finding of incomplete, or in the case of ptpn11D61G, lack of rescue of lymphangiogenesis by RASopathy-associated mutants is particularly interesting. Have the authors looked at why this is so-i.e., does sprouting occur in D61G-reconstituted embryos? Is migration then blocked or accelerated? Is fusion to aiSVs defective? Although not necessary for the current publication, such information would certainly strengthen the paper. Also, I am not sure that I agree with the authors' statement that the two NS-ML mutants rescue equally to WT; A462T, in particular, is at least nominally less effective and if the n was higher, it might well show statistically lower rescue. The authors should consider tempering this statement.

      * We are planning to investigate in-depth the effects of Shp2-D61G and other NS-associated genes on lymphangiogenesis, but this is beyond the scope of this paper. Here we demonstrate that Shp2 variants rescue or not, upon expression of synthetic mRNA encoding Shp2 variants by microinjection at the one-cell stage. We have tempered our statement about the NS-ML mutants in the text (line 369-372): “Both NSML variants rescued the lymphangiogenesis defects in ptpn11a-/-ptpn11b-/- embryos to the extent that there was no significant difference with their wild type and heterozygous siblings anymore (Figure 10b).”

      7) In the Discussion, the authors reference recent papers on lymphatic defects in NS patients. Although there is no harm in citing these papers, lymphatic abnormalities have been noted in NS patients since the initial descriptions of the syndrome. Either those papers or a review should be cited as well.

      * We have included a reference (line 486) to the review by Roberts et al. 2013 Lancet 381,333-342, https://doi.org/10.1016/S0140-6736(12)61023-X in addition to the recent papers we cited that report lymphatic anomalies in human NS patients, based on lymphangiograms.

      8) The authors might want to note that peripheral edema has been universally associated with SHP2 inhibitor treatment in patients.

      * It is an interesting notion that peripheral edema is the second most frequently occurring side effect in response to SHP2 inhibitor treatment in human subjects (Johnson ML et al. 2024 Mol Cancer Ther 2025;24:384–91 doi: 10.1158/1535-7163.MCT-24-0466). We have included a statement to this effect in the Discussion of the manuscript (line 423-430).

      9) Also, why do the authors think that Tie1 signaling does not require SHP2? It would be interesting to note for the reader that SHP2 has been reported to bind to activated Tie1 and discuss anything known about SHP2 requirements for Tie1 action in mammalian systems.

      * SHP2 interacts with many RTKs that are involved in many developmental processes. Zebrafish embryos lacking functional Tie1 display reduced endothelial and endocardial cell numbers and reduced heart size (Carlantoni et al. 2021 Dev Biol. 469:54-67. doi: 10.1016/j.ydbio.2020.09.008). Whereas we have not investigated this in detail, we have not observed obvious defects in cardiac development. Yet, Tie1 signaling has been implicated in lymphangiogenesis and we cannot exclude involvement of defective Tie1 signaling due to lack of functional Shp2 in the Shp2 double knockouts.

      **Referee cross-commenting** No further comments

      Reviewer #2 (Significance (Required)):

      Thie is a comprehensive study of the role of SHP2 in lymphatic development, using zebrafish as a model. Although descriptive, this paper is important because mutations in SHP2 are associated with lymphatic abnormalities and SHP2 inhibitors cause lymphedema. Also, the unique features of the zebrafish system allow the authors to define the steps and signaling pathways defective in these models.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      SHP2 is an adaptor protein that plays an important role in the RAS/MAPK pathway. Abnormal activity in this pathway has been involved in various cancer as well as developmental disorders like Noonan Syndrome. Here, the authors show the important role of Shp2 in physiological lymphatic development in zebrafish using various Shp2 mutants. This promising manuscript, however, needs some adjustments and further clarifications.

      Results section:

      • Transmitted light images of ptpn11a-/- ptpn11b-/- embryos are not consistent throughout the figures. Larvae in figure 1 is particularly severe compared to images of the same line at 5dpf in the rest of the article (ex. Supp fig1 c, Supp fig4 c&l). Authors should have a consistent representative images. Was there a range of phenotype severity in this model ? Additional phenotype details and quantifications should be included about this double knockout model.

      * We consistently observed a range of phenotypes in the double mutant embryo since the first description of the phenotype (Bonetti et al. 2014, PLoS ONE 9, e94884. doi:10.1371/journal.pone.0094884). The variation depends on the families that are being used to generate the embryos. This is why we include non-injected controls for all injection experiments. Whereas not all double homozygous embryos show edemas, edemas are representative of the phenotype.

      • Line 165-167 : "Loss of functional Shp2a in ptpn11a-/- ptpn11b+/+ embryos induced a pleiotropic phenotype from 4 days post fertilization (dpf) onwards (Figure 1a-d) and was previously shown to be embryonic lethal". Line 178 : "Wild-type siblings and single mutants showed normal lymphatic vasculature...". There is a discrepancy between these 2 sections because one of the single mutant is embryonically lethal. What was the cause of lethality in this model and was it vascular-related ? Could the authors provide more detail about that ?

      * In our view, there is no discrepancy between these sections. The ptpn11a-/-ptpn11b+/+ embryos start to show a morphological phenotype at 4 dpf, but lymphangiogenesis is normal in these embryos. The embryos lacking functional Shp2a do not survive long after reaching 5 dpf and we have never obtained adult ptpn11a-/- fish. Hence, Shp2a is required for normal zebrafish embryogenesis, but lymphangiogenesis is only impaired in embryos lacking all Shp2. We have not investigated lethality of ptpn11a-/-ptpn11b+/+ embryos or larvae in detail, but the absence of a functional swim bladder (Fig. 2c) is likely causing lethality. We have no indication that lethality was vascular related.

      • Authors managed to create various mutant zebrafish model crossed with the double transgenic flt4:mCitrine;flt1:tdTomato. In the double mutant, it is surprising to see an important decrease in the tdTomato arterial expression. Please choose a more representative image or add further explanations.

      * The tdTomato signal in this particular experiment is reduced in the double mutant compared to the other genotypes we show here. We believe that by coincidence the embryo in Figure 2d is heterozygous for tdTomato, whereas the other embryos are homozygous. The conclusion of this experiment is not affected by this apparent difference in expression: double homozygous embryos lack the lymphatic vasculature.

      • Authors had shown clear defects in the zebrafish model in figure 1. It is confusing since zebrafish were imaged at 4dpf (line 176) but figure 2 shows images at 4dpf whereas the TD is fully visible and developed at 5dpf. Authors should correct that or show both set of images at 4 and 5 dpf (one can be placed in supplementary). Also, text refers the presence of TD at 5 dpf (line 184-185) and correlated quantification (figure 2e) whereas images from figure 2 are from 4dpf fish.

      * The thoracic duct is detectable in all segments of zebrafish embryos at 4 dpf (Fig. 2a). Morphological defects do not necessarily correlate with defective development of the thoracic duct. However, severe edemas in the double knockouts distort the vasculature and/or interfere with imaging of the thoracic duct and therefore we assessed the presence of the thoracic duct at 4 dpf. Line 193 – the quantifications were done using embryos at 4 dpf. We have corrected this mistake in the text of the revised version.

      • Line 167 & 173: authors mentioned embryonically lethal model without explaining how old the larvae were, could you please add the information.

      * The term “embryonic lethal” is technically not correct, because the embryos do not die in significant numbers before they reach 5 dpf. We have rephrased this to “lethal after the embryonic stage” (line 168 and 174) to be more accurate. We have not established exactly when the larvae died. Most embryos survive until 5 dpf, and we never obtained adult ptpn11a-/- fish. Establishing when the larvae die is considered an animal experiment under European law. We have chosen not to sacrifice larvae just to establish when they died.

      • Authors claim that no significant lymphatic deficiencies were observed in the single Shp2a or Shp2b alone. Is this result due to compensatory mechanisms from one isoform to the other ? Further molecular quantifications such as qPCR or Western blot could be performed in both single mutant to characterize this phenomenon.

      * Indeed, we believe that redundancy between Shp2a and Shp2b is the cause that there are no lymphatic deficiencies in the single mutants. Previously, we have shown that Shp2a and Shp2b are both functional, that both Shp2a and Shp2b rescue developmental defects and that Shp2a and Shp2b are both expressed in zebrafish embryos (Bonetti et al., 2014 PLoS ONE 9: e94884, doi:10.1371/journal.pone.0094884). Moreover, expression of either Shp2a or Shp2b rescued defects in the lymphatic vasculature in double knockout embryos (Fig. 4), which is consistent with Shp2a and Shp2b having compensatory roles.

      • Figure 3 - the authors show differential development of the head vasculature. It would be consistent with the rest of the figures to keep the same labelling and colors rather than black and white images. Authors nicely added figure 3c and 3f as great schematic, it would be helpful to highlight all of them in the zebrafish images (ex. BLEC) and add different colors of arrows for each structure. Adding single mutant images as supplementary figures would be important to confirm that there are no significant defects.

      Measurements and quantification should be performed to validate the authors claim of missing and impaired lymphatic structures. Could the authors provide details about the vascular vessels of the head, is there any consequence in the blood vasculature ?

      Additionally, using a nuclear line or a nuclear staining is essential before making any conclusion about lymphatic cell population abnormality.

      * We provide the representation as shown in Figure 3, because the contrast of the flt4:mCitrine signal is superior in this black and white representation compared to the green signal on black background representation. We have included differently colored arrowheads to indicate the different lymphatic structures and we have included representative images of the single mutants in Supplementary Figure 2.

      Our conclusions regarding the lymphatic vasculature in the head are qualitative. Most lymphatic structures are missing altogether in the double mutant, which does not allow meaningful quantification. We have not observed obvious defects in the blood vasculature in the double mutant.

      We conclude that lymphatic vasculature does not develop normally. A nuclear reporter line would be required to conclude that the number of lymphatic cells is aberrant in the double mutant, which is interesting, but is not what we conclude from these experiments.

      • Figure 4 - Authors performed rescue experiments with injection of mRNA to demonstrate that the lymphatic KO phenotype was due to the lack of functional Shp2. Successful mRNA injection and so Shp2a/Shp2b increased expression should be confirmed using qPCR to validate the experiment in the first place. Representative images correlating with quantifications should be added in the figure to support the authors results.

      * The constructs we used for the rescue experiments contain GFP fused to the autoproteolytic peptide 2A and Shp2 (variant) (Bonetti et al., 2014, Development 141, 1961-1970, DOI: 10.1242/dev.106310). These constructs drive expression of the fusion protein, which is cleaved into GFP and the Shp2 variant. Hence, expression of GFP is indicative of expression of Shp2. We routinely discarded embryos that did not express GFP at 1 dpf, thus selecting embryos that express the Shp2 (variants).

      • Figure 5 - Authors should perform experiment with a nuclear line or a nuclear staining in the fish lines before making any conclusion about the number of PL cells. Additional clarifications about the methods of quantification should be included. The authors should count the number of segments/missing segments instead. Individual values with standard deviation should be shown in the graph instead of the total mean value and standard variation and should be specified in the figure legend.

      * We agree with the reviewer that counting cells with a nuclear reporter would be superior to the way we quantified the number of PL cells in the transgenic flt4:mCitrine reporter line. It is possible that if two PL cells are very close together, they will be counted as one and hence that the numbers we provide are an underestimate of the total number of PL cells. We feel that this potential intrinsic error in counting would be the same for all conditions/ genotypes. The point of Figure 5 is that the double mutants have no PL cells and the other genotypes have similar numbers of PL cells. The potential intrinsic error would not alter the conclusion of this figure. We have included how we counted the number of PL cells in the legend to Fig. 5 and we included the standard deviation in Fig. 5e.

      • Figure 6 - Time-lapse imaging shows aberrant sprouting in the double mutant compared to control larvae. However, it is not clear if that process is just delayed or completely impaired in the mutant : time-lapses experiment should be performed in later stages. It seems that the chosen time-points images are different from the wild-type and the mutant groups, it would be best to have the same time-point to highlight the difference between the two groups. Authors affirm that vISV formation is unaffected in the double mutant larvae, however, it is hard to confirm that statement with black and white images and supplementary movies. Raw confocal images and movies should be included instead to distinguish lympho-venous and arterial structures.

      * The supplementary movies and Fig. 6, which is derived from these movies, show lack of PL cell formation in the double mutant (Fig. 6B). PL cell formation is clearly visible in wild type embryos (Fig. 6A). The sprouts that (are supposed to) give rise to PL cells are indicated with arrowheads. In both embryos, vISV formation is evident in the ISVs next to the ones where PL cells start to form, i.e. the ISVs next to the ones indicated with arrowheads. Sprouting of the endothelial cells is best observed in the time lapse movies. Whereas the exact timing may be different due to the exact conditions, the developmental timing of the sequence of images is similar between the wild type and the double mutant. The black and white representation gives higher contrast than the original fluorescent movies/ pictures, which is why we prefer this representation.

      • Figure 7 - Figure 7d does not correlate with previous imaging included in figure 2, in fact, fluorescent expressions appear inverted between the two figures. Please standardize this as they are not comparable. Quantification of the percentage of veins may not be the best parameter to investigate the normality of the vISV. Measurements of the diameter of the vISV would be more relevant. Individual values with standard deviation should be shown in the graph instead of the total mean value and standard variation and should be specified in the figure legend.

      * We believe the intensities of the signals in Figure 7d and Figure 2d may be different, because the embryo in Figure 2d may be heterozygous for the flt1:tdTomato transgene, whereas the embryo in Figure 7d is homozygous. Whereas the intensities of tdTomato are different, we clearly observe the absence of the lymphatic vasculature in Figure 2d and normal formation of vISVs in Figure 7d. We have indicated in the legend of the figure that the percentage of vISVs was determined in the number of embryos indicated and that the average percentage is plotted in the graph with the error bars indicating the standard deviation (lines 787-789).

      • Figure 8 - Authors have analyzed flt4 and vegfc expression in the mutant embryos to further characterize Lymphangiogenesis processes in the model. Fold change expression of flt4 appears to be decreased in the double mutant compared to control. It would be useful to also quantify it in uninjected and ptpn11a+/- ptpn11b-/- groups as additional appropriate control groups. Images of ptpn11a+/+ ptpn11b+/+ embryos should be added. Lack of consistency between images and quantification are confusing.

      Considering that quantifications in other figures were performed in a high number of larvae and only 3 were included in this figure in the double mutant group, it would be important to increase the number of ptpn11a-/- ptpn11b-/- embryos for this experiment. To confirm that vegfc expression is normal, fold change expression should be included as performed for flt4 expression.

      Figure number is missing.

      QPCR was done with ptpn11a+/+ptpn11-/- and ptpn11a-/-ptpn11b-/- embryos, correlating to the genotypes that were used for in situ hybridization. There were no injections performed in the framework of this experiment. Because ptpn11a+/+ptpn11b-/- embryos formed lymphatic vasculature like wild type embryos (Figure 2), we focused on embryos derived from an incross of ptpn11a+/-ptpn11-/- fish, generating ptpn11a-/-ptpn11b-/- double mutant embryos as well as ptpn11a+/+ptpn11-/- and ptpn11a+/-ptpn11b-/- siblings. In situ hybridization indicated that flt4 expression was reduced, which was confirmed by QPCR. We have not included vegfc in the QPCR experiments, because the in situ hybridization experiments did not suggest a difference in expression between the genotypes. The Figure number was added.

      • Figure 9: A different background line was used for this figure (fli1a:eGFP;kdrl:mCherry vs flt4:mCitrine;flt1:tdTomato), could the authors explain the purpose of this change and add a brief experiment to confirm the findings and phenotype do not change from one line to another. The overall purpose of this set of experiment is not very clear, maybe one or two sentences of transition as well as rephrasing parts of this section could help better understand the objective and results.

      * A different transgenic background was used for this figure. Like Tg(flt4:mCitrine;flt1:tdTomato), the Tg(fli1a:eGFP;kdrl:mCherry) line allows analysis of the lymphatic vasculature (all lymphatic vessels are labeled with eGFP, not mCherry). The results were the same between the two transgenic lines. The flt4:mCitrine signal is more robust than the flia:eGFP signal, which is why we showed images of the former in most of the figures. Representative images of the Tg(fli1a:eGFP;kdrl:mCherry) line are shown in Supplementary Figure 1. We have included a statement to explain the objective of this part (line 311-312): “We used mutants of Shp2a to assess which signaling functions of Shp2 are required for normal lymphangiogenesis.”

      • Figure 10 - Correlating zebrafish data with human disease is very interesting and highlight the importance of this work. The authors characterize the effect of NS and NSML variants on morphological and lymphatic defects in zebrafish embryos and find that these variants significantly rescued anomalies in double mutant larvae. Since these variants have opposite effects (increase signaling activity in NS and decreasing activity in NSML), authors should add a few words about how two opposite variants could have the same outcome on the zebrafish model. It may also be helpful to include information about these diseases in the introduction, including the lymphatic complications.

      * In the discussion, we included a paragraph where we discuss the effects of the NS and NSML variants and why both variants may rescue the phenotype in Shp2 double knockout embryos (lines 458-488).

      • On supplementary figure 4, double mutant expressing Shp2a A462T fish seems to develop edema. Similarly to figure 8, on all supplementary figures, data were collected from only 3 larvae per group in some groups (2 in supplementary fig 2l) is weak considering that this in vivo model allows to generate a very high number of embryos. Authors should increase the number of larvae per group to reach at least N=10/group to be more robust.

      Line 357 "... was observed more frequently in Shp2a-D61G injected double mutant embryos" this statement should be supported by the appropriate quantifications and statistical analysis.

      * We increased the number of embryos that we evaluated for each condition of the injection experiments to at least 9.

      Line 361-362 " (cf. Figure 4, 10b)" incorrect typo?

      * We have altered the statement (line 369-372) to: “Both NSML variants rescued the lymphangiogenesis defects in ptpn11a-/-ptpn11b-/- embryos to the extent that there was no significant difference with their siblings anymore (Figure 10b).

      Materials and Methods section :

      Overall, this section needs significant clarifications considering the amount of work and data that have been collected. Additionally, each reagent, material, solution, objective, need to be rigorously referenced with reference number and supplier name.

      * The catalog numbers of special reagents have been added.

      Each software should also have the version specified and be correctly cited (ex: ImageJ software version 2.14.0/1.54f. and reference: Schneider, C. A., Rasband, W. S., & Eliceiri, K. W. (2012). NIH Image to ImageJ: 25 years of image analysis. Nature Methods, 9(7), 671-675) .

      * We have indicated the version number and included a reference to the Image J software in the revised version (line 136, 137)

      • Constructs, mRNA synthesis : Were the sequences validated ? If yes, how? Please explain.

      * The constructs were validated by sequencing. The mRNA synthesis was verified by running aliquots of the mRNA on agarose gels. Based on the signal on gel, the concentration was adjusted to ensure that equal amounts of mRNA of each Shp2 variant were injected at the one-cell stage.

      Microscopy : Precise references of the objectives that were used to capture images.

      * We included references to the objectives that were used in microscopy in the Materials and Methods section.

      • Quantification: Please specify how all quantifications were made. How figure 5e and 7e were collected?

      * In the legend to Fig. 5, we indicated how the data were quantified (line 772-774): “Quantification of the number of PL cells in the trunk at 54 hpf. The number of PL cells was counted in the trunk of 54 hpf embryos over the length of 10 somites and the average number of PL cells is depicted. The error bars indicate the standard deviation..” In the legend to Fig. 7 we have included a statement how the percentage of venous ISVs was determined (line 787-789): “The percentage of veins in siblings and double homozygous mutants was determined in the indicated number of embryos (n) and is depicted. The error bars indicate the standard error.”

      Statistical analysis: Specify how data are expressed (ex. Mean {plus minus} s.e.m). The authors have made a serious confusion in choosing the statical tests. Differences between the experimental groups should be evaluated with the use of the Mann-Whitney test only when two groups are compared. Differences between three or more experimental groups (your case in this paper) should be evaluated with the use of an analysis of variance test (ANOVA), followed by a Tukey-Kramer post hoc test when the results were significant (P* We use the Mann-Whitney test to compare the groups in pairs, i.e. the ptpn11a+/+ptpn11b-/- control group compared to ptpn11a+/-ptpn11b-/-, or compared to ptpn11a-/-ptpn11b-/- double knock-out. This is reflected in the brackets we use to indicate significance or the lack thereof between samples, e.g. Figure 4.

      Suggestions on additional supplemental figures :

      • Beginning of introduction gives an impression of a review article about vascular development in larvae, authors should shorten it and/or add a supplementary schematic to support this long description.

      * We try to be complete to help the reader understand the rest of the paper better.

      • Alignment of the different proteins of the study both in human and zebrafish to show homology

      * For an alignment of the Shp2a and Shp2b proteins with human SHP2, we refer to our previously published paper: Bonetti et al., 2014, PLoS One 9, e94884, doi:10.1371/journal.pone.0094884).

      Schematic of protein domains, binding domains and location of variants

      * This is an interesting suggestion, but for space reasons, we decided not to include such schematics.

      **Referee cross-commenting** No further comments

      Reviewer #3 (Significance (Required)):

      SHP2 is an adaptor protein that plays a critical role in regulating the RAS/MAPK signaling pathway. Dysregulation of this pathway has been implicated in various cancers and developmental disorders, including Noonan Syndrome. In this study, the authors demonstrate the essential function of Shp2 in physiological lymphatic development in zebrafish by examining multiple Shp2 mutant models. This promising manuscript, however, needs some adjustments and further clarifications.

      I believe the appropriate audience for this research is specialized - primarily scientists and researchers working in basic biomedical research, particularly in molecular biology, developmental biology, and signaling pathways. The study's focus on zebrafish models and the mechanistic role of Shp2 in lymphatic development positions it within the scope of fundamental biology rather than translational or clinical application, though it has relevance to both.

      As a member of a vascular malformations laboratory, my research focuses on advancing biomedical research through an integrative approach combining in vivo research, molecular biology, translational medicine, and public health. More specifically, my current work focuses on specific genes causing complex lymphatic anomalies and drug discovery using zebrafish models.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      SHP2 is an adaptor protein that plays an important role in the RAS/MAPK pathway. Abnormal activity in this pathway has been involved in various cancer as well as developmental disorders like Noonan Syndrome. Here, the authors show the important role of Shp2 in physiological lymphatic development in zebrafish using various Shp2 mutants. This promising manuscript, however, needs some adjustments and further clarifications.

      Results section:

      • Transmitted light images of ptpn11a-/- ptpn11b-/- embryos are not consistent throughout the figures. Larvae in figure 1 is particularly severe compared to images of the same line at 5dpf in the rest of the article (ex. Supp fig1 c, Supp fig4 c&l). Authors should have a consistent representative images. Was there a range of phenotype severity in this model ? Additional phenotype details and quantifications should be included about this double knockout model.
      • Line 165-167 : "Loss of functional Shp2a in ptpn11a-/- ptpn11b+/+ embryos induced a pleiotropic phenotype from 4 days post fertilization (dpf) onwards (Figure 1a-d) and was previously shown to be embryonic lethal". Line 178 : "Wild-type siblings and single mutants showed normal lymphatic vasculature...". There is a discrepancy between these 2 sections because one of the single mutant is embryonically lethal. What was the cause of lethality in this model and was it vascular-related ? Could the authors provide more detail about that ?
      • Authors managed to create various mutant zebrafish model crossed with the double transgenic flt4:mCitrine;flt1:tdTomato. In the double mutant, it is surprising to see an important decrease in the tdTomato arterial expression. Please choose a more representative image or add further explanations.
      • Authors had shown clear defects in the zebrafish model in figure 1. It is confusing since zebrafish were imaged at 4dpf (line 176) but figure 2 shows images at 4dpf whereas the TD is fully visible and developed at 5dpf. Authors should correct that or show both set of images at 4 and 5 dpf (one can be placed in supplementary). Also, text refers the presence of TD at 5 dpf (line 184-185) and correlated quantification (figure 2e) whereas images from figure 2 are from 4dpf fish.
      • Line 167 & 173: authors mentioned embryonically lethal model without explaining how old the larvae were, could you please add the information.
      • Authors claim that no significant lymphatic deficiencies were observed in the single Shp2a or Shp2b alone. Is this result due to compensatory mechanisms from one isoform to the other ? Further molecular quantifications such as qPCR or Western blot could be performed in both single mutant to characterize this phenomenon.
      • Figure 3 - the authors show differential development of the head vasculature. It would be consistent with the rest of the figures to keep the same labelling and colors rather than black and white images. Authors nicely added figure 3c and 3f as great schematic, it would be helpful to highlight all of them in the zebrafish images (ex. BLEC) and add different colors of arrows for each structure. Adding single mutant images as supplementary figures would be important to confirm that there are no significant defects. Measurements and quantification should be performed to validate the authors claim of missing and impaired lymphatic structures. Could the authors provide details about the vascular vessels of the head, is there any consequence in the blood vasculature ? Additionally, using a nuclear line or a nuclear staining is essential before making any conclusion about lymphatic cell population abnormality.
      • Figure 4 - Authors performed rescue experiments with injection of mRNA to demonstrate that the lymphatic KO phenotype was due to the lack of functional Shp2. Successful mRNA injection and so Shp2a/Shp2b increased expression should be confirmed using qPCR to validate the experiment in the first place. Representative images correlating with quantifications should be added in the figure to support the authors results.
      • Figure 5 - Authors should perform experiment with a nuclear line or a nuclear staining in the fish lines before making any conclusion about the number of PL cells. Additional clarifications about the methods of quantification should be included. The authors should count the number of segments/missing segments instead. Individual values with standard deviation should be shown in the graph instead of the total mean value and standard variation and should be specified in the figure legend.
      • Figure 6 - Time-lapse imaging shows aberrant sprouting in the double mutant compared to control larvae. However, it is not clear if that process is just delayed or completely impaired in the mutant : time-lapses experiment should be performed in later stages. It seems that the chosen time-points images are different from the wild-type and the mutant groups, it would be best to have the same time-point to highlight the difference between the two groups. Authors affirm that vISV formation is unaffected in the double mutant larvae, however, it is hard to confirm that statement with black and white images and supplementary movies. Raw confocal images and movies should be included instead to distinguish lympho-venous and arterial structures.
      • Figure 7 - Figure 7d does not correlate with previous imaging included in figure 2, in fact, fluorescent expressions appear inverted between the two figures. Please standardize this as they are not comparable. Quantification of the percentage of veins may not be the best parameter to investigate the normality of the vISV. Measurements of the diameter of the vISV would be more relevant. Individual values with standard deviation should be shown in the graph instead of the total mean value and standard variation and should be specified in the figure legend.
      • Figure 8 - Authors have analyzed flt4 and vegfc expression in the mutant embryos to further characterize Lymphangiogenesis processes in the model. Fold change expression of flt4 appears to be decreased in the double mutant compared to control. It would be useful to also quantify it in uninjected and ptpn11a+/- ptpn11b-/- groups as additional appropriate control groups. Images of ptpn11a+/+ ptpn11b+/+ embryos should be added. Lack of consistency between images and quantification are confusing. Considering that quantifications in other figures were performed in a high number of larvae and only 3 were included in this figure in the double mutant group, it would be important to increase the number of ptpn11a-/- ptpn11b-/- embryos for this experiment. To confirm that vegfc expression is normal, fold change expression should be included as performed for flt4 expression. Figure number is missing.
      • Figure 9: A different background line was used for this figure (fli1a:eGFP;kdrl:mCherry vs flt4:mCitrine;flt1:tdTomato), could the authors explain the purpose of this change and add a brief experiment to confirm the findings and phenotype do not change from one line to another. The overall purpose of this set of experiment is not very clear, maybe one or two sentences of transition as well as rephrasing parts of this section could help better understand the objective and results.
      • Figure 10 - Correlating zebrafish data with human disease is very interesting and highlight the importance of this work. The authors characterize the effect of NS and NSML variants on morphological and lymphatic defects in zebrafish embryos and find that these variants significantly rescued anomalies in double mutant larvae. Since these variants have opposite effects (increase signaling activity in NS and decreasing activity in NSML), authors should add a few words about how two opposite variants could have the same outcome on the zebrafish model. It may also be helpful to include information about these diseases in the introduction, including the lymphatic complications.

      On supplementary figure 4, double mutant expressing Shp2a A462T fish seems to develop edema. Similarly to figure 8, on all supplementary figures, data were collected from only 3 larvae per group in some groups (2 in supplementary fig 2l) is weak considering that this in vivo model allows to generate a very high number of embryos. Authors should increase the number of larvae per group to reach at least N=10/group to be more robust. Line 357 "... was observed more frequently in Shp2a-D61G injected double mutant embryos" this statement should be supported by the appropriate quantifications and statistical analysis. Line 361-362 " (cf. Figure 4, 10b)" incorrect typo?

      Materials and Methods section:

      Overall, this section needs significant clarifications considering the amount of work and data that have been collected. Additionally, each reagent, material, solution, objective, need to be rigorously referenced with reference number and supplier name. Each software should also have the version specified and be correctly cited (ex: ImageJ software version 2.14.0/1.54f. and reference: Schneider, C. A., Rasband, W. S., & Eliceiri, K. W. (2012). NIH Image to ImageJ: 25 years of image analysis. Nature Methods, 9(7), 671-675) .

      • Constructs, mRNA synthesis : Were the sequences validated ? If yes, how? Please explain.
      • Microscopy : Precise references of the objectives that were used to capture images.
      • Quantification: Please specify how all quantifications were made. How figure 5e and 7e were collected?
      • Statistical analysis: Specify how data are expressed (ex. Mean {plus minus} s.e.m). The authors have made a serious confusion in choosing the statical tests. Differences between the experimental groups should be evaluated with the use of the Mann-Whitney test only when two groups are compared. Differences between three or more experimental groups (your case in this paper) should be evaluated with the use of an analysis of variance test (ANOVA), followed by a Tukey-Kramer post hoc test when the results were significant (P<0.05). It is crucial that further investigation should be made regarding the accurate statistical analysis to perform to avoid any false negative (or false positive) results.

      Suggestions on additional supplemental figures :

      • Beginning of introduction gives an impression of a review article about vascular development in larvae, authors should shorten it and/or add a supplementary schematic to support this long description.
      • Alignment of the different proteins of the study both in human and zebrafish to show homology
      • Schematic of protein domains, binding domains and location of variants

      Significance

      SHP2 is an adaptor protein that plays a critical role in regulating the RAS/MAPK signaling pathway. Dysregulation of this pathway has been implicated in various cancers and developmental disorders, including Noonan Syndrome. In this study, the authors demonstrate the essential function of Shp2 in physiological lymphatic development in zebrafish by examining multiple Shp2 mutant models. This promising manuscript, however, needs some adjustments and further clarifications.

      I believe the appropriate audience for this research is specialized - primarily scientists and researchers working in basic biomedical research, particularly in molecular biology, developmental biology, and signaling pathways. The study's focus on zebrafish models and the mechanistic role of Shp2 in lymphatic development positions it within the scope of fundamental biology rather than translational or clinical application, though it has relevance to both.

      As a member of a vascular malformations laboratory, my research focuses on advancing biomedical research through an integrative approach combining in vivo research, molecular biology, translational medicine, and public health. More specifically, my current work focuses on specific genes causing complex lymphatic anomalies and drug discovery using zebrafish models.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Woutersen et al. describe the effect of single and double knockouts of the zebrafish SHP2 orthologs ptpn11a and ptpb11b. Although some effects of single deletion of ptpn11a are observed, compound deletion results in profound ablation of VEGFR3 (flt4 in zebrafish)-dependent but interestingly, not Tie1-dependent lymphangiogenesis. Rescue experiments with genes encoding WT and mutant forms of SHP2 indicate that intact SH2 domains, PTP activity, and C-terminal tyrosines are required. They also observe differential rescue by the zebrafish analogs of Noonan syndrome (NS) and Noonan syndrome with multiple lentigines (NS-ML) mutants.

      Overall, this is a comprehensive analysis of the effects of WT and mutant SHP2 in lymphatic development in zebrafish. I support its publication with minimal revisions addressing the points below.

      1. For the general reader, it would be helpful to include (in the Supplementary Materials or in Fig. 1) a diagram showing the steps in lymphatic development described in the Introduction that shows the position of the various structures that are subsequently referred to only by abbreviations.
      2. For several figures, there is no statement of what the arrowheads and asterisks point to either in the text or figure legends (e.g. Fig. 2, Fig. 5, Fig. 7). Also, Fig. 6 has "arrowheads", not "arrows". Please check all figure legends carefully to ensure that they fully describe the results shown).
      3. In the legend to Fig. 1, the authors state that ptpn11a-/- embryos have a "slim" phenotype. How was this assessed-and can it be quantified?
      4. In the experiments shown in Fig. 6 (and Supplemental movie 1), the authors show that initial sprouting occurs in double mutant embryos, but the sprouts are unable to connect to an aiSV. There are clearly sprouts in the double mutant embryos shown, but there appear to be fewer of them. Do normal numbers of initial sprouts form?
      5. If possible, the authors should show immunoblots for all the rescue experiments to convince the reader that each construct was expressed appropriately.
      6. The finding of incomplete, or in the case of ptpn11D61G, lack of rescue of lymphangiogenesis by RASopathy-associated mutants is particularly interesting. Have the authors looked at why this is so-i.e., does sprouting occur in D61G-reconstituted embryos? Is migration then blocked or accelerated? Is fusion to aiSVs defective? Although not necessary for the current publication, such information would certainly strengthen the paper. Also, I am not sure that I agree with the authors' statement that the two NS-ML mutants rescue equally to WT; A462T, in particular, is at least nominally less effective and if the n was higher, it might well show statistically lower rescue. The authors should consider tempering this statement.
      7. In the Discussion, the authors reference recent papers on lymphatic defects in NS patients. Although there is no harm in citing these papers, lymphatic abnormalities have been noted in NS patients since the initial descriptions of the syndrome. Either those papers or a review should be cited as well.
      8. The authors might want to note that peripheral edema has been universally associated with SHP2 inhibitor treatment in patients.
      9. Also, why do the authors think that Tie1 signaling does not require SHP2? It would be interesting to note for the reader that SHP2 has been reported to bind to activated Tie1 and discuss anything known about SHP2 requirements for Tie1 action in mammalian systems.

      Significance

      Thie is a comprehensive study of the role of SHP2 in lymphatic development, using zebrafish as a model. Although descriptive, this paper is important because mutations in SHP2 are associated with lymphatic abnormalities and SHP2 inhibitors cause lymphedema. Also, the unique features of the zebrafish system allow the authors to define the steps and signaling pathways defective in these models.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Lymphatic vessels drain tissue fluid, absorb lipids, and traffic immune cells. Recent studies on adaptive immunity have identified lymphatics as a potential key target to treat inflammation-associated diseases. In this context, studies on lymphatic sprouting, i.e. the process by which lymphatics expand, are timely. Although Zebrafish lymphatics are somewhat different from mammalian lymphatics, still, the zebrafish has been a useful model for the identification of the key players regulating lymphatic vessel growth, thus, presenting potential targets for pre-clinical studies.

      Woutersen et. al. have studied the shp2a and shp2b douple mutant zebrafish and identified a requirement for shp2 in lymphatic vessel formation 3-5 days post fertilization. The authors state that the shp2 is required for migration and differentiation of the future lymphatic vessels but not the formation of the venous intersegmental vessels (in contrast to other relevant genes, such as vegfr3). The phenotype is rescued by the expression of wild-type but not mutant shp2.

      Major comments:

      The authors use shp2 deleted strains, live imaging and mRNA rescue experiments. The results, as such, are convincing and the reporting is accurate, allowing reproduction of the experiments. Still, some of the conclusions are not fully backed up by the presented results and would need further experimentation as outlined below:

      1. The other "lymphatic vessel mutants", such as vegfr3, vegfc, and grb2, also cause blood vessel phenotypes, i.e. have an effect on venous intersegmental vessels. The authors state that the shp2 mutants are the first ones to have a lymphatic vessel-specific phenotype. Authors should discuss whether this is due to maternal contribution, i.e. long maternal shp2 mRNA or protein half-life? To back up the statement, authors should investigate later angiogenesis events (developmental or induced) to show that shp2 is not required.
      2. To show that shp2 has a lymphatic endothelium autonomous role, the authors show that the vegfc mRNA expression is not altered. Authors should quantify the in situ signals (vegfc and vegfr3) and use non-specific probes to show the level of non-specific staining. It is still possible that shp2 would have a lymphatic endothelium-independent role, for example, in Vegf-c processing. Authors should discuss this or delete shp2 in an endothelium-specific manner. Authors should also stain, use in situ hybridization or qPCR (of extracted flt4 reporter-expressing cells) to show that shp2 is expressed in lymphatic endothelial cells.
      3. Authors highlight lymphatic endothelial cells and precursors with flt4 (vegfr3) reporter. Furthermore, authors write "a pivotal role for Shp2 signaling in the migration and differentiation of lymphatic endothelial" but do not provide any evidence for the differentiation expect the presence of flt4 (vegfr3) reporter expressing cells. To use a second method for detecting lymphatic vessels and to investigate the differentiation, the authors should show and quantify Prox1 expression in PCV endothelial cells prior to sprouting and in migrating future lymphatic endothelial cells.
      4. SHP2 has not been linked to VEGFR3 earlier, but has been shown to control VEGFR2. However, it is not obvious whether SHP2 is a positive or a negative regulator of VEGFR2. Here, authors should try to stain pErk in sprouting control and shp2 deleted cells, similar to their previous study (Mauri et al. 2021), to show the effect of shp2 loss on the growth factor receptor downstream signaling.
      5. Reporting the sample numbers: In most of the experiments/figures, the authors do not have sufficient information. The number of independent experiments and biological replicates should be shown for each, even representative, experiment. Data should always be derived from more than one independent experiment.

      Minor comments:

      1. P.13 rows 269-271: "In addition, we observed normal perfusion and blood flow in the established vISV connections of the ptpn11a-/-ptpn11b-/- embryos and their siblings, suggesting that Shp2 is dispensable for the formation of vISVs.". The authors should show all the data mentioned in the manuscript. If this is shown in a provided movie, please, indicate which one.
      2. Figure legend 6: change "arrow" to "arrowhead".

      Significance

      The current manuscript is focused on the characterization of the shp2 mutant embryo phenotype and the rescue experiments. Upon completion of the above-mentioned experiments, the manuscript presents shp2 as a novel regulator of lymphatic vessel formation/lymphatic endothelial cell survival. As such, this notion is quite isolated, since there is no biochemical evidence of, for example, VEGFR3-SHP2 interaction. Broader impact (and audience) would be reached if the authors could show the molecular mechanisms governed by Shp2. Now, in the absence of this data, the impact is moderate. Still, lymphangiogenesis researchers would find the results interesting, thus potentially opening new avenues.

      Reviewer's field of expertise: Lymphatic endothelium.

      No expertise in zebrafish.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The authors use c.elegans intestinal cells to understand the molecular mechanisms underlying the transition between canonical embryonic cell cycles and the single endomitosis cycle that happens in Larval 1 stage, when daughter nuclei are separated but cytoplasmic division does not occur, forming of a binucleated cell. They first show that intestinal cells do not form a midzone after DNA segregation, and lack Zen4/Mklp1 protein expression. They also show by RNAfish that endomitotic cells have reduced mRNA levels of several genes essential during regular cytokinesis, such as Zen4/Mklp1, Cyk4/RacGap1 and Spd1/Prc1. They then explore by single cell RNAseq the transcriptome of intestinal cells as compared to regularly cycling ones, and reveal that not only cytokinesis genes are downregulated in L1 intestinal cells, but also many G2/M genes. Using transgenic animals, they show that the protein level of 2 of these essential mitotic genes, air-2/aurB and Plk-1/Plk1, are not affected. They then explore the function of the conserved DREAM complex, which is known to regulate G2/M gene expression in several animal models, in the intestine lineage. Using RNAi and temperature sensitive alleles of the DREAM components, they show that the DREAM complex does participate in zen-4/Mklp1 transcriptional repression. However, the low expression of zen-4 observed in L1 intestinal cells of DREAM mutants is not sufficient to promote cytokinesis. Moreover, DREAM mutants analysis reveals that the DREAMS complex acts during embryogenesis to restrict canonical cell cycle number, and during larval stages to maintain binucleation of the intestinal cells.

      Major comments:

      The downregulation of the transcription of cytokinetic genes (section 2 and fig 2) is convincing. However, the authors only mention genes that show a downregulation. This could be interpreted as if transcription in general is downregulated. Thus, a control gene that do not show downregulation should be presented on this figure (such as rho-1, presented in fig3). Also, would it be possible to express zen-4 with a ubiquitous promoter to test if remains expressed in intestinal cells. That would clearly show that the regulation is transcriptional and not post-transcriptional (the protein looks completely absent).

      Section 6A-C showing that the DREAM complex represses zen-4 transcription is convincing but a few results should be detailed. For example, the authors quantify the percentage of GFP positive intestines (GFP as a readout of zen-4 transcription) in several DREAM components RNAi (fig 6A-B). To evaluate the ability of the DREAM to repress zen-4 expression, the quantification should also show the percentage of GFP positive intestinal cells. Was the quantification of zen-4 mRNA shown in Fig 6C done on all the intestinal cells of the different strains?

      Section 6D: To test if the loss of DREAM is sufficient to induce cytokinesis in L1 intestinal cells, the authors count the number of cells in L3 and newly hatched L1. These counting, although giving clear results, are not the most obvious way to test the presence vs absence of cytokinesis. Is it possible to perform live imaging on the zen-4::GFP positive cells (in efl-1RNAi for example) to follow their ability to perform cytokinesis?

      Section 7: the authors show that the loss of lin-54 leads to 30% of uni-nucleated cells, and 2 or 3% of cells with more than 2 nuclei (compared to control where 100% of L3 cells are strictly binucleated). Then the authors focus on the origin of these 3% of cells with more than 2 nuclei and conclude that they originate from a switch from endoreplication to endomitosis. They entitled section 7 accordingly to this conclusion. Although this conclusion is valid, the authors don't investigate the origin of the phenotype of the mono nucleated cells (which represent a big fraction of the cells). Do lin-54 depleted cells skip the endomitosis step and start endoreplication from L1? This would suggest a role of the DREAM complex to promote endomitosis, and not only to "regulate the cell-cycle switch from endomitosis to endoreplication". Can the authors follow the cell cycle of lin-54 depleted cells over several cycles?

      Minor comment:

      Fig1A: This very nice scheme, that allows non specialists to understand the developmental aspect of the intestinal cells. However, it would benefit to have the time indicated as hours (could it be that the cell cycle duration influences the ability to perform cytokinesis?)

      Section 3: The authors perform RNAseq on L1 intestinal cell and found that most G2/M genes (84%) are downregulated compared to canonically cycling cells. They focus on few kinases that are important for mitotic entry. It may be interesting to mention/discuss the RNA levels of late mitotic phosphatases as well.

      Section 4 is entitled "mitotic protein abundance...". The author did not quantify PLK-1 and AIR-2 protein levels. This should be done if "abundance" is in the section title.

      Section 6 is entitled "...in the repression of cytokinesis genes...". However, the authors only study zen-4. The title should be changed accordingly.

      Section 6: it is striking that the loss of lin-54 has only a very minor effect on zen-4 expression, while it is the only member of the DREAM complex whose loss leads to an excessive proliferation in embryos. Can lin-54 act independently of the DREAM complex?

      Section 6 and 7: What is the expression level of the DREAM complex members in the intestinal cell lineage? (optional)

      Referee cross-commenting

      I agree with all the major points raised by the other reviewers. In particular, it is really important to show that cell cycle genes are specifically down-regulated in intestinal cells.

      Significance

      This article explores the molecular mechanisms of a particular type of cell cycle where no cytokinesis happens: the endomitotic cycle. This type of cycle exists in many animals, but is not well characterized. Therefore, studying its mechanisms, in particular in a developmental context, is important. The results presented here will be of interest to the large community of scientists working on cytokinesis or atypical cell cycles. This study uses sophisticated C. elegans genetic experiments, and beautiful single molecule FISH to analyze cytokinetic gene expression of the intestinal cells in several background. The authors convincingly show the downregulation of those genes in endomitotic cells compared to cells with a regular cell cycle. Moreover, they provide strong evidence that the transcription of Zen-4 is negatively regulated by the DREAM complex. However, the expression of zen-4 in the DREAM complex mutants is not as high as cells having a regular cell cycle, and the loss of DREAM components is not sufficient to induce cytokinesis. One of the limitations is the inability to overexpress the cytokinetic genes in the intestinal cells to test if this would be sufficient to induce cytokinesis as many genes might be required. A function of the DREAM in regular embryonic cycles and in later larval stages is also presented, but could be developed further if live imaging was used.

      As a drosophila cell cycle scientist, I have little expertise in transcriptomic and chromatin analysis, and was not fully able to evaluate these aspects of the work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The work by Barrull-Mascaró and colleagues investigates how polyploidy is established in the C. elegans intestine, where cells have an invariant cell-cycle pattern, characterized by one endomitosis cell cycle at the end of L1 followed by an endoreplication cycle at the end of each larval stage (L1 to L4). They find that cells undergo endomitosis but fail to initiate cytokinesis. This is due to the inability to assemble a central spindle, which arises from a DREAM-driven transcriptional repression of central-spindle genes.

      The authors use live imaging to observe L1 endomitosis and observe the complete absence of cleavage furrowing (Figure 1B) and of a central spindle (1C). Moreover, endomitotic cells express low levels of ZEN-4MKLP1 protein (a component of the centralspindlin complex, 1E), and low mRNA levels of centralspindlin components and contractile ring regulators (2A-C). The reduced levels of ZEN-4MKLP1 are shown to be due to reduced transcription, which is initially activated and then silenced as embryos progress through development.

      Genome-wide mRNA sequencing confirmed the lower expression of cytokinesis genes (zen-4, cyk-4, spd-1, ect-2 and nmy-2) and other G2/M genes in intestinal cells compared to other 4n cell population (enriched for germline, Q-cell lineage). G1/S genes were also downregulated in endomitotic cells, albeit to a lesser extent, and without being downregulated at the protein level. Interestingly, transcriptionally repression of cytokinesis genes is impervious to the induction of additional M-phases forced through the overexpression of CYE-1, CDK-2AF, and CDC-25.1GF.

      Finally, the authors show that downregulation of G2/M genes in intestinal endomitotic genes is not linked to common markers of epigenetic silencing (Figure 5). It depends, at least in part, on the activity of the DREAM complex (Figure 6). However, additional mechanisms are in place to inhibit central spindle formation and cytokinesis in endomitotic cells, since DREAM inactivation is insufficient to induce cytokinesis.

      Major comments:

      Can the authors make a stronger case for cell-cycle related genes being selectively downregulated in intestinal endomitotic cells? What about genes involved in the DNA-damage response, apoptosis, or other housekeeping cellular functions? My point being - RhoA aside - is there a more generic downregulation of transcription in endomitotic cells?

      Minor comments:

      Figure 2B-C: What does N represent? The number of animals or of independent experiments? Please clarify. Figure 3B: it is my understanding that cells were labeled with mCherry and Hoechst. I am confused as to what the GFP signal refers to? Figure S2: How do the authors explain the double Hoechst peak in the mCherry+ population? Is that a 4n and a sub-4n population? "...we generated a strain carrying an intestine-specific transgene expressing cye-1 (cyclin E) and a gain-of-function allele of cdk-2 (CDK2), which have previously been shown to force ectopic divisions in mammalian cells as well as in C. elegans (39-41). By examining newly-hatched L1 larvae, we found that intestinal overexpression of CYE-1 and CDK-2AF during embryogenesis resulted in many additional intestinal cell divisions, but also gave rise to binucleated cells, suggesting endomitosis can already occur during embryogenesis...": it is unclear to me what this experiment is meant to show, and what is the evidence of endomitosis already occurring during embryogenesis. What were the authors trying to enforce when overexpressing CYE-1 and CDK-2AF? Just the appearance of excess number of cells? And is the evidence of endomitosis already occurring during embryogenesis the fact that newly hatched larvae should not have undergone endomitosis yet? Page 12: mention that animals are zen-4p::NLSsfGFP before "Strikingly, 50% of efl-1 RNAi animals and 85% of dpl-1 RNAi animals showed detectable nuclear GFP expression in their intestinal cells" The paragraph "The DREAM complex is involved in the repression of cytokinesis genes during endomitosis" is very long and filled with complex information. I strongly believe it could benefit from restructuring, and from including additional information that can help the logical flow for a non-C. elegans readership. In endomitotic cells, CDK activity is still sufficiently elevated as to ensure mitotic entry. However, CDK activity disrupt the DREAM complex. How do the authors reconcile this observation with the role of the DREAM complex in suppressing cytokinesis in endomitotic cells?

      Significance

      The authors define a clear gap in knowledge that they set out to fill: define the mechanism uncoupling nuclear division from cytokinesis, which are normally initiated synchronously by APC activity. They contextualize this gap in knowledge with the evidence present in the literature about transcriptional down-regulation of cytokinesis genes as a strategy cells can use to skip cytokinesis.

      Overall, I find that the authors claims and conclusions are supported by the data. Within the limits of my knowledge, the authors appear to contextualize their work within the current literature and are upfront about the limitation of their study throughout the presentation of their results and their discussion. The authors use several state-of-the-art techniques (Genome-wide mRNA seq, chromatin immunocleavage sequencing) and convincingly show a role for the DREAM complex in regulating expression of G2/M genes. With some modifications to make the article more accessible to a non-C. elegans audience, the article can be of interest to a broad readership.

      My expertise encompasses polyploidy and cell death.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The study by Barrull-Mascaró et al. uses C. elegans intestinal cells as a model to investigate the mechanisms underlying endomitosis. The authors find that endomitotic cells fail to form a central spindle. By comparing the transcriptomic profiles of endomitotic intestinal cells and seam cells which do not display endomitosis, they show that the expression of many cytokinesis regulators is repressed in endomitotic cells. Furthermore, they reveal that this repression is in part mediated by the DREAM pathway. Overall, their findings strongly suggest that the transition to endomitosis is a switch that depends on the transcriptional regulation of cytokinetic genes.

      Major comments:

      The key conclusions are convincing. The methods and data are clear and rigorous and the manuscript is very well written. Most of the quantification are shown. We have some suggestion that could improve the manuscript

      1. Regarding smFISH and the mRNA spots quantification (e.g., Figure 2A,B for zen-4 mRNA), although the RNA seq and protein fluorescence data are consistent with low zen-4 mRNA levels, we suggest that the authors should perform a negative control to validate the quantified mRNA signal. For example, they can perform ZEN-4 RNAi, and assess whether the detected smFISH dots or cytoplasmic signals disappear in early embryos, as shown in Fig. S1A.
      2. In Figure 4G, is the ectopic division of seam cells accompanied by a higher level of zen-4 mRNA compared to WT condition? The key question is whether the induction of cye-1, cdk-2, and cdc25.1 is sufficient to induce zen-4 or other cytokinesis gene expression in cell types other than intestinal cells.
      3. This is an optional point. A general question is whether the DREAM complex specifically represses multiple cytokinesis genes beyond zen-4. This could potentially be addressed by performing smFISH in relevant mutants. Additionally, is the role of the DREAM complex restricted to intestinal cells, or does it also play a broader role in seam cells? For example, this could be tested by measuring zen-4 mRNA levels in seam cells and assessing whether seam cell numbers increase in larvae, analogous to the increased intestinal cell counts shown in Figures 6D and 6E.

      Minor comments:

      1. In Figure 1 and the corresponding results text, the authors state that "Many of the proteins needed for central spindle assembly, such as Aurora B and Cdk1.... it is unlikely that these proteins are absent during endomitosis." Indeed, AuroraB and PLK-1 localization and intensity were presented in Figure 3G. We suggest moving these results to Figure 1 to improve the story flow and then re-mentioning them later when discussing the RNA-seq data. In addition, to strengthen the conclusion that "We did not notice apparent changes in PLK-1 abundance or localization during endomitosis M phase", the authors could provide quantification of AuroraB and PLK-1 in M-phase.
      2. Questions can be addressed by discussion: In Figure 4H and the corresponding text, the authors state that "for these experiments, we induced ectopic M phases in early L1 stage animals by heat-shock", whereas in Figures 4C-G the induction was performed in L2 worms. Could the authors clarify the rationale for using different developmental stages for these experiments? Additionally, if the same L1 induction protocol were applied, would they expect to observe more intestine cells as shown in Figure 4C,D?
      3. Questions can be addressed by discussion: It is intriguing that lin-54 RNAi causes such a strong phenotype in Figure 6D, yet its depletion leads to only a minor increase in zen-4 promoter signal in Figures 6A,B. Could the authors discuss possible explanations for this discrepancy? Furthermore, would it be feasible to test co-depletion of lin-54 with efl-1 as an example, to assess whether such combinatorial knockdowns lead to a stronger induction of promoter activity in Figures 6A,B?
      4. Regarding if CDK1 in C. elegans also regulates the DREAM complex, is there a tagged CDK-1 strain to see if it is rapidly degraded in intestine cells but not in seam cells for example? Or is there a similar approach to induce CDK-1 ectopic expression to assess if there is more canonical division in intestinal cells (This part it OPTIONAL).
      5. This is an optional point: Since mRNA levels do not always reflect protein levels, such as the case for PLK1 and AIR2, the authors could also check if the other key central spindle proteins are downregulated as ZEN-4, if there are available fluorescence tagged strains.
      6. Mislabeling of Figure 3G and H AIR-2 and PLK-1 and in the corresponding text.
      7. In the text statement "had mononucleated intestinal cells and did not show any signs of M-phase entry (Figure 4G)", I believe it means Figure 4H.
      8. It is interesting that there is discrepancy of downregulated rho-1 in RNA seq Fig s4B and upregulated mRNA spots in M phase Fig 3F. The authors could comment on that.
      9. The authors should also provide the gene list of their RNA-seq for data availability.
      10. In some key experiments, although the sample size (n) appears sufficient, only two biological replicates (N) are reported. It may be helpful to clarify how reproducibility was assessed in these cases. In particular, please indicate whether a consistent trend was observed across the biological replicates. Additional replicates could strengthen the conclusions, depending on the journal's guidelines.

      Referee cross-commenting

      Overall, we agree with the points raised by the other reviewers. They highlight several important areas where the findings could be strengthened and the interpretation clarified. Addressing these questions will help the authors present their story more clearly.

      Our main concern is the lack of a suitable control for mRNA spot quantification. This could be addressed relatively easily by performing zen-4 RNAi as a negative control.

      Other issues can likely be resolved through textual revisions and/or by adding a limitations section to the discussion. For instance, the authors should phrase their conclusions more cautiously if they have not quantified Aurora B and PLK-1 levels, and similarly soften their conclusions about cytokinesis if only zen-4 is analysed.

      Significance

      This study on the repression of cytokinesis genes provides valuable insights into the mechanisms underlying both canonical mitosis and endomitosis. The findings are likely to attract significant interest from researchers in the fields of embryonic development and cell cycle regulation.

      My expertise: cell biology, cell division, embryo development

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility and clarity

      1) Summary

      This study investigates the mechanochemistry of Arp2/3-mediated branched actin networks at the level of individual branch junctions under load. Using microfluidic single-filament/branch force assays (including constant-force flow and open-chamber imaging) the authors quantify debranching, re‑nucleation, and mother- vs daughter‑interface stability across nucleotide states of Arp2/3 (ADP-Pi, ADP, and an ADP-BeFx proxy for ADP-Pi). They further test effects by two branch regulators (GMF and cortactin). Key findings include: (i) ADP-Pi and ADP complexes share similar force dependence but differ markedly (~20×) in intrinsic dissociation rate; (ii) phosphate turnover on the Arp2/3 complex is rapid ii) affinity for Pi drops when Arp2/3 loses its daughter filament; (iii) quantification from model fits uncovers large stability differences between daughter and mother interfaces of the Arp2/3 complex; (iv) extraordinary high stability of ADP-Pi-like Arp2/3 on the mother filament; and (v) distinct effects of GMF and cortactin on force‑dependent stability. Overall, the work combines technically demanding measurements with mechanistic modeling to probe how nucleotide state and regulatory factors tune branch mechanics.

      2) Major comments:

      1. Low force kinetics and completeness of survival curves (Figure 1). "For all forces, the surviving curves exhibited a clear single exponential behavior...." While the data can be fitted to monoexponential decay curves, data at low forces is clearly incomplete. >90% of branches have not dissociated by the end of the experiment. For the particular data shown in 1C (F00nN, n=60 total branches) it means that the time information is coming from

      Essential; experiment might already be performed. Otherwise straightforward to do (weeks time).

      In figure 1B, we indeed show a Survival curve for ADP-Arp2/3 complex branch dissociation at 0 pN up to 900 seconds. As now shown in updated supp figure S2, the data was in fact acquired for at least 5000 seconds for ADP-Arp2/3 and ADP-Pi states (N=2 repeats for each condition, with n = 60 and 90 branches for ADP-Arp2/3 branches, and 90 and 132 branches for ADP-Pi-Arp2/3 branches). The debranching rates reported in the initial submission were already obtained by fitting the surviving curves over the whole duration of the experiments.

      1. Stability Analysis (Figure 4). I can follow much of the arguments presented in the stability analysis of the daughter vs mother interfaces, which is in principle extremely interesting! However, there are some concerns here:

      i) The authors emphasize the zero force ratio derived from fits (which is linked to the stability difference of the two interfaces in the absence of force) despite this being only weakly constrained by data. Intuitively in the model, the stability difference should grow to very large values as the re-nucleation ratio approaches 1 at low force. This combined with the noise in the data poses an issue in my opinion. Looking at the data and the error margin, I think that the authors cannot state with high confidence that there is a real difference between the relative stability of the daughter and mother interfaces between the two nucleotide states of the complex.

      Essential; analysis and textual revision only

      We thank the reviewer for this comment. The difference in stability between the two interfaces is strongly constrained by the shape of the branch renucleation ratio versus force curve, and its value at 0 pN. This is illustrated in the figure shown below (new Supp Fig. S8), showing the dissociation rates of the two interfaces (in ‘dashed’ and ‘point-dashed’ style) that contribute to the overall debranching rate in each nucleotide condition. Despite the limited force range at which we probed the debranching rate, the branch renucleation ratio curve informs us on which interface is the weakest, and how this evolves with force.

      We have assessed the confidence intervals of the parameters obtained from the fits, taking into account the error bars on our experimental datapoints. It seems to indicate that the simultaneous fits of the debranching rate and the branch renucleation ratio curves indeed constrain the parameters quite strongly. These confidence intervals are now reported in the main text and in the summarizing table.

      We have repeated branch renucleation experiments for ADP-BeFx- and ADP-Pi-Arp2/3 complex branches (see new figure 4C&D, and our response to the next point). We believe these new measurements allow a better assessment of the relative stability between the two interfaces for Arp2/3 complex branch junctions in the ADP-BeFx state.

      Still, we agree with the reviewer that the dispersion of the experimental data does not allow us to have a strong confidence on the crossover force and relative stability difference of the interfaces. Therefore, we have slightly toned down the way we present and discuss the differences in stability when comparing the two nucleotide states.

      ii) For ADP-Pi, the renucleation ratio essentially remains flat over the measured force range. Hence, the data can only provide little leverage to estimate both the zero force ratio and, more importantly, the differential distance to the transition state in the slip-bond model in my opinion, which will show in the crossover force. Consequently, the quoted ">100×" stability difference at F=0 and the crossover force >20pN are driven largely by extrapolation rather than direct constraint by data. Given the high number of free parameters in the model, I would anticipate that several crossover forces and differential distances might explain the data nearly equally well. Instead of loosely reporting exact number from fits, I would have hoped for some sort of sensitivity analysis, for instance relying on profile likelihoods. Also parameter values could be reported as bounds (e.g crossover force≫measured range) rather than precise point estimates. This issue re-occurs (albeit not as drastically) for the cortactin experiments (Figure 6).

      Essential; analysis and textual revision only

      As mentioned in our response to the previous point, we have repeated renucleation experiments for ADP-BeFx- (and also for Arp2/3 complex branches in the presence of 50 mM Pi) (see new figure 4C&D) to better characterize the differential distance between to the transition force. The crossover force for the ADP-BeFx state is now 13.5 pN and the ratio of the stability between the two interfaces is roughly 100 times.

      We agree with the reviewer that the dispersion of the experimental data does not allow us to have a strong confidence on the crossover force and relative stability difference of the interfaces. We have thus toned down the way we report these values. We do believe though that the difference we report between the ADP and ADP-BeFx state appears to be significant and needs to be acknowledged.

      As a side note, it has proven to be challenging to pull on branches at forces higher than 7 pN. To apply a large force on the branch junction, we need to have a high flow rate. In this case, it appeared that the height of the filaments (both mother and daughter filaments) above the surface seem to deviate from what we have established in our previous studies (Jegou et al, Nat. Comm. 2013 & Wioland et al, PNAS 2019). This may originate from the fact branched filaments have a more complex shape than an individual filament. Characterizing accurately the evolution of the branch height as a function of the flow rate and applied force would require quite extensive additional characterization, which, we believe, is beyond the current focus of this study on the stability of Arp2/3 complexes.

      iii) One important expectation from the "two slip bond" model is that branch dissociation rates should not necessarily scale mono-exponentially as they mostly do over the accessible force range of the paper. However, once the "minor" pathway of dissociation from the mother starts to dominate at high forces, rates become more force sensitive. This is nicely recaptured by the model fits in Figure S6 but deserves some explanation in the text. Otherwise, people will simply remember the "ADP-Pi is 20-fold more stable than ADP at all forces" message.

      Essential; textual revision only

      We now have rephrased the key sentences (in the Abstract and Results sections) to more clearly state that the debranching rate is not increasing mono-exponentially with force.

      In the Abstract: “Remarkably, we find that branch junctions are over 30-fold more stable when the Arp2/3 complex is in the ADP-Pi rather than ADP state, and that force accelerates debranching with similar exponential factors in both states.”

      In the Results section: “The debranching rate seems to increase exponentially with the applied pulling force, in the range of 0 to 6 pN (Fig. 1F; see more refined analysis below). This behaviour is predicted by the Bell-Evans model for a slip bond.”

      iv) One important prerequisite for the model is that isolated Arp2/3 complexes (without a daughter filament) should dissociate with equal rates from mother filaments at all flow rates. Since the Arp2/3 complex prefers mother filament curvature, forces experienced by the mother might change its off-rate. It would be good to refer to this assumption in the text and experimentally verify it. I could not find it in the paper nor in Ghasemi et al 2024.

      Essential; simple experiment (a weeks time).

      We thank the reviewer for this important comment.

      First, we investigated whether the viscous drag force, applied on the ADP-Arp2/3 complexes which remain bound to mother filaments could affect their stability. We have performed branch renucleation experiments at different flow rates but with the same pulling force on branch junctions (average force 3.9 pN) by adapting the length of the daughter filament. As shown in new supp. figure S11 (shown below), we did not observe any significant differences between ‘low’ and ‘high’ flow rates. If the off-rate of the surviving Arp2/3 was significantly affected by the flow, this would have led to a variation of the renucleation ratio with the flow rate.

      Second, we have investigated the impact of the tension experienced by the mother filament at the location of the branch junction for ADP-Arp2/3 complex branches, with the same pulling force on the branches (average 4.1 pN pulling force on branches). We have quantified the debranching rate from three groups of branches depending on their position along mother filaments. As shown in new supp. figure S12 (shown below), we can observe a small trend, where the debranching rate decreases with the tension on the mother filament at the branching point.

      Doubling the tension on the mother filament from 15 to 30 pN decreases the debranching rate by a third. Though, pairwise logrank tests performed between the survival fractions of the three binned groups do not report any statistical significant difference (all p values > 0.05). One possible explanation for this is the height of the mother filament in the microfluidics flow that increases linearly from the anchoring point to the free barbed end. As a consequence the pulling force on the branches will be higher, as branches experience faster flows.

      For these same groups, upon branch dissociation, all remaining-bound Arp2/3 complexes are exposed to the same flow rate; the branch renucleation ratios were similar. Thus branch renucleation ratio seems to not significantly depend on the tension experienced by the mother filament at the branching point.

      Similarly, Pandit et al PNAS 2020, Extended figure S1, also reported no detectable impact of the mother filament tension on the debranching rate in their assay.

      v) The force dependence of the branch re-nucleation rate (Fig 3D) has been measured previously by the same group (Ghasemi et al). While the data in the older paper has not been fitted by a model, the trend of the data in the previous paper looks conspicuously different. Are there any explanations for this? I speculate that it might be related to actin and ATP not being saturated (low-force re-nucleation rate rarely exceeds 80%) in Ghasemi et al., but it would be good to know what the authors think about this. Essential; textual revision only

      This is a good point. We have plotted the data of the renucleation ratio from ADP-Arp2/3 complex from figure 1F of Ghasemi et al, Sc. Adv. 2024 (performed at 0.3 and 1 µM actin), together with the data of the current study from figure 4D (performed at 1.5 µM actin). We feel this comparison could be of interest to the readers, and have thus integrated it in the manuscript as new supp. figure S13 (shown below).

      As expected, the branch renucleation ratio is lower with lower concentrations of actin. The experimental data points from Ghasemi et al are similarly well fitted by the branch renucleation function obtained for 1.5 µM multiplied by a scaling parameter, which reflects the fact that the branch renucleation ratio is actin concentration dependent (Fig. 6A in Ghasemi et al). This scaling parameter was the only free parameter of those fits.

      Since the branch renucleation ratio depends on the actin concentration as follows, 0.97.kon.([actin] - Cc)kon.([actin] - Cc)+koffATP-Arp2/3 , with kon = 3.4 µM-1.s-1 and koff ATP-Arp2/3 = 0.66 s-1 from (Ghasemi et al. 2024), the scaling parameter obtained by the fits give estimates of the actin concentration in these experiments, of 0.6(±0.05) and 0.9(±0.2) µM for the experiments performed at 0.3 and 1 µM respectively in (Ghasemi et al. 2024).

      1. Stability of the authentic ADP-Pi-Arp2/3 complex on the mother filament. The extraordinary stability of the isolated ADP-BeFx-Arp2/3 complex on mother filaments is surprising, especially considering that both ATP and ADP states are much more labile (Ghasemi et al 2024). I would recommend repeating this experiment in the authentic ADP-Pi state with labelled Arp2/3 complexes as a more direct readout, even if this would require working with very high phosphate concentrations.

      Essential; simple experiment (a weeks time).

      We have followed the recommendation of the reviewer and have performed new experiments using fluorescent Arp2/3 complexes for ADP, ADP-BeFx and ADP-Pi states, now displayed in new figure 5C (also shown below).

      For fluorescent Arp2/3 complexes remaining bound to the mother filament, the Arp2/3 complex - mother filament interface is ~ 100 times more stable in the ADP-BeFx state (0.0046 s-1) compared to the ADP state (0.56 s-1). We also assessed the dissociation of surviving ADP-BeFx-Arp2/3 complexes using unlabelled Arp2/3 complexes (previously in figure 4B, repeated experiment shown in new supp. figure S10), which also indicates a remarkable stability.

      The dissociation curve of surviving Arp2/3 complexes in the presence of 50 mM Pi and 200 µM ATP in solution reflects the mixture of Arp2/3 dissociating in the ADP/ATP state and ADP-Pi-Arp2/3 that can either dissociate in the ADP-Pi state or lose their Pi and dissociate in the ATP state. Despite the presence of 50 mM Pi, the rate at which ADP dissociates and ATP reloads rate is much faster than Pi binding. Fitting this survival curve with a function that accounts for the initial double populations and the evolution of the ADP-Pi population (see Methods) gives a good estimate of the Pi release rate.

      OPTIONAL: Further, but beyond the scope of the present paper, would be titrating phosphate in these experiments, which would even allow the authors to independently verify the reduced Pi affinity for Arp2/3 in the mother filament. Of note, this affinity difference is needed to satisfy detailed balance in the reaction scheme (Fig 4 D)!

      We thank the reviewer for this suggestion. High concentrations of phosphate in the buffer renders glass surfaces quite sticky in our assays. We’ve tried several different passivation strategies (BSA, PLL-PEG, K-casein, …) but none gave satisfactory results. So titrating phosphate, by going beyond 50 mM phosphate, proved to be quite challenging.

      Detailed balance, considering the two possible routes connecting the ADP-Pi-Arp2/3 complex branch junction state and the surviving ADP-Arp2/3 complex state, can be written as KPi rel.branch junction . Kdebranching ADP-Arp2/3 = KdebranchingADP-Pi-Arp2/3 . KPi rel.surviving Arp2/3.. Some of these affinity constants are not known, because of the inability to determine reverse reactions rates such as the rebinding of a daughter filament to a surviving Arp2/3. It is thus hard to determine how the affinity of Pi for Arp2/3 complex changes between Arp2/3 complexes at branch junctions and surviving Arp2/3 complexes on mother filaments.

      While we cannot determine the affinity constant of Pi for a surviving Arp2.3 complex, our data indicates that the dissociation rate of Pi is higher from Arp2/3 complexes at branch junction (koff = 0.21 s-1) than from surviving Arp2/3 complexes (koff = 0.05 s-1). This unexpected finding indicates that surviving Arp2/3 complexes adopt a conformation where the nucleotides are readily exchanged, but where the ‘back door’ for Pi release is less open. We now discuss this point in our revised manuscript.

      1. Importance of "surviving" ADP-Pi-Arp2/3 complexes. The authors show a) rapid turnover of Pi on the ADP-Arp2/3 complex in both branch- or mother filament-bound state and b) the lowered Pi affinity of the latter. Nonetheless, they emphasize the importance of long-lived "surviving" ADP-Pi bound complexes on the mother (even stated in the abstract). I understand that this fraction shows under some experimental conditions (BeFx), but unless I am missing something, most complexes should rapidly lose their phosphate and either exchange nucleotide or dissociate from the mother under physiological conditions. Please clarify or tone done.

      Essential; textual revision only

      We thank the reviewer for their remark. We have tried to clarify this aspect in the manuscript.

      As shown now with the departure rate of fluorescent surviving Arp2/3 complexes together with branch renucleation data, we show that surviving ADP-Pi-Arp2/3 complexes are quite stable on mother filaments, because they detach and release their Pi slowly, such that branch regrowth will occur provided there is actin in solution. In the absence of actin monomers, as the reviewer correctly points out, the surviving ADP-Pi-Arp2/3 will predominantly release its Pi and thus become a surviving ADP-Arp2/3 complex. We have modified the text to avoid any confusion.

      1. GMF mechanism. The authors claim that GMF "...accelerates the departure of the surviving Arp2/3 complex from the mother...". I assume that they infer this from decrease in the re-nucleation ratio. However, alternatively GMF could simply dwell on the complex, inhibiting re-nucleation without promoting dissociation from the mother. The authors should either monitor Arp2/3 dwell times directly to discriminate between these possibilities or be more cautious in their conclusions.

      Essential; simple experiment (a weeks time) or textual revision.

      In Ghasemi et al. Sci. Adv. 2024, we examined the departure of Arp2/3 from the mother filament after GMF-induced debranching using fluorescent Arp2/3. Most of the fluorescent Arp2/3 dissociated from mother filaments within the same frame as the branch, i.e. within 0.5 seconds after the debranching event, and none were visible after another second . This could be due to Arp2/3 departing with the branch or an accelerated departure after branch dissociation. In any case, this rules out the possibility that GMF would dwell on the surviving complex for a substantial amount of time without promoting dissociation from the mother.

      In the present manuscript, we now show that increasing the ATP concentration 10-fold (from 0.2 to 2 mM) is sufficient to restore the branch renucleation ratio to its level without GMF. This shows that GMF does not cause Arp2/3 to leave with the branch, but rather that it (also) acts on the surviving Arp2/3 complex, in a way that is countered by high concentrations of ATP. More specifically, it suggests that GMF accelerates the departure of the surviving ADP-Arp2/3 complex, either directly and by hindering the reloading of ATP, and that GMF does not affect the surviving Arp2/3 complex once it has reloaded ATP.

      We now discuss these two non-mutually exclusive possibilities for the accelerated dissociation of the surviving ADP-Arp2/3 complex in the manuscript.

      6.Cortactin mechanism and the "leash model". I must say that the cortactin data are the most puzzling part of the paper and hard to reconcile with what we know from structure. I was hoping to find some of this resolved in the discussion. However, I do not understand the "leash model" in the discussion section for cortactin-mediated branch stabilization: "This would explain the observed increase in branch survival compared to the absence of cortactin. As the pulling force is increased, this rebinding mechanism becomes less efficient." According to my understanding of the data, this is opposite to what happens. Cortactin only stabilizes the labile interface at elevated forces! Some re-writing might help here.

      Essential; textual revision.

      We thank the reviewer for having us think more thoroughly about the model we initially proposed. We now believe that our ‘leash’ mechanism is not able to fully recapitulate our observations in a simple and satisfactory manner.

      We now propose a much simpler model, where the binding of cortactin to the Arp2/3 complex at the branch junction simply changes the energy landscape of the Arp2/3-daughter interface without the need to invoke a rebinding of the daughter filament upon branch departure. We have updated our interpretation of the data in the Discussion section accordingly.

      Overall, our results on the impact of cortactin on branch renucleation highlights a surprising behaviour that would require further investigation to fully decipher the underlying molecular mechanism.

      3) Minor comments

      Organization: - I do not want to impose on how to best tell the story, but I felt that Fig1 A-D and Fig 2 A-B belong to one logical unit (nucleotide dependence), whereas Fig 1 E-F and Fig 2 C belong to the other (Pi binding and exchange). Perhaps consider re-organizing to streamline presentation?

      We thank the reviewer for their suggestion. We agree that it flows more naturally as suggested, and have made the changes! Thank you.

      Semantics/Typos: - Abstract: „... ADP-Pi and ADP-Arp2/3 detach with the same exponential increase as a function of force...". Increase should refer to the dissociation rate, which should be added to the sentence.

      We have corrected this.

      Results page 8: "...and the majority of Arp2/3 complexes detach from the mother filament while remaining bound to the branch at the debranching time." "Branch" should likely be daughter here, as there is no branch after dissociation of either interface.

      We have corrected this, thank you.

      Results page 13: "Exposing ADP-BeFx-Arp2/3 complex branch junctions to a saturating amount of GMF...". It is strange to imply saturation, because GMF likely simply does not bind to the complex in this nucleotide state with appreciable affinity. Suggest to change to "high".

      We have made the changes accordingly.

      Discussion page 18: "Moreover, in mammalian Arp2/3, His80 in Arp3 (corresponding to His73 in mammalian actin) is not methylated, and corresponds to residue N77 in Arp3, which is also not modified." N77 likely belongs to Arp2?

      We have made the changes accordingly.

      Discussion page 19: "We showed that Pi affinity for Arp2/3 complexes at branch junctions is around 3.7 mM (Fig. 1), a value which lies within the reported 1-10 mM Pi concentration measured in the cytosol in different mammalian cell types". Notably, this is not too different from F-actin, which should be mentioned. By this measure alone, free inorganic phosphate could also directly regulate actin filament stability!

      We now mention this and discuss that intracellular Pi can also impact actin filament nucleotide state.

      Future interest (non essential): - It would be utterly exciting (but beyond current scope) to quantify how instantaneous debranching rates evolve for naturally aging branches starting from ATP-Arp2/3 complexes!

      We thank the reviewer for this remark. It is indeed quite beyond the scope of the current study, as this would require a way to probe ATP-Arp2/3 complex branches while daughter filaments are still quite short (so pulling on them is difficult). An interesting alternative could be to use ATP analogs, such as App-NHp (aka AMP-PNP), to stabilize this state. However, some studies have mentioned that App-NHp is not very stable.

      Significance

      General assessment:

      This is a compelling and carefully executed study that delivers a clear mechanistic framework for how Arp2/3 branch junctions fail and re‑form under load. The central strength is the tight integration of state‑of‑the‑art reconstitutions with careful and original kinetic analysis. The experimental design is elegant and experiments have been carried out to a masterful standard. The figures are clear, the statistics are appropriate with some exceptions as detailed above. There are very few labs in the world that could have achieved this feat!

      A few aspects could be further strengthened, most notably the explanation and application of the "two slip bond" model as well as slightly more restraint in speculating around specific regulatory mechanisms. However, these are minor refinements that do not detract from the important contributions of the paper.

      Overall, the clearly work merits publication with high priority after revision; most requested changes are textual/analytical with very few targeted experiments, which would substantially strengthen core claims.

      We thank the reviewer for their positive evaluation of our manuscript. We hope that our responses to the detailed points above, along with the corresponding revisions of the manuscript, will alleviate their concerns.

      Advance relative to prior literature: The major novel findings of the paper are already summarized above. There is some recent work done on the subject of branch mechanics by the authors (Ghasemi et al 2024, PMID: 38277459) and others (Pandit et al 2020 PMID: 32461373), but the focus of the present work is clearly unique and the there is plenty of novel insight.

      Audience and impact: Primary audience: specialists in cytoskeleton dynamics, in vitro reconstitution single molecule biophysics, and mechanobiochemistry. Secondary: researchers in cell motility, morphogenesis and mechanobiology, physicists working on active matter and modelers studying force producing and load-bearing biopolymer networks. The results and analysis framework should inform quantitative models of branched network turnover under load and the interpretation of regulatory factor action in vivo and in cells.

      Reviewer expertise: Actin dynamics; biochemical reconstitution; single molecule approaches; biophysics.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Xiao et al examine the molecular events occurring when Arp2/3 complex-mediated actin filament branches are removed from mother actin filaments. They do this using microfluidics assay with purified proteins combined with single filament TIRF imaging of branched actin filaments with distinct fluorescent labels. The contribution of different nucleotide states of Arp2/3 complex are tested in conjunction with the relationship force exerted on the branches and regulatory protein involvement from GMF and cortactin. The data seem comprehensive and highly quantified in response to concentration, force, fraction of branches and survival times and branching rates. They find that ADP-BeFx and high phosphate concentrations (leading to the ADP-Pi state) leads to a slower debranching rate at a given level of force applied. The ability to rapidly switch the buffer gives powerful information about response times of debranching compared with other actin remodelling events. They use renucleation experiments to determine that the previous debranching event most often occurs at the Arp2/3 complex/daughter interface, showing that filaments will be ready to re-branch in the stable ADP-Pi bound state. GMF addition allows debranching of the ADP state to occur at a lower force. Cortactin acts similarly to the ADP-Pi state to increase branch stability.

      Specific comments

      The pulling force on the branches seems to arise from different flow rates in the microfluidics. Viscous drag is mentioned and I can see there is methylcellulose in the buffer. It would be helpful to have the explanation of the conversion between flow and force, even if it has been standard in previous work.

      We apologize if this was unclear: in microfluidics experiments, the buffer does not contain methylcellulose. Methylcellulose is only used for ‘open chamber’ experiments, where no force is applied to Arp2/3 branches, to maintain them in the TIRF field of excitation (Figure S2).

      To better clarify the conversion between flow and force, we have rephrased and extended the Methods section to explain how the force on the branch junction is computed based on the local flow velocity and the length of the daughter filament.

      Pg 5 - what was the motivation to titrate phosphate? It seems a stretch that intracellular Pi levels are tuning branching inside cells more than protein-mediated control (GMF or cortactin) - can the authors evidence this at all?

      We are not claiming that the level of Pi plays a stronger regulatory role than proteins. We show that inorganic phosphate tunes the state of the Arp2/3 complex, which in turn modulates the action of regulatory proteins, such as GMF and cortactin.

      Nonetheless, we do show that the contribution of inorganic phosphate is quite central as it can (1) strongly stabilize branch junctions (~30-fold decrease in the dissociation rate), and (2) tune the activity of GMF and cortactin on Arp2/3 complexes at branch junctions as well as on the ‘surviving’ Arp2/3 complexes that remain bound to mother filaments.

      We thus titrated phosphate and found that its impact on Arp2/3 complex stability is significant in the range of Pi concentration that is explored in cells. For the sake of completeness, and following a comment from reviewer #1, we now also mention the affinity of Pi for actin subunits in filaments in the Discussion, and discuss the impact of intracellular Pi on actin itself.

      Minor comments

      • In the introduction, while the structural and mutagenesis evidence is clearly stated, in other cases a bit more detail would be helpful e.g. 'biochemical studies', which referred measurement of hydrolysis rates using radiolabelling

      We have made changes to more precisely define which biochemical assays were used in previous studies.

      • Page 3 Figures shouldn't be referenced in the introduction

      We have removed the references to the figures from the introduction.

      • Page 3 slip bond behaviour needs explanation

      We now explain the concept when first using this concept in the manuscript, as follows: “The debranching rate seems to increase exponentially with the applied pulling force, in the range of 0 to 6 pN (Fig. 1F; see more refined analysis below). This behaviour of accelerated debranching with the increase of the applied force is similar to the ‘slip bond’ concept, as predicted by the Bell-Evans model of the force-dependent lifetime of the interaction between two proteins”.

      • Figure 1B seems to be a theoretical schematic which is superfluous

      We suppose that the reviewer is actually referring to figure 3B of the initial manuscript, describing the energy potential of a molecular interaction as a function of the reaction coordinate. We agree with the reviewer that it is not absolutely required and we have removed it.

      • Figure 4D is helpful, different weight lines might help even more to explain the dominant pathways

      We have made modifications to the biochemical reaction scheme in this figure (now figure 5F in the revised version). We hope we succeeded in improving its readability. Since the different paths depend on mechano-chemical parameters, there is no real dominant pathway per se.

      **Referee cross-commenting**

      Rev1 sounds like the specialist here. I can't comment on their requests. Some similar points arise between the reviewers which need addressing.

      Reviewer #2 (Significance (Required)):

      Significance

      Taking a look at references 16 and 19, I do not find it clear what is achieved differently in the current work compared to these papers and what agrees and what disagrees. If it's a species difference I might expect the two species would be analysed side-by-side in this paper.

      We thank the reviewer for this important comment. The goal of our study was not to compare the behaviour of mammalian and yeast Arp2/3 complexes.

      We now try to better explain that the motivation of the present work is to address how the nucleotide state of the Arp2/3 complex tunes actin branch mechanosensitive stability, and regulates interactions with well known Arp2/3 complex binding proteins. Most of the reactions are quantified here for the first time. Moreover, the experiments with branch junctions in different nucleotide states are done under controlled mechanical conditions, providing the first direct measurements of the force-dependence of the debranching reactions. Our detailed kinetic analysis of the full reaction scheme allows us to model the different binding interfaces of the Arp2/3 complex.

      In addition, it is worth noting that:

      1. Species matter and this is why ref 16 and 19 can give the impression to disagree on the ability to renucleate branches thanks to the stability of surviving Arp2/3 complexes on mother filaments.
      2. In ref 16 (Pandit et al, PNAS 2020) species are mixed (yeast Arp2/3 and mammalian alpha actin from skeletal muscle), likely leading to a different behaviour compared to the only mammalian protein situation we examine in our current work. In particular, with mixed species one misses the ability to renucleate, as shown in our previous study Ghasemi et al (ref 19). However, since mixing species does not correspond to anything physiological, we do not think it is worth repeating these conditions alongside our experiments.
      3. Further, the analysis carried out in ref 16 suffers from important limitations: the force was unknown (not calibrated) and the data was fitted by a model that compounded several reactions, providing only an indirect estimation of the rates, in particular at zero force. In contrast, we have worked with calibrated forces (including dedicated experiments at zero force) and we have carried out specific experiments to directly measure several rates.
      4. In ref 19 (our earlier work) we did not investigate the impact of the nucleotide state of the branch junction at all, and we did not systematically measure the dissociation rates as a function of force.

      Contrary to Pandit et al, we directly measure the difference in branch stability at zero force between ADP and ADP-Pi states and show that the ~ 30 fold difference holds true at all probed forces. Last, the force dependence of the branch renucleation success rate gives us crucial information on which of the two Arp2/3 complex interfaces ruptures first.

      I'm not understanding how the authors can distinguish effects of adding phosphate and BeFx on Arp 2 and 3 compared to effects on actin. Importantly, are possible accompanying changes in the actin filament a confounding factor?

      We have checked that the nucleotide state (ADP-BeFx and ADP-Pi versus ADP) of the mother and daughter filaments have no impact on branch stability:

      • In the experiments shown in figure 2F, where the buffer condition to which branches are exposed is quickly changed from phosphate buffer to buffer without phosphate, we observe a rapid change of branch stability. Actin subunits at the branch junction are in F-actin conformation according to recent cyroEM observations (ref. Chavani et al, Nat Comm. 2024; Liu et al, NSMB 2024). These actin subunits, initially in the ADP-Pi state, are expected to age and become ADP with a rate of ~ 0.007 s-1 (ie half-time of 100 s; ref. Jegou et al, PLoS Biology 2011, Ooosterhert et al, NSMB 2023), a much lower rate than the observed change of the debranching rate (0.21 s-1). This means that the debranching rate is independent of the nucleotide state of daughter and mother filaments.

      • In new supp. Figure S4, we show that the debranching rate is similar for ADP-Arp2/3 complex branch junctions initiated from ADP- or ADP-BeFx-actin mother filaments.

      • In new supp. Figure S9, we initially exposed branch junctions to a BeFx solution then monitored debranching and branch renucleation in our standard buffer (ie without BeFX or Pi). We observed multiple rounds of branch renucleation, the first with ADP-BeFx-actin daughter filaments, and the following with daughter filaments never exposed to BeFx. They all had the same debranching rates and renucleation success rates.

      The paper is quite specialist to read and the advance appears to be incremental. My expertise is in molecular pathways to actin regulation outside the main area of the paper.

      The results we present in this study are often unexpected, and some go counter long-standing assumptions. The regulation of Arp2/3-nucleated branches is of importance for the stability and the force-generating capabilities of many actin networks in cells. Last, most of the measurements that we present had never been done, mainly because experiments are difficult to achieve, and require specific tools to monitor several events while controlling the applied force.

      We believe our results are of broad interest as they go counter long-standing assumptions. We have rewritten the text in several instances to convey our message more clearly.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Please find enclosed the review of the manuscript "Inorganic phosphate in Arp2/3 complex acts as a rapid switch for the stability of actin filament branches" by Xiao et al.

      The authors provide a detailed investigation of how the nucleotide bound to the Arp2/3 complex affects branch stability under flow force. From a kinetic perspective, this is an elegant study with generally high-quality data, although some conclusions rest on assumptions rather than direct experimental evidence.

      We thank the reviewer for their positive feedback. We have improved our manuscript and performed important additional experiments to provide more direct experimental evidence of our conclusions.

      A key question concerns the physiological relevance of these findings. For instance, the concept of branch regrowth may not be applicable in cellular contexts, since forces by actin polymerization would displace existing branches away from sites where they generate this active forces. The authors should clarify the relevance of regrowth during active force generation by branched networks.

      We thank the reviewer for this comment. Our in vitro results indeed point to a previously unreported property of branched actin networks, i.e. the ability of Arp2/3 complexes to readily renucleate branches in the ADP-Pi state and that it does require reloading ATP within Arp2/3.

      Branched actin networks, especially the lamellipodia or endocytotic patches, do exert active force thanks to actin polymerization of the individual branches at the forefront. Though, the whole actin network is exposed to stress, and the architecture of the network (inter-branch distance, crosslink between branches, …) presumably strongly impact its mechanical properties.

      In the case of other types of branched actin networks, such as the actin cortex, myosin motor put the whole network under tension. Such pulling forces on actin branches, depending on the amplitude of the pulling force, can lead to branch regrowth, and network self-repair.

      We have modified the text to make the physiological relevance clearer.

      Additionally, all experiments employ flow conditions that branches would probably not experience in cells-notably, the flow direction in the cellular context would be reversed. Altering the flow direction relative to the branches could affect not only the relationship between flow rate and branch stability, but potentially other system properties as well.

      We agree with the reviewer that in cells branches will not experience flow conditions similar to the ones we use in our in vitro assay. Nonetheless, in cells we expect mechanical stress on the branch junction to be applied in all directions. In lamellipodia, the compressive force applied at the leading edge is expected to result in diverse local orientations of the force on individual branch junctions within the network (as explained in Lappalainen et al. Nat Rev MBC 2022). Also, branch junctions are found in the cell cortex, where they are exposed to pulling forces resulting from the action of myosin motors and crosslinkers on mother and daughter filaments.

      This impact of the direction of the flow was addressed in our previous publication (Ghasemi et al, Sc. Adv. 2024, figure 2) and, to a lesser extent, by the lab of Enrique de la Cruz in Pandit et al, PNAS 2020 (ref. 16). We reported that flow direction has a minimal effect, if any, on branch dissociation rate and renucleation ratio.

      Reviewer #3 (Significance (Required)):

      Furthermore, the study appears not to account for the mother filament (particularly its nucleotide state) or the actin subunit bound to the Arp2/3 complex. The authors should discuss why their interpretation focuses exclusively on the Arp2/3 complex rather than on the actin filaments or Arp2/3-bound actin subunit.

      We have checked that the nucleotide state (ADP-BeFx and ADP-Pi versus ADP) of the mother and daughter filaments has no impact on branch stability :

      • In the experiments shown in figure 2F, where the buffer condition to which branches are exposed is quickly changed from phosphate buffer to buffer without phosphate, we observe a rapid change of branch stability. Actin subunits at the branch junction are in F-actin conformation according to recent cyroEM observations (ref. Chavani et al, Nat Comm. 2024; Liu et al, NSMB 2024). These actin subunits, initially in the ADP-Pi state, are expected to age and become ADP with a rate of ~ 0.007 s-1 (ie half-time of 100 s; ref. Jegou et al, PLoS Biology 2011, Ooosterhert et al, NSMB 2023), a rate much lower than the observed change of the debranching rate (0.21 s-1). This means that the debranching rate is independent of the nucleotide state of daughter and mother filaments.

      • In new supp. Figure S4, we show that the debranching rate is similar for ADP-Arp2/3 complex branch junctions initiated from ADP- or ADP-BeFx-actin mother filaments.

      • In new supp. Figure S9, we initially exposed branch junctions to a BeFx solution then monitored debranching and branch renucleation in a regular buffer. We observed multiple rounds of branch renucleation, the first with ADP-BeFx-actin daughter filaments, and the following with daughter filaments never exposed to BeFx. They all had the same debranching rates and renucleation success rates.

      An important concern involves the use of KPi (inorganic phosphate). Based our experience, KPi appears to have effects beyond simply impacting nucleotide state-actin filaments seem to assemble differently in the presence of KPi. The authors should exercise caution in their interpretation of KPi-based experiments.

      Concentration of KPi (up to 50 mM Pi) did not slow down barbed end elongation rate in our experiments.

      Overall, while the technical quality and kinetic analyses are state-of-the-art, relating this work to physiological contexts remains challenging, and some conclusions appear overstated.

      We have made changes in the discussion to try to more clearly relate our in vitro observations and conclusions with the cellular context where branch renucleation could have a strong impact on the architecture and mechanics of actin networks.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Please find enclosed the review of the manuscript "Inorganic phosphate in Arp2/3 complex acts as a rapid switch for the stability of actin filament branches" by Xiao et al.

      The authors provide a detailed investigation of how the nucleotide bound to the Arp2/3 complex affects branch stability under flow force. From a kinetic perspective, this is an elegant study with generally high-quality data, although some conclusions rest on assumptions rather than direct experimental evidence.

      A key question concerns the physiological relevance of these findings. For instance, the concept of branch regrowth may not be applicable in cellular contexts, since forces by actin polymerization would displace existing branches away from sites where they generate this active forces. The authors should clarify the relevance of regrowth during active force generation by branched networks.

      Additionally, all experiments employ flow conditions that branches would probably not experience in cells-notably, the flow direction in the cellular context would be reversed. Altering the flow direction relative to the branches could affect not only the relationship between flow rate and branch stability, but potentially other system properties as well.

      Significance

      Furthermore, the study appears not to account for the mother filament (particularly its nucleotide state) or the actin subunit bound to the Arp2/3 complex. The authors should discuss why their interpretation focuses exclusively on the Arp2/3 complex rather than on the actin filaments or Arp2/3-bound actin subunit.

      An important concern involves the use of KPi (inorganic phosphate). Based our experience, KPi appears to have effects beyond simply impacting nucleotide state-actin filaments seem to assemble differently in the presence of KPi. The authors should exercise caution in their interpretation of KPi-based experiments.

      Overall, while the technical quality and kinetic analyses are state-of-the-art, relating this work to physiological contexts remains challenging, and some conclusions appear overstated.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Xiao et al examine the molecular events occurring when Arp2/3 complex-mediated actin filament branches are removed from mother actin filaments. They do this using microfluidics assay with purified proteins combined with single filament TIRF imaging of branched actin filaments with distinct fluorescent labels. The contribution of different nucleotide states of Arp2/3 complex are tested in conjunction with the relationship force exerted on the branches and regulatory protein involvement from GMF and cortactin. The data seem comprehensive and highly quantified in response to concentration, force, fraction of branches and survival times and branching rates. They find that ADP-BeFx and high phosphate concentrations (leading to the ADP-Pi state) leads to a slower debranching rate at a given level of force applied. The ability to rapidly switch the buffer gives powerful information about response times of debranching compared with other actin remodelling events. They use renucleation experiments to determine that the previous debranching event most often occurs at the Arp2/3 complex/daughter interface, showing that filaments will be ready to re-branch in the stable ADP-Pi bound state. GMF addition allows debranching of the ADP state to occur at a lower force. Cortactin acts similarly to the ADP-Pi state to increase branch stability.

      Specific comments

      The pulling force on the branches seems to arise from different flow rates in the microfluidics. Viscous drag is mentioned and I can see there is methylcellulose in the buffer. It would be helpful to have the explanation of the conversion between flow and force, even if it has been standard in previous work.

      Pg 5 - what was the motivation to titrate phosphate? It seems a stretch that intracellular Pi levels are tuning branching inside cells more than protein-mediated control (GMF or cortactin) - can the authors evidence this at all?

      Minor comments

      • In the introduction, while the structural and mutagenesis evidence is clearly stated, in other cases a bit more detail would be helpful e.g. 'biochemical studies', which referred measurement of hydrolysis rates using radiolabelling
      • Page 3 Figures shouldn't be referenced in the introduction
      • Page 3 slip bond behaviour needs explanation
      • Figure 1B seems to be a theoretical schematic which is superfluous
      • Figure 4D is helpful, different weight lines might help even more to explain the dominant pathways

      Referee cross-commenting

      Rev1 sounds like the specialist here. I can't comment on their requests. Some similar points arise between the reviewers which need addressing.

      Significance

      Taking a look at references 16 and 19, I do not find it clear what is achieved differently in the current work compared to these papers and what agrees and what disagrees. If it's a species difference I might expect the two species would be analysed side-by-side in this paper.

      I'm not understanding how the authors can distinguish effects of adding phosphate and BeFx on Arp 2 and 3 compared to effects on actin. Importantly, are possible accompanying changes in the actin filament a confounding factor?

      The paper is quite specialist to read and the advance appears to be incremental. My expertise is in molecular pathways to actin regulation outside the main area of the paper.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      This study investigates the mechanochemistry of Arp2/3-mediated branched actin networks at the level of individual branch junctions under load. Using microfluidic single-filament/branch force assays (including constant-force flow and open-chamber imaging) the authors quantify debranching, re‑nucleation, and mother- vs daughter‑interface stability across nucleotide states of Arp2/3 (ADP-Pi, ADP, and an ADP-BeFx proxy for ADP-Pi). They further test effects by two branch regulators (GMF and cortactin). Key findings include: (i) ADP-Pi and ADP complexes share similar force dependence but differ markedly (~20×) in intrinsic dissociation rate; (ii) phosphate turnover on the Arp2/3 complex is rapid ii) affinity for Pi drops when Arp2/3 loses its daughter filament; (iii) quantification from model fits uncovers large stability differences between daughter and mother interfaces of the Arp2/3 complex; (iv) extraordinary high stability of ADP-Pi-like Arp2/3 on the mother filament; and (v) distinct effects of GMF and cortactin on force‑dependent stability. Overall, the work combines technically demanding measurements with mechanistic modeling to probe how nucleotide state and regulatory factors tune branch mechanics.

      Major comments:

      1. Low force kinetics and completeness of survival curves (Figure 1). "For all forces, the surviving curves exhibited a clear single exponential behavior...." While the data can be fitted to monoexponential decay curves, data at low forces is clearly incomplete. >90% of branches have not dissociated by the end of the experiment. For the particular data shown in 1C (F00nN, n=60 total branches) it means that the time information is coming from <6 observations, which is rather low for the single molecule field. I am slightly worried by this point, since the debranching rates under ADP-Pi conditions at zero force, are even by one magnitude slower. Yet, no raw data is shown. Given that the dissociation rate at low forces is a contentious point, the authors should show the raw data and the corresponding fits. At present, they only show an experimental scheme and images for these "open chamber" assay (Fig S2). Ideally, they would image for much longer than 900s with lower sampling time in those assays, to firmly establish that 20-fold difference also holds at 0 force.

      Essential; experiment might already be performed. Otherwise straightforward to do (weeks time).

      1. Stability Analysis (Figure 4). I can follow much of the arguments presented in the stability analysis of the daughter vs mother interfaces, which is in principle extremely interesting! However, there are some concerns here:

      i) The authors emphasize the zero force ratio derived from fits (which is linked to the stability difference of the two interfaces in the absence of force) despite this being only weakly constrained by data. Intuitively in the model, the stability difference should grow to very large values as the re-nucleation ratio approaches 1 at low force. This combined with the noise in the data poses an issue in my opinion. Looking at the data and the error margin, I think that the authors cannot state with high confidence that there is a real difference between the relative stability of the daughter and mother interfaces between the two nucleotide states of the complex.

      Essential; analysis and textual revision only

      ii) For ADP-Pi, the renucleation ratio essentially remains flat over the measured force range. Hence, the data can only provide little leverage to estimate both the zero force ratio and, more importantly, the differential distance to the transition state in the slip-bond model in my opinion, which will show in the crossover force. Consequently, the quoted ">100×" stability difference at F=0 and the crossover force >20pN are driven largely by extrapolation rather than direct constraint by data. Given the high number of free parameters in the model, I would anticipate that several crossover forces and differential distances might explain the data nearly equally well. Instead of loosely reporting exact number from fits, I would have hoped for some sort of sensitivity analysis, for instance relying on profile likelihoods. Also parameter values could be reported as bounds (e.g crossover force≫measured range) rather than precise point estimates. This issue re-occurs (albeit not as drastically) for the cortactin experiments (Figure 6).

      Essential; analysis and textual revision only

      iii) One important expectation from the "two slip bond" model is that branch dissociation rates should not necessarily scale mono-exponentially as they mostly do over the accessible force range of the paper. However, once the "minor" pathway of dissociation from the mother starts to dominate at high forces, rates become more force sensitive. This is nicely recaptured by the model fits in Figure S6 but deserves some explanation in the text. Otherwise, people will simply remember the "ADP-Pi is 20-fold more stable than ADP at all forces" message.

      Essential; textual revision only

      iv) One important prerequisite for the model is that isolated Arp2/3 complexes (without a daughter filament) should dissociate with equal rates from mother filaments at all flow rates. Since the Arp2/3 complex prefers mother filament curvature, forces experienced by the mother might change its off-rate. It would be good to refer to this assumption in the text and experimentally verify it. I could not find it in the paper nor in Ghasemi et al 2024.

      Essential; simple experiment (a weeks time).

      v) The force dependence of the branch re-nucleation rate (Fig 3D) has been measured previously by the same group (Ghasemi et al). While the data in the older paper has not been fitted by a model, the trend of the data in the previous paper looks conspicuously different. Are there any explanations for this? I speculate that it might be related to actin and ATP not being saturated (low-force re-nucleation rate rarely exceeds 80%) in Ghasemi et al., but it would be good to know what the authors think about this.

      Essential; textual revision only 3. Stability of the authentic ADP-Pi-Arp2/3 complex on the mother filament. The extraordinary stability of the isolated ADP-BeFx-Arp2/3 complex on mother filaments is surprising, especially considering that both ATP and ADP states are much more labile (Ghasemi et al 2024). I would recommend repeating this experiment in the authentic ADP-Pi state with labelled Arp2/3 complexes as a more direct readout, even if this would require working with very high phosphate concentrations.

      Essential; simple experiment (a weeks time).

      OPTIONAL: Further, but beyond the scope of the present paper, would be titrating phosphate in these experiments, which would even allow the authors to independently verify the reduced Pi affinity for Arp2/3 in the mother filament. Of note, this affinity difference is needed to satisfy detailed balance in the reaction scheme (Fig 4 D)! 4. Importance of "surviving" ADP-Pi-Arp2/3 complexes. The authors show a) rapid turnover of Pi on the ADP-Arp2/3 complex in both branch- or mother filament-bound state and b) the lowered Pi affinity of the latter. Nonetheless, they emphasize the importance of long-lived "surviving" ADP-Pi bound complexes on the mother (even stated in the abstract). I understand that this fraction shows under some experimental conditions (BeFx), but unless I am missing something, most complexes should rapidly lose their phosphate and either exchange nucleotide or dissociate from the mother under physiological conditions. Please clarify or tone done.

      Essential; textual revision only 5. GMF mechanism. The authors claim that GMF "...accelerates the departure of the surviving Arp2/3 complex from the mother...". I assume that they infer this from decrease in the re-nucleation ratio. However, alternatively GMF could simply dwell on the complex, inhibiting re-nucleation without promoting dissociation from the mother. The authors should either monitor Arp2/3 dwell times directly to discriminate between these possibilities or be more cautious in their conclusions.

      Essential; simple experiment (a weeks time) or textual revision. 6. Cortactin mechanism and the "leash model". I must say that the cortactin data are the most puzzling part of the paper and had to reconcile with what we know from structure. I was hoping to find some of this resolved in the discussion. However, I do not understand the "leash model" in the discussion section for cortactin-mediated branch stabilization: "This would explain the observed increase in branch survival compared to the absence of cortactin. As the pulling force is increased, this rebinding mechanism becomes less efficient." According to my understanding of the data, this is opposite to what happens. Cortactin only stabilizes the labile interface at elevated forces! Some re-writing might help here.

      Essential; textual revision.

      Minor comments

      Organization:

      • I do not want to impose on how to best tell the story, but I felt that Fig1 A-D and Fig 2 A-B belong to one logical unit (nucleotide dependence), whereas Fig 1 E-F and Fig 2 C belong to the other (Pi binding and exchange). Perhaps consider re-organizing to streamline presentation?

      Semantics/Typos:

      • Abstract: „... ADP-Pi and ADP-Arp2/3 detach with the same exponential increase as a function of force...". Increase should refer to the dissociation rate, which should be added to the sentence.
      • Results page 8: "...and the majority of Arp2/3 complexes detach from the mother filament while remaining bound to the branch at the debranching time." "Branch" should likely be daughter here, as there is no branch after dissociation of either interface.
      • Results page 13: "Exposing ADP-BeFx-Arp2/3 complex branch junctions to a saturating amount of GMF...". It is strange to imply saturation, because GMF likely simply does not bind to the complex in this nucleotide state with appreciable affinity. Suggest to change to "high".
      • Discussion page 18: "Moreover, in mammalian Arp2/3, His80 in Arp3 (corresponding to His73 in mammalian actin) is not methylated, and corresponds to residue N77 in Arp3, which is also not modified." N77 likely belongs to Arp2?
      • Discussion page 19: "We showed that Pi affinity for Arp2/3 complexes at branch junctions is around 3.7 mM (Fig. 1), a value which lies within the reported 1-10 mM Pi concentration measured in the cytosol in different mammalian cell types". Notably, this is not too different from F-actin, which should be mentioned. By this measure alone, free inorganic phosphate could also directly regulate actin filament stability!

      Future interest (non essential):

      • It would be utterly exciting (but beyond current scope) to quantify how instantaneous debranching rates evolve for naturally aging branches starting from ATP-Arp2/3 complexes!

      Significance

      General assessment:

      This is a compelling and carefully executed study that delivers a clear mechanistic framework for how Arp2/3 branch junctions fail and re‑form under load. The central strength is the tight integration of state‑of‑the‑art reconstitutions with careful and original kinetic analysis. The experimental design is elegant and experiments have been carried out to a masterful standard. The figures are clear, the statistics are appropriate with some exceptions as detailed above. There are very few labs in the world that could have achieved this feat!

      A few aspects could be further strengthened, most notably the explanation and application of the "two slip bond" model as well as slightly more restraint in speculating around specific regulatory mechanisms. However, these are minor refinements that do not detract from the important contributions of the paper.

      Overall, the clearly work merits publication with high priority after revision; most requested changes are textual/analytical with very few targeted experiments, which would substantially strengthen core claims.

      Advance relative to prior literature:

      The major novel findings of the paper are already summarized above. There is some recent work done on the subject of branch mechanics by the authors (Ghasemi et al 2024, PMID: 38277459) and others (Pandit et al 2020 PMID: 32461373), but the focus of the present work is clearly unique and the there is plenty of novel insight.

      Audience and impact:

      Primary audience: specialists in cytoskeleton dynamics, in vitro reconstitution single molecule biophysics, and mechanobiochemistry. Secondary: researchers in cell motility, morphogenesis and mechanobiology, physicists working on active matter and modelers studying force producing and load-bearing biopolymer networks. The results and analysis framework should inform quantitative models of branched network turnover under load and the interpretation of regulatory factor action in vivo and in cells.

      Reviewer expertise:

      Actin dynamics; biochemical reconstitution; single molecule approaches; biophysics.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the three reviewers for their thoughtful and constructive comments which help us to improve the manuscript. Please find our responses below. * *

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary This study investigates how altered expression of cleavage and polyadenylation (CPA) factors affects alternative polyadenylation (APA), transcription termination and cellular phenotypes in colorectal cancer (CRC) cell lines. The authors combine genetic perturbations of CPA factors with chemical inhibition of CPSF73 and assess effects on clonogenic potential, transcription-replication conflicts, APA profiles, and transcription termination-associated RNAPII phosphorylation patterns. The main comparisons are performed between healthy (1CT), primary tumour (SW480, HCT116) and metastatic (SW620) cell lines, which are reported to contain altered expression levels of CPA factors. The data suggest differential dependence on CPA factors between primary tumour-derived and metastatic CRC cell lines, as well as changes in transcription termination patterns. The data are overall well-presented with clear figures. However, in several cases the strength of the conclusions appears to exceed the support provided by the data, and alternative interpretations should be considered.

      Major comments 1. Clonogenic sensitivity to CPA factor perturbation and comparability of clonogenic assays between cell lines: -The data indicate that clonogenic potential in SW480 is strongly dependent on CPSF73 and PCF11, whereas SW620 appear less sensitive. However, the interpretation is complicated by differences in depletion efficiencies. In SW620 cells, PCF11 depletion appears inefficient, and protein levels remain higher than in siLUC-treated SW480 cells (Fig. 1D and S1C; also in comparison to 1CT by inference of Fig. 1C). Thus, the apparent resistance of SW620 cells could reflect insufficient depletion rather than true biological tolerance. The effectiveness of siCPSF73 treatments is difficult to assess from the presented data. Quantification of protein knockdown levels should be provided and incorporated into the interpretation.

      -In Fig. 1D, 1E, and S1D, colony formation of DMSO- or siLUC-treated SW620 and SW480 cells differs markedly in absolute terms. However, the graphs are normalized separately for each cell line, which obscures this difference. This raises two concerns: First, the baseline clonogenic capacity differs between the lines and should be discussed. Second, it is unclear whether direct comparisons between cell lines are valid when normalization is performed independently. For example, in absolute terms, 1 µM JTE-607 appears to have a similar effect in SW620 cells as 5 µM in SW480 cells, which would contradict the conclusion that metastatic cells are more tolerant to CPA perturbations. This issue should be explicitly addressed.

      We thank the reviewer for those thoughtful comments.

      a) Assessing the biological meaning of differences in PCF11 depletion efficiency between SW480 and SW620 cell lines is inherently tricky, because the two cell lines differ 3-fold in their baseline PCF11 level (Fig. 1C). Even with equal efficiencies of knock-down, the number of PCF11 molecules per cell left after the treatment will differ. We haven't mentioned this in our original manuscript but will highlight this issue in the revised version - as we agree it is an important consideration for the interpretation of the results.

      b) As requested, we will add quantification of western blots from 3 biological replicates to the revised manuscript, to demonstrate the depletion efficiencies. We agree that the single western blot presented by itself was not sufficient; the efficiency of SW620 knock-down is not lower compared to SW480.

      c) The baseline clonogenic capacity of SW480 and SW620 has been previously calculated and compared in two publications (PMID: 31961892 and 29796953). In both cases, the SW620 cells showed higher clonogenic potential than SW480, which was calculated based on the number of clones containing more than 50 cells.

      d) The reason behind normalization of our data to a control sample is the difference in cell size between the cell lines, which prohibits their direct comparison.

      For the colony formation assays, we seeded the same number of cells and cultured them for the same amount of time. However, the difference in cell size, leads to a huge difference in colony sizes (Figure 1D), therefore it was not possible to set the same parameters for counting colonies of SW480 and SW620 cells. Therefore, we decided to use an approach frequently used in high profile cancer studies (e.g. Li at al., 2023, PMID: 37620362, Waterhouse et al., 2025, PMID: 40328966, Yang et al., 2026, PMID: 41484364) and normalize each biological replicate to the control sample to analyze the response to the treatment only.

      e) During revision, we might additionally perform CellTiter 96® Non-Radioactive Cell Proliferation Assay (MTT) to test how another cancerous characteristic of SW480 and SW620 cells are affected by JTE-607.

      f) We will also perform colony formation and/or MTT assays for 3 additional cell lines: HCT116 (primary tumor-derived) and T87 and COLO-205 (metastasis-derived, which we are currently in the process of obtaining) to assess their sensitivity to JTE-607.

      g) The result of higher sensitivity of SW620 cells compared to SW480 cells has been obtained not only for PCF11 knock-down, where inter-cell line differences of baseline protein level make interpretations more difficult, but also for CPSF73 knock-down (Fig. 1D), which baseline level was similar and knock-down was equally efficient in both cell lines, and for CPSF73 inhibition (Fig. 1E); with the use of normalization procedures used frequently in literature (see point d).

      Therefore, we argue that our conclusion that SW480 cells are more sensitive than SW620 to the abrogation of 3' pre-mRNA cleavage and transcription termination is valid. However, we are willing to weaken our conclusion if the reviewer does not agree with our point of view.

      For the additional cancer-specific experiments proposed above, we suggest the usage of JTE-607 as drug treatment is more robust, reproducible, and medically relevant compared to knock-down experiments.

      1. Interpretation of transcription termination markers: -The study uses RNAPII T4ph as a marker of transcription termination, which is well justified based on the ref. [30], but still the mechanistic basis of this modification is not fully understood. Changes in T4ph localization are interpreted as consequences of CPA activity, but possible differences in kinase or phosphatase activities between cell lines are not considered that could affect the T4ph levels or localization. Therefore, conclusions based solely on T4ph redistribution should be presented with greater caution, and alternative explanations should be acknowledged.

      While in our experience RNAPII T4ph is the most sensitive and useful termination marker, we agree with the referee that its metabolism and function is insufficiently understood - this is an important and interesting direction for future investigation.

      In order to increase the robustness of our study, during revision we will additionally perform nascent transcriptomics on SW480 and SW620 using a different method, POINT-seq. POINT-seq in contrast to T4ph mNET-seq relies neither on RNAPII modification status nor is affected by pausing. We will also probe global T4ph-RNAPII levels in our cellular model by western blot. We will then adjust our manuscript accordingly.

      -Line 240 states that premature termination is increased in primary tumour cells. However, the data show increased T4ph signal (Fig. 4B) but no change in total RNAPII occupancy in gene bodies (Fig. 4A). This does not directly demonstrate increased termination. Additional evidence or a more cautious interpretation would be appropriate.

      The reviewer is right in pointing out the difference between the Total-RNAPII and T4ph-RNAPII signals across the gene body. We will provide a clearer description and explanation in the revised manuscript.

      T4ph-RNAPII is present at low levels in human cells. S2ph and S5ph are the dominant modifications, accounting for ~75% of phospho-counts, whereas T4ph has a relative abundance of ~15% (PMID: 26799765). In addition, T4ph is concentrated at gene ends and typically very low in the gene body (PMID: 28017589, 30819644, doi: 10.1101/2025.07.14.664659). Consequently, it is very easy to spot its gene-body increase in metagene analysis (Figure 4B), even when it happens only on a subset of genes in cancer samples (e.g. Fig. 4D).

      Total-RNAPII signal in the gene body largely reflects S2ph-modified RNAPII levels so its metagene analysis is not sufficiently sensitive to detect differences in gene-body T4ph-RNAPII.

      Consequently, RNAPII-T4ph and RNAPII-total mNET-seq show distinct metagene patterns and different responses to termination changes. RNAPII-T4ph mNET-seq is a sensitive method to detect changes in termination patterns, while total-RNAPII is much less specific and sensitive with respect to transcription termination.

      1. Cleavage-termination distance as a predictor of transcript levels: -Figure 5A presents median distances across all genes. It would be informative to perform a gene-wise comparison between cell lines (difference in cleavage-T4ph distance for the same gene, e.g. in 1CT vs. HCT116, individual differences plotted across all genes). This analysis could help clarify how frequently individual genes experience the effect (shortening of the cleavage-T4ph distance between 1CT and tumour cells) that is observed globally.

      Thank you for this valuable suggestion. We have performed the gene-wise comparison which is indeed very informative. Firstly, we observed the same trend as for all active protein-coding genes - shorter distance in all CRC cell lines compared to 1CT cells with the lowest values of the cleavage-termination distance in the primary tumor cells. Secondly, and even more importantly, this analysis additionally shows that the shortening effect is global - only a small percentage of genes do not undergo shortening of the cleavage-T4ph distance between 1CT and tumor cells.

      We will incorporate the results of this analysis into the figures of the revised manuscript.

      -The manuscript claims that proximity between pre-mRNA 3′-end cleavage and transcription termination predicts increased nuclear transcript levels. However, the correlation coefficients are small (Spearman r ~ -0.2 at most), indicating weak predictive power. Therefore, the use of the term "predicts," especially in the manuscript title, appears to overstate the strength of the relationship. The authors should either moderate this claim or provide additional analysis to support stronger predictive value.

      We agree with the reviewer that the term "predicts" is not ideal in this context and are happy to substitute "is associated with". The title would then read: "Proximity of pre-mRNA 3′ end processing and transcription termination is associated with enhanced gene expression".

      Minor comments -Figures 1B and S1A: The discontinuous y-axis makes it difficult to assess relative protein level differences between normal and cancer samples. Statistical testing should be included to evaluate significance.

      We had decided against statistical testing due to the problems with biological interpretation of such analyses and its limitations for proteins present in the cell at low levels and/or highly variable between samples. PCF11 is such protein. It is an order of magnitude less abundant compared to other RNA 3' processing factors, and its levels are variable as shown in our Fig. 1B (re-analyzed proteomics data from Wiśniewski et al., 2015). Therefore, the increase in PCF11 levels in this dataset is not statistically significant in Mann-Whitney test, while it is significant for CPSF73.

      The variability of PCF11 levels can be also observed in the Clinical Proteomic Tumor Analysis Consortium (CPTAC) data in the Human Protein Atlas (while no absolute quantification was performed there).

      In two independently obtained proteomics patient datasets (Wiśniewski et al., 2015; CPTAC), as well as in Western blot assays from our cell culture model, an increase in PCF11 protein abundance is observed in cancer cells. This consistency across different datasets and our model holds greater biological relevance than the statistical analysis of highly varied samples. Nevertheless, if the reviewer requires statistics, we will include them in the revised manuscript.

      The discontinuous y-axis was applied due to broad range of protein molecules. Presentation of data with linear continuous scale did not allow to present the difference between normal and cancer samples for all the proteins on the same graph.

      Alternatively, if the reviewer and editor prefer, we are happy to present the data with log10 transformed scale. The disadvantage of log-scale is that the differences between normal and cancer samples are less obvious to the eye, the advantage is a continuous y-axis.

      -Lines 217-218: The text should emphasize that nuclear RNA abundance may not reflect cytoplasmic mRNA levels, particularly when APA alters 3′UTRs and may affect mRNA stability.

      We agree and will incorporate the reviewer's suggestion in our revised manuscript.

      -Lines 261-264: The cleavage-termination distance metric should be more clearly defined as the distance between the polyadenylation site and the T4ph signal peak.

      We plan to incorporate a drawing into the figure, to better explain our cleavage-termination definition.

      We also performed the cleavage site to T4ph signal peak (highest signal in the termination window) distance calculations, and they show the same trend as our original method (Figure 5A), with no changes to the conclusions we made. We will incorporate this additional analysis into a supplementary figure.

      **Referees cross-commenting**

      Reviewer #3:

      On the contrary to implied in the reviewer report, this manuscript does not report the effects of CPSF73 inhibitor JTE-607 on APA. On this note, as the authors discuss uncoupling of cleavage and transcription termination, they could consider (this is not a request) testing how the cleavage inhibitor JTE-607 impacts the distribution of transcription termination marker T4ph, and whether the effects would be different in different cell lines where the coupling appears to be different. This could give mechanistic insights into the sources of the differences between cell lines.

      In order to get a mechanistic idea why shorter cleavage-termination distance is associated with higher gene expression, we plan to test the cleavage efficiency on genes, which show differences in cleavage-termination distance and expression levels, between SW480 and SW620 cell lines. To this end, we will perform POINT-seq, checking differences between those cell lines in control conditions and with JTE-607. We believe that this new experimental approach will provide a deeper mechanistic insight, compared to performing further correlation analyses repeating the same experiment types.

      Reviewer #1 (Significance (Required)):

      This study addresses an important question in RNA biology and cancer research: how altered expression or pharmacological targeting of CPA factors affects alternative polyadenylation, transcription termination, and cellular phenotypes in CRC models. This topic is timely, as CPSF73 has been proposed as a therapeutic target, making it important to understand the molecular and cellular consequences of modulating CPA factor activities. A key strength and robust finding of the study is the identification of unexpected relationships between pre-mRNA 3′-end processing and transcription termination during CRC progression. Notably, the authors report that changes in alternative polyadenylation and transcription termination appear to be uncoupled and may even occur in opposite directions. This challenges simplified models in which these processes are tightly coordinated and suggests that their (mis)regulation in cancer cells may be more complex than previously appreciated. Secondly, the study provides an interesting observation that gene-specific changes in cleavage-T4ph distance correlate negatively with changes in nuclear levels of processed transcripts. This suggests a potential relationship between the spatial coupling of 3′-end processing and transcription termination and transcript abundance. If validated mechanistically, this could represent a conceptual advance in understanding how transcription termination dynamics influence gene expression outputs. However, the observed correlations are relatively weak, and the mechanistic basis of this relationship remains unclear. As such, this advance is primarily descriptive at this stage.

      As indicated in response to the cross-commenting point above, one possible mechanistic explanation why shorter cleavage-termination distance could be associated with higher gene expression, is increased cleavage efficiency when the cleavage-termination distance is short. To test this hypothesis, we will perform POINT-seq on SW480 and SW620 cell lines, in control and CPSF73 inhibition conditions. We have previously demonstrated that POINT-seq technique allows calculation of cleavage efficiencies, and its alterations (doi: 10.1101/2025.07.14.664659).

      So far, our data (Fig. 5F, G) indicates that PCF11 is involved in this process since PCF11 downregulation resulted in lengthening the distance between 3′-end cleavage and RNAPII terminal pausing. This lengthening was in parallel correlated with the decrease of the nuclear RNA levels. However, PCF11 participates in multiple steps of gene expression - pre-mRNA cleavage, alternative polyadenylation, RNAPII pausing, and mRNA export - making the underlying mechanism difficult to pinpoint without additional experiments.

      Importantly, our work provides the first clear evidence that changes in cleavage site usage and termination region usage can become uncoupled. We hope that continued tool development, together with studies like ours, will ultimately enable a full mechanistic understanding.

      Several interpretations of experimental data would benefit from more cautious framing or additional analysis. In particular, the relationship between changes in CPA factor expression levels and sensitivity to the CPSF73 inhibitor JTE-607 across CRC cell lines remains unclear from the presented data.

      During the revision we will explain more clearly the rationale for our interpretation of the data. In cases where more cautious framing would still be needed, we will include alternative interpretations.

      This work will be of interest primarily to basic researchers in RNA processing and transcription regulation, gene expression control, cancer cell biology and pharmacological targeting of RNA-processing machineries.

      Reviewer field of expertise: My expertise is in RNA processing and gene regulation. I do not have specific expertise clinical oncology or cancer biology.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Factors involved in pre-mRNA cleavage and polyadenylation (CPA) are upregulated in many cancers and have been found to be associated with poor prognosis. In their manuscript "Proximity of pre-mRNA 3′ end processing and transcription termination predicts enhanced gene expression", Stepien et al. use colorectal cancer (CRC)-derived cell lines as a model of CPA overexpression to study its biological consequences. To this end, the authors initially confirm increased expression of CPA factors in these cell lines and demonstrate that their knock-down strongly decreases the colony-forming ability of primary tumour-derived CRC cells. They further assess various phenotypes that are expected to depend on CPA activity based on the current knowledge in the field, including poly-A site selection, occurrence of transcription-replication conflicts, and the site of transcription termination. Contrary to expectations, they find a proximal shift in transcription termination to be the most prominent change in CRC ell lines with high CPA levels, despite no clear preference for proximal poly-A site usage in these cells, suggesting an uncoupling of both processes. The authors combine their 3'-end mapping data and T4P-mNET-seq data mapping terminating RNAPII to score cleavage-termination distance at individual genes and find shorter distances to correlate with increased gene expression in the different cell lines. Overall, this is a carefully conducted study, and the claims and conclusions are well supported by the data.

      I have some minor comments: 1. PLA assay to quantify transcription-replication conflicts (Figure 2). The quantified data looks very convincing and is also in good agreement with the proximal shift in transcription termination that is demonstrated later in the paper. However, the PLA channel signal in the microscopy image examples shown in panel A looks very blurry, and it is hard to imagine that one would be able to count # foci based on this. This may just be an issue with the resolution of the image provided. Apparently, there are much less foci in the treated samples shown in panel B - maybe microscopy images for these could be provided as well? Also, since none of the treatments impact the # of TRCs, it would have been nice to include a positive control known to induce TRCs to demonstrate that the assay works (if such a control is known) - this is optional, and I would not ask to repeat the entire experiment just for this additional control (but maybe the authors have done it and the data is already available?).

      We apologize for the low resolution of the picture presented in Figure 2. We were unable to upload high resolution picture file during the first submission, for technical reasons. We will improve it in the revised manuscript.

      The difference in baseline PLA foci between Fig. 2A and 2B reflects a known sensitivity of the PLA assay to cell confluency. As these two experiments were performed at different confluences, direct cross-panel comparison is not appropriate. For this reason, all quantitative comparisons in the manuscript are made strictly within the same plate, the same PLA reaction, and between wells with comparable confluency, which avoids introducing bias from these technical variables. For clarity, we plan to incorporate the above information into the Methods section. To validate assay specificity within each experiment, we confirmed that EdU-positive cells consistently showed higher PLA foci counts than EdU-negative cells from the same wells, demonstrating that quantification reflects genuine PCNA-associated signal above background. With this internal validation in place, each panel's comparisons remain valid and interpretable on their own terms.

      No classical positive control exists for a PolII-pThr4/PCNA PLA interaction, as this is a relatively unexplored proximity event with no established positive control condition. We used single-antibody negative controls to establish assay specificity, although we didn't quantify and show it. We also used EdU-negative cells within the same wells as an internal background baseline, ensuring that measured foci reflect genuine signal above background. As a proxy for positive controls, we relied on the detection of changes in PLA foci number between the tested conditions, such as the effect of 4h XRN2 degradation. Also, the consistency of biological replicates and the differences between cell lines made us quite secure we were detecting reproducible and biologically relevant differences.

      1. Figure 2A-C: please include information on number of cells quantified

      We will incorporate this information into the revised manuscript.

      1. Figure 2C: In the label, please include degron, e.g. HCT116 CPSF73-AID rather than just HCT116

      We will modify the label according to the reviewer's suggestion.

      1. Figure 5C: When quantifying nascent txn based on mNET-seq, to which extent would one expect terminally paused RNAPII along the gene body (premature termination events) to contribute to the increased signal? That is, could an increase in stalling be mistaken for an increase in transcription? Based on the metagene plot in Fig 2A it doesn't look like it, but the authors may be able to estimate the effect (if any) from their data.

      We thank the reviewer for pointing this out.

      As reviewer #1 observed, and we comment above (Rev.1 point 2b), the increase of premature termination events in cancer cells, which can be readily detected by RNAPII T4ph mNET-seq increase in the gene body, does not globally perturb total RNAPII mNET-seq profiles (see metagenes in figure 4A and 4B).

      Nevertheless, mNET-seq method does indeed detect both nascent transcription levels and RNAPII pausing, which is particularly relevant when wanting to make conclusions on a single gene level. In order to increase the robustness of our study and make stronger conclusions about nascent transcription rates, independent of stalling, during revision we will perform POINT-seq experiments in SW480 and SW620 cells. That method, in contrast to mNET-seq, is not pausing sensitive.

      Reviewer #2 (Significance (Required)):

      The observed uncoupling of poly-A site selection and size of termination window is unexpected and raises important questions on how these coupled processes can be regulated independently.

      Strengths of the study: i) Parallel assessment of different CRC-based cell lines provides evidence of phenotype stability across patients. ii) Brings together strong technical expertise combining different state-of-the-art methodologies to map and correlate poly-A site usage, site of transcription termination, and levels of nascent transcription within the same cell lines under the same conditions, providing a comprehensive dataset.

      Limitations: i) For the time being, observation limited to CRC cell lines.

      While this is the first time that we are able to show the pre-mRNA 3' cleavage and transcription termination uncoupling so clearly, we have previously reported findings in other cell types which pointed to this direction. We found in HeLa cells (PMID: 30819644) that genes preferentially using distal polyadenylation sites exhibit more proximal RNAPII terminal pausing compared to genes that predominantly use proximal polyadenylation sites. Recently, we also found in U2OS cells after SETD2 KO and renal cell carcinoma cell lines with SETD2 mutation, that readthrough transcription occurs independently of APA (doi: 10.1101/2025.07.14.664659). This phenomenon could be frequent, but it has not been investigated until now, as cleavage and termination were usually studied separately.

      In terms of the correlation between cleavage-termination distance and expression levels, in our study so far, we found it in CRC (HCT116, SW480, SW620) and cervical cancer (HeLa) cell lines. During revision we plan to test it additionally in pancreatic cell lines, with high sensitivity to JTE-607 treatment (BxPC3), medium (Panc1), and low sensitivity (MiaPaCa2).

      ii) Mechanism behind proximal shift of termination to be determined.

      We agree with the reviewer that the mechanism underlying the proximal transcription termination is missing. Our unpublished data show correlation between RNAPII pausing and transcription termination factors occupancy on chromatin. However, since more factors are involved, such as elongation speed and chromatin architecture, resolving the mechanism requires further extensive studies.

      I expect this work to be of interest to an audience interested in transcription and regulation of gene expression more broadly, with potential translational relevance for cancer therapy.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Stępień et al. aims to investigate the roles of pre-mRNA 3′ end processing and transcription termination using colorectal cancer (CRC) cell lines (normal colon epithelial cells: 1CT; primary tumours: HCT116 and SW480; metastatic tumour: SW620). By using publicly available proteomic datasets and their cellular models, the authors first demonstrate elevated expression of several cleavage and polyadenylation (CPA) and termination factors, including CPSF73 and PCF11, in CRC cells. They further assess the functional relevance of CPSF73 and PCF11, showing that siRNA-mediated knockdown of these factors reduces colony formation, particularly in primary cancer cell lines. However, they do not observe a clear association between CPA/termination factors and transcription-replication collisions (TRCs), suggesting that TRCs may not underlie the altered colony formation phenotype.

      The authors also examine alternative polyadenylation (APA) using the CPSF73 inhibitor JTE-607 and report complex APA patterns: primary tumour cell lines display a bias toward distal APA site usage, whereas the metastatic SW620 line preferentially uses proximal sites. They further evaluate transcription termination and observe a proximal shift (early termination) in primary CRC cells, and to a lesser extent in SW620 cells. Noting the apparent discrepancy between APA site shifting and proximal termination, the authors introduce a new metric termed cleavage-termination distance, defined as the distance between the coordinates of the major PAS and RNAPII termination site. They report an association between shortening of cleavage-termination distance and increased gene expression, which may contribute to the upregulation of cancer-related genes.

      Overall, this is a well-written manuscript that highlights potential roles of pre-mRNA processing and transcription termination in gene expression control, with implications for cancer biology. Nevertheless, several issues should be addressed to strengthen the study:

      Overall, this is a well-written manuscript that reveals potential roles of pre-mRNA processing and transcription termination in gene expression control, with implications for cancer biology. Nevertheless, I have a few comments that may help strengthen the study.

      Major comments: 1. The study includes two primary tumour cell lines but only one metastatic cell line (SW620), which is derived from the same patient as SW480. It remains unclear whether the observed effects represent general characteristics of metastatic tumour cells or are specific to this particular cell line.

      Our primary workhorse in this study are the cell lines SW480 and SW620, which are derived from the same patient, to avoid the confounding variable of genetic diversity between cell lines. Unfortunately, these are the only paired CRC cell lines currently available in cell banks.

      We would not want to perform further (expensive and time-consuming) genomic assays on additional CRC metastatic cell lines since the cell lines available were isolated from other types of metastasis (liver or lung, while SW620 comes from lymph node) and other patients - which would make interpreting any results obtained with them difficult. However, we plan to check the sensitivity of one more primary (HCT116) and two more metastatic (T87 and COLO-205) cancer cell lines to JTE-607 treatment in colony formation or MTT assay to find out whether the differences in CRC cell sensitivity are more cancer-stage or patient specific.

      Further on, we plan to check whether our finding of alterations in cleavage-termination distance might have clinically relevant prognostic value, even outside of the context of CRC. To this end, we will test the hypothesis that a short cleavage-termination distance could be a prognostic marker for sensitivity of cells to JTE-607 treatment. It has been previously demonstrated that pancreatic cancer (PC) cell lines differ in sensitivity to JTE-607 (PMID: 38191171). We will perform T4ph-mNET-seq and nuclear 3'mRNA-seq experiments on PC cell lines to check the cleavage-termination distance in JTE-607-sensitive (BxPC3), medium sensitive (Panc1) and least JTE-607 sensitive (MiaPaCa2) cells, and for presence or absence of correlation of this distance with the cell sensitivity to JTE-607.

      The rationale for focusing on colorectal cancer in this study requires further clarification. Although the Introduction provides a comprehensive review of CPSF73 and PCF11 in other cancer types, evidence specific to colorectal cancer is limited. Are these factors known to be mutated or dysregulated in CRC? Is their expression associated with patient survival? The authors could strengthen their rationale by performing a basic analysis using publicly available datasets (e.g., TCGA), such as evaluating expression levels in tumour versus normal tissue and generating Kaplan-Meier survival curves.

      We will respond to these questions in the revision.

      1. In Figure 5 and Supplementary Figure 5, the authors analyse cleavage-termination distance across oncogenes and tumour suppressor genes and observe a negative correlation between cleavage-termination distance and gene expression level. This is an interesting finding and suggests a possible mechanism for enhancing expression of cancer-related genes. It would be valuable to extend this analysis more systematically-for example, by stratifying genes based on cleavage-termination distance and performing gene ontology enrichment analysis / GSEA to identify functional categories enriched among genes with shorter or longer distances. The authors could further relate these gene sets to, for example, distinct phenotypes between primary vs metastatic tumours.

      This is an excellent suggestion. We will perform the above analyses carefully during the revision. Our initial analysis done upon receiving the reviews suggests that the genes, whose cleavage-termination distance decreases during tumorigenesis, while gene expression increases, are enriched for RNA processing, DNA damage response, chromatin organization and ribosome biogenesis factors. On the other hand, increased cleavage-termination distance and decreased gene expression are mostly associated with organelle assembly and protein localization. We will deepen this analysis and discuss the implication to cancer biology in our revised manuscript.

      Minor comments: 4. In Figure 2A, the number of RNAPII-PCNA PLA foci appear comparable between SW480 and SW620, whereas in Figure 2B this seems to be much lower in SW620 compared to SW480. Could the authors clarify this discrepancy?

      The difference in baseline PLA foci between Fig. 2A and 2B reflects a known sensitivity of the PLA assay to cell confluency. As these two experiments were performed at different confluencies, direct cross-panel comparison is not appropriate. For this reason, all quantitative comparisons in the manuscript are made strictly within the same plate, the same PLA reaction, and between wells with comparable confluency, which avoids introducing bias from these technical variables. For clarity, we plan to incorporate the above information into the Methods section. To validate assay specificity within each experiment, we confirmed that EdU-positive cells consistently showed higher PLA foci counts than EdU-negative cells from the same wells, demonstrating that quantification reflects genuine PCNA-associated signal above background. With this internal validation in place, each panel's comparisons remain valid and interpretable on their own terms.

      1. Is the cleavage-termination distance metric influenced by gene length? If so, should this parameter be normalised to gene length to avoid potential bias?

      No, gene length is not a bias in the cleavage-termination distance.

      • We performed correlation analysis and there is no significant correlation between the cleavage-termination distance and gene length, in any of cell line pairs in our model: HCT116 vs 1CT (spearman r=0.001, p=0.945); SW480 vs 1CT (spearman r=0.036, p=0.0654); SW620 vs 1CT (spearman r=-0.018, p=0.325).
      • Additionally, we quantified the decrease in cleavage-termination distance on the very same gene, just in different cell lines. We will incorporate this result into the manuscript.
        1. The data and analysis scripts generated in this study have not yet been made publicly available and therefore cannot be fully evaluated.

      We apologize for this omission. The revised manuscript will contain the link to our publicly available scripts in GitHub and the GEO access.

      **Referees cross-commenting**

      I agree with the reports from both Reviewer #1 and Reviewer #2.

      I would like to thank Reviewer #1 for pointing out my mistaken. The authors did not use JTE-607 to study APA; rather, they studied the differences in APA between cell lines. I apologise for the confusion.

      Reviewer #3 (Significance (Required)):

      General assessment: This study investigates the contribution of pre-mRNA 3′ end processing and transcription termination to colorectal cancer (CRC) biology using a combination of cell line comparisons (primary versus metastatic tumours), chemical, and RNAi perturbations, and bioinformatic analyses.

      The major strengths of the work include: • The use of CRC cell lines representing normal, primary, and metastatic states, including matched primary and metastatic lines derived from the same patient. • A systematic analysis of alternative polyadenylation (APA) and transcription termination, revealing a potential uncoupling between these two closely related processes. • The introduction of a novel quantitative metric-cleavage-termination distance-to examine the relationship between PAS usage and RNAPII termination. • The identification of a negative association between cleavage-termination distance and gene expression, suggesting an additional regulatory layer influencing gene expression.

      However, certain limitations should be considered: • The generalisability of conclusions regarding metastatic CRC is limited by reliance on a single metastatic cell line.

      We believe that the experiments we outlined above in response to Reviewer #3 point 1 will allow us to extend the generalizability of conclusion.

      • The translational relevance of the findings could be further strengthened through patient-level or clinical data analysis.

      We agree with the reviewer. Due to technical limitations, it is not possible to perform nascent transcriptomic experiments on patient material at this time. However, we will attempt to strengthen the translational relevance by additional experiments and analysis as indicated in response to Reviewer #3 points 1-3.

      Advance: The study proposes potentially novel roles for 3′ end cleavage and transcription termination in regulating gene expression in colorectal cancer. In particular, the conceptual distinction between APA site shifting and transcription termination, together with the introduction of the cleavage-termination distance metric, represents a conceptual advance.

      Audience: The work is primarily positioned within basic research. With additional translational context, it may also attract interest from a broader audience.

      Field of expertise: transcriptional regulation and bioinformatics

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Stępień et al. aims to investigate the roles of pre-mRNA 3′ end processing and transcription termination using colorectal cancer (CRC) cell lines (normal colon epithelial cells: 1CT; primary tumours: HCT116 and SW480; metastatic tumour: SW620). By using publicly available proteomic datasets and their cellular models, the authors first demonstrate elevated expression of several cleavage and polyadenylation (CPA) and termination factors, including CPSF73 and PCF11, in CRC cells. They further assess the functional relevance of CPSF73 and PCF11, showing that siRNA-mediated knockdown of these factors reduces colony formation, particularly in primary cancer cell lines. However, they do not observe a clear association between CPA/termination factors and transcription-replication collisions (TRCs), suggesting that TRCs may not underlie the altered colony formation phenotype.

      The authors also examine alternative polyadenylation (APA) using the CPSF73 inhibitor JTE-607 and report complex APA patterns: primary tumour cell lines display a bias toward distal APA site usage, whereas the metastatic SW620 line preferentially uses proximal sites. They further evaluate transcription termination and observe a proximal shift (early termination) in primary CRC cells, and to a lesser extent in SW620 cells. Noting the apparent discrepancy between APA site shifting and proximal termination, the authors introduce a new metric termed cleavage-termination distance, defined as the distance between the coordinates of the major PAS and RNAPII termination site. They report an association between shortening of cleavage-termination distance and increased gene expression, which may contribute to the upregulation of cancer-related genes. Overall, this is a well-written manuscript that highlights potential roles of pre-mRNA processing and transcription termination in gene expression control, with implications for cancer biology. Nevertheless, several issues should be addressed to strengthen the study:

      Overall, this is a well-written manuscript that reveals potential roles of pre-mRNA processing and transcription termination in gene expression control, with implications for cancer biology. Nevertheless, I have a few comments that may help strengthen the study.

      Major comments: 1. The study includes two primary tumour cell lines but only one metastatic cell line (SW620), which is derived from the same patient as SW480. It remains unclear whether the observed effects represent general characteristics of metastatic tumour cells or are specific to this particular cell line. 2. The rationale for focusing on colorectal cancer in this study requires further clarification. Although the Introduction provides a comprehensive review of CPSF73 and PCF11 in other cancer types, evidence specific to colorectal cancer is limited. Are these factors known to be mutated or dysregulated in CRC? Is their expression associated with patient survival? The authors could strengthen their rationale by performing a basic analysis using publicly available datasets (e.g., TCGA), such as evaluating expression levels in tumour versus normal tissue and generating Kaplan-Meier survival curves. 3. In Figure 5 and Supplementary Figure 5, the authors analyse cleavage-termination distance across oncogenes and tumour suppressor genes and observe a negative correlation between cleavage-termination distance and gene expression level. This is an interesting finding and suggests a possible mechanism for enhancing expression of cancer-related genes. It would be valuable to extend this analysis more systematically-for example, by stratifying genes based on cleavage-termination distance and performing gene ontology enrichment analysis / GSEA to identify functional categories enriched among genes with shorter or longer distances. The authors could further relate these gene sets to, for example, distinct phenotypes between primary vs metastatic tumours.

      Minor comments: 4. In Figure 2A, the number of RNAPII-PCNA PLA foci appear comparable between SW480 and SW620, whereas in Figure 2B this seems to be much lower in SW620 compared to SW480. Could the authors clarify this discrepancy? 5. Is the cleavage-termination distance metric influenced by gene length? If so, should this parameter be normalised to gene length to avoid potential bias? 6. The data and analysis scripts generated in this study have not yet been made publicly available and therefore cannot be fully evaluated.

      Referees cross-commenting

      I agree with the reports from both Reviewer #1 and Reviewer #2. I would like to thank Reviewer #1 for pointing out my mistaken. The authors did not use JTE-607 to study APA; rather, they studied the differences in APA between cell lines. I apologise for the confusion.

      Significance

      General assessment:

      This study investigates the contribution of pre-mRNA 3′ end processing and transcription termination to colorectal cancer (CRC) biology using a combination of cell line comparisons (primary versus metastatic tumours), chemical, and RNAi perturbations, and bioinformatic analyses.

      The major strengths of the work include:

      • The use of CRC cell lines representing normal, primary, and metastatic states, including matched primary and metastatic lines derived from the same patient.
      • A systematic analysis of alternative polyadenylation (APA) and transcription termination, revealing a potential uncoupling between these two closely related processes.
      • The introduction of a novel quantitative metric-cleavage-termination distance-to examine the relationship between PAS usage and RNAPII termination.
      • The identification of a negative association between cleavage-termination distance and gene expression, suggesting an additional regulatory layer influencing gene expression. However, certain limitations should be considered:
      • The generalisability of conclusions regarding metastatic CRC is limited by reliance on a single metastatic cell line.
      • The translational relevance of the findings could be further strengthened through patient-level or clinical data analysis.

      Advance:

      The study proposes potentially novel roles for 3′ end cleavage and transcription termination in regulating gene expression in colorectal cancer. In particular, the conceptual distinction between APA site shifting and transcription termination, together with the introduction of the cleavage-termination distance metric, represents a conceptual advance.

      Audience: The work is primarily positioned within basic research. With additional translational context, it may also attract interest from a broader audience.

      Field of expertise: transcriptional regulation and bioinformatics

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Factors involved in pre-mRNA cleavage and polyadenylation (CPA) are upregulated in many cancers and have been found to be associated with poor prognosis. In their manuscript "Proximity of pre-mRNA 3′ end processing and transcription termination predicts enhanced gene expression", Stepien et al. use colorectal cancer (CRC)-derived cell lines as a model of CPA overexpression to study its biological consequences. To this end, the authors initially confirm increased expression of CPA factors in these cell lines and demonstrate that their knock-down strongly decreases the colony-forming ability of primary tumour-derived CRC cells. They further assess various phenotypes that are expected to depend on CPA activity based on the current knowledge in the field, including poly-A site selection, occurrence of transcription-replication conflicts, and the site of transcription termination. Contrary to expectations, they find a proximal shift in transcription termination to be the most prominent change in CRC ell lines with high CPA levels, despite no clear preference for proximal poly-A site usage in these cells, suggesting an uncoupling of both processes. The authors combine their 3'-end mapping data and T4P-mNET-seq data mapping terminating RNAPII to score cleavage-termination distance at individual genes and find shorter distances to correlate with increased gene expression in the different cell lines. Overall, this is a carefully conducted study, and the claims and conclusions are well supported by the data.

      I have some minor comments:

      1. PLA assay to quantify transcription-replication conflicts (Figure 2). The quantified data looks very convincing and is also in good agreement with the proximal shift in transcription termination that is demonstrated later in the paper. However, the PLA channel signal in the microscopy image examples shown in panel A looks very blurry, and it is hard to imagine that one would be able to count # foci based on this. This may just be an issue with the resolution of the image provided. Apparently, there are much less foci in the treated samples shown in panel B - maybe microscopy images for these could be provided as well? Also, since none of the treatments impact the # of TRCs, it would have been nice to include a positive control known to induce TRCs to demonstrate that the assay works (if such a control is known) - this is optional, and I would not ask to repeat the entire experiment just for this additional control (but maybe the authors have done it and the data is already available?).
      2. Figure 2A-C: please include information on number of cells quantified
      3. Figure 2C: In the label, please include degron, e.g. HCT116 CPSF73-AID rather than just HCT116
      4. Figure 5C: When quantifying nascent txn based on mNET-seq, to which extent would one expect terminally paused RNAPII along the gene body (premature termination events) to contribute to the increased signal? That is, could an increase in stalling be mistaken for an increase in transcription? Based on the metagene plot in Fig 2A it doesn't look like it, but the authors may be able to estimate the effect (if any) from their data.

      Significance

      The observed uncoupling of poly-A site selection and size of termination window is unexpected and raises important questions on how these coupled processes can be regulated independently.

      Strengths of the study:

      i) Parallel assessment of different CRC-based cell lines provides evidence of phenotype stability across patients.

      ii) Brings together strong technical expertise combining different state-of-the-art methodologies to map and correlate poly-A site usage, site of transcription termination, and levels of nascent transcription within the same cell lines under the same conditions, providing a comprehensive dataset.

      Limitations:

      i) For the time being, observation limited to CRC cell lines.<br /> ii) Mechanism behind proximal shift of termination to be determined.

      I expect this work to be of interest to an audience interested in transcription and regulation of gene expression more broadly, with potential translational relevance for cancer therapy.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      This study investigates how altered expression of cleavage and polyadenylation (CPA) factors affects alternative polyadenylation (APA), transcription termination and cellular phenotypes in colorectal cancer (CRC) cell lines. The authors combine genetic perturbations of CPA factors with chemical inhibition of CPSF73 and assess effects on clonogenic potential, transcription-replication conflicts, APA profiles, and transcription termination-associated RNAPII phosphorylation patterns. The main comparisons are performed between healthy (1CT), primary tumour (SW480, HCT116) and metastatic (SW620) cell lines, which are reported to contain altered expression levels of CPA factors.

      The data suggest differential dependence on CPA factors between primary tumour-derived and metastatic CRC cell lines, as well as changes in transcription termination patterns. The data are overall well-presented with clear figures. However, in several cases the strength of the conclusions appears to exceed the support provided by the data, and alternative interpretations should be considered.

      Major comments

      1. Clonogenic sensitivity to CPA factor perturbation and comparability of clonogenic assays between cell lines:
        • The data indicate that clonogenic potential in SW480 is strongly dependent on CPSF73 and PCF11, whereas SW620 appear less sensitive. However, the interpretation is complicated by differences in depletion efficiencies. In SW620 cells, PCF11 depletion appears inefficient, and protein levels remain higher than in siLUC-treated SW480 cells (Fig. 1D and S1C; also in comparison to 1CT by inference of Fig. 1C). Thus, the apparent resistance of SW620 cells could reflect insufficient depletion rather than true biological tolerance. The effectiveness of siCPSF73 treatments is difficult to assess from the presented data. Quantification of protein knockdown levels should be provided and incorporated into the interpretation.
        • In Fig. 1D, 1E, and S1D, colony formation of DMSO- or siLUC-treated SW620 and SW480 cells differs markedly in absolute terms. However, the graphs are normalized separately for each cell line, which obscures this difference. This raises two concerns: First, the baseline clonogenic capacity differs between the lines and should be discussed. Second, it is unclear whether direct comparisons between cell lines are valid when normalization is performed independently. For example, in absolute terms, 1 µM JTE-607 appears to have a similar effect in SW620 cells as 5 µM in SW480 cells, which would contradict the conclusion that metastatic cells are more tolerant to CPA perturbations. This issue should be explicitly addressed.
      2. Interpretation of transcription termination markers:
        • The study uses RNAPII T4ph as a marker of transcription termination, which is well justified based on the ref. [30], but still the mechanistic basis of this modification is not fully understood. Changes in T4ph localization are interpreted as consequences of CPA activity, but possible differences in kinase or phosphatase activities between cell lines are not considered that could affect the T4ph levels or localization. Therefore, conclusions based solely on T4ph redistribution should be presented with greater caution, and alternative explanations should be acknowledged.
        • Line 240 states that premature termination is increased in primary tumour cells. However, the data show increased T4ph signal (Fig. 4B) but no change in total RNAPII occupancy in gene bodies (Fig. 4A). This does not directly demonstrate increased termination. Additional evidence or a more cautious interpretation would be appropriate.
      3. Cleavage-termination distance as a predictor of transcript levels:
        • Figure 5A presents median distances across all genes. It would be informative to perform a gene-wise comparison between cell lines (difference in cleavage-T4ph distance for the same gene, e.g. in 1CT vs. HCT116, individual differences plotted across all genes). This analysis could help clarify how frequently individual genes experience the effect (shortening of the cleavage-T4ph distance between 1CT and tumour cells) that is observed globally.
        • The manuscript claims that proximity between pre-mRNA 3′-end cleavage and transcription termination predicts increased nuclear transcript levels. However, the correlation coefficients are small (Spearman r ~ -0.2 at most), indicating weak predictive power. Therefore, the use of the term "predicts," especially in the manuscript title, appears to overstate the strength of the relationship. The authors should either moderate this claim or provide additional analysis to support stronger predictive value.

      Minor comments

      • Figures 1B and S1A: The discontinuous y-axis makes it difficult to assess relative protein level differences between normal and cancer samples. Statistical testing should be included to evaluate significance.
      • Lines 217-218: The text should emphasize that nuclear RNA abundance may not reflect cytoplasmic mRNA levels, particularly when APA alters 3′UTRs and may affect mRNA stability.
      • Lines 261-264: The cleavage-termination distance metric should be more clearly defined as the distance between the polyadenylation site and the T4ph signal peak.

      Referees cross-commenting

      Reviewer #3: On the contrary to implied in the reviewer report, this manuscript does not report the effects of CPSF73 inhibitor JTE-607 on APA. On this note, as the authors discuss uncoupling of cleavage and transcription termination, they could consider (this is not a request) testing how the cleavage inhibitor JTE-607 impacts the distribution of transcription termination marker T4ph, and whether the effects would be different in different cell lines where the coupling appears to be different. This could give mechanistic insights into the sources of the differences between cell lines.

      Significance

      This study addresses an important question in RNA biology and cancer research: how altered expression or pharmacological targeting of CPA factors affects alternative polyadenylation, transcription termination, and cellular phenotypes in CRC models. This topic is timely, as CPSF73 has been proposed as a therapeutic target, making it important to understand the molecular and cellular consequences of modulating CPA factor activities. A key strength and robust finding of the study is the identification of unexpected relationships between pre-mRNA 3′-end processing and transcription termination during CRC progression. Notably, the authors report that changes in alternative polyadenylation and transcription termination appear to be uncoupled and may even occur in opposite directions. This challenges simplified models in which these processes are tightly coordinated and suggests that their (mis)regulation in cancer cells may be more complex than previously appreciated. Secondly, the study provides an interesting observation that gene-specific changes in cleavage-T4ph distance correlate negatively with changes in nuclear levels of processed transcripts. This suggests a potential relationship between the spatial coupling of 3′-end processing and transcription termination and transcript abundance. If validated mechanistically, this could represent a conceptual advance in understanding how transcription termination dynamics influence gene expression outputs. However, the observed correlations are relatively weak, and the mechanistic basis of this relationship remains unclear. As such, this advance is primarily descriptive at this stage. Several interpretations of experimental data would benefit from more cautious framing or additional analysis. In particular, the relationship between changes in CPA factor expression levels and sensitivity to the CPSF73 inhibitor JTE-607 across CRC cell lines remains unclear from the presented data. This work will be of interest primarily to basic researchers in RNA processing and transcription regulation, gene expression control, cancer cell biology and pharmacological targeting of RNA-processing machineries. Reviewer field of expertise: My expertise is in RNA processing and gene regulation. I do not have specific expertise clinical oncology or cancer biology.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      See Response to reviewers.docx

      Revision added:

      TOGGLE Cloud.docx

      Response to reviewers.docx

      High-resolution image.zip

      If the files are not displayed correctly, please have the editor contact the authors. As the submission system may compress images, a file named "High-resolution image.zip" has been uploaded.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors develop a method that extracts functional properties clusters cells from single-cell RNA sequencing using machine learning techniques.

      As it stands, there are several major shortcomings in the presentation of the work:

      • The motivation for the method is not well explained. Certainly analyses of single cell transcriptomics data do not capture the full state or trajectory, however there isn't any concrete example of the problem that this method intends to solve, nor how/why existing methods fail to capture this.
      • No motivation is given for any of the approaches that feature in the method, and therefore there is no consistent logical thread that a reader can follow to be convinced that the results are plausible.
      • Methods are not sufficiently explained.

      For example: There is no indication of how the original graph is obtained, nor what is diffusing in the 'graph diffusion' method. Statements like "The term 'sparse' indicates the process of sparsifying the matrix." does not explain anything about the actual process of making the matrix sparse. This could perhaps be understood in terms of a 'sparse' function in a particular linear algebra package, which would implicitly make this procedure concrete, but this is not mentioned.

      Section 2.2.2 mentions a reinforcement learning scheme, however none of the following explanation describes anything related to the commonly accepted reinforcement learning literature, and several quantities (such as the loss) remain undefined. Similarly, section 2.2.3 mentions the BERT pre-trained transformer without indicating how specifically it was modified or trained for this particular purpose, except perhaps in Figure 4 which itself is intensely confusing. Again in this section the authors mention a 'genetic algorithm' with no reference to any commonly accepted approaches used in the development of genetic algorithms for optimisation, and with no explanation of what exactly is optimised or how convergence is monitored.

      No code implementation is provided, and therefore it is impossible to use this to understand any of the methodology, and renders it impossible to reproduce.

      • Where mathematical notation is used it is incredibly confusing to read, using multiple symbols for different concepts, and not appearing to conform to any commonly accepted convention. In some cases, these are missing completely, for example on page 6, rendering it entirely impossible to follow.
      • The results do not support the assertions made about the method.

      No explanation of the alternative methods is given in section 3.1, nor why they are expected to perform the tasks chosen, or what the configuration of these models are and whether these have been optimised. In Table 2 many alternative methods are listed, however there is no explanation why only a small subset were chosen for comparison, nor what information the authors base their conclusions on (whether these were actually executed for this purpose, or whether they were interpreted from the paper).

      Metrics such as 'accuracy' are not defined, and are the only numerical evaluation of the method, whereas one would expect considerably more detailed evaluation of the claims, such as in the CoSpar paper.

      Section 2.4 mentions 'Details of scRNA-seq data processing and experimental methods are shown in the Supplementary Appendix 6 - Animal Processing.', however I have no supplementary material titled 'Appendix 6', and nothing at all that documents the scRNA-Seq pipeline.

      Section 3.2 seems to describe a very manual procedure for identifying these clusters, and seems to bear no relation to the TOGGLE procedure defined previously, so it's not clear how good an indication this is of the performance of the algorithm. Furthermore, the subsequent results seem to rarely refer to the TOGGLE method at all, and lack any meaningful comparison to alternative methods or why TOGGLE is essential for obtaining these results.

      In many cases the plots in this section are so distorted by compression that making out the text and the points is essentially impossible, and so I cannot comment on any of these.

      I would strongly recommend that the methodology of the paper is greatly expanded to cover exactly what is done, such that it is possible to reproduce in its entirety. Asking a third party who is an expert in machine learning to read through the descriptions and the mathematics would also be highly beneficial to ensure their correctness. Furthermore, it is essential that all of the claims made in the introduction and throughout the paper be systematically and explicitly validated in the results. If this cannot be done on real data, where ground-truth labels and trajectories are hard to come by, some evidence for these claims could be acquired by evaluation on simulated data.

      Significance

      The inference of cell state and trajectories from single cell RNA sequencing is a timely and important task in computational biology, with many important downstream applications. The method described in this paper aims to distinguish functionally distinct cell populations that exhibit small differences in transcript counts. However, it is not precisely articulated why the complicated approach proposed here is advantageous over simpler more conventional approaches, such as graph clustering, random-walk based methods such as CoSpar, or trajectory inference based on ODE kinetics such as in scVelo.

      Furthermore, the methods described are exceptionally vague and hard to follow, with mathematical descriptions and naming schemes that are inconsistent with the commonly accepted literature of the techniques referenced. Therefore it is also difficult for even a well-prepared reader to come to their own conclusions as to the performance and applicability of the proposed approach. This issue is compounded by the fact that there is very little in the way of validation of the specific claims made of the method, let alone in relation to alternative methods.

      The study would be greatly improved by expanded methods sections, documenting in detail what is done at each stage. Where existing work is referenced without an exact description, how the implementation differs from the reference must be addressed. Most of the description is currently in text form only, which is wholly insufficient for the kind of complex mathematical operations described. Furthermore, many of the claims throughout the paper go unaddressed in the results, where there are only a few accuracy metrics and comparisons of results, and there is no attempt to rigorously demonstrate an advantage of any of the novel components presented (for example, in an ablation study). Expanding these numerical metrics and comparisons across all applications of the method is essential for demonstrating the assertions of the paper - for example, constructing a metric for the comparison of the TOGGLE and ground-truth UMAP comparisons. In cases where there is no real ground truth in the data, simulated datasets could be used to demonstrate the plausible performance of TOGGLE in ideal scenarios.

      My expertise is in computational biology and machine learning, with a background in physics.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Overall Assessment

      The manuscript addresses an important problem-inferring subtle functional states from single-time-point scRNA-seq. However, essential methodological details are missing, several claims lack rigorous support, and key computational steps cannot be reproduced from the current description. The biological experiments validate ferroptosis itself, but do not validate the correctness of the inferred trajectories or cluster boundaries.

      Major Comments

      1. In Step 1 of TOGGLE (Figure 1), the method employs graph diffusion and emphasizes that the resulting asymmetric distance matrix encodes additional information. In fact, the entire downstream TOGGLE framework is built upon this graph and its diffusion-based similarity. However, the manuscript currently lacks essential information regarding this core component: 1) it does not explain how the asymmetric distance matrix is generated, nor does it provide explicit formulas or computational steps; 2) it does not specify which type of diffusion operator is used; 3) it is unclear how the underlying graph is constructed from the expression matrix-e.g., whether a standard kNN graph is used and whether edge weights are normalized.
      2. Although the manuscript demonstrates the biological usefulness of TOGGLE across several datasets and provides experimental validation in multiple systems, the method still lacks essential ablation analyses and performance benchmarking. These components are critical for establishing the stability, robustness, and necessity of each part of the proposed framework. Without systematic evaluation-such as comparisons to existing trajectory or state-inference methods, or ablations of key modules (diffusion construction, boundary detection, masking, GA-based merging)-it remains difficult to assess whether TOGGLE's improvements are due to the core algorithmic design or to dataset-specific behaviors. Incorporating these analyses would substantially strengthen the evidence supporting the method's reliability and generalizability.
      3. I find the methodological description insufficiently clear. The overall algorithmic framework appears to be assembled from multiple existing computational components without a unified or coherent theoretical formulation. As a result, the rationale behind each module and the mathematical connections between them are not rigorously established. In addition, the mathematical expressions throughout the manuscript lack standardized notation and clarity. For example, vectors and matrices should be consistently denoted using bold symbols, and expressions such as "cov_D" are not appropriate for describing covariance matrices in a formal mathematical context. The absence of precise notation and properly structured equations makes it difficult for readers to understand, evaluate, or reproduce the proposed method. A clearer and more rigorous mathematical exposition is necessary to support the validity of the algorithmic design.
      4. Key components (δ computation, binary assignment, recursion criteria, optimization objective) lack formal definitions or pseudocode. The "reinforcement-like" description is conceptual rather than methodological.
      5. Both masking and GA introduce stochasticity and complexity. Their necessity is not demonstrated, and no ablation study tests whether they contribute to performance or stability.
      6. Masking ratio, GA population size, mutation rate, stopping criteria, and the details of pseudotime usage are all unspecified, making the computational procedure difficult to reproduce. Moreover, given the presence of multiple stochastic components-including masking, genetic-algorithm iterations, and graph-related randomness-the manuscript should evaluate the stability of the method under different random seeds or bootstrap resampling. Without clearly defined parameters and robustness analyses, it is challenging to assess the reliability and reproducibility of the proposed framework.

      Minor Comments

      1. "cellular neighborhoods" requires a precise definition.
      2. Some figures (e.g., Fig. 2) are schematic and would benefit from quantitative clarification.

      Significance

      Advancement:If clarified and rigorously validated, TOGGLE could become a useful tool for trajectory-free state inference. Currently, the novelty lies more in application breadth than in methodological rigor.

      Audience: Likely audiences include computational biologists, neuroscientists studying ferroptosis, and researchers working on NSC epigenetics. Usage beyond these areas depends on methodological clarification.

      Expertise: Keywords: single-cell transcriptomics, graph diffusion, clustering algorithms, trajectory inference, statistical modeling. I am comfortable evaluating the computational components; biological assays are evaluated from standard computational-biology perspective.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Guo et. al, Wnt/β-catenin in the muscle spindle

      Guo and colleagues investigated a role of canonical wnt signaling in the muscle spindle. Muscle spindles are formed by myofibers upon innervation by proprioceptive sensory neurons. The signal for induction of the spindle, Nrg1, is provided by the proprioceptive neuron and induces the expression of immediate early genes like Egr3 in the emerging spindle. Subsequently, bag1/2 and chain fibers are formed, and a capsule is generated around the spindle. Besides the inducing signal, little is known about spindle development, and to my knowledge all other work remained descriptive.

      To work with muscle spindles is demanding as the spindles are very rare in muscles. The authors have developed new approaches like collecting spindles for RNAseq analyses that allow them to molecularly analyze the spindle.

      In the manuscript, the authors characterize wnt signaling in muscle spindle development. They show data from the Axin2-GFP mouse strain (Axin2 is a well-known target of canonical wnt signaling). Their data indicate that both extra- and intrafusal fibers express Axin2 at P0 and P5; at P25 and P40 Axin2 is maintained in bag2 fibers and capsule cells of spindles and downregulated in other myofibers. This indicates that canonical wnt signaling is initially active in all fibers, and subsequently restricted to bag1 fibers and capsule cells.

      Two mouse strains are used to conditionally mutate β-catenin, the transducer of canonical wnt signals, in the spindle: the first strain relies on the use of Egr3-Cre (mutates β-catenin in intrafusal fibers and capsule cells; recombination sets in presumably shortly after E15.5). The second strain uses HSACreERT2 to mutate β-catenin (recombination is induced in all myofibers after tamoxifen treatment). In the data provided, tamoxifen was injected at P5 and the animals were analyzed at P25 or later. Molecular phenotyping is done on the Egr3-Cre strain using RNAseq showing around 750 down- and 300 upregulated genes in isolated muscle spindles from β-catenin mutants.

      Other phenotyping data: Histology (Egr3-Cre):1) changes in the distribution of GLUT1 in the spindle, 2) VCAN downregulation in capsule cells of the mutant; 3) abnormal aggregation of nuclei in bag2 fibers, 4) abnormal annulospiral morphology, i.e. proprioceptive neuronal endings are abnormal.<br /> Histology (HSACreERT2): 1,2) GLUT1 and VCAN unaffected; 3) bag2 nuclear aggregation is normal; 4) abnormal annulospiral morphology, 5) abnormal gait. The authors assign the differences in phenotypes to the differences in cell type specificity of recombination (Egr3-Cre: intrafusal fibers and capsule cells; HSACreERT2: intra- and extrafusal fibers). This indicates that 1) Wnt/β-catenin affects annulospiral endings indirectly via a primary deficit in bag2 fibers and 2) nuclear aggregation phenotype in bag2 fibers is caused indirectly via a primary deficit in capsule cells and 3) a cell autonomous function Wnt/β-catenin exists in capsule cells. Overall, the work is carefully done, and the data are presented clearly. The phenotypes are relatively mild, in particular the behavioral consequences of the mutation.

      I have some major points that should be discussed and taken into account in the writing of the paper.

      1. Developmental phenotypes. The authors claim the phenotypes observed are caused by developmental deficits, but the animals are only analyzed at P25 (histology and RNAseq) or later time points. From the data shown it cannot be excluded that the spindle is formed correctly but that spindle maintenance is impaired. Additional time points would be needed to convincingly argue a developmental phenotype. Specifically, analysis of a time point when control and mutant spindles have similar histology is needed, in order to argue that subsequent developmental steps are impaired.
      2. Differences in phenotypes in the two strains. Can the authors be sure that differences in phenotypes observed in Egr3-Cre and HSACreERT2 lines are exclusively due to cell type specificity of recombination, and not due to differences in recombination efficacies? This is particularly important for the syncytial fibers. Incomplete recombination in a fiber might allow nuclei that have not recombined to provide sufficient β-catenin for signaling in the entire fiber.
      3. Please provide data that show that β-catenin is expressed in capsule cells.

      Minor

      The following sentence refers to the wrong figure (should refer to Fig. 4): While mutant loops had similar widths, loop number was reduced and the distance between loops increased (Figure 3G-G').

      Significance

      Guo and colleagues investigated a role of canonical wnt signaling in the muscle spindle. Muscle spindles are formed by myofibers upon innervation by proprioceptive sensory neurons. The signal for induction of the spindle, Nrg1, is provided by the proprioceptive neuron and induces the expression of immediate early genes like Egr3 in the emerging spindle. Subsequently, bag1/2 and chain fibers are formed, and a capsule is generated around the spindle. Besides the inducing signal, little is known about spindle development, and to my knowledge all other work remained descriptive.

      To work with muscle spindles is demanding as the spindles are very rare in muscles. The authors have developed new approaches like collecting spindles for RNAseq analyses that allow them to molecularly analyze the spindle.

      In the manuscript, the authors characterize wnt signaling in muscle spindle development. They show data from the Axin2-GFP mouse strain (Axin2 is a well-known target of canonical wnt signaling). Their data indicate that both extra- and intrafusal fibers express Axin2 at P0 and P5; at P25 and P40 Axin2 is maintained in bag2 fibers and capsule cells of spindles and downregulated in other myofibers. This indicates that canonical wnt signaling is initially active in all fibers, and subsequently restricted to bag1 fibers and capsule cells.

      Two mouse strains are used to conditionally mutate β-catenin, the transducer of canonical wnt signals, in the spindle: the first strain relies on the use of Egr3-Cre (mutates β-catenin in intrafusal fibers and capsule cells; recombination sets in presumably shortly after E15.5). The second strain uses HSACreERT2 to mutate β-catenin (recombination is induced in all myofibers after tamoxifen treatment). In the data provided, tamoxifen was injected at P5 and the animals were analyzed at P25 or later. Molecular phenotyping is done on the Egr3-Cre strain using RNAseq showing around 750 down- and 300 upregulated genes in isolated muscle spindles from β-catenin mutants.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In their study the authors address an important aspect in developmental neurobiology. In particular, they investigate the molecular underpinnings of muscle spindle development in the mouse. Muscle spindles are essential components to transmit muscle stretch and proprioceptive feedback to the spinal cord. They first analyze preexisting muscle spindle specific gene expression patterns that have been established before. They find an intriguing enrichment of expression of components of the Wnt/beta-catenin pathway. Inspired by these findings the authors next genetically deleted specific components from capsule and intrafusal spindle fibers during embryogenesis. They found profound gene expression changes and morphological alterations in the spindle fibers but also at the sensory proprioceptive nerve endings. Finally, the authors deleted beta-catenin at postnatal stages and detected significant defects in proprioception function. Altogether, they conclude that beta-catenin signaling exerts important function in muscle spindle development through cell-autonomous (spindle intrinsic) and non-cell-autonomous (affecting nerve terminals and proprioceptive functions) mechanisms.

      Overall the study is excellently conceived, the experiments performed at very high standards and the results were interpreted with great care. The manuscript is very well written and the data is presented neatly. In my opinion there are just a few very minor items that the authors could address to improve the reading experience.

      1. In Figure 1B, the font color in the blue boxes are not clearly readable and I recommend to use darker color tone or even black.
      2. Figure 1C-H, it would be useful to outline the capsule and fiber compartments in the fluorescent panels to improve the orientation and better appreciation of the expressed genes.
      3. Figure 3C'-3C'', the authors should define the meaning of the red and black labelled gene names.
      4. Figure 3C', the yellow writing is hard to read, I suggest to use darker color tone.
      5. Figure 4, the authors should write the proper genotype in the boxes and in italic font.
      6. Figure 5, the authors should write the proper genotype in the boxes and in italic font.
      7. In the introduction, the authors could cite a few more (perhaps major reviews) about the Wnt/beta-catenin biochemical functions. Ideally after the first sentence in the respective paragraph.

      Significance

      Proprioception is an essential process for controlling postures and movement. The anatomical development of the muscle spindles, that are responsible for detecting muscle stretch and transmitting proprioceptive feedback to the spinal cord, has been quite well described. However, the molecular mechanisms that regulate the development of the muscle spindles with the attached proprioceptive nerve endings are not well understood. To address this gap in our knowledge the authors evaluated muscle spindle specific gene expression and probed the function of the Wnt/beta-catenin pathway (highly specifically expressed in spindle components) in muscle spindle development and function. They found striking and significant deficits in muscle spindle development and proprioception upon muscle spindle specific ablation of beta-catenin. Altogether, they conclude that beta-catenin signaling exerts important function in muscle spindle development through cell-autonomous (spindle intrinsic) and non-cell-autonomous (affecting nerve terminals and proprioceptive functions) mechanisms. Conclusively, the data and findings in the present study reflect a true advance and provide new insights into the molecular mechanisms that drive muscle spindle development and therefore proprioception.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The authors demonstrate the importance and roles of Wnt/β-catenin signalling in mammalian muscle spindle development and maintenance.

      Major comments:

      The paper is very well and clearly written with full details of the data and methods. The results, statistical analyses, and conclusions are convincing without any need for further experiments.

      Minor comments:

      • In the Introduction (paragraph 3) the authors state "Extensive morphological studies have shown that muscle spindle development begins around embryonic day (E) 14, when slow myofibers first contact proprioceptive axons and differentiate into intrafusal fibers in a sequential process." I would suggest "Extensive morphological studies have shown that muscle spindle development in the mouse begins around embryonic day (E) 14, when proprioceptive axons first contact primary myotubes initiating the differentiation of primary and secondary myotubes into intrafusal fibers in a sequential process." In the same paragraph it is stated that "Recent work has shown that Lrp4 expression in intrafusal fibers is necessary to maintain the sensory synapses of annulospiral endings..." Sensory endings, including those of muscle spindles are not usually, nor conventionally, regarded as synapses.
      • Legend to Fig 1 (F,G; inset in G shows enlargement of Fzd2) Fzd2 to read Fzd10.
      • Legend to Fig 2 (A) "Dotted lines demarcate equatorial region of spindles." I suggest "Dotted lines demarcate areas enlarged in B'-C', including equatorial regions of spindles."
      • Paragraph beginning "Next, to associate these changes..." "Surprisingly, for intrafusal genes, the most enriched GO term was "neuron projection morphogenesis,..." Why is this surprising?
      • Legend to Fig 4 "a shorter spindle height in mutants" This is unclear; I suggest "a smaller spindles diameter" would be clearer. Similarly "and shorter nucleus height" is unclear; I suggest "and smaller nuclear accumulation diameter".
      • Legend to Fig 5 Again, I think "spindle height" would be clearer as "spindle diameter". Specific experimental issues that are easily addressable.
      • The figures are all clear, in some cases when sufficiently enlarged, but careful attention needs to be paid to their final enlargements to ensure that the essential details remain clearly visible.

      Referees cross-commenting

      It is satisfying to see that all three reviewers agree on the importance of this paper, and that two reviewers clearly agree that no further experimental work is necessary to support the conclusions reached by the authors.

      Significance

      This is an important work of major significance in the area of muscle spindle studies, and in the wider field of the genetic basis of the integrated development of complex sense organs.

      My expertise is in the structure, ultrastructure, immunohistochemistry, and physiology of muscle spindles.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Point-by-point response to Reviewer comments:

      We copied the Reviewer comments below in italics. Revisions we propose in response to Reviewer comments are underlined.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Yin et al investigates how epidermal cells shape somatosensory neuron (SSN) morphology and function through selective ensheathment in Drosophila. This study builds on earlier work by another group showing that the phagocytic receptor Draper (Drpr) as a crucial epidermal factor that is important for dendrite pruning and clearance. In the present study, the authors how that Drpr also functions in the epidermis to establish the characteristic stretches of epidermal ensheathment of dendrite arborization neurons in the fruit fly Drosophila melanogaster. This is particularly true for highly branched types of dendrites but ont dendrites that show simpler branching patterns. Overexpression of Drpr increases ensheathment and nociceptor sensitivity, linking molecular recognition to sensory modulation. Further, Drpr is known to recognize phosphatidylserine (PS) on neurites to promote ensheathment and the authors show localization of a reporter for PS with epidermal membranes. Genetic manipulations that reduce PS results in a reduction in epidermal sheaths and the chemokine-like protein Orion promoting Drpr/PS interactions is required for these processes. Overall, the manuscript is well written, although at times maybe primarily for a fly audience. Reach could be improved by making it more accessible to a non-fly audience. The observation that Drpr is not only required for removing damaged or degenerating dendrites but also for their correct ensheathment of highly branched dendrites presents an important finding that could be of interest for a wider audience provided the following points are adequately addressed:

        • The Introduction could be further elaborated to help readers understand the significance of epidermal dendrite ensheathment. Addressing the following points may achieve this: (i) The Introduction would benefit from including details on developmental disorders and neurological diseases associated with defects or abnormalities in dendrite ensheathment.*

      We appreciate this suggestion. We allude to possible connections between ensheathment defects and human disease in the discussion but agree that it would be appropriate for the introduction; we will underscore this possible connection more clearly in our revised manuscript. We note studies of epidermal ensheathment are limited in mammalian systems, so links between dysregulation of epidermal ensheathment and disease have not been firmly established.

      (ii) In lines 74-79, it is unclear whether the described findings are conserved across evolution or were demonstrated in a specific model organism.

      The Reviewer refers to our statement about similarities in the cellular mechanism of epidermal ensheathment and phagocytosis. Indeed, these features are evolutionarily conserved in vertebrates, and we agree that it is worthwhile to emphasize this point. We added a statement underscoring the evolutionary conservation of the morphogenetic mechanism along with the relevant citation.

      (iii) Including a description of the known literature on phagocytosis in this process would help readers better understand the novelty and significance of this study.

      We agree with the Reviewer. In our revised introduction we will include a more detailed description of key features of phagocytic engulfment and highlight the salient differences between ensheathment and phagocytosis including the failure to complete the endocytic event in ensheathment and the persistence of PIP2 at the membrane.

      (iv) Details of published Draper function in Han et al 2014 should be elaborated along with unanswered question that is addressed in this study.

      The Han et al 2014 study established that epidermal cells, not Drosophila hemocytes (professional phagocytes), are primarily responsible for phagocytic clearance of damaged dendrites in the periphery. Similarly, the Rasmussen et al 2015 study we cite established that skin cells in vertebrates (zebrafish) act as primary phagocytes in removal of damaged peripheral neurites. These studies demonstrate the phagocytic capacity of epidermal cells, particularly in recognition of somatosensory neurites, and the Han study demonstrates that Draper is required for this epidermal phagocytosis. Neither of these studies addresses mechanisms of epidermal ensheathment; we will clarify this point in our revised introduction.

      • It is unclear why the authors focus exclusively on Drpr and Crq, without addressing emp and CG4006, both of which show higher expression levels than the former. Moreover, the conclusion that 14 out of 16 engulfment receptor genes have no role based solely on RNAi knockdown experiments is a very strong statement that may requires additional validation. The authors should provide evidence that the RNAi knockdowns achieved complete loss of gene function to support their claim about 16 engulfment receptors. In addition, at most the authors can conclude that the tested genes are individually not required.*

      The Reviewer makes several points that warrant discussion. First, the Reviewer asks “why the authors focus exclusively on Drpr and Crq, without addressing emp and CG40066.” The rationale for focusing on Drpr and Crq in our discussion of the expression data is that both Drpr and Crq function in phagocytic engulfment of damaged neurites. Our focus on Drpr for the remainder of the study is guided by the knockdown phenotypes; if either emp, CG40066, or any other receptor showed robust and reproducible effects on ensheathment we would have discussed them at length. Indeed, we identified a potentially novel ensheathment phenotype for NimB4 and devote a small portion of our discussion to its possible function. However, our primary focus in this study was to identify phagocytic receptors required for epidermal ensheathment of somatosensory neurites and drpr was the top hit from our RNAi screen.

      Second, we acknowledge that RNAi knockdown is often incomplete and without additional validation a negative result using RNAi is difficult to interpret. In our original text we state: “epidermal RNAi of 14/16 engulfment receptor genes had no significant effect on the extent of dendrite ensheathment in third instar larvae (Figure 1, F and G), consistent with the notion that most epidermal engulfment receptors are dispensable for dendrite ensheathment.” We do not claim that other receptors have “no role”, simply that our results are consistent with the interpretation that most receptors are dispensable. Furthermore, we acknowledge that multiple other receptors likely contribute to other aspects of ensheathment (lines 131-145; NimB4 knockdown causes an “empty sheath” phenotype). However, the Reviewer’s comments convince us that we should more clearly word our interpretation of the negative RNAi results more to reflect the limitations of the approach; we will incorporate this into our revision.

      Third, the Reviewer brings up the very important point that receptor redundancy could mask phenotypes. Indeed, our studies suggest that additional pathways likely function in parallel with Drpr. We agree that potential redundancy is an important consideration and absolutely warrants discussion in this section of the results; we will add this to our revised text and we have already updated the statement in the results to read “most epidermal phagocytic receptors are individually dispensable for dendrite ensheathment.”

      The final point the Review makes is that analysis of the knockdown efficacy is warranted if we want to make strong claims about gene function for other receptors. We agree that this would be an important first step, but in many cases protein perdurance masks RNAi phenotypes as well. So, efficient knockdown alone is not enough to make concrete conclusions about gene function in this developmental context.

      • What kind of genes are crq and ea?*

      Crq is a Scavenger receptor and Eater is a Nimrod-family receptor (indicated in Figure 1A).

      • Comparing Figures 1C and 1E, it appears that drpr knockdown has a differential effect on epidermal dendrite ensheathment between main and secondary branches. If this observation is correct, separate quantification for each branch type would be more appropriate, along with an explanation for the observed differences.*

      We agree with the Reviewer’s assessment that ensheathment appears to be largely absent on terminal dendrites following drpr knockdown but some ensheathment persists on major dendrites. In prior published studies we demonstrated that terminal branches are less extensively ensheathed than primary dendrites in wild-type larvae (Jiang et al 2019 eLife). We will provide this important context in our revised submission. We hypothesize that Drpr uniformly affects ensheathment across the arbor but agree with the Reviewer that quantification is warranted to evaluate this hypothesis. We will add this analysis to our revised submission.

      • For Figure 1K, it would be informative to examine how drpr knockdown affects dendrite length in these neurons.*

      We agree with the Reviewer. We demonstrate that drpr null mutants have exuberant terminal branching, but we have not yet analyzed effects of epidermal drpr RNAi. We will add this analysis to our revised manuscript.

      • For Drpr expression (Figure 3), it would be valuable to highlight any differences in expression between primary and secondary dendritic branches.*

      The Reviewer’s question about Drpr distribution at sites of ensheathment will be particularly relevant if we observe differential impacts of Drpr knockdown on ensheathment at primary and higher order dendrites. In our initial submission we showed that >70% of PIP2+ (Fig. 3B) and cora+ (Fig. 3D) epidermal sheaths also exhibited Drpr accumulation; we likewise showed that Drpr accumulation adjacent to dendrites only occurred at sites labeled by the sheath marker cora (Fig. 3G). In our revised submission, we will examine whether Drpr accumulation is more prevalent at sites of PIP2 accumulation on main branches compared to terminal branches.

      • Removing drpr leads to excessive branching of SSN dendrites. Does overexpression of drpr affect dendrite morphology in the opposite manner?*

      The Reviewer asks an intriguing question about effects of drpr overexpression. We have not examined effects of epidermal drpr overexpression on dendrite morphogenesis, but we will add these experiments to our revised manuscript.

      • Although drpr role in dendrite ensheathment is well explored, the interactions between drpr and PS seem underexplored. For example, do the changes in ensheathment as a result of manipulating PS levels require drpr? Does changing PS levels affect Drpr localization or levels?*

      The Reviewer raises two questions about the relationship between PS exposure and Drpr.

      First, they inquire whether changes in ensheathment resulting from manipulating PS levels require Drpr. We show that overexpressing the ATP8a flippase in C4da neurons, which limits PS exposure, limits the extent of ensheathment. Similarly, we show that sheath formation requires Drpr. In principle, we could assay effects of simultaneously overexpressing ATP8a in neurons and inactivating Drpr (using the Drpr null mutation), but such an experiment will likely be difficult to interpret because the individual treatments cause an almost complete loss of sheaths. We did not investigate whether increasing PS exposure increases ensheathment because prior studies demonstrated that ectopic PS exposure induces membrane shedding in C4da dendrites.

      Second, they inquire whether PS levels affect Drpr localization or levels. We demonstrate that inactivation of the PS bridging molecule Orion prevents Drpr localization at sheaths, hence we predict that neuronal overexpression of the ATP8a flippase should have a similar effect. In the revised manuscript, we will examine this possibility (monitoring Drpr distribution at epidermal contact sites with neurons overexpressing ATP8a).

      Minor Points:

        • Why there is no gene in bold category for hemocytes in Figure 1A*

      The bold type was used to indicate the receptors that were selected for screening, using a relaxed criteria for identifying receptors that were “expressed”: any receptor detected at a level of 0.1 TPM. To this point, the figure legend states: “Epidermal candidate genes in bold exhibited a TPM value > 0.1 in at least one biological sample and were selected for inclusion in RNAi screen for epidermal phagocytic receptors required for ensheathment.”

      We acknowledge that this is a relaxed criteria for “expression” and likely includes receptors that are not appreciably expressed in epidermal cells. Within the text we compare the repertoire of hemocyte and epidermal phagocytic receptors using a more standard (albeit still relatively relaxed) threshold of 0.5 TPM. We added shading to the histograms in Fig. 1A to facilitate comparison of phagocytic receptor gene expression in hemocytes and epidermal cells.

      • Line 67: "neurons BEING the most extensively..."*

      • Line 126: should read "epidermal engulfment receptors are INDIVIDUALLY dispensable"*

      • Line 216: "THE DrprD 5 mutation had no significant..."*

      • Line 230: "overexpression" instead of "overexpressed"*

      • Line 385: similar "TO"*

      These grammatical errors have been corrected. We thank the Reviewer for their careful reading of the manuscript.

      Reviewer #1 (Significance (Required)):

      This is an interesting study that adds to our understanding of the role of phagocytic receptors in shaping dendrites. Specifically, the role of Drpr (Draper) is studied, a gene previously known as an important for removal degenerating dendrites. The limitations of the manuscript as is is that it seems to be written primarily for a fly audience. Contextualizing the results and in the significance of this like conserved pathway could increase the significance.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Innervation of the skin by somatosensory neurons is a conserved process that enables perception and discrimination of mechanical stimuli. How do molecules exposed by neurons and skin cells collaborate to promote neurite-induced epidermal sheath formation? Here, the authors combine fruit fly molecular and genetic tools with high resolution imaging to address this fundamental question. Based on morphological similarity between phagocytosis and SSN ensheathment, the authors hypothesized that one or more phagocyte receptors might promote ensheathment through ligand-driven interactions with neurites. To test this hypothesis, the authors systematically screened phagocytic receptors expressed in the epidermis for functional roles in ensheathment. Through this screening approach, the authors found that the Draper (Drpr) receptor functions in epidermal cells as a significant factor required to promote ensheathment. They support this conclusion using a suite of cell- and tissue-specific RNAi tools and mutant fly lines in conjunction with elegant mechanistic work that establishes a role for the conserved "eat-me" signal phosphatidylserine (PS) in driving ensheathment.

      Major comments:

      Are the claims and the conclusions supported by the data or do they require additional experiments or analyses to support them?

      The seven key claims presented in the abstract are strongly supported by experimental data and analyses presented in the manuscript. At least one experimental result displayed in a main figure in support of the indicated key claim is summarized below. This summary does not present a comprehensive list of all data in support of a particular claim. Rather, it is an effort to confirm that each key result presented to the readership in the abstract is supported by at least one rigorously analyzed experimental result.

      We concur with the Reviewer’s interpretations of our work and appreciate the clarity of their summaries below.

        • Drpr functions in epidermal cells to promote ensheathment: Expressing a Draper RNAi under control of a larval epidermal driver (A58) led to significant reduction in total sheath length (Fig 1H), average sheath length (Fig 1I), and fraction ensheathed (Fig 1J). Similar results were obtained using two different Draper RNAi constructs.*

      The argument presented through RNAi results in Fig 1 is bolstered by data using an existing validated Draper mutant line in Fig 2A-E. A question of interest to this reviewer upon receiving the paper was whether Draper functions at initial stages of sheath formation, maintenance of existing sheaths, or both. The timelapse data in Fig 2F suggests that Draper activity is dispensable for maintaining existing sheaths.

      • ...that Draper accumulates at sites of epidermal ensheathment but not contact sites of unsheathed neurons:*

      Immunostaining experiments demonstrate that Drpr immunoreactivity is enriched at PIP2-positive membrane domains in epidermal cells (Fig 3A-B). Is this accumulation selective for epidermal sheaths? Yes. In Fig. 3E-G, the authors show that Drpr enrichment overlaps with the sheath marker cora but not with dendrites of C1da neurons or from unsheathed portions of C4da dendrite arbors. The authors confirm specificity of Drpr immunoreactivity through control experiments using a Drpr mutant (Supplementary Fig 2).

      • ...that Drpr overexpression increased ensheathment:*

      Enforced overexpression of Draper in epidermal cells via Epidermal GAL4 driving UAS-Drpr (Fig 5A) shows significantly higher levels of ensheathment of C4da neurons as compared to controls. The authors demonstrate specificity by showing that epidermal Drpr overexpression did not induce ectopic sheath formation in C1da neurons (Fig 5E-G).

      • ...that extracellular PS accumulates at sites of ensheathment:*

      Using a previously developed secreted AnnV-mScarlet reporter (Ji et al. 2023 https://doi.org/10.1073/pnas.2303392120), the authors demonstrate that PLC-PH-GFP labeled stretches were also labeled by AnnV-mScarlet (Fig 6A-B), consistent with their model that ensheathment by Drpr is mediated by PS exposure on dendrites.

      • ...that overexpression of the PS Flippase ATP8a blocks ensheathment:*

      This claim is supported by demonstrating that overexpression of ATP8A, a protein that drives drives unidirectional PS translocation from the outer to the inner leaflet of the plasma membrane, impacts C4da neurite ensheathment. Selective overexpression of ATP8A in C4da neurons using a ppk-GAL4 induced a significant reduction in epidermal sheaths (Fig 6C).

      • ...that Orion is required for sheath formation:*

      Inactivation of the chemokine-like PS bridging molecule Orion significantly reduces fraction of ensheathment (Fig 6I-L).

      • Overexpression of Draper enhanced nociceptor sensitivity to mechanical stimulus*

      Consistent with a functional role for epidermal ensheathment in responses to mechanical stimuli, the authors report a significant reduction in nocifensive responses in a behavioral assay presented in Fig 6H.

      In conclusion, the authors' claims are supported by the data as presented in this version of the manuscript.

      • Please request additional experiments only if they are essential for the conclusions. Alternatively, ask the authors to qualify their claims as preliminary or speculative, or to remove them altogether.

      n/a

      • If you have constructive further reaching suggestions that could significantly improve the study but would open new lines of investigations, please label them as "OPTIONAL".

      n/a

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated time investment for substantial experiments.

      n/a

      • Are the data and the methods presented in such a way that they can be reproduced?

      Yes. The quality of the cell imaging data presented in the figures is high. The figure legends are sufficient to follow the investigators' conceptual approach and technical progress as they build their model. Transparent presentation of the screening data in Fig. 1 F-G was particularly appreciated by these reviewers.

      Are the experiments adequately replicated and statistical analysis adequate?

      Yes. We specifically commend the table outlining all statistical tests presented in the supplementary methods and linked to each figure.

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Minor comments:

      1. Could the authors further clarify Drpr's anticipated window of activity during sheath formation and/or speculate further on this point in the discussion? Live imaging in Fig. 2 suggest that Drpr is dispensable for maintenance of existing sheaths. Given that Drpr is proposed to be activated through transient phosphorylation that recruits the binding partner Shark (PMCID PMC2493287), it might be useful to clarify Drpr's window of activation (ie transient or constitutive) for an audience more familiar with Drpr's canonical functions in engulfment. The section prior to speculation about a possible role for negative regulators of phagocytosis (Line 360) might be a possible location for this addition.

      We appreciate the insightful suggestion. As the Reviewer notes, our results are consistent with a model in which Drpr is required for formation but not maintenance of sheaths. Our original hypothesis was that Drpr would transiently localize to sheaths and be largely absent from mature sheaths. However, our antibody staining suggests that Drpr persists at mature sheaths (signal from endogenously labeled Drpr protein was too dim for live imaging in our hands). We therefore favor a model in which Drpr is transiently activated to promote sheath assembly.

      In the context of engulfment, Src42A-dependent tyrosine phosphorylation of Draper promotes association of Shark and Draper pathway activation. Src42A activation is regulated by integrins and RTKs, providing a potential point of crosstalk with other pathway(s) likely involved in ensheathment. Intriguingly, membrane recruitment and activation of Talin depends in part on PIP2, and Talin promotes both Integrin activation and recruitment of PIP2-prodicing PIP5K Kinases, providing a potential feed-forward mechanism for increasing PIP2 accumulation, Talin recruitment, and Integrin activation, which can promote Src42A activation. In our revised discussion we will provide a more thorough treatment of mechanism(s) of Drpr activation.

      • The authors might consider developing their conclusion a bit further for a broad audience. For example, the gesture to Piezo dependence in the current final sentence might provide an opening to discuss an exciting future avenue focused on integrating molecular mechansensors into a comprehensive model of selective SSN ensheathment important for the perception and discrimination of touch and pain.*

      We appreciate the suggestion and agree that it is worthwhile to expand on the potential links between ensheathment and sensory neuron function in our revised discussion. Our studies thus far have largely explored mechanosensation, but it’s worth noting that the nociceptive neurons under study here are polymodal, and other functional classes of somatosensory neurons are ensheathed to differing degrees, so an intriguing open question is whether ensheathment selectively potentiates the function of mechanosensors or more generally enhances functional coupling of somatosensory neurons to the epidermis. Our finding that ensheathment levels can be bidirectionally regulated by drpr levels provides an entry point to more broadly characterizing functions for ensheathment.

      • Word missing or extra "in" in Line 69 after ECM?*

      Corrected.

      • In Fig 1 and Fig 3, the PLC(delta)-PH-GFP reporter contains the delta symbol, in other throughout the paper it does not. In addition, Fig 5 is denoted "PIP2 (PLC-PH-GFP)". For consistency the authors might consider using PLC(delta)-PH-GFP across all figures.*

      As suggested, we updated the figures and text to include the delta symbol in the reporter PLC(delta)-PH-GFP.

      • Fig 6P - do the authors suggest Orion is distributed at high concentration throughout the entire upper portion of the figure? Perhaps the coloration could be changed if Orion binding is suggested to occur between Drpr and PS.*

      We have not examined Orion distribution in the periphery, though prior studies demonstrate that it is secreted into the hemolymph from multiple sources. Our schematic focuses on sites of contact between epidermal cells and dendrites but omits the hemolymph, muscle, and other cell types in the periphery. In our initial schematic epidermal cells and Orion were shaded similarly; in our revision we chose a different color for epidermal cells to prevent confusion.

      Optional suggestions for consideration to provide further context for a broad audience:

      Optional 6. The authors might consider placing their work in the context of an emerging literature focused developmental roles for immune cell signaling molecules/other phagocyte receptors at steady state. While the present study focused on epidermal ensheathment of SSNs stands on its own as a notable contribution and does not require these citations to support its conclusions, context from an emerging literature bridging immunity and development might be of interest to a broad readership. Should the authors wish to strengthen the link between their work and findings from other systems indicating a shared role in immunity and development for key immunoreceptors and their binding partners, they might consider adding citations/phrasing indicating that Draper's molecular collaborator Shark kinase (PMCID PMC2493287) was initially discovered as a developmental gene required for dorsal closure (PMCID PMC316420). They might also consider highlighting the role of Draper's mammalian orthologs Megf10/Megf11 in regulating mosaic spacing of retinal neurons (PMCID PMC3310952).

      We appreciate the Reviewer’s suggestions, in particular the value of further highlighting relevant links between immunity and development. Not including Megf10/Megf11 (Drpr vertebrate orthologue) in our discussion was an oversight as we predict that Megf10/Megf11 serves a similar role in ensheathment of vertebrate somatosensory neurons. In our revised manuscript we will incorporate a more thorough discussion of the emerging literature bridging immunity and development.

      Optional 7. The authors might consider tying their extended discussion of integrins (~Line 320-Line 335) into their overall argument in a more cohesive manner. For example, how (if at all) do the authors see Drpr collaborating with other receptors to regulate initiation versus maintenance of sheaths? Is a model in which Drpr initiates ensheathment maintained by other molecules possible? Speculation on this point in the discussion might integrate other molecules into the authors' model in a cohesive manner and/or bolster the authors' discussion of Drpr's window of activation/deactivation during ensheathment.

      Indeed, we envision a model in which Drpr cooperates with other receptors; we discussed one possible connection to integrins above and will incorporate a fuller treatment of the possible crosstalk between these pathways in our discussion. Regarding a model in which Drpr initiates ensheathment maintained by other molecules: yes, we agree that this is possible, but our results suggest that additional receptors likely participate in sheath initiation as well. Drpr inactivation substantially reduces but does not totally eliminate ensheathment, however the sheaths that form in drpr mutants are structurally distinct from mature sheaths (shorter, narrower, appear to recruit less Cora). Hence, we favor a model in which drpr signaling cooperates with a parallel, partially redundant pathway for initiating sheath formation in response to sheath-promoting signals. Integrin signaling is a plausible candidate for this parallel pathway for reasons we discuss in our original submission (and above); in our revised discussion we will more extensively discuss the potential cross-talk between Drpr signaling and Integrin signaling in initiation and maintenance of epidermal sheaths.

      Reviewer #2 (Significance (Required)):

      This study provides a new link between a conserved phagocyte receptor (Drpr) and epidermal ensheathment of somatosensory neurons, an important process at the heart of the regulated development and function of the nervous system. As such, the Yin et al. submission is a significant contribution to a rapidly moving research area of broad interest to an intellectually diverse readership interested in the molecular and cellular basis of neurodevelopment and interactions between the nervous system and the immune system.* *

      An important strength of this study is the striking degree of the epidermal ensheathment phenotypes observed when normal Drpr expression is disrupted either through depletion, mutation, or targeted overexpression. For example, depletion of Drpr via RNAi induces a ~three fold reduction in total sheath length (Fig 1F - ~1.45 mm in controls as compared to ~0.5 mm with Drpr RNAi). Notably, epidermal enforced overexpression of Drpr induces a notable increase in the fraction of ensheathed neurons (Fig 5A-D). This strength of phenotype enables the investigators to deploy an elegant sequence of molecular and genetic tools to further probe mechanism and implicate extracellular PS in this process.* *

      Reviewer area keywords as requested: phagocytes, immune cell signaling, signal transduction

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The study by Yin and colleagues investigates how epidermal cells recognize and ensheath somatosensory neuron (SSN) dendrites in Drosophila larvae. The authors identify the phagocytic receptor Draper (Drpr) as a key mediator of selective epidermal ensheathment and demonstrate that this process relies on phosphatidylserine (PS) exposure on dendrites and the bridging molecule Orion. The work significantly advances our understanding of neuron/epidermis interactions and reveals a novel role for phagocytic recognition pathways in non-glial ensheathment.

      The manuscript is clearly written, methodologically solid and supported by compelling data. The authors combine genetic, imaging and functional approaches to uncover a mechanism of structural and functional modulation of nociceptive neurons. The results will interest researchers studying neuronal morphogenesis, epithelial biology and non-glial phagocytic pathways.

      Specific Critiques:

      While the study is strong and timely, several issues should be addressed prior to acceptance:

      Figure 1: The authors refer to the receptors as "engulfment receptors." I recommend calling them "phagocytic receptors" since not all are required for the engulfment step (e.g., Crq).

      The Reviewer makes an important distinction. We have updated our manuscript to reflect this point, replacing “engulfment receptor” with “phagocytic receptor” in the text and in our title.

      Figure 2: The title states "Drpr is required in epidermal cells..." yet the authors analyze a drpr null mutant, which lacks Drpr in all expressing cells (glia, macrophages and epidermal cells). The rationale for using the null mutant instead of epidermal-specific RNAi should be explained.

      The increased dendrite number in drpr RNAi larvae should also be noted here.

      We agree – the title is not appropriate for this version of the figure; we changed the title to better reflect the experiments being portrayed.

      Our RNAi experiments in Figure 1 and 2 demonstrate that drpr is cell autonomously required in epidermal cells for dendrite ensheathment. Here, we include analysis of an amorphic drpr allele to (1) provide further genetic support underscoring the requirement for drpr in dendrite ensheathment and (2) to underscore the observation that a small number of immature sheaths form in the complete absence of drpr, arguing for the presence of an additional pathway that contributes to sheath formation.

      Effects of epidermal drpr RNAi on dendrite number is not something we evaluated with our time-lapse studies in Figure 2. Instead, we monitored the effects of drpr knockdown on growth behavior of epidermal sheaths and found that epidermal drpr RNAi triggered an increase in the frequence of sheath retraction events and a decrease in sheath growth events.

      Figure 3: Explain the numbers on the X-axis in panels B and D. Add a panel without blue dashed outlines to better visualize Drpr expression. Adjust the red boxes to precisely match the enlarged regions.

      Each bar represents a single neuron; the numbers denote the number of sheaths sampled from each neuron. We added this to the figure and figure legend in our manuscript. We thank the Reviewer for identifying this oversight.

      We appreciate the Reviewer’s perspective on the blue hatched lines; we removed the hatched lines from the ROI and adjusted the position of the red hatched box.

      Figure 4: Why is the drpr mutant used here rather than RNAi? Please clarify the reasoning for choosing mutants in some experiments and knockdown in others.

      In Figure 2, we show analysis of the amorphic allele to further corroborate our RNAi studies, as described above. We chose to use the drpr amorphic mutant for these studies because we have no GAL4-independent reporter to label C1da neurons for analysis of dendrite arborization patterns. Although we could use HRP staining in combination with epidermal drpr RNAi, live imaging of dendrite arbors labeled by a C1da neuron GAL4 driver provides a more sensitive and reliable readout for morphogenesis studies.

      In our revised manuscript we will add analysis of C4da dendrite patterns in larvae expressing drpr RNAi in epidermal cells to evaluate whether the dendrite defects reflect epidermal requirements for drpr function.

      Figure 5: Correct the placement of white boxes in panels E-F′.

      We thank the Reviewer for identifying the mismatch. We corrected the placement to match the size of the ROIs.

      *Figure 6: AnnV staining in B is difficult to detect. Please add a version of the panel showing AnnV alone. *

      In our initial submission we include the overlay of PLC-PH-GFP and AnnV-mScarlet (B), an image showing the PLC-PH-GFP alone (B’) and an image showing the AnnV-mScarlet alone (B”).

      AnnV labeling appears weak on sheaths. Since epidermal membranes are strongly labeled, confirm PS exposure on dendrites with a commercial fluorescent Annexin V reagent.

      We appreciate the suggestion to use a commercial fluorescent Annexin V reagent and agree that it would strengthen our findings if such a reagent labeled sheaths. However, we intentionally prioritized analysis using the in vivo reporter because numerous studies indicate that epidermal sheaths are inaccessible to large molecules in solution (in the absence of detergent). One of the first assays used to monitor the in vivo distribution of sheaths was based on the inaccessibility of antibodies to ensheathed neurites (Kim et al, Neuron, 2012; also Tenenbaum et al, Current Biology, 2017; Jiang et al, eLife, 2019). More recently, we demonstrated that 10kDa dextran dyes are excluded from epidermal sheaths (Luedke et al, PLoS Genetics, 2024). Nevertheless, as part of our revision we will examine whether commercially available Annexin V reagents label sheaths.

      In F and F" sheaths are labeled in areas without visible dendrites. Please clarify.

      We note that although C4da dendrites are the most extensively ensheathed among da neurons, other neurons (most prominently C3da neurons) also exhibit significant ensheathment (Jiang et al, eLife, 2019). We use established markers of epidermal sheaths (Cora immunoreactivity in this panel; PIP2 reporters and/or Cora-GFP localization in other panels), hence Drpr accumulates at Cora+ sheaths on C4da neurons and Cora+ sheaths that form on other da neurons. We will clarify this point in the text of our revised manuscript.

      In O and P, show Drpr staining without blue dashed sheath outlines.

      We have removed the blue dashed outlines from the figure panels.

      The legend contains numerous labeling errors: there is no B′ or B"; C-G should be E-G; G-I should be H-J; I-L should be K-N; M-O should be O-R. Please revise carefully.

      The labeling errors have been corrected.

      Sup Fig 1: Add a panel with only c4da labeling to visualize dendrites.

      We have added a panel displaying only C4da dendrites to this figure.

      Sup Fig 2: The anti-Drpr signal is unexpected in the null mutant. Validate with an additional antibody (e.g., mouse monoclonal anti-Drpr from the DSHB).

      We appreciate the suggestion and have already tested the mouse monoclonal anti-Drpr antibody from DSHB and found that it is unsuitable for use in our preparations (ie, no Drpr-dependent immunoreactivity, even in specimens overexpressing Drpr).

      With respect to the comment about the unexpected signal in the null mutant, we note that the antibody is a rabbit polyclonal and is not purified. In our experience it is not uncommon for rabbit serum (even pre-immune serum) to recognize multiple antigens in the larval skin. Nevertheless, our control studies demonstrate that Drpr immunoreactivity is eliminated at epidermal sheaths in Drpr null mutants.

      Sup Fig 3: No panels A or B are shown; no PIP2 marker is present despite the legend. Please revise. Drpr overexpression appears to increase Cora levels in some cell. Could Drpr affect Cora expression or distribution? This should be addressed. Also dendrite number appears higher in Drpr-overexpressing larvae. Please state whether this is significant.

      The labeling errors in the legend have been corrected; the corresponding studies with the PIP2 marker are presented in Figure 5.

      All epidermal drivers we have characterized exhibit a low level of variegation in expression within a hemisegment that we have previously documented (Jiang et al 2014 Development; Jiang et al 2019 eLife), and we suspect that it may be related to epidermal endoreplication (epidermal cells do not synchronously endoreplicate). However, we have not observed any systematic difference in epidermal GAL4 driver or Cora-GFP expression in larvae overexpressing Drpr. We note that a single cell in the field of view in Supplemental Figure 3 exhibits a higher level of GFP fluorescence. We occasionally observe this, independent of background genotype.

      All gene names must be italicized and lowercase (e.g., drpr), including in figure labels and legends.

      All protein names must be capitalized and non-italic (e.g., Drpr, Cora).

      We appreciate the Reviewer’s feedback. We used Drpr in keeping with many recent reports, but the Reviewer is correct in outlining the standard naming conventions. We have changed the gene names to reflect convention (lowercase, italics for genes that were initially identified according to phenotypic characterization; uppercase, italics for genes named according to homology to orthologues in other species such as NimB4 and ATP8A)

      Define ROI on first use.

      Done. We defined ROI in the methods section.

      Ensure consistent phrasing: use "anti-Cora or anti-Drpr immunoreactivity" uniformly.

      We have done so.

      There a few typos which must be corrected:

        • Line 196: "containing" → "contain"*
        • Line 205: "antibodies Drpr" → "antibodies to Drpr" or "anti-Drpr antibodies"*
        • Line 331: "predominan" → "predominant"*
        • Line 353: "phagocyting" → "phagocytic"*
        • Line 385: "similar the effect" → "similar to the effect"*
        • Line 432: Title should be underlined*
        • Line 544: "drpr∆5" is missing the 5*
        • Line 569: "immunoreactivity a" → "immunoreactivity of"*

      The typographical errors have been corrected. We thank the Reviewer for the close reading of the manuscript.

      Reviewer #3 (Significance (Required)):

      The manuscript makes a meaningful contribution to the field of neuron/epidermal cells interactions by demonstrating that recognized phagocytic machinery components can be co-opted for ensheathment of sensory neurites. This not only expands our understanding of skin innervation and mechanosensation but also raises intriguing implications for how similar mechanisms might operate in vertebrates (e.g., epidermal/nerve interactions, peripheral neuropathy). Given the functional link to nociceptive sensitivity, the work may have broader relevance for pain biology and sensory disorders.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The study by Yin and colleagues investigates how epidermal cells recognize and ensheath somatosensory neuron (SSN) dendrites in Drosophila larvae. The authors identify the phagocytic receptor Draper (Drpr) as a key mediator of selective epidermal ensheathment and demonstrate that this process relies on phosphatidylserine (PS) exposure on dendrites and the bridging molecule Orion. The work significantly advances our understanding of neuron/epidermis interactions and reveals a novel role for phagocytic recognition pathways in non-glial ensheathment. The manuscript is clearly written, methodologically solid and supported by compelling data. The authors combine genetic, imaging and functional approaches to uncover a mechanism of structural and functional modulation of nociceptive neurons. The results will interest researchers studying neuronal morphogenesis, epithelial biology and non-glial phagocytic pathways.

      While the study is strong and timely, several issues should be addressed prior to acceptance: Figure 1: The authors refer to the receptors as "engulfment receptors." I recommend calling them "phagocytic receptors" since not all are required for the engulfment step (e.g., Crq).

      Figure 2: The title states "Drpr is required in epidermal cells..." yet the authors analyze a drpr null mutant, which lacks Drpr in all expressing cells (glia, macrophages and epidermal cells). The rationale for using the null mutant instead of epidermal-specific RNAi should be explained. The increased dendrite number in drpr RNAi larvae should also be noted here.

      Figure 3: Explain the numbers on the X-axis in panels B and D. Add a panel without blue dashed outlines to better visualize Drpr expression. Adjust the red boxes to precisely match the enlarged regions.

      Figure 4: Why is the drpr mutant used here rather than RNAi? Please clarify the reasoning for choosing mutants in some experiments and knockdown in others.

      Figure 5: Correct the placement of white boxes in panels E-F′.

      Figure 6: AnnV staining in B is difficult to detect. Please add a version of the panel showing AnnV alone. AnnV labeling appears weak on sheaths. Since epidermal membranes are strongly labeled, confirm PS exposure on dendrites with a commercial fluorescent Annexin V reagent. In F and F" sheaths are labeled in areas without visible dendrites. Please clarify. In O and P, show Drpr staining without blue dashed sheath outlines. The legend contains numerous labeling errors: there is no B′ or B"; C-G should be E-G; G-I should be H-J; I-L should be K-N; M-O should be O-R. Please revise carefully.

      Sup Fig 1: Add a panel with only c4da labeling to visualize dendrites. Sup Fig 2: The anti-Drpr signal is unexpected in the null mutant. Validate with an additional antibody (e.g., mouse monoclonal anti-Drpr from the DSHB). Sup Fig 3: No panels A or B are shown; no PIP2 marker is present despite the legend. Please revise. Drpr overexpression appears to increase Cora levels in some cell. Could Drpr affect Cora expression or distribution? This should be addressed. Also dendrite number appears higher in Drpr-overexpressing larvae. Please state whether this is significant.

      All gene names must be italicized and lowercase (e.g., drpr), including in figure labels and legends. All protein names must be capitalized and non-italic (e.g., Drpr, Cora). Define ROI on first use. Ensure consistent phrasing: use "anti-Cora or anti-Drpr immunoreactivity" uniformly. There a few typos which must be corrected:

      • Line 196: "containing" → "contain"
      • Line 205: "antibodies Drpr" → "antibodies to Drpr" or "anti-Drpr antibodies"
      • Line 331: "predominan" → "predominant"
      • Line 353: "phagocyting" → "phagocytic"
      • Line 385: "similar the effect" → "similar to the effect"
      • Line 432: Title should be underlined
      • Line 544: "drpr∆5" is missing the 5
      • Line 569: "immunoreactivity a" → "immunoreactivity of"

      Significance

      The manuscript makes a meaningful contribution to the field of neuron/epidermal cells interactions by demonstrating that recognized phagocytic machinery components can be co-opted for ensheathment of sensory neurites. This not only expands our understanding of skin innervation and mechanosensation but also raises intriguing implications for how similar mechanisms might operate in vertebrates (e.g., epidermal/nerve interactions, peripheral neuropathy). Given the functional link to nociceptive sensitivity, the work may have broader relevance for pain biology and sensory disorders.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Innervation of the skin by somatosensory neurons is a conserved process that enables perception and discrimination of mechanical stimuli. How do molecules exposed by neurons and skin cells collaborate to promote neurite-induced epidermal sheath formation? Here, the authors combine fruit fly molecular and genetic tools with high resolution imaging to address this fundamental question. Based on morphological similarity between phagocytosis and SSN ensheathment, the authors hypothesized that one or more phagocyte receptors might promote ensheathment through ligand-driven interactions with neurites. To test this hypothesis, the authors systematically screened phagocytic receptors expressed in the epidermis for functional roles in ensheathment. Through this screening approach, the authors found that the Draper (Drpr) receptor functions in epidermal cells as a significant factor required to promote ensheathment. They support this conclusion using a suite of cell- and tissue-specific RNAi tools and mutant fly lines in conjunction with elegant mechanistic work that establishes a role for the conserved "eat-me" signal phosphatidylserine (PS) in driving ensheathment.

      Major comments:

      Are the claims and the conclusions supported by the data or do they require additional experiments or analyses to support them?

      The seven key claims presented in the abstract are strongly supported by experimental data and analyses presented in the manuscript. At least one experimental result displayed in a main figure in support of the indicated key claim is summarized below. This summary does not present a comprehensive list of all data in support of a particular claim. Rather, it is an effort to confirm that each key result presented to the readership in the abstract is supported by at least one rigorously analyzed experimental result.

      1. Drpr functions in epidermal cells to promote ensheathment: Expressing a Draper RNAi under control of a larval epidermal driver (A58) led to significant reduction in total sheath length (Fig 1H), average sheath length (Fig 1I), and fraction ensheathed (Fig 1J). Similar results were obtained using two different Draper RNAi constructs. The argument presented through RNAi results in Fig 1 is bolstered by data using an existing validated Draper mutant line in Fig 2A-E. A question of interest to this reviewer upon receiving the paper was whether Draper functions at initial stages of sheath formation, maintenance of existing sheaths, or both. The timelapse data in Fig 2F suggests that Draper activity is dispensable for maintaining existing sheaths.
      2. ...that Draper accumulates at sites of epidermal ensheathment but not contact sites of unsheathed neurons: Immunostaining experiments demonstrate that Drpr immunoreactivity is enriched at PIP2-positive membrane domains in epidermal cells (Fig 3A-B). Is this accumulation selective for epidermal sheaths? Yes. In Fig. 3E-G, the authors show that Drpr enrichment overlaps with the sheath marker cora but not with dendrites of C1da neurons or from unsheathed portions of C4da dendrite arbors. The authors confirm specificity of Drpr immunoreactivity through control experiments using a Drpr mutant (Supplementary Fig 2).
      3. ...that Drpr overexpression increased ensheathment: Enforced overexpression of Draper in epidermal cells via Epidermal GAL4 driving UAS-Drpr (Fig 5A) shows significantly higher levels of ensheathment of C4da neurons as compared to controls. The authors demonstrate specificity by showing that epidermal Drpr overexpression did not induce ectopic sheath formation in C1da neurons (Fig 5E-G).
      4. ...that extracellular PS accumulates at sites of ensheathment: Using a previously developed secreted AnnV-mScarlet reporter (Ji et al. 2023 https://doi.org/10.1073/pnas.2303392120), the authors demonstrate that PLC-PH-GFP labeled stretches were also labeled by AnnV-mScarlet (Fig 6A-B), consistent with their model that ensheathment by Drpr is mediated by PS exposure on dendrites.
      5. ...that overexpression of the PS Flippase ATP8a blocks ensheathment: This claim is supported by demonstrating that overexpression of ATP8A, a protein that drives drives unidirectional PS translocation from the outer to the inner leaflet of the plasma membrane, impacts C4da neurite ensheathment. Selective overexpression of ATP8A in C4da neurons using a ppk-GAL4 induced a significant reduction in epidermal sheaths (Fig 6C).
      6. ...that Orion is required for sheath formation: Inactivation of the chemokine-like PS bridging molecule Orion significantly reduces fraction of ensheathment (Fig 6I-L).
      7. Overexpression of Draper enhanced nociceptor sensitivity to mechanical stimulus Consistent with a functional role for epidermal ensheathment in responses to mechanical stimuli, the authors report a significant reduction in nocifensive responses in a behavioral assay presented in Fig 6H.

      In conclusion, the authors' claims are supported by the data as presented in this version of the manuscript.

      • Please request additional experiments only if they are essential for the conclusions. Alternatively, ask the authors to qualify their claims as preliminary or speculative, or to remove them altogether.

      n/a - If you have constructive further reaching suggestions that could significantly improve the study but would open new lines of investigations, please label them as "OPTIONAL".

      n/a - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated time investment for substantial experiments. n/a - Are the data and the methods presented in such a way that they can be reproduced?

      Yes. The quality of the cell imaging data presented in the figures is high. The figure legends are sufficient to follow the investigators' conceptual approach and technical progress as they build their model. Transparent presentation of the screening data in Fig. 1 F-G was particularly appreciated by these reviewers.

      Are the experiments adequately replicated and statistical analysis adequate?

      Yes. We specifically commend the table outlining all statistical tests presented in the supplementary methods and linked to each figure.

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Minor comments:

      1. Could the authors further clarify Drpr's anticipated window of activity during sheath formation and/or speculate further on this point in the discussion? Live imaging in Fig. 2 suggest that Drpr is dispensable for maintenance of existing sheaths. Given that Drpr is proposed to be activated through transient phosphorylation that recruits the binding partner Shark (PMCID PMC2493287), it might be useful to clarify Drpr's window of activation (ie transient or constitutive) for an audience more familiar with Drpr's canonical functions in engulfment. The section prior to speculation about a possible role for negative regulators of phagocytosis (Line 360) might be a possible location for this addition.
      2. The authors might consider developing their conclusion a bit further for a broad audience. For example, the gesture to Piezo dependence in the current final sentence might provide an opening to discuss an exciting future avenue focused on integrating molecular mechansensors into a comprehensive model of selective SSN ensheathment important for the perception and discrimination of touch and pain.
      3. Word missing or extra "in" in Line 69 after ECM?
      4. In Fig 1 and Fig 3, the PLC(delta)-PH-GFP reporter contains the delta symbol, in other throughout the paper it does not. In addition, Fig 5 is denoted "PIP2 (PLC-PH-GFP)". For consistency the authors might consider using PLC(delta)-PH-GFP across all figures.
      5. Fig 6P - do the authors suggest Orion is distributed at high concentration throughout the entire upper portion of the figure? Perhaps the coloration could be changed if Orion binding is suggested to occur between Drpr and PS.

      Optional suggestions for consideration to provide further context for a broad audience: Optional 6. The authors might consider placing their work in the context of an emerging literature focused developmental roles for immune cell signaling molecules/other phagocyte receptors at steady state. While the present study focused on epidermal ensheathment of SSNs stands on its own as a notable contribution and does not require these citations to support its conclusions, context from an emerging literature bridging immunity and development might be of interest to a broad readership. Should the authors wish to strengthen the link between their work and findings from other systems indicating a shared role in immunity and development for key immunoreceptors and their binding partners, they might consider adding citations/phrasing indicating that Draper's molecular collaborator Shark kinase (PMCID PMC2493287) was initially discovered as a developmental gene required for dorsal closure (PMCID PMC316420). They might also consider highlighting the role of Draper's mammalian orthologs Megf10/Megf11 in regulating mosaic spacing of retinal neurons (PMCID PMC3310952).

      Optional 7. The authors might consider tying their extended discussion of integrins (~Line 320-Line 335) into their overall argument in a more cohesive manner. For example, how (if at all) do the authors see Drpr collaborating with other receptors to regulate initiation versus maintenance of sheaths? Is a model in which Drpr initiates ensheathment maintained by other molecules possible? Speculation on this point in the discussion might integrate other molecules into the authors' model in a cohesive manner and/or bolster the authors' discussion of Drpr's window of activation/deactivation during ensheathment.

      Significance

      This study provides a new link between a conserved phagocyte receptor (Drpr) and epidermal ensheathment of somatosensory neurons, an important process at the heart of the regulated development and function of the nervous system. As such, the Yin et al. submission is a significant contribution to a rapidly moving research area of broad interest to an intellectually diverse readership interested in the molecular and cellular basis of neurodevelopment and interactions between the nervous system and the immune system.

      An important strength of this study is the striking degree of the epidermal ensheathment phenotypes observed when normal Drpr expression is disrupted either through depletion, mutation, or targeted overexpression. For example, depletion of Drpr via RNAi induces a ~three fold reduction in total sheath length (Fig 1F - ~1.45 mm in controls as compared to ~0.5 mm with Drpr RNAi). Notably, epidermal enforced overexpression of Drpr induces a notable increase in the fraction of ensheathed neurons (Fig 5A-D). This strength of phenotype enables the investigators to deploy an elegant sequence of molecular and genetic tools to further probe mechanism and implicate extracellular PS in this process.

      Reviewer area keywords as requested: phagocytes, immune cell signaling, signal transduction

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Yin et al investigates how epidermal cells shape somatosensory neuron (SSN) morphology and function through selective ensheathment in Drosophila. This study builds on earlier work by another group showing that the phagocytic receptor Draper (Drpr) as a crucial epidermal factor that is important for dendrite pruning and clearance. In the present study, the authors how that Drpr also functions in the epidermis to establish the characteristic stretches of epidermal ensheathment of dendrite arborization neurons in the fruit fly Drosophila melanogaster. This is particularly true for highly branched types of dendrites but ont dendrites that show simpler branching patterns. Overexpression of Drpr increases ensheathment and nociceptor sensitivity, linking molecular recognition to sensory modulation. Further, Drpr is known to recognize phosphatidylserine (PS) on neurites to promote ensheathment and the authors show localization of a reporter for PS with epidermal membranes. Genetic manipulations that reduce PS results in a reduction in epidermal sheaths and the chemokine-like protein Orion promoting Drpr/PS interactions is required for these processes. Overall, the manuscript is well written, although at times maybe primarily for a fly audience. Reach could be improved by making it more accessible to a non-fly audience. The observation that Drpr is not only required for removing damaged or degenerating dendrites but also for their correct ensheathment of highly branched dendrites presents an important finding that could be of interest for a wider audience provided the following points are adequately addressed:

      1. The Introduction could be further elaborated to help readers understand the significance of epidermal dendrite ensheathment. Addressing the following points may achieve this:

      (i) The Introduction would benefit from including details on developmental disorders and neurological diseases associated with defects or abnormalities in dendrite ensheathment.

      (ii) In lines 74-79, it is unclear whether the described findings are conserved across evolution or were demonstrated in a specific model organism.

      (iii) Including a description of the known literature on phagocytosis in this process would help readers better understand the novelty and significance of this study.

      (iv) Details of published Draper function in Han et al 2014 should be elaborated along with unanswered question that is addressed in this study. 2. It is unclear why the authors focus exclusively on Drpr and Crq, without addressing emp and CG4006, both of which show higher expression levels than the former. Moreover, the conclusion that 14 out of 16 engulfment receptor genes have no role based solely on RNAi knockdown experiments is a very strong statement that may requires additional validation. The authors should provide evidence that the RNAi knockdowns achieved complete loss of gene function to support their claim about 16 engulfment receptors. In addition, at most the authors can conclude that the tested genes are individually not required. 3. What kind of genes are crq and ea? 4. Comparing Figures 1C and 1E, it appears that drpr knockdown has a differential effect on epidermal dendrite ensheathment between main and secondary branches. If this observation is correct, separate quantification for each branch type would be more appropriate, along with an explanation for the observed differences. 5. For Figure 1K, it would be informative to examine how drpr knockdown affects dendrite length in these neurons. 6. For Drpr expression (Figure 3), it would be valuable to highlight any differences in expression between primary and secondary dendritic branches. 7. Removing drpr leads to excessive branching of SSN dendrites. Does overexpression of drpr affect dendrite morphology in the opposite manner? 8. Although drpr role in dendrite ensheathment is well explored, the interactions between drpr and PS seem underexplored. For example, do the changes in ensheathment as a result of manipulating PS levels require drpr? Does changing PS levels affect Drpr localization or levels?

      Minor Points:

      1. Why there is no gene in bold category for hemocytes in Figure 1A
      2. Line 67: "neurons BEING the most extensively..."
      3. Line 126: should read "epidermal engulfment receptors are INDIVIDUALLY dispensable"
      4. Line 216: "THE DrprD 5 mutation had no significant..."
      5. Line 230: "overexpression" instead of "overexpressed"
      6. Line 385: similar "TO"

      Significance

      This is an interesting study that adds to our understanding of the role of phagocytic receptors in shaping dendrites. Specifically, the role of Drpr (Draper) is studied, a gene previously known as an important for removal degenerating dendrites. The limitations of the manuscript as is is that it seems to be written primarily for a fly audience. Contextualizing the results and in the significance of this like conserved pathway could increase the significance.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Reply to the Reviewers

      I thank the Referees for their...

      Referee #1

      1. The authors should provide more information when...

      Responses + The typical domed appearance of a hydrocephalus-harboring skull is apparent as early as P4, as shown in a new side-by-side comparison of pups at that age (Fig. 1A). + Though this is not stated in the MS 2. Figure 6: Why has only...

      Response: We expanded the comparison

      Minor comments:

      1. The text contains several...

      Response: We added...

      Referee #2

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, Wang et al. use Difteria Toxin (DT) to cause hair cell (HC) death in transgenic mice expressing the DT receptor in the HC of the inner ear. This model is assumed to cause HC loss in a selective way. The lesioned mice are assessed for translational vestibulo-ocular reflex (tVOR), vestibular sensory evoked potential (VsEP), rotational vestibulo-ocular reflex (rVOR), and single-unit recordings of vestibular afferents from cristae and maculae. Numbers of surviving HC, including total HC, type I HC (HCI) and type II HC (HCII) were also obtained at short and long times after DT exposure. By comparing the functional and histological results, the authors conclude that DT cause dose-dependent HC loss and vestibular function loss, that limited but significant HC regeneration occurs and that vestibular organs display variable but ample redundancy because robust physiological responses were obtained despite loss of high percentages of HCs.

      However, there are several limitations in the experimental design, methodological choices and analysis of the results that weaken the conclusions stated by the authors. Also, some important aspects of the work are not clear enough for an in deep scrutiny.

      The following list of weaknesses is not arranged in order of importance.

      1. Choice and use of the Pou4f3DTR/+ transgenic on FVB and C57/Bl6 backgrounds.

      1.a. Literature descriptions of the Pou4f3-DTR model used C57/Bl6 and CBA/J backgrounds and low mortality rates were found after DT administration. The present study generated Pou4f3-DTR mice on a FVB background and found that DT cause high mortality rates in this background. Comparison of the C57/Bl6 and FVB backgrounds are included in Figures 1 and 2 and the conclusion was that the C57/Bl6 background is more suitable for studying vestibular HC degeneration/regeneration. However, data are presented in Figures 3 to 7 without informing the reader whether these are from C57/Bl6 or FVB animals. Because of the information given in table 1, at least part of the data in these Figures is from the less suitable FVB mice. It is also possible that some data sets contain unbalanced numbers of animals from each strain in the different experimental conditions, with a potential impact on the robustness of the results. The strain identity of animals should be clarified across all data sets.

      1.b. Why the Pou4f3-DTR transgene was introduced in the FVB strain? The FVB strain is frequently used in transgenesis because the prominent pronuclei in their fertilized eggs and large litter size. While generation of transgenic lines in FVB mice is common, why would you want to bring an already established transgenic modification to a FVB background? It is known that FVB mice become blind by wean age due to a mutation in the Pde6b gene. Were the authors trying to have the Pou4f3-DTR model in a strain of blind animals? It is anomalous that the rationale for the FVB derivation is not provided and that the blindness of this strain is not even mentioned in an article containing VOR data. 2. Toxicity of the DT.

      2.1. The non-HC toxicity of DT is not evaluated. One of the stated reasons of the choice of the Pou4f3-DTR model to ablate HC is that other alternative models (aminoglycosides, cisplatin, IDPN) cause other toxic effects besides HC toxicity. However, the lack of evidence of other toxicities in Pou4f3-DTR mice after DT administration may simply be due to lack of assessment. Besides the inner ear, Pou4f3 is expressed in several structures including the genitourinary system, the retina, Merkel cells and subsets of somatosensory and brain neurons (https://www.ncbi.nlm.nih.gov/datasets/gene/18998/; PMID: 20826176; PMID: 22262898; PMID: 34266958; PMID: 33135183), so one would expect DT toxicity in these Pou4f3 expressing cells. Also, DT may cause other toxicities not explored in the model. The fact that the DT treatment is toxic beyond the intended HC toxicity is proven by the high (strain-dependent) mortality rate recorded in this study. A more detailed analysis of the effects of DT in the Pou4f3-DTR mice is needed before stating that the treatment is selective for the inner ear HCs. By the way, hyperactivity is not an additional toxic effect of ototoxic chemicals, it is a consequence of the vestibular function loss.

      2.2. The dose-response relationship of the DT treatment is unclear. The authors state that DT caused a dose-dependent loss of HC. However, the effects across different DT doses were not compared directly. Instead, each DT dose was compared with a different set of controls, and then the percentage of HC loss was qualitatively compared without statistical comparison. Looking at the numbers, the percent loss after the 35x2 dose is greater than that recorded after the 50x2 dose, contradicting the conclusion of a dose-dependent effect. One possible explanation is that the DT treatment has an inverted-U dose-response relationship, and the 25x2, 35x2 and 50x2 doses draw the bottom of the U. Alternatively, you have a dose-dependent effect with a dose causing a moderate effect (15) and 3 doses (25x2, 35x2, 50x2) causing near-maximal effects with differences among these groups more related to experimental variability than to dose-dependency. <br /> 3. Experimental design, use of animals, role of batch-to-batch variability in apparent results.

      3.a. The number of animals used in each experimental condition, their assignment to each assessment and participation in each dataset must be clarified. The reader is not informed on whether the animals used for physiological and histological analyses were the same or separate sets of animals were used. Also, the distribution of animals in different batches is not clarified and this may have originated apparent results through experimenter-generated bias. For instance, the HC count data are presented as two different, independent experiments, one evaluating different doses in the two strains at 14 days after exposure (Figures 1 and 2) and a second one comparing the HC counts at 2 weeks and 6 months after exposure (Figures 3 and 4). However, these were not separate experiments because at least some animals were shared in the two "experiments". This is demonstrated by the duplicate images between figures 1 and 2 and figures 3 and 4 (for instance, images D to D' in Figure 1 are the same than images C to C' in Figure 3). Therefore, at least part of the data for 2-week animals in Figure 3 have already been used as data of day-14 animals in Figure 1. This makes this reviewer suspect that 6-month animals in Figure 3 were treated with DT at different dates than 2-month animals in the same figure. Therefore, the small but significant "regeneration" could be simply due to differences in experimental outcome due to batch-to-batch experimental variability. In this kind of models, batch-to-batch experimental variability may be large and generate apparent group differences. For instance, in Figure 1, HC loss seems to be deeper after 35x2 than after 50x2. Although no statistical comparison is made between these groups, there seems to be an inversion of the dose-effect relationship that may simply depend on experimental (batch-to-batch) variability.

      3.b. The aim of revealing the relationship between HC loss and function retention should ideally be addressed using an experimental design providing subject-based data for comparison. That is, you cause the lesion, next you evaluate the function, and then you obtain the tissues for histological assessment, so the individual functional values can be matched to the individual HC numbers for a robust assessment of the relationship. In this work, group data from functional analyses are compared to group data from histological analyses, but no information is given on whether the same or different animals were used. If the same animals were used, the lack of direct comparison of the individual data is surprising and suggest that perhaps the comparison was made and conflicting results were observed. Alternatively, if different sets of animals were used, the conclusions on the "redundancy" of the vestibular organs are severely weakened because batch-to-batch variability in the extent of the lesion may be large and the lesions in the animals used for physiological assessment were in fact not assessed. As noted above, the possibility of a large batch-to-batch variability in the extent of the lesion is supported by the observation that lesions in 35x2 mice were deeper than lesions in 50x2 mice.<br /> 4. The conclusions on HC regeneration needs a deeper scrutiny and the conclusion on its dose-dependency is not supported by the data.

      4.1. The animals used for the experiments are too young to sustain claims on adult HC regeneration. DT was administered in "4-6 weeks old" animals. In rats and mice, many HC are generated at the early postnatal days and they mature over the first month. At 4 weeks after birth (postnatal day 28), the number of immature HCs in the rat utricle is small but significantly higher than at day 60 (PMID: 38895157). Therefore, 4-week-old animals may contain a higher reserve of immature cells to show up as "new HC" after damage than 6-week or 8-week-old animals. One possible origin of the differences between 2-week and 6-month DT animals would be that the 6-month group included more animals treated at 4 weeks while the 2-week group included more animals treated at 6 weeks.

      4.2. The conclusions on regeneration are based on percentages of HC densities. In the first 2-week experiment the area of the epithelium is assessed, but areas are not taken into consideration when comparing HC densities at 2 weeks and 6 months after DT. Is it possible that the increase in HC density is caused by epithelial shrinkage rather than by emergence of new HC?

      4.3. The spontaneous HC regeneration is stated to be "dose-dependent", meaning that more extensive lesions caused more vigorous regeneration. However, this is only an apparent effect caused by the use of percent data. Thus, the increase in HC counts in the utricle is said to represent a 52% after 25X2 and 118% after 50X2. However, if you look at the numbers instead of percentages, the mean number of HCs is 130 vs 86 (an increase of 44) after 25X2 and 78 vs 36 (an increase of 42) after 50X2. So, the cell counts indicate tat a similar number of "new" HCs appear after either dose. 5. The use of antibodies and the exact methodology for HC counts is unclear and perhaps defective.

      5.1. The immunohistochemical protocol did not include a specific marker for HCI, so HCI were defined as MYO7A+/Sox2- cells, HCII were MYO7A+/Sox2+ cells and supporting cells were MYO7A-/ Sox2+cells. The use of additional markers for the HCI (Spp1) or the calyx (Caspr1, tenascin-C) would have provided a more robust dataset. Also, striola/central versus peripheral regions were simply defined by approximate anatomical comparison, when positive markers of the central region are available (oncomodulin, calretinin+ calyces).

      5.2. The primary and secondary antibodies listed do not match. Two Myosin7a antibodies were used (mouse monoclonal from DSHB and rabbit polyclonal from Proteus) and a goat anti-Sox2. However, the secondaries listed are one anti-goat and two anti-rabbits. No anti-mouse is listed.

      5.3. In the figures, the reader is not informed whether the data are from the mouse anti-MYO7A or the rabbit anti-MYO7A, or whether the figure includes mixed data from both antibodies. This is highly relevant because MYO7A was used as the only positive marker for HCI, MYO7A expression may be reduced in stressed HCs (PMID: 37195449), and the two anti-MYO7A antibodies have different affinity for the target. Thus, if the 2-week samples were labelled with the mouse anti-MYO7A and the 6-month samples were labelled with the rabbit antibody, added to the possibility of reduced MYO7A expression at 2 weeks, then the apparent regeneration may be simply apparent, not real regeneration.

      5.4. The images were similarly obtained with the 63X objective in both the utricle and the crista. Why two different measures (per 10,000 square micrometres in utricle and per 2500 in crista) were computed if the original area used for counts was the same? The counts are said to be derived from these 63X square images or from merged images spanning the whole utricle. However, the results section does not include the information on the particular kind of image used for any of the counts, and all are presented similarly. The method used to obtain each count should be indicated and valid comparisons should only include counts obtained with the same method. 6. The presentation of the results and its interpretation is biased. Unbiased interpretation of the results do not support conclusions such as "we found that utricle function is largely preserved until hair cell loss exceeded 90%".

      6.1. "...a trend of increase....1.2+/-0.4 to 2.7+/-0.6...". These are similar very low numbers, close to zero, not a trend of increase.

      6.2. The reader is informed that VsEP "is particularly dependent...striolar type I hair cells". However, the next sentence stresses that measures "remained unchanged at low dose (15 ng/g), with 54% HC survival in striola" when the percentage survival of HCI was 62.7 %. The 54% survival was for total HCs.

      6.3. Lack of statistical significance is interpreted as lack of significant biological effect, when this may simply result from lack of power of the experimental design. For instance, it is concluded that the 15 ng/g dose has no effect on VsEP amplitudes, because control and DT animals did not sow statistically significant differences in this parameter. However, the comparison was made using only 4 control animals, with one of them showing a value much lower than the other 3. Also, 7 of the 8 DT animals had amplitude values below these 3 control values, and the mean value in the DT group was about 30-40% lower than the control mean. Clearly, larger groups were necessary to conclude that the 15 DT dose had no effect. Or, as suggested above, use individual animal-based comparisons to compare HC loss to loss of function. Lack of statistical significance in experiments with an insufficient number of controls can't be used to conclude that responses "were intact".

      6.4. "At 25 ng/g x2.....Notably, only 3 out of 13 exhibited elevated VsEP thresholds at this dose". However, looking at Fig 5C it seems more accurate to say that 8 out of 13 exhibited elevated thresholds. "At the highest dose (35 ng/g x2), 53.8% (7 out of 13) of the animals showed elevated VsEP thresholds", but in fact all 13 DT animals showed thresholds above the mean threshold value in the control group. 7. A total of 198 vestibular afferents were measured in 5 DT mice and 195 afferents in 4 control mice. An explanation is lacking about the representativeness of these populations, whether they represent a biased or unbiased representation of the total population of afferents. 8. Information of vehicle and volume of injection of DT is lacking. 9. Vestibular organs were "harvested". How? In PBS, fixative?<br /> 10. Why was the anterior crista used for HC counts? The VOR test used examines the reflexes generated in the lateral crista, and the lateral crista is easier to image. 11. There are several reference errors, including formal errors (duplicate o missing references) and content errors (references that do not include the information that you would expect from the text where they are cited).

      Referees cross-commenting

      While Referee #1 states that the experiments were carefully executed, in my opinion there are many details of the experimental design and execution that need to be better explained before this statement can be made.

      Significance

      The question addressed is of great interest for several reasons. To explain one, the degree of redundancy in the system greatly influences the possibilities of significant functional recovery that can be achieved by therapeutic interventions aimed at triggering HC regeneration after HC loss from any cause. The DT/transgenic mouse model is certainly an interesting model to address the question.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Wang et al. presents a detailed analysis of dose‑dependent vestibular hair cell damage induced by diphtheria toxin (DT) in Pou4f3‑DTR knock‑in mice. The authors examine type I and type II hair cell survival, vestibular functional outcomes, single‑unit recordings from vestibular ganglion neurons, and dose‑dependent regenerative responses. Two mouse strain backgrounds were compared, revealing similar vestibular phenotypes but markedly different survival rates. The authors conclude that ancient vestibular functions are redundant with respect to surviving hair cells across vertebrate systems.

      The experiments are carefully executed, and the data are consistent with their previous work using the same model. I recommend publication after the authors address the following minor points:

      1. Synaptic Damage Not Addressed<br /> No data are presented regarding synaptic integrity, despite the well‑established vulnerability of hair‑cell synapses across ototoxic and genetic models. Because single‑unit recordings cannot resolve synaptic morphology, additional discussion is needed-beyond the brief mention on Page 16, line 4-regarding potential synaptic loss, its expected relationship to hair‑cell degeneration, and how it might influence the interpretation of afferent responses.
      2. Higher DT Dose (50 ng/g ×2) Producing Less Damage<br /> In several datasets, the highest DT dose appears to induce less damage than the 35 ng/g ×2 dose. The authors should comment on possible explanations, such as DT solubility limits, receptor saturation, nonlinear pharmacodynamics, or strain‑specific physiological responses.
      3. Clarification of Redundancy Concept (Page 13, lines 13-15)<br /> The manuscript states that the increase in DT‑induced unresponsive afferents supports the redundancy concept. The logic behind this connection is not fully explained. Please elaborate on how the presence of unresponsive afferents aligns with or strengthens the argument for functional redundancy in vestibular systems.
      4. Therapeutic Potential of Reactivating Silent/Reserve Hair Cells<br /> The idea of reactivating silent or reserve hair‑cell populations is intriguing but underdeveloped. Expanding this section-perhaps by discussing potential molecular pathways, precedents in other sensory systems, or feasibility in mammalian vestibular organs-would strengthen the translational relevance of the work.
      5. Different DT Doses Used Between Strains (e.g., Fig. 2E-G)<br /> Although the two strains are described as having similar vestibular phenotypes, some figures use 25 ng/g ×2 for one strain and 50 ng/g ×2 for the other. Please clarify the rationale for using different doses-whether due to survival differences, pilot data, or strain‑specific sensitivity.
      6. Typographical Error (Page 8, line 8)<br /> A closing parenthesis appears to be missing.
      7. Define IDPN at First Mention<br /> Please spell out IDPN (β‑iodopropionitrile) at its first appearance in the text.

      Significance

      The manuscript by Wang et al. presents a detailed analysis of dose‑dependent vestibular hair cell damage induced by diphtheria toxin (DT) in Pou4f3‑DTR knock‑in mice. The authors examine type I and type II hair cell survival, vestibular functional outcomes, single‑unit recordings from vestibular ganglion neurons, and dose‑dependent regenerative responses. Two mouse strain backgrounds were compared, revealing similar vestibular phenotypes but markedly different survival rates. The authors conclude that ancient vestibular functions are redundant with respect to surviving hair cells across vertebrate systems.

      The experiments are carefully executed, and the data are consistent with their previous work using the same model. I recommend publication after the authors address the suggested minor points.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01578R

      Corresponding author(s): Sabine Costagliola

      1. General Statements

      We are pleased to submit the revised version of our manuscript entitled “____Foxe1 deficiency impairs thyroid fate while supporting a lung differentiation program____” (Review Commons Refereed Preprint #RC-2022-01578R).

      We are grateful for the careful and constructive evaluation provided by the reviewers. Their insightful comments have significantly strengthened the manuscript, both conceptually and experimentally.

      We sincerely apologize for the delay in submitting this revision. Addressing the reviewers’ comments required additional experimental work, and during this period, the postdoctoral researcher who initiated and led the project completed her training and left the laboratory, requiring a reorganization of responsibilities within the team to ensure rigorous completion of the requested studies. We appreciate your patience and believe that the manuscript has been considerably strengthened as a result.

      Collectively, these modifications move the manuscript beyond a descriptive study and provide new mechanistic insight into the role of Foxe1 in thyroid specification, late chromatin regulation of Pax8 expression, and the permissive state originated in the Foxe1 absence leading to Nkx2.1 differentiation into lung.

      In addition, we would like to inform you that the author order has been modified in this revised version to accurately reflect contributions made during the revision process. As a result, Mírian Romitti has been moved to co–last author. All authors have reviewed and approved this change as well as the final version of the manuscript.

      We are excited to resubmit this substantially improved version and believe it now provides a clearer and more mechanistically grounded contribution to the field.

      2. Point-by-point description of the revisions

      This section is mandatory. Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript.

      • *

      We would like to thank all reviewers for their constructive comments and valuable suggestions, which have helped us improve the quality and clarity of our manuscript. Below, we provide a point-by-point response to all comments. The corresponding revisions have been incorporated into the transferred manuscript.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      • *

      Summary

      The authors investigate the effect of Foxe1KO primarily on thyroid differentiation of mouse ES cells following a previously established protocol based on sequential endoderm induction, Nkx2-1/Pax8 overexpression and stimulation of the TSHR/cyclicAMP pathway. Silencing of Foxe1 expression significantly suppresses the generation of functional thyroid follicles. By single cell profiling a great number of Foxe1 targeted genes are identified, some confirmed from previous studies and some are new candidates. Embryonic bodies lacking Foxe1 instead accumulate various lung lineage cells characterized by known cell type markers, which appear to organize in lung tissue-like structures. Based on these findings, it is suggested that Foxe1 might be involved in endoderm cell fate decisions.

      • *

      Major comments

      The title and abstract hold promise that Foxe1 is also a regulator of lung development, and that Foxe1 transcriptional activity might be decisive for thyroid versus lung fate decisions. However, there are no experimental support suggesting that one and the same ES cells at a certain critical time point may switch fate from thyroid to lung (or vice versa). Since lung markers are induced in Nkx2-1/Pax8/cAMP+ ESC it is likely that "control" organoids with maintained Foxe1 expression already contain lung lineage cells, which might expand simply by clonal selection as the thyroid lineage is suppressed by subsequent Foxe1 deletion. Although authors discuss some in this direction, it is not obvious to readers without very careful reading that this possibility and explanation is feasible and should be considered and problematized.

      We thank the reviewer for this important and thoughtful comment. We agree that our data do not demonstrate a direct fate switch of individual ES cells from thyroid to lung identity at a defined developmental time point. We have revised the title, abstract, and discussion to clarify that our findings support a model of lineage stabilization and transcriptional competition rather than active binary fate conversion.

      Our chromatin accessibility data argue against induction of a de novo lung program upon Foxe1 loss. In Foxe1 KO cells, we observe:

      • A marked reduction in chromatin accessibility at the Pax8 locus (see Figure 6B)
      • No significant gain in accessibility at canonical lung program loci (see Figure 6F) Thus, lung gene activation does not require establishment of new accessible chromatin regions. Instead, lung-associated loci appear to be in a permissive chromatin configuration in Nkx2-1+ foregut progenitors.

      Importantly, quantitative lineage analysis further supports destabilization of thyroid commitment rather than emergence of a new lineage. In wild-type organoids, approximately 80% of Nkx2-1+ cells co-express Pax8, indicating strong thyroid commitment. In contrast, in Foxe1 KO organoids, only ~20% of Nkx2-1+ cells retain Pax8 expression (see data below). This substantial reduction in Nkx2-1⁺/Pax8⁺ double-positive cells indicates collapse of thyroid lineage reinforcement, leaving a larger fraction of Nkx2-1-positive cells transcriptionally permissive and capable of engaging alternative Nkx2-1-dependent programs such as lung.

      Mechanistically, our data support the following model:

      1. Early during differentiation, Pax8 induces Foxe1 expression.
      2. Foxe1 subsequently becomes required to sustain chromatin accessibility at the Pax8 locus (supported by Figure 6B and predicted biding site, Foxe1 motif, Table S1).
      3. In Foxe1 KO cells, accessibility at the Pax8 locus collapses, reducing Pax8 expression and weakening thyroid super-enhancer activity.
      4. As the thyroid transcriptional network destabilizes, Nkx2-1, still expressed, can cooperate with lung-associated cofactors at already accessible lung loci.
      5. Lung transcription increases without requiring de novo chromatin opening, consistent with redistribution of limiting transcriptional machinery. Supporting a more direct regulatory role, motif analysis revealed a predicted Foxe1 binding site within regulatory regions of the Pax8 locus (Table S1). This is consistent with the possibility that Foxe1 directly binds Pax8-associated enhancers, potentially recruiting chromatin remodelers and/or stabilizing enhancer-promoter interactions required to maintain high Pax8 expression. While functional validation of this binding will require future studies, this observation further supports a model in which Foxe1 actively maintains Pax8 chromatin accessibility rather than indirectly affecting thyroid identity.

      Interestingly, our newly added data (Figure S8A-C) show that complete absence of Pax8 (Pax8KO mESCs) does not result in the same phenotype, displaying a complete absence of thyroid or lung organoids. This finding reinforces the hypothesis that Foxe1 is not regulating Pax8 expression at early stages of thyroid specification.

      Furthermore, our previous single-cell RNA-seq analysis of mouse thyroid organoids (Romitti et al., Frontiers in Endocrinology, 2021) did not reveal substantial lung cell population under wild-type conditions, with only a small Nkx2.1-Krt5 cluster, called non-thyroid epithelial cells being identified. This suggests that high Pax8 levels in the presence of Foxe1 effectively commit most Nkx2.1+ progenitors toward thyroid fate.

      Despite this, we agree that expansion of rare lung-competent cells, even if unlikely, cannot be formally excluded. Definitive resolution of whether a bipotent Nkx2.1+ progenitor with dual thyroid and lung potential exists would require dedicated lineage tracing at single-cell resolution. Such experiments would be necessary to distinguish between fate conversion and expansion of lineage-competent progenitors and lie beyond the scope of the current study.

      Ultimately, we have extensively revised the manuscript to clarify these points and to avoid implying direct fate switching. Our data instead support a model in which Foxe1 stabilizes thyroid commitment by maintaining Pax8 enhancer accessibility, thereby functionally restricting Nkx2.1 from engaging alternative foregut programs.

      All the above-mentioned information and discussion have been incorporated to the new version of the manuscript

      Observations that Foxe1KO did not at all influence gene expression in expanding lung-like cells are consistent with the idea that lung and thyroid specification in the model are independent phenomena, and argue against the existence of a common bipotent progenitor. If authors disagree, this issue and question should be more thoroughly discussed and argued for with more supporting experimental data than found in the current manuscript version

      We thank the reviewer for this important comment. As stated above, we agree that our current data do not formally demonstrate the existence of a common bipotent progenitor, and we have revised the manuscript to avoid overinterpretation in this regard.

      Regarding lung genes expression, we observe significant differences between WT and Foxe1 KO organoids at day 22, as assessed by qPCR (see Figure S3). In addition, single-cell RNA sequencing reveals the presence of distinct lung cell populations in the Foxe1 KO condition, characterized by high expression of specific lung lineage markers (see Figure 4). Importantly, these lung populations were not detected in our previous single-cell RNA-seq analysis of WT thyroid organoids (Romitti et al., Frontiers in Endocrinology, 2021), except for a small population of Nkx2-1+Krt5+ cluster, indicating that their emergence is specifically associated with Foxe1 loss.

      Despite the appearance of these lung-like cell types in Foxe1 KO organoids, ATAC-seq analysis does not reveal increased chromatin accessibility at canonical lung regulatory loci compared to WT (see Figure 6). This suggests that Foxe1 does not act as a direct negative regulator of the lung program. Rather, our data support a model in which Foxe1 primarily maintains thyroid lineage stability by sustaining chromatin accessibility at the Pax8 locus. In its absence, Pax8 expression is reduced, thyroid enhancer activity collapses, and thyroid differentiation is compromised.

      Consequently, Nkx2-1+ cells remain in a transcriptionally permissive state in which lung-associated loci, already epigenetically accessible in foregut-derived progenitors, can be engaged. Thus, lung differentiation appears to arise not through active induction by Foxe1 loss, but through destabilization of the thyroid program, allowing Nkx2-1 to cooperate with alternative cofactors within an already permissive chromatin landscape.

      To prevent misunderstanding, we have modified the title and substantially clarified the results and discussion sections to better reflect this model and to avoid implying direct lineage instruction or proven bipotency.

      Minor comments

      What is the fraction of. Nkx2-1+ cells that organize into follicles vs lung structures? Based on provided overview images (e.g. Figs. S1, S4) the general impression is that most cells do not form 3D-structures (i.e. do not differentiate). Please explain this and provide information in paper.

      We thank the reviewer for this helpful comment and for the opportunity to clarify this point.

      First, the images shown in Figs. S1B–C correspond to day 7 and Fig. S4E to day 10 of the differentiation protocol. As indicated in the figure legends, these represent early stages of the culture during which cells are still a pool of progenitor-like cells. At these time points, organized 3D thyroid follicles or lung-like epithelial structures are not yet formed. We have revised the figure legends to ensure this is clearly stated and to avoid the impression that full differentiation has already occurred at these stages.

      Regarding the fraction of Nkx2-1⁺ cells that organize into follicles vs. lung structures at later stages, we acknowledge that we are not able to provide an exact quantitative proportion. Due to the 3D nature of the culture system and the size heterogeneity of the structures, precise counting of all Nkx2-1⁺ cells within organoids are technically challenging. However, based on representative images (e.g., Fig. 1C) and repeated observations across independent experiments, a subset of Nkx2-1⁺ cells clearly organize into epithelial 3D structures, while others remain unorganized or in less structured aggregates.

      In the Foxe1 KO condition, the larger size and morphology of the epithelial structures suggest that a substantial proportion of Nkx2-1⁺ cells contribute to lung-like structures. Morphologically, these structures are typically larger (approximately 70–600 µm) compared to thyroid follicles (approximately 30–50 µm), supporting the impression that lung-like structures represent a significant fraction of organized epithelia in the KO condition.

      Importantly, our single-cell RNA-seq data provide additional support for epithelial organization within defined clusters. The Nkx2-1/lung clusters express high levels of epithelial markers such as Epcam and Cdh1 (E-cadherin), consistent with structured epithelial identity. In contrast, only the Thyroid 1 cluster expresses these epithelial markers robustly, whereas the Thyroid 2 and Nkx2-1⁺/Pax8⁻ clusters show low or absent expression, suggesting that not all Nkx2-1⁺ cells acquire a fully organized epithelial state.

      Fig. 1C: Supposed follicles are not shown in this graph.

      We thank the reviewer for pointing this out. We agree that, due to the low magnification, individual follicular structures are not clearly discernible in Fig. 1C. The purpose of these images was not to illustrate fully formed thyroid follicles, but rather to highlight the relative proportion of Nkx2-1⁺/Pax8⁺ double-positive cells in control versus Foxe1 KO conditions.

      To avoid confusion, we have revised the figure legend and the text and replaced the term “thyroid follicles” with “thyrocytes,” which more accurately reflects what is shown at this magnification. We believe this clarification better aligns the description with the intent of the figure.

      Why does not thyroglobulin accumulate in lumen (which if present would be a good means for quantification by counting follicles)?

      We thank the reviewer for this valuable suggestion and agree that luminal thyroglobulin (Tg) accumulation would, in principle, represent an informative readout for follicle quantification.

      However, our organoids display a fetal-like developmental state and exhibit heterogeneity in maturation and functional competence (as expected in vivo at early development). As we have previously demonstrated (Carvalho et al., Advanced Healthcare Materials, 2023), even in highly mature thyroid organoid systems, not all morphologically defined follicles are functionally active. Thus, the absence or variability of luminal Tg or iodinated Tg (Tg-I) accumulation does not necessarily indicate absence of follicle formation at this developmental stage. In other words, Tg accumulation is not a fully reliable surrogate marker for follicle presence in this context. Here we included an example of Tg staining in mouse thyroid organoids, where we can observe some regions with Tg accumulated in the lumen, while most of the cells also show (or exclusively) cytoplasmatic staining. This image further confirms the variability in Tg accumulation among derived organoids.

      To more accurately identify follicular structures, we relied on epithelial polarity and architectural markers. Specifically, we used E-cadherin and ZO-1 staining in combination with Pax8 to define organized epithelial thyroid structures. In addition, we employed an iodinated-thyroglobulin antibody (mouse anti–Tg-I, gift from C. Ris-Stalpers) and improved the quality of the Tg-I staining in Fig. 1E. This was further complemented by the Tg-EGFP reporter signal to better visualize thyroid follicular organization.

      Nevertheless, due to the intrinsic 3D nature of the culture system and structural heterogeneity of the organoids, precise quantitative assessment remains technically challenging.


      Indeed, follicles should be quantified to estimate induction success. Please also explain rounded structures in Foxe1KO image (are they distal lung buds?). Or are Control and Foxe1KO images confused in this panel?!?

      We thank the reviewer for this important comment and for raising the need for quantitative assessment.

      To estimate induction efficiency and directly compare control and Foxe1 KO conditions, we quantified Nkx2-1⁺ and Nkx2-1⁺/Pax8⁺ populations by flow cytometry (Fig. S6A-B), using the Nkx2-1_mKO2 reporter in combination with Pax8 antibody staining. We observed a marked reduction in the total number of Nkx2-1⁺ cells in Foxe1 KO organoids compared to controls, beginning at day 11 and becoming progressively more pronounced over time. By day 21, approximately 40-50% of cells in the control condition are Nkx2-1⁺, whereas only ~10-15% are Nkx2-1⁺ in the Foxe1 KO.

      Importantly, co-staining with Pax8 further revealed that in control organoids, the majority of Nkx2-1⁺ cells are also Pax8⁺ (41.9% of total cells), consistent with efficient thyroid commitment. In contrast, in Foxe1 KO organoids, only 3.1% of total cells are double positive, indicating a profound reduction in thyroid lineage. These quantitative data provide a robust measure of induction success and lineage specification efficiency.

      Regarding the rounded structures shown in Fig. 1D in the Foxe1 KO condition, the images are correctly assigned and not confused. These rounded epithelial structures represent the few thyroid follicles that form in the absence of Foxe1, as confirmed by Pax8 and Tg positivity. Although markedly reduced in frequency, follicle formation is not completely abolished in the KO condition. However, as highlighted in Fig. 1D, these self-organized follicles are not functionally mature, as evidenced by the absence of Nis/Slc5a5 expression. An additional example of a follicle derived in the Foxe1 KO condition is shown in Fig. S5B.

      Fig. 1E: text on Fig. legend is erroneously given under (F), whereas a dedicated and relevant text for (F) is missing.

      We thank the reviewer for this careful observation. The figure legend has been corrected to properly assign the text to panel (E), and a dedicated legend describing panel (F) has now been added. In addition, we have ensured that the corresponding figure panels are appropriately referenced in the main text.

      Fig. 1F. Immunostaining of iodinated thyroglobulin (Tg-I) is very poor. Is it due to a bad antibody (does it work well in in vivo thyroid stainings?) or is organification simply inefficient? Again, poor content of Tg in lumen (as also suggested by Fig. S5A), it is puzzling. Or are in vitro-generated follicles leaky (i.e. do not behave as natural thyroid follicles)?

      We thank the reviewer for this helpful comment. Following this suggestion, we have improved the quality of the iodinated thyroglobulin (Tg-I) immunostaining and included new images at higher quality and different magnifications in Fig. 1E. These revised images more clearly show the accumulation of Tg-I within the luminal compartment, particularly in the WT control condition.

      Regarding the apparent variability in Tg accumulation, we believe this reflects the fetal-like developmental state of the organoids and the heterogeneity in their maturation and functional competence. As discussed above, not all follicles generated in vitro reach the same level of functional maturity, which may influence the degree of Tg accumulation within the lumen.

      Importantly, we do not believe that the in vitro–derived follicles are structurally leaky. First, the luminal localization of iodinated Tg is clearly detectable in Fig. 1E, indicating that Tg can accumulate within the follicular lumen. Second, functional assays presented in Fig. 1F demonstrate robust iodide uptake and organification, supporting the presence of an active thyroid hormone biosynthetic machinery in these organoids.

      Figs. 2A-E: Comments on lung cell markers. A: E-cad is unspecific, Sox9 would better label branching morphogenesis

      We thank the reviewer for this helpful comment. The purpose of the first panel in Fig. 2 (A) was to highlight the presence of Nkx2-1⁺ cells organizing into epithelial structures, as indicated by E-cadherin staining. In this context, E-cadherin was used to visualize epithelial organization rather than to specifically identify lung lineage cells. This also allowed us to emphasize the clear morphological differences between thyroid follicles, which are typically smaller, and the larger epithelial structures observed in the Foxe1 KO condition that are consistent with lung-like structures.

      The presence and identity of specific lung cell populations are further addressed in the subsequent panels of Fig. 2 (B-H) and more comprehensively in the single-cell RNA-seq dataset presented in Fig. 4.

      While we agree that Sox9 staining would provide an additional marker for bud tip progenitors and branching morphogenesis, our single-cell RNA-seq analysis shows Sox9 expression within the Nkx2-1⁺/Epcam⁺/Pax8⁻/Tg⁻ population in Foxe1 KO organoids (Fig. 4B), supporting the presence of this lung progenitor population in our system.

      Finally, it is important to note that our culture system (media) is not designed to promote lung development in vitro, which probably impairs the proper physiological lung tissue formation and differentiation progress observed in optimal systems and in vivo. In addition, we believe that we have fetal-like lung organoids in vitro, as comparison to scRNAseq of E17.5 suggests. These aspects were also discussed in the new version of the manuscript.

      C: co-staining for E-cad would help differentiate cell types. D: Goblet cells seem Nkx2-1 negative, please explain.

      We thank the reviewer for these helpful comments.

      Regarding the suggestion to include E-cadherin co-staining to better distinguish cell types, we agree that this would provide additional spatial information. However, due to technical limitations related to the species of the primary antibodies used for several lung lineage markers, we were unable to include E-cadherin co-staining in many of the panels. To address epithelial identity at the transcriptomic level, in our single-cell RNA-seq analysis we specifically filtered for Nkx2-1⁺ cells that were also Epcam⁺, thereby focusing the analysis on epithelial populations present in the organoids (Fig. 4A). Consistent with this approach, the lung-related clusters identified in the dataset (Fig. 4B) show clear expression of epithelial markers, including Epcam and Cdh1 (E-cadherin) (Fig. 3E), supporting their epithelial nature.

      Regarding the observation that goblet cells appear Nkx2-1 negative, we note that the Muc5ac staining shown in Fig. 2D primarily reflects secreted mucin that accumulates within the lumen of the lung-like epithelial structures rather than intracellular staining confined to individual goblet cells. As a result, the signal is predominantly detected in the luminal space, which may give the impression that it is not associated with Nkx2.1-expressing cells. To clarify this point, we provide images highlighting Muc5ac accumulation within epithelial structures that express Nkx2.1 (Fig. 2D) and Sox2 (Fig. 2F). In addition, Fig. S5C shows a large Nkx2-1_mKO2⁺/Sox2⁺ epithelial structure with clear Muc5ac accumulation in the lumen, supporting the presence of goblet-like secretory activity within these Nkx2.1–derived lung structures.

      E: Diffuse pattern. Are assumed club cells really Nkx2-1 pos? CC10 immunostaining might help.

      • *

      We thank the reviewer for this helpful comment. The diffuse pattern observed in Fig. 2E is largely due to the 3D reconstruction of the image, which can reduce the apparent sharpness of individual cellular boundaries. Nevertheless, the image indicates that Scgb3a2+ cells are located within epithelial structures containing Nkx2.1–expressing cells.

      Following the reviewer’s suggestion, we have now included additional immunostaining for Cc10/Scgb1a1 in the revised manuscript (Fig. 2G), which further supports the presence of club-like cells in the organoids. Although we were unable to show direct co-staining with Nkx2-1, our single-cell RNA-seq analysis confirms that all Scgb3a2⁺ and Scgb1a1/Cc10⁺ cells identified in the organoids belong to a Nkx2-1⁺/Epcam⁺ epithelial population (Fig. 4A–B and Fig. S7A). This is further illustrated in the corresponding UMAP plots shown below.

      Together, these data support the interpretation that the Scgb3a2⁺ and Cc10⁺ cells detected in the organoids correspond to Nkx2.1-derived epithelial club-like cells.

      F: I doubt that SEM is conclusive for identification of specific (lung) cell types unless tissue architecture (e.g. proximal-distal positions) is considered for comparison to the natural branching process of the developing lung.

      We agree with the reviewer that SEM alone is not sufficient for the definitive identification of specific lung cell types. In this study, SEM was used to visualize ultrastructural features and morphological characteristics suggestive of differentiated epithelial cell types, based on comparisons with SEM images from human/mouse lungs. Importantly, our organoids do not represent adult lung tissue, but most likely fetal stages of lung development, this is an important aspect since cells might not display full features of adult lungs; e.g. ciliated cells show rather short cilia, compatible with early development. Similar aspect is observed with alveolar structures, that are most likely developing-alveolar sacs. This important aspect of developmental stage is now described in the figure legend (Fig. 2H).

      To improve the clarity of our SEM images, we modified the figure and replaced images that had not very clear features by new ones. We included a new image showing mucus accumulation in the luminal compartment, a larger view of developing-alveolar sacs and alveolar cells, with a zoomed image of AT2 cell. In addition, epithelium containing secretory cells and mucus blobs was included.

      Importantly, cell identity in our study was not inferred from SEM alone. We used several complementary approaches, including immunostaining, qPCR analysis, and single-cell RNA sequencing, to support the identification of the different lung epithelial populations present in the organoids.

      Nevertheless, we have decided to retain instead improving the SEM images in Fig. 2H, as they provide valuable ultrastructural characterization of the organoids and illustrate morphological features consistent with differentiated lung epithelial cells.

      Line 161: Is it really "spontaneous" generation? Please rephrase.

      We thank the reviewer for this suggestion. The term “spontaneous” has been replaced with “unexpected” to more accurately describe the generation of these structures.

      Fig. S3A. According to Major Comment above, please explain in more detail why and how lung marker expression is evident in induced "Controls" (i.e. organoids without Foxe1KO). Is it due to parallel/independent lung and thyroid differentiation? Is phenotype of rather Foxa1KO a matter of clonal selection?

      Back to our previous response, the low lung marker expression observed in control organoids likely reflects the presence of Nkx2-1⁺ foregut progenitors that remain transcriptionally permissive to alternative Nkx2.1–dependent programs. In wild-type conditions, the majority of Nkx2.1⁺ cells co-express Pax8 (~80%), indicating robust thyroid commitment, still with around 20% of the cells not committing to thyroid, what could explain an “inefficient” parallel lung differentiation in presence of Foxe1. In contrast, in Foxe1 KO organoids this proportion drops to ~20%, reflecting destabilization of the thyroid transcriptional network rather than induction of a new lineage. Consistent with this, chromatin accessibility analyses show reduced accessibility at the Pax8 locus in Foxe1 KO cells without significant gain at canonical lung loci. Together, this process could allow the expansion of the non-thyroid committed progenitors and acquisition of lung cell fate due to the permissive state of the chromatin. While expansion of rare lung-competent progenitors cannot be formally excluded, distinguishing between lineage plasticity and clonal expansion would require dedicated lineage-tracing experiments beyond the scope of this study.

      Figs. S3B-M. Scanning electron micrographs. Are these from one single (lung-like) structure imaged at different angles and magnitude or selected from multiple/different structures? If the latter, there a bias of selection that raises concern about cell identity. See similar SEM comment above.

      We thank the reviewer for this important point. The SEM images in the old Figures S3B–M did represent distinct lung-like structures rather than multiple angles of a single organoid, as we could not obtain representative images of all cell types from the same structure. However, the SEM data presented in Figure 2 already sufficiently highlight the distinct cell types and structures. To avoid redundancy, we have therefore removed panels S3B-M in the revised version of the figure.

      Line 181: Text states that cells additionally were visualized by microscopy, but this is not shown in Fig. 4.

      We thank the reviewer for pointing this out. The sentence has been revised to clarify that the reporter fluorescence can be used to track differentiation by microscopy, while the efficiency of Nkx2.1⁺ cell generation is quantified by flow cytometry, as shown in Figure S4D–E rather than Figure 4. The updated sentence reads:

      “The reporter fluorescence allowed tracking the Nkx2-1+ cells appearance by microscopy and quantification of the differentiation efficiency by flow cytometry (Figure S4D-E).”

      • *

      Fig. 4. Data based/biased on computationally Pax8-negative selected Foxe1KO cells. Are Pax8 negative cells present in "Control" (Foxe1+) organoids and a potential source of enrichment independent of the thyroid lineage?

      We thank the reviewer for raising this important point, which prompted us to further examine the Nkx2.1⁺/Pax8⁻ cell populations in both control and Foxe1 KO samples. Flow cytometry analysis (shown below) indicates that the proportion of Pax8+ and Pax8- cells among mKO2⁺ (Nkx2-1⁺) cells was comparable between control and Foxe1 KO organoids at day 9, two days after completion of doxycycline induction. This suggests that both thyroid and lung lineages were initially induced at similar levels in the two cell lines.

      This trend persists until day 12, when a clearer divergence between thyroid and lung fates begins to emerge in control versus Foxe1 KO organoids. Overall, these results indicate that Foxe1 expression reinforces thyroid lineage specification, whereas Foxe1 knockout results in an expansion of Nkx2.1+/Pax8- cells. Importantly, the PCA analysis of ATAC-seq data presented in Fig. 5G supports this conclusion.

      The paper by Fagman et al. (Am J. Pathol, 2004), which shows aberrant/ectopic thyroid differentiation in airway respiratory epithelium in ShhKO mouse embryos, may by cited and discussed with reference to the possible existence of bipotent lung/thyroid progenitors/stem-like cells in vivo.

      • *

      We thank the reviewer for this valuable suggestion and apologize for not citing this highly relevant study in the previous version of the manuscript. We have now incorporated a discussion of this work in the final paragraph of the revised manuscript.

      Added text in the manuscript: "In conclusion, the present work advances our understanding of the critical role of Foxe1 in initiating and sustaining proper thyroid tissue formation and function, while also highlighting novel molecular players for future investigation in thyroid biology. Beyond the thyroid, our findings underscore the intricate relationships among endodermal lineages during differentiation, particularly between thyroid and lung. Supporting this concept, in vivo studies by Fagman and collaborators (2004) showed that loss of Shh signaling during early organogenesis leads to thyroid dysgenesis and the appearance of aberrant thyrocytes expressing Nkx2-1, Foxe1, and Tg in the presumptive trachea, emphasizing the need to repress inappropriate thyroid programs in non-thyroid anterior foregut endoderm (Fagman et al., 2004). Building on this, it is intriguing to speculate that transient thyroid/lung bipotent progenitors may exist in vivo, analogous to the transient bipotent progenitors described during liver and pancreas development (Deutsch et al., 2001; Xu et al., 2011). Future studies using lineage tracing approaches could directly test the existence and fate of such progenitors, providing a deeper understanding of early endodermal plasticity and the mechanisms that safeguard lineage fidelity."

      Reviewer #1 (Significance (Required)):

      • *

      The results are indeed of great value mainly for developmental biologist interested in regenerative medicine and specifically concerning in vitro systems for lung and thyroid differentiation. The provided single cell data sets of thyroid progenitors undergoing differentiation and the impact of Foxe1KO are a major achievement and resource.

      • *

      This reviewer´s expertise is mainly in vivo thyroid development.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      • *

      Summary: This study by Fonseca et al investigated how the specification of mouse ESCs towards thyroid lineage was regulated by the presence or absence of Foxe1, a thyroid specific transcriptional factor. Compromised thyroid induction was observed when Foxe1 was knocked out. Interestingly, the author found increased induction of lung cells in the absence of Foxe1, suggesting its role in regulating the balance of thyroid-versus-lung specification. While interesting, the main issue with this study is the lack of quantitative analysis of cellular specification, and the lack of comprehensiveness regarding the markers used to characterize each cell lineage, especially for the lung lineages.

      • *

      Major points:

      • *

      For analyzing the outcome of lineage specification in the comparison of with and without Dox or in the comparison of control versus Foxe1 KO, the only quantitative readout is qPCR. The author should perform additional characterization using flow cytometry for NKX2.1, Pax8, Tg, Tg-I, Ecad, and ZO-1 to reveal more clear mechanism: reduced number/percentage of cellular specification into thyroid lineage, or immature phenotypes in specified thyroid cells.

      • *

      We thank the reviewer for raising this important point. We agree that incorporating quantitative analyses is essential to confirm the phenotype driven by the loss of Foxe1 expression. To address this comment, we have added additional flow cytometry analyses at different time points throughout the culture in the revised manuscript (Fig. S6A–B). Specifically, we now include quantification of Nkx2.1/mKO2⁺ cells and Tg/GFP reporter⁺ cells in both control and Foxe1KO organoids from day 7 to day 21 of the differentiation protocol.

      These data show that up to day 10 there is no significant difference in the proportion of Nkx2.1⁺ cells generated under the two conditions. However, from day 11 onwards the trajectories diverge clearly: in control organoids, Nkx2.1⁺ cells reach approximately 50% of the population, whereas only 10–15% of cells become Nkx2.1⁺ in the Foxe1KO condition (Fig. 6A and Fig. S6B). These findings are consistent with the reduced proportion of Nkx2.1⁺Pax8⁺ cells observed in Foxe1KO organoids (Fig. 6B and Fig. S6B), confirming the impairment in thyroid cell generation caused by the loss of Foxe1 expression. In addition, although not the most precise measure, we also observed a similar reduction in the proportion of Tg-GFP⁺ cells in the Foxe1KO condition compared with controls (Fig. 6A).

      While these new results provide additional quantitative insight, accurately assessing the maturation state of the generated thyroid cells by flow cytometry remains challenging due to several technical limitations:

      1. Tg quantification: Despite testing several anti-thyroglobulin antibodies for flow cytometry, we were unable to obtain reliable staining. For this reason, we included quantification of the Tg/GFP reporter described above. Despite the clear reduction in Tg+ cells among Foxe1 KO organoids, we previously demonstrated (Romitti and Eski et al., 2021; Fig. 2E) that the GFP reporter captures only a fraction of the Tg⁺ cell population present in the culture, not being the most accurate method for quantification.
      2. Tg-I and ZO-1 quantification: Due to their intraluminal and apical localization within thyroid follicles, quantification of Tg-I is not possible by FC and ZO-1 staining has demonstrated to be technically difficult and did not yield reliable results.
      3. Assessment of immature vs. mature thyrocytes: We believe that the combined datasets presented in Fig. 1 and the scRNA-seq analysis (Fig. 3) provide sufficient evidence to interpret the Foxe1KO phenotype. Together, these results indicate that: (i) Foxe1KO organoids show a reduced efficiency in generating thyrocytes and Nkx2.1+ cells compared with the control line; and (ii) the few thyrocytes that form in the absence of Foxe1 display impaired maturation.

      The authors claimed that in the absence of Foxe1, lung organoid can be observed. Quantitative analysis, such as organoid count or flow cytometry, should be provided to assess this comparing organoid identities in the presence and absence of Foxe1.

      We thank the reviewer for this important comment and we agree that a precise quantification would reinforce our findings on organoid identities. As described above, we performed flow cytometry analyses to track Nkx2.1/Pax8 cell populations over time in both WT and Foxe1KO conditions. In the WT condition, approximately 80% of Nkx2.1⁺ cells are also Pax8⁺, consistent with thyroid lineage specification. In contrast, in the Foxe1KO condition, only ~20% of Nkx2.1⁺ cells co-express Pax8, indicating a strong reduction in thyroid lineage commitment.

      Although this approach does not directly quantify lung organoids, our scRNA-seq data show that the majority of Nkx2.1⁺Pax8⁻ cells in the Foxe1KO condition display an epithelial transcriptional profile, with a substantial proportion exhibiting a lung-like signature.

      Regarding a direct quantification of the proportions of each organoid type, we encountered several technical limitations inherent to organoid systems. In particular, variability between wells and differentiations, combined with the three-dimensional complexity of the cultures, makes reliable counting of distinct organoid identities challenging.

      With respect to flow cytometry-based quantification of lung identity, the diversity of lung epithelial cell types represents an additional challenge. Available markers often label only specific subpopulations and can overlap with thyroid markers. For example, Sox2 labels airway epithelial cells but not alveolar cells, whereas Sox9, which can mark distal lung progenitors, is also highly expressed in thyrocytes. Similarly, assays with secretory cell markers (Scgb3a2, Scgb1a1, and Muc5ac) did not yield reliable staining in our system. Hopx, an alveolar marker, is also detected in the thyroid population. Although thyroid cells can be specifically identified by Pax8 staining, this overlap further complicates the combination of markers required for reliable flow cytometry quantification of lung lineages.

      Taken together, and considering that in our previous work we demonstrated by scRNA-seq that lung differentiation is not clearly observed in the control line, with only a small subset of Nkx2-1+Krt5+ cluster been detected (Romitti and Eski et al., 2021), our quantitative analyses rely primarily on Nkx2.1/Pax8 flow cytometry together with the transcriptional evidence provided by scRNA-seq.

      In Figure 2, the claim of lung cell identities is not well supported. (1) SEM data on alveolar and goblet cells is not conclusive;

      We agree with the reviewer that SEM alone is not sufficient for the definitive identification of specific lung cell types. In this study, SEM was primarily used to visualize ultrastructural features and morphological characteristics suggestive of differentiated epithelial cell types, based on comparisons with previously reported SEM images of human and mouse lung tissue.

      Importantly, our organoids do not represent adult lung tissue but rather likely correspond to early developmental stages of lung formation. This is an important consideration, as cells at these stages may not display all the morphological hallmarks observed in mature lungs. For example, the ciliated cells observed in our organoids present relatively short cilia, which is consistent with early stages of airway epithelial development. Similarly, the structures resembling alveoli are more consistent with developing alveolar sacs rather than fully mature alveoli. This developmental context is now clarified in the figure legend (Fig. 2H).

      To improve the clarity and interpretability of the SEM data, we revised the figure and replaced images in which the features were not sufficiently clear. The updated panel now includes images showing mucus accumulation within the luminal compartment, a broader view of developing alveolar sac–like structures, and a higher-magnification image highlighting cells with morphology consistent with alveolar type II–like cells. In addition, we included images of epithelial regions containing secretory cells and mucus deposits.

      Importantly, cell identity in our study was not inferred from SEM alone. Instead, we used several complementary approaches, including immunostaining, qPCR analyses, and single-cell RNA sequencing, to support the identification of the different lung epithelial populations present in the organoids.

      For these reasons, we chose to retain the SEM data in Fig. 2H while improving the image selection and annotations, as these images provide valuable ultrastructural information and illustrate morphological features consistent with differentiated lung epithelial structures.

      In addition, it’s important to note that our system is not designed (culture media composition) for optimal generation of lung organoids and we believe that despite of the indications of fetal-like lung organoids generated they might not follow the expected physiological path observed in vitro optimal models and in vivo. It could impact the maturity and the proportions of the cells derived. This discussion is also now present in the updated version of the manuscript.

      (2) Alveolar type 1 cells should be characterized by AGER and AQP5 besides HOPX

      We thank the reviewer for this valuable suggestion and agree that additional markers such as AGER and AQP5 would further support the identification of alveolar type I (AT1) cells. Following the reviewer’s recommendation, we performed additional immunostainings using AQP5 and AGER antibodies. However, we were unfortunately unable to obtain reliable staining that would clearly demonstrate AT1 cells in our organoid system.

      Nevertheless, both AQP5 and AGER transcripts are detected in the lung-like populations in our scRNA-seq dataset (Fig. 4 and examples shown below). Interestingly, their expression is not restricted to a single well-defined cluster, which may reflect the transitioning/immature state of the lung-like cells present in the organoids. Additional comparison to in vivo dataset suggests an enrichment in AT1 signature in cluster 0, which contains Foxe1KO-derived cells, however it might not reflect fully maturation of this cell type.

      Taken together, these observations further reinforce that while lung epithelial populations are present, the organoids likely represent an early developmental stage of lung differentiation rather than fully mature lung tissue, and therefore may not yet exhibit the clear marker segregation characteristic of adult alveolar cell types.

      (3) Alveolar types 2 cells should be characterized by NKX2.1 and SFTPC co-staining;

      Dear reviewer, as mentioned in the previous comment, we encountered similar technical difficulties when attempting SFTPC immunostaining, and we were unfortunately unable to obtain reliable staining in our organoid system.

      In contrast to Aqp5 and Ager, Sftpc transcripts were not detected in our scRNA-seq dataset. However, several other markers commonly associated with AT2 cells, such as Napsa, Abca3, and Lpcat1, are expressed in the lung-like populations (examples shown below). In addition, comparative analyses with an in vivo mouse lung dataset indicate transcriptomic similarities between E17.5 AT2 cells in vivo and a subset of cells present in the Foxe1KO organoids (Fig. 4C). This analysis also highlights the possible presence of AT2 precursors, reinforcing the immaturity of the system.

      Taken together, these observations suggest the presence of AT2-like cells at an early developmental stage, rather than fully differentiated or functional AT2 cells. This interpretation is consistent with the overall developmental immaturity of the lung-like structures observed in our organoid system.

      (4) For showing proximal lung identities, it would be helpful if the authors can co-stain more than one lineage, such as basal cell together with goblet cell/ciliated cells to reveal potential pseudostratified epithelium.

      We thank the reviewer for this insightful suggestion. Addressing the spatial organization of proximal lung epithelial cell types within the organoids is indeed an interesting aspect. Based on our observations, multiple epithelial cell types do not appear to consistently coexist within the same organoid structure.

      Our analyses indicate that many organoids co-express basal cell markers (p63 and Krt5) together with Sox2, but not together with Muc5ac, a marker of goblet cells. This observation may suggest that the in vitro system does not fully recapitulate the progressive epithelial maturation and spatial organization seen in vivo, such as the formation of a pseudostratified airway epithelium.

      Ideally, this question would be addressed through three-dimensional immunostaining within individual organoid structures to visualize the spatial arrangement of the different epithelial lineages. However, despite several attempts, we were unable to obtain images that would allow reliable interpretation of such co-localization.

      Regarding ciliated cells, analysis of the scRNA-seq dataset indicates that they represent a relatively rare population in our cultures, which likely further limits the ability to visualize their spatial organization within organoids.

      Minor points:

      • *

      All characterization of in vitro induced thyroid cells should be accompanied by parallel analysis of native thyroid cells (from in vivo mice) that serve as a benchmark for the maturity of the induced cells. Some staining, such as Fig 1F on Tg-I remains quite different from what is reported from in vivo findings.

      We thank the reviewer for this important comment. In our previous work (Antonica et al., Nature, 2012), the characterization of thyroid organoids was extensively performed in direct comparison with native mouse thyroid tissue, and all antibodies used in the study were benchmarked using mouse thyroid as a positive control. Regarding the maturity of the thyroid organoids generated in vitro, we previously demonstrated both in vitro and in vivo thyroid hormone (TH) production, confirming the functional capacity of the derived thyroid cells. Although a certain degree of heterogeneity in maturation is observed within WT thyroid organoids, likely reflecting their fetal-like developmental state, these findings support the presence of functionally mature thyrocytes.

      To further address the reviewer’s concern, we have now included new Tg-I staining images in Fig. 1F, which more clearly illustrate the accumulation of the thyroid hormone precursor within the luminal compartment of follicles derived from WT mESCs.

      In addition, we would like to note that the specificity and suitability of the antibodies used to stain native mouse thyroid cells have been validated in several previous studies, including Dathan et al., Dev Dyn, 2002; Gérard et al., Am J Pathol, 2008; Hartog et al., Endocrinology, 1990.

      The labeling of panel E and F in Figure.1 should be switched.

      We thank the reviewer for bringing this to our attention. The labeling of panels E and F in Fig. 1 has been corrected accordingly in the revised manuscript.

      Reviewer #2 (Significance (Required)):

      • *

      This study provided direct in vitro evidence regarding the critical role of Foxe1 for thyroid lineage induction, and suggested its role in balancing thyroid versus lung fate determination. It is thus important to the field of both thyroid and lung developmental and stem cell biology. However, the significance of this study in hindered by the lack of comprehensiveness in the analysis.

      We thank the reviewer for the positive evaluation of our study and for recognizing its relevance to both thyroid and lung developmental biology. To address the concern regarding the comprehensiveness of the analysis, we have carefully revised the manuscript to improve clarity and to better present and discuss the results of our analyses. We believe that these revisions have strengthened the manuscript and improved the overall quality of the study.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      • *

      Costagliola et al. have demonstrated that Foxe1, a transcription factor, plays a key role in the proper differentiation of Nkx2-1 (+) cells into thyroid follicles. They have also revealed that some Foxe1-null/Nkx2-1 (+) cells differentiate into the lung, including airway and alveolar epithelia, in their ES cell-derived organoid system. Although it has already been appreciated that Foxe1 contributes to the thyroid development in mice and humans, this excellent study has clarified that its absence, as a result, enhances the differentiation of Nkx2-1 (+) cells into the lung. I have no serious criticisms regarding methodology, results, and interpretation of results. I' d like you to elucidate whether similar findings are obtained even from human ES cell lines in the future.

      We would like to express our sincere gratitude to Reviewer #3 for the positive feedback on our work. We fully agree that it will be important to determine whether similar findings can also be observed using human embryonic stem cell (ESC) systems.

      While the mouse model used in this study was first reported in 2012 (Antonica et al., Nature, 2012), our group has more recently developed a corresponding system to generate functional thyroid follicular cells from human pluripotent stem cells (Romitti et al., Nature Communications, 2022). Using this human platform, we are currently investigating the role of FOXE1, as well as other genes associated with congenital hypothyroidism, in human thyroid development. We anticipate that these studies will provide further insight into the mechanisms controlling thyroid lineage specification and will be the focus of future work.

      Minor comment:

        • Fig 3C-E, Fig 6B, D, and F: These figures are so small that the words are almost illegible.*

      We thank the reviewer for bringing this to our attention. The figures have been revised to improve readability, and the font sizes have been increased in Fig. 3C–E and Fig. 6B, D, and F in the updated version of the manuscript.

      Reviewer #3 (Significance (Required)):

      I'm a pathologist who specialize in lung cancer and the stem cells in the distal airway. This paper will probably attract those who are interested in the development of the thyroid or the lung, because the authors have revealed that 1) Foxe1 contributes to the proper thyroid development, and 2) its absence consequently enhances the differentiation of Nkx2-1 (+) cells into the lung.

      We thank the reviewer for this thoughtful comment and for highlighting the potential interest of our study for researchers working in thyroid and lung development. We agree that our findings provide new insight into the role of Foxe1 in thyroid lineage specification and suggest that its absence can shift the differentiation potential of Nkx2.1⁺ progenitors toward a lung epithelial fate. We hope that these results will contribute to a better understanding of the mechanisms regulating cell fate decisions within the anterior foregut endoderm and will be of interest to both the thyroid and lung developmental biology communities.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Costagliola et al. have demonstrated that Foxe1, a transcription factor, plays a key role in the proper differentiation of Nkx2-1 (+) cells into thyroid follicles. They have also revealed that some Foxe1-null/Nkx2-1 (+) cells differentiate into the lung, including airway and alveolar epithelia, in their ES cell-derived organoid system. Although it has already been appreciated that Foxe1 contributes to the thyroid development in mice and humans, this excellent study has clarified that its absence, as a result, enhances the differentiation of Nkx2-1 (+) cells into the lung. I have no serious criticisms regarding methodology, results, and interpretation of results. I' d like you to elucidate whether similar findings are obtained even from human ES cell lines in the future.

      Minor comment:

      1. Fig 3C-E, Fig 6B, D, and F: These figures are so small that the words are almost illegible.

      Significance

      I'm a pathologist who specialize in lung cancer and the stem cells in the distal airway. This paper will probably attract those who are interested in the development of the thyroid or the lung, because the authors have revealed that 1) Foxe1 contributes to the proper thyroid development, and 2) its absence consequently enhances the differentiation of Nkx2-1 (+) cells into the lung.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This study by Fonseca et al investigated how the specification of mouse ESCs towards thyroid lineage was regulated by the presence or absence of Foxe1, a thyroid specific transcriptional factor. Compromised thyroid induction was observed when Foxe1 was knocked out. Interestingly, the author found increased induction of lung cells in the absence of Foxe1, suggesting its role in regulating the balance of thyroid-versus-lung specification. While interesting, the main issue with this study is the lack of quantitative analysis of cellular specification, and the lack of comprehensiveness regarding the markers used to characterize each cell lineage, especially for the lung lineages.

      Major points:

      For analyzing the outcome of lineage specification in the comparison of with and without Dox or in the comparison of control versus Foxe1 KO, the only quantitative readout is qPCR. The author should perform additional characterization using flow cytometry for NKX2.1, Pax8, Tg, Tg-I, Ecad, and ZO-1 to reveal more clear mechanism: reduced number/percentage of cellular specification into thyroid lineage, or immature phenotypes in specified thyroid cells.

      The authors claimed that in the absence of Foxe1, lung organoid can be observed. Quantitative analysis, such as organoid count or flow cytometry, should be provided to assess this comparing organoid identities in the presence and absence of Foxe1.

      In Figure 2, the claim of lung cell identities is not well supported. (1) SEM data on alveolar and goblet cells is not conclusive; (2) Alveolar type 1 cells should be characterized by AGER and AQP5 besides HOPX; (3) Alveolar types 2 cells should be characterized by NKX2.1 and SFTPC co-staining; (4) For showing proximal lung identities, it would be helpful if the authors can co-stain more than one lineage, such as basal cell together with goblet cell/ciliated cells to reveal potential pseudostratified epithelium.

      Minor points:

      All characterization of in vitro induced thyroid cells should be accompanied by parallel analysis of native thyroid cells (from in vivo mice) that serve as a benchmark for the maturity of the induced cells. Some staining, such as Fig 1F on Tg-I remains quite different from what is reported from in vivo findings.

      The labeling of panel E and F in Figure.1 should be switched.

      Significance

      This study provided direct in vitro evidence regarding the critical role of Foxe1 for thyroid lineage induction, and suggested its role in balancing thyroid versus lung fate determination. It is thus important to the field of both thyroid and lung developmental and stem cell biology. However, the significance of this study in hindered by the lack of comprehensiveness in the analysis.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The authors investigate the effect of Foxe1KO primarily on thyroid differentiation of mouse ES cells following a previously established protocol based on sequential endoderm induction, Nkx2-1/Pax8 overexpression and stimulation of the TSHR/cyclicAMP pathway. Silencing of Foxe1 expression significantly suppresses the generation of functional thyroid follicles. By single cell profiling a great number of Foxe1 targeted genes are identified, some confirmed from previous studies and some are new candidates. Embryonic bodies lacking Foxe1 instead accumulate various lung lineage cells characterized by known cell type markers, which appear to organize in lung tissue-like structures. Based on these findings, it is suggested that Foxe1 might be involved in endoderm cell fate decisions.

      Major comments

      The title and abstract hold promise that Foxe1 is also a regulator of lung development, and that Foxe1 transcriptional activity might be decisive for thyroid versus lung fate decisions. However, there are no experimental support suggesting that one and the same ES cells at a certain critical time point may switch fate from thyroid to lung (or vice versa). Since lung markers are induced in Nkx2-1/Pax8/cAMP+ ESC it is likely that "control" organoids with maintained Foxe1 expression already contain lung lineage cells, which might expand simply by clonal selection as the thyroid lineage is suppressed by subsequent Foxe1 deletion. Although authors discuss some in this direction, it is not obvious to readers without very careful reading that this possibility and explanation is feasible and should be considered and problematized. Observations that Foxe1KO did not at all influence gene expression in expanding lung-like cells are consistent with the idea that lung and thyroid specification in the model are independent phenomena, and argue against the existence of a common bipotent progenitor. If authors disagree, this issue and question should be more thoroughly discussed and argued for with more supporting experimental data than found in the current manuscript version.

      Minor comments

      What is the fraction of. Nkx2-1+ cells that organize into follicles vs lung structures? Based on provided overview images (e.g. Figs. S1, S4) the general impression is that most cells do not form 3D-structures (i.e. do not differentiate). Please explain this and provide information in paper.

      Fig. 1C: Supposed follicles are not shown in this graph. Why does not thyroglobulin accumulate in lumen (which if present would be a good means for quantification by counting follicles)? Indeed, follicles should be quantified to estimate induction success. Please also explain rounded structures in Foxe1KO image (are they distal lung buds?). Or are Control and Foxe1KO images confused in this panel?!?

      Fig. 1E: text on Fig. legend is erroneously given under (F), whereas a dedicated and relevant text for (F) is missing.

      Fig. 1F. Immunostaining of iodinated thyroglobulin (Tg-I) is very poor. Is it due to a bad antibody (does it work well in in vivo thyroid stainings?) or is organification simply inefficient? Again, poor content of Tg in lumen (as also suggested by Fig. S5A), it is puzzling. Or are in vitro-generated follicles leaky (i.e. do not behave as natural thyroid follicles)?

      Figs. 2A-E: Comments on lung cell markers. A: E-cad is unspecific, Sox9 would better label branching morphogenesis C: co-staining for E-cad would help differentiate cell types. D: Goblet cells seem Nkx2-1 negative, please explain. E: Diffuse pattern. Are assumed club cells really Nkx2-1 pos? CC10 immunostaining might help. F: I doubt that SEM is conclusive for identification of specific (lung) cell types unless tissue architecture (e.g. proximal-distal positions) is considered for comparison to the natural branching process of the developing lung.

      Line 161: Is it really "spontaneous" generation? Please rephrase.

      Fig. S3A. According to Major Comment above, please explain in more detail why and how lung marker expression is evident in induced "Controls" (i.e. organoids without Foxe1KO). Is it due to parallel/independent lung and thyroid differentiation? Is phenotype of Foxa1KO rather a matter of clonal selection?

      Figs. S3B-M. Scanning electron micrographs. Are these from one single (lung-like) structure imaged at different angles and magnitude or selected from multiple/different structures? If the latter, there a bias of selection that raises concern about cell identity. See similar SEM comment above.

      Line 181: Text states that cells additionally were visualized by microscopy, but this is not shown in Fig. 4.

      Fig. 4. Data based/biased on computationally Pax8-negative selected Foxe1KO cells. Are Pax8 negative cells present in "Control" (Foxe1+) organoids and a potential source of enrichment independent of the thyroid lineage?

      The paper by Fagman et al. (Am J. Pathol, 2004), which shows aberrant/ectopic thyroid differentiation in airway respiratory epithelium in ShhKO mouse embryos, may by cited and discussed with reference to the possible existence of bipotent lung/thyroid progenitors/stem-like cells in vivo.

      Significance

      The results are indeed of great value mainly for developmental biologist interested in regenerative medicine and specifically concerning in vitro systems for lung and thyroid differentiation. The provided single cell data sets of thyroid progenitors undergoing differentiation and the impact of Foxe1KO are a major achievement and resource.

      This reviewer´s expertise is mainly in vivo thyroid development.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Ruby Ponnudurai

      Scientific Editor

      Review Commons

      February 16th, 2026

      Dear Dr. Ponnudurai,

      Please see below for a detailed response to reviewers for manuscript #RC-2025-03108: "Short chain fatty acids regulate the chromatin landscape and distinct gene expression changes in human colorectal cancer cells".

      __Authors' Summary: __We thank all the reviewers for the constructive and immensely helpful reviews of our manuscript. We have revised the manuscript addressing the reviewers' comments, which we feel has substantially strengthened our paper. Please see below for our point-by-point responses to the comments, which are all indicated in blue text. All changes in the manuscript are also indicated in blue text.


      Reviewer #1 Evidence, reproducibility and clarity


      In this manuscript, Kabir et al. explore the impact of microbiota-derived short-chain fatty acids (SCFAs) on chromatin structure and gene expression in human cells. They show that SCFAs, particularly butyrate, contribute to specific histone modifications such as butyrylation at H3K27, detectable in human colon tissue. Additional modifications like acetylation, butyrylation, and propionylation at H3K9 and H3K27 respond to SCFA levels and are enriched at active regulatory regions in colorectal cancer cells. Treatment with individual or combined SCFAs mimicking gut conditions alters gene expression patterns, with butyrate playing a dominant regulatory role. Butyrate's effects on gene expression are claimed to be independent of HDAC inhibition and instead rely on the p300/CBP complex through histone butyrylation. These findings underscore SCFAs as crucial modulators of epigenetic regulation in the human colon and highlight butyrate's dominant role in shaping chromatin and gene regulation beyond its known metabolic functions.

      The authors used two human cell lines and an in vivo murine model paired with RNA and ChIP sequencing approaches to identify target genes and chromatin modifications in response to SCFAs.

      While the findings are interesting and could provide important insights into the epigenetic influence of SCFAs in human cells, the study would benefit from additional experiments to strengthen the conclusions. Comments and suggestions are listed below:

      Response: We sincerely thank the reviewer for their thoughtful and constructive comments. In addition, we appreciate the recognition of the potential impact of our findings. We have addressed all comments below.

      1. Figure 1: The H3K27bu expression in human biopsies highlights the clinical significance of the current study. However, the authors need to provide more information on the human colon samples, e.g., how many total patients were analyzed, and what were the age and/or sex. Only the methods mention the use of benign TMA; this should also be clarified in the figure legends. It would also be helpful to show histone butyrylation levels in normal vs. cancer human tissues.

      Response: We completely agree that analysis of additional patient samples is important. In light of this comment, we have expanded our analysis of human colon samples. In the original manuscript, we showed IF images from patient intestinal sections. Patient demographic information (age and sex) is now included in the figure legend. While we analyzed two patients by IF, we realized that images from only one patient are shown. We also felt it was important to add additional rigor to our patient analysis. Therefore, we have incorporated additional patient samples and performed H3K27bu staining using IHC across normal and colon cancer sections obtained from 40 different patients. This is now included as Supplemental Figure 1. In addition, we have included information about age, sex, staging, and grading in Supplemental Figure 1C. Interestingly, we observed that adenocarcinoma patients have significantly decreased levels of H3K27bu compared to normal colon or normal adjacent tissues (Supplemental Figure 1B). We speculate that this may be due to alterations with the microbiota composition and dysbiosis associated with colorectal cancer (PMIDs: 26515465, 25758642, 25699023). Very interestingly, this is in contrast to reports of elevated H3K27ac in colon cancer samples (PMID: 24994966). We are excited to explore this further, and this is something we plan to follow-up on in future studies.

      1. Figure 1: In addition, given that the butyrate level descends towards the base of the colonic crypt (with the highest at the top of the crypt where mature intestinal epithelial cells reside) (Kaiko et al., 2016), it is important to show how the H3K27bu signature is distributed along the crypt. This data would further emphasize the clinical relevance of this study, given that most colorectal cancers (CRCs) arise from stem and progenitor cells.

      Response: We agree that this is an important question and recognize the elegant study by Kaiko et al. However, our human samples are obtained from commercially available tissue microarrays and the sectioning is not consistent across samples, resulting in a minimal amount of samples that we could analyze for staining patterns from crypt to villi (please see Supplemental Figure 1A for example sections). This unfortunately prevents us from completing rigorous image analysis. In future studies, we plan to perform this analysis after we obtain an IRB protocol that will allow us to answer this question in the most rigorous way possible.

      Throughout the manuscript: The rationale for selecting the two CRC cell lines (HCT 116 and Caco2) should be explained. While commonly used, providing background on their genetic differences (e.g., driver mutations) is important, as this could greatly influence the PTM landscape.

      Response: We chose to use both HCT-116 and Caco-2 cancer cell lines throughout our studies, since as noted these cells are the most commonly used lines in the literature. In addition, having consistent results across distinct genetic backgrounds strengthens our results: using both cell lines tells us whether observed PTM patterns are conserved across genetically diverse CRC contexts, as HCT-116 is characterized by mutations in KRAS and PIK3CA, while Caco-2 has mutations in APC and TP53 (PMIDs: 17088437, 24755471, and 16418264). We have added this information into the text in lines 106-107.

      The study lacks additional controls, such as a normal colon epithelial cell line and a non-colonic cell type. Including these would help determine whether the observed butyrate effects are tissue- or disease-specific. This data would also help assess whether SCFA effects, and specifically butyrate's effects, on histone acylation and gene expression are systemic or local.

      Response: Thank you for this insightful comment. We have now included additional data using normal colon cells in the form of mouse colon organoids and a distinct non-intestinal cell line, the embryonic kidney cell line HEK 293T. Importantly, we observe similar changes to chromatin after treatments with different SCFAs in both colon organoids and HEK 293T cells as shown in the cancer cell lines (Figure 1E, 1F). Interestingly, we also observe that the colon cancer cell lines have visible signal of histone butyrylation without treatment, while we only observe these modifications in HEK 293T cells following treatment.

      As for understanding systemic vs. local effects of butyrate on chromatin, we additionally treated cells with different concentrations reflecting the intestinal lumen or serum concentrations of SCFAs: 5 mM and 5 µM, respectively. While the concentrations of SCFAs can vary across individuals, we felt that these numbers reflected differences in intestinal vs. serum levels based on the literature (summarized in PMID: 27259147). Importantly, we observe that only the 5 mM SCFA treatment reflecting levels in the intestinal lumen results in induction of histone acetylation and butyrylation, while the 5 µM treatment reflecting serum SCFA levels failed to induce increased levels of these histone modifications (Figure 1F).

      Together, this data suggests that the response on chromatin to SCFAs is more universal at high concentrations. However, based on local vs. systemic concentrations throughout the body, we expect that responses on chromatin will largely be restricted to the intestine or in other areas or conditions where high concentrations of metabolites are localized.

      Figure 2: The authors show ChIP-seq results in the HCT 116 cell line. To exclude the possibility that the demonstrated chromatin signatures are cell line-specific, results from Caco2 should also be shown. In addition, the 2D environment and multiple passaging alter gene expression in cell lines; using human colonic organoids would provide a more clinically and physiologically relevant model.

      Response: We have now added Cut&Run analysis for the histone acyl marks of interest in Caco-2 cells, which is a technique analogous to ChIP to map genomic localization. Please see now Figure 2C-D. Importantly, we observe very similar localization of these histone modifications across the different cell lines. We also agree that the question of how 2D vs. 3D environment may impact localization of these modifications is important. In organoids, ChIP-seq and Cut&Run are technically difficult. In addition, we feel that using human organoids is currently beyond the scope of our manuscript. However, we previously characterized H3K27bu and H3K27ac occupancy from primary epithelial cells isolated from the mouse intestine (PMID: 38413806). Importantly, in this study we observed similar genomic enrichment of H3K27bu and H3K27ac. This suggests that the general patterns of localization of these modifications across species and across cells isolated from both 2D vs. 3D systems are similar.

      Figure 4 is very confusing. Entinostat itself, as an HDAC inhibitor (iHDAC), increases butyrylation. The data shown are insufficient to draw conclusions. First, the authors should use additional iHDACs, and second, they should illustrate the overlap in gene expression changes between all treatments using a Venn diagram to clarify which genes/signatures are specific to each treatment.

      Response: We agree that testing additional iHDACs is important. We have now included an additional iHDACs (Tucidinostat) in our studies to make more widespread conclusions beyond the activity of Entinostat. We have performed additional treatments, demonstrating that all iHDACS tested increase both histone butyrylation and acetylation (Supplemental Figure 8A-B). We also have performed qPCR for candidate differential genes and demonstrated that expression changes following our treatments with Tucidinostat phenocopy changes observed with Entinostat (Figure 5F). These dynamic gene changes show examples of genes that are responsive to butyrate treatment and p300/CBP inhibition, yet differ from other iHDAC treatment. As requested, we have additionally added a Euler plot to Figure 4 depicting the overlap between treatments in this figure (Figure 5C).

      Figure 4: The authors use an HDAC inhibitor to rule out butyrate's effect on gene expression via HDAC inhibition. However, butyrate can also modulate gene expression through activation of GPR109a. Using GPR109a antagonists is necessary to address this possibility. These data are essential to validate the specific role of histone butyrylation in gene regulation.

      Response: We thank you for this comment and completely agree that butyrate can act through multiple mechanisms, including activation of GPR109a. However, it has previously been demonstrated that this receptor is silenced via DNA methylation in human colon cancer samples and colon cancer cell lines, including HCT-116 (PMID: 19276343). Supporting this notion, we observed very low expression levels of this receptor in our HCT-116 cells (please note the very low TPM values), with minimal differences in response to butyrate treatment (Supplemental Figure 6E, included below). We have additionally included gene expression data for two other potential GPCRs activated by butyrate or other SCFAS (FFAR2 and FFAR3), and also observe very low expression of these genes. Therefore, we concluded that the butyrate effects on gene expression independent of HDAC inhibition in our data are not likely to be dependent on GPR109A or FFAR2/3 signaling.


      New ____Supplemental Figure 6E____: mRNA expression of GPCR genes that are known SCFA targets. Levels of mRNA expression (transcript per million, TPM) as assayed by RNA-seq of GPR109A (official gene name HCAR2), FFAR2, and FFAR3 in HCT-116 cells. Expression levels related to Figure 3. Statistical significance was determined using ANOVA adjusting for multiple comparisons with p

      Supplementary Figure 4 and manuscript: There is no in vivo methods section describing the tributyrin-gavaged mice. The authors should clarify how the experiment was performed, how cells were isolated, whether sorting was performed, and which markers were used.

      Response: We apologize for this confusion. The in vivo data is from previously published work that is publicly available (PMID: 38413806). We analyzed data from mice that were gavaged with tributyrin, where non-sorted IECs were analyzed for RNA-seq. We have clarified this and have added this information in the figure legend (now Supplemental Figure 6).

      Supplementary Figure 4: The GO analysis results show that lipid catabolism is among the top differentially enriched pathways. Butyrate is a known PPARγ agonist (Litvak et al., 2018), and activation of PPARγ is known to drive expression of genes involved in lipid metabolism. The authors need to rule out this function of butyrate before attributing this signature solely to histone butyrylation.

      Response: We appreciate this point and have performed additional analysis to identify whether canonical PPARγ target genes are enriched or not in our data. Additionally, we recognize that our data may reflect the combined effects of both PPARγ activation and histone butyrylation. In Supplemental Figure 6 (Supplemental Figure 4 in the previous version), we especially acknowledge that the differential genes changing may be due to varied mechanisms of butyrate action. Therefore, to address this comment, we performed additional analysis on data related to Figure 5 (previously Figure 4), where we have additional treatments including using a p300/CBP inhibitor to identify potentially more chromatin related mechanisms of action.

      We have now extended our analysis of RNA-seq data related to Figure 5 to include gene ontology enrichment that is not dependent on clustering (Supplemental Figure 9A). While we do not observe PPARγ target genes as top enriched categories, we have also specifically tested the enrichment of PPARg-related MsigDb groups using publicly available datasets (Supplemental Figure 9B). Here, we observe some enrichment of different gene sets related to PPARγ activity across different tissue systems. Together, this new data suggests that some PPARg targets are enriched with our different cell treatments, including butyrate, but they are not the predominant gene categories that we observe changing.

      Most PPARg target genes have been identified in tissue systems beyond the gut, such as adipose tissue and immune cells. To specifically analyze genes in the intestine that are PPARg-dependent, we identified select genes in the literature (PMIDs: 29182565, 28798125, and 28798125). In PMID: 29182565, these genes include lipid transport (Cd36), lipolysis (Hsl, and Atgl), and various lipid metabolism pathways (Cact, Fasn, Mlycd, Dgat2, and Agpat9). In PMID: 28798125, these genes include HMOX1, PDK4, ANGPTL4, UCP2, AQP8, and PLIN2 as butyrate/ PPARg targets. PMID: 28798125 identified Nos2 as a butyrate and PPARg target. Their expression levels following butyrate and other treatments in Figure 5 (formerly Figure 4) are now included as Supplemental Figure 9C (also included below). Interestingly, these genes respond differently compared to the other iHDAC tested (Entinostat) and are only mildly impacted by p300/CBP inhibition (please see A485_Butyrate column vs. Butyrate alone). This suggests that the major impacts on this pathway are not through p300/CBP activity or histone butyrylation, but may be due to other mechanisms of butyrate action. We have also included additional discussion of butyrate and potential roles of PPARg signaling in lines 243-256.

      New Supplemental Figure 9C.

      It would be helpful to include a table of differentially abundant genes as a supplement to the heatmaps and GO analysis.

      Response: We are happy to include tables of differentially expressed genes from all our analysis as supplemental files. This is now included as Supplemental Table 1.

      Significance

      This study explores how microbiota-derived SCFAs, particularly butyrate, influence histone acylation and gene regulation. While the topic is relevant, the work lacks important controls (e.g., normal epithelial and non-colonic cells) and omits mechanistic validation (e.g., GPR109a signaling, PPARγ involvement). The rationale for cell line selection is unclear, and in vivo methods are insufficiently described.

      Audience: The study will mainly interest specialists in microbiota-chromatin interaction. Broader impact is limited by the narrow model scope and underdeveloped mechanistic insight.

      My Expertise:

      Cancer biology, in vivo models, microbiota-host interactions.

      Response: We sincerely thank the reviewer for their very helpful comments. We hope that the above point-by-point responses adequately addresses concerns regarding controls, mechanistic validation, and methods description. We really appreciate their note that the topic is relevant, yet we also feel that our work will have broader impacts due to the interdisciplinary nature of the research and the inclusion of additional model systems (intestinal organoids and additional cell lines) and mechanistic experiments.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This study presents a novel finding that short-chain fatty acids (SCFAs) produced by microbial metabolism regulate gene transcription in human colon cancer cells by modulating histone H3K9 and H3K27 butyrylation and propionylation, both of which are associated with an open chromatin state. The authors further reveal that the major effect of the SCFA mixture is driven by butyrate and identify p300/CBP-dependent, rather than HDAC inhibition-dependent, gene regulation by butyrate. Overall, this is a well-organized study that provides valuable insight into the role of metabolites in human cells.

      Response: Thank you for your positive review of our manuscript. We really appreciate the reviewer pointing out the novelty and organization of our study. Please see below for point-by-point responses to your comments.

      Major comments:

      1. In Figures 1C and 1D, why did the SCFA mixture not increase histone butyrylation or propionylation to the same level as single butyrate treatment? Response: Thank you for this question. We believe that this effect is observed due to differences in butyrate concentrations, as we aimed to keep the total concentration of SCFAs equal across all treatments at 5 mM. In the single treatment, butyrate is at 5 mM while in the mixtures, butyrate is at 1.67 mM (1:1:1) or 1 mM (3:1:1). In addition, in Figure 3A we included a 15 mM mixture for RNA-seq analysis, where butyrate and the other SCFAs are all at 5 mM concentrations. Since we observed highly similar patterns of gene expression with 15 mM or 5 mM final SCFA mixture concentrations, we did not include the 15 mM treatment in our other experiments.

      In Figure 3B, how does butyrate block the effects of acetate and propionate on transcription?

      Response: This is a great question, but we are not necessarily claiming that butyrate is blocking effects of acetate and propionate on transcription. For example, it is also possible that butyrate induces more gene expression changes compared to acetate or propionate, as the number of differentially expressed genes is greater in butyrate-treated cells (Response Table 1).

      Comparison vs. vehicle

      __Upregulated __

      (log2FC > 0)

      __Downregulated (log2FC

      __Upregulated __

      (log2FC > 1)

      __Downregulated (log2FC

      Acetate

      3160

      3518

      433

      352

      Propionate

      3402

      3854

      1304

      735

      Butyrate

      4600

      4539

      2082

      1727

      __Response Table1. Number of differentially expressed genes for each SCFA treatment group, related to Figure 3. __RNA-seq was performed on HCT-116 cells grown in DMEM and treated with 5 mM of single SCFAs for 6 hours. Differential genes were identified using DESeq2 Wald test and statistically significant genes were defined using a padj To fully understand mechanistic differences of butyrate vs. acetate or propionate, we would need to perform additional experiments that we feel are beyond the scope of this current manuscript. However, we speculate that several mechanisms could account for these differences: for example, different histone acylations could have differential impacts on chromatin structure, reader binding, or transcription factor recruitment. As for blocking effects, select longer acylations (butyrylation and crotonylation) have been demonstrated to have repressive effects in transcription or reader protein binding in specific cell contexts (example PMIDs: 27105113, 31676231, 37311463). These are important future studies for our group and will likely shed light on additional mechanistic insights of different histone acylation functions. We have highlighted some of these concepts in the discussion (lines 301-310):

      "We also observe that butyrate and propionate treatment have both overlapping and distinct effects on gene regulation (Figure 3, Supplemental Figure 4, Supplemental Figure 8D). Propionate appears to have more modest effects compared to butyrate, as it induces a smaller number of differential gene changes and these genes do not display enrichment in ATP and nucleotide metabolism categories. These differences in gene regulatory responses to the different SCFA treatments could be due to multiple mechanisms. For example, we speculate that there could be chromatin-independent functions through distinct alterations in metabolic or signaling pathways or chromatin-dependent mechanisms through potential distinct structural effects on chromatin or differences in reader protein binding."

      Which pathways are associated with acetate- and propionate-specific DEGs?

      Response: Thank you for this insightful question. We have performed gene ontology analysis for acetate and propionate DEGs. Interestingly, there is largely overlap between the different SCFA treatments (Supplemental Figure 4A). However, propionate treatment fails to enrich for select gene ontology categories that we observe in acetate treatment (Supplemental Figure 4B, __included below). For example, by gene set enrichment analysis, acetate enriches for gene categories related to nucleotide and ATP synthesis, while propionate does not. However, both acetate and propionate (and all SCFA treatments) are enriched in categories related to the ribosome and rRNA (__Supplemental Figure 4B-C). We have added this analysis to the manuscript as Supplemental Figure 4 and included additional discussion of this analysis in the text in lines 163-171 (included below), as well as additional speculation about differences between propionylation and butyrylation in lines 301-310 (included above).

      *"We further analyzed gene programs changing with different SCFA treatments. All SCFA treatments regulated largely overlapping gene programs including those related to RNA metabolism, ATP synthesis, and ribosome function (Supplemental Figure 4a). Since butyrate overlapped greatly with the combination SCFA treatment, we specifically analyzed acetate and propionate gene programs (Supplemental Figure 4b-c). Interestingly, propionate treatment failed to enrich for select gene ontology categories that we observe in other SCFA treatments. Specifically, propionate-dependent gene programs did not include those related to ATP and nucleotide metabolism, highlighting some differences in gene expression changes following different SCFA treatments." *

      • *

      New__ Supplemental Figure 4B.__

      Which genes are related to growth inhibition in butyrate-treated cells? Does the 1:1:1 SCFA mixture have a similar impact on cell growth as single butyrate treatment?

      Response: Butyrate has previously been shown to inhibit cell growth in colon cancer cells (PMIDs 9125124, 33017771, 38398853). These include differential regulation of key cell cycle regulators, such as p21 and Cyclin D1. We have included both GO term enrichment for the 1:1:1 SCFA mix and gene expression data for select cell cycle regulators in Supplemental Figure 7C-D (7D also included below). This demonstrates that both butyrate and the SCFA mixtures, and to a lesser extent propionate, differentially regulate key cell cycle genes including CDKN1C, CDK2, CDK4, WEE1, and RB1. We have additionally performed a GLO assay for the 1:1:1 SCFAs treatment to investigate its impact on growth inhibition, which is now included as Supplemental Figure 7B. Here, we observe that the 1:1:1 and 3:1:1 mixtures of SCFAs significantly decrease cell viability. However, this is not to the same extent as butyrate treatment alone. Together, this data suggest that butyrate reduces cell viability at least in part through altering key cell cycle genes. This effect is mimicked with the SCFA mixture treatments, but to a lesser extent compared to butyrate alone.

      New Supplemental Figure 7D.


      Reviewer #2 (Significance (Required)):

      General assessment: This study clearly demonstrates the role of butyrate in gene regulation and elucidates its underlying regulatory mechanisms. However, it does not provide insight into how butyrate counteracts the effects of acetate and propionate, despite these metabolites often being detected together. In addition, it remains unclear which specific histone PTMs are associated with the distinct gene expression changes induced by different short-chain fatty acids. Lastly, the observation that histone butyrylation and propionylation correlate with active transcription is not novel.

      Advance: This study advances understanding of short-chain fatty acids in chromatin and gene regulation, highlighting butyrate's dominant role and its p300/CBP-dependent rather than HDAC inhibition-dependent mechanism.

      Audience: This work may attract significant interest in both the epigenetics and metabolism fields.

      My expertise: histone acetylation, HATs, transcriptional regulation, cancer.

      Response: We very much appreciate all of these thoughtful comments. We are thankful for the recognition that this story advances our understanding of SCFA function through chromatin and may be of significant interest to the epigenetics and metabolism fields. We hope that we have now provided additional insight into roles of propionate and acetate (Supplemental Figure 4). We also recognize that similar to other studies, we observe colocalization of the different histone marks and it is difficult to tease apart specific functions. We plan to further address this important question in future studies.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors explore the effects of short-chain fatty acids (SCFAs) acetate, propionate, and butyrate on chromatin and gene expression in human colon cancer cells. The authors first characterize the presence of histone propionylation and histone butyrylation in different colon cancer cell lines as a function of SCFA treatments. Then, they perform ChIP-seq to determine the genomic localization of these marks and observe that these marks are deposited on euchromatic regions similar to H3K4Me3 and to one another, consistent with previous reports. The authors then performed gene expression analysis to determine the contribution of the SCFAs. Interestingly, they observe that butyrate treatment alone mimicked the gene expression profile of an equimolar mixture of short-chain fatty acids treatment, at least in the tested cell lines. Finally, the authors designed an experiment to try to separate the functions of butyrate on gene expressions that are dependent on p300/CBP and are independent of the HDAC inhibition property. The following aspects of the paper need addressing-

      Response: We sincerely thank the reviewer for their very helpful and constructive comments. We appreciate the notes on interesting aspects of our study. We hope that we have addressed all concerns as described below.

      Major comments

      1. There is no confirmation of the validity of the results seen from ChIP-seq (Figure 2) and RNA-seq (Figure 3). The majority of the findings of the paper are derived from ChIP-seq and RNA-seq data, and hence, experiments validating such results need to be established. ChIP-qPCR for representative gene(s) with adequate controls needs to be performed for different acyl marks (H3K27bu, H3K27pr, H3K4Me3, H3K9pr, H3K9bu) to support the ChIP-seq results, and RT-qPCR for representative gene(s) for different treatment conditions (vehicle, acetate, propionate, butyrate, and 5 mM 1:1:1 mixture) for validating RNA-seq results. Response: We are happy to include validation by qPCR of our ChIP and RNA-seq results. The qPCR validation for Figure 3 is now included as Figure 3F and qPCR validation for ChIP-seq is included as Figure 4C. We have selected genes that are differentially expressed and also display occupancy of different histone acyl marks. In addition, we performed additional qPCR validation for our RNA-seq data related to Figure 5 (previously Figure 4), which is now included as Figure 5F-G. Lastly, we performed orthogonal analysis of ChIP using Cut&Run in Caco-2 cells, which is now included as Figure 2C-D. This further supports our findings with HCT-116 cells.

      The authors describe an interesting strategy to differentiate the different functions of butyrate (Figure 4). The authors propose that differential genes that change with p300/CBP inhibitor treatment, that are separate from HDAC inhibitor treatment, are potential genes that are a function of histone butrylation. An important control that is missing in this experiment is cells treatment with propionate. In their previous findings (Figure 1C-D), they note that both propionate and butyrate treatments elevate the levels of histone acetylation, propionylation, and butyrylation. But the HDAC inhibitory activity of propionate is not very well established, and performing experiments to prove it is are beyond the scope of this paper. Importantly, p300/CBP has been shown to catalyze histone propionylation with higher efficiency compared to histone butyrylation (PMID: 27820805, PMID: 29070843). Therefore, it would be ideal to include differentially expressed genes from propionate-treated cells in the analysis to rule out any discrepancy.

      Response: Thank you for this insightful comment. We agree that propionate also elevates histone butyrylation and may have important effects. We have therefore included our differentially expressed genes with propionate treatment from Figure 3 in our analysis related to HDAC inhibition: we have plotted these differentially expressed genes in a matched, ordered column to our clustering analysis in Figure 4 (now Figure 5) as Supplemental Figure 8D (also included below). This demonstrates that overall propionate has similar gene expression changes to butyrate, but the extent of these changes is less pronounced compared to butyrate. In addition, our qPCR validation analysis in Figure 3F demonstrates that propionate similarly regulates some differentially expressed genes affected by butyrate (such as PHOSPHO1 and HOXB9) but fails to differentially regulate other targets (such as CYSRT1). This suggests that propionate and butyrate have both overlapping and distinct targets, which is consistent with our global analyses in Figure 3A-D. Lastly, we now have included specific analysis of gene program changes related to propionate treatment (Supplemental Figure 4). Interestingly, there is largely overlap between the different SCFA treatments (Supplemental Figure 4A). However, propionate treatment fails to enrich for select gene ontology categories that we observe in other SCFA treatments (Supplemental Figure 4A-B). For example, by gene set enrichment analysis, other SCFA treatments enrich for gene categories related to nucleotide and ATP synthesis, while propionate does not. However, all SCFA treatments are enriched in categories related to the ribosome and rRNA (Supplemental Figure 4B-C). Together, this data suggests that propionate has largely similar effects to butyrate treatment in regulating gene expression programs with some distinct differences.

      New Supplemental Figure 8D.

      Along the same lines as comment #2, other possible "functions" of propionate and/or butyrate that could explain why treatment with them increase histone acetylation, propionylation, and butyrylation are not discussed. This work was not cited/discussed: PMID 34677127 despite being very closely related and relevant. Indeed, there seems to be some redundancy of efforts between that paper (2021) and this one even in terms of the specific experiments performed.

      Response: Thank you for this comment, and we sincerely apologize for our oversight in not citing this important work. We are very familiar with this paper, and this was an unfortunate accidental oversight. We have now cited it throughout the text in lines 51, 123, and 330. In addition, we expanded our discussion about how our single treatments of butyrate or propionate increase levels of multiple histone acyl marks including acetylation, butyrylation, and propionylation. We now include activation of p300 as a potential mechanism for this observation in lines 327-330: "This is consistent with the role of butyryl-CoA and propionyl-CoA functioning as activators of p300 acetyltransferase activity, where these molecules can directly stimulate p300 auto-acylation and acetylation activity on histones and other substrates12" Lastly, while we agree that many of our treatments are similar to this paper, we also feel that our downstream analysis is distinct, as we are focusing on genomic localization and gene expression changes, in addition to changes in levels of the histone marks themselves. We believe that this distinction lessens the redundancy between our papers and may be of interest to the chromatin field.

      An analysis for correlations between the ChIP-seq data for H3K27bu (Fig 2) and RNA-seq data following butyrate treatment (Fig 3) would provide further insights into whether the genes/pathways that are enriched/downregulated in H3K27bu ChIP-seq data correlate with genes/pathways that are upregulated/downregulated in RNA-seq data.

      Response: We really appreciate this suggestion and agree that this analysis would add important additional insights. We have therefore performed this analysis through binning genes by expression level and analyzed occupancy of H3K27bu according to gene expression quartiles, which is now included as Figure 4B. Additionally, we included the other histone butyrylation and propionylation marks that are the focus of our manuscript. We have found that levels of H3K27bu occupancy are correlated with high gene expression quartiles. Importantly, this is also consistent with our earlier work in primary mouse intestinal cells (PMID: 38413806).

      Minor comments

      1. All the images appear to be very low resolution. This could be due to the online submission system. Response: We apologize for this issue and believe it is due to the submission system.

      For Fig 2, the caption says "...treated with different SCFAs for 24 hours," but it is unclear precisely what the treatment was. Were the cells treated with the SCFA mix, and then ChIP-seq was performed for the 5 different marks tested? Or were there different SCFA treatments performed for each mark that was ChIPed?

      Response: We have revised the text of the figure legend to make it clear that we treated cells with individual SCFAs (propionate for propionylation marks and butyrate for butyrylation marks).

      Line 99-100: "Treatment with butyrate, propionate, or a mixture of all three SCFAs resulted in a global increase in histone butyrylation or propionylation" is misleading. The authors test only specific sites on Histone H3 using site-specific antibodies and do not test whether these treatments increase global levels of acylation on other histones and sites using pan-acyl antibodies. So, this sentence needs to be rephrased to clearly indicate that the treatments only increased at the tested sites.

      Response: Thank you for this comment. We understand this was misleading and that was not our intention at all. By writing "global levels," we simply meant levels of immunoblotting signal at these specific lysine residues. We have therefore revised the text to make it clearer (now in lines 102-104): "Treatment with butyrate, propionate, or a mixture of all three SCFAs resulted in significant increases of histone butyrylation and propionylation at select residues of histone H3, as assayed by immunoblotting".

      Reviewer #3 (Significance (Required)):

      Strengths and limitations: The experiments in the study were performed with a high degree of rigor, including appropriate controls. The discussion of the -seq data in Figs 2-4 avoided focusing on or following up on specific genes, which limited the conclusions from these data to being very broad. A key paper (that was not recent) was missing from the context presented in the paper, weakening the discussion of the data presented.

      Advance: The advance is pretty conceptually incremental. Similar experiments as in Fig 1-3 in similar models have been performed in other papers already (e.g., PMID 39789354 in 2025 and PMID 34677127 in 2021), although Fig 4 was an interesting experiment that helps differentiate the work from existing literature.

      Audience: This work would be interesting to a chromatin audience as well as a microbiome audience, but the scope of the conclusions from this paper, and it's redundancy with other literature, will limit its profile.

      My expertise is in histone PTM biochemistry and biology, including non-canonical histone acyl PTMs.

      Response: We really appreciate the thoughtful and constructive comments and the recognition that this story may be of interest to the chromatin and microbiome audiences. In addition, we acknowledge other similar recent work that is also very interesting, but we also feel that our manuscript is distinct in several important ways from these studies. In particular, the analysis of gene expression changes that we propose to be histone butyrylation dependent vs. through HDAC inhibition (Figure 5, previously Figure 4) and the finding that butyrate drives SCFA combination gene expression changes (Figure 3). We are very grateful for the recognition of these interesting findings by this reviewer. Furthermore, we also want to highlight that we have expanded our analysis of human tissues (Supplemental Figure 1), which adds additional novelty to this work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: The authors explore the effects of short-chain fatty acids (SCFAs) acetate, propionate, and butyrate on chromatin and gene expression in human colon cancer cells. The authors first characterize the presence of histone propionylation and histone butyrylation in different colon cancer cell lines as a function of SCFA treatments. Then, they perform ChIP-seq to determine the genomic localization of these marks and observe that these marks are deposited on euchromatic regions similar to H3K4Me3 and to one another, consistent with previous reports. The authors then performed gene expression analysis to determine the contribution of the SCFAs. Interestingly, they observe that butyrate treatment alone mimicked the gene expression profile of an equimolar mixture of short-chain fatty acids treatment, at least in the tested cell lines. Finally, the authors designed an experiment to try to separate the functions of butyrate on gene expressions that are dependent on p300/CBP and are independent of the HDAC inhibition property. The following aspects of the paper need addressing-

      Major comments

      1. There is no confirmation of the validity of the results seen from ChIP-seq (Figure 2) and RNA-seq (Figure 3). The majority of the findings of the paper are derived from ChIP-seq and RNA-seq data, and hence, experiments validating such results need to be established. ChIP-qPCR for representative gene(s) with adequate controls needs to be performed for different acyl marks (H3K27bu, H3K27pr, H3K4Me3, H3K9pr, H3K9bu) to support the ChIP-seq results, and RT-qPCR for representative gene(s) for different treatment conditions (vehicle, acetate, propionate, butyrate, and 5 mM 1:1:1 mixture) for validating RNA-seq results.
      2. The authors describe an interesting strategy to differentiate the different functions of butyrate (Figure 4). The authors propose that differential genes that change with p300/CBP inhibitor treatment, that are separate from HDAC inhibitor treatment, are potential genes that are a function of histone butrylation. An important control that is missing in this experiment is cells treatment with propionate. In their previous findings (Figure 1C-D), they note that both propionate and butyrate treatments elevate the levels of histone acetylation, propionylation, and butyrylation. But the HDAC inhibitory activity of propionate is not very well established, and performing experiments to prove it is are beyond the scope of this paper. Importantly, p300/CBP has been shown to catalyze histone propionylation with higher efficiency compared to histone butyrylation (PMID: 27820805, PMID: 29070843). Therefore, it would be ideal to include differentially expressed genes from propionate-treated cells in the analysis to rule out any discrepancy.
      3. Along the same lines as comment #2, other possible "functions" of propionate and/or butyrate that could explain why treatment with them increase histone acetylation, propionylation, and butyrylation are not discussed. This work was not cited/discussed: PMID 34677127 despite being very closely related and relevant. Indeed, there seems to be some redundancy of efforts between that paper (2021) and this one even in terms of the specific experiments performed.
      4. An analysis for correlations between the ChIP-seq data for H3K27bu (Fig 2) and RNA-seq data following butyrate treatment (Fig 3) would provide further insights into whether the genes/pathways that are enriched/downregulated in H3K27bu ChIP-seq data correlate with genes/pathways that are upregulated/downregulated in RNA-seq data.

      Minor comments

      1. All the images appear to be very low resolution. This could be due to the online submission system.
      2. For Fig 2, the caption says "...treated with different SCFAs for 24 hours," but it is unclear precisely what the treatment was. Were the cells treated with the SCFA mix, and then ChIP-seq was performed for the 5 different marks tested? Or were there different SCFA treatments performed for each mark that was ChIPed?
      3. Line 99-100: "Treatment with butyrate, propionate, or a mixture of all three SCFAs resulted in a global increase in histone butyrylation or propionylation" is misleading. The authors test only specific sites on Histone H3 using site-specific antibodies and do not test whether these treatments increase global levels of acylation on other histones and sites using pan-acyl antibodies. So, this sentence needs to be rephrased to clearly indicate that the treatments only increased at the tested sites.

      Significance

      Strengths and limitations: The experiments in the study were performed with a high degree of rigor, including appropriate controls. The discussion of the -seq data in Figs 2-4 avoided focusing on or following up on specific genes, which limited the conclusions from these data to being very broad. A key paper (that was not recent) was missing from the context presented in the paper, weakening the discussion of the data presented.

      Advance: The advance is pretty conceptually incremental. Similar experiments as in Fig 1-3 in similar models have been performed in other papers already (e.g., PMID 39789354 in 2025 and PMID 34677127 in 2021), although Fig 4 was an interesting experiment that helps differentiate the work from existing literature.

      Audience: This work would be interesting to a chromatin audience as well as a microbiome audience, but the scope of the conclusions from this paper, and it's redundancy with other literature, will limit its profile.

      My expertise is in histone PTM biochemistry and biology, including non-canonical histone acyl PTMs.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This study presents a novel finding that short-chain fatty acids (SCFAs) produced by microbial metabolism regulate gene transcription in human colon cancer cells by modulating histone H3K9 and H3K27 butyrylation and propionylation, both of which are associated with an open chromatin state. The authors further reveal that the major effect of the SCFA mixture is driven by butyrate and identify p300/CBP-dependent, rather than HDAC inhibition-dependent, gene regulation by butyrate. Overall, this is a well-organized study that provides valuable insight into the role of metabolites in human cells.

      Major comments:

      1.In Figures 1C and 1D, why did the SCFA mixture not increase histone butyrylation or propionylation to the same level as single butyrate treatment? 2.In Figure 3B, how does butyrate block the effects of acetate and propionate on transcription? 3.Which pathways are associated with acetate- and propionate-specific DEGs? 4.Which genes are related to growth inhibition in butyrate-treated cells? Does the 1:1:1 SCFA mixture have a similar impact on cell growth as single butyrate treatment?

      Significance

      General assessment: This study clearly demonstrates the role of butyrate in gene regulation and elucidates its underlying regulatory mechanisms. However, it does not provide insight into how butyrate counteracts the effects of acetate and propionate, despite these metabolites often being detected together. In addition, it remains unclear which specific histone PTMs are associated with the distinct gene expression changes induced by different short-chain fatty acids. Lastly, the observation that histone butyrylation and propionylation correlate with active transcription is not novel.

      Advance: This study advances understanding of short-chain fatty acids in chromatin and gene regulation, highlighting butyrate's dominant role and its p300/CBP-dependent rather than HDAC inhibition-dependent mechanism.

      Audience: This work may attract significant interest in both the epigenetics and metabolism fields.

      My expertise: histone acetylation, HATs, transcriptional regulation, cancer

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Kabir et al. explore the impact of microbiota-derived short-chain fatty acids (SCFAs) on chromatin structure and gene expression in human cells. They show that SCFAs, particularly butyrate, contribute to specific histone modifications such as butyrylation at H3K27, detectable in human colon tissue. Additional modifications like acetylation, butyrylation, and propionylation at H3K9 and H3K27 respond to SCFA levels and are enriched at active regulatory regions in colorectal cancer cells. Treatment with individual or combined SCFAs mimicking gut conditions alters gene expression patterns, with butyrate playing a dominant regulatory role. Butyrate's effects on gene expression are claimed to be independent of HDAC inhibition and instead rely on the p300/CBP complex through histone butyrylation. These findings underscore SCFAs as crucial modulators of epigenetic regulation in the human colon and highlight butyrate's dominant role in shaping chromatin and gene regulation beyond its known metabolic functions.

      The authors used two human cell lines and an in vivo murine model paired with RNA and ChIP sequencing approaches to identify target genes and chromatin modifications in response to SCFAs. While the findings are interesting and could provide important insights into the epigenetic influence of SCFAs in human cells, the study would benefit from additional experiments to strengthen the conclusions. Comments and suggestions are listed below:

      1. Figure 1: The H3K27bu expression in human biopsies highlights the clinical significance of the current study. However, the authors need to provide more information on the human colon samples, e.g., how many total patients were analyzed, and what were the age and/or sex. Only the methods mention the use of benign TMA; this should also be clarified in the figure legends. It would also be helpful to show histone butyrylation levels in normal vs. cancer human tissues.
      2. Figure 1: In addition, given that the butyrate level descends towards the base of the colonic crypt (with the highest at the top of the crypt where mature intestinal epithelial cells reside) (Kaiko et al., 2016), it is important to show how the H3K27bu signature is distributed along the crypt. This data would further emphasize the clinical relevance of this study, given that most colorectal cancers (CRCs) arise from stem and progenitor cells.
      3. Throughout the manuscript: The rationale for selecting the two CRC cell lines (HCT 116 and Caco2) should be explained. While commonly used, providing background on their genetic differences (e.g., driver mutations) is important, as this could greatly influence the PTM landscape.
      4. The study lacks additional controls, such as a normal colon epithelial cell line and a non-colonic cell type. Including these would help determine whether the observed butyrate effects are tissue- or disease-specific. This data would also help assess whether SCFA effects, and specifically butyrate's effects, on histone acylation and gene expression are systemic or local.
      5. Figure 2: The authors show ChIP-seq results in the HCT 116 cell line. To exclude the possibility that the demonstrated chromatin signatures are cell line-specific, results from Caco2 should also be shown. In addition, the 2D environment and multiple passaging alter gene expression in cell lines; using human colonic organoids would provide a more clinically and physiologically relevant model.
      6. Figure 4 is very confusing. Entinostat itself, as an HDAC inhibitor (iHDAC), increases butyrylation. The data shown are insufficient to draw conclusions. First, the authors should use additional iHDACs, and second, they should illustrate the overlap in gene expression changes between all treatments using a Venn diagram to clarify which genes/signatures are specific to each treatment.
      7. Figure 4: The authors use an HDAC inhibitor to rule out butyrate's effect on gene expression via HDAC inhibition. However, butyrate can also modulate gene expression through activation of GPR109a. Using GPR109a antagonists is necessary to address this possibility. These data are essential to validate the specific role of histone butyrylation in gene regulation.
      8. Supplementary Figure 4 and manuscript: There is no in vivo methods section describing the tributyrin-gavaged mice. The authors should clarify how the experiment was performed, how cells were isolated, whether sorting was performed, and which markers were used.
      9. Supplementary Figure 4: The GO analysis results show that lipid catabolism is among the top differentially enriched pathways. Butyrate is a known PPARγ agonist (Litvak et al., 2018), and activation of PPARγ is known to drive expression of genes involved in lipid metabolism. The authors need to rule out this function of butyrate before attributing this signature solely to histone butyrylation.
      10. It would be helpful to include a table of differentially abundant genes as a supplement to the heatmaps and GO analysis.

      Significance

      This study explores how microbiota-derived SCFAs, particularly butyrate, influence histone acylation and gene regulation. While the topic is relevant, the work lacks important controls (e.g., normal epithelial and non-colonic cells) and omits mechanistic validation (e.g., GPR109a signaling, PPARγ involvement). The rationale for cell line selection is unclear, and in vivo methods are insufficiently described.

      Audience:

      The study will mainly interest specialists in microbiota-chromatin interaction. Broader impact is limited by the narrow model scope and underdeveloped mechanistic insight.

      My Expertise:

      Cancer biology, in vivo models, microbiota-host interactions.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this report, Segal and coworkers describe the identification of the CD99 receptor as a marker whose expression discriminates betweeen two populations of a EWS tumor cell model, with distinct morphologies, behavior and xenograft tumor progression. As a top differentially expressed gene, they identify caveolin-1, a pervasive but highly contextual modulator of tumor cell behavior, as a potential driver of Pi3K/AKT-mediated thriving and survival.

      Major conceptual comments:

      The study uses state-of-the-art technology and their observations are potentially relevant to the field. However, the description of the mechanistic link between CAV1 and CD99 is not clear. The system they developed to stably manipulate CAV1 levels would be ideal to test the relevance of their claims in vivo. Certain additional experiments might clarify the dependency on caveolae or CAV1 (cholesterol/metabolic intervention). I am also intrigued by the proceedings they used to "evolve" CD99hi and CD99lo populations: mechanical stimuli can modulate caveolae assembly, and this should be addressed at least in one independent cell model, if not assessing whether primary tumors exhibit these diverging populations.

      Major technical comments:

      • the completeness of the "mechanical passaging" should be somehow demonstrated. Does this procedure progressively affect CD99 and CAV1 expression?

      • To my understanding, knockdown assays are performed using only one shRNA sequence. this should be validated with at least one independent shRNA intervention, or a similar approach like CRISPr gene ablation

      • The "aspect ratio" morphology score should be better explained and biologically contextualized (and if possible, correlated with Akt reporter signal) in the main figures

      • Drug resistance assays should be better detailed, first in vitro and then in vivo

      Significance

      The study aims to identify contextual drivers of EWS tumor cells, and identify two potential molecules as associated with a proliferative/survival phenotype. As such, the aim of the study is important. However, the differential phenotypes investigated arise from a non-standard form of cell passaging in a single established cell line model, and no exploration on the link of their findings to actual human EWS disease is clearly explored. The proposed model of interdependence among the potential drivers found is neither clear at mechanistic level, limiting the potential of these findings to identify specific biomarkers and/or interventions. For publication, to the very least

      (1) The mechanistic ties among the different drivers explored should be better characterized (also at technical level, see above) and

      (2) the relevance of CAV1 KD in vivo should be shown. I strongly encourage the exploration of public data on EWS for the relevance of their findings, even if preliminary (i.e. expression levels associated with increased or decreased patient survival, for example).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The submitted article identifies a distinct subpopulation of Ewing Sarcoma (EwS) cells characterized by high CD99 and elevated Caveolin-1 expression; shows that Caveolin-1 in these cells orchestrates PI3K/AKT signaling by specifically modulating the spatial organization of PI3K activity on the plasma membrane; demonstrates that the CD99-High and CD99-Low states are reversible, providing a flexible mechanism for survival-oriented plasticity in response to chemotherapy; and proposes that unlike CD99-Low cells,
 CD99-High cells use a Caveolin-1-driven signaling architecture to survive.

      Mayor comments:

      • The conclusion are well substantiated by the data and no additional experiments are needed to support the claims of the paper. Data and methods are presented in such a way that they can be reproduced. Experiments replicates and statistical analysis appear adequate.

      Minor comments:

      While the bibliography is appropriate, the manuscript should clarify its novelty from the outset. Specifically, the Introduction must acknowledge that the role of increased Akt activity in Cav-1-mediated cell survival is already established (Li et al., 2003). This study, currently listed as reference 42 in the Discussion, should be moved to the Introduction and discussed alongside references 31 and 32 to properly frame the study's context.

      Regarding the role of caveolin-1 facilitating mechanosensitive Akt signaling, Sedding et al. (doi: 10.1161/01.RES.0000160610.61306.0f.) should be cited.

      Likewise, Results: "Knockdown of Cav-1 in TC71 cells drastically reduced phosphorylated Akt levels in CD99Hi cells (Figure 5A), suggesting a dependence on Caveolin-1 for their Akt signaling". Rather than suggesting, these results are consistent with published data (Yang, H et al 2016; reference 32)

      The nature of the CD99high state remains ill-defined. While the authors identify CD99 as a standard biomarker for Ewing Sarcoma (EwS) (Ref 38), they simultaneously suggest that this specific CD99high subpopulation may have been previously overlooked due to its sensitivity to conventional enzymatic dissociation. To avoid confusion, the authors should explicitly clarify the distinction between baseline CD99 expression and this highly sensitive, high-expression state.

      Figure 4C. In addition to caveolae, CD99Hi, CD99Lo, and CD99Hi+Cav-1KD cells also present conspicuous differences in ER and ribosome content. These should be taken into account and the possibility be considered that the effect of Cav-1 may actually be caveolin-independent.

      Significance

      This manuscript adds further support to the role of caveolin-1 facilitating mechanosensitive Akt signaling, and establishes a novel link to CD99 expression in Ewing sarcoma.

      These findings will be of interest for researchers across the fields of mechanobiology and preclinical oncology, particularly those investigating caveolar scaffolds and PI3K/Akt-driven malignancies.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript identified a subpopulation of Ewing sarcoma TC71 cells following mechanical passaging that express high levels of the CD99 marker. This CD99-hi TC71 population was shown by multiple in vitro and in vivo assays to be more aggressive than the CD99-lo population (Figures 1-3). The authors then show that Caveolin-1 is upregulated at the transcriptional level in CD99-hi cells and that these cells have caveolae, show that Cav1 expression in the CD99-hi cells reduced pAkt signaling and they propose that this affects survival of this cell population.

      The strength of the paper is the very exhaustive in vitro and in vivo phenotypical analysis of the CD99-hi population (Figures 1-3).

      There are however multiple important weaknesses and omissions in this paper:

      1. The study is based on a single cell line, TC71, and conclusions are extended to Ewings sarcoma in general. Attributing conclusions generally to Ewings sarcoma must necessarily be based on analysis of multiple Ewings sarcoma cell lines and preferably supported by patient tumor data.

      2. The statement that caveolin-1 is a molecular signature of the CD99-hi state is not supported by the Western blot in Supp fig 4E, that shows that CD99-hi cells have lower Cav1 levels than CD99-lo cells, even though the CD99-hi have caveolae. This must be explained mechanistically and functionally and it cannot be argued that caveolin-1 is a "molecular signature of the CD99-hi state" if caveolin-1 expression levels are reduced in the CD99-hi population.

      3. The mechanistic role of caveolin-1 selectively in the CD99-hi population needs to be better established. Data supporting a role for Cav1 in survival is weak (Fig 4D) and not supported by the data presented in Supp fig 4G where Cav1 KD shows no effect on survival. A selective role for caveolin-1 in the CD99-hi cells must be demonstrated by parallel analysis of CD99-lo cells. Similarly ,the effects of caveolin-1 knockdown on AKT signaling are restricted to CD99-hi cells (Fig 5AB) and must be also shown for CD99-lo cells.

      4. There is no data linking AKT signaling to the CD99-hi phenotype elaborately detailed in Figs 1-3. If as the authors claim " the CD99High state establishes a Caveolin-1-driven signaling architecture that supports tumor cell survival through mechanisms distinct from those used by CD99Low cells" then: 1. caveolin-1 dependent Akt signaling must be shown to be specific to CD99-hi and not CD99-lo cells; and 2. Akt signaling shown to selectively regulate survival of CD99-hi cells in a caveolin-1 dependent manner, based on the in vivo assays developed in figures 2 and 3.

      Significance

      The strength of the paper is the very exhaustive in vitro and in vivo phenotypical analysis of the CD99-hi population. There are however multiple important weaknesses and omissions in this paper which when addressed, could considerably improve the significance of the manuscripts findings for the community.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the Reviewers for their comments on our manuscript “Structural insights into mitotic-centrosome assembly”. As described below, we have substantially revised the manuscript in response to their comments and are hoping you would consider the revised manuscript “Phosphorylation relieves autoinhibition to drive Cnn centrosome scaffold assembly” at The EMBO Journal. Our specific responses (black text) to the Reviewer’s comments (blue text) are detailed below

      Reviewer #1

      Main Points:

      1) From previous studies, it seems to me that for the residues potentially relevant for the hairpin regulation there is direct evidence of phosphorylation only for S567 (mass spec, phospho-antibody). Have the authors tested single site mutants (S567A and E)? Also, have they tested D mutations? If so, this should be commented on and shown. If not, it should be tested, in particular since the 2E phospho-mimetic is not functioning properly in vivo. If S571 is indeed crucial, it should be demonstrated that it is also phosphorylated. Otherwise it is possible that the mutation of this residue simply impairs important interactions (e.g. PReM-CM2, others), independent of phosphorylation.

      As requested, we have now tested individual S567A and S571A mutations and found that they both perturb Cnn scaffold assembly, but to a lesser extent than the 2A double mutant (New Fig.S3A). We also now confirm by MS that recombinant Polo can phosphorylate both S567 and S571 in vitro, and we have examined the behaviour of a 2D mutant and find that it behaves very similarly to the 2E mutant (New Fig.S3B).

      2) It is unclear why in vitro only A mutations have been tested and not phospho-mimetics. This should be tested for the interaction between PReM and CM2. This would allow to probe the model that phosphorylation opens the hairpin to allow interaction. Currently, such proof is missing in the study. Alternatively, the authors could phosphorylate the recombinant protein in vitro. The in vivo data is harder to interpret due to the complexity of the model and the authors should take advantage of the in vitro system.

      As requested, we now show in New Fig.S5 that whereas in vitro WT Cnn490-608 and Cnn-2A490-608 behave as dimers, Cnn-2E490-608 elutes in two major fractions—a tetramer species and a much larger species that elutes in the void volume (meaning that 2E can form very large species even in the absence of CM2) (Figure S5A). In the presence of CM2, Cnn-2E490-608 forms a tetramer (that eluted slightly later than the Cnn-2E490-608 tetramer) and larger complexes that contained CM2 and eluted in the void volume with a profile similar to Cnn-2E490-608 on its own (Figure S5B). These results are consistent with the possibility that the 2E substitutions open the helical hairpin to allow self-interactions that drive homo-tetramer and larger complex assembly in vitro.

      3) Regarding the worm PReM and CM2 domains, the authors mention that they have tested in vitro phosphorylation by PLK-1, but I could not find any data showing this. They should demonstrate successful phosphorylation or test candidate site by phospho-mimetic mutation. It is possible that the worm proteins depend more strongly on phosphorylation to relieve autoinhibition compared to the fly proteins.

      This is a good point, and we apologise for this omission. We now state that we confirmed by MS analysis that the recombinant worm PLK-1 we used in these in vitro experiments phosphorylates the putative SPD-5 PReM domain on the three sites (S627, S653 and S658) known to be important for promoting SPD-5 scaffold assembly in vivo (Figure Legend, Figure 6). Thus, the lack of detectable binding between these proteins is not due to the lack of phosphorylation.

      Minor Point:

      4). Fig. 6C, D: the labeling of the chimeric constructs using "+" symbols is confusing, since it suggests that separate proteins were expressed. If I understand this correctly, with the current labeling, deltaCM2+DmCM2 means WT? The authors should write the full name of the wildtype or chimeric construct in each case and use a more standard/less confusing nomenclature. Also, I suggest to start the panels and graphs with the WT sample.

      We thank the Reviewer for this suggestion and have re-labelled this Figure to clarify this point. We understand the point about putting the WT panels first in Figure 6C,D (now Figure 5C,D) but think that this is not the correct comparison to emphasise. We are testing the ability of the various CM2 domains to “rescue” the lack of a CM2 domain, so we feel Drosophila Cnn lacking CM2 is the correct baseline for this comparison.

      Reviewer #2

      Main Comments:

      1. The title is too vague. Any number of existing papers could be said to provide "structural insights into mitotic centrosome assembly". The authors need to narrow down to a defined conclusion and state this as the title.
      2. I think the strongest and most novel aspects of this study relate to the mechanism of Cnn assembly via relief of the auto-inhibited PReM. The effort to elucidate assembly mechanisms of SPD-5 and CDK5RAP2 are comparatively light and there are no accompanying experiments in worms or human cells. Without the in vivo experiments, it's hard to know if the in vitro experiments are valid. It's speculative for the authors to say they found the true PReM for CDK5RAP2; they do not demonstrate that PLK-1 phosphorylation potentiates assembly in Figure 8. Thus, I suggest re-writing the paper to focus on Cnn. Experiments in Figure 6 are still valid if reframed. For example, substituting Cnn's CM2 with the CM2 from CDK5RAP2 vs. the C-term of SPD-5 illustrates that a simple coiled-coil with open ends (H.s.CM2) is sufficient to interact with PReM whereas a coiled-coil with a closed end (SPD-5 C-term, predicted by Figure 6A) cannot. We thank the Reviewer for these helpful comments and have re-written and re-organised the manuscript in accord with these suggestions—most importantly providing a more specific title and re-ordering the data to better focus the paper on the relief of Cnn autoinhibition.

      The purpose of Figure 1 is unclear. None of the other figures examine SPD-5 and CNN in the condensate form, which required using 4% PEG in this paper. The other assays look at the network form, which could behave differently and have different dependence on specific domains. I think they should perform the condensate assay for all other figures, otherwise leave it out. Furthermore, CDK5RAP2 is mentioned, yet not examined in Figure 1. It must be noted that CDK5RAP2 will also condense into droplets under crowding conditions or with a synthetic nucleator (Rios et al., 2025 J Cell Sci). Thus, it seems that condensation potential is a universal feature of known PCM scaffold proteins.

      The original Figure 1 has been moved to end of the paper (now Figure 8) and we now more thoroughly explain the logic of these experiments. Briefly, given that the PReM and CM2 domains in flies and worms seem to function in different ways in vivo, we sought here to test whether this was also the case in vitro—where the behaviour of full-length SPD-5 and of these domains of Cnn have been extensively studied, but never directly compared. We believe such a direct comparison will be of some interest to the field (the Woodruff et al., 2017 paper describing these in vitro SPD-5 condensates has been cited >700 times). We now also cite the Rios et al., 2025 paper but note that, despite extensive efforts, we were unable to purify enough well-behaved CDK5RAP2 for our experiments and so could not include it in this analysis. We think Rios et al., used an MBP-fusion of CDK5RAP2 in their experiments, which may explain this difference.

      The study uses different species without doing the same types of experiments on each. Sometimes human CDK5RAP2 is thrown in, sometimes not. They solve crystal structures of PReM from Cnn but not from the other proteins. This gets confusing, especially since the authors state that they seek to test if fly Cnn and worm SPD-5 assemble through different mechanisms (see last sentence of the intro). Also, if the focus is on worm vs. fly PCM assembly mechanisms, why include the human protein, especially Figure 8?

      On re-reading our original manuscript we appreciate this confusion. We hope that in re-writing the manuscript along the lines suggested by the Reviewer the logical flow of our experiments will be clearer.

      The conclusion that SPD-5's narrow PReM and "CM2" domains don't interact is consistent with the cross-linking mass spectrometry data from Rios et al. 2024. They showed only one X-link with low occurrence (1 out of 6 samples) between these two regions, even in the phosphorylated state (Fig. 1G). However, Nakajo et al (2022) claimed the opposite, showing that a larger PReM-containing construct (a.a. 272-732) interacts with a C-terminal construct (a.a. 1061-1198) after PLK-1 phosphorylation. Can the authors comment on this? Perhaps there is another site in SPD-5, outside of a.a. 541-677, that acts like the Cnn PReM?

      These are good points and we now mention this last possibility in the Discussion. We also now mention the supporting cross-linking Mass Spec data from Rios et al., 2024.

      I have serious doubts that the C-terminus of SPD-5 has a CM2 domain. To me, there is no real sequence homology with the traditional CM2's from humans and flies, and the AF3 predictions support this. Ohta et al. (2021) called this region "CM2-like" based on very poor homology, which a is questionable practice. Any coiled-coil region will appear somewhat homologous due to the heptad repeat pattern that defines them (e.g., leucines line up quite nicely). Thus, is it fair to say that SPD-5 doesn't assemble through a PReM-CM2 interaction? There may be a different region in SPD-5 that looks more like the canonical CM2. I think the authors have compelling evidence to give the C-terminal coiled-coil region in SPD-5 its own name rather than calling it CM2.

      This is a fair point, although the literature is already quite confusing on the nomenclature for the C-terminal region of SPD-5 (e.g., Ohta et al., JCB, 2021; Nakajo et al., JCS, 2022), so we are reluctant to add another name to the mix. Given that we draw comparisons with the fly and human CM2 domains (that are clearly related by sequence), we think it is easiest for readers if we use the “CM2” nomenclature throughout, although making clear our conclusion that SPD-5 “CM2” does not appear to function in the same way as fly/human CM2.

      Figure 3E. Would measuring scaffold mass be more appropriate? The PReM(deltaH1,NTH2) leads to more compact scaffolds, but maybe they assemble just as well as the deltaH1 mutant. As it stands, there is a discrepancy between panel E and F in terms of what is measured (area vs. intensity) and the outcome.

      In several previous papers we use fluorescence intensity to measure the “amount” of protein at centrosomes in vivo but, in our original paper (Feng et al., Cell, 2017), we quantified PReM::CM2 scaffold assembly in vitro by measuring the area of scaffold assembly. Thus, we prefer to present the current data in this way for consistency across publications, and we believe either measure is valid. We could measure the area and intensity of the PReM∆H1 and PReM∆H1∆NTH2 scaffolds to compare scaffold density, but we think this would unnecessarily complicate this data. The main point is not how much or how dense each scaffold is, but rather that the PReM∆H1∆NTH2 protein doesn’t really make a scaffold at all—but rather makes smaller “blobs” that tend to bunch together (further characterised in Fig.S2).

      Minor Comments:

      1. In one version of the PDF there are images missing in Fig 1F, 4C, 4D. I opened another version (source version) and the images were there. Just FYI.
      2. Figure 4A. The blue coloration makes it difficult to read the black letters.
      3. Figure 4A. Why is part of the protein colored in green? This coloration isn't defined, nor does it show up again in panel B.
      4. The layout of Figure 4 is confusing. It took me a few minutes to realize that the big red box inset belonged to panel B and not panel A.
      5. Figure 4C,D. The sample size is not mentioned in the legend.
      6. The title for Figure 4 seems too speculative. How can the authors say that phosphorylation relieves the autoinhibition without structural data?
      7. Figure 5B. The sample size is not mentioned in the legend.
      8. Figure 6B,D. The sample size is not mentioned in the legend.
      9. The text in Figure 7B is hard to read because it is too small. Please make this bigger.
      10. Figure 8C. What is colored in magenta? Is there an additional labeled protein besides mNG-CM2?
      11. Figure 8C. What is the sample size? How many images were taken? Also, why are there data points off to the right of the last column?
      12. The wording of these sections needs improving. I found them complicated and difficult to understand. We thank the Reviewer for taking the time to make these helpful comments. We have addressed all these points in the revised manuscript. On point 10, the magenta objects were fiduciary beads that were inadvertently included on this panel (and are no longer shown).

      Reviewer #3

      Major Comments: 1. The title, "Structural Insights into Mitotic-Centrosome Assembly," is overly broad. The study primarily focuses on CM2-PReM intramolecular interactions in D. melanogaster Cnn and does not comprehensively address mitotic centrosome assembly across species. A more specific title reflecting the fly-centric and structural focus would better align with the manuscript's scope and conclusions.

      As described at the start of our response to Reviewer #2, the title and focus of the manuscript have been extensively revised along these lines.

      The authors analyze condensate formation by Cnn and SPD-5 but overlook condensate formation by CDK5RAP2, which was recently reported by Rios et al. (2025, PMID: 40454523). Including CDK5RAP2 would enable a more balanced and informative comparison across fly, worm, and human homologs.

      As described in point 3 of our response to Reviewer #2, we now cite Rios et al., 2025 but note that, despite extensive efforts, we were unable to purify enough well-behaved CDK5RAP2 for our experiments and so could not include it in this analysis. We believe Rios et al., used a full-length MBP-fusion of CDK5RAP2 in their experiments, which may explain this difference as MBP is very good at keeping proteins soluble (but would not be appropriate in our experiments where we compare full-length untagged proteins).

      In Figure 3, reconstitution of Cnn scaffolds using purified CM2 and PReM fragments yields "macromolecular scaffolds," but their physical properties are not defined. It remains unclear whether these assemblies are ordered or amorphous, and whether they exhibit solid- or gel-like behavior. Moreover, the heterogeneous, scattering particles observed by negative-stain EM (Figure S3B), likely corresponding to the Cnn490-608-CM2 complex, raise the possibility of nonspecific aggregation rather than organized scaffold formation. Appropriate controls lacking CM2 are needed to exclude spontaneous aggregation of PReM fragments. In addition, testing shorter truncations of the PReM H2 helix could help define the minimal requirements for scaffold assembly. Finally, the rationale for including the CnnΔExPReM construct only in vivo (Figure 3F), but not in the in vitro assays (Figure 3A-E), should be clarified.

      We apologise, as our presentation of this data has clearly led to some confusion on these points.

      First, as we now clarify, the amorphous solid-like physical properties of the PReM::CM2 scaffolds were described in our previous paper where we also showed that these scaffolds are not simply non-specific aggregates—as several single point mutations that disrupt the LZ::CM2 tetramer also prevent PReM::CM2 scaffold assembly in vitro as well as Cnn scaffold assembly in vivo (see Fig.5, Feng et al., Cell, 2017). Also, in all in vitro scaffolding experiments we always perform a negative control (-CM2) to confirm that none of the scaffolds are aggregates of the PReM domain being tested. We don’t usually show this control now as there would be lots of empty black boxes on the Figures. We do, however, show this control for the human putative PReM domain (Figure 7C), as we are testing this here for the first time.

      Second, the request to test shorter truncations of the PReM H2 helix to define the minimal requirements for scaffold assembly is unnecessary as PReM∆H1∆NTH2 already cuts H2 at the start of the LZ, and we previously showed the LZ is required for PReM::CM2 scaffold assembly in vitro (Feng et al., Cell, 2017). Thus, any further truncation of H2 will start to remove the LZ, which we already know is essential. We have now made this point more clearly.

      Finally, the Cnn∆ExPReM construct the Reviewer mentions was tested in both the in vitro (now Figure 2B) and in vivo (now Figure 2F) assays, but the labelling was confusing so this was not clear. We have now clarified this point.

      The coarse-grained (CG) simulation methodology is insufficiently described. Given that CG approaches sacrifice atomic detail and may oversimplify interactions, readers require more information to evaluate the model's reliability and limitations. A comparison with the framework used by Ramirez et al. (2024, PMID: 38356260) would be informative. It is also unclear why available crystal structures of WT and 2A Cnn (Figure 2C; Figure S4) were not used as simulation inputs, or why the structure of Cnn490-579 2E was not determined to complete the structural comparison.Furthermore, mutation of Ser567 and Ser571 to alanine markedly stabilizes the PReM domain (Figure 5C, D), implying that these residues maintain domain flexibility. Back-mapping CG models to atomic resolution could reveal the interactions altered by these mutations. The exclusive focus on double mutants (2A and 2E) is also limiting; analysis of single-point mutants at S567 or S571 would clarify whether both residues contribute equally or play distinct roles.

      We performed coarse-grained simulations because although they simplify atomic interactions and capture overall conformational dynamics, which is what we are trying to assess here (Fig.4C,D). We now clarify this point and provide more detail of our simulation methodology in the main text and Materials and Methods. We used the full helical hairpin (i.e., H2+H3+H4) prediction in these simulations—rather than the crystal structure of the partial helical hairpin (i.e., H2+most of H3)—as we reasoned that the presence of the full H3 and H4 might influence breathing, and the full helical hairpin (see Video S1) seems likely to be the relevant biological fold. As we now show (new Figure S5), and as discussed above, the 2E mutants do not behave well in vitro so we were unable to solve their structure. We agree that we could perform atomic resolution simulations to better understand how the 2A/E and single A/E mutations might suppress/enhance breathing, but we believe such an analysis is beyond the scope of the current manuscript and would distract from our main conclusions.

      The discussion lacks sufficient integration with prior studies and often presents conclusions without adequate citation. For example, the claim that flies and humans rely on related PReM-CM2 interactions whereas worms use distinct phosphorylation-regulated mechanisms is not supported by appropriate references. In addition, limited cross-referencing to the manuscript's own data weakens the connection between results and conclusions. Expanding and better grounding the discussion in existing literature would significantly enhance its depth and clarity. We thank the Reviewer for this general point and have tried to better integrate our results with prior studies—particularly in the Discussion section.

      Minor Comments: 1. In Figure 1B, the molecular weight units for the protein marker are missing and should be included. Fixed.

      In Figures 1E and 1F, readability would be improved by including x-axis labels on all graphs, rather than only on the bottom panels.Fixed. The protein structures shown in Figures 2C and 2D sh7w b b∫ybb ould be explicitly labeled as dimers to avoid confusion. Fixed. In Figures 3A-D, using fluorescently labeled CM2 would help validate both the interaction with the PReM domain and its localization within the scaffold.We have previously tried fluorescently tagging the CM2 domain, but scaffold formation is much less robust. We do not think this invalidates this assay, as the evidence supporting the PReM::CM2 interaction is very strong—including assessing the physiological influence of multiple point mutations in both domains in residues at the heart of the interaction interface identified by crystallography (e.g., see Fig.4, Feng et al., Cell, 2017).

      In Figure 3E, no statistical comparisons are presented between the original PReM construct and other samples. In addition, information regarding sample size and the number of experimental replicates is missing from the figure legend. Fixed. In Figure 3F, the absence of a pixel intensity scale bar makes the data difficult to interpret, as color values corresponding to high and low signal intensities are unclear. Moreover, no additional centrosome marker is included, nor is there evidence that PReM fragment expression levels are comparable across samples. These concerns also apply to Figures 4C and 4D.We now include pixel intensity scales in all relevant Figures. We think we do not need to show additional centrosome markers in our images as centrosomes exhibit a very reproducible behaviour in these embryos so we can be very confident that the objects we show here are genuine centrosomes. Considering expression levels, the images in Fig.4C,D (now 3C,D) are derived from stable transgenic lines so we can measure protein expression levels and show that the 2A and 2E mutants are expressed at similar levels to WT (new Figure S6). The images in 2F are from mRNA injections, so cannot be quantified in this way. However, we have vast experience with this assay (used in >15 publications since 2014) and can tell when, very occasionally, an injected mRNA is not expressed well (as this leads to a lack of general fluorescence in the cytoplasm). In addition, we know that deletions in Cnn do not generally destabilise the protein as we have analysed many such transgenic lines (see, for example, Reviewer Figure 1). Thus, the differences in centrosomal levels observed and quantified in 2F are almost certainly not caused by differences in the stability of the proteins being generated from the injected mRNAs.

      In Figure 4A, the interacting residues of PReM and CM2 shown in the red inset would be clearer if residue annotations for each domain were displayed in distinct colors. Additionally, the legends for Figures 4C and 4D do not specify the scale bar length.Fixed. The authors state that interactions between CM2 and PReM-2A462-608 could not be detected in vitro based on SEC chromatograms (Figure 5A), yet the figure does not clearly show this result. The accompanying SDS-PAGE images are too small and lack lane labels, making interpretation difficult (a similar issue applies to Figure 7B). Furthermore, the SEC chromatogram x-axis lacks volume annotations, hindering correlation between chromatographic peaks and SDS-PAGE results (in contrast to Figure 7B, which provides an appropriate example).We thank the reviewer for these points, all of which have now been fixed/adjusted.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study by Mohamad et al. builds on prior work by Conduit et al. (2014, PMID: 24656740) and Feng et al. (2017, PMID: 28575671), which established the essential role of intramolecular interactions between the phospho-regulated multimerization (PReM) domain and centrosomin motif 2 (CM2) of Drosophila Cnn in pericentriolar matrix (PCM) expansion during mitosis. Extending these studies, the authors investigate the structural properties of Cnn's PReM and CM2 domains and compare them with homologous proteins in C. elegans (SPD-5) and humans (CDK5RAP2). Their analyses suggest a phosphorylation-dependent mechanism that relieves Cnn autoinhibition, with particular emphasis on Ser567 and Ser571 within the PReM domain. The authors further propose that, whereas Cnn and CDK5RAP2 share conserved CM2-PReM interactions, SPD-5 has diverged to employ distinct mechanisms for PCM scaffold assembly.

      Although these conclusions rely heavily on AlphaFold3-predicted models (Abramson et al., 2024, PMID: 38718835), they are supported by a combination of in vitro and in vivo experiments, including live-cell imaging and molecular dynamics simulations. However, inconsistencies between in vitro and in vivo observations weaken some interpretations and warrant more careful discussion. Addressing the concerns below would substantially strengthen the manuscript.

      Major Comments

      1. The title, "Structural Insights into Mitotic-Centrosome Assembly," is overly broad. The study primarily focuses on CM2-PReM intramolecular interactions in D. melanogaster Cnn and does not comprehensively address mitotic centrosome assembly across species. A more specific title reflecting the fly-centric and structural focus would better align with the manuscript's scope and conclusions.
      2. The authors analyze condensate formation by Cnn and SPD-5 but overlook condensate formation by CDK5RAP2, which was recently reported by Rios et al. (2025, PMID: 40454523). Including CDK5RAP2 would enable a more balanced and informative comparison across fly, worm, and human homologs.
      3. In Figure 3, reconstitution of Cnn scaffolds using purified CM2 and PReM fragments yields "macromolecular scaffolds," but their physical properties are not defined. It remains unclear whether these assemblies are ordered or amorphous, and whether they exhibit solid- or gel-like behavior. Moreover, the heterogeneous, scattering particles observed by negative-stain EM (Figure S3B), likely corresponding to the Cnn490-608-CM2 complex, raise the possibility of nonspecific aggregation rather than organized scaffold formation. Appropriate controls lacking CM2 are needed to exclude spontaneous aggregation of PReM fragments. In addition, testing shorter truncations of the PReM H2 helix could help define the minimal requirements for scaffold assembly. Finally, the rationale for including the CnnΔExPReM construct only in vivo (Figure 3F), but not in the in vitro assays (Figure 3A-E), should be clarified.
      4. The coarse-grained (CG) simulation methodology is insufficiently described. Given that CG approaches sacrifice atomic detail and may oversimplify interactions, readers require more information to evaluate the model's reliability and limitations. A comparison with the framework used by Ramirez et al. (2024, PMID: 38356260) would be informative. It is also unclear why available crystal structures of WT and 2A Cnn (Figure 2C; Figure S4) were not used as simulation inputs, or why the structure of Cnn490-579 2E was not determined to complete the structural comparison.

      Furthermore, mutation of Ser567 and Ser571 to alanine markedly stabilizes the PReM domain (Figure 5C, D), implying that these residues maintain domain flexibility. Back-mapping CG models to atomic resolution could reveal the interactions altered by these mutations. The exclusive focus on double mutants (2A and 2E) is also limiting; analysis of single-point mutants at S567 or S571 would clarify whether both residues contribute equally or play distinct roles. 5. The discussion lacks sufficient integration with prior studies and often presents conclusions without adequate citation. For example, the claim that flies and humans rely on related PReM-CM2 interactions whereas worms use distinct phosphorylation-regulated mechanisms is not supported by appropriate references. In addition, limited cross-referencing to the manuscript's own data weakens the connection between results and conclusions. Expanding and better grounding the discussion in existing literature would significantly enhance its depth and clarity.

      Minor Comments

      1. In Figure 1B, the molecular weight units for the protein marker are missing and should be included.
      2. In Figures 1E and 1F, readability would be improved by including x-axis labels on all graphs, rather than only on the bottom panels.
      3. The protein structures shown in Figures 2C and 2D should be explicitly labeled as dimers to avoid confusion.
      4. In Figures 3A-D, using fluorescently labeled CM2 would help validate both the interaction with the PReM domain and its localization within the scaffold.
      5. In Figure 3E, no statistical comparisons are presented between the original PReM construct and other samples. In addition, information regarding sample size and the number of experimental replicates is missing from the figure legend.
      6. In Figure 3F, the absence of a pixel intensity scale bar makes the data difficult to interpret, as color values corresponding to high and low signal intensities are unclear. Moreover, no additional centrosome marker is included, nor is there evidence that PReM fragment expression levels are comparable across samples. These concerns also apply to Figures 4C and 4D.
      7. In Figure 4A, the interacting residues of PReM and CM2 shown in the red inset would be clearer if residue annotations for each domain were displayed in distinct colors. Additionally, the legends for Figures 4C and 4D do not specify the scale bar length.
      8. The authors state that interactions between CM2 and PReM-2A462-608 could not be detected in vitro based on SEC chromatograms (Figure 5A), yet the figure does not clearly show this result. The accompanying SDS-PAGE images are too small and lack lane labels, making interpretation difficult (a similar issue applies to Figure 7B). Furthermore, the SEC chromatogram x-axis lacks volume annotations, hindering correlation between chromatographic peaks and SDS-PAGE results (in contrast to Figure 7B, which provides an appropriate example).

      Significance

      This work will be of interest not only to cell biologists studying centrosomes, but also to molecular biologists investigating how protein modifications regulate protein behavior.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Mohamed et al. set out to compare the assembly mechanisms of pericentriolar material (PCM) in flies and nematodes. They reveal that the main PCM scaffold protein in each species (Cnn in flies, SPD-5 in nematodes) are sufficient to form supramolecular droplets (with a crowding agent) or networks (without a crowding agent). However, they diverge in one key aspect: Cnn scaffold assembly relies on the interaction between a C-terminal CM2 domain and a central phospho-regulated domain (PReM), whereas SPD-5 does not. The authors solve the crystal structure of a region within Cnn's PReM. With the help of modeling, they speculate that this region is auto-inhibited through backfolding of alpha helices, thus preventing its interaction with the CM2 domain. This auto-inhibition would be relieved by phosphorylation, which modeling suggests would increase "breathing" of the backfolded structure. The author end by presenting evidence to suggest that the human PCM scaffold protein CDK5RAP2 may assemble through a PReM-CM2 interaction.

      Major Comments:

      1. The title is too vague. Any number of existing papers could be said to provide "structural insights into mitotic centrosome assembly". The authors need to narrow down to a defined conclusion and state this as the title.
      2. I think the strongest and most novel aspects of this study relate to the mechanism of Cnn assembly via relief of the auto-inhibited PReM. The effort to elucidate assembly mechanisms of SPD-5 and CDK5RAP2 are comparatively light and there are no accompanying experiments in worms or human cells. Without the in vivo experiments, it's hard to know if the in vitro experiments are valid. It's speculative for the authors to say they found the true PReM for CDK5RAP2; they do not demonstrate that PLK-1 phosphorylation potentiates assembly in Figure 8. Thus, I suggest re-writing the paper to focus on Cnn. Experiments in Figure 6 are still valid if reframed. For example, substituting Cnn's CM2 with the CM2 from CDK5RAP2 vs. the C-term of SPD-5 illustrates that a simple coiled-coil with open ends (H.s.CM2) is sufficient to interact with PReM whereas a coiled-coil with a closed end (SPD-5 C-term, predicted by Figure 6A) cannot.
      3. The purpose of Figure 1 is unclear. None of the other figures examine SPD-5 and CNN in the condensate form, which required using 4% PEG in this paper. The other assays look at the network form, which could behave differently and have different dependence on specific domains. I think they should perform the condensate assay for all other figures, otherwise leave it out. Furthermore, CDK5RAP2 is mentioned, yet not examined in Figure 1. It must be noted that CDK5RAP2 will also condense into droplets under crowding conditions or with a synthetic nucleator (Rios et al., 2025 J Cell Sci). Thus, it seems that condensation potential is a universal feature of known PCM scaffold proteins.
      4. The study uses different species without doing the same types of experiments on each. Sometimes human CDK5RAP2 is thrown in, sometimes not. They solve crystal structures of PReM from Cnn but not from the other proteins. This gets confusing, especially since the authors state that they seek to test if fly Cnn and worm SPD-5 assemble through different mechanisms (see last sentence of the intro). Also, if the focus is on worm vs. fly PCM assembly mechanisms, why include the human protein, especially Figure 8?
      5. The conclusion that SPD-5's narrow PReM and "CM2" domains don't interact is consistent with the cross-linking mass spectrometry data from Rios et al. 2024. They showed only one X-link with low occurrence (1 out of 6 samples) between these two regions, even in the phosphorylated state (Fig. 1G). However, Nakajo et al (2022) claimed the opposite, showing that a larger PReM-containing construct (a.a. 272-732) interacts with a C-terminal construct (a.a. 1061-1198) after PLK-1 phosphorylation. Can the authors comment on this? Perhaps there is another site in SPD-5, outside of a.a. 541-677, that acts like the Cnn PReM?
      6. I have serious doubts that the C-terminus of SPD-5 has a CM2 domain. To me, there is no real sequence homology with the traditional CM2's from humans and flies, and the AF3 predictions support this. Ohta et al. (2021) called this region "CM2-like" based on very poor homology, which a is questionable practice. Any coiled-coil region will appear somewhat homologous due to the heptad repeat pattern that defines them (e.g., leucines line up quite nicely). Thus, is it fair to say that SPD-5 doesn't assemble through a PReM-CM2 interaction? There may be a different region in SPD-5 that looks more like the canonical CM2. I think the authors have compelling evidence to give the C-terminal coiled-coil region in SPD-5 its own name rather than calling it CM2.
      7. Figure 3E. Would measuring scaffold mass be more appropriate? The PReM(deltaH1,NTH2) leads to more compact scaffolds, but maybe they assemble just as well as the deltaH1 mutant. As it stands, there is a discrepancy between panel E and F in terms of what is measured (area vs. intensity) and the outcome.

      Minor Comments

      1. In one version of the PDF there are images missing in Fig 1F, 4C, 4D. I opened another version (source version) and the images were there. Just FYI.
      2. Figure 4A. The blue coloration makes it difficult to read the black letters.
      3. Figure 4A. Why is part of the protein colored in green? This coloration isn't defined, nor does it show up again in panel B.
      4. The layout of Figure 4 is confusing. It took me a few minutes to realize that the big red box inset belonged to panel B and not panel A.
      5. Figure 4C,D. The sample size is not mentioned in the legend.
      6. The title for Figure 4 seems too speculative. How can the authors say that phosphorylation relieves the autoinhibition without structural data?
      7. Figure 5B. The sample size is not mentioned in the legend.
      8. Figure 6B,D. The sample size is not mentioned in the legend.
      9. The text in Figure 7B is hard to read because it is too small. Please make this bigger.
      10. Figure 8C. What is colored in magenta? Is there an additional labeled protein besides mNG-CM2?
      11. Figure 8C. What is the sample size? How many images were taken? Also, why are there data points off to the right of the last column?
      12. The wording of these sections needs improving. I found them complicated and difficult to understand.

      "Fly and worm Spd-2/SPD-2 and Polo/PLK-1 are clear homologues, but Cnn and SPD-5 share little sequence homology-although they are both predicted to be large coiled-coil-rich proteins. Thus, it remains unclear whether these two, largely unrelated, molecules form mitotic-PCM scaffolds that assemble and function in a similar manner"

      "We first focused on Drosophila Cnn as, although the full structure of the original PReM domain (Cnn403-608) is unknown, this domain contains an internal leucine-zipper (LZ) dimer (Cnn490-544) whose crystal structure, in a tetrameric complex with a CM2 dimer, had been solved (Figure 2A) (Feng et al., 2017)."

      "When the full PReM and CM2 domains are mixed in vitro, they form large micron-scale assemblies and point mutations that perturb the LZ::CM2 tetramer perturb PReM::CM2 scaffold assembly in vitro and Cnn scaffold assembly in vivo."

      Significance

      Overall Assessment:

      While I find the premise of this study to be interesting, its execution and presentation are not fully convincing. The study is a collection of experiments connected by a thread that can be difficult to follow. One concern is the lack of focus and a clearly stated conclusion, which is ultimately embodied by the vague title. For example, the research question at the beginning doesn't match with the outcome in the end. At the end of the introduction, the authors state they wish to compare assembly mechanisms of Cnn and SPD-5. However, at the end of the results, they present data on CDK5RAP2 and speculate on its assembly. Why introduce the human protein here? Another concern is the lack of symmetry in the experiments. There is much more in vitro characterization of Cnn than SPD-5 or CDK5RAP2, and all in vivo work is performed in flies. Finally, this study does not address if the best-established model for SPD-5 assembly-multimerization via specific, multivalent coiled-coil interactions-applies to fly Cnn. Thus, to me, this is study is a deeper dive into the mechanism of Cnn assembly, not necessarily a fair cross-species comparison. I do not have major issues with the results, but I recommend that this paper undergo significant re-writing before being re-reviewed. There are also issues with data display and reporting of experimental details (e.g., sample sizes) that should be easily fixed.

      Advance: this study provides new insight into how two specific domains interact within PCM scaffold proteins to promote scaffold assembly. It provides some new structural insight into the mechanism of Cnn auto-inhibition. However, there is limited conceptual advance, as the bigger ideas (e.g., auto-inhibition as a regulatory control, PCM scaffold assembly through condensation of coiled-coil proteins) were already established.

      Audience: this study will be of interest to cell biologists studying centrosome assembly, mitosis, and evolution.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The study by Mohamad et al. investigates the structural basis and regulatory role of phosphorylation in the assembly of the mitotic pericentriolar material (PCM) scaffold, which nucleates microtubules and organizes the poles of the mitotic spindle. They use structure determination, biochemical reconstitution and in vivo experiment in flies to address how fly, worm, and human homologs of a key scaffold protein (Cnn, SPD-5, and CDK5RAP2, respectively) are relieved from auto-inhibition in a phosphorylation-dependent manner to form extended scaffolds through interactions between PReM and CM2 domains. An important discovery is a helical hairpin structure in the PReM domain that is the basis of autoinhibition and is regulated by phosphorylation. The work addresses the fundamental question how the centrosome matures in preparation for mitosis, by increasing the size and activity of the PCM scaffold that surrounds the centrioles. It also addresses how conserved the underlying molecular mechanism are among flies, worms, and humans. The study is overall of high quality, building on previous works by the authors and other groups, and adding new structural and biochemical insight. Most of the conclusions are supported by the data. I have a few concerns though that should be addressed. An important issue is the analysis of phosphorylation sites, which appears incomplete. For example, it lacks demonstration that both of the two studied phosphorylation sites are indeed phosphorylated. Kinase motif identification and mutation is not sufficient, considering that phosphorylation is integral to the proposed model of how autoinhibitory intra-molecule interactions are relieved, and considering that phospho-mimetics have not been tested in vitro and function poorly in vivo.

      Main:

      1) From previous studies, it seems to me that for the residues potentially relevant for the hairpin regulation there is direct evidence of phosphorylation only for S567 (mass spec, phospho-antibody). Have the authors tested single site mutants (S567A and E)? Also, have they tested D mutations? If so, this should be commented on and shown. If not, it should be tested, in particular since the 2E phospho-mimetic is not functioning properly in vivo. If S571 is indeed crucial, it should be demonstrated that it is also phosphorylated. Otherwise it is possible that the mutation of this residue simply impairs important interactions (e.g. PReM-CM2, others), independent of phosphorylation.

      2) It is unclear why in vitro only A mutations have been tested and not phospho-mimetics. This should be tested for the interaction between PReM and CM2. This would allow to probe the model that phosphorylation opens the hairpin to allow interaction. Currently, such proof is missing in the study. Alternatively, the authors could phosphorylate the recombinant protein in vitro. The in vivo data is harder to interpret due to the complexity of the model and the authors should take advantage of the in vitro system.

      3) Regarding the worm PReM and CM2 domains, the authors mention that they have tested in vitro phosphorylation by PLK-1, but I could not find any data showing this. They should demonstrate successful phosphorylation or test candidate site by phospho-mimetic mutation. It is possible that the worm proteins depend more strongly on phosphorylation to relieve autoinhibition compared to the fly proteins.

      Minor:

      4). Fig. 6C, D: the labeling of the chimeric constructs using "+" symbols is confusing, since it suggests that separate proteins were expressed. If I understand this correctly, with the current labeling, deltaCM2+DmCM2 means WT? The authors should write the full name of the wildtype or chimeric construct in each case and use a more standard/less confusing nomenclature. Also, I suggest to start the panels and graphs with the WT sample.

      Significance

      The study's strength is the use of a combination of structural and biochemical approaches with in vivo model testing. Its main limitation is that the analyses of the role of phosphorylation lacks depth and is not fully conclusive, despite its importance for centrosomal scaffold assembly. The study advances our understanding of centrosomal scaffold assembly and maturation at a molecular level, and how specific molecular aspects of these processes are conserved or differ among different organisms. The findings are of interest to cell biologists. My expertise is in centrosome and microtubule biology.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      _Overall, we were encouraged by the comments of the reviewers, who mostly agreed that the study advance our understanding of two component system signaling mechanisms. The most substantive critique raised was the lack of mechanistic insight into the specific binding sites of Cu and NO on the PdtaS protein and the lack of examination of additional ligands such as cyclic di-GMP and zinc. We agree with this critique and cannot, and did not, make specific statements about the location of ligand binding. However, w_e draw a clear distinction in the manuscript between the functional effects of a chemical entity (ligand) on kinase activity and knowledge of the precise binding site of that ligand on the protein. As acknowledged, we did not determine the binding site. However, we do demonstrate the functional effect of the ligands, and these effects cannot occur without physical interaction between the ligand and the protein, so we believe the statement that the ligands are having this effect through binding is accurate, without knowledge of the precise location of that binding. ____

      Reviewer 1:

      • The primary concern pertains to ligand recognition by PdtaS. While PdtaS constitutive autophosphorylation is shown to be dependent on dimerization, there is no direct evidence of ligand binding. How Cu and NO inhibits PdtaS activity remains uncharacterized. Is it unclear if there are specific binding pockets inducing PdtaS conformational switch, if both substrates compete for a single binding pocket, or if Cu and NO inhibit dimerization by binding to the dimer interface. Similarly, it is unclear if NO does not covalently modify the key cysteine residues by S-nitrosylation, nor if Cu induces a distinct and reversible thiol-switch by site-specific oxidation that regulates PdtaS dimerization and activity.
      • __Response: Our discussion already contained this sentence: "Although we identify mutations with both positive and negative effects on dimer affinity which have effects of ligand inhibition of the kinase, this data does not identify specific molecular details on how dimerization is inhibited and whether Cu and NO both interact with the same regions of the dimer interface." We have added a sentence acknowledging that we do not determine whether NO is covalently modifying a thiol (line 343). __

      • Given the focus on ligand effects on PdtaS dimerization and activity, zinc and c-di-GMP should also be considered, as prior studies have suggested they may be sensed by PdtaS. Similarly, given the claim of multiligand sensing, it would be valuable to examine the combined effects of NO and Cu. Do they act additively, synergistically, or interfere with each other?

      • Response: We have added data to the manuscript examining the effect of c-di-GMP on kinase activity in combination with Cu. We do not observe a substantial synergistic effect. This new data is now Figure S3.

      • PdtaS variants and mutants are neither introduced nor adequately described. For example, in lines 144-150, PdtaS-H303Q and G443 are mentioned without citation, and their construction is not described in the Materials and Methods section. As a result, it is difficult to determine which experiments and constructs are specific to this manuscript. Please provide a detailed Materials and Methods section, and include as supplementary material a complete list of all strains, primers, and constructs used in this study, along with their origins.

      • __Response: We have added a section to the methods detailing the construction of the PdtaS mutant protein expression plasmids. __

      • References: Xing J et al 2023 is duplicated. Please correct in the text and in the references list.

      • Response: We have deleted the duplicate reference

      • Please provide molecular weights on gels (fig. 1C, D, E, 2A, 5C, D, 7A). Please provide incubation time for kinase reactions in figure legends (e.g. Fig 1C, D, E, ...).

      • __Response: All of these incubation times are included in the materials and methods. We ____will add to the figure legends depending on journal style. We have added selected MW numbers to the MW markers in 1D,E, 5D,7A.

      __

      • Please indicate whether representative experiments are shown, and specify the number of replicates performed for each assay (e.g. Fig 1C, D, E, ...). This information is essential for assessing the reproducibility and robustness of the findings.
      • __Response: We are somewhat confused by this comment. For each claim made about quantitative effects, we include a quantitation panel that contains experimental replicates (1F, 2D, 4D, 5C, 7B). For MST graphs, we state the number of replicates for each time point. __

      • Please clarify the discrepancy in Figure 2A regarding the calcium concentration used. The results section (line 163) refers to 10 µM, whereas the figure legend (line 393) states 1 mM.

      • __Response: We have corrected the figure legend to 10____m __

      • Figure 2A should include zinc, as previous work by the authors has shown that zinc directly inhibits the kinase activity of PdtaS. It would also be informative to test c-di-GMP in Fig. 2, given that c-di-GMP has been described to binds PdtaS (PMID: 33772870), and that c-di-GMP binding at dimer interfaces has been demonstrated in transcription factors (e.g., PMID: 25171413).

      • __Response: We did not test zinc because in our prior studies the effects of Zn and Cu were identical. We have tested c-di-GMP as noted above (see new Fig S3). __

      • The interpretation in lines 206-207 is not convincing. PdtaS homologs may differ in ligand specificity, precluding the presence of a conserved ligand-binding cavity but not of a specific ligand binding cavity in the GAF/PAS domains. Functional divergence of the binding site can occur, and this possibility should be acknowledged.

      • Response: We are somewhat confused by this comment. This is the sentence in question: "____This analysis suggests that the PdtaS kinase family has evolved to conserve the dimerization interface, shown above to be important for autokinase activity, but that the putative ligand binding domains do not have a conserved ligand cavity, arguing against a specific ligand that binds in the GAF or PAS pocket in this family of histidine kinases." The sentence does not argue that there is no ligand binding in the GAF PAS cavity, only that the cavity is not conserved, and this argues against a single ligand. To clarify this point, we will insert the word "single" before "specific ligand"

      Reviewer 2:

      • The dimer model is consistent with trans phosphorylation, but I did not see model quality described, especially in the H303-ATP binding interface. Can the authors provide AlphaFold PAE and pLDDT scores?
      • __Response: We have added a SI figure (Figure S4) with this data. __

      • Although the effect of Cu and NO on the two mutant PdtaS is clear, why the WT activity in Fig. 2A is not also inhibited is not obvious to me, especially since WT dimerization is affected by Cu and NO (Fig. 2B, C). Is there also cis-autophosphorylation that masks reductions in trans phosphorylation? Is the WT signal saturated on this autorad?

      • __Response: This assay, as noted in the figure legend, was done with 10_m_M Cu. This dose is only mildly inhibitory to the wild type kinase, as demonstrated in Figure 5D-E, which clearly demonstrates Cu inhibition. We don't have an explanation for why the trans phosphorylation mutant pair is inhibited by lower doses of Cu. It is possible this reflects some cis-autophosphorylation, but the strong inhibition of trans autophosphorylation is consistent with our model. __

      • The two Cys residues in PdtaS were previously found to affect kinase activity. Here, the authors show they also modestly affect dimerization. Since ~1/3 of mycobacteriales have both Cys, a double mutant would have been interesting for the in vitro characterization (it is used in live bacteria in Fig. 7A) and might show a more pronounced effect (not critical).

      • __Response: Although we agree, we attempted to purify the double cysteine mutant from * coli* but unable to due to insolubility, so we were unable to test the protein. __

      • Although competition data and structural model clearly indicate trans phosphorylation, some cis-phosphorylation can probably not be ruled out, especially since the dimer mutant H67A shows some activity. Although that mutation does not seem to fully disrupt the dimer, the H67A activity could be indicative of some cis-phosphorylation.

      • __Response: The H67A mutant is a dimerization mutant that weakens, but does not completely disrupt the dimer. This mutant cannot be used to distinguish cis vs trans phosphorylation and therefore we cannot rule out a mixture of cis vs trans autophosphorylation. The data in figure 1 argues for trans phosphorylation being the dominant mechanism. __

      • The Cys residues destabilize the dimer, and mutating the Cys stabilizes it, even canceling out the effect of the chemical destabilizers Cu and NO. In Fig. 4A, it looks like all Cys are too far apart to form disulfides, but Cu2+ can cause formation of disulfides. Can the authors comment on the distance of the Cys and the likelihood that disulfides have a role in this mechanism? If this were plausible, thiol-to-disulfide ratios with and without Cu could be directly measured. Although a bit more of a stretch, NO could also contribute to disulfide formation through ROS, and disulfides could be a way by which these two disparate ligands have the shared effect on activation shown here.

      • __Response: As stated above, we are not able to comment about whether direct modification of these cysteines is occurring. We do not believe the proximity of the cysteines would allow disulfide formation. __

      • The interdomain mutation Arg261Ala is quite nice and shows a specific effect on activity, but not dimerization, indicating that this interdomain ion bond somehow transfers the dimerization signal from the GAF to the PAS domain. Were there any other interdomain bonds? For completeness, was the basal autophosphorylating or phosphotransfer activity to PdtaR affected by the 261 mutation?

      • __Response: We did not detect any other bonds in our modeling. The basal level of autophosphorylation of the R261A protein compared to WT is visible at the lower end of the Cu inhibition curve in 6E and is comparable. We did not observe a difference in autophosphorylation at 0 Cu in the gels supporting this curve. __

      • First sentence in the Discussion, wording: The study investigates kinase activation, not signal sensing in a strict sense.

      • __Response: We have edited this sentence. __

      • Although this is primarily a biochemical, mechanistic study, one or two sentences on the biological significance of PdtaS/R in M. tuberculosis in the Introduction would be nice

      • __Response: We believe these sentences in the introduction already satisfies this request: "PdtaS and PdtaR were implicated in the Rip1 pathway of * tuberculosis signal transduction by a genetic suppressor screen in which inactivation of either PdtaS or PdtaR reverted the copper and nitric oxide sensitivity of M. tuberculosis lacking rip1. Copper and NO directly inhibit the kinase activity of PdtaS, an inhibition that requires the N terminal GAF and PAS domains _[39]_, indicating that the GAF-PAS are necessary to transmit the inhibitory signal to the kinase domain." We would also note that although much of the data is biochemical, we test the in vivo relevance of our model using M. tuberculosis* strains carrying PdtaS mutations

      __

      Reviewer 3


      • Limited characterization and validation of dimerization measurements: (a) while MST is an established technique, the central thesis relies heavily on dimerization measurements using this single method. Given the importance of this finding, at least one additional orthogonal approach would strengthen the conclusions significantly. Analytic size exclusion chromatography (SEC) could be a very simple, accessible and reliable approach to address this core mechanistic question. By choosing the right size resolution separation matrix, the authors should be able to separate complete monomers, from partial complexes (e.g. dimers only held through the kinase domain) and full dimers (the species the authors expect for the constitutively active wt protein). Ready advantage of having the wt protein can be taken, as well as several dimerization mutants (C53A, C57A, H67A), and presence/absence of cognate ligands (NO, Cu). For necessary reference standards, a dilution series should be able to reveal the elution position for wt monomers (and if this approach reveals to be difficult, mild chaotropic conditions can always be attempted, often times also pH shifts can do the job). Other techniques can point in the same direction as SEC, such as SAXS (best coupled to a SEC, or SEC-SAXS), native polyacrylamide gel electrophoresis, and/or dynamic light scattering.
      • __Response: We appreciate the reviewers' careful suggestions for additional experimental approaches. Although logical, we are unable to undertake them at this time and further exploration will hopefully be stimulated by our study. __

      (b) More importantly, additional techniques should be chosen such that a clear distinction can be made between two different scenarios, namely: that only the sensory domains (PAS/GAF) undergo ligand-triggered dissociation; or, instead, that the entire protein dissociates into separate monomers (i.e. including the kinase domains). This seems like an extremely important distinction, so that the proposed kinase-regulation mechanism is well understood/described. The first scenario would be less "disruptive" wrt previous paradigms (sensory domain dissociation could well be linked to a conformational rearrangement that allosterically inhibits kinase auto-phosphorylation).

      • __Response: We agree that this is an important distinction. We would note here that our prior data (Buglino et al eLife 2021) demonstrated that the isolated kinase domain of PdtaS is not inhibited by copper or NO, indicating that the effect of these ligands both requires the GAF-PAS and that the kinase dimer itself is not sensitive to ligand induced inhibition. This result does not directly address the reviewer's question, which is whether there is localized inhibition of dimerization in the GAF-PAS dimer, which, via an allosteric mechanism, inhibits phosphorylation by the kinase domain, which we have shown is in trans, without actual dissociation. We are not aware of a technique that could distinguish what would presumably be a type of allosteric localized dimer disruption from full dissociation. Our data clearly indicates that the kinase inhibition effect is mediated by the dimer dissociation effect on the GAF/PAS and full characterization of the effects of that on the kinase domain will await further studies outside of this paper. __
      • R261A mechanistic inconsistency: The manuscript shows that the R261A mutant has attenuated copper inhibition in vitro, albeit remaining functional in vivo (Figure 7B). While the authors acknowledge this suggests their "interdomain coupling model is incomplete or compensated by other mechanism in vivo," this significant discrepancy undermines confidence in the proposed mechanism and deserves more thorough investigation and/or discussion.
      • __Response: We thank the reviewer for this comment, which relates to the R261A mutant. We disagree that this result "undermines confidence in the proposed mechanism". We rigorously interrogated our in vitro findings by genetic complementation in M. tuberculosis cells using epitope tagged proteins and these results largely confirmed the model in that C53A, C53A/C57A, and H67A all inactivated signaling, as predicted from our model. R261 is the exception and, as we discuss, it indicates that we do not completely understand the in vivo determinants of coupling between the GAF-PAS dimer and the kinase domain, which is dependent on R261A in vitro. __

      Insufficient evidence about signal integration: While the authors argue this mechanism enables "integration of multiple inputs into the kinase without the constraints of specific ligand recognition" (lines 342-344), this appears conceptually flawed to me. The ligands (Cu and NO) must still be specifically sensed and bound somewhere on the protein to trigger dimerization disruption - the mechanism simply uses dimerization modulation as the output rather than the more typical allosteric conformational changes. The conservation pattern (interface > binding sites) may reflect selective pressure to maintain dimerization capability across the family, while individual species evolved different ligand specificities. The authors should clarify that their mechanism represents a novel output mode for ligand sensing rather than an alternative to specific ligand recognition, and discuss how this distinction affects their evolutionary interpretation.

      • Response: We thank the reviewer for this comment. The issue raised is the difference between ligand "recognition" and "sensing" with the former implying a specific binding site (which we acknowledge above and in the paper that we do not identify) and the functional output modified by ligands. Our data supports that dimerization is an important mechanism of sensing, but we do not claim that the dimer interface is the binding site for the ligands. We would note that the following sentences were in the reviewed version of the paper and we believe clearly make the exact distinction that the reviewer requests: __ Abstract: "These results indicate that a single bacterial kinase can __sense chemically diverse inputs through inhibition of dimerization dependent phosphorylation"

      Line 110: "Ligand binding pockets of GAF and PAS domains can bind a wide variety of ligands[38], but it remains to be determined whether multi-ligand sensing by PdtaS represents a manifestation of specific chemical recognition by the GAF-PAS domains or some other mechanism."

      Line 119: "Mutations in the GAF dimer interface that alter dimerization also impair multi-ligand sensing __of Cu and NO in vitro and in M. tuberculosis cells. Our findings establish a mechanism of __multi ligand sensing through alteration of sensor oligomeric state."

      Line 330: "Taken together, these data are consistent with a model in which modulation of dimer affinity is the sensing mechanism of the mycobacterial clade of PdtaS kinases, rather than specific recognition of Cu or NO by the ligand binding pockets of the GAF or PAS domains."

      __We have edited one instance (line 337) in the discussion where the use of "recognition" might have been misconstrued. __

      Minor Comments 1. The manuscript could better explain why PdtaS is described as "constitutively active" - the distinction between showing autophosphorylation activity in vitro versus true constitutive activity could be clearer. Can the authors show or refer to evidence of live constitutive PdtaR phosphorylation by PdtaS? (e.g. PhosTag electrophoresis gels of whole protein extracts and Western blotting revealed by anti-PdtaR; the use of NO and Cu can easily be used as inhibitors in such experimental setup).

      • Response: We thank the reviewer for this question. Our basis for claiming that the kinase is constitutively active, both for autophosphorylation and phosphotransfer to PdtaR, is the following:
      • __In the work of others and our prior work, PdtaS autophosphorylates without added ligand, which is contrary to most histidine kinases which are ligand activated. __
      • __PtdaS phosphorylates PdtaR without added ligand in vitro (see Figure 5C of this paper) __
      • __In terms of in vivo demonstration of PdtaR phosphorylation, this is very challenging in all response regulators given the unstable nature of the aspartate phosphorylation. We have been unsuccessful in visualizing PdtaR phosphorylation in vivo using phostag or western blotting. However, we note that our prior work demonstrated that mutation of the phosphoacceptor residue in PdtaR (D65A) phenocopied loss of both PdtaS and PdtaR (Buglino et al eLife 2021). __

      • Figure 5D shows some gel quality issues and also limited detail in the legend to know what exactly each panel represents and labels' definitions (e.g. "Ca" on the first lane, etc). The difference between wt and mutant is not clearcut to me, difficult from these data alone to derive a reliable Ki. Furthermore, the control gel on the bottom for the wt (I believe this is a cold control gel to see loaded quantities of protein on each lane?), seems to have less protein in the higher Cu concentrations.

      • __Response: The calcium lane is the divalent ion control as in the other figures. The legend of this figure refers to figure 1, which is more detailed and the identical assay. As noted in the methods, the phosphorylation signal is normalized to the total protein in the lower gel, so the lower amount of protein in these lanes in incorporated into the quantitation, which itself is derived from triplicate experiments, as noted in the legend. __

      Enough experimental detail should be included on figure legends so that the experiments are self-explanatory.

      • __Response: This is a journal style question that will be addressed depending on the identity of the eventual journal. __

      Lines 184-185 : they only refer to the fact that c-di-GMP binds to the GAF domain of PdtaS, yet the paper by Hariharan et al 2021 also shows that it activates PdtaS's autokinase activity. This should be double-checked and taken into account for the discussion of cognate ligands' effects.

      • Response: As noted above, we have added a supplemental figure (S3) that examines the effect of c-di-GMP on autokinase activity. Hariharan reported activation of PdtaS by c-di-GMP (Figure 6A of that publication). We do not see similar activation and do not observe an effect of cDG with Cu. Line 233: "Dimerization separation of function mutations" title is unclear

      • Response: Edited The structural model source (AlphaFold3) should be mentioned in the main text, not just figure legends. AF3-predicted models should be illustrated according per-residue pLDDT reliability indices (typically with a color ramp).

      • __Response: The paper contains this sentence:_ "_We performed bioinformatic analyses of the conservation of PdtaS across the Actinomycetota phyla and mapped this conservation onto the predicted full length PdtaS dimer structure predicted using AlphaFold". We have added the specific AlphaFold model (3) to this sentence. __

      • __We have also added an SI figure containing the pLDDT data (Figure S4). __
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Sankhe and collaborators, investigates the mechanism by which the Mycobacterium tuberculosis two-component system PdtaS/PdtaR senses copper and nitric oxide. The authors demonstrate that PdtaS is a constitutively active histidine kinase that autophosphorylates in trans, and that ligand-triggered inhibition occurs through disruption of dimerization rather than typical allosteric conformational changes of the dimeric species. Through phylogenetic analysis, mutagenesis, and biochemical assays, they show that conservation occurs primarily at the dimer interface rather than putative ligand binding sites, supporting a novel mechanism of multi-ligand sensing through modulation of oligomeric state. The experimental design is generally sound, with appropriate controls and multiple lines of evidence supporting the main conclusions. The trans-autophosphorylation experiments are particularly elegant and convincing. While there are some mechanistic concerns that should be addressed (particularly around R261A, and the actual dimerization extent/effect), the core findings are significant, and the work represents an important contribution to understanding bacterial signal transduction.

      Major Comments:

      1. Limited characterization and validation of dimerization measurements: (a) while MST is an established technique, the central thesis relies heavily on dimerization measurements using this single method. Given the importance of this finding, at least one additional orthogonal approach would strengthen the conclusions significantly. (b) More importantly, additional techniques should be chosen such that a clear distinction can be made between two different scenarios, namely: that only the sensory domains (PAS/GAF) undergo ligand-triggered dissociation; or, instead, that the entire protein dissociates into separate monomers (i.e. including the kinase domains). This seems like an extremely important distinction, so that the proposed kinase-regulation mechanism is well understood/described. The first scenario would be less "disruptive" wrt previous paradigms (sensory domain dissociation could well be linked to a conformational rearrangement that allosterically inhibits kinase auto-phosphorylation). Analytic size exclusion chromatography (SEC) could be a very simple, accessible and reliable approach to address this core mechanistic question. By choosing the right size resolution separation matrix, the authors should be able to separate complete monomers, from partial complexes (e.g. dimers only held through the kinase domain) and full dimers (the species the authors expect for the constitutively active wt protein). Ready advantage of having the wt protein can be taken, as well as several dimerization mutants (C53A, C57A, H67A), and presence/absence of cognate ligands (NO, Cu). For necessary reference standards, a dilution series should be able to reveal the elution position for wt monomers (and if this approach reveals to be difficult, mild chaotropic conditions can always be attempted, often times also pH shifts can do the job). Other techniques can point in the same direction as SEC, such as SAXS (best coupled to a SEC, or SEC-SAXS), native polyacrylamide gel electrophoresis, and/or dynamic light scattering.
      2. R261A mechanistic inconsistency: The manuscript shows that the R261A mutant has attenuated copper inhibition in vitro, albeit remaining functional in vivo (Figure 7B). While the authors acknowledge this suggests their "interdomain coupling model is incomplete or compensated by other mechanism in vivo," this significant discrepancy undermines confidence in the proposed mechanism and deserves more thorough investigation and/or discussion.
      3. Insufficient evidence about signal integration: While the authors argue this mechanism enables "integration of multiple inputs into the kinase without the constraints of specific ligand recognition" (lines 342-344), this appears conceptually flawed to me. The ligands (Cu and NO) must still be specifically sensed and bound somewhere on the protein to trigger dimerization disruption - the mechanism simply uses dimerization modulation as the output rather than the more typical allosteric conformational changes. The conservation pattern (interface > binding sites) may reflect selective pressure to maintain dimerization capability across the family, while individual species evolved different ligand specificities. The authors should clarify that their mechanism represents a novel output mode for ligand sensing rather than an alternative to specific ligand recognition, and discuss how this distinction affects their evolutionary interpretation.

      Minor Comments

      1. The manuscript could better explain why PdtaS is described as "constitutively active" - the distinction between showing autophosphorylation activity in vitro versus true constitutive activity could be clearer. Can the authors show or refer to evidence of live constitutive PdtaR phosphorylation by PdtaS? (e.g. PhosTag electrophoresis gels of whole protein extracts and Western blotting revealed by anti-PdtaR; the use of NO and Cu can easily be used as inhibitors in such experimental setup).
      2. Figure 5D shows some gel quality issues and also limited detail in the legend to know what exactly each panel represents and labels' definitions (e.g. "Ca" on the first lane, etc). The difference between wt and mutant is not clearcut to me, difficult from these data alone to derive a reliable Ki. Furthermore, the control gel on the bottom for the wt (I believe this is a cold control gel to see loaded quantities of protein on each lane?), seems to have less protein in the higher Cu concentrations.
      3. Enough experimental detail should be included on figure legends so that the experiments are self-explanatory.
      4. Lines 184-185 : they only refer to the fact that c-di-GMP binds to the GAF domain of PdtaS, yet the paper by Hariharan et al 2021 also shows that it activates PdtaS's autokinase activity. This should be doube-checked and taken into account for the discussion of cogante ligands' effects.
      5. Line 233: "Dimerization separation of function mutations" title is unclear
      6. The structural model source (AlphaFold3) should be mentioned in the main text, not just figure legends. AF3-predicted models should be illustrated according per-residue pLDDT reliability indices (typically with a color ramp).
      7. Ensure consistent reporting of replicate numbers across all experiments.

      Significance

      General assessment:

      The study provides elegant trans-autophosphorylation experiments and strong phylogenetic support for dimerization interface conservation. However, it relies heavily on MST as the sole method for measuring dimerization -the main finding in terms of novelty- and shows mechanistic inconsistencies (R261A functional in vivo despite attenuated inhibition in vitro).

      Scientific Advance:

      While the authors overstate novelty by claiming to bypass "specific ligand recognition" (ligands must still bind specifically to trigger dimerization disruption), the identification of dimerization modulation as the inhibitory output mechanism represents a meaningful advance. The work establishes an important framework for understanding how M. tuberculosis senses multiple host-derived stresses and may inform studies of other inhibitory sensor kinases.

      Target Audience:

      I believe this work will be of great interest to bacterial signaling researchers and M. tuberculosis pathogenesis specialists, with broader appeal to the microbiology community studying two-component systems and host-pathogen interactions. The dimerization-based mechanism may also attract structural biologists studying multi-domain sensor architectures.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      This study explores the activation mechanism of a two-component system in M. tuberculosis, PdtaS/R. PdtaS/R is known to sense Cu and NO. Here, the authors show that PdtaS autophosphorylates in trans upon dimerization. PdtaS is constitutively active, and Cu and NO binding inactivate the kinase by preventing dimerization. The dimerization interface, but not the ligand binding domain, is conserved in PdtaS orthologs, and disruption of the dimer interface also disrupts ligand sensing in vitro and in live M. tuberculosis.

      Major comments

      This is an interesting and thorough analysis of the (in)activation mechanism of a TCS. Although much work has been done on such systems, this TCS is quite interesting as it is a rarer cytosolic, soluble system, because it has been shown to sense two chemically very different ligands- Cu and NO (and apparently also cdi-GMP)- and because it is constitutively active and inactivated by ligands, which is more unusual. A main strength of the paper is the identification of a range of mutants with specific effects on dimerization, activity, auto and substrate phosphorylation and GAF-PAS interactions to probe and parse the contribution of different aspects of the mechanism. The flow of the experiments is logical, and the data are generally clear, even though TCS phosphorylation is short-lived and can be tricky to capture. The heterodimer mixing experiments using WT, phosphoreceptor His-, and ATP binding mutants are clear and conclusively show trans phosphorylation. The control ruling out dimerization defects of the mutants is useful, and bioinformatic analysis of the GAF and PAS domains shows surprisingly clearly the conservation of the dimer interface, not the ligand binding site. The experiments showing the effects of dimerization on the activity of PdtaS are conclusive, with mutations showing stronger (Cys) or weaker (His67, already shown in a previous paper) dimerization and the expected effects on PdtaS activity. Testing some key mutants in live bacteria is another nice feature of the study that shows that in vitro findings (mostly) carry over to live bacteria, which is not always the case, and often just not tested. In sum, this is a solid, straightforward study on the activation mechanism of a more unusual M. tuberculosis TCS.

      Minor comments

      The dimer model is consistent with trans phosphorylation, but I did not see model quality described, especially in the H303-ATP binding interface. Can the authors provide AlphaFold PAE and pLDDT scores?

      Although the effect of Cu and NO on the two mutant PdtaS is clear, why the WT activity in Fig. 2A is not also inhibited is not obvious to me, especially since WT dimerization is affected by Cu and NO (Fig. 2B, C). Is there also cis-autophosphorylation that masks reductions in trans phosphorylation? Is the WT signal saturated on this autorad?

      Related: Although competition data and structural model clearly indicate trans phosphorylation, some cis-phosphorylation can probably not be ruled out, especially since the dimer mutant H67A shows some activity. Although that mutation does not seem to fully disrupt the dimer, the H67A activity could be indicative of some cis-phosphorylation.

      Some kinetic experiments would have been useful to gauge the timescale of these mechanisms (but not critical).

      The two Cys residues in PdtaS were previously found to affect kinase activity. Here, the authors show they also modestly affect dimerization. Since ~1/3 of mycobacteriales have both Cys, a double mutant would have been interesting for the in vitro characterization (it is used in live bacteria in Fig. 7A) and might show a more pronounced effect (not critical).

      The Cys residues destabilize the dimer, and mutating the Cys stabilizes it, even canceling out the effect of the chemical destabilizers Cu and NO. In Fig. 4A, it looks like all Cys are too far apart to form disulfides, but Cu2+ can cause formation of disulfides. Can the authors comment on the distance of the Cys and the likelihood that disulfides have a role in this mechanism? If this were plausible, thiol-to-disulfide ratios with and without Cu could be directly measured. Although a bit more of a stretch, NO could also contribute to disulfide formation through ROS, and disulfides could be a way by which these two disparate ligands have the shared effect on activation shown here.

      The interdomain mutation Arg261Ala is quite nice and shows a specific effect on activity, but not dimerization, indicating that this interdomain ion bond somehow transfers the dimerization signal from the GAF to the PAS domain. Were there any other interdomain bonds? For completeness, was the basal autophosphorylating or phosphotransfer activity to PdtaR affected by the 261 mutation?

      First sentence in the Discussion, wording: The study investigates kinase activation, not signal sensing in a strict sense.

      Although this is primarily a biochemical, mechanistic study, one or two sentences on the biological significance of PdtaS/R in M. tuberculosis in the Introduction would be nice

      Christoph Grundner

      Significance

      This is a mechanistic study on the (in)activation mechanism of an M. tuberculosis TCS with some unusual features: The kinase, PdtaS, is constitutively active and ligand binding inactivates it. It is a soluble system that binds multiple ligands, fewer of which have been described to date. While trans-phosphorylation and regulation by dimerization are not conceptually new, they were also not a given in this more unusual TCS. The authors identified relevant mutants at several steps of the activation mechanism to specifically probe the effect of dimerization, interdomain communication etc., and test their relevance in live bacteria, which goes beyond what comparable biochemical studies typically do. The authors have previously published some aspects of the PdtaS/R activation mechanism (inhibition by Cu and NO, Cys mutants). The conserved dimer interface suggests that many of the PdtaS orthologs are similarly regulated, and that different ligands can converge on this dimerization-dependent activation. Thus, the study could be relevant for the whole family and the range of ligands they likely sense. The study summarizes the findings in the idea that PdtaS activity relies on dimerization to react to divergent ligands rather than specific ligand binding to the GAF and PAS domains. This statement is perhaps too strong and that rather than either/or, it is likely both. Overall, this is a thorough biochemical study that reveals aspects of a more non-typical TCS activation mechanism that are of high interest to the Mtb and bacterial signaling field.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript the authors aim to characterize the mechanism regulating PdtaS activity, a histidine kinase of Mycobacterium tuberculosis responding to nitric oxide (NO) and copper ions (Cu). The PdtaS/PdtaR two-component system is atypical, with PdtaS being cytoplasmic and phosphorylating PdtaR in the absence of signal(s). Here, the authors characterize key residues involved in PdtaS dimerization necessary for PdtaS activity on PdtaR, in vitro and in M. tuberculosis. They show that both NO and Cu inhibit dimerization and thus PdtaS autokinase activity. They propose a model where changes in dimer affinity serve as the sensing mechanism allowing to integrate multiple signals without relying on specific ligand binding.

      Major comments:

      • The primary concern pertains to ligand recognition by PdtaS. While PdtaS constitutive autophosphorylation is shown to be dependent on dimerization, there is no direct evidence of ligand binding. How Cu and NO inhibits PdtaS activity remains uncharacterized. Is it unclear if there are specific binding pockets inducing PdtaS conformational switch, if both substrates compete for a single binding pocket, or if Cu and NO inhibit dimerization by binding to the dimer interface. Similarly, it is unclear if NO does not covalently modify the key cysteine residues by S-nitrosylation, nor if Cu induces a distinct and reversible thiol-switch by site-specific oxidation that regulates PdtaS dimerization and activity.
      • Since there is currently no direct evidence of ligand binding or residue modification, the conclusions drawn-particularly in the title and discussion-should be presented more cautiously, unless further structural, biochemical, or genetic data substantiate ligand binding.
      • Given the focus on ligand effects on PdtaS dimerization and activity, zinc and c-di-GMP should also be considered, as prior studies have suggested they may be sensed by PdtaS. Similarly, given the claim of multiligand sensing, it would be valuable to examine the combined effects of NO and Cu. Do they act additively, synergistically, or interfere with each other?
      • OPTIONAL. A significant limitation is the exclusive consideration of a single PdtaS conformation-the autophosphorylation-competent state. Histidine kinases typically cycle through at least three distinct enzymatic activities: phosphatase, autokinase, and phosphotransfer. Each of these functions relies on specific conformational states, which are often modulated by ligand binding. It is therefore important to investigate whether PdtaS also possesses phosphatase activity. Do ligands such as NO and Cu influence this activity-increasing phosphatase function, or simply inhibiting autophosphorylation and/or phosphotransfer? Moreover, does the monomeric or dimeric form of PdtaS exhibit phosphatase activity? In addition, the stability of phosphorylated PdtaR should be addressed, as it is crucial for understanding the overall dynamics and output of the signaling cascade.

      Minor comments:

      • PdtaS variants and mutants are neither introduced nor adequately described. For example, in lines 144-150, PdtaS-H303Q and G443 are mentioned without citation, and their construction is not described in the Materials and Methods section. As a result, it is difficult to determine which experiments and constructs are specific to this manuscript. Please provide a detailed Materials and Methods section, and include as supplementary material a complete list of all strains, primers, and constructs used in this study, along with their origins.
      • References: Xing J et al 2023 is duplicated. Please correct in the text and in the references list.
      • Please provide molecular weights on gels (fig. 1C, D, E, 2A, 5C, D, 7A).
      • Please provide incubation time for kinase reactions in figure legends (e.g. Fig 1C, D, E, ...).
      • Please indicate whether representative experiments are shown, and specify the number of replicates performed for each assay (e.g. Fig 1C, D, E, ...). This information is essential for assessing the reproducibility and robustness of the findings.
      • Please clarify the discrepancy in Figure 2A regarding the calcium concentration used. The results section (line 163) refers to 10 µM, whereas the figure legend (line 393) states 1 mM.
      • Figure 2A should include zinc, as previous work by the authors has shown that zinc directly inhibits the kinase activity of PdtaS. It would also be informative to test c-di-GMP in Fig. 2, given that c-di-GMP has been described to binds PdtaS (PMID: 33772870), and that c-di-GMP binding at dimer interfaces has been demonstrated in transcription factors (e.g., PMID: 25171413).
      • I am not convinced by the interpretation line 206-207. PdtaS homologs can have different ligand specificity, impling the conservation of a ligand cavity in the GAF/PAS domains.
      • The interpretation in lines 206-207 is not convincing. PdtaS homologs may differ in ligand specificity, precluding the presence of a conserved ligand-binding cavity but not of a specific ligand binding cavity in the GAF/PAS domains. Functional divergence of the binding site can occur, and this possibility should be acknowledged.

      Significance

      The manuscript demonstrates that PdtaS autokinase activity occurs in trans and that homodimerization is critical for its constitutive activity. While these findings extend previous work by the authors (PMID: 34003742)-which had already identified several key residues in PdtaS, including cysteines essential for NO and Cu sensing-they represent incremental advances. The mechanistic model proposed remains speculative without substantial additional experimental validation. The conclusions rely heavily on structural predictions from AlphaFold, representing only a single conformation corresponding to the autokinase-competent state. Crucially, the manuscript does not provide direct evidence for the mechanism of NO and Cu sensing. It also excludes the possibility of direct ligand binding-including untested candidates such as zinc and c-di-GMP-to a specific pocket, without experimentally addressing the hypothesis. These gaps significantly limit the mechanistic insight and overall impact of the study.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Reply to the Reviewers

      I thank the Referees for their...

      Referee #1

      1. The authors should provide more information when...

      Responses + The typical domed appearance of a hydrocephalus-harboring skull is apparent as early as P4, as shown in a new side-by-side comparison of pups at that age (Fig. 1A). + Though this is not stated in the MS 2. Figure 6: Why has only...

      Response: We expanded the comparison

      Minor comments:

      1. The text contains several...

      Response: We added...

      Referee #2

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Velázquez et al. investigate the transcriptomic and epigenetic consequences of exogenous expression of the Ewing sarcoma fusion oncogene EWSR1∷FLI1 (EF1) in yeast. The study provides compelling evidence that EF1 can bind ETS transcription factor motifs, as well as a single 4xGGAA repeat within the S. cerevisiae genome, despite the absence of several canonical cofactors often implicated in EF1 biology, including CBP/p300 and Polycomb group (PcG) proteins. The authors further show that EF1 expression redistributes RNA Polymerase II toward EF1-bound ETS sites, yet. Strikingly, this relocalization is accompanied by only modest global transcriptional effects relative to those reported in human or insect systems. In addition, EF1 expression reverses repeat-associated epigenetic silencing of synthetic GGAA microsatellites in engineered reporter strains. Taken together, the data support the conclusion that GGAA microsatellite-mediated transcriptional rewiring is a comparatively conserved EF1 property, whereas broader transcriptomic changes arising from individual EF1-bound ETS sites appear to be more context-dependent. Overall, the manuscript is clearly written and logically organized, and the methodological descriptions and data-analysis details appear sufficient to enable reproducibility.

      Major comments

      None.

      Minor comments

      1. Additional citations in the Introduction. A small number of additional references would further support specific statements.
        • End of paragraph 3: consider adding Boulay et al. (2017).
        • Paragraph 4, after "proximal and distal genes": consider adding Tomasou et al. (2015) and Orth et al. (2022).
      2. Potential antibody cross-reactivity in the CoIP experiment. Is there an EWSR1 homolog in S. cerevisiae? To rule out unintended interactions, it would be helpful to exclude binding of endogenous yeast proteins by the anti-EWS antibody used for co-immunoprecipitation, for example via a homology search and/or appropriate specificity controls.
      3. Clarification of EF1-associated toxicity. The authors suggest that part of the observed transcriptional signal may reflect EF1 toxicity. It would strengthen the interpretation to characterize this phenotype more explicitly (e.g., growth rate over time, viability/cell death, or longer-term fitness effects). In addition, it would be informative to test whether toxicity depends on EF1 DNA-binding activity and whether truncation mitigates toxicity, as reported in Drosophila (Mahnoor et al., 2024).
      4. RNA Polymerase II relocalization versus limited transcriptional output. The apparent recruitment/repositioning of RNA Pol II in the absence of substantial transcriptional change is particularly interesting. This point could be strengthened by assessing RNA Pol II "states," for example using phosphorylation-state specific antibodies to distinguish stalled/paused from actively elongating polymerase.
      5. Promoter choice in the GGAA microsatellite reporter. The use of a modified constitutive promoter to monitor GGAA microsatellite-dependent activation is somewhat unconventional. Many studies (e.g., Gangwal et al., Hölting et al.) use minimal promoters to demonstrate EF1-dependent upregulation rather than reversal of heterochromatin-associated silencing. A brief rationale for the chosen design or discussion of how it relates to prior reporter paradigms would help readers contextualize the approach.
      6. Reconstituting minimal cofactor requirements for ETS-site activity. The finding that EF1 binding at single ETS sites does not translate into strong transcriptomic remodeling in yeast is an intriguing aspect of the work. The manuscript could be further enriched by systematic attempts to reconstitute candidate cofactors in this minimal system to define the minimal requirements for ETS-site-dependent gene activation.

      Significance

      This study will be of broad interest because it convincingly separates EF1-driven GGAA microsatellite-dependent activation from the canonical regulatory functions of FLI1, reinforcing the concept that fusion transcription factors can acquire genuinely neomorphic activities-activities that may ultimately be therapeutically actionable. Beyond the biological insight, the successful establishment of exogenous EF1 expression in yeast is itself a notable technical achievement, given the longstanding challenges associated with EF1 expression in heterologous settings. As presented here, yeast offers a valuable platform to interrogate EF1 function in a simplified and more controlled context.