7,132 Matching Annotations
  1. Jun 2025
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Firstly, we would like to thank the reviewers for their time and efforts in critiquing this paper. The reviewers addressed our study to be significant, but also presented great suggestions to improve our manuscript, mainly the comparison of mRNA and eRNA for predicting subtype specificity and prognosis, the integration with independent validation datasets, etc. Our preliminary analyses showed that our classified mRNAs can predict subtypes better which is not surprising, as these subtypes were initially discovered using mRNA differences. Hence, we employed a novel approach of associating these classified mRNA and eRNA with distance and identified 71% classified eRNAs are associated with classified mRNAs. We also propose to integrate the datasets with PEGS (Briggs et al 2021) to achieve better mRNA-eRNA association and Perturb-seq validated regions to achieve functional validation of the eRNA loci. We believe that our potential improved integrative analyses will improve the novelty and power of our findings, as this is an unique approach which is employed in patient samples-based high resolution eRNA atlas for the first time. We have addressed most of the other major and minor comments of the reviewers and have provided the preliminary revised manuscript.

      Reviewer #1

      Evidence, reproducibility and clarity

      Summary<br /> This study assesses eRNA activity as a classifier of different subtypes of breast cancer and as a prognosis tool. The authors take advantage of previously published RNA-seq data from human breast cancer samples and assess it more deeply, considering the cancer subtype of the patient. They then apply two machine learning approaches to find which eRNAs can classify the different breast cancer subtypes. While they do not find any eRNA that helps distinguish ductal vs. lobular breast cancers, their approach helps identify eRNAs that distinguish luminal A, B, basal and Her2+ cancers. They also use motif enrichment analysis and ChIP-seq datasets to characterize the eRNA regions further. Through this analysis, they observe that those eRNAs where ER binds strongest are associated with a poor patient prognosis.

      Major comments:

      Part of the rationale for this study is the previous observation that eRNAs are less associated with the prognosis of breast cancer patients in comparison to mRNAs and they claim that the high heterogeneity between breast cancer subtypes would mask the importance of eRNAs. In this study, the authors solely focus on eRNAs as a classification of breast cancer subtypes and prognostic tool and do not answer whether eRNAs or mRNAs are a better predictor of cancer subtypes and of prognosis. Since the answer and the tools are already in their hands, it would be important to also see a comparative analysis where they assess which of the two (mRNAs or eRNAs) is a better predictor.

      Response: We appreciate the reviewer for this valid point about comparing the prognostic eRNAs vs mRNAs. Our study doesn’t imply that eRNA markers are better than mRNAs in predicting subtype specificity and/or prognosis, but our motivation for working with eRNAs is that they can be used to define relevant transcriptional regulators and prognosis generally if they are subtyped. As the molecular subtypes in breast cancers were established using gene expression datasets, mRNAs would perform better as predictors of subtypes and or prognosis. However, identifying regulatory networks with emphasis on transcription factor binding motif analyses is not achievable using mRNA datasets. Analysing the active enhancer regions with eRNA transcription will provide high resolution landscape of TF and epigenetic networks. These sorts of analyses usually require ATAC-seq or H3K27ac datasets, but these assays need fresh frozen tissue material and laborious experimental designs compared to RNA-seq datasets. Furthermore, eRNA-transcribing enhancers represent highly active enhancers, while ATAC and H3K27ac datasets can identify all enhancers, which can be inactive or poised, but captured due to the dynamic nature of enhancers. We demonstrate that traditional RNA-seq datasets mapped on active enhancer regions showing eRNA transcription would be sufficient to identify the highly active TF network and gene-enhancer regulatory frameworks in a subtype-specific manner, hence emphasising the potential of eRNA studies.

      Hence, the scope of our study is not to establish which RNA can predict subtype and survival, but to demonstrate the potential of studying eRNAs in patient samples using traditional RNA-seq assays. This study would be beneficial for epigenetics biologists of how enhancer transcription can be associated with gene regulation through deregulated transcription factor networks in patients. The above section had been included in the discussion in the revised manuscript.

      As the comparative analyses suggested by the reviewer will substantiate the potential of eRNAs being studied as cancer prognostic markers, we performed identical methodologies with our machine learning approaches on the published TCGA mRNA-seq datasets, identify the subtype-specific mRNAs as well as prognostic mRNAs and perform the comparative analyses of eRNAs and mRNAs. As we expected, mRNAs indeed perform better in associating with subtype specificity than eRNAs as we could identify more subtype-specific mRNAs with better statistics metrics. The results exhibit great separation across subtypes (Basal, Her2, LumA/B) as well as Ductal vs Lobular.

      We believe that eRNA and mRNA are complementary but not comparative to predict subtype-specific survival. To address this in the revised manuscript, we performed an initial selection of the eRNAs associated with their corresponding subtype-specific mRNAs within 50 kb distance which can be integrated with the above analyses, based on the suggestion from reviewer 3. In our preliminary analysis, around 71% of eRNAs are associated with the subtype-specific mRNAs and we also observed an observable separation of ductal and lobular subtypes using this method.

      Furthermore, we integrated our enhancer RNAs with the key enhancer regions which show significant impact on gene transcription, as shown in single cell CRISPRi screens (Perturb-seq) datasets derived from ATAC-matched H3K27ac datasets verified on one ER+ and one ER- breast cancer cell lines (Wang et al., Genome Biology 2025, https://genomebiology.biomedcentral.com/articles/10.1186/s13059-025-03474-0) . Our initial analyses identified at least 29 regions from the Perturb-seq datasets overlapping with 72 and 5 eRNAs of subtype classification and Her2 survival respectively.

      For the revised manuscript, we will perform the mRNA-eRNA association in a detailed manner and include the data. We will also employ our well-established tool for associating mRNAs and noncoding elements, Peak set Enrichment in Gene Sets (PEGS, Briggs et al., F1000 research, 2021 https://f1000research.com/articles/10-570/v2 ). We hypothesise that this will improve the power of the classification models used in the study and will also provide gene-enhancer RNA interaction landscape in patient samples for the first time. Furthermore, we will integrate the activity of these eRNA-mRNA pairs with chromatin accessibility and enhancer activity using ATAC-seq and H3K27ac ChIP-seq datasets to establish more robust active regulatory networks in patient samples. We will also perform motif analyses on the published ATAC-seq peaks (performed on TCGA-BRCA patient samples, Corces et al., 2018) close to the eRNA loci to identify the TF networks with better precision, hopefully unravelling novel and relevant subtype-specific TFs in an efficient manner, better than our original work. Furthermore, as an experimental functional validation of our classified eRNAs, we will investigate the regulatory effect of 29 Perturb-seq overlapped regions. Hence, our revised manuscript will potentially provide a comprehensive validated list of enhancer RNA regions which are highly active, actively transcribing, subtype and survival specific regulatory networks in breast cancer patients for the first time.

      The authors run the umaps of Fig. 1C only taking the predictor eRNAs. It is then somewhat expected to observe a separation. Coming from a single-cell omics field, what I would suggest is to take the eRNA loci and compute a umap with the highly variable regions, perform clustering on it and assess how the cancer subtypes are structured within the data. This would give a first overview of how much segregation and structure one can have with this data. Having a first step of data exploration would also strengthen the paper. If the authors have tried it, could the authors comment on it?

      Response: We appreciate the reviewer for sharing their experience from single cell omics analysis. In our case, following the scRNA like pipeline is not appropriate, given the focus of our study on identifying markers on the already annotated subtypes. Basically, we aim to assess the quality of the identified markers (the quality is quantified by the statistics provided for random forest classification), and we see that the data is well-separated in PCA using only PC1 and PC2. We showed the umap (using PC1 and PC2) for better visualization in the original manuscript and we included the PCA plots in the revised manuscript.

      'neither measures could classify any distinct eRNAs for invasive ductal vs lobular cancer samples' S1B. Just by eye, I can see a potential enrichment of ductal on the left and on the right while lobular stays in the center. This suggests to me that, while perhaps each eRNA alone does not have the power to classify the lobular vs ductal subtype, perhaps there is a difference - which could result from a cooperative model of eRNA influence - that would need further exploration. Would a PCA also show enrichments of ductal vs. lobular in specific parts of the plot? It may be worth exploring the PC loadings to see which eRNAs could play an influence. In this regard, a more unbiased visual examination, as suggested in my previous point, could help clarify whether there could be an association of certain eRNAs that cannot be captured by ML.

      Response: The subtypes of cancer patients (Basal, Her2, LumA/B) possess clear differences in mRNA expression in breast cancer studies. Given the fixed annotations of the subtypes in the patient datasets, we applied our methodologies on mRNA datasets, and the results exhibited great separation across subtypes (Basal, Her2, LumA/B) as well as Ductal vs Lobular. In addition, 70% of subtype-specific eRNAs are located next to mRNA. This ensures that we detected proper eRNA markers. Furthermore, Random Forest is the standard and powerful non-linear classifier for these types of classifying questions. Therefore, we hypothesized that the data which can distinguish Ductal vs Lobular does not exist in the used eRNA dataset. We only detected 38 subtype-specific mRNAs using information gain with standard cutoff 0.05 which they have classifying power across ductal-lobular. With this standard cutoff only one eRNA-associated gene was detected. To explore more, we used low cutoff for information gain (0.01) and then took only the eRNAs which are located near classified mRNAs (up to 50KB). In this way, we detected 96 eRNA candidates linked to 8 classified mRNAs. These 96 eRNAs could, to some extent, classify ductal vs lobular (PCA plots attached above). This observation can further verify that if a more comprehensive eRNA dataset exists, we could detect better eRNA markers and cover more (probably all) mRNA markers. Hence, cooperative model of eRNA as suggested by the reviewer can't be achieved and random forest is one of the efficient tools to decipher the cooperation if it exists. Besides, as we demonstrated in this paper that eRNA is a complementary dataset to mRNA which can assist in the identification of regulatory networks. For the revision, we will provide more detailed eRNA-mRNA associations using integration with PEGS and Perturb-seq validated regions, in both subtype classification and survival and will motivate the potential similar studies for ductal vs lobular in the discussion.

      "we employed machine learning approaches on 302,951 eRNA loci identified from RNA-seq datasets from 1,095 breast cancer patient samples from previous studies" - the previous studies from which the authors take the data [11,12] highlight the presence of ~60K enhancers in the human genome and they use less than that in their analysis. Could the authors please clarify the differences in numbers with previous studies and give a reasoning?

      Response: ~300K enhancers are derived from ENCODE H3K27ac datasets which represents all active enhancer regions marked by H3K27ac (Hnisz et al., 2013). This is a high-resolution map of eRNA loci ever presented. In Chen et al 2020, 1,531 superenhancers representing 30K eRNA loci was utilised for exploratory analysis, and the findings were generalised back to the 300K set. 65K enhancer loci covers tissue-specific enhancers initially identified by FANTOM CAGE datasets and this subset provide limited regions of eRNA expression. Hence, our analyses on ~300K eRNA loci provide unbiased information on subtype specificity and gene-TF regulatory networks. The differences had been highlighted in the methods and results in the revised manuscript.

      Also, from the methods section, they discard many patient samples due to low QC, so, from what I understand, the number of samples analyzed in the end is 975 and not 1,095.

      Response: We thank the reviewer for pointing this out and we have updated the numbers in the revised manuscript.

      Minor comments:

      Can the authors please state the parameters of the umap in methods? Although it could be intrinsic to the dataset, data points are grouped in a way that makes me think that the granularity is too forced. Could the authors please show how the umap would behave with more lenient parameters? Or even with PCA?

      Response: We used ‘umap’ function from umap package (with default parameters) in R using only PC1 and PC2, hence the granularity is not forced. As suggested by the reviewer, we have now added PCA plots in the main figures (Fig. 1E) and moved all the umap plots to the Supplementary figures (Fig.S1B) in the revised manuscript.

      'Majority of the basal' -> The majority of the basal.

      Response: We thank the reviewers for noticing the typo and we corrected this in the revised manuscript.

      Significance

      This is a paper relevant in the cancer field, particularly for breast cancer research. The significance of the paper lies in digging into the breast cancer samples, taking the different existing subtypes into account to assess the contribution of eRNAs as a classifier and as a prognostic tool. The data is already available but it has not been studied to this degree of detail. It highlights the importance of characterizing cancer samples in more depth, considering its intrinsic heterogeneity, as averaging across different subtypes would mask biology. My expertise lies in gene regulation and single-cell omics. My contribution will therefore be more focused on the analysis and extraction of biological information. The extent of its specific relevance in cancer research falls beyond my expertise.

      Response: We appreciate the reviewer for understanding our efforts to bring out the importance of subtyping and to explore the association of eRNA in breast cancer transcriptional gene regulatory networks.

      Reviewer #2

      Evidence, reproducibility and clarity

      Summary<br /> Enhancer RNAs (eRNAs) are early indicators of transcription factor (TF) activity and can identify distinct molecular subtypes and pathological outcomes in breast cancer. In this study, Patel et al. analysed 302,951 polyadenylated eRNA loci from 1,095 breast cancer patients using RNA-seq data, applying machine learning (ML) to classify eRNAs associated with specific molecular subtypes and survival. They discovered subtype-specific eRNAs that implicate both established and novel regulatory pathways and TFs, as well as prognostic eRNAs -specifically, LumA and HER2-survival- that distinguish favorable from poor survival outcomes. Overall, this ML-based approach illustrates how eRNAs reveal the molecular grammar and pathological implications underlying breast cancer heterogeneity.

      Major comments

      1. The authors define 302,951 eRNA loci based on RNA-seq data, yet it is widely known that many enhancers reside in proximity to promoters or within intronic regions (examples presented in Fig. 3B and S3). Consequently, it seems likely that reads mapped to these regions might not truly represent eRNA signals but include mRNA contamination. Could the authors clarify how they ensured that the identified eRNAs were not confounded by mRNA reads? What fraction of these enhancer loci is promoter proximal or intronic? How does H3K4me3, a well-established and standardized active promoter histone mark, behave on these loci? The reviewer considers it important to confirm that the identified eRNAs are indeed of enhancer origin rather than promoter transcripts.

      Response: For this study, we utilised pan cancer atlas-based published work (Chen et al 2018 and 2020) where the abundant RNA signals on intronic and intergenic regions are included, and promoter-based signals are excluded. These studies utilise the advantage of identifying eRNAs on large sample size and the possibility of mRNA being on introns in 1000s of patient samples is very low. A clarification of this concern had been discussed in the Introduction of these studies as follows: “because eRNA reads associated with real enhancer activity recurrently accumulate, whereas background transcription noise tends to occur stochastically. The large number of RNA-seq reads obtained would compensate for the statistical power compromised by the low eRNA expression level typically observed in a single sample.” We included clarification of this concern in the discussion. Furthermore, as per the reviewer’s suggestion, we examined the distribution of the eRNA loci across the genome and found that majority of eRNA regions are located on introns and intergenic regions. This figure had been included in the Supplementary Fig. S6A.

      2. In Fig. 1B, the F measure (0.540) of the Basal subtype using the Logmc method contradicts its extremely high precision (1.000) and sensitivity (0.890). The authors need to clarify the exact formula or method used to compute F1 and the discrepancy in the reported metrics for this subtype and perhaps other subtypes as well.

      Response: We apologise for the mistake in this section and thank the reviewer for pointing this out. We included the formulas for each statistical metric in the method section of the manuscript. The F-measure was mentioned wrong which led to the confusion here. The figure had been corrected with the F-measure of 0.94 in the revised manuscript.

      3. As shown in Fig. 4C, S4B, and most, if not all, tracks of Fig. S3, ER binding regions are not annotated as eRNA loci. It seems, in this reviewer's opinion, very unlikely that this is because they generally lack eRNA expression, but rather they do not express polyadenylated eRNA (typically 1D eRNA), which is captured in this dataset. The reviewer posits that these enhancers produce more transient, non-polyadenylated 2D eRNA. It has been widely documented in prior studies that ER-bound enhancers exhibit bimodal eRNA expression patterns [e.g., Li, W. et al. Functional roles of enhancer RNAs for oestrogen-dependent transcriptional activation. Nature 498, 516-520 (2013)]. Could the authors address this opinion and elaborate on how the restriction to polyadenylated transcripts might underrepresent enhancers regulated by ER and other TFs and whether this bias impacts the overall findings?

      Response: The authors appreciate the reviewer’s suggestion to address the caveats of using polyadenylated eRNAs to identify the ER binding patterns. TCGA eRNA atlas with polyadenylated eRNAs indeed possesses this disadvantage of using polyadenylated eRNAs for this study, however currently there are no data available with bidirectional transcripts in any breast cancer patient samples. The tools to profile these RNAs are not robust enough to be performed on frozen cancer tissue samples which are extremely limited in their size and availability. By utilising the polyadenylated eRNA-seq datasets, we might not only lose the accuracy of ER binding patterns, but also for other transcription factors which activate/associate with bimodal expression around enhancers. However, our integrative analysis on stable polyadenylated eRNA loci can still identify the most-relevant TF networks of each subtype.

      Furthermore, we validated this finding by analysing our own datasets of KAS-seq which represents any active transcribing bidirectional enhancers from MCF7 cell line. Independently, we also incorporated ATAC-seq, H3K27ac ChIP-seq, CAGE and GRO-seq data on the gene profiles in Fig. S3 to associate the eRNA regions identified in polyadenylated RNA datasets with ER binding sites in patients and published bidirectional transcripts in the preliminarily revised manuscript. We observed that all the ER binding sites are accompanied by open and active enhancer marks with bidirectional transcription (either GRO- or CAGE positive) but they are not on the exact location of eRNA regions. Subtype-specific eRNA regions close to genes like MLPH and XBP1 possess both active bidirectional transcribing ER bound sites far away (around 1.5 kb) from subtype-specific eRNA loci and bidirectional transcribing ER unbound sites. However, these distal ER binding sites are close to the regions from the list of 300K eRNA loci and they were simply not identified as subtype-specific regions. Hence, it can be true that the occupancy of ER might not be present on all subtype-specific eRNA loci, but our subtype-specific eRNA sites are representative of bidirectional transcription.

      Upon the suggestion from the reviewer, we discussed the potential of identifying TF networks by analysing the 1D eRNAs, in the revised manuscript.

      4. Despite the unsatisfied performance of the ML approach on classifying Her2 subtypes, the hierarchical clustering performed in Fig. 2A and S2A appears to show a reasonable separation of Her2 subtypes, showing as a clustered green band. Could the authors quantitatively assess how effective this clustering results and compare that to the ML outcome? (OPTIONAL)

      Response: The authors acknowledge this interpretation from the reviewers. Using both the measures, our ML platform can identify markers for Her2 subtype but some of the statistical metrics are poor. As the heatmaps were performed based on these identified Her2 markers, a separate analysis on this cluster would not be much informative. The poor metrics for Her2 classification was already justified, partly due to the low number of Her2+ patients in the cohort.

      5. In Fig. 4 and S4, the authors reported to have enriched binding or motif of TFs, e.g., FOXA1, AP-2, and E2A, specifically at enhancer loci with low eRNA level, which conflicts with their established roles as transcriptional activators. The reviewer asks for an address as to why these factors would be associated with basal low-eRNA regions and whether any additional data might clarify their functional role in these contexts.

      Response: The authors appreciate the reviewer’s concern, but we would like to clarify that eRNAs which are less expressed in basal subtype are classified as basal low. These regions show high expression in luminal patients. Hence, there is a strong overlap of basal low and luminal high regions. FOXA1 and AP2 factors are strongly established coactivators in luminal ER+ transcriptional signaling, hence they are associated with basal low eRNA regions. We clarified this in the discussion and provided more literature evidence in the revised manuscript to demonstrate the strong role of FOXA1 and AP2 factors in ER+ luminal breast cancer transcriptional response.

      6. Regarding Fig. 4B, the authors state that "ER binding occupies only the strongest ssDNA and GRO-seq-positive sites". Firstly, the GRO-seq data quality is poor with indiscernible peaks. This may be insufficient for a qualified representation of nascent eRNA expression. More importantly, it appears each heatmap is ranked independently, so top loci for ssDNA are not necessarily top loci for GRO-seq, ER, Pol-II, or H3K27ac. The reviewer requests clarification on how the authors plot these heatmaps and questions whether the statement is supported by the analysis as presented.

      Response: We acknowledge the reviewer’s concern and based on their suggestion, we utilised another set of GRO-seq datasets which is more deeply sequenced and published by the same lab. The average plot from these new datasets showed better profile. We also apologize for not providing enough details of how we generated the heatmaps in Fig. 4B. The heatmaps were made separately for each profile to auto scale with their own intensity levels but the order of the regions is based on KAS-seq intensity. The order of these regions was kept the same between each profile. Hence, top loci of ssDNA are not exact top loci of GRO, ER, H3K27ac and Polymerase but top loci of ssDNA also show similar high intensity in GRO, ER, H3K27ac and Polymerase, hence correlated. We also removed regions which belong to blacklisted regions of hg38 and the regions which were over-sequenced due to amplifications and showed weird signals. We provided the new heatmaps and profile plots in the revised manuscript with different clusters of KAS-seq intensity. We also updated the methods section to clarify how these heatmaps were made.

      7. In Fig. S4B and the third plot of 4C, the averaged histogram of ER binding appears in multiple sharp peaks with drastic asymmetric positioning around the enhancer centre, which is highly atypical of most published ER ChIP-seq profiles. Could the authors discuss possible "spatial syntax" or directional patterns of ER binding in relation to eRNA loci and cite any literature showing a similar pattern? Further evidence is required to substantiate these observations, as they are remarkably unique.

      Response: The authors agree with the reviewer’s point about asymmetric peaks of ER on the luminal specific eRNA regions. Due to the nature of the average profile plots and the number of regions explored here are so low, the profiles look asymmetrical and different than the published literature. Heatmaps lose their resolution when made on a very low number of regions. The focus of this analysis is to highlight that the ER is not binding to the centre of eRNA loci which is contradictory to the published findings from in vitro studies, but further away on these subtype-specific regions. We don’t have any solid evidence to demonstrate the directional patterns of ER binding related to this data. To avoid any confusion, we removed these average plots but focused on the already existing single gene profiles in Fig. S3 and discussed our interpretations in detail.

      Minor comments<br /> 1. When introducing eRNAs, the reviewer recommends mentioning that 1) eRNA levels correlate with enhancer activity and 2) eRNA expression precedes target gene transcription, thus reflecting upstream regulatory events. Relevant references include: Arner, E. et al. Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells. Science 347, 1010-1014 (2015); Carullo, N. V. N. et al. Enhancer RNAs predict enhancer-gene regulatory links and are critical for enhancer function in neuronal systems. Nucleic Acids Res. 48, 9550-9570 (2020); Kaikkonen, Minna U. et al. Remodeling of the Enhancer Landscape during Macrophage Activation Is Coupled to Enhancer Transcription. Mol. Cell 51, 310-325 (2013).

      Response: These are great recommendations from the reviewer, and we included the suggested publications in the Introduction section of the revised manuscript.

      2. H3K27ac is used initially to define these regulatory loci, and like eRNAs, H3K27ac also varies among patients. Which H3K27ac dataset(s) were used initially, and could this approach potentially overlook patient-specific enhancers? (OPTIONAL)

      Response: This is a totally valid point from the reviewer. The idea of this project is to define common subtype-specific enhancers which can be regulatory and prognostic, hence can be developed further as biomarkers providing benefit for more patients in the future. Hence, investigating the common enhancers which are activated in multiple normal and cancer cell lines defined by ENCODE is more valid than patient-specific enhancers whose activity might be influenced by specific genetic alterations. There is very limited availability of H3K27ac ChIP-seq datasets from cancer patients to explore the patient-specific enhancers, and our analyses were totally based on the published work, hence not possible to fully address this concern. The source of the H3K27ac ENCODE datasets (from 86 human cell lines and tissue samples) is clarified in the revised manuscript.

      3. In addition to the overall metrics displayed in Fig. 2B, could the authors provide precision and sensitivity values for LumA and LumB separately under the Logmc method, given the observation in Fig. 2E that LumA and LumB are not well separated in the UMAP projection?

      Response: The authors appreciate the suggestion from the reviewer. We have included the metrics separately for LumA and LumB in the revised manuscript in Fig. S1D.

      4. Could the author elaborate, in the discussion section, on why there is a substantial difference in ML performance depending on whether InfoGain or Logmc is used?

      Response: We have included the following text in the discussion to explain the differences between these two measures.

      “InfoGain measure work with the approach of binarization with k-means (k=2). It has the potential to capture both strongly expressed eRNAs which are differential between subtypes as well as low expressed sparser on and off eRNAs. In the first case, although eRNA is highly expressed in all patients, the higher expression mode becomes 1 and the lower expressed mode become 0. However, in case of low expression, more on and off expression, recentered logmc would not generate a striking high value. Furthermore, binarization is also a strong process to perform better clustering and classification, as distinguishing between data points gets better and clearer. “

      5. How does the expression pattern of Basal high, Basal low, Her2, and Lum eRNA clusters behave differentially in Basal, Her2, and LumA/B subtypes? Are Basal high eRNAs downregulated in Her2 or Lum subtypes, and vice versa? Since many downstream analyses rely on these eRNA clusters, it is suggested to include a heatmap and/or boxplot that displays how each eRNA category is expressed in each subtype to confirm that these definitions are consistent.

      Response: We thank the reviewers for this suggestion and apologise for not providing enough clarification on the expression of eRNAs in other subtypes. Indeed, Basal high expressed eRNA are expressed low in LumA and LumB and Basal low expressed eRNAs are expressed higher in lumA and lumB. Her2 subtype-specific eRNAs has a trend of expression between Basal and Lum, as it can be seen in the umap and PCA. Basically, the Basal high expressed eRNAs are Lum lower expressed eRNAs, and the Basal low expressed markers are Lum higher expressed markers. As per the suggestion from the reviewer, we provided heatmaps on eRNA expression of each subtype-specific with regulation in other subtype patients in figure S2F-K.

      Referee cross-commenting

      I share Reviewer #1's opinion that the manuscript should assess whether mRNA or eRNA is the stronger predictor of breast cancer subtypes and clinical outcomes. It will greatly improve the novelty if eRNA is shown to be a better indicator for cancer characterization.

      Also, I strongly concur with Reviewer #3 that the current informatics approach is superficial and that several conclusions are contentious. The authors need to resolve the inconsistencies in their ML statistics and the potentially misleading interpretations of the ChIPseq and motif enrichment results.

      It is further recommended that, building Reviewer #3's comment, the study integrate eRNA signatures with their proximal genes to address 1) whether genes located near these enhancers are differentially expressed-and correlated with enhancer activity-across cancer subtypes, and 2) whether it provides insights into understanding the enhancer-gene regulatory architecture in a subtype-specific context.

      Response: We thank reviewer 2 for cross-commenting on reviewer 1 and 3’s suggestions. Indeed, these are interesting points to cover and will increase the novelty of the study. Based upon these suggestions and discussed earlier for reviewer 1’s comments, we will explore the comparison of mRNAs vs eRNAs as predictor of cancer subtypes and prognosis and the association of genes-eRNAs in cis as discussed in other reviewer’s comments. Our preliminary analyses show a strong association of eRNA and mRNA specific to subtypes and an observable separation on subtypes which were harder to classify markers using eRNAs alone. Hence, we will improve these analyses, and the manuscript further as discussed above in the final revision.

      Significance

      General Assessment

      This study provides insights into the potential use of eRNA to classify breast cancer subtypes and refine prognostic markers. A strength is the integration of large-scale RNA-seq data with machine learning to identify eRNA signatures in biologically-meaningful patient samples, revealing both established and novel TF networks. The study also discovered eRNA clusters that correlate with the survival of patients, thus providing strong clinical implications. However, the ML approach yields several inconsistencies-for instance, unsatisfactory classification results for the Her2 subtype as well as the confused statistical metrics in the results. Furthermore, the ML model struggles to differentiate more nuanced molecular classes (e.g., LumA vs. LumB) and higher-level histological subtypes (e.g., lobular vs. ductal), thus limiting its power to dissect more delicate pathological and molecular mechanisms. Another limitation worth noting of this ML approach is the exclusive use of only polyadenylated eRNAs via RNA-seq, which excludes perhaps the more prominent 2D eRNA expressed in regulatory enhancers. Moreover, certain datasets appear to be of suboptimal quality, leading to assertions that would benefit from additional supporting evidence. Altogether, while the study offers a promising angle on eRNA-based tumor stratification, more robust experimental validations are needed to resolve inconsistencies and clarify the mechanistic underpinnings.

      Advance<br /> Conceptually, the study highlights the potential for eRNA-based signatures to capture regulatory variation beyond classical markers. However, the utility of these signatures is constrained by the focus on polyadenylated transcripts alone, likely underrepresenting key enhancer regions, and certain evidence presented in this study is not substantial enough to support some statements. While the work adds an important dimension to the understanding of enhancer biology in breast cancer, the resulting insights are partly hampered by limitations in data coverage and quality.

      Audience<br /> The primary audience includes cancer epigenetics, functional genomics, and bioinformatics researchers who are interested in leveraging eRNAs as biomarkers and dissecting complex regulatory networks in breast cancer. Clinically oriented scientists focusing on molecular diagnostics may also find relevance in the authors' approach to stratify subtypes and outcomes. The research is most relevant to a specialized audience within basic and translational cancer genomics, as well as computational biology groups interested in eRNA analysis.

      Field of Expertise

      I evaluate this manuscript as a researcher specializing in cancer epigenetics, functional genomics, and NGS-based data analysis. Parts of the manuscript touching on clinical outcome measures may require additional review from practicing oncologists.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This study aims to classify prognostic and subtype-specific eRNAs in breast cancer, highlighting their potential as biomarkers.<br /> Data was analysed using existing machine learning algorithms,<br /> Data analysis is superficial and it is hard to understand the key significant findings.

      This is an important topic and a highly relevant approach to identifying RNA-based biomarkers.<br /> They analyse published RNAseq datasets by focusing on molecular subtype-specific eRNAs, enhancing clinical relevance and thereby addressing the heterogeneity of the cancer type (strength of the study).

      Weaknesses include: Most of the findings are purely correlation-based and also based on a reanalysis of published datasets; it would benefit from experimental validation to support their findings. Differential expression analysis of large datasets likely yields some differences in the transcriptome. How significant are these changes?<br /> Does the expression of eRNAs affect the expression of genes in cis? Although this analysis would provide some associated gene expression differences, it can also provide some insights into subtype-specific differences in gene expression programs.<br /> If the authors find experimental validations are not feasible, I recommend validating the eRNA signature in an independent dataset.

      Response: We acknowledge the weaknesses noticed by the reviewer from this study about the correlation-based analyses of published datasets. While the TCGA eRNA atlas datasets are reanalysed, these are the high-resolution maps ever published on eRNA expression on cancer patient samples, and our study is the first to establish the subtype specific classification of eRNAs. We believe that the eRNAs are biologically relevant, as they are strongly associated with the subtype-specific pathways and epigenetic regulators. Upon suggestion from the reviewers, we will explore the association of mRNAs and eRNAs in cis to establish further significance and relevance of the eRNAs we identified (discussed earlier in reviewer 1 comments).

      We would like to focus on studying the functional relevance of eRNAs as a separate project. In vitro studies to establish the knockdown of eRNAs are not straightforward due to the toxicity and non-specific targeting of the locked nucleic acids approach or Cas13-based RNA targeting. siRNA-based approaches don't target the nuclear eRNAs effectively, even though they were widely used by other labs to target eRNAs. Hence, a lot of effort on optimisations are needed to establish functional validation of our eRNAs, hence not under the scope and time frame of this study/revision. To provide validation and significance using independent datasets, we will explore the association of these factors with the expression of subtype-specific eRNAs further in our final revised manuscript using the tools explained above for reviewer 1 (PEGS and Perturb-seq integration). Integration of our classified eRNAs with the published Perturb-seq validated regions from ER+ and ER- breast cancer cell lines will provide the functional validation of patient-associated classified enhancer/eRNAs. Hence, our study would be the first to demonstrate the validated gene-enhancer regulatory networks from breast cancer patient datasets.

      Furthermore, we included the single gene visualisation profiles of independent datasets of ER ChIP-seq from different patients (Ross-Innes et al., 2012), ATAC-seq from TCGA patients (Corces et al., 2018), H3K27ac ChIP-seq datasets from cell lines (Theodorou et al., 2013 and Hickey et al., 2021) and GRO-seq and CAGE data published in MCF7 cells close to the eRNA regions and discussed their overlap with the eRNA regions in the revised manuscript. In the final revision, we will perform further detailed integration of all these profiles. Overall, our study will provide the integratory analysis of various independent epigenetic and functional profiles to validate our classified subtype and survival-specific eRNA regions.

      Here are major points; addressing these points in the revised version is important.

      From Figure 1B, what eRNAs were identified for LumB using log2MC?

      Response: The authors acknowledge the lack of analyses on LumB eRNAs in the original version of the manuscript. In the final revised manuscript after associating with mRNAs, we will provide the heatmaps, pathway analyses and other functional annotations for LumB specific eRNAs.

      Page 8 However, sensitivity and F-measure .... It would help to include the metrics for the number of patients in each subtype. The ratio of eRNAs/number of cases in each subtype would inform if the number of eRNAs is an outcome of no. of cases or subgroup-specific.

      Response: This is a great suggestion from the reviewer, and we included the number of patients for each subtype in the table in Fig. 1D. We observed that the basal patients are low in number, but we identified more basal eRNAs. Hence, the number of eRNAs identified in subtype-specific manner is not correlated to the number of patients in the cohort.

      Page 9 "Altogether, both measurements classify eRNAs efficiently based on subtypes, InfoGain allowed us to distinguish further samples based on high and low expression of eRNAs for basal subtype and performed better in statistical metrics" Based on statistical metrics, both models seem to be performing similarly except for Her2.

      Response: We apologise for this wrong interpretation. We corrected this in the revised manuscript at page 9.

      In Fig. 1B, the F-measure metrics are wrong for basal LogMC, as it is 0.94 rather than 0.54, which could lead to a misinterpretation of the model.

      Response: We apologise for the mistake in this figure, and we included the corrected heatmap in the revised manuscript.

      Many genome browser figures, including Figure S3. TFBS is not at the same site as eRNAs detected. Is there CAGE data to show that binding these TFs at these sites leads to the expression of eRNAs? That will give direct evidence that the eRNAs are transcribed due to these TFs

      Response: This is a great suggestion from the reviewer. We incorporated ATAC-seq, H3K27ac ChIP-seq, CAGE and GRO-seq data on the gene profiles in Fig. S3 to validate the activity of these ER binding sites in the preliminarily revised manuscript. We observed that all the ER binding sites are accompanied by open and active enhancer marks with bidirectional transcription (either GRO- or CAGE positive) but they are not on the exact location of eRNA regions (250-1000 bps away from the centre of ER binding site). Subtype-specific eRNA regions close to genes like MLPH and XBP1 possess active bidirectional transcribing ER binding sites far away from subtype-specific eRNA loci and also ER unbound sites. However, these distal ER binding sites are close to the regions from the list of 300K eRNA loci and they were simply not identified as subtype-specific regions.

      Page 10, There were 30 Her2-specific eRNA regions.... Do the same enhancers also regulate these genes as those from which eRNAs are transcribed? Is it cis-effect, or could these affect the trans-regulating of other genes?

      Response: We acknowledge the concern from the reviewer, however this is hard to be validated, as functional experiments to explore the 3D interactions of enhancers and gene promoters are not robust enough to be performed in patient samples and can't be performed within the revision time frame. In the final revised manuscript, we will explore the association of enhancers and promoters of ERBB2 with PEGS association as discussed above and with available HiC datasets in Her2+ cell lines (HCC1954, GSE167150, Kim et al., 2022 https://pubmed.ncbi.nlm.nih.gov/35513575/ )

      Minor comments:

      Page 8 "InfoGain meausure..." Fig. S2A also shows high and low expressed eRNAs for the basal group

      Response: We apologise for the lack of clarity here. InfoGain measure identifies both high and low expressed eRNAs in all patients showing similar pattern of regulation among patients. However, logmc derived eRNAs are highly expressed in most patients. Low expressed eRNAs could not be identified in logmc measure as strong as InfoGain regions. The text in the results had been edited in the revised manuscript to reflect better clarity on this point.

      Page 11, Our analyses also identified the role of another..... The statement is misleading as it is the enrichment of these TFs with the eRNAs<br /> Response: We included the word “enrichment” to clarify this statement.

      Page 13, "Around 90% of eRNAs are bidirectional and non-polyadenylated [53]. TCGA expression datasets are based on RNA-seq assays, which capture only non-polyadenylated RNAs. Thus, analysing the expression of eRNAs on mRNA-seq datasets might not be adequate". It is very confusing, please check<br /> Response: We apologise for the mistake, and this has been corrected in the revised manuscript.

      Reviewer #3 (Significance (Required)):

      This is an important topic and a highly relevant approach to identifying RNA-based biomarkers.<br /> They analyse published RNAseq datasets by focusing on molecular subtype-specific eRNAs, enhancing clinical relevance and thereby addressing the heterogeneity of the cancer type (strength of the study).

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-03004

      Corresponding author(s): Kentaro Furukawa and Tomotake Kanki

      1. General Statements [optional]

      We would like to thank the reviewers for their constructive and positive feedback. We are encouraged that all three reviewers consider the identification of Mfi2 as an outer mitochondrial membrane fission factor required for mitophagy to be a significant and important contribution to the research field. We acknowledge the concerns raised and propose the following plan to address them through additional experiments and clarifications. We believe that these revisions will further strengthen the manuscript and enhance its impact.

      2. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Furukawa and colleagues identified Mfi2 as novel factor that promotes fragmentation and removal of damaged mitochondria by mitophagy. They report that parallel loss of Dnm1 and Mfi2 blocks mitophagy. Mfi2 acts on the outer membrane, while the previous found Atg44 functions in the intermembrane space. How the proteins cooperate remains unknown. This is an elegant study with high-quality data. The findings are interesting for a broad readership. There are some issues as outline below that should be solved.

      Response:

      We would like to thank Reviewer #1 for their thoughtful evaluation of our manuscript and for recognizing the interest and quality of the study.

      1. It remains unclear how Mfi2 is anchored into the outer mitochondrial membrane. Does it contain a transmembrane domain? The carbonate resistance indicates the presence of such transmembrane domain. However, the presented structures lack such membrane-spanning segment. This point should be clarified.

      Response:

      We performed an in silico topology prediction of Atg44 and Mfi2 using TMHMM. This tool identified a weakly hydrophobic region of Mfi2 near the N-terminus but did not predict a definitive transmembrane domain (see new Fig. EV1E) (Page 6, lines 8-9). This result implies that Mfi2 interacts with the outer membrane in a monotopic or peripheral manner, rather than as a classical transmembrane protein. Such proteins may remain in the membrane pellet after carbonate treatment due to their strong hydrophobic insertion into the lipid bilayer (e.g., yeast tafazzin/Taz1; Brandner et al., Mol. Biol. Cell, 2005; DOI: 10.1091/mbc.E05-03-0256). We will incorporate this interpretation in the revised manuscript.

      How does Mfi2 cooperate with Dnm1? Is there any interaction between these proteins? Some further information could provide mechanistic insights into the function of Mfi2.

      Response:

      While our study does not explicitly suggest that Mfi2 cooperates with Dnm1, we plan to investigate whether these proteins physically associate. We will perform co-immunoprecipitation experiments under growing and mitophagy-inducing conditions to examine potential interactions between Mfi2 and Dnm1. Further insights into their interaction could help clarify the mechanistic role of Mfi2 in mitochondrial fission and mitophagy.

      The authors report a CL-dependent binding of Mfi2 to liposomes. Is the recruitment of Mfi2 to mitochondria impaired when CL-synthesis is blocked, e.g. in crd1delta mitochondria?

      Response:

      To assess the role of cardiolipin in Mfi2 localization, we will compare the efficiency of mitochondrial targeting of endogenous Mfi2 in WT and crd1Δ cells. Additionally, as mentioned in Reviewer #3's comment, we plan to perform coarse-grained molecular dynamics simulations to further investigate the interaction between Mfi2 and cardiolipin. The results of these simulations will be incorporated into the discussion to provide deeper mechanistic insights.

      Figure 4B: a wild-type control should be added.

      Response:

      We appreciate Reviewer #1’s suggestion to include a WT control in Figure 4B. However, given the focus of this figure on the rescue of mitophagy defects in the mfi2Δ dnm1Δ strain, we believe that adding a WT control is not essential for the analysis. The key comparison here is between the mfi2Δ dnm1Δ strain and the rescue conditions, and statistical analysis was performed to support the conclusions. We hope this clarifies our approach, but we will make adjustments if necessary.

      Reviewer #1 (Significance (Required)):

      The reported findings are interesting for a broad readership.

      Response:

      We appreciate Reviewer #1’s recognition of the relevance of our findings to a broad readership.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this study, the authors discover a mitochondrial fission factor, termed Mfi2, that promotes mitophagy efficiency and that functions in a partially redundant way with Dnm1 for the fission of mitochondrial outer membranes during mitophagy. The discovery helps to clarify why Dnm1 does not appear to be essential for fission mediated mitophagy by Dnm1. Mfi2 is structurally similar to the inner membrane fission factor Atg44 which is consistent with Mfi2's fission activity. The authors show that Mfi2 has membrane fission activity towards nanotubes in vitro, and that membrane binding is dependent of high levels of cardiolipin, a mitochondrially enriched lipid. In summary, the authors show that Mfi2 mediates mitochondrial outer membrane fission together with Drp1, whereas Atg44 mediates inner membrane fission, that together are necessary for mitophagy.

      Response:

      We thank Reviewer #2 for the positive assessment and for clearly summarizing the main contributions of our work.

      Major: 1. Figure 2: How do the expression levels of the Mfi2 constructs compare to the endogenous levels of the protein? This will help to gauge to what degree Mfi2 N66 overexpression is needed to achieve mitochondrial fragmentation in Atg44 delta cells and also the low level of mitophagy rescue that was observed.

      Response:

      We used the TDH3 promoter for the expression of Mfi2 in Figures 2D and 2E. Unfortunately, our Mfi2 antibody only detects full-length Mfi2, as it recognizes a C-terminal region of the protein. This means we cannot directly compare the expression levels of Mfi2(N66) to those of endogenous full-length Mfi2. To clarify the expression levels, we will provide the following data:

      (1) Mfi2 antibody: Endogenous Mfi2(Full) and overexpressed Mfi2(Full)

      (2) FLAG antibody: Overexpressed Mfi2(Full)-FLAG and overexpressed Mfi2(N66)-FLAG

      Figure 3A-B: The cardiolipin binding results in vitro are interesting but the concentration of cardiolipin is much lower on the outer membrane versus the inner membrane. Can the authors comment on whether the cardiolipin levels used on the nanotubes are relevant to that of the mitochondrial outer membrane where Mfi2 is located? Can the authors provide quantitative data for these experiments to help strengthen their conclusions?

      Can the authors also use purified MBP alone or a form of Mfi2 that cannot bind to membrane e.g. Mfi2-C33) as a control?

      Response:

      We thank the reviewer for raising this important point regarding our cardiolipin-dependent in vitro data. In our experiments, we used 20 mol% cardiolipin (CL), a concentration higher than the typical levels in the mitochondrial outer membrane, which contains less than 5% CL. However, it is known that CL translocates to the outer membrane under mitophagy-inducing conditions (e.g., Chu et al., Nat Cell Biol, 2013; Kagan et al., Cell Death Differ, 2016). Our use of elevated CL levels aligns with standard practices in in vitro reconstitution assays to ensure adequate membrane curvature and charge density, which are necessary for robust and reproducible protein-membrane interaction assessments.

      To strengthen our conclusions, we will provide a quantitative analysis of the nanotube fission experiments. This will include the percentage of severed tubes under each condition, the total number of tubes analyzed (n), and the relationship between tube diameter and fission efficiency. These additional data will allow for a more thorough evaluation of the membrane fission activity of Mfi2.

      Furthermore, we will include control experiments using purified MBP alone and a membrane-binding-deficient mutant of Mfi2 (C33), as suggested by the reviewer.

      Figure 4D: The protrusions are very difficult to visualize. Can the authors also provide zoomed in regions. Is the data representative from 3 or more independent experiments? Can the authors provide a graph of the quantitation to aid readers with analysis of the data?

      Response:

      We thank the reviewer for this helpful suggestion. In the revised manuscript, we will provide higher magnification images to improve the visibility of mitochondrial protrusions. We confirm that the presented images are representative of results obtained from three independent experiments. Additionally, as requested, we will include a graph quantifying the frequency and morphology of protrusions to facilitate data interpretation.

      Figure 4D: It is fascinating to see the mitochondrial protrusion formation being dependent on autophagy factors but not mitochondrial fission factors. To help visualize this, can the authors image one of either Atg1, Atg8 to address whether phagophores are forming on the protrusions and if so where they are positionally located on the protrusion in control and/or mfi2,dnm1,atg44 triple mutant cells?

      Response:

      We thank the reviewer for this insightful comment. In our previous study (Fukuda et al., Mol Cell, 2023), we demonstrated that Atg proteins, such as Atg8, accumulate at mitochondrial protrusions formed in atg44Δ cells, suggesting that these structures can serve as sites for phagophore assembly. However, as in our previous microscopy analysis, the resolution limitations of our imaging system make it difficult to precisely determine the exact location of phagophores on the protrusions.

      Whether similar recruitment occurs in the absence of both Mfi2 and Dnm1 remains untested. To address this, we will perform fluorescence imaging of fluorescent protein tagged Atg proteins, such as GFP-Atg8, in mfi2Δ dnm1Δ atg44Δ triple mutant cells to examine whether phagophores form on the mitochondrial protrusions under these conditions. This will help us determine whether phagophore formation requires mitochondrial fission or occurs independently of it.

      Minor: 1. Is it possible to target Atg44 to the mitochondrial outer membrane, either by attaching an OM anchor or using part of the N-terminus of Mfi2? This will help elucidate how Mfi2 reaches the outer membrane and whether Atg44 can be just as active on the outer membrane as long as it can access it.

      Response:

      We thank the reviewer for this suggestion. We will construct chimeric proteins between Atg44 and Mfi2 and examine where such proteins are localized. Additionally, we will assess whether these chimeric proteins have the functional activity of Mfi2, as this will help determine if Atg44 can be active on the mitochondrial outer membrane when properly targeted.

      Are microtubules or actin required for the protrusions to form? Using the triple mutant cells that have a high proportion of protrusions, it could be tried to add cytoskeletal depolymerizing drugs such as nocodazole for microtubules or Latrunculin A or Latrunculin B for actin.

      Response:

      We thank the reviewer for this suggestion. We will test the effect of cytoskeletal depolymerizing drugs on protrusion formation in the mfi2Δ dnm1Δ atg44Δ triple mutant cells.

      Reviewer #2 (Significance (Required)):

      Significance: The discovery of Mfi2 as an outer membrane mitophagy fission factor is an exciting, and very important and significant contribution to the field. The data are in this study are clear and the conclusions are generally well supported by the experiments. This study appears to be suitable as a report style manuscript given that there is limited mechanistic analysis of Mfi2 activity. This does not affect the importance of the work, it just means that it is suited as a report of a significant discovery. Overall, this fills an important knowledge gap in solving the mystery behind which factors are involved in mitochondrial outer membrane fission during mitophagy, and provides a clarification why Dnm1 loss alone minimally affects mitophagy. This work will appeal to researchers interested in mitochondrial biology, the autophagy field, and cell biologists interested in organelle membrane dynamics, and is also broadly important and interesting to all cell biologists.

      Reviewer expertise: mitophagy mechanisms, cell biology of mitophagy, autophagy and autophagosome formation, mitochondrial biology including OXPHOS and mitochondrial dynamics

      Response:

      We appreciate Reviewer #2’s comments on the importance and potential impact of our discovery for the mitophagy and cell biology fields.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Furukawa et al. presents a well-structured and thorough study identifying Mfi2 as a novel mitochondrial outer membrane-resident fission factor required for mitophagy in Saccharomyces cerevisiae. The authors demonstrate that Mfi2, together with the inner membrane mitofissin Atg44 and the dynamin-related GTPase Dnm1, contributes to mitochondrial fragmentation during mitophagy. Importantly, they show that while Dnm1 is dispensable on its own, Mfi2 and Dnm1 act redundantly from the outer membrane to support Atg44-mediated fission. The data are robust, the figures are clear, and the mechanistic insight into how mitophagy-specific fission is achieved is of high relevance to the field of mitochondrial quality control.

      Overall, this is a logically constructed and convincing study with important implications for understanding compartment-specific mechanisms of mitochondrial fission during selective autophagy. The conclusions are largely well supported by the data. However, a few issues and points of clarification should be addressed before publication.

      Response:

      We thank Reviewer #3 for the careful and constructive review and for acknowledging the logical structure and robustness of our data.

      Major Comments

      1. The observation that both Mfi2 and Atg44 require high cardiolipin (CL) content for membrane binding and fission in vitro is intriguing, especially given that CL is enriched in the inner membrane. The authors mention CL externalisation during mitophagy, but this connection could be made more explicit earlier in the manuscript. Furthermore, since the molecular mechanism of membrane interaction remains unresolved, I would strongly encourage the authors to undertake coarse-grained molecular dynamics simulations to explore how Mfi2 might interact with lipid bilayers of differing composition. This could clarify the role of CL and the potential structural contribution of the disordered C-terminal region. Response:

      We thank the reviewer for highlighting the need to clarify the connection between CL externalization and the observed CL-dependent membrane binding and fission activity of Mfi2 and Atg44. While we briefly mentioned CL externalization during mitophagy in the Discussion, we agree that this connection should be made more explicit earlier in the manuscript. In the revised version, we will incorporate a brief rationale in the Results section to clarify that CL translocates to the mitochondrial outer membrane under mitophagy-inducing conditions (e.g., Chu et al., Nat Cell Biol 2013). This will provide a physiological basis for our in vitro reconstitution assays using CL-containing liposomes.

      We also appreciate the reviewer’s suggestion to explore the molecular basis of Mfi2-lipid interaction through coarse-grained molecular dynamics (CGMD) simulations. In collaboration with Dr. Yuji Sakai, we will perform coarse-grained molecular dynamics (CGMD) simulations to investigate how Mfi2 interacts with lipid bilayers of varying compositions, focusing particularly on the role of cardiolipin and the structural contribution of the disordered C-terminal region. If successful, we will include the results in the revised manuscript.

      While the genetic and phenotypic data indicate that Mfi2 and Dnm1 act independently to support mitochondrial fission, the spatial and temporal organisation of their activity during mitophagy remains unclear. Do Mfi2 and Dnm1 colocalise at fission sites, or do they act at separate subdomains of the outer membrane? Live-cell imaging with fluorescently tagged Mfi2 and Dnm1, particularly during mitophagy induction, could help clarify whether these factors act in concert or at distinct locations and time points. This would also help determine whether their apparent redundancy reflects parallel mechanisms or functional compensation at shared sites. It would also be interesting to combine this with Atg44.

      Response:

      We thank the reviewer for this insightful comment. We plan to perform co-localization analysis of Mfi2 and Dnm1 during mitophagy induction to clarify whether these proteins colocalize at fission sites or act at separate subdomains of the outer membrane. Additionally, we will conduct co-immunoprecipitation experiments of Mfi2 and Dnm1 (see also Response to Reviewer #1’s major comment 2) to further investigate their potential interaction. It is challenging to analyze Mfi2, Dnm1, Atg44, and mitochondrial fission sites simultaneously, as fluorescence-tagged Atg44 has been shown to lose its function (Fukuda et al., Mol Cell, 2023).

      Minor Comments

      1. The sodium carbonate extraction and proteinase K assays (Figure 1E-F) are standard but may not be familiar to all readers. A brief explanatory sentence clarifying what these methods reveal about membrane topology would improve accessibility. Response:

      We thank the reviewer for this helpful comment. We have added a brief explanatory sentence in the revised manuscript to clarify the principles and interpretation of the sodium carbonate extraction and proteinase K assays (Page 5, lines 23-25; Page 6, lines 1-3).

      While immunoblot quantifications are shown throughout, it would be helpful to include statistical analysis where appropriate, especially in cases where differences between genotypes or constructs are modest.

      Response:

      Statistical analyses have been added for immunoblot quantifications where appropriate, particularly in cases where differences between genotypes or constructs are modest.

      The naming of Mfi2 as a mitofissin is consistent with previous terminology introduced for Atg44, but the term remains relatively new. A brief clarification distinguishing "mitofissin" from the better-known "mitofusin" family in mammals would help avoid confusion for readers less familiar with yeast-specific nomenclature.

      Response:

      We have added a brief explanation of the term "mitofissin" to distinguish it from the mammalian "mitofusin" family in Introduction (Page 3, line 26-Page 4 line 1).

      Reviewer #3 (Significance (Required)):

      This is a strong and well-executed study that provides mechanistic insight into how mitochondrial fission is coordinated during mitophagy in yeast. A major strength is the identification and characterisation of Mfi2 as a previously unrecognised outer membrane fission factor acting in parallel with Dnm1 and in coordination with the intermembrane space protein Atg44. The genetic, imaging, and in vitro biochemical data are carefully integrated, and the authors are transparent about limitations, including open questions around the C-terminal domain of Mfi2, CL dependence, and the evolutionary conservation of mitofissins.

      The work makes a conceptual advance by showing that mitophagy-specific mitochondrial fission requires the cooperation of spatially separated factors acting from both the inside and outside of mitochondria, a mechanism that had not been fully appreciated. This study helps resolve previous contradictions regarding the dispensability of Dnm1 in mitophagy, thereby filling a gap in our understanding of organelle-specific fission. While the findings are focused on yeast, they raise broader questions about whether similar principles apply to higher eukaryotes (historically yeast research was always at the forefront of autophagy field).

      The study will be of interest to specialists in autophagy, mitochondrial dynamics, and yeast cell biology, as well as researchers working on membrane remodelling and organelle quality control. While the audience is primarily specialised, the conceptual insights will resonate more broadly in the cell biology community.

      I am an expert in mitophagy mechanisms in mammalian cells, and while not a specialist in yeast models, I found the study logical, rigorous, and of clear relevance to the broader autophagy field.

      Response:

      We are grateful for Reviewer #3’s recognition of the conceptual advance provided by our study and its relevance beyond yeast biology.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Responses to Reviewer #1:

      ・We performed in silico topology prediction of Atg44 and Mfi2 using TMHMM. This tool identified a weakly hydrophobic region of Mfi2 near the N-terminus but did not predict a definitive transmembrane domain (new Fig. EV1E) (Page 6, lines 8-9).

      Responses to Reviewer #3:

      ・We have added a brief explanatory sentence in the revised manuscript to clarify the methods and interpretation of the sodium carbonate extraction and proteinase K assays (Page 5, lines 23-25; Page 6, lines 1-3).

      ・Statistical analyses have been added for immunoblot quantifications where appropriate, particularly in cases where differences between genotypes or constructs are modest.

      ・We have added a brief explanation of the term "mitofissin" to distinguish it from the mammalian "mitofusin" family in Introduction (Page 3, line 26-Page 4, line 1).

      4. Description of analyses that authors prefer not to carry out

      Response to Reviewer #1 (Major 4):

      We will not include the WT strain as a control. See our response.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Furukawa et al. presents a well-structured and thorough study identifying Mfi2 as a novel mitochondrial outer membrane-resident fission factor required for mitophagy in Saccharomyces cerevisiae. The authors demonstrate that Mfi2, together with the inner membrane mitofissin Atg44 and the dynamin-related GTPase Dnm1, contributes to mitochondrial fragmentation during mitophagy. Importantly, they show that while Dnm1 is dispensable on its own, Mfi2 and Dnm1 act redundantly from the outer membrane to support Atg44-mediated fission. The data are robust, the figures are clear, and the mechanistic insight into how mitophagy-specific fission is achieved is of high relevance to the field of mitochondrial quality control. Overall, this is a logically constructed and convincing study with important implications for understanding compartment-specific mechanisms of mitochondrial fission during selective autophagy. The conclusions are largely well supported by the data. However, a few issues and points of clarification should be addressed before publication.

      Major Comments

      1. The observation that both Mfi2 and Atg44 require high cardiolipin (CL) content for membrane binding and fission in vitro is intriguing, especially given that CL is enriched in the inner membrane. The authors mention CL externalisation during mitophagy, but this connection could be made more explicit earlier in the manuscript. Furthermore, since the molecular mechanism of membrane interaction remains unresolved, I would strongly encourage the authors to undertake coarse-grained molecular dynamics simulations to explore how Mfi2 might interact with lipid bilayers of differing composition. This could clarify the role of CL and the potential structural contribution of the disordered C-terminal region.
      2. While the genetic and phenotypic data indicate that Mfi2 and Dnm1 act independently to support mitochondrial fission, the spatial and temporal organisation of their activity during mitophagy remains unclear. Do Mfi2 and Dnm1 colocalise at fission sites, or do they act at separate subdomains of the outer membrane? Live-cell imaging with fluorescently tagged Mfi2 and Dnm1, particularly during mitophagy induction, could help clarify whether these factors act in concert or at distinct locations and time points. This would also help determine whether their apparent redundancy reflects parallel mechanisms or functional compensation at shared sites. It would also be interesting to combine this with Atg44.

      Minor Comments

      1. The sodium carbonate extraction and proteinase K assays (Figure 1E-F) are standard but may not be familiar to all readers. A brief explanatory sentence clarifying what these methods reveal about membrane topology would improve accessibility.
      2. While immunoblot quantifications are shown throughout, it would be helpful to include statistical analysis where appropriate, especially in cases where differences between genotypes or constructs are modest.
      3. The naming of Mfi2 as a mitofissin is consistent with previous terminology introduced for Atg44, but the term remains relatively new. A brief clarification distinguishing "mitofissin" from the better-known "mitofusin" family in mammals would help avoid confusion for readers less familiar with yeast-specific nomenclature.

      Significance

      This is a strong and well-executed study that provides mechanistic insight into how mitochondrial fission is coordinated during mitophagy in yeast. A major strength is the identification and characterisation of Mfi2 as a previously unrecognised outer membrane fission factor acting in parallel with Dnm1 and in coordination with the intermembrane space protein Atg44. The genetic, imaging, and in vitro biochemical data are carefully integrated, and the authors are transparent about limitations, including open questions around the C-terminal domain of Mfi2, CL dependence, and the evolutionary conservation of mitofissins.

      The work makes a conceptual advance by showing that mitophagy-specific mitochondrial fission requires the cooperation of spatially separated factors acting from both the inside and outside of mitochondria, a mechanism that had not been fully appreciated. This study helps resolve previous contradictions regarding the dispensability of Dnm1 in mitophagy, thereby filling a gap in our understanding of organelle-specific fission. While the findings are focused on yeast, they raise broader questions about whether similar principles apply to higher eukaryotes (historically yeast research was always at the forefront of autophagy field).

      The study will be of interest to specialists in autophagy, mitochondrial dynamics, and yeast cell biology, as well as researchers working on membrane remodelling and organelle quality control. While the audience is primarily specialised, the conceptual insights will resonate more broadly in the cell biology community.

      I am an expert in mitophagy mechanisms in mammalian cells, and while not a specialist in yeast models, I found the study logical, rigorous, and of clear relevance to the broader autophagy field.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, the authors discover a mitochondrial fission factor, termed Mfi2, that promotes mitophagy efficiency and that functions in a partially redundant way with Dnm1 for the fission of mitochondrial outer membranes during mitophagy. The discovery helps to clarify why Dnm1 does not appear to be essential for fission mediated mitophagy by Dnm1. Mfi2 is structurally similar to the inner membrane fission factor Atg44 which is consistent with Mfi2's fission activity. The authors show that Mfi2 has membrane fission activity towards nanotubes in vitro, and that membrane binding is dependent of high levels of cardiolipin, a mitochondrially enriched lipid. In summary, the authors show that Mfi2 mediates mitochondrial outer membrane fission together with Drp1, whereas Atg44 mediates inner membrane fission, that together are necessary for mitophagy.

      Major:

      1. Figure 2: How do the expression levels of the Mfi2 constructs compare to the endogenous levels of the protein? This will help to gauge to what degree Mfi2 N66 overexpression is needed to achieve mitochondrial fragmentation in Atg44 delta cells and also the low level of mitophagy rescue that was observed.
      2. Figure 3A-B: The cardiolipin binding results in vitro are interesting but the concentration of cardiolipin is much lower on the outer membrane versus the inner membrane. Can the authors comment on whether the cardiolipin levels used on the nanotubes are relevant to that of the mitochondrial outer membrane where Mfi2 is located? Can the authors provide quantitative data for these experiments to help strengthen their conclusions? Can the authors also use purified MBP alone or a form of Mfi2 that cannot bind to membrane e.g. Mfi2-C33) as a control?
      3. Figure 4D: The protrusions are very difficult to visualize. Can the authors also provide zoomed in regions. Is the data representative from 3 or more independent experiments? Can the authors provide a graph of the quantitation to aid readers with analysis of the data?
      4. Figure 4D: It is fascinating to see the mitochondrial protrusion formation being dependent on autophagy factors but not mitochondrial fission factors. To help visualize this, can the authors image one of either Atg1, Atg8 to address whether phagophores are forming on the protrusions and if so where they are positionally located on the protrusion in control and/or mfi2,dnm1,atg44 triple mutant cells?

      Minor:

      1. Is it possible to target Atg44 to the mitochondrial outer membrane, either by attaching an OM anchor or using part of the N-terminus of Mfi2? This will help elucidate how Mfi2 reaches the outer membrane and whether Atg44 can be just as active on the outer membrane as long as it can access it.
      2. Are microtubules or actin required for the protrusions to form? Using the triple mutant cells that have a high proportion of protrusions, it could be tried to add cytoskeletal depolymerizing drugs such as nocodazole for microtubules or Latrunculin A or Latrunculin B for actin.

      Significance

      The discovery of Mfi2 as an outer membrane mitophagy fission factor is an exciting, and very important and significant contribution to the field. The data are in this study are clear and the conclusions are generally well supported by the experiments. This study appears to be suitable as a report style manuscript given that there is limited mechanistic analysis of Mfi2 activity. This does not affect the importance of the work, it just means that it is suited as a report of a significant discovery. Overall, this fills an important knowledge gap in solving the mystery behind which factors are involved in mitochondrial outer membrane fission during mitophagy, and provides a clarification why Dnm1 loss alone minimally affects mitophagy. This work will appeal to researchers interested in mitochondrial biology, the autophagy field, and cell biologists interested in organelle membrane dynamics, and is also broadly important and interesting to all cell biologists.

      Reviewer expertise: mitophagy mechanisms, cell biology of mitophagy, autophagy and autophagosome formation, mitochondrial biology including OXPHOS and mitochondrial dynamics

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Furukawa and colleagues identified Mfi2 as novel factor that promotes fragmentation and removal of damaged mitochondria by mitophagy. They report that parallel loss of Dnm1 and Mfi2 blocks mitophagy. Mfi2 acts on the outer membrane, while the previous found Atg44 functions in the intermembrane space. How the proteins cooperate remains unknown. This is an elegant study with high-quality data. The findings are interesting for a broad readership. There are some issues as outline below that should be solved.

      1. It remains unclear how Mfi2 is anchored into the outer mitochondrial membrane. Does it contain a transmembrane domain? The carbonate resistance indicates the presence of such transmembrane domain. However, the presented structures lack such membrane-spanning segment. This point should be clarified.
      2. How does Mfi2 cooperate with Dnm1? Is there any interaction between these proteins? Some further information could provide mechanistic insights into the function of Mfi2.
      3. The authors report a CL-dependent binding of Mfi2 to liposomes. Is the recruitment of Mfi2 to mitochondria impaired when CL-synthesis is blocked, e.g. in crd1delta mitochondria?
      4. Figure 4B: a wild-type control should be added.

      Significance

      The reported findings are interesting for a broad readership.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Reviewers and Revision Plan

      We thank all three reviewers for their thoughtful and constructive comments. We are pleased that the reviewers found our work to be "very interesting," "well written," with "high quality" data that is "convincing" and will be "of broad interest for the community of axon guidance, circuit formation and brain development." We particularly appreciate the recognition that our study provides "novel functions for Cas family genes in forebrain axon organization" and uses "state-of-the art mouse genetics" with "quantitative and statistical rigor." Below are our detailed responses to each reviewer's comments, including extensive additional experiments and analyses that we will perform to significantly strengthen the manuscript.

      Reviewer #1

      We thank this reviewer for recognizing that our experiments are "carefully done and quantified" with "clear and striking" phenotypes that "support most of the conclusions in the manuscript." We appreciate their acknowledgment that this work will be "of interest to developmental neurobiologists and the axon guidance and adhesion fields."

      Major Comments:

      __ Authors clearly show that misplaced TCA axons are coordinate with cortical layer defects, with misplaced tbr1+ neurons, in EMX-Cre cas and integrin knockouts, suggesting these axons are following misplaced cells. These results are described as 100% coordinate, but since there is no figure of quantification, authors need to clarify how many embryos were examined for each genotype, as this was not described in results or legends.__ We apologize for this oversight and will provide detailed quantification of this important finding. We examined a total of 11 Emx1Cre;TcKO embryos with 13 controls, and 14 Emx1Cre;Itgb1 embryos with 13 littermate controls at two developmental stages (E16.5 and P0) to quantify the coordination between misplaced Tbr1+ neurons and cortical bundle formation. This quantification will be presented in the main text and figure legend.

      Here's a more detailed breakdown of those numbers: For Emx1Cre;TcKO knockouts, we examined 7 controls and 5 mutants at P0, and 6 controls and 6 mutant embryos at E16.5. For the Emx1Cre;Itgb1 knockouts, we examined 5 controls and 6 mutant neonates at P0, and 8 controls and 8 mutant embryos at E16.5.

      __ Are the neurons not misplaced in Nex cre cas or integrin knockouts? One would think presumably not, but then what are the tbr1+ cell migration defect caused by? I struggle with the semantics of non-neuronal autonomous role of cas in cortex, since tbr1+ neurons are misplaced, and this is what the axons are mistargeting too. So yes, potentially cas or b1 is not needed in those neurons, but those misplaced neurons are presumably driving the phenotype.__

      We agree that this important point requires better explanation. You are absolutely correct that Tbr1+ neurons are not misplaced in NexCre;TcKO mutants (Wong et al., 2023), which is precisely why these animals do not exhibit cortical bundle formation. In addition to our previously published data showing normal location of Tbr1+ neurons in those mutants, we can also provide similar analysis at E16.5 and P0 as a supplemental figure. The model we propose is that Cas genes are required in radial glial cells for proper positioning of deep layer cortical neurons. These correctly positioned neurons, in turn, provide appropriate guidance cues for TCA projections. Hence, our model is that while the role of Cas genes is non-neuronal-autonomous (acting in radial glia rather than in the neurons themselves), the mispositioned Tbr1+ neurons in Emx1Cre;TcKO mutants drive the TCA misprojection phenotype. We will clarify this mechanism in the discussion and provide a new graphical model as a supplemental figure to facilitate conceptualization of our conclusions.

      __ Authors need to clarify in the discussion that they can't rule out the cas not also needed in tca neurons, Since neither emx or nex cre would hit those cells.__

      We will add the following clarification to the discussion: The analysis of cortical bundle formation in Emx1Cre;TcKOrevealed a comparable phenotype to that observed in NestinCre;TcKO, strongly suggesting a cortical-autonomous role for Cas genes in CB formation. "However, we cannot formally exclude a thalamus-autonomous role for Itgb1 or Cas genes in TCA pathfinding, as we did not ablate these genes exclusively in the thalamus. Future studies using thalamus-specific Cre drivers would be needed to definitively address this question."

      __ Could authors add boxes in zoomed out brain images to denote zoom regions. And potentially a schematic demonstrating placement of DiI for lipophilic tracing experiments.__

      We will add boxes to denote zoom regions where possible throughout the manuscript. For some high magnification panels, we selected the best representative images, which don't necessarily correspond to specific regions of the lower magnification panels, but we will note this in the figure legends. We will also add a schematic diagram to a supplemental figure illustrating DiI placement for all lipophilic tracing experiments.

      Reviewer #2

      We thank this reviewer for describing our study as "very interesting," "well written," with data that are "of high quality" and findings that are "convincing." We appreciate their recognition that we used "state-of-the art mouse genetics" and that our work will be "of broad interest for the community of axon guidance, circuit formation and brain development."

      Major Comments:

      __ Immunofluorescence labeling for other β-integrin family members to examine expression in AC axons may provide insights into why β1-integrin deficiency does not replicate the Cas TcKO phenotype.__ This is an excellent suggestion that we will address experimentally. We will perform RNAscope analysis for integrin β5, β6, and β8 in developing piriform and S1 cortex at E14.5, E16.5, and E18.5, as these are the only other β-integrins expressed during cortical development. We anticipate that this analysis may reveal expression of alternative β-integrins in the neurons that extend axons along the developing anterior commissure, which would provide a potential explanation for why β1-integrin deficiency does not replicate the AC phenotype observed in Cas TcKO animals. These new data will be presented as part of a new figure.

      __ Is there any evidence that β1-integrin in developing cortical axons is colocalized with Cas proteins (in vivo or in vitro)?__

      We have tested multiple antibodies for p130Cas and CasL without success in cortical tissue. However, we will test two new integrin β1 antibodies and a new p130Cas antibody. While direct colocalization may be challenging due to species restrictions and tissue-specific antibody performance, we will attempt to show regional co-expression in consecutive sections. If the integrin antibodies work, we will present data as a supplemental figure demonstrating that p130Cas (using our BAC-EGFP reporter) and β1-integrin show overlapping expression patterns in developing cortical white matter tracts and neurons, supporting their potential functional interaction. In the end, while we will try to address this critique, we will be limited by the reagents that are available to us.

      Minor Comments:

      __ How long do the Cas TcKO with the various cre driver survive?__

      We have not systematically quantified survival beyond 6 months, but surprisingly, survival up to 6 months of age appears normal for all genotypes examined. This information will be included in the Methods section.

      Reviewer #3

      We thank this reviewer for acknowledging that our "main claims and conclusions are solidly supported by the data" with "good overall data quality" and "high quantitative and statistical rigor." We appreciate their recognition that we "uncover novel functions for Cas family genes in forebrain axon organization" and that our "overall reporting and discussion of findings is data-driven and refrains from excessive speculation."

      Addressing Concerns About Novelty and Impact:

      We respectfully disagree with the characterization of our findings as "somewhat incremental." While we acknowledge that similar axonal defects have been described in other lamination mutants, our study makes several novel and significant contributions:

      First demonstration of Cas family requirement in forebrain axon tract development: This is the first study to establish roles for Cas proteins in axon guidance, representing a completely new function for these well-studied signaling molecules. Novel β1-integrin-independent role for Cas proteins: Our finding that AC defects occur in Cas mutants but not β1-integrin mutants reveals a previously unknown signaling pathway and challenges the assumption that Cas proteins always function downstream of β1-integrin. Mechanistic insights into cortical-TCA interactions: While the general principle that cortical lamination affects TCA projections has been established, our work provides the first demonstration of how specific adhesion signaling molecules (Cas proteins) control this process through radial glial function. Cell-type specific requirements: Our systematic analysis using multiple Cre drivers provides unprecedented detail about where and when Cas proteins function during brain development, revealing both neuronal-autonomous (AC) and non-neuronal autonomous (TCA) roles.

      As Reviewer #2 noted, "The main advancement is a more nuanced understanding of where and when these molecules function during brain development and insights into the origin of the defects observed." This represents significant mechanistic progress in understanding forebrain circuit assembly.

      Specific Comments:

      Suggestion about cell autonomy testing: We appreciate the optional suggestion to test strict cell autonomy using sparse deletion approaches. While this would indeed be interesting, it would represent a substantial undertaking beyond the scope of the current study. However, we believe our current data using NexCre (which hits early postmitotic neurons) versus NestinCre (CNS-wide deletion) and Emx1Cre (cortical progenitors) provides supportive evidence for neuronal autonomy of the AC phenotype, as mentioned by the reviewer.

      In vitro axon guidance assays: This is an excellent suggestion for future molecular studies. In the discussion we identify specific candidate guidance molecules (e.g. Ephrins) that would be prime targets for such experiments.

      Cross-Reviewer Comments:

      We appreciate Reviewer #3's agreement with the other reviewers' suggestions and will address the quantification of neuronal mispositioning/axon bundle correlation as requested by Reviewer #1.

      Additional Improvements:

      Beyond addressing the specific reviewer comments, we will make several additional improvements to strengthen the manuscript:

      Enhanced statistical analysis: All quantifications will include appropriate statistical tests with clearly stated n values and multiple litters represented. Expanded discussion: We will better contextualize our findings within the broader axon guidance literature and discuss future directions (e.g. TCAs). New data: Additional controls, expression analysis, and quantifications will strengthen our conclusions.

      We believe these revisions, particularly the new experimental data addressing integrin family expression and the detailed quantification of phenotype coordination, will significantly strengthen our conclusions and demonstrate the novelty and impact of our findings. We hope the reviewers will find these improvements satisfactory and agree that our work makes important contributions to understanding axon guidance mechanisms in the developing forebrain.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript by Estep et al., the authors use conditional in vivo mouse genetics to study roles for Cas family intracellular adaptor proteins in forebrain axon tract development. They report two phenotypes after simultaneous nervous system-wide deletion of three Cas family genes - (1) defasciculation and misprojection of anterior commissure axons and (2) ectopic formation of thalamocortical axon bundles that penetrate the cortex. Further investigation using specific Cre lines and other conditional knockout alleles demonstrates that the anterior commissure defect results from a requirement for Cas genes in cortical projection neurons, whereas thalamocortical axons are misguided due to Cas functional requirements in cortical lamination, as ectopic axon bundles are confined to sites of disrupted cortical layer formation. Overall, this study uncovers novel functions for Cas family genes in forebrain axon organization, one of which likely reflects a direct role in axon guidance and/or fasciculation, while another one is indirect and based in the previously established role of Integrin-Cas signaling in radial glia organization and cortical neuron migration.

      The main claims and conclusions of the paper are solidly supported by the data. The study is fairly descriptive in nature, being limited to in vivo analyses of Cas expression patterns and characterization of the knockout phenotypes, and does not uncover novel molecular mechanisms for axon guidance, but it also does not attempt to make any claims to that effect. The overall reporting and discussion of findings is data-driven and refrains from excessive speculation, which is commendable. The overall data quality is good, and data organization and presentation are clear. Quantitative and statistical rigor are high.

      The characterization of Itgb1 knockout animals and various conditional Cas knockouts provides strong evidence that the thalamocortical axon phenotypes are simply a secondary consequence of cortical disorganization, as they strictly segregate with defects in cortical lamination.

      The requirement for Cas genes in anterior commissure axon organization is accurately reported as "neuronal-autonomous", but not as cell-autonomous. It would be interesting, yet not essential (i.e. this suggestion is optional), to test for strict cell autonomy by sparsely deleting Cas family genes in a subset of the neurons that project axons through the anterior commissure and analyzing the projection patterns of Cas mutant and control neurons in such a genetic mosaic side by side.

      In the discussion, the authors highlight a few of the axon guidance signaling pathways that would be strong candidates for requiring Cas in the context of the anterior commissure. If the authors wanted to develop this idea further, they should consider using in vitro axon guidance assays to study the requirement for Cas function in the axonal response to these candidate guidance molecules.

      Referee Cross-commenting

      I generally agree to the comments by reviewers 2 and 3. I especially like reviewer 1's suggestion to provide quantitative support for the correlation between sites of neuronal mispositioning and sites of ectopic axon bundle emergence in the cortex. I also agree with that reviewer's idea to box regions in micrographs that are shown in high-magnification panels.

      Significance

      The strengths of the study lie in its simplicity and limited scope, yet so do its weaknesses. The authors uncover requirements for Cas genes in axon tract organization, but mechanistic insights are extremely limited. On the plus side, the authors refrain from excessive speculation and stay very close to the data in the interpretation and discussion of their findings.

      The reported findings are novel, at least to some extent. The same group had previously established the Itgb1-Cas signaling axis as an important regulator of cortical architecture, and results presented here document a thalamocortical axon guidance phenotype that results from defective cortical lamination. Similar axonal defects have been described in other mouse models with lamination phenotypes, and these studies are cited in the manuscript at hand. So while the study is not first to show this interplay between cortical neuronal positioning and thalamocortical axon organization, it does add to the growing body of evidence for this phenomenon. As for the anterior commissure defect, the study is first to establish a role for Cas family genes in development of this axon tract, but beyond evidence that this might be a neuronal-autonomous requirement (but see earlier comment), it does not provide any mechanistic insights into this Cas function. Had the authors identified an actual signaling pathway for axon guidance or bundling that is mediated by Cas proteins and explains their requirement for anterior commissure formation, this study would be a lot more impactful. In its current form, however, the limited genetic and functional insights from this manuscript will largely be of interest to a specialized audience. The overall advance provided by this work is somewhat incremental.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: In this very interesting study, Estep and colleagues investigate the role of Cas family members (p130Cas, CasL/Nedd9 and Sin/Efs) by generating triple conditional mutant (Cas TcKO) mice to investigate to role in the developing brain (E14.5 - P0), focusing on thalamocortical axons (TCA) and the anterior commissure (AC). For visualization of p130Cas expressing neurons, the p130Cas-EGFP-BAC allele was used. This revealed EGFP (p130Cas) expression in all major cortical tracts and overlap with L1 distribution. Conditional ablation using Nestin-cre (Nes-cre;TcKO) revealed defects in the AC and the external capsule (EC). In addition, these mice show pathfinding defects resulting cortical bundles (CBs) in white matter within the cortical plate. Evidence is provided that these CBs originate from TCA afferents. To assess the cell autonomy of these phenotypes, Emx1-cre;TcKO and Nex-Cre;TcKO mice were generated. Analysis of these mice revealed that Cas genes function in cortical neurons is required for proper TCA development. Nex-cre;TcKO mice only replicated the AC phenotypes observed in Nestin-cre;TcKO mice. Moreover, evidence is provided that proper development of TCA afferents requires non-neuronal functions of Cas genes. Because Cas proteins function downstream of integrins, including beta1-integrin, Itgb1 cKO mice (using Emx-cre or Nex-cre) to examine similarities to Cas TcKO mice. Indeed, Emx-cre;Itgb1 cKO mice phenocopy CB defects observed in the Emx-cre; CasTcKO, while Nex-cre;Itgb1 mutants do not, and neither of the Itgb1 mutants phenocopied the Cas TcKO defects in the AC. Correlative evidence is provided that CBs observed in Cas TcKO mutants originate from disorganization of the subplate.

      Overall, this manuscript is well written, and most of the data presented are of high quality. It is also clear that a great deal of effort was put into the experiments, however some issues were identified, and the authors should address them to further clarify and strengthen the work.

      Major comments:

      1. Immunofluorescence labeling for other b-integrin family members to examine expression in AC axons may provide insights into why b1-integrin deficiency does not replicate the Cas TcKO phenotype.
      2. Is there any evidence that b1-integrin in developing cortical axons is colocalized with Cas proteins (in vivo or in vitro)

      Minor comments:

      1. How long do the Cas TcKO with the various cre driver survive?

      Significance

      Elucidation of molecular mechanisms of axon pathfinding and brain wiring in vivo. Using state-of-the art mouse genetics; this includes genetic labeling of specific axon tracts, generation of compound mutants in a cell type specific manner and gene products that are thought to function in the same pathway. This was confirmed for some fiber systems, but not for others. The findings presented are convincing and the manuscript is well written. The guidance molecules investigated are not novel and have been analyzed previously, however not with the same rigor or the use of compound mutants. The main advancement is a more nuanced understanding of where and when these molecules function during brain development and insights into the origin of the defects observed. Of broad interest for the community of axon guidance, circuit formation and brain development. I have been studying molecules that regulate axon guidance, growth and regeneration for the past 20+ years.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In the manuscript by Estep et al., the authors studied Cas proteins expressed during brain development, particularly during the formation of the anterior commissure (AC) and thalamocortical (TCA) projection, using conditional alleles in mice, immunohistochemistry, and a combination of lipophilic axon tracers or genetically encoded fluorophores that mark cells that have expressed Cre. They found that Cas proteins were required for proper guidance of TCA projections and fasciculation of posterior AC axons using broad deletion of Cas gene function by crossing Cas TcKO animals with Nestin-Cre mice- CNS-wide deletions in both neuronal and glial populations, results in axon misprojection and aberrant cortical bundles, of both AC and TCA. With a time course, they find these axon misguidance phenotypes appear at different developmental time points, with AC defasciulation not apparent until e18.5, whereas aberrent TCA Cortical bundles were detected already at e14.5, and increasing over developmental time.

      They then go on to use more specific Cre drivers, EMX cre (RGCs, excitatory neurons, mqcroglia in cortex), vs Nex Cre early postmitotic neurons in cortex. EMX cre, still shows TCA defects, even though cas not knocked out of tca axons, suggesting cortical autonomous expression of cas, affects these axons. Because Nex Cre mice didn't show this phenotype, this suggested that the TCA phenotype was cortical-autonomous but not neuronal-autonomous, with mis-projecting TCA processes (cortical bundles) closely associating with misplaced subplate and deep layer neurons. cortical- and neuronal autonomous role for Cas genes during AC fasciculation showed that defasciculating AC axons originated from the dorsolateral cortex. Defects in AC fasciculation were dissimilar to β1-integrin mutants, suggesting that Cas proteins can act independently of β1-integrin during AC formation.

      Overall, these data indicate a requirement for Cas family genes during cortical white matter tract formation. The experiments are carefully done and quantified, and the phenotypes are clear and striking, and support most of the conclusions in the manuscript. I only suggest a few points for clarification.

      Authors clearly show that misplaced TCA axons are coordinate with cortical layer defects, with misplaced tbr1 + neurons, in EMX-Cre cas and integrin knockouts, suggesting these axons are following misplaced cells. These results are described as 100% coordinate, but since there is no figure of quantification, authors need to clarify how many embryos were examined for each genotype, as this was not described in results or legends.

      Are the neurons not misplaced in Nex cre cas or integrin knockouts? One would think presumably not, but then what are the tbr1+ cell migration defect caused by? I struggle a with the semantics of non-neuronal autonomous role of cas in cortex, since tbr1+ neurons are misplaced, and this is what the axons are mistargeting too. So yes, potentially cas or b1 is not needed in those neurons, but those misplaced neurons are presumably driving the phenotype.

      Authors need to clarify in the discussion that they can't rule out the cas not also needed in tca neurons, Since neither emx or nex cre would hit those cells.

      Could authors add boxes in zoomed out brain images to to denote zoom regions. And potentially a schematic demonstrating placement of DiI for lipophilic tracing experiments.

      Significance

      The study demonstrates the different requirement for Cas proteins and b1 integrin in different cell populations for appropriate white matter tract formation, and further supports the that cortical layering helps direct TCA projections. It also provides evidence for a b1 integrin independent role for cas proteins. The authors nicely discuss this with several alternative upstream receptors that may be involved detailed, that set the stage for future work, but this would be quite a large endeavor. I would add that it is unclear if cas proteins are needed in the TCA neurons, as they did not use a cre driver that would clarify this. The final limitation I will mention is that EM would likely be required to demonstrate a role for fasiculation, but this also seems beyond this original manuscript. This study will be of interest to developmental neurobiologists and the axon guidance and adhesion fields.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility and clarity

      In their manuscript de las Mercedes Carro et al investigated the role of Ago proteins during spermatogenesis by producing a triple knockout of Ago 1, 3 and 4. They first describe the pattern of expression of each protein and of Ago2 during the differentiation of male germ cells, then they describe the spermatogenesis phenotype of triple knockout males, study gene deregulation by scRNA seq and identify novel interacting proteins by co-IP mass spectrometry, in particular BRG1/SMARCA4, a chromatin remodeling factor and ATF2 a transcription factor. The main message is that Ago3 and 4 are involved in the regulation of XY gene silencing during meiosis, and also in the control of autosomal gene expression during meiosis. Overall the manuscript is well written, the topic, very interesting and the experiments, well-executed. However, there are some parts of the methodology and data interpretation that are unclear (see below).

      Major comments

      1= Please clarify how the triple KO was obtained, and if it is constitutive or specific to the male germline. In the result section a Cre (which cre?) is mentioned but it is not mentioned in the M&M. On Figure S1, a MICER VECTOR is shown instead of a deletion, but nothing is explained in the text nor legend. Could the authors provide more details in the results section as well as in the M&M ? This is essential to fully interpret the results obtained for this KO line, and to compare its phenotype to other lines (such as lines 184-9 Comparison of triple KO phenotype with that of Ago4 KO). Also, if it is a constitutive KO, the authors should mention if they observed other phenotypes in triple KO mice since AGO proteins are not only expressed in the male germline.

      Response: We apologize for omitting this vital information. We have now incorporated a more detailed description of how the Ago413 mutant was created in the results and M&M sections (line 120 and 686 respectively).

      As mentioned in the manuscript, Ago4, Ago1 and Ago3 are widely expressed in mammalian somatic tissues. Mutations or deletions of these genes does not disrupt development; however, there is limited research on the impact of these mutations in mammalian models in vivo. In humans, mutations in Ago1 and Ago3 genes are associated with neurological disorders, autism and intellectual disability (Tokita, M.J.,et al. 2015- doi: 10.1038/ejhg.2014.202., Sakaguchi et al. 2019- doi: 10.1016/j.ejmg.2018.09.004, Schalk et al 2021- doi: 10.1136/jmedgenet-2021-107751). In mouse, global deletion of Ago1 and Ago3 simultaneously was shown to increase mice susceptibility to influenza virus through impaired inflammation responses (Van Stry et al 2012- doi.org/10.1128/jvi.05303-11). Studies performed in female Ago413 mutants (the same mutant line used herein) have shown that knockout mice present postnatal growth retardation with elevated circulating leukocytes (Guidi et al 2023- doi: 10.1016/j.celrep.2023.113515). Other studies of double conditional knockout of Ago1 and Ago3 in the skin associated the loss of these Argonautes with decreased weight of the offspring and severe skin morphogenesis defects (Wang et al 2012- doi: 10.1101/gad.182758.111). In our study, we did not observe major somatic or overt behavioral phenotypes, and we did not observe statistical differences in body weights of null males compared to WT as shown in figure below.

      2= The paragraph corresponding to G2/M analysis is unclear to me. Why was this analysis performed? What does the heatmap show in Figure S4? What is G2/M score? (Fig 2D). Lines 219-220, do the authors mean that Pachytene cells are in a cell phase equivalent to G2/M? All this paragraph and associated figures require more explanation to clarify the method and interpretation.

      __Response: __We have modified the methods to include more information about how the cell cycle scoring used in Figures 2D and S4 were calculated and will add more information regarding the interpretation of these figures.

      3= I have concerns regarding Fig2G: to be convincing the analysis needs to be performed on several replicates, and, it is essential to compare tubules of the same stage - which does not seem to be the case. This does not appear to be the case. Besides, co (immunofluorescent) staining with markers of different cell types should be shown to demonstrate the earlier expression of some markers and their colocalization with markers of the earlier stages.

      __Response: __We agree with the Reviewer. New images with staged tubules will be added to the analysis of Figure 2G.

      4= one important question that I think the authors should discuss regarding their scRNAseq: clusters are defined using well characterized markers. But Ago triple KO appears to alter the timing of expression of genes... could this deregulation affects the interperetation of scRNAseq clusters and results?

      __Response: __We thank the reviewer for this suggestion and agree that including this information is important. We expect that, at most, this dysregulation impacts the edges of these clusters slightly. Given that marker genes that have been used to define cell types in these data are consistently expressed between the knockout and wildtype mice (see Figure S4A), we do not think that the cells in these clusters have different identities, just dysregulated expression programs. We have added the relevant sentence to the discussion, and will include additional supplemental figure panels to document this point more comprehensively.

      5= XY gene deregulation is mentioned throughout the result section but only X chromosome genes seem to have been investigated.... Even the gene content of the Y is highly repetitive, it would be very interesting to show the level of expression of Y single copy and Y multicopy genes in a figure 3 panel.

      __Response: __We agree with the reviewer that including analysis of Y-linked genes is important. We will add a supplemental figure which includes the Y:Autosome ratio and differential expression analysis.

      6= Can the authors elaborate on the observation that X gene upregulation is visible in the KO before MSCI; that is in lept/zygotene clusters (and in spermatogonia, if the difference visible in 3A is significant?)

      Response: We do see that X gene expression is upregulated before pachynema. Previous scRNA-seq studies that have looked at MCSI have seen that silencing of genes on the X and Y chromosomes starts before the cell clusters that are defined as pachynema, though silencing is not fully completed until pachynema. We have clarified this point in the manuscript.

      7 = miRNA analysis: could the authors indicate if X encoded miRNA were identified and found deregulated? Because Ago4 has been shown to lead to a downregulation of miRNA, among which many X encoded. It is therefore puzzling to see that the triple KO does not recapitulate this observation. Were the analyses performed differently in the present study and in Ago4 KO study?

      __Response: __The analysis identifying downregulation of miRNA in the original Ago4 mutant analysis was conducted relative to total small RNA expression. Amongst those altered miRNA families in the Ago4 mutants, we demonstrated both upregulation and downregulation of miRNA. We agree that confirming a similar global downregulation of miRNA counts compared to other small RNAs is important. Therefore, in a revised manuscript, we will add this information to the miRNA analysis section, especially highlighting the X chromosome-associated miRNAs, as well as whether the ratios between other small RNA classes change.

      8 = The last results paragraph would also benefit from some additional information. It is not clear why the authors focused on enhancers and did not investigate promoters (or maybe they were but it's unclear). Which regions (size and location from TSS) were investigated for motif enrichment analyses? To what correspond the "transcriptional regulatory regions previously identified using dREG" mentioned in the M&M? I understand it's based on a previous article, but more info in the present manuscript would be useful.

      Response: We thank the reviewer for this suggestion. The regions that were used for motif enrichment will be included as a supplementary information in the fully revised manuscript. We have also clarified in the methods that these transcriptional regulatory regions were downloaded from GEO and obtained from previous ChRO-seq data (from GEO) analysis. These data are run through the dREG pipeline that identifies regions predicted to contain transcription start sites, which include promoters and enhancers.

      Minor comments

      1) In the introduction: The sentence "Ago1 is not expressed in the germline from the spermatogonia stage onwards allowing us to use this model to study the roles of Ago4 and Ago3 in spermatogenesis." is misleading because Ago1 is expressed at least in spermatogonia; It would be more precise to write "after spermatogonia stage" and rephrase the sentence. Otherwise it is surprising to see AGO1 protein in testis lysate and it is not in line with the scRNA seq shown in figure 2.

      __Response: __We agree with the Reviewers suggestion and have edited the sentence on line 100. This sentence now reads "Ago1 is not expressed in the germline after the spermatogonia stage allowing us to use this model to study the roles of Ago4 and Ago3 in spermatogenesis".

      2) Could the authors precise if AGO proteins are expressed in other tissues? In somatic testicular cells?

      __Response: __Expression patterns of mammalian AGOs have been described in somatic and testicular tissues for the mouse by Gonzales-Gonzales et al (2008) by qPCR. They found that Ago2 is expressed in all the somatic tissues analyzed (brain, spleen, heart, muscle and lung) as well as the testis, with the highest expression in brain and lowest in heart. Ago1 is highly expressed in spleen compared to all the tissues analyzed, while Ago3 and Ago4 showed highest expression in testis and brain. Within somatic tissues of the testis, the four argonautes are expressed in Sertoli cells, however, Ago1,3 and 4 expression is very low compared to Ago2, with the latter showing a 10-fold higher transcript level. We have included a sentence with this information in the introduction in line 89.

      3) Pattern of expression: How do the authors explain that AGO3 disappears at the diplotene stage and reappears in spermatids?

      __Response: __ Single cell RNAseq data in the germline shows reduced transcript for Ago3 from the Pachytene stage onwards, suggesting minimal if any new transcription in round spermatids. We hypothesize that the AGO3 protein present in the round spermatid stage is cytoplasmic, presumably coming from the pool of AGO3 in the chromatoid body, a cytoplasmic structure with functional association with the nucleus in round spermatids (Kotaja et al, 2003 doi: 10.1073/pnas.05093331).

      4) It would be useful to show the timing of expression of AGO 1 to 4 throughout spermatogenesis in the first paragraph of the article. Maybe the authors could present data from fig2B earlier?

      Response: We understand the Reviewers concern, however, given that Ago expression throughout spermatogenesis was obtained from scRNA seq, we consider that this data should be presented after introducing the Ago413 knockout and the scRNA seq experiment. As Ago1-4 expression was also described in an earlier manuscript by Gonzales-Gonzales et al in the mouse male germline, and our data aligns with this report, we included a sentence about these previous findings in the earlier results section.

      5) Line 190: please modify the sentence "reveal no differences in cellular architecture of the seminiferous tubules when compared to wild-type males" to " reveal no gross differences..." since even without quantification of the different cell types it is visible that KO seminiferous tubules are different from WT tubules.

      __Response: __We agree with the reviewer, and we modified line 190 (now 173) as suggested. Grossly, seminiferous tubules from Ago413 null males contain the same cell types as in wild type tubules, including spermatozoa. However, our studies show that the number and quality of germ cells is compromised in knockouts, as shown by sperm counts and TUNEL staining.

      6) TUNEL analysis: please stage the tubules to determine the stage(s) at which apoptosis is the most predominant.

      __Response: __We have complied with the reviewer suggestion. Figure 1G now shows staged seminiferous tubules, and we have replaced the wild type image for one where the staged tubules match the knockout image.

      7) Figure S4B does not show an increase of cells at Pachytene stage but at Lepto/zygotene stage (as well as an increase of spermatogonia). Please comment this discrepancy with results shown in Fig2.

      __Response: __Figures 2 and S4 show distribution of cells in different substages of spermatogenesis and prophase I measured with very different methods: a cytological approach using chromosome spreads cells vs a transcriptomic approach that involves clustering of cells. We attribute the differences in cell type distribution to differences in the sensitivity of the methods to identify each cell type and therefore identify differences between the number of cells for each group. Moreover, our scRNA-seq data groups the leptotene and zygotene stages together, while the cytological approach allows for separation of these two sub-stages. Importantly, both results show that Ago413 spermatocytes are progressing slower from pachynema into diplonema and/or are dying after pachynema, as stated in line 194 in our manuscript.

      8) Fig5H and 5I are not mentioned in the result section. Also, it would be useful to label them with "all chromosomes" and "XY" to differentiate them easily

      __Response: __We apologize for the omission and have now cited Figures 5H and 5I in the manuscript (line 453). We have added the suggested labels.

      9) Line 530 "data provide further evidence for a functional association between AGO-dependent small RNAs and heterochromatin formation, maintenance and/or silencing." Please rephrase, the present article does not really show that AGO nuclear role depends on small RNAs.

      __Response____: __We agree with the reviewer that these data do not directly show a dependence on small RNAs. As our identified localization of AGO proteins to the pericentric heterochromatin coincides with localization of DICER shown previously by Yadav and collaborators (2020, doi: 10.1093/nar/gkaa460), we do believe that our data further implicates small RNAs in the silencing of heterochromatin. Yadav et al shows that DICER localizes to pericentromeric heterochromatin and processes major satellite transcripts into small RNAs in mouse spermatocytes, and cKO germ cells have reduced localization of SUV39H2 and H3K9me3 to the pericentromeric heterochromatin. Given the colocalization of both small RNA producing machinery and AGOs at pericentromeric heterochromatin, the AGOs may bind these small RNAs, and the statement in line 530 refers to how our results provide evidence for the involvement of other RNAi machinery in the silencing of pericentromeric heterochromatin investigated by Yadav et al which likely includes small RNAs.

      To clarify this point, we have modified the text accordingly.

      10) Line 1256: replace "cite here " by appropriate reference

      __Response: __The reference was added to line 1256.

      11) Please use SMARCA4 instead of BRG1 name as it is its official name.

      __Response: __We have replaced BRG1 with SMARCA4 in the text and figures.

      Figures:

      Figure 1: Are the pictures shown for Ago3-tagged and floxed from the same stages ? The leptotene stage in 1A looks like a zygotene, while some pachytene/diplotene stage pictures do not look alike.

      __Response: __New representative images have been added to figure 1 to match the same substages across the figure.

      Figure 1D, please label the Y scale properly (testis weight related to body weight)

      __Response: __We have fixed this.

      FigS1: Please comment the presence of non-specific bands in the figure legend

      __Response: __We have added a sentence in Figure S1 Legend.

      Fig 2E and F, please indicate on the figure (in addition to its legend), what are the X and Y axes respectively to facilitate its reading.

      __Response: __X and Y axes are now labelled in Figure 2E and F.

      2F: please use an easier abbreviation for Spermatocyte than Sp (which could spermatogonia, sperm etc..) such as Scyte I ? (same comment for Fig 3C)

      Response: The abbreviation for spermatocyte was changed from Sp to Scyte I in Figures 2 and 3.

      Overall, for all figures showing GSEA analyses, could the authors explain what a High positive NES and a High negative NES mean in the results section?

      Response: Thank you for this suggestion. We have added this information where the GSEA score of the cell markers is initially introduced.

      Significance

      Ago proteins are known for their roles in post transcriptional gene regulation via small RNA mediated cleavage of mRNA, which takes places in the cytoplasm. Some Ago proteins have been shown to be also located in the nucleus suggesting other non-canonical roles. It is the case of Ago4 which has been shown to localize to the transcriptionally silenced sex chromosomes (called sex body) of the spermatocyte nucleus, where it contributes to regulate their silencing (Modzelewski et al 2012). Interestingly, Ago4 knockout leads to Ago3 upregulation, including on the sex body indicating that Ago3 and Ago4 are involved in the same nuclear process. In their manuscript, de las Mercedes Carro et al., investigate the consequences of loss of both Ago3 and Ago4 in the male germline by the production of a triple knockout of Ago1, 3 and 4 in the mouse. With this model, the authors describe the role of Ago3 and Ago4 during spermatogenesis and show that they are involved in sex chromosome gene repression in spermatocytes and in round spermatids, as well as in the control of autosomal meiotic gene expression. Triple KO males have impaired meiosis and spermiogenesis, with fewer and abnormal spermatozoa resulting in reduced fertility. Since Ago1 male germline expression is restricted to pre-meiotic germ cells, it is not expected to contribute to the meiotic and postmeiotic phenotypes observed in the triple KO. The strengths of the study are i) the thorough analyses of mRNA expression at the single cell level, and in purified spermatocytes and spermatids (bulk RNAseq), ii) the identification of novel nuclear partners of AGO3/4 relevant for their described nuclear role: ATF2, which they show to also co-localize with the sex body, and BRG1/SMARCA4, a SWI/SNF chromatin remodeler. The main limitation of the study is the lack of information in the method regarding the production of the triple KO, as well as some aspects of the transcriptome and motif analyses. It is also surprising to see that the triple KO does not recapitulate the miRNA deregulation observed in Ago4 KO. The characterization of a non-canonical role of AGO3/4 in male germ cells will certainly influence researchers of the field, and also interest a broader audience studying Argonaute proteins and gene regulation at transcriptional and posttranscriptional levels.

      Reviewer #2

      Evidence, reproducibility and clarity

      In the manuscript titled "Argonaute proteins regulate the timing of the spermatogenic transcriptional program" by Carro et al., the authors present their findings on how Argonaute proteins regulate spermatogenic development. They utilize a mouse model featuring a deletion of the gene cluster on chromosome 4 that contains Ago1, Ago3, and Ago4 to investigate the cumulative roles of AGO3 and AGO4 in spermatogenic cells. The authors characterize the distribution of AGO proteins and their effects on key meiotic milestones such as synapsis, recombination, meiotic transcriptional regulation, and meiotic sex chromosome inactivation (MSCI). They analyze stage-specific transcriptomes in spermatogenic cells using single-cell and bulk RNA sequencing and determine the interactome of AGO3 and AGO4 through mass spectrometry to examine how AGO proteins may regulate gene expression in these cells during meiotic and post-meiotic development. The authors conclude that both AGO3 and AGO4 are essential for regulating the overall gene expression program in spermatogenic cells and specifically modulate MSCI to repress sex-linked genes in pachytene spermatocytes, which may be partially mediated by the proper distribution of DNA damage repair factors. Additionally, AGO3 is suggested to interact with the chromatin remodeler SWI/SNF factor BRG1, facilitating its removal from the sex-chromatin to enable the repression of sex-linked genes during MSCI.

      Major Comments: 1. The study utilized a triple knockout mouse model to determine the effect of AGO3 on spermatogenesis, following up on their previous report about the role of AGO4 in spermatogenesis, which resulted from an upregulation of AGO3 in Ago4-/- spermatocytes. However, the results are more difficult to interpret and ascertain the role of AGO3 in these cells, given the absence of any observable phenotype from Ago3 interruption. AGO4 regulates sex body formation, meiotic sex chromosome inactivation (MSCI), and miRNA production in spermatocytes, all of which were noted in the absence of both AGO3 and AGO4, with only an increased incidence of cells containing abnormal RNAPII at the sex chromosomes. It will be necessary to characterize how AGO3 regulates spermatogenic development, including meiotic progression and the regulation of the meiotic transcriptome, and compare these findings with the current observations to determine if the proposed mechanism involving AGO3, BRG1, and possibly AP2 is relevant in this context.

      __Response: __While we agree with Reviewer that a single Ago3 knockout will help understand distinct roles of AGO3 and AGO4 in spermatogenesis, the time and resources required to generate a new mouse model are substantial. The analysis included in this current manuscript has already taken over seven years, and with the lengthy production of a new single mutant mouse, validation of the new mouse, and then final analysis, we would be looking at another 3-5 years of analysis. In the current funding climate, and with strong concerns over ensuring reduction in utilization of laboratory mice, we consider this request to be far in excess of what is required to move this important story forward.

      The Ago413-/- mouse model has allowed us to associate a nuclear role of Argonaute proteins with a strong reproductive phenotype in the mouse germline. Given the redundancy between Ago3 and Ago4, it is likely that a single Ago3 knockout would have a mild phenotype just like the Ago4 KO. All this said, we agree with the reviewer that analysis of an Ago3 knockout mouse is a valuable next step, just not within this chapter of the story.

      1. Does Ago413-/- mice recapitulate the early meiotic entry phenotype observed in Ago4-/- mice? If not, could it be possible that AGO3 promotes meiotic entry, given its strong mRNA expression in spermatogonia according to the scRNAseq data (Fig. 2B)

      Response: Our scRNA-seq data shows strong expression of Ago3 in spermatogonia, as mentioned by the Reviewer. Analysis of cell cycle marker expression also shows that the transcriptomic profile of spermatogonia is altered, with higher levels of transcripts corresponding to the later G2/M stages (Figure 2D). Moreover, Ago413 knockouts present an increase in the number of spermatogonial stem cells (Supplementary Figure S4B). However, this cluster represents a pool of quiescent and mitotically active cells entering meiosis, therefore interpretation of these data might be challenging. While specific experiments could be conducted to answer this question, this is outside of the scope of our manuscript. The manuscript as it stands is already rather large, and a full analysis of meiotic entry dynamics would dilute the core message relating to chromatin regulation in the sex body.

      1. The authors suggested that the removal of BRG1 by AGO3 is necessary during sex body formation and the eventual establishment of MSCI. However, the BAF complex subunit ARID1A has been shown to facilitate MSCI by regulating promoter accessibility. It will be interesting to determine how BRG1 distribution changes across the genome in the absence of AGO proteins and how that correlates with alterations in sex-linked gene expression.

      __Response: __We agree that changes in BRG1 distribution across the genome would be very interesting to identify. However, in this work we show that BRG1/SMARCA4 protein changes its localization in the sex body very rapidly between early to late pachynema. These two substages are only discernable by immunofluorescence using synaptonemal complex markers, as there are currently no available techniques to enrich for these subfractions. Therefore, study of genome occupancy of BRG1 in these specific substages by techniques such as CUT&Tag are not currently possible. However, we are currently working on new methods to distinguish these cell populations and hope eventually to use these purification strategies to perform the studies suggested by this reviewer. Alternatively, the hope is that single cell CUT&Tag methods will become more reliable, and will enable us to address these questions. Both of these options are not currently available to us. The studies by Menon et al (2024-doi:10.7554/eLife.88024.5) provide strong evidence to support that ARID1A is needed to reduce promoter accessibility of XY silenced genes in prophase I through modulation of H3.3 distribution. However, this mechanism and our identification of the removal of BRG1 between early and late pachytema are not inconsistent with one another, as either SMARCA4 or SMARCA2 can associate with ARID1A as part of the cBAF complex, and ARID1A is also not in all forms of the BAF complex which BRG1 are in. The difference between our results and those seen in Menon et al likely indicate that there are multiple forms of the BAF complex which are differentially regulated during MSCI and play different roles in silencing transcription. Further studies of specific BAF subunits are needed to elucidate how different flavors of the BAF complex act at specific genomic locations and meiotic time points.

      1. The observations presented in this manuscript (Fig. 1D, 2C, 3D, and 4) suggest a haploinsufficiency of the deleted locus in spermatogenic development. How does this compare with the ablation of either Ago3 or Ago4? Please explain.

      Response: Our previous studies in single Ago4 knockouts did not present a heterozygous phenotype (Modzelewski et al 2012, doi: 10.1016/j.devcel.2012.07.003, data not shown). Triple Ago413 knockouts show a much stronger fertility phenotype than single Ago4 knockout. Testis weight of Ago413 homozygous null present a 30% reduction while heterozygous mice show a 15% reduction (Figure 1D), comparable to the 13% reduction previously observed in Ago4-/- males. Sperm counts of Ago413 null and heterozygous males are reduced by 60% and 39% compared to wild type (Figure 1E), respectively, whereas Ago4 null mice have a milder phenotype, with only a 22% reduction in sperm counts. At the MSCI level, both homozygous and heterozygous Ago413 mutant spermatocytes show a similar increase in pachytene spermatocytes with increased RNA pol II ingression into the sex body with respect to wild-type of 35% and 30%, respectively. Ago4 single knockouts show an almost 18% increase in Pol II ingression when compared to wild type. These comparisons are now included in our manuscript in lines 170, 172 and 288. A milder phenotype of the Ago4 knockout and haploinsufficiency in triple Ago413 knockouts but not in Ago4 single knockouts is likely a consequence of the overlapping functions of Ago3 and Ago4 in mammals (and/or overexpression of Ago3 in Ago4 knockouts). In the context of their role in RISC, Wang et al (doi: 10.1101/gad.182758.111) studied the effects of single and double conditional knockouts for Ago1 and Ago2 in miRNA-mediated silencing. They discovered that the interaction between miRNAs and AGOs is highly correlated with the abundance of each AGO protein, and only double knockouts presented an observable phenotype.

      Minor Comments: Based on the interactome analysis, it was argued that AGO3 and AGO4 may function separately. Please discuss how AGO3 might compensate for AGO4 (Line 109).

      Response: We hypothesize that the combined function of AGO3 and AGO4 is needed for proper sex chromosome inactivation during meiosis. We base this hypothesis on the facts that (i) both proteins localize to the sex body in pachytene spermatocytes, (ii) loss of Ago4 leads to upregulation of Ago3, and (iii) the MSCI phenotype of Ago413 knockout mice is much stronger than the single Ago4 knockout (see above). However, AGO3 and AGO4 might not induce silencing through the same mechanism or pathway. In this work, we observed that their temporal expression in prophase I is different; while AGO3 protein seems to disappear by the diplotene stage, AGO4 is present in the sex body of these cells. Moreover, the proteomic analysis revealed a very low number of common interactors, an observation which could support the idea of AGO3 and AGO4 acting by different (albeit perhaps related) mechanisms to achieve MSCI. It is also possible that common interactors were not identified in our proteomic analysis due to the low abundance of AGO3 and AGO4 in the germ cells, limiting the resolution of the proteomics analysis (note that in order to visualize AGO proteins in WB experiments, at least 60 μg of enriched germ cell lysate must be loaded per lane). Moreover, given the difficulty in obtaining enough isolated pachytene and diplotene spermatocytes to perform immunoprecipitation experiments, we performed IP experiments in whole germ cell lysates, which limits the interpretation of our analysis. If AGO3 and AGO4 protein interactors overlap, then AGO3 would directly substitute for AGO4 leading to silencing in single Ago4 knockouts. However, if AGO3 and AGO4 work together through different, complementary mechanisms, then Ago4 mutant mice likely compensates loss of Ago4 by upregulation of Ago3along with specific interactors of the given pathway. We have added a sentence addressing this matter in line 411 of the results section and lines 506 and 513 of the discussion in the revised manuscript.

      In Line 221, it is unclear what is meant by 'cell cycle transcripts'. Does this refer to meiotic transcripts? It is also important to discuss the relevance of the G2/M cell cycle marker genes at later stages of meiotic prophase.

      Response: Thank you for this suggestion. We have changed the relevant text to remove redundancies and include more information. We agree that considering the importance of these genes across meiotic prophase is needed, as cells which are in the dividing stage will already have produced the proteins necessary for division. These cells likely correspond to the diplotene/M cluster cells that have a lower G2/M score, potentially causing the bimodal distribution seen in Figure 2D. We have added a sentence addressing this to the manuscript.

      While identified as a common interactor of both AGO3 and AGO4 in lines 440-445, HNRNPD is not listed among AGO4 interactors in Table S6. Please correct or explain this discrepancy.

      Response: HNRPD was originally identified as an AGO4 interactor using a less strict criteria than the one used in our manuscript: we required consistent enrichment in at least two rounds of IP MS experiments. This reference to HNRNPD was a mistake, given that HNRPD was only enriched in one of our three replicates. Thus, we apologize and have removed the sentence in lines 440-445.

      It is unclear whether wild-type cell lysate or lysate containing FLAG-tagged AGO3 was used for BRG1 immunoprecipitation, and which antibody was used to detect AGO3 in the BRG1 IP sample. A co-IP experiment demonstrating interaction between BRG1 and wild-type AGO3 would be ideal in this context. Furthermore, co-localization by IF would be beneficial to determine the subcellular localization and the cell stages the interaction may be occurring. Additionally, co-IP and Western blot methodologies should be included in the methods section.

      __Response: __MYC-FLAG tagged AGO3 protein lysates were used for BRG1 Co-Immunoprecipitation, along with an anti MYC antibody to detect AGO3. This is now detailed in the Methods section of our revised manuscript (line 1133).

      Regarding BRG1 and AGO3 colocalization by IF, we can confidently show that both AGO3 and BRG1 localize to the sex chromosomes in early pachynema by comparing BRG1/SYCP3 and FLAG-AGO3/SYCP3 stained spreads. We were not able to show colocalization simultaneously on the same cells, given the lack of appropriate antibodies. Our anti FLAG antibody is raised in mouse, while anti BRG1 is raised in rabbit, therefore a non-rabbit, non-mouse anti SYCP3 would be needed to identify prophase I substages, and our lab does not possess such a validated antibody. However, we now have access to a multiplexing kit that allows to use same-species antibodies for immunofluorescence and we can perform these experiments for a revised manuscript.

      __Response: __The methods section now includes description of co-IP methodologies (line 1132). Western Blot methodologies are explained in lane 718, under the "Immunoblotting" title.

      In line 599, it is unclear what is meant by 'persistence of sex chromosome de-repression'. Please correct or clarify this.

      Response: This sentence has been changed and reads: "The persistence of sex chromosome gene expression".

      If possible, please add an illustration to summarize the findings together.

      Response: We thank the reviewer for this suggestion, and have now added this in Figure 6

      Significance

      Overall, this study enhances the understanding of gene expression regulation by AGO proteins during spermatogenesis. Several approaches, including functional, histological, and molecular characterization of the triple knockout phenotype, were instrumental in elucidating the role of AGO proteins in MSCI and meiotic as well as postmeiotic gene regulation. The main limitation of the study is that it is challenging to appreciate the role of AGO3 in addition to the previously published role of AGO4 without the inclusion of necessary control groups. Furthermore, the mechanism of action for AGO proteins in meiotic gene regulation was left relatively unexplored. This study presents new findings that will be significant for the research community interested in gene regulation, chromatin biology, and reproductive biology with the above suggestions considered.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      The authors characterize a CRISPR-Cas9 mouse mutant that targets 3 genes that encode AGO family proteins, 2 of which are expressed during spermatogenesis (AGO3 and AGO4) and one that is said is not expressed, AGO1. This mouse mutant showed that AGO3 and AGO4 both contribute to spermatogenesis success as the "Ago413" mutation gave rise to an additive reduction in testis weight, due to spermatocyte apoptosis, and reduction in sperm count. Furthermore, they use insertion mouse mutants for Ago3 and Ago2 that express tagged versions of their corresponding proteins, which they use in combination with pan-AGO antibodies and Ago mutants to show differential expression and localization properties of AGO2, AGO3, and AGO4 (and the absence of AGO1) during spermatogenesis with a particular focus on meiotic prophase. They perform single-cell RNAseq and intricate analyses to demonstrate a change in distribution of meiotic stages in Ago413 mutants, and the overall cell cycle in spermatogonia and spermatocytes is altered. This analysis shows that the mutation leads to an inability to downregulate prior spermatogonia/spermatocyte stage transcripts in a timely manner. On the other hand, later-stage spermatocytes are abnormally expressing spermiogenesis genes. Similar to the Ago4 mutant previously characterized MSCI is disrupted. The authors also show that AGO3 has different interaction partners compared to AGO4 and focus their final assessment on a novel interaction partner of AGO3, BRG1. They show that this factor, which is involved in chromatin remodeling, is aberrantly localized to the sex body during meiotic prophase and diplonema. As BRG1 is involved in open chromatin, it is proposed that AGO3 restricts BRG1 (and related proteins) from the XY chromosome to ensure MSCI. Overall, this paper is very well constructed with mechanistic insights that make this a very impactful contribution to the research community. Major Comments:

      1. The abstract contains "Ago413-/- mouse" without any explanation of what that is. The abstract needs to be a stand-alone document that does not require any referencing for context.

      Response: We have included a sentence describing Ago413 in line 27

      Figure 2C. - The significance bars are confusing as they appear to overlap strangely.

      Response: We have modified this figure and now present the significance bars are on top of the data points.

      On line 235, the authors state that "we first identified the top non-overlapping upregulated genes for Ago413+/+ germ cells in each cluster. Why did the authors not also select down-regulated genes in each cluster to perform a similar analysis?

      __Response: __Thank you for this question. As our goal was to identify genes that are markers of the transcriptional program in each cell type, we used only uniquely upregulated genes for each cluster. Genes that are downregulated for a cluster may be indicative of the transcription in several other cell types, which is not easily interpretable. For a revised manuscript, we will perform this analysis to determine if there is any specific alterations in these downregulated genes.

      Their Ago413 mutant characterization does a good job of assessing meiotic prophase and spermatozoa. However, their assessment of the stages in between these is lacking (meiotic divisions and spermiogenesis).

      Response: We understand the reviewer's concern, however, it is not usual to study stages between the first meiotic division and spermiogenesis because meiosis II is so rapid and thus we lack tools to dissect it. In general, any defect that impacts meiosis I (and particularly prophase I) leads to cell death during prophase I or at metaphase I due to strictly adhered checkpoints that eradicate defective cells. Thus, the increased TUNEL staining in prophase I indicates to us that defective cells are cleared before exit from meiosis I, and those cells progressing to the spermatid stage are "normal" for meiosis II progression. For these cells that did complete meiosis I and progressed normally through meiosis II, we analyzed their spermiogenic outcome extensively (see section entitled "Post-meiotic spermatids from Ago413-/- males exhibit defective spermiogenesis and poor spermatozoa function"). This section included extensive sperm morphology, sperm motility and sperm fertility through in vitro fertilization assays. That said, we have added a sentence on line 268 to explain the transit through meiosis II.

      The discovery of the interaction between BRG1 and AGO3 is exciting. They should assess BRG1 localization in later sub-stages, including late diplonema and diakinesis.

      __Response: __BRG1(SMARCA4) was analyzed throughout prophase I, as shown in image 5G, including quantification of fluorescence intensity included the analysis of diplonema (5H-I). However, diakinesis was not included here since there was no observable signal of BRG1 in these cells. We have explained this in lines 459.

      ATF2 should have been assessed in more detail, as was done for BRG1 in Figure 5.

      __Response: __We agree with the Reviewer, however, staining of chromosome spreads with the anti ATF2 antibody was not possible in our hands after several attempts and changes in staining conditions. However, as staining of sections was successful, we showed localization of ATF2 on spermatocytes by co staining sections with SYCP3 and ATF2.

      Reviewer #3 (Significance (Required)): Overall, this paper is very well constructed with mechanistic insights, as described in my reviewer comments, that make this a very impactful contribution to the research community.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors characterize a CRISPR-Cas9 mouse mutant that targets 3 genes that encode AGO family proteins, 2 of which are expressed during spermatogenesis (AGO3 and AGO4) and one that is said is not expressed, AGO1. This mouse mutant showed that AGO3 and AGO4 both contribute to spermatogenesis success as the "Ago413" mutation gave rise to an additive reduction in testis weight, due to spermatocyte apoptosis, and reduction in sperm count. Furthermore, they use insertion mouse mutants for Ago3 and Ago2 that express tagged versions of their corresponding proteins, which they use in combination with pan-AGO antibodies and Ago mutants to show differential expression and localization properties of AGO2, AGO3, and AGO4 (and the absence of AGO1) during spermatogenesis with a particular focus on meiotic prophase. They perform single-cell RNAseq and intricate analyses to demonstrate a change in distribution of meiotic stages in Ago413 mutants, and the overall cell cycle in spermatogonia and spermatocytes is altered. This analysis shows that the mutation leads to an inability to downregulate prior spermatogonia/spermatocyte stage transcripts in a timely manner. On the other hand, later-stage spermatocytes are abnormally expressing spermiogenesis genes. Similar to the Ago4 mutant previously characterized MSCI is disrupted. The authors also show that AGO3 has different interaction partners compared to AGO4 and focus their final assessment on a novel interaction partner of AGO3, BRG1. They show that this factor, which is involved in chromatin remodeling, is aberrantly localized to the sex body during meiotic prophase and diplonema. As BRG1 is involved in open chromatin, it is proposed that AGO3 restricts BRG1 (and related proteins) from the XY chromosome to ensure MSCI. Overall, this paper is very well constructed with mechanistic insights that make this a very impactful contribution to the research community.

      Major Comments:

      1. The abstract contains "Ago413-/- mouse" without any explanation of what that is. The abstract needs to be a stand-alone document that does not require any referencing for context.
      2. Figure 2C. - The significance bars are confusing as they appear to overlap strangely.
      3. On line 235, the authors state that "we first identified the top non-overlapping upregulated genes for Ago413+/+ germ cells in each cluster. Why did the authors not also select down-regulated genes in each cluster to perform a similar analysis?
      4. Their Ago413 mutant characterization does a good job of assessing meiotic prophase and spermatozoa. However, their assessment of the stages in between these is lacking (meiotic divisions and spermiogenesis).
      5. The discovery of the interaction between BRG1 and AGO3 is exciting. They should assess BRG1 localization in later sub-stages, including late diplonema and diakinesis.
      6. ATF2 should have been assessed in more detail, as was done for BRG1 in Figure 5.

      Significance

      Overall, this paper is very well constructed with mechanistic insights, as described in my reviewer comments, that make this a very impactful contribution to the research community.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the manuscript titled "Argonaute proteins regulate the timing of the spermatogenic transcriptional program" by Carro et al., the authors present their findings on how Argonaute proteins regulate spermatogenic development. They utilize a mouse model featuring a deletion of the gene cluster on chromosome 4 that contains Ago1, Ago3, and Ago4 to investigate the cumulative roles of AGO3 and AGO4 in spermatogenic cells. The authors characterize the distribution of AGO proteins and their effects on key meiotic milestones such as synapsis, recombination, meiotic transcriptional regulation, and meiotic sex chromosome inactivation (MSCI). They analyze stage-specific transcriptomes in spermatogenic cells using single-cell and bulk RNA sequencing and determine the interactome of AGO3 and AGO4 through mass spectrometry to examine how AGO proteins may regulate gene expression in these cells during meiotic and post-meiotic development. The authors conclude that both AGO3 and AGO4 are essential for regulating the overall gene expression program in spermatogenic cells and specifically modulate MSCI to repress sex-linked genes in pachytene spermatocytes, which may be partially mediated by the proper distribution of DNA damage repair factors. Additionally, AGO3 is suggested to interact with the chromatin remodeler SWI/SNF factor BRG1, facilitating its removal from the sex-chromatin to enable the repression of sex-linked genes during MSCI.

      Major Comments:

      The study utilized a triple knockout mouse model to determine the effect of AGO3 on spermatogenesis, following up on their previous report about the role of AGO4 in spermatogenesis, which resulted from an upregulation of AGO3 in Ago4-/- spermatocytes. However, the results are more difficult to interpret and ascertain the role of AGO3 in these cells, given the absence of any observable phenotype from Ago3 interruption. AGO4 regulates sex body formation, meiotic sex chromosome inactivation (MSCI), and miRNA production in spermatocytes, all of which were noted in the absence of both AGO3 and AGO4, with only an increased incidence of cells containing abnormal RNAPII at the sex chromosomes. It will be necessary to characterize how AGO3 regulates spermatogenic development, including meiotic progression and the regulation of the meiotic transcriptome, and compare these findings with the current observations to determine if the proposed mechanism involving AGO3, BRG1, and possibly AP2 is relevant in this context.

      Does Ago413-/- mice recapitulate the early meiotic entry phenotype observed in Ago4-/- mice? If not, could it be possible that AGO3 promotes meiotic entry, given its strong mRNA expression in spermatogonia according to the scRNAseq data (Fig. 2B) The authors suggested that the removal of BRG1 by AGO3 is necessary during sex body formation and the eventual establishment of MSCI. However, the BAF complex subunit ARID1A has been shown to facilitate MSCI by regulating promoter accessibility. It will be interesting to determine how BRG1 distribution changes across the genome in the absence of AGO proteins and how that correlates with alterations in sex-linked gene expression. The observations presented in this manuscript (Fig. 1D, 2C, 3D, and 4) suggest a haploinsufficiency of the deleted locus in spermatogenic development. How does this compare with the ablation of either Ago3 or Ago4? Please explain.

      Minor Comments:

      Based on the interactome analysis, it was argued that AGO3 and AGO4 may function separately. Please discuss how AGO3 might compensate for AGO4 (Line 109).<br /> In Line 221, it is unclear what is meant by 'cell cycle transcripts'. Does this refer to meiotic transcripts? It is also important to discuss the relevance of the G2/M cell cycle marker genes at later stages of meiotic prophase.<br /> While identified as a common interactor of both AGO3 and AGO4 in lines 440-445, HNRNPD is not listed among AGO4 interactors in Table S6. Please correct or explain this discrepancy. It is unclear whether wild-type cell lysate or lysate containing FLAG-tagged AGO3 was used for BRG1 immunoprecipitation, and which antibody was used to detect AGO3 in the BRG1 IP sample. A co-IP experiment demonstrating interaction between BRG1 and wild-type AGO3 would be ideal in this context. Furthermore, co-localization by IF would be beneficial to determine the subcellular localization and the cell stages the interaction may be occurring. Additionally, co-IP and Western blot methodologies should be included in the methods section. In line 599, it is unclear what is meant by 'persistence of sex chromosome de-repression'. Please correct or clarify this. If possible, please add an illustration to summarize the findings together.

      Significance

      Overall, this study enhances the understanding of gene expression regulation by AGO proteins during spermatogenesis. Several approaches, including functional, histological, and molecular characterization of the triple knockout phenotype, were instrumental in elucidating the role of AGO proteins in MSCI and meiotic as well as postmeiotic gene regulation. The main limitation of the study is that it is challenging to appreciate the role of AGO3 in addition to the previously published role of AGO4 without the inclusion of necessary control groups. Furthermore, the mechanism of action for AGO proteins in meiotic gene regulation was left relatively unexplored. This study presents new findings that will be significant for the research community interested in gene regulation, chromatin biology, and reproductive biology with the above suggestions considered.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In their manuscript de las Mercedes Carro et al investigated the role of Ago proteins during spermatogenesis by producing a triple knockout of Ago 1, 3 and 4. They first describe the pattern of expression of each protein and of Ago2 during the differentiation of male germ cells, then they describe the spermatogenesis phenotype of triple knockout males, study gene deregulation by scRNA seq and identify novel interacting proteins by co-IP mass spectrometry, in particular BRG1/SMARCA4, a chromatin remodeling factor and ATF2 a transcription factor. The main message is that Ago3 and 4 are involved in the regulation of XY gene silencing during meiosis, and also in the control of autosomal gene expression during meiosis. Overall the manuscript is well written, the topic, very interesting and the experiments, well-executed. However, there are some parts of the methodology and data interpretation that are unclear (see below).

      Major comments

      1. Please clarify how the triple KO was obtained, and if it is constitutive or specific to the male germline. In the result section a Cre (which cre?) is mentioned but it is not mentioned in the M&M. On Figure S1, a MICER VECTOR is shown instead of a deletion, but nothing is explained in the text nor legend. Could the authors provide more details in the results section as well as in the M&M ? This is essential to fully interpret the results obtained for this KO line, and to compare its phenotype to other lines (such as lines 184-9 Comparison of triple KO phenotype with that of Ago4 KO). Also, if it is a constitutive KO, the authors should mention if they observed other phenotypes in triple KO mice since AGO proteins are not only expressed in the male germline.
      2. The paragraph corresponding to G2/M analysis is unclear to me. Why was this analysis performed? What does the heatmap show in Figure S4? What is G2/M score? (Fig 2D). Lines 219-220, do the authors mean that Pachytene cells are in a cell phase equivalent to G2/M? All this paragraph and associated figures require more explanation to clarify the method and interpretation.
      3. I have concerns regarding Fig2G: to be convincing the analysis needs to be performed on several replicates, and, it is essential to compare tubules of the same stage - which does not seem to be the case. This does not appear to be the case. Besides, co (immunofluorescent) staining with markers of different cell types should be shown to demonstrate the earlier expression of some markers and their colocalization with markers of the earlier stages.
      4. one important question that I think the authors should discuss regarding their scRNAseq: clusters are defined using well characterized markers. But Ago triple KO appears to alter the timing of expression of genes... could this deregulation affects the interperetation of scRNAseq clusters and results?
      5. XY gene deregulation is mentioned throughout the result section but only X chromosome genes seem to have been investigated.... Even the gene content of the Y is highly repetitive, it would be very interesting to show the level of expression of Y single copy and Y multicopy genes in a figure 3 panel.
      6. Can the authors elaborate on the observation that X gene upregulation is visible in the KO before MSCI; that is in lept/zygotene clusters (and in spermatogonia, if the difference visible in 3A is significant?)
      7. miRNA analysis: could the authors indicate if X encoded miRNA were identified and found deregulated? Because Ago4 has been shown to lead to a downregulation of miRNA, among which many X encoded. It is therefore puzzling to see that the triple KO does not recapitulate this observation. Were the analyses performed differently in the present study and in Ago4 KO study?
      8. The last results paragraph would also benefit from some additional information. It is not clear why the authors focused on enhancers and did not investigate promoters (or maybe they were but it's unclear). Which regions (size and location from TSS) were investigated for motif enrichment analyses? To what correspond the "transcriptional regulatory regions previously identified using dREG" mentioned in the M&M? I understand it's based on a previous article, but more info in the present manuscript would be useful.

      Minor comments

      1. In the introduction: The sentence "Ago1 is not expressed in the germline from the spermatogonia stage onwards allowing us to use this model to study the roles of Ago4 and Ago3 in spermatogenesis." is misleading because Ago1 is expressed at least in spermatogonia; It would be more precise to write "after spermatogonia stage" and rephrase the sentence. Otherwise it is surprising to see AGO1 protein in testis lysate and it is not in line with the scRNA seq shown in figure 2.
      2. Could the authors precise if AGO proteins are expressed in other tissues? In somatic testicular cells?
      3. Pattern of expression: How do the authors explain that AGO3 disappears at the diplotene stage and reappears in spermatids?
      4. It would be useful to show the timing of expression of AGO 1 to 4 throughout spermatogenesis in the first paragraph of the article. Maybe the authors could present data from fig2B earlier?
      5. Line 190: please modify the sentence "reveal no differences in cellular architecture of the seminiferous tubules when compared to wild-type males" to " reveal no gross differences..." since even without quantification of the different cell types it is visible that KO seminiferous tubules are different from WT tubules.
      6. TUNEL analysis: please stage the tubules to determine the stage(s) at which apoptosis is the most predominant.
      7. Figure S4B does not show an increase of cells at Pachytene stage but at Lepto/zygotene stage (as well as an increase of spermatogonia). Please comment this discrepancy with results shown in Fig2.
      8. Fig5H and 5I are not mentioned in the result section. Also, it would be useful to label them with "all chromosomes" and "XY" to differentiate them easily
      9. Line 530 "data provide further evidence for a functional association between AGO-dependent small RNAs and heterochromatin formation, maintenance and/or silencing." Please rephrase, the present article does not really show that AGO nuclear role depends on small RNAs.
      10. Line 1256: replace "cite here " by appropriate reference
      11. Please use SMARCA4 instead of BRG1 name as it is its official name.

      Figures:

      Figure 1: Are the pictures shown for Ago3-tagged and floxed from the same stages ? The leptotene stage in 1A looks like a zygotene, while some pachytene/diplotene stage pictures do not look alike.

      Figure 1D, please label the Y scale properly (testis weight related to body weight)

      FigS1: Please comment the presence of non-specific bands in the figure legend

      Fig 2E and F, please indicate on the figure (in addition to its legend), what are the X and Y axes respectively to facilitate its reading.

      2F: please use an easier abbreviation for Spermatocyte than Sp (which could spermatogonia, sperm etc..) such as Scyte I ? (same comment for Fig 3C)

      Overall, for all figures showing GSEA analyses, could the authors explain what a High positive NES and a High negative NES mean in the results section?

      Significance

      Ago proteins are known for their roles in post transcriptional gene regulation via small RNA mediated cleavage of mRNA, which takes places in the cytoplasm. Some Ago proteins have been shown to be also located in the nucleus suggesting other non-canonical roles. It is the case of Ago4 which has been shown to localize to the transcriptionally silenced sex chromosomes (called sex body) of the spermatocyte nucleus, where it contributes to regulate their silencing (Modzelewski et al 2012). Interestingly, Ago4 knockout leads to Ago3 upregulation, including on the sex body indicating that Ago3 and Ago4 are involved in the same nuclear process. In their manuscript, de las Mercedes Carro et al., investigate the consequences of loss of both Ago3 and Ago4 in the male germline by the production of a triple knockout of Ago1, 3 and 4 in the mouse. With this model, the authors describe the role of Ago3 and Ago4 during spermatogenesis and show that they are involved in sex chromosome gene repression in spermatocytes and in round spermatids, as well as in the control of autosomal meiotic gene expression. Triple KO males have impaired meiosis and spermiogenesis, with fewer and abnormal spermatozoa resulting in reduced fertility. Since Ago1 male germline expression is restricted to pre-meiotic germ cells, it is not expected to contribute to the meiotic and postmeiotic phenotypes observed in the triple KO. The strengths of the study are i) the thorough analyses of mRNA expression at the single cell level, and in purified spermatocytes and spermatids (bulk RNAseq), ii) the identification of novel nuclear partners of AGO3/4 relevant for their described nuclear role: ATF2, which they show to also co-localize with the sex body, and BRG1/SMARCA4, a SWI/SNF chromatin remodeler. The main limitation of the study is the lack of information in the method regarding the production of the triple KO, as well as some aspects of the transcriptome and motif analyses. It is also surprising to see that the triple KO does not recapitulate the miRNA deregulation observed in Ago4 KO. The characterization of a non-canonical role of AGO3/4 in male germ cells will certainly influence researchers of the field, and also interest a broader audience studying Argonaute proteins and gene regulation at transcriptional and posttranscriptional levels.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the reviewers

      We would like to thank the reviewers for their comments, we see great value in the suggestions they made to strengthen our work. We are glad to see that they are in general positive about the manuscript. In the following, we include a point-by-point response to their comments, which are in general consistent with each other.


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Sanchez-Cisneros and colleagues, examine how tracheal cell adhesion to the ECM underneath the epidermis helps shape the tracheal system. They show that if cell-ECM adhesion is perturbed the development of the tracheal system and the epidermis is disrupted. They also detect protrusions extending from the dorsal trunk cells towards the ECM. The work is novel, the figures are clear, and the questions are well addressed. However, I find that some of the claims are not completely supported by the data presented. I have some suggestions that will, I believe, clarify certain points.

      Major comments

      At the beginning of the results section as in the introduction the authors claim that "It is generally assumed that trunk displacement occurs due to tip cells pulling on the trunks so that they follow their path dorsally." This sentence is not referenced, and I do not know where it has been shown or proposed to be like this. In addition, the comparison with the ventral branches is also not referenced and the movie does not really show this. Forces generated by tracheal branch migration have been shown to drive intercalation (Caussinus E, Colombelli J, Affolter M. Tip-cell migration controls stalk-cell intercalation during Drosophila tracheal tube elongation. Curr Biol. 2008;18(22):1727-1734. doi:10.1016/j.cub.2008.10.062), but not dorsal trunk (DT) displacement.

      • *

      We agree that dorsal trunk displacement has not been discussed in previous works, just the fact that tip-cell migration influences stalk cell intercalation. We will rephrase this sentence, stating that dorsal trunk displacement has not been studied.

      However, to rule out the possibility that DT displacement and the phenotype observed in XXX is due to dorsal branch pulling forces, the authors should analyze what happens in the absence of dorsal branches (in condition of Dpp signalling inhibition as in punt mutants or Dad overexpression conditions).

      This is a great idea, and we thank the reviewer for suggesting this. We tried to achieve a similar goal by expressing a Dominant Negative FGFR (Breathless-DN) in the tracheal system, since its expression under btl-gal4 affects tip cell migration. However, the phenotype arises too late to have an effect in dorsal branch migration during the stages we were interested in analyzing. The alternative proposed by the reviewer should be more efficient, as blocking Dpp signalling prevents the formation of dorsal branches completely. We have just received flies carrying the UAS-Dad construct. We will express Dad under btl-gal4 and see how this affects dorsal trunk displacement.

      I am concerned about the TEM observations. The authors claim they can identify tracheal cells by their lumen (Fig. 2 C'). However, at stage 15, the tracheal lumen should be clearly identifiable, and the interluminal DT space should be wider relative to the size of the cells. In this case, there is nothing telling us that we are not looking at a dorsal branch or lateral trunk cell. Furthermore, at embryonic stage 15, the tracheal lumen is filled with a chitin filament, which is not visible in these micrographs. Also, there is quite a lot of tissue detachment and empty spaces between cells, which might be a sign of problems in sample fixing. Better images and more accurate identification of dorsal trunk cells is necessary to support the claim that "These experiments revealed a novel anatomical contact between the epidermis and tracheal trunks".

      The protocol that we use for TEM involves performing 1-μm sections that allow us to stage embryos and to identify the anatomical regions using light microscopy and then switch to ultra-thin sections for electron microscopy once we have found the right position within the sample. This approach also allows us to determine the integrity of the sample. We attach here a micrograph of the last section we analyzed before we decided to do the EM analysis. The asterisk (*) points to a region where the multicellular lumen of the trunk is visible. Due to its proximity to the posterior spiracles, we are confident this is the dorsal trunk and not the lateral trunk. We realize now, after comparing this image with an atlas of development (Campos-Ortega and Hartenstein, 2013), that the stage we chose to illustrate the interaction is a stage 14 embryo instead of the stage 15 we indicated in the manuscript. We will change the stage but given that dorsal closure has already started by stage 14, this does not affect our analysis. Still, we apologize for the mis-staging of the embryo.

      In the light-microscopy image, we have overlaid the EM section to the corresponding region of interest. We agree that the lumen should be thicker compared to the length of the cells, if the section would be cutting the trunk through its largest diameter. However, the protrusions we see do not emerge from the middle part of the trunk where the lumen is found but are seen towards the dorsal side of the trunk, where the lumen will no longer be visible in a longitudinal section as the ones we present. In the embryo shown in Figure 2A-C, our interpretation is that the section was done through a very shallow section of the lumen (represented below). We interpret this from the fact that we see abundant electron-dense areas which we think are adherens junctions from multiple cells. These junctions are visible in Figure 2C but are currently not labelled. We will add arrows to increase their visibility.

      Given that protruding cells lie at the base of dorsal branches, it would be expected that in some sections we would find the protrusions close to the dorsal branches. This is in fact what we show in the micrograph shown in Figure 2D, with a lower magnification overview image shown in Figure S2D. In this case, we see a cell in close proximity to the tendon cells on one side (Figure 2D), which is connected to a dorsal branch on the opposite side (shown in Figure S2D). This dorsal branch is clearly autocellular and chitin deposition is visible as expected for the developmental stage. Again, in Figure S2E we see an electron-dense patch near the lumen that corresponds to the adherens junctions that seal the lumen. We see that all this needs to be better explained in the manuscript, so we will elaborate on the descriptions, and incorporate the light microscopy micrograph to the supplemental figures. This should also aid with the anatomical descriptions requested by Reviewer #3. Nevertheless, we think these observations confirm that what we are describing are the contact points between the dorsal trunk and tendon cells.

      Timelapse imaging of the protrusions in DT cells is done with frames every 4 minutes (Video S3). This is not enough to properly show cellular protrusions and the images do not really show interaction with the epidermis. Video S4 has a better time resolution but it is very short and only shows the cut moment. Video S4, shows the cut, but the reported (and quantified recoil) is not clear. Nevertheless, the results are noteworthy and should be further analysed.

      We will acquire high temporal resolution time-lapse images using E-Cadherin::GFP and btl-gal4, UAS-PH::mCherry to show the behaviour of the protrusions on a short time scale.

      • *

      Provided these embryos survive, would it be possible to check if embryos after laser cutting will develop wavy DTs?

      We think it would be interesting to carry out this experiment, but the laser cut experiments were done under a collaborative visit and we would not be able to repeat it in a short-term period.

      What happens to the larvae under the genetic conditions presented in Fig.S3? Do they reach pupal stages? Do these animals reach adult stages?

      We have seen escapers out of these crosses, but we have not quantified the lethality of the experiment. We will analyse this and include it in the manuscript.

      The kayak phenotypes are very interesting and perhaps the authors could explore them more. As in inhibition of adhesion to the ECM, kay mutants display wavy dorsal trunks. Do they have defective adhesion? Fos being a transcription factor, this is a possibility. The authors should at least discuss the kay phenotypes more extensively and present a suitable hypothesis for the phenotype.

      We agree that the kayak experiments might bring more consequences than just preventing dorsal closure. We will complement this approach by blocking dorsal closure by other independent means. We will use pannier-gal4 (a lateral epidermis driver), engrailed-gal4 (a driver for epidermal posterior compartment), and 332-gal4 (an amnioserosa driver) to express dominant-negative Moesin. In our experience, this also delays dorsal closure and it should result in a similar tracheal phenotype as the one we see in kayak embryos.

      Minor comments

      Page 2 Line 9/10 The sentence "tracheal tubes branch and migrate over neighbouring tissues of different biochemical and mechanical properties to ventilate them." should be rewritten. Tracheal cells do not migrate over other tissues to ventilate them.

      We meant to say that tracheal cells migrate over other tissues at the same time as they branch and interconnect to allow gas exchange in their surroundings after tracheal morphogenesis is completed. Ventilation is used here as a synonym for gas exchange or breathing. We will rephrase this if the reviewer considers it confusing.

      Page 2 Line 24/25 The sentence "It has been generally assumed that trunks reach the dorsal side of the embryo because of the pulling forces of dorsal branch migration." needs to be backed up by a reference.

      As explained above, we will rephrase this sentence.

      Page 7 Line 32/23 In this sentence, the references are not related to dorsal closure "Similarly, the signals that regulate epidermal dorsal closure do not participate in tracheal development, or vice versa (Letizia et al., 2023; Reichman-Fried et al., 1994)."

      Our goal in this sentence was to explain that while JNK is required for proper epidermal dorsal closure, loss of JNK signaling in the trachea does not affect tracheal development (as shown by Letizia et al., 2023). At the same time, Reichman-Fried et al., 1994 described the phenotypes of loss of breathless (btl). We will remove this last reference as the work does not study the epidermis. We will rephrase the sentence as: “Similarly, the signals that regulate epidermal dorsal closure do not participate in tracheal development; namely, JNK signaling (Letizia et al., 2023).”

      Page 12 Line 1 "Muscles attach to epidermal tendon cells through a dense meshwork of ECM" this sentence must be referenced.

      We will add the corresponding references for this statement: (Fogerty et al., 1994; Prokop et al., 1998; Urbano et al., 2009). We will change “dense” for “specialized”.

      Fig. S1- Single channel images (A'-C' and A'-C') should be presented in grayscale.

      Fig. S4- Single channel images (A'-D' and A'-D') should be presented in grayscale.

      We will add the grayscale, single-channel images for these figures.

      Reviewer #1 (Significance (Required)):

      The findings shown in this manuscript shed light on the interactions and cooperation between two organs, the tracheal system and the epidermis. These interactions are mediated by cell-ECM contacts which are important for the correct morphogenesis of both systems. The strengths of the work lie on its novelty and live analysis of these interactions. However, its weaknesses are related to some claims not completely backed by the data, some technical issues regarding imaging and some over-interpreted conclusions.

      This basic research work will be of interest to a broad cell and developmental biology community as they provide a functional advance on the importance of cell-ECM interactions for the morphogenesis of a tubular organ. It is of specific interest to the specialized field of tubulogenesis and tracheal morphogenesis.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: In this paper, the authors explore the relationships between two Drosophila tissues - the epidermis and tracheal dorsal trunk (DT) - that get dorsally displaced during mid-late embryogenesis. The show a nice temporal correlation between the movements of the epithelia during dorsal closure and DT displacement. They also show a correlation between the movement of an endogenously tagged version of collagen and the DT, suggesting that the ECM may contribute to this coordinated movement. Through high magnification TEM, they show that tracheal cells make direct contact with the subset of epithelial cells, known as tendon cells, that also serve as muscle attachment sites. In between these contact sites, tracheae are separated from the epithelia by the muscles. Furthermore, the TEMs and confocal imaging of tracheal cells expressing a membrane marker at these contact sites show that the tracheal cells are extending filopodia toward the tendon cells. The authors then explore how a variety of perturbations to the ECM produced by the tendon and DT cells affect DT and epithelial movement. They find that expressing membrane-associated matrix metalloproteases (MMP1 or MMP2) in tendon cells as well as perturbations in integrin or integrin signaling components leads to delays in dorsal displacement as well as defective lengthening of the tracheal DT tubes. They find that defects in the association between the tracheal and epidermal ECM attachments affect dorsal displacement of the epidermis, disrupting dorsal closure.

      Major comments: I like the goals of this paper testing the idea that the ECM plays important roles in the coordination of tissue placement, and I think they have good evidence of that from this study. However, I disagree with the conclusions of the authors that disrupting contact between DT and the tendon cells has no effect on DT dorsal displacement. DT tracheal positioning is clearly delayed; the fact that it takes a lot longer indicates that the ECM does affect the process. It's just that there are likely backup systems in place - clearly not as good since the tracheal tubes end up being the wrong length.

      We agree with this view; in our deGradFP experiments we see a delayed DT displacement. We focused our analyses on the coordination with epidermal remodelling, which remained unaltered, but we in fact see a delayed progression in dorsal displacement of both tissues (Figure 5I-J). We will emphasize this in the corresponding section of the Results.

      It also seems important that the parts of the DT where the dorsal branches (DB) emanate are moving dorsally ahead of the intervening portions of the trachea. This suggests to me that the DB normally does contribute to DT dorsal displacement and that this activity may be what helps the DT eventually get into its final position. The authors should test whether the portions of the DT that contact the DB are under tension. If the DB migration is providing some dorsal pulling force on the DT, this may also contribute to the observed increases in DT length observed with the perturbations of the ECM between the tendon cells and the trachea - if tube lengthening is a consequence of the pulling forces that would be created by parts of the trachea moving dorsally ahead of the other parts. Here again, it would be good to test if the DT itself is under additional tension when the ECM is disrupted.

      • *

      We thank the reviewer for the suggested experiments. We agree with the fact that the dorsal branches should pull on the dorsal trunk and that this interaction should generate tension. Unfortunately, we are unable to test this with the experiments proposed by the reviewer, but we propose an alternative strategy to overcome this. We understand that the reviewer suggests we do laser cut experiments in dorsal branches to see if there is a recoil in the opposite direction of dorsal branch migration. We carried out our laser cut experiments using a 2-photon laser through a visit to the EMBL imaging facility, using funds from a collaborative grant. Funding a second visit would require us to apply for extra funding, which would delay the preparation of the experiments. We are aware of UV-laser setups within our university, however, UV-laser cuts would also affect the epidermis above the dorsal branches, which we think might contribute to recoil we would expect to see.

      Instead of doing laser cuts, we have designed an experiment based on the suggestion of reviewer #1 of blocking Dpp signaling (with UAS-Dad), which would prevent the formation of dorsal branches. We expect that in this experimental setup, the trunk will bend ventrally in response to thepulling forces of the ventral branches. We will also co-express UAS-Dad (to prevent dorsal branch formation) and UAS-Mmp2 (to ‘detach’ the dorsal trunk from the epidermis), and we would expect to at least partially rescue the wavy trunk phenotype.

      Minor comments: The authors need to do a much better job in the intro and in the discussion of citing the work of the people who made many of the original findings that are relevant to this study. Many citations are missing (especially in the introduction) or the authors cite their own review (which most people will not have read) for almost everything (especially in the discussion). This fails to give credit to decades of work by many other groups and makes it necessary for someone who would want to see the original work to first consult the review before they can find the appropriate reference. I know it saves space (and effort) but I think citing the original work is important.

      • *

      The reviewer is right; we apologize for falling into this practice. We will reference the original works wherever it is needed.

      Figure 7 is not a model. It is a cartoon depicting what they see with confocal and TEM images.

      We will change the figure; we will include our interpretations of the phenotypes we observed under different experimental manipulations.

      Reviewer #2 (Significance (Required)):

      Overall, this study is one of the first to focus on how the ECM affects coordination of tissue placement. The coordination of tracheal movement with that of the epidermis is very nicely documented here and the observation that the trachea make direct contact with the tendon cells/muscle attachment sites is quite convincing. It is less clear from the data how exactly the cells of the trachea and the ECM are affected by the different perturbations of the ECM. It seems like this could be better done with immunostaining of ECM proteins (collagen-GFP?), cell type markers, and super resolution confocal imaging with combinations of these markers. What happens right at the contact site between the tendon cell and the trachea with the perturbation? I think that at the level of analysis presented here, this study would be most appropriate for a specialized audience working in the ECM or fly embryo development field.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary The manuscript by Sanchez-Cisneros et al provides a detailed description of the cellular interactions between cells of the Drosophila embryonic trachea and nearby tendon and epidermal cells. The researchers use a combination of genetic experiments, light sheet style live imaging and transmission electron microscopy. The live imaging is particularly clear and detailed, and reveals protruding cells. The results overall suggest that interactions mediated through the ECM contribute to development of trachea and dorsal closure of epidermis. One new aspect is the existence of dorsal trunk filipodia that are under tension and may impact tracheal morphogenesis through required integrin/ECM interactions.

      Major comments: - Are the key conclusions convincing? Generally, the key conclusions are well supported by the data, and the movies are very impressive. Interactions between the cell types are clearly shown, as is the correlations in their development. However, some of the images are challenging to decipher for a non-expert in Drosophila trachea, especially the EM images, and some of the data is indirect or a bit weak.

      We thank the reviewer for their observations. As mentioned above in response to Reviewer #1, we will add an overview image of the embryo we processed for TEM that is presented in Figure 2.

      The data related to failure of dorsal closure affecting trachea relies on one homozygous allele of one gene (kayak), and so this is somewhat weak evidence. Even though kay is not detected in trachea, there could be secondary effects of the mutation or another lesion on the mutant chromosome. The segments look a bit uneven in the mutant examples.

      • *

      The reviewer is right; as we proposed before, we will complement the kayak experiments with independent approaches that will delay dorsal closure.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Some of the experiments have low n values, especially in imaging experiments, so these may be more preliminary, but they are in concordance with other data.

      The problem we face in our live-imaging experiments is related to the probability of finding the experimental embryos. In most of our experiments we combine double-tissue labelling plus the expression of genetic tools. This generally corresponds to a very small proportion of the progeny. We will aim to have at least 4 embryos per condition.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. Higher n-values would substantiate the claims. To strengthen the argument that dorsal closure affects trachea morphogenesis mechanically, the authors might consider using of a combination of kay mutant alleles or other mutant genes in this pathway to provide stronger evidence. Or they could try a rescue experiment in epidermis and trachea separately for the kay mutants.

      We think our experiments delaying dorsal closure using the Gal4/UAS system and a variety of drivers should address the point of the possible indirect effects of kay in tracheal development.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. Imaging data can take awhile to obtain, but the genetic experiments could be done in a couple of months, and the authors should be able to obtain any needed lines within a few weeks.

      The reviewer is correct, we will be able to plan our crosses for the proposed experiments within a couple of months.

      • Are the data and the methods presented in such a way that they can be reproduced? Generally, yes. For the deGrad experiments, it is not clear how the fluorescent intensity was normalized - was this against a reference marker?

      Briefly, we used signals from within the embryo as internal controls. In the case of en-gal4, we normalized the signal to the sections of the embryo where en is not expressed and therefore, beta-integrin levels should not be affected. In the case of btl-gal4, we normalized against the signal surrounding the trunks which should also not be affected by the deGradFP system. We will elaborate on these analyses in the methods section.

      Are the experiments adequately replicated and statistical analysis adequate? There are several experiments with low n values, so this could fall below statistical significance. For example, data shown in Fig 1G: n=3; Fig 4D n=4, n=3; Fig 6J n=4

      As mentioned above, we will increase our sample sizes.

      Minor comments: - Specific experimental issues that are easily addressable. To make the TEM images more easily interpreted, it would be helpful to provide a fluorescent image of all the relevant cell types (especially trachea, epidermis, muscle, and tendon cells, plus segmental boundaries) labelled accordingly, so that reader can correlate them more easily with the TEM images. They might also include a schematic of an embryo to show where the TEM field of view is.

      We believe this should be addressed by adding the light microscopy section of the embryo with the TEM image overlaid as illustrated above.

      It is hard to be confident that the EM images reflect the cells they claim and that the filopodia are in fact that, at least for people not used to looking at these types of images.

      As we explained in the response to Reviewer #1, we will elaborate on the descriptions of our TEM data. We think that adding the reference micrograph will aid with the interpretations of the TEM images.

      • Are prior studies referenced appropriately? yes
      • Are the text and figures clear and accurate? yes

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? The writing could be revised to be a bit clearer. Since the results of the experiments do not support the initial hypothesis, I found it a bit confusing as I read along. It may help to introduce an alterative hypothesis earlier to make the paper more logical and easy to follow. To be more specific, On page 3, the authors say they "show that dorsal trunk displacement is mechanically coupled to the remodelling of the epidermis" and also in the results comment that "With two opposing forces pulling the trunks other factors likely participate in their dorsal displacement, but so far these have remained unstudied." But that doesn't end up being what they find. The results from figure 5 and related interpretation on page 17 says "cell-ECM interactions are important for proper trunk morphology, but not for its displacement." So this was confusing to read and I would encourage the authors to frame the issues a bit differently in terms of tube morphogenesis.

      We see how this might be confusing. We will rewrite the introduction so that the work is easier to follow. To achieve this, we will state from the beginning the mechanisms we anticipate that regulate trunk displacement: 1) adhesion to the epidermis, 2) pulling forces from the dorsal branches and 3) a combination of both.

      Some minor presentation issues: What orientation is the cross-sectional view in figure 1C and movie 1?

      We will add a dotted box that indicates the region that we turned 90° to show the cross-section.

      On page 12, the authors say the "Electron micrographs also suggested high filopodial activity" but activity suggests dynamics that are not clear from EM. This could be re-phrased.

      As the reviewer indicates, we cannot conclude dynamics from a static image. We will replace “suggested high filopodial activity” with “revealed filopodial abundance”.

      Reviewer #3 (Significance (Required)):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. The results of the paper are significant in that they characterize a mechanical interaction between two tissue types in development, which are linked by the extracellular matrix that sits between them. It is not clear to me that this describes a "novel mechanism for tissue coordination" as stated in the abstract, but it does characterize this type of interaction in a detailed cellular way.

      • Place the work in the context of the existing literature (provide references, where appropriate). For specialists, the work identifies a novel protruding cell type in the fly embryonic trachea, and provides beautiful and detailed imaging data on tracheal development. The "wavy" trachea phenotype is also uncommon and very interesting, so this result could be linked to the few papers that also describe this phenotype and be built up.

      • State what audience might be interested in and influenced by the reported findings. As it stands, this is most interesting for a specialized audience because it requires some understanding of the development of this system in particular. As it characterizes this to a new level of detail, it could be influential to those in the field. Some addition clarification of the results and re-framing could make the manuscript more clear and interesting for non-specialists.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. I work with Drosophila and have studied embryonic and adult cell types, although not trachea specifically. I am familiar with all the genetic techniques and imaging techniques used here.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The manuscript by Sanchez-Cisneros et al provides a detailed description of the cellular interactions between cells of the Drosophila embryonic trachea and nearby tendon and epidermal cells. The researchers use a combination of genetic experiments, light sheet style live imaging and transmission electron microscopy. The live imaging is particularly clear and detailed, and reveals protruding cells. The results overall suggest that interactions mediated through the ECM contribute to development of trachea and dorsal closure of epidermis. One new aspect is the existence of dorsal trunk filipodia that are under tension and may impact tracheal morphogenesis through required integrin/ECM interactions.

      Major comments:

      • Are the key conclusions convincing?

      Generally, the key conclusions are well supported by the data, and the movies are very impressive. Interactions between the cell types are clearly shown, as is the correlations in their development. However, some of the images are challenging to decipher for a non-expert in Drosophila trachea, especially the EM images, and some of the data is indirect or a bit weak.

      The data related to failure of dorsal closure affecting trachea relies on one homozygous allele of one gene (kayak), and so this is somewhat weak evidence. Even though kay is not detected in trachea, there could be secondary effects of the mutation or another lesion on the mutant chromosome. The segments look a bit uneven in the mutant examples. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Some of the experiments have low n values, especially in imaging experiments, so these may be more preliminary, but they are in concordance with other data. - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Higher n-values would substantiate the claims. To strengthen the argument that dorsal closure affects trachea morphogenesis mechanically, the authors might consider using of a combination of kay mutant alleles or other mutant genes in this pathway to provide stronger evidence. Or they could try a rescue experiment in epidermis and trachea separately for the kay mutants. - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Imaging data can take awhile to obtain, but the genetic experiments could be done in a couple of months, and the authors should be able to obtain any needed lines within a few weeks. - Are the data and the methods presented in such a way that they can be reproduced?

      Generally, yes. For the deGrad experiments, it is not clear how the fluorescent intensity was normalized - was this against a reference marker? - Are the experiments adequately replicated and statistical analysis adequate?

      There are several experiments with low n values, so this could fall below statistical significance. For example, data shown in Fig 1G: n=3; Fig 4D n=4, n=3; Fig 6J n=4

      Minor comments:

      • Specific experimental issues that are easily addressable.

      To make the TEM images more easily interpreted, it would be helpful to provide a fluorescent image of all the relevant cell types (especially trachea, epidermis, muscle, and tendon cells, plus segmental boundaries) labelled accordingly, so that reader can correlate them more easily with the TEM images. They might also include a schematic of an embryo to show where the TEM field of view is.

      It is hard to be confident that the EM images reflect the cells they claim and that the filopodia are in fact that, at least for people not used to looking at these types of images. - Are prior studies referenced appropriately?

      yes - Are the text and figures clear and accurate?

      yes - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      The writing could be revised to be a bit clearer. Since the results of the experiments do not support the initial hypothesis, I found it a bit confusing as I read along. It may help to introduce an alterative hypothesis earlier to make the paper more logical and easy to follow. To be more specific, On page 3, the authors say they "show that dorsal trunk displacement is mechanically coupled to the remodelling of the epidermis" and also in the results comment that "With two opposing forces pulling the trunks other factors likely participate in their dorsal displacement, but so far these have remained unstudied." But that doesn't end up being what they find. The results from figure 5 and related interpretation on page 17 says "cell-ECM interactions are important for proper trunk morphology, but not for its displacement." So this was confusing to read and I would encourage the authors to frame the issues a bit differently in terms of tube morphogenesis.

      Some minor presentation issues:

      What orientation is the cross-sectional view in figure 1C and movie 1? On page 12, the authors say the "Electron micrographs also suggested high filopodial activity" but activity suggests dynamics that are not clear from EM. This could be re-phrased.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      The results of the paper are significant in that they characterize a mechanical interaction between two tissue types in development, which are linked by the extracellular matrix that sits between them. It is not clear to me that this describes a "novel mechanism for tissue coordination" as stated in the abstract, but it does characterize this type of interaction in a detailed cellular way. - Place the work in the context of the existing literature (provide references, where appropriate).

      For specialists, the work identifies a novel protruding cell type in the fly embryonic trachea, and provides beautiful and detailed imaging data on tracheal development. The "wavy" trachea phenotype is also uncommon and very interesting, so this result could be linked to the few papers that also describe this phenotype and be built up. - State what audience might be interested in and influenced by the reported findings.

      As it stands, this is most interesting for a specialized audience because it requires some understanding of the development of this system in particular. As it characterizes this to a new level of detail, it could be influential to those in the field. Some addition clarification of the results and re-framing could make the manuscript more clear and interesting for non-specialists. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I work with Drosophila and have studied embryonic and adult cell types, although not trachea specifically. I am familiar with all the genetic techniques and imaging techniques used here.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: In this paper, the authors explore the relationships between two Drosophila tissues - the epidermis and tracheal dorsal trunk (DT) - that get dorsally displaced during mid-late embryogenesis. The show a nice temporal correlation between the movements of the epithelia during dorsal closure and DT displacement. They also show a correlation between the movement of an endogenously tagged version of collagen and the DT, suggesting that the ECM may contribute to this coordinated movement. Through high magnification TEM, they show that tracheal cells make direct contact with the subset of epithelial cells, known as tendon cells, that also serve as muscle attachment sites. In between these contact sites, tracheae are separated from the epithelia by the muscles. Furthermore, the TEMs and confocal imaging of tracheal cells expressing a membrane marker at these contact sites show that the tracheal cells are extending filopodia toward the tendon cells. The authors then explore how a variety of perturbations to the ECM produced by the tendon and DT cells affect DT and epithelial movement. They find that expressing membrane-associated matrix metalloproteases (MMP1 or MMP2) in tendon cells as well as perturbations in integrin or integrin signaling components leads to delays in dorsal displacement as well as defective lengthening of the tracheal DT tubes. They find that defects in the association between the tracheal and epidermal ECM attachments affect dorsal displacement of the epidermis, disrupting dorsal closure.

      Major comments: I like the goals of this paper testing the idea that the ECM plays important roles in the coordination of tissue placement, and I think they have good evidence of that from this study. However, I disagree with the conclusions of the authors that disrupting contact between DT and the tendon cells has no effect on DT dorsal displacement. DT tracheal positioning is clearly delayed; the fact that it takes a lot longer indicates that the ECM does affect the process. It's just that there are likely backup systems in place - clearly not as good since the tracheal tubes end up being the wrong length. It also seems important that the parts of the DT where the dorsal branches (DB) emanate are moving dorsally ahead of the intervening portions of the trachea. This suggests to me that the DB normally does contribute to DT dorsal displacement and that this activity may be what helps the DT eventually get into its final position. The authors should test whether the portions of the DT that contact the DB are under tension. If the DB migration is providing some dorsal pulling force on the DT, this may also contribute to the observed increases in DT length observed with the perturbations of the ECM between the tendon cells and the trachea - if tube lengthening is a consequence of the pulling forces that would be created by parts of the trachea moving dorsally ahead of the other parts. Here again, it would be good to test if the DT itself is under additional tension when the ECM is disrupted.

      Minor comments: The authors need to do a much better job in the intro and in the discussion of citing the work of the people who made many of the original findings that are relevant to this study. Many citations are missing (especially in the introduction) or the authors cite their own review (which most people will not have read) for almost everything (especially in the discussion). This fails to give credit to decades of work by many other groups and makes it necessary for someone who would want to see the original work to first consult the review before they can find the appropriate reference. I know it saves space (and effort) but I think citing the original work is important.

      Figure 7 is not a model. It is a cartoon depicting what they see with confocal and TEM images.

      Significance

      Overall, this study is one of the first to focus on how the ECM affects coordination of tissue placement. The coordination of tracheal movement with that of the epidermis is very nicely documented here and the observation that the trachea make direct contact with the tendon cells/muscle attachment sites is quite convincing. It is less clear from the data how exactly the cells of the trachea and the ECM are affected by the different perturbations of the ECM. It seems like this could be better done with immunostaining of ECM proteins (collagen-GFP?), cell type markers, and super resolution confocal imaging with combinations of these markers. What happens right at the contact site between the tendon cell and the trachea with the perturbation? I think that at the level of analysis presented here, this study would be most appropriate for a specialized audience working in the ECM or fly embryo development field.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Sanchez-Cisneros and colleagues, examine how tracheal cell adhesion to the ECM underneath the epidermis helps shape the tracheal system. They show that if cell-ECM adhesion is perturbed the development of the tracheal system and the epidermis is disrupted. They also detect protrusions extending from the dorsal trunk cells towards the ECM.

      The work is novel, the figures are clear, and the questions are well addressed. However, I find that some of the claims are not completely supported by the data presented. I have some suggestions that will, I believe, clarify certain points.

      Major comments

      At the beginning of the results section as in the introduction the authors claim that "It is generally assumed that trunk displacement occurs due to tip cells pulling on the trunks so that they follow their path dorsally." This sentence is not referenced, and I do not know where it has been shown or proposed to be like this. In addition, the comparison with the ventral branches is also not referenced and the movie does not really show this. Forces generated by tracheal branch migration have been shown to drive intercalation (Caussinus E, Colombelli J, Affolter M. Tip-cell migration controls stalk-cell intercalation during Drosophila tracheal tube elongation. Curr Biol. 2008;18(22):1727-1734. doi:10.1016/j.cub.2008.10.062), but not dorsal trunk (DT) displacement. However, to rule out the possibility that DT displacement and the phenotype observed in XXX is due to dorsal branch pulling forces, the authors should analyze what happens in the absence of dorsal branches (in condition of Dpp signalling inhibition as in punt mutants or Dad overexpression conditions).

      I am concerned about the TEM observations. The authors claim they can identify tracheal cells by their lumen (Fig. 2 C'). However, at stage 15, the tracheal lumen should be clearly identifiable, and the interluminal DT space should be wider relative to the size of the cells. In this case, there is nothing telling us that we are not looking at a dorsal branch or lateral trunk cell. Furthermore, at embryonic stage 15, the tracheal lumen is filled with a chitin filament, which is not visible in these micrographs. Also, there is quite a lot of tissue detachment and empty spaces between cells, which might be a sign of problems in sample fixing. Better images and more accurate identification of dorsal trunk cells is necessary to support the claim that "These experiments revealed a novel anatomical contact between the epidermis and tracheal trunks".

      Timelapse imaging of the protrusions in DT cells is done with frames every 4 minutes (Video S3). This is not enough to properly show cellular protrusions and the images do not really show interaction with the epidermis. Video S4 has a better time resolution but it is very short and only shows the cut moment. Video S4, shows the cut, but the reported (and quantified recoil) is not clear. Nevertheless, the results are noteworthy and should be further analysed. Provided these embryos survive, would it be possible to check if embryos after laser cutting will develop wavy DTs?

      What happens to the larvae under the genetic conditions presented in Fig.S3? Do they reach pupal stages? Do these animals reach adult stages?

      The kayak phenotypes are very interesting and perhaps the authors could explore them more. As in inhibition of adhesion to the ECM, kay mutants display wavy dorsal trunks. Do they have defective adhesion? Fos being a transcription factor, this is a possibility. The authors should at least discuss the kay phenotypes more extensively and present a suitable hypothesis for the phenotype.

      Minor comments

      Page 2 Line 9/10 The sentence "tracheal tubes branch and migrate over neighbouring tissues of different biochemical and mechanical properties to ventilate them." should be rewritten. Tracheal cells do not migrate over other tissues to ventilate them.

      Page 2 Line 24/25 The sentence "It has been generally assumed that trunks reach the dorsal side of the embryo because of the pulling forces of dorsal branch migration." needs to be backed up by a reference.

      Page 7 Line 32/23 In this sentence, the references are not related to dorsal closure "Similarly, the signals that regulate epidermal dorsal closure do not participate in tracheal development, or vice versa (Letizia et al., 2023; Reichman-Fried et al., 1994)."

      Page 12 Line 1 "Muscles attach to epidermal tendon cells through a dense meshwork of ECM" this sentence must be referenced.

      Fig. S1- Single channel images (A'-C' and A'-C') should be presented in grayscale.

      Fig. S4- Single channel images (A'-D' and A'-D') should be presented in grayscale.

      Significance

      The findings shown in this manuscript shed light on the interactions and cooperation between two organs, the tracheal system and the epidermis. These interactions are mediated by cell-ECM contacts which are important for the correct morphogenesis of both systems. The strengths of the work lie on its novelty and live analysis of these interactions. However, its weaknesses are related to some claims not completely backed by the data, some technical issues regarding imaging and some over-interpreted conclusions.

      This basic research work will be of interest to a broad cell and developmental biology community as they provide a functional advance on the importance of cell-ECM interactions for the morphogenesis of a tubular organ. It is of specific interest to the specialized field of tubulogenesis and tracheal morphogenesis.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their careful and thoughtful read of our work.

      Reviewer 1 helpfully suggested that the audience might not know what fuzzy matching is. The manuscript contains the following explanation of fuzzy matching:

      "That subjects had almost no exact matches to SARS-CoV-2-specific IGH sequences did not exclude the possibility that they have sequences that are functionally similar to these reference sequences. The same possibility exists for TRBs. A standard method for finding similar sequences is using the Levenshtein (edit) distance. Sequences with a distance of less than or equal to a tolerance t are considered similar (for example, sequences that differ by no more than t=1 amino acid). This is known as “fuzzy matching” with tolerance t. (Note that exact matches are just fuzzy matches with tolerance 0.)"

      We now also add the word "approximate" in conjunction with earlier uses of the word "fuzzy."

      Reviewer 2 asked whether we "focused on potential contributions [to CDR3 length variations] based on germline gene usage, rather than directly observed contributions from the V and J segments within the CDR3 regions;" the answer is, the latter. Reviewer 2 also pointed out that it would be valuable to have HLA typing for a more comprehensive analysis. We wholeheartedly agree and have added a sentence to this effect in the discussion.

      Reviewer 3 had several specific comments. The first was regarding the overall implication of the study. There are several:

      • binding capacity is as predictive as, and more robust than, prior approaches. As we write: "We found that repertoires’ binding capacity to known SARS-CoV-2-specific CD4+ TRBs performs as well as the best hand-tuned approximate or “fuzzy” matching at predicting a protective level of NAbs, while also being more robust to repertoire sample size and not requiring hand-tuning."

      • the importance of looking for unexpected patterns, for example in non-productive joins as was done here, and for global small-scale perturbations that together result in unavoidable signals. As we write, "B- and T-cell adaptive responses to SARS-CoV-2 infection and vaccination are surprising, subtle, and diffuse," and "One open question is to what extent infection affects antibody and TCR repertoires as a whole vs. enriching specific clones within it. One can refer to these ends of the continuum of possible effects as “diffuse” vs. “precise.”"

      • caution against over-interpreting correlations with specific gene segments and. As we write: "With these caveats in mind, to our knowledge previous studies have identified 20 IGH V genes to be enriched in sequences produced during various immune responses to SARS-CoV-2.8–16 Given that human genomes encode 54 IGH V genes,17 collectively these studies implicate 37% of V genes in the response to this single viral exposure, indicating that the SARS-CoV-2 response is either quite broad within individuals, quite heterogeneous among individuals, or both."

      Each of these challenges prevailing approaches, understanding, and conclusions about patterns and signatures in repertoire sequence. We should hope this would be of some benefit.

      Reviewer 3 also asked what type of vaccines the participants received. We have now clarified that they all received an mRNA vaccine: 80% receiving Pfizer Comirnaty and the rest receiving Moderna Spikevax.

      We looked at anti-spike neutralizing antibodies because this is where the evidence for neutralization is strongest. It would have been great to have diagnostics for every protein as well as Fc function, but these were not available and therefore not possible to study.

      Reviewer 3 noted that we mention that it is impossible to know a priori what study size would be adequate to identify public sequences comprehensively in COVID-19 and asks if 251 individuals are enough. Assuming this is in reference to the size of our study, we would like to point out that this study does not claim to identify public sequences comprehensively. The rationale is more is better. The statistics tell the reader the extent to which to reject the null hypotheses put forth.

      Regarding comorbidities: we probably could perform an analysis on their impact in a future study. We thank the reviewer for this idea.

      Regarding timing: samples were collected from the vaccinee cohort 4 to 84 days (mean = 44.3 days, standard deviation = 15.3 days) after administration of the initial vaccine dose. Supplementary Figure S13 shows sampling times vs. NAb titers.

      We feel the length of the introduction is required to contextualize the implications and benefits of this study.

      We were unable to find the typos referred to but did run the manuscript through spelling and grammar checks again. We thank the Reviewer for the thoughtful attentiveness.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study by Braun et al. looked at B and T cell receptor repertoires in SARS-CoV-2 infected and vaccinated individuals in comparison to healthy controls to evaluate the impact on neutralizing antibody titers. The results are clearly presented. As expected, vaccination and infection have differential effects on the repertoires. The major finding is that vaccinated and infected individuals with more SARS-CoV-2-specific TRBs have higher neutralizing antibody titers.

      Major comments:

      • It is not clear what the overall implication of this study is? What are the benefits?

      • The authors did not specify what type of vaccines the participants received. If different vaccines were used, a comparison is necessary.

      • The authors only looked at anti-spike neutralizing antibodies while other SARS-CoV-2 proteins could have a different impact. It is also known that Fc-functions participate to protection and should be studied.

      • Line 55 the authors mentioned it's difficult to know what study size is enough to represent the population. What is the rationale for including 251 individuals? Is this enough to represent the population?

      • With so many comorbidities in the different cohorts, could the authors perform an analysis of the impact of such comorbidities?

      • The timing of the SARS-CoV-2 infection or vaccination of the individuals is missing. Were all samples collected at the same time? As we know neutralizing antibodies are waning over time, it is important to include this data.

      Minor comments:

      • Introduction is quite long and needs to be summarized better.

      • Include a table with all the abbreviations

      • Missing data in the individuals demographics table

      • Typo line 65

      • Typo in the methods section

      Significance

      This study highlights the effects of SARS-CoV-2 vaccination versus infection on antibody and T cell repertoires. Increasing evidence shows the differential effects of vaccination compared to infection. This substantial study provides new data to the field. However, for a virus for which many vaccines and treatments have already been developed and proven effective, the implications and benefits to the field should be more clearly explained.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      SUMMARY

      The paper focuses on identifying signatures in specific antibody and T-cell receptor repertoires related to SARS-CoV-2 infection and vaccination within a cohort of 250 patients. This study addresses the limitations of prior research, which often relied on small sample sizes, possibly leading to an underestimation of the variability in adaptive immune responses to SARS-CoV-2 infection and vaccination. Researchers analyzed functional features by sequencing IGH, TRD, and TRB, aiming to identify signatures that correlate antibody and T-cell responses with SARS-CoV-2 exposure.

      MAJOR COMMENTS

      • The study's major limitations, as acknowledged by the authors, primarily relate to the timing of sample collections during the pandemic and current methodological constraints, such as the challenge of reliably predicting receptor-antigen binding using a unified approach. Despite these challenges, the methodologies and statistical approaches employed were carefully designed to minimize potential biases. The tools and findings from this study could prove valuable for future research, particularly in this rapidly evolving field.

      • One intriguing finding was the observed pattern of IGH CDR3 lengths among vaccinated individuals. When investigating the contributions of germline genes to CDR3 regions as a potential explanation for length variations, did I understand correctly that authors focused on potential contributions based on germline gene usage, rather than directly observed contributions from the V and J segments within the CDR3 regions? This discovery highlights a potential impact of vaccination on V-D-J recombination machinery.

      • OPTIONAL For future studies, it may be valuable to have HLA typing for a more comprehensive analysis.

      Significance

      Overall, this manuscript uncovers previously undescribed patterns of immune responses to SARS-CoV-2 infection and vaccination. It is supported by a statistically robust methodological approach to effectively interpret the complex features resulting from exposure to a specific immunogen.

      The manuscript could be of broad interest for immunologists, clinicians and bioinformaticians.

      My area of expertise lies in the molecular biology of T cells, with a focus on applying multi-omics approaches (e.g., transcriptomics, epigenomics) to elucidate the molecular mechanisms governing T cell function and their role in the immune responses.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, the authors sequenced B- and T-cell receptor repertoires (recombined immunoglobulin heavy-chain, TCRβ, and TCRδ genes) from the blood of infected, vaccinated, and control subjects (tested for negative). They focused on their hypervariable CDR3 regions and correlated this AIRRseq data with demographics and clinical findings from subject data. They investigated whether features of these repertoires could predict subjects' SARS-CoV-2 neutralizing antibody titer. They discovered that age affected NAb levels in vaccinated subjects but not infectees. Furthermore, they found that vaccination, but not infection, substantially affects non-productively recombined IGHs, and that repertoires' binding capacity to known SARS-CoV-2-specific CD4+ TCRβ performs as well as the best matching at predicting a protective level of NAbs. The overall conclusion from this dataset is that B- and T-cell adaptive responses to SARS-CoV-2 infection and vaccination are subtle and diffuse.

      The data support the claims and the conclusions and do not require additional analyses.

      The study is robust and large, with over 250 subjects, and involved sequencing IGH and TCRdelta as well as TCRbeta, to a depth of over 100000 cells/subject.

      Significance

      The study is very specific and sectorial, and I do not think it is easily accessible to a broad audience of immunologists; despite this, however, the authors have managed to explain quite understandably the results achieved, the challenges faced, and the conclusions obtained. If the aim is to inform the scientific community that deals with immunology, I suggest not assuming that the audience knows what fuzzy is. So, I would recommend explaining the statistical tools used in a few words.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1.1. It would be helpful if the authors could discuss whether there is any correlation between cryptic sites and the extent of experimental validation in the Phosphosite database (e.g. those that were only identified in one or a few MS experiments). It is difficult to determine stoichiometry of phosphorylation experimentally, but can any inference be made on the extent of phosphorylation of cryptic sites vs. more conventional sites located in IDRs or on the surface of globular domains?

      We thank the reviewer for this valuable suggestion. To investigate the extent of the experimental validation of phosphosites, we examined the number of supporting studies for each site reported in the PhosphoSitePlus database. Specifically, we summed the values of the LT_LIT (literature-based experiments), MS_LIT (mass spectrometry literature), and MS_CST (Cell Signaling Technology mass spectrometry) fields to count the number of independent studies supporting each phosphorylation site, either cryptic or non-cryptic. To visualize the results, we plotted the number of supporting references vs the relative solvent accessibility (RSA) distribution of phosphosites (Figure R1). The analysis revealed a direct correlation between the RSA of phosphosites and the number of studies supporting their phosphorylation. This observation may arise from an intrinsic difficulty in studying cryptic phosphosites due to their destabilizing effects on native proteins. Notably, no differences were observed in the number of supporting studies within cryptic phosphosites (Figure R1B). We have not mentioned these analyses in the new version of the manuscript. However, we would gladly add it if the editor or the reviewer advises accordingly.

      1.2. The authors note that a larger percentage of tyrosine phosphorylation sites are cryptic compared with serine/threonine sites. I assume that tyrosine itself is more highly enriched in the hydrophobic cores of proteins relative to serine or threonine, due to its bulky hydrophobic side chain. Is the increased proportion of cryptic tyrosine phosphorylation sites more, less, or the same as the proportion of tyrosine in hydrophobic cores relative to serine and threonine?

      We thank the reviewer for this insightful comment. As correctly noted, tyrosine residues tend to be enriched in the hydrophobic cores of proteins, as reflected by their generally lower relative solvent accessibility (RSA) values, regardless of phosphorylation state. This enrichment is likely due to the tyrosine side chain's bulky and partially hydrophobic nature. To address the reviewer's question, we compared the RSA distributions of phosphorylated tyrosine, serine, and threonine residues with that of the same residues non-phosphorylated in the human proteome (Figure R2). In order to statistically compare the two distributions, we employed the Mann-Whitney test. The large sample size inevitably yields very low p-values, even when the distributions differ mildly (pThr, pSer vs non-p Thr, Ser, p 1.3. Fig. 5D and E: I had some trouble interpreting these figures. Indicating where the native state is in the plots would be helpful (stated in text as lower right, but a rectangle on the plot would make this more obvious). The text discusses three metastable intermediates, but what is the fourth one shown on the figures (well A, close to the native state)? This could be more explicitly explained.

      We added the missing rectangles into the original Fig. 5D and E (see below Figure R3 and R4). The three metastable intermediates discussed in the original text reflect protein conformers in which the cryptic site is exposed to the solvent. Conversely, the fourth state, and the final native state, are conformations in which the site is already partially or fully cryptic. The observation that the masking of cryptic sites coincides with the latest folding steps allows us to hypothesize a mechanism by which cryptic phosphorylation may regulate protein folding. Following the reviewer's suggestion, we now specify more explicitly each conformation in the new version of the legends of the relative figures (text file with track changes, lines 950 and 1017).

      1.4. The fact that phosphomimetic mutations of cyptic sites in SMAD2 and CHK1 lead to lower expression levels and shorter half-lives is not surprising, given the expected disruption of the hydrophobic core by introduction of a charged residue. The results certainly show that if phosphorylated, these sites would decrease expression and half-life. With respect to half-life, however, if the authors are correct and cryptic sites are predominately phosphorylated co-translationally, one would expect that the half-life curves for the wt protein would not be a simple exponential, but would instead reflect two distinct populations: those that are phosphorylated during translation, and are almost immediately degraded, and those that escape phosphorylation and have the same half-life as the non-phosphorylatable mutant. Are the actual experimental results consistent with this two-population model? If not, this would be evidence that some of these cryptic sites can be exposed post-translation, either by thermal fluctuation or biological interactions.

      We thank the reviewer for this insightful point. The readout employed in our study (i.e., western blotting) measures the aggregate signal from the total protein population in the cell culture. It thus reflects average protein levels rather than the dynamics of individual molecules. As such, it is not well-suited to resolving coexisting populations with distinct half-lives. We agree that if phosphorylation of cryptic sites occurs strictly co-translationally, one might expect a biphasic decay curve. However, due to methodological constraints, our assay provides only a single exponential fit to the global turnover kinetics. While we cannot entirely exclude the possibility that cryptic sites may become exposed post-translationally (e.g., due to thermal fluctuations or interactions), our molecular dynamics simulations did not reveal such exposure events within the simulated timescales. Therefore, while the two-population model remains plausible in principle, our results are consistent with a co-translational phosphorylation and degradation model. Forthcoming experiments aimed at characterizing the phosphorylation of ribosome-associated nascent chains in the human proteome may further validate this conclusion.

      1.5. The authors make a point that cryptic phosphosites are more highly conserved than non-cryptic phosphosites, but it is not clear to me whether it is the side chain itself or its ability to be phosphorylated that is conserved. Supplemental Fig. 9, if I am interpreting it correctly, would suggest it is the residue itself and not its phosphorylation that is conserved. If so, wouldn't this suggest that phosphorylation of these cryptic sites is just an inevitable consequence of the conservation of serine, threonine, and tyrosine residues in hydrophobic core regions? If the authors have evidence that argues against this simple hypothesis, they should discuss it (e.g., cryptic phosphosites are more highly conserved in some cases than non-phosphorylated tyrosine, serine, and threonine residues that are not solvent accessible).

      We agree with the reviewer's interpretation. The higher conservation of cryptic phosphosites likely reflects the evolutionary constraint on hydrophobic core residues, which tend to be more conserved due to their role in structural stability. This conservation does not imply phosphorylation at those sites is functionally selected across species. Instead, when such residues are phosphorylated, as we observe in the human proteome, the effect is often destabilizing and associated with protein degradation. Our analysis does not establish that the phosphorylation of cryptic residues is conserved across species, only that the residues themselves are. We appreciate the reviewer's suggestion and now explicitly discuss this point in the revised manuscript to clarify the distinction between residue conservation and phosphorylation conservation (text file with track changes, line 618)

      1.6. Regarding the evolutionary conservation of cryptic sites, have the authors taken into consideration that tyrosine-specific kinases, phosphatases, and reader domains first appeared in the first metazoans, and are for the most part not seen in non-metazoan eukaryotes? I notice some of the proteomes used for the conservation analysis include plants and yeast, which lack most tyrosine phosphorylation.

      We thank the reviewer for this insightful comment. In response to the suggestion, we have recalculated the entropic conservation score by restricting the analysis to metazoan species. This analysis ensures that the evolutionary context more accurately reflects the presence and functional relevance of tyrosine-specific kinases, phosphatases, and reader domains. The comparison between the entropic score distribution calculated by including or not non-metazoan orthologues show statistically significant differences for both serine and threonine, and tyrosine. However, the large sample sizes translate inevitably into statistically significant p-values, even when the differences in mean are minimal and the standard deviations relatively small. To better assess the practical relevance of these differences, we calculated Cohen's d as a measure of effect size (Table R1). The coefficient helps assess the size and biological significance of a difference (>0.2 = small effect; >0.5 = medium effect; >0.8 = large effect). The analysis indicates a very modest deviation in entropic scores by including or not non-metazoan orthologues.

      1.7. I find the argument that phosphorylation of exposed core residues is part of normal protein quality control/proteostasis to be convincing. Can the authors provide any experimental evidence to support this model (for example, greater phosphorylation of cryptic sites under stress conditions)? I don't think these experiments are necessary, but would seem to be a logical next step and could be done quite easily through collaboration.

      We appreciate the reviewer's suggestion and fully agree that showing more significant phosphorylation of cryptic sites under stress conditions could represent an exciting future direction. We are conducting experiments on individual tumor suppressors such as p53 and PTEN, which harbor cryptic phosphosites, to test whether cellular stress conditions enhance phosphorylation at these positions. These studies assess whether such modifications contribute to altered protein stability or function in stress or disease contexts, particularly cancer. We plan to communicate these results in forthcoming publications and are currently open to collaborations to broaden this line of investigation.

      1.8. The authors note at the end of the discussion that targeting cryptic phosphosites might be a strategy to selectively degrade some proteins in cancer. Practically, how would this work? I can't think of how, but perhaps the authors can provide more specific suggestions.

      We thank the reviewer for raising this important point. One promising approach to therapeutically exploit cryptic phosphosites builds on the PPI-FIT principles (Pharmacological Protein Inactivation by Folding Intermediate Targeting). This strategy targets transient structural pockets appearing only in folding intermediates (Spagnolli et al., Comm Biology 2021). In this context, kinases that phosphorylate cryptic sites could be modulated, either inhibited or redirected, so that misfolded or oncogenic proteins are selectively marked for degradation. For example, selectively enhancing the phosphorylation of a cryptic site on an oncogenic protein could destabilize it and promote its degradation via the proteasome. Conversely, preventing phosphorylation at a cryptic site on a tumor suppressor (e.g., by inhibiting the specific kinase) could enhance protein stability and restore function. While this concept is still emerging, it offers an exciting therapeutic avenue that complements our findings. We added a paragraph addressing this point in the discussion section of the new version of the manuscript (text file with track changes, line 716).

      1.9. Introduction: "It involves the addition of a phosphate to an hydroxyl group found in the side chain of specific amino acids, typically serine, threonine or tyrosine residues." Of course serine, threonine, and tyrosine are the only standard amino acids with a simple hydroxyl group, so "typically" is not needed here.

      We have removed the word "typically" to reflect the accurate chemical specificity of phosphorylation events (text file with track changes, line 82).

      1.10. In my view this is an important study, bringing rigor and a broad proteomic perspective to a phenomenon that (to my knowledge) had not been carefully examined previously. In terms of the big picture, I am of two minds. On the one hand, showing that phosphorylation of hydrophobic core residues exposed during translation or the early stages of folding can regulate steady state levels of some proteins provides an intriguing new mechanism to control the complement of proteins in the cell, and is potentially an area of regulation in normal physiology or in disease. On the other hand, if this is just part of the normal proteostatic mechanisms (hydrophobic core residues exposed for too long consign the protein to degradation, before it can lead to aggregation and other problems), that is a little less interesting to me. I think future work to tease out whether this mechanism is actually regulated and used by the cell to transmit information will be key. But the first step is showing that the phenomenon is real and widespread, and in my view this preprint accomplishes that goal very well.

      We appreciate the reviewer's thoughtful summary and agree that distinguishing between passive proteostatic clearance and active regulatory function is essential. Toward this goal, we plan to carry out a phosphoproteomic analysis of ribosome-associated nascent chains. By mapping phosphorylation events during translation, we aim to validate our cryptic phosphosite dataset in a co-translational context and potentially identify novel regulatory modifications. This approach will also help us assess whether phosphorylation at cryptic sites is modulated context-dependently, thereby supporting a role in regulated protein expression rather than solely quality control.

      2.1. Evolutionary comparison whether cryptic and non-cryptic sites are differently conserved. Two distinct distributions for cryptic and non-cryptic phospho-sites are observed and Figure 6 shows two entropy distributions of cryptic v non-cryptic. Here it is unclear whether this is significant given the different distributions of the two types when non modified.

      We thank the reviewer for raising this critical point. Due to the large sample sizes in our analysis, statistical tests inevitably yield very low p-values, even when differences in mean are minimal and the standard deviations relatively small. To better assess the practical relevance of these differences, we calculated Cohen's d as a measure of effect size (Table R2). The comparison between cryptic and non-cryptic phosphosites yielded an effect size (Cohen's d = 0.4028) slightly lower than the one obtained for residues lying within protein cores or exposed on protein surfaces (Cohen's d = 0.5126), both indicating a modest but meaningful shift in entropic scores. In contrast, the comparisons between cryptic phosphosites and all core residues, as well as non-cryptic phosphosites and all surface residues, showed negligible effect sizes (Cohen's d = 0.0245 and 0.1326, respectively). These findings suggest that while statistical significance is achieved in all cases, only the difference between cryptic and non-cryptic phosphosites, or core and surface residues, reflects a meaningful biological signal. We have now included these data in the new version of the manuscript (text file with track changes, line 544).

      2.2. The identification of buried modification sites and what the biological meaning / implications are is a very interesting topic. However PTM distribution on proteins is very skewed (many papers have identified ____clusters, hot spots, structural dependencies etc...) and therefore comparing modified sites on different residues and in different protein regions and with non-modified residues has to be very stringently controlled.

      We fully agree with the reviewer that PTM distribution is non-random and influenced by structural and functional constraints, making comparative analyses challenging. To ensure rigor, we implemented a robust computational pipeline. Unlike other PTMs found almost exclusively on solvent-exposed residues, phosphorylation uniquely showed a distinct subset of sites with extremely low solvent accessibility. This pattern held even after applying stringent structural and dynamical filters. Specifically, we excluded low-confidence residues, small or unstructured domains, and sites that become exposed due to thermal fluctuations, using the SPECTRUS-based dynamic analysis. While we cannot entirely rule out context-specific exposure in fully folded proteins (e.g., during protein-protein interactions), we validated selected cryptic sites experimentally, and our findings were consistent with the computational predictions. We believe this multilayered approach strengthens the reliability of our classification and distinguishes cryptic phosphosites from the broader PTM landscape.

      2.3. Very basic question: How do you assessed the RSA value of the residues from the alphafold structure. If it is sequence based, then it is unclear what the alpha fold structure actually contributes in this step? Although I assume it is structure based, it is not well described, only a reference.

      We calculated the RSA values using the Shrake-Rupley algorithm implemented in the MDTraj Python library. This is a structure-based metric: for each PTM-carrying residue, we evaluated the absolute SASA from the 3D AlphaFold structure and normalized it against the theoretical maximum exposure for that residue in a Gly-X-Gly tripeptide, as defined in Tien et al. (2013). Thus, AlphaFold structures directly provide the atomic coordinates necessary for solvent accessibility estimation. We have now revised the Methods section to describe this process more explicitly (text file with track changes, lines 110 and 113).

      2.4. Given that the different residues S,T,Y but also K for glycosylations etc. have a very different baseline RSA distribution, the distributions of modified residues as such are not so informative. Are the distributions of residues with the alpha fold LOD 0.65 different between modified and non-modified?

      2.5. Same point: it is very clear that "tyrosine presenting a larger proportion of cryptic phosphor-sites", as they mainly are within folded domains to begin with. The pattern of phosphorylation and clustering is very different between the modified amino acid residue T,S,Y and needs consideration, given the large number of PTMs, a simple distribution is not sufficient to argue.

      As already discussed in point 1.2 above, and correctly noted also by this reviewer, tyrosine residues are generally enriched in the hydrophobic cores of proteins, which is reflected by their typically low RSA, regardless of phosphorylation status. This tendency likely arises from the bulky and partially hydrophobic nature of the tyrosine side chain. To address the reviewer's question, we compared the RSA distributions of phosphorylated tyrosine, serine, and threonine residues with those of all these amino acids in the human proteome. We found that phosphorylated residues consistently exhibit higher RSA values than the overall averages for their respective amino acids. This is expected, as phosphorylation within protein cores would likely be destabilizing. Indeed, the existence of low-RSA phosphorylated residues, represents a significant deviation from the intrinsic tendency of tyrosine, serine, and threonine residues and suggests that cryptic sites may become accessible only transiently along protein folding pathways.

      2.6. Figure 3E (proteins need names in the figure ): the cryptic site T222 (Chk1) is not in the quasi ridged domain, it is in a light color region. What is actually the SPECTRUS cutoff? The Pidc is only one sentence in the main text? It says fewer than 80% intradomain contacts in rigid domains i.e. >0.8, right, but is the domain rigid?

      We have revised the original figure in the new version of the manuscript to include protein names, and clarified the domain assignments. The cryptic phosphosite T222 in Chk1 lies within a quasi-rigid domain, as identified by SPECTRUS. The color of the image does not reflect any structural property but instead it is used to distinguish different quasi-rigid domains. In particular, black regions identify unstructured domains, whereas shadows from dark grey to white identify quasi rigid domains. We apologize for the lack of clarity. We have corrected the figure legend accordingly (text file with track changes, line 912).

      There is no cutoff in SPECTRUS' identification of quasi-rigid domain. Non quasi-rigid domains are simply regions of the protein that SPECTRUS cannot process properly. Meaning regions that, due to the large degree of intrinsic fluctuations, cannot be modelled as quasi-rigid.

      We also expanded the description of Pidc in the main text to clarify that it quantifies the proportion of intra-domain contacts made by the phosphosite's side chain, and that a cutoff of {greater than or equal to}0.8 was used to retain only residues well-integrated within rigid domains (text file with track changes, line 243).

      We hope these updates will resolve the ambiguities noted and more clearly define the criteria used in our filtering pipeline.

      2.7. The evolutionary comparison (which is not my core expertise), seems again like comparing different things. Why not comparing cryptic and non-cryptic sites in the same protein regions? Also p-Y are, evolutionarily speaking, very different to p-S and p-T. How is this possibly considered in one distribution. p-Y analysis needs to be separated from the p-T and p-S analyses here.

      We want to clarify that our evolutionary analyses compare residues at the aligned positions in orthologous proteins across multiple species. This approach ensures that each cryptic or non-cryptic phosphosites is assessed in its native structural and sequence context. Therefore, the comparison is not between different regions but evaluates the evolutionary conservation of specific sites across species, allowing for a direct and meaningful comparison of cryptic and non-cryptic phosphosites. In order to address the second point, we report below the entropic score distributions for serine/threonine and tyrosine, separately (Figure R5).

      2.8. Have the authors thought of randomization of their data to see whether the distributions are significant?

      We are unsure we fully understand what the referee means by randomizing the data in this case.

      However, according to the mathematical definition of entropic score, the limit case in which, within each orthogroup, the phosphorylated amino acid is replaced by a completely random residue yields an entropic score of 1. The opposite limit, in which all members of the orthogroups have the same amino acid in the position of the phosphorylated amino acid, yields an ES of 0. We have added a paragraph in the methods to stress this point (text file with track changes, line 354).

      2.9. Labeling in Suppl Figures is insufficient. E.g. In S6 what are the various WT, A and D numbering, are this independent stable transfections/clones? Figure S7 what is R? Thank you for pointing this out. We have now corrected the missing information in the revised version of the manuscript (text file with track changes, from line 992 to 1008)

      2.10. Whether or not findings are "impressive" should be up to the reader, please remove these attributes in the text.

      We agree with the reviewer's suggestion. We have removed subjective language such as "impressive" from the revised manuscript to ensure an objective and neutral tone, allowing readers to independently evaluate the significance of our findings (text file with track changes, line 454).

      3.1. Residues with pLDDT scores below 65 were excluded from the analysis. The high-confidence measure applies to individual residues, regardless of whether the domains they belong to are also predicted with high confidence. Identifying the number of domains containing PTMs with overall high-confidence predictions could provide better insights into the orientation of modified residues within domain structures. To assess the relationship between residue-specific confidence and domain stability, we can analyze the correlation between high-confidence modified residues and the overall prediction accuracy of their domains. This could be quantified using the average error scores of domain residues. Additionally, using the average pLDDT score would indicate how many individual residues were predicted with high local structural confidence. In contrast, the average PAE (Predicted Aligned Error) score would provide insights into how well each residue's position is predicted relative to others within the domain, reflecting overall domain structural confidence.

      Our analysis excluded residues with pLDDT scores below 65 to ensure high local confidence. While pLDDT provides residue-level structural confidence, assessing domain-wide prediction quality offers additional insights into modified residues' spatial organization and exposure. However, a domain-level interpretation is currently limited by the format of AlphaFold structural predictions. Specifically, AlphaFold does not provide Predicted Aligned Error (PAE) matrices for sequences split into overlapping fragments, a method used for proteins longer than 2,700 amino acids. These fragment predictions are only available in the downloadable AlphaFold proteome archives, not through the web interface, and lack the global alignment metrics (such as PAE) necessary for analyzing domain stability or inter-residue confidence within the domain context.

      3.2. "Approximately 65% of proteins with cryptic phosphosites contained only one or two such residues, while less than 10% had five or more sites (Supp. Figure 3)." To better interpret this trend, it would be useful to analyze the total number of cryptic PTMs on proteins part of this study, including all modification types-not just phosphorylation. This would help determine whether the observed pattern is specific to phosphorylation or if it extends to other post-translational modifications as well.

      To compare the occurrence of different cryptic PTMs, we extended our analysis to include all cryptic post-translational modifications annotated in PhosphoSitePlus, including phosphorylation, glycosylation, methylation, sumoylation, and ubiquitination. The approach allowed us to assess whether the observed distribution of cryptic phosphosites is unique or represents a more general feature of all cryptic PTMs. We observed extensive variation among the different PTMs in the proportion of proteins carrying 1, 2, or more of the same cryptic PTM (see Table R3). However, it must be noted that the relatively low number of cryptic PTMs, excluding phosphorylation, could make it difficult to determine whether these patterns reflect actual biological trends or are simply influenced by the sample size. We have not included these data in the new version of the manuscript, but we would be willing to add them if the editor or the reviewer advises us accordingly.

      3.3. For the validation of cryptic sites, selecting domains under 200 amino acids was mentioned. However, was there also a minimum length threshold applied, similar to the filtering criteria used for false positives (less than 40 ignored)?

      The 40-residue threshold was applied because protein domains that are too small cannot be reliably subdivided into quasi-rigid domains. Trying to run SPECTRUS on structures with fewer than 40 residues inevitably returns a warning, reflecting the intrinsic cooperative nature of quasi-rigid domains. In fact, entities composed of too few amino acids cannot properly arrange themselves into 3D structures and tend to be disordered. The same reasoning was applied when choosing the proteins to simulate. In particular, for the refolding simulations, we selected protein domains possessing the following properties:

      1. Shorter than 200 amino acids to limit the computational demands.
      2. Long enough to fold into an ordered 3-dimensional conformation reliably.
      3. Have an experimentally determined NMR or X-ray crystal structure 3.4. To test their hypothesis that phosphorylation affects protein expression, they selected candidates for serine and threonine but excluded tyrosine. What were the reasons for not including tyrosine-related PTMs in their analysis?

      Our experimental assays relied on phosphomimetic substitutions to mimic the effect of phosphorylation. While serine/threonine phosphorylation can be reasonably mimicked by E or D substitutions, there is no reliable single-residue mimic for phosphotyrosine. Indeed, E or D substitutions do not recapitulate the structural or electronic features of pTyr. Given these limitations, we excluded tyrosine phosphosites from experimental validation to avoid generating inconclusive or misleading data.

      3.5. Do we know that the regulatory role of S300 on PYST1 is associated with the dual specificity of the phosphatase, and is this why it was selected as a negative regulator? While the regulatory roles of the other analyzed phosphosites on SMAD and CHK1 are discussed, there is limited mention of the specific role of S300 on PYST1 within the scope of the study.

      S300 of PYST1 was selected not due to known regulatory relevance, but for technical convenience. PYST1 is a relatively small protein, facilitating computational simulations. We also had suitable reagents for detection (i.e., expression vector), and importantly, S300 was identified as a false-positive cryptic phosphosite removed by our dynamic filtering. It was a practical and structurally matched negative control for validating our computational pipeline.

      3.6. When comparing the entropic scores between cryptic and non-cryptic residues, the medians are 0.43 and 0.52, respectively. Although this difference is not very high, they do observe that cryptic residues have lower scores than non-cryptic ones. The distributions also show greater overlap (Figure 6). I'm wondering if any statistical testing would help assess how distinct these two groups really are.

      We thank the reviewer for the comment raised by reviewer #2, for which we provide an answer above. Briefly, given our large sample sizes, statistical tests often yield very low p-values even for minor differences. To assess the biological significance, we calculated Cohen's d (Table R2 above). The effect size between cryptic and non-cryptic phosphosites (d = 0.4028) was modest but meaningful, and slightly lower than between core and surface residues (d = 0.5126).

      3.7. Why did the authors choose to rely on AlphaFold data instead of examining PDB structures? I didn't see any explanation or rationale provided for preferring AlphaFold predictions over experimentally determined structures from the PDB.

      We appreciate the value of this comment. We focused on AlphaFold to maximize proteome-wide coverage. Indeed, although PDB structures offer experimentally validated conformations, their sparse and uneven proteome coverage (particularly for membrane proteins, low-abundance factors, and intrinsically disordered regions) precludes a truly global analysis. AlphaFold2 models, by contrast, deliver accurate, full-length structures for nearly the entire human proteome, enabling unbiased, large-scale mapping of cryptic phosphosites. Nonetheless, we performed the same analysis using high-resolution structures from the Protein Data Bank (PDB). The results were fully consistent with those based on AlphaFold predictions, indicating that our findings are consistent across the two databases (see Figure R6 below).

      3.8. Novelty - The concept that cryptic site modifications can dysregulate signaling in cancer and other diseases is known, but systematically categorizing PTM sites into cryptic and non-cryptic to generate hypotheses for a wide range of identified PTMs remains an underdeveloped approach. This study establishes a framework for classifying PTMs based on their structural accessibility, integrating AlphaFold predictions, molecular dynamics simulations, solvent accessibility analysis, and phylogenetic conservation metrics. This approach not only enhances our understanding of PTM-mediated regulatory mechanisms but also provides a foundation for exploring how cryptic modifications contribute to protein function, stability, and disease progression.

      We appreciate the reviewer's comment. To our knowledge, this is the first study to introduce and define "cryptic phosphosites" as a structurally distinct and functionally relevant subset of phosphorylation sites. While some individual cases of buried amino acids influencing cancer-related proteins have been reported, no previous study has systematically mapped, filtered, and analyzed these sites across the human proteome using integrated structural, dynamical, evolutionary, and experimental criteria.

      3.9. The study relies primarily on predicted protein structures (e.g., AlphaFold), without exploring experimentally derived structures, which could provide more accurate and physiologically relevant insights.

      We have addressed this point above (see reply to #3.7).

      3.10. While the research demonstrates the impact of cryptic PTMs on protein function, it would be valuable to also investigate non-cryptic sites from their annotated data. By examining the effects of modifications on these non-cryptic sites, the study could further validate the importance of the cryptic versus non-cryptic classifications and help clarify the functional relevance of both types of sites.

      We thank the referee for this thoughtful suggestion. We compared the proportion of cryptic or non-cryptic phosphosites associated with cancer- and disease-related mutations in each group from the COSMIC and PTMVar datasets. The percentage of phosphosites associated with the two repositories is essentially the same for cryptic and non-cryptic sites. This observation suggests that, despite their different structural and regulatory features, both site types occur similarly in disease contexts (see Table R4). We have included these data in the new version of the manuscript (text file with track changes, line 1067; and new Supp. Table 3).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The methods applied in this study were thoughtfully designed. The study's goals and the experiments performed to test several of their hypotheses were meticulously planned, ensuring that the research approach was robust and aligned with the objectives. The experimental design effectively addressed the key questions and provided reliable insights into the role of cryptic PTMs in protein function and disease mechanisms.

      This study investigates cryptic post-translational modification (PTM) sites in the human proteome and their role in protein folding and expression, with significant implications for disease mechanisms. This work seeks to bridge the gap between the abundance of identified PTM sites and their regulatory roles in signaling pathways. A key focus of the study is on intermediate protein conformations-states that exist between fully folded and unfolded structures to determine whether these transient states contribute to disease by affecting protein synthesis, activity, stability, and degradation. To classify PTM sites as cryptic or non-cryptic, the authors used AlphaFold-predicted structures and relative solvent accessibility (RSA) scores, excluding those within quasi-rigid domain interfaces. This enabled them to create a database of mapped PTM sites, distinguishing based on their cryptic nature. Their analysis revealed that most PTMs occur at solvent-exposed residues, but unexpectedly, one-third of tyrosine phosphosites were cryptic. To assess the impact of cryptic phosphorylation on protein expression, they performed molecular dynamics (MD) simulations on SMAD and CHK1 phosphsites, showing that cryptic sites can become transiently exposed during protein folding. Their computational simulations further supported the finding that this exposure enhancing the chances of being modified and ultimately a potential mechanism for destabilization of its structure (due to that modification) to trigger degradation in physiological conditions. Experimentally, western blotting and protein half-life measurements confirmed that phosphomimetic substitutions affected protein expression, supporting their hypothesis that cryptic phosphorylation can influence protein stability and function. From an evolutionary and functional perspective, their phylogenetic analysis using entropy scores indicates that cryptic sites are more conserved. They also show that the cryptic PTM sites identified in this study were found to be substituted by phosphomimetic mutations in tumor-suppressor proteins, leading to dysregulation of their function and suppression of downstream signaling essential for tumor cell death. This study provides a framework for mapping cryptic PTM sites and understanding their role within intermediate protein folding states. By linking cryptic PTMs to their effects on protein stability, signaling pathways, and disease progression, the findings highlight a potential regulatory mechanism through which cryptic modifications contribute to cancer and other diseases.

      Minor revisions

      1. Result 1 - Residues with pLDDT scores below 65 were excluded from the analysis. The high-confidence measure applies to individual residues, regardless of whether the domains they belong to are also predicted with high confidence. Identifying the number of domains containing PTMs with overall high-confidence predictions could provide better insights into the orientation of modified residues within domain structures. To assess the relationship between residue-specific confidence and domain stability, we can analyze the correlation between high-confidence modified residues and the overall prediction accuracy of their domains. This could be quantified using the average error scores of domain residues. Additionally, using the average pLDDT score would indicate how many individual residues were predicted with high local structural confidence. In contrast, the average PAE (Predicted Aligned Error) score would provide insights into how well each residue's position is predicted relative to others within the domain, reflecting overall domain structural confidence.
      2. "Approximately 65% of proteins with cryptic phosphosites contained only one or two such residues, while less than 10% had five or more sites (Supp. Figure 3)." To better interpret this trend, it would be useful to analyze the total number of cryptic PTMs on proteins part of this study, including all modification types-not just phosphorylation. This would help determine whether the observed pattern is specific to phosphorylation or if it extends to other post-translational modifications as well.
      3. For the validation of cryptic sites, selecting domains under 200 amino acids was mentioned. However, was there also a minimum length threshold applied, similar to the filtering criteria used for false positives (less than 40 ignored)?
      4. To test their hypothesis that phosphorylation affects protein expression, they selected candidates for serine and threonine but excluded tyrosine. What were the reasons for not including tyrosine-related PTMs in their analysis?
      5. Do we know that the regulatory role of S300 on PYST1 is associated with the dual specificity of the phosphatase, and is this why it was selected as a negative regulator? While the regulatory roles of the other analyzed phosphosites on SMAD and CHK1 are discussed, there is limited mention of the specific role of S300 on PYST1 within the scope of the study.
      6. When comparing the entropic scores between cryptic and non-cryptic residues, the medians are 0.43 and 0.52, respectively. Although this difference is not very high, they do observe that cryptic residues have lower scores than non-cryptic ones. The distributions also show greater overlap (Figure 6). I'm wondering if any statistical testing would help assess how distinct these two groups really are.
      7. Why did the authors choose to rely on AlphaFold data instead of examining PDB structures? I didn't see any explanation or rationale provided for preferring AlphaFold predictions over experimentally determined structures from the PDB.

      Significance

      Novelty - The concept that cryptic site modifications can dysregulate signaling in cancer and other diseases is known, but systematically categorizing PTM sites into cryptic and non-cryptic to generate hypotheses for a wide range of identified PTMs remains an underdeveloped approach. This study establishes a framework for classifying PTMs based on their structural accessibility, integrating AlphaFold predictions, molecular dynamics simulations, solvent accessibility analysis, and phylogenetic conservation metrics. This approach not only enhances our understanding of PTM-mediated regulatory mechanisms but also provides a foundation for exploring how cryptic modifications contribute to protein function, stability, and disease progression.

      Strengths - This study benefits from its use of multiple validation methods and false-positive filtering, resulting in a high-confidence dataset of annotated PTM sites. The combination of computational predictions and experimental analyses strengthens the validity of their findings. This integrative approach enhances the reliability of the data and provides a comprehensive understanding of cryptic versus non-cryptic PTMs in protein regulation.

      Limitations

      1. The study relies primarily on predicted protein structures (e.g., AlphaFold), without exploring experimentally derived structures, which could provide more accurate and physiologically relevant insights.
      2. While the research demonstrates the impact of cryptic PTMs on protein function, it would be valuable to also investigate non-cryptic sites from their annotated data. By examining the effects of modifications on these non-cryptic sites, the study could further validate the importance of the cryptic versus non-cryptic classifications and help clarify the functional relevance of both types of sites.

      Audience - The broader implications of this work extend to biomedical research, drug discovery, and therapeutic development. Researchers in cell signaling and systems biology who aim to understand which modification sites are crucial for evaluating the outcomes of signaling pathways can benefit from the insights generated by this study. It provides a pathway for identifying novel drug targets and enhances our understanding of disease mechanisms, particularly in cancer and other diseases. Additionally, this work encourages and motivates computational biologists to develop more efficient methods for capturing protein folding dynamics, enabling more accurate hypotheses regarding the effects of specific PTM sites and how they influence protein function and disease progression.

      My expertise lies primarily in structural biology, with a strong background in developing and utilizing bioinformatics and computational tools. While I currently have less hands-on experience with experimental techniques, my comprehensive understanding of experimental methodologies, combined with an awareness of the expected outcomes, has enabled me to effectively evaluate and interpret experimental results.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Review on Gasparotto et al "Mapping Cryptic Phosphorylation Sites in the Human Proteome"

      Gasparotte et al assess the solvent accessibility of 87,138 post-translationally modified amino acids in the human proteome (from phosphosite plus). There initial observation is that a large fraction of modified sites are buried, a finding that is pronounced for phosphorylation but not other modifications. Their approach is using alpha fold 3D structures (0.65 cut off) and RSA prediction to get a set of buried sites. Further refinement includes the removing of low-confidence segments (such as loops, linkers, or short disordered regions) and to use SPECTRUS to identified quasi-rigid domains. The idea is that quasi rigid domains may not breathe and thus will be modified during the synthesis or folding.

      They generated a final dataset of 10,606 cryptic T, S and Y phosphor-sites in 5,496 proteins and state that: "These data indicate that ~5% of all known phospho-sites are cryptic. Impressively, the number translates to ~33% of phosphorylated proteins in the human proteome presenting at least one cryptic phospho-site." They focus on S417 of the SMAD2, T382 of Chk1, known to be associated with loss of function effects or proteasomal degradation and S300 of PYST1 negative control. They stably express these proteins as phospho-mimicry or alanine substitution in HEK293. Expression levels were reduced in the phosphor-D- mutant versions and upon cycloheximide treatment a reduction of the turnover time for the phospho-D CHK1 was observed. I think we are looking a large clonal difference in the supplemental figures.

      The examples are supported by MD simulations that suggest that cryptic phospho-sites can occur during the folding process and affect protein homeostasis by drastically increasing degradation rate and leading to rapid turnover; Essentially the phospho-versions show a solvent exposure. Evolutionary comparison whether cryptic and non-cryptic sites are differently conserved. Two distinct distributions for cryptic and non-cryptic phospho-sites are observed and Figure 6 shows two entropy distributions of cryptic v non-cryptic. Here it is unclear whether this is significant given the different distributions of the two types when non modified. Finally, overlay of the sites with cancer mutations lists 221 mutations in COSMIC associated with cryptic phosphosites that have been annotated as cancer-related and 138 mutations in PTMVar linked to cancer and other human pathologies. The identification of buried modification sites and what the biological meaning / implications are is a very interesting topic. However PTM distribution on proteins is very skewed (many papers have identified cluster, hot spots, structural dependencies etc...) and therefore comparing modified sites on different residues and in different protein regions and with non-modified residues has to be very stringently controlled.

      Points for consideration

      • Very basic question: How do you assessed the RSA value of the residues from the alphafold structure. If it is sequence based, then it is unclear what the alpha fold structure actually contributes in this step? Although I assume it is structure based, it is not well described, only a reference.
      • Given that the different residues S,T,Y but also K for glycosylations etc. have a very different baseline RSA distribution, the distributions of modified residues as such are not so informative. Are the distributions of residues with the alpha fold LOD 0.65 different between modified and non-modified?
      • Same point: it is very clear that "tyrosine presenting a larger proportion of cryptic phosphor-sites", as they mainly are within folded domains to begin with. The pattern of phosphorylation and clustering is very different between the modified amino acid residue T,S,Y and needs consideration, given the large number of PTMs, a simple distribution is not sufficient to argue.
      • Figure 3 E (proteins need names in the figure ): the cryptic site T222 (Chk1) is not in the quasi ridged domain, it is in a light color region. What is actually the SPECTRUS cutoff? The Pidc is only one sentence in the main text? It says fewer than 80% intradomain contacts in rigid domains i.e. >0.8, right, but is the domain rigid?
      • The evolutionary comparison (which is not my core expertise), seems again like comparing different things. Why not comparing cryptic and non-cryptic sites in the same protein regions? Also p-Y are, evolutionarily speaking, very different to p-S and p-T. How is this possibly considered in one distribution. p-Y analysis needs to be separated from the p-T and p-S analyses here.
      • Have the authors thought of randomization of their data to see whether the distributions are significant?
      • Labeling in Suppl Figures is insufficient. E.g. In S6 what are the various WT, A and D numbering, are this independent stable transfections/clones? Figure S7 what is R?
      • Whether or not findings are "impressive" should be up to the reader, please remove these attributes in the text.

      Significance

      The identification of buried modification sites and what the biological meaning / implications are is a very interesting topic. However PTM distribution on proteins is very skewed (many papers have identified cluster, hot spots, structural dependencies etc...) and therefore comparing modified sites on different residues and in different protein regions and with non-modified residues has to be very stringently controlled.

      main conclusion: 5% of all known phospho-sites are cryptic, at least one in 1/3 of structured protein regions.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This preprint uses bioinformatic and experimental approaches to explore the prevalence and consequences of the phosphorylation of residues normally buried in the hydrophobic core of proteins. By cross-referencing validated human phosphosites (PhosphositePlus) with the predicted 3D structures of the human proteome (from the AlphaFold predicted protein structure database), they identified potential "cryptic" phosphosites not expected to be solvent-accessible. They further refined the list using a variety of tools and conclude that a significant percentage (roughly 25%) of known phosphosites in folded domains are cryptic. They go on to experimentally test the consequences of mutating several of these sites in known proteins either to non-phosphorylateable or phospho-mimetic residues, and found that the phosphomimetic mutants had lower half-lives and average expression levels than either the wt or non-phosphorylatable versions. Finally, they show that putative cryptic phosphorylation sites are more highly conserved that those that are surface-accessible, and that some of these cryptic sites are found in tumor suppressor genes and that phosphomimetic mutations at these sites can be found in tumor mutation databases.

      Major comments:

      Overall the experimental approach is relatively straightforward, and in general the authors' interpretation of the results seems reasonable. There were several areas where I believe additional analysis or discussion might clarify the interpretation, however.

      1. It would be helpful if the authors could discuss whether there is any correlation between cryptic sites and the extent of experimental validation in the Phosphosite database (e.g. those that were only identified in one or a few MS experiments). It is difficult to determine stoichiometry of phosphorylation experimentally, but can any inference be made on the extent of phosphorylation of cryptic sites vs. more conventional sites located in IDRs or on the surface of globular domains?
      2. The authors note that a larger percentage of tyrosine phosphorylation sites are cryptic compared with serine/threonine sites. I assume that tyrosine itself is more highly enriched in the hydrophobic cores of proteins relative to serine or threonine, due to its bulky hydrophobic side chain. Is the increased proportion of cryptic tyrosine phosphorylation sites more, less, or the same as the proportion of tyrosine in hydrophobic cores relative to serine and threonine?
      3. Fig. 5D and E: I had some trouble interpreting these figures. Indicating where the native state is in the plots would be helpful (stated in text as lower right, but a rectangle on the plot would make this more obvious). The text discusses three metastable intermediates, but what is the fourth one shown on the figures (well A, close to the native state)? This could be more explicitly explained.
      4. The fact that phosphomimetic mutations of cyptic sites in SMAD2 and CHK1 lead to lower expression levels and shorter half-lives is not surprising, given the expected disruption of the hydrophobic core by introduction of a charged residue. The results certainly show that if phosphorylated, these sites would decrease expression and half-life. With respect to half-life, however, if the authors are correct and cryptic sites are predominately phosphorylated co-translationally, one would expect that the half-life curves for the wt protein would not be a simple exponential, but would instead reflect two distinct populations: those that are phosphorylated during translation, and are almost immediately degraded, and those that escape phosphorylation and have the same half-life as the non-phosphorylatable mutant. Are the actual experimental results consistent with this two-population model? If not, this would be evidence that some of these cryptic sites can be exposed post-translation, either by thermal fluctuation or biological interactions.
      5. The authors make a point that cryptic phosphosites are more highly conserved than non-cryptic phosphosites, but it is not clear to me whether it is the side chain itself or its ability to be phosphorylated that is conserved. Supplemental Fig. 9, if I am interpreting it correctly, would suggest it is the residue itself and not its phosphorylation that is conserved. If so, wouldn't this suggest that phosphorylation of these cryptic sites is just an inevitable consequence of the conservation of serine, threonine, and tyrosine residues in hydrophobic core regions? If the authors have evidence that argues against this simple hypothesis, they should discuss it (e.g., cryptic phosphosites are more highly conserved in some cases than non-phosphorylated tyrosine, serine, and threonine residues that are not solvent accessible).
      6. Regarding the evolutionary conservation of cryptic sites, have the authors taken into consideration that tyrosine-specific kinases, phosphatases, and reader domains first appeared in the first metazoans, and are for the most part not seen in non-metazoan eukaryotes? I notice some of the proteomes used for the conservation analysis include plants and yeast, which lack most tyrosine phosphorylation.
      7. I find the argument that phosphorylation of exposed core residues is part of normal protein quality control/proteostasis to be convincing. Can the authors provide any experimental evidence to support this model (for example, greater phosphorylation of cryptic sites under stress conditions)? I don't think these experiments are necessary, but would seem to be a logical next step and could be done quite easily through collaboration.
      8. The authors note at the end of the discussion that targeting cryptic phosphosites might be a strategy to selectively degrade some proteins in cancer. Practically, how would this work? I can't think of how, but perhaps the authors can provide more specific suggestions.

      Minor comment:

      1. Introduction: "It involves the addition of a phosphate to an hydroxyl group found in the side chain of specific amino acids, typically serine, threonine or tyrosine residues." Of course serine, threonine, and tyrosine are the only standard amino acids with a simple hydroxyl group, so "typically" is not needed here.

      Significance

      In my view this is an important study, bringing rigor and a broad proteomic perspective to a phenomenon that (to my knowledge) had not been carefully examined previously. In terms of the big picture, I am of two minds. On the one hand, showing that phosphorylation of hydrophobic core residues exposed during translation or the early stages of folding can regulate steady state levels of some proteins provides an intriguing new mechanism to control the complement of proteins in the cell, and is potentially an area of regulation in normal physiology or in disease. On the other hand, if this is just part of the normal proteostatic mechanisms (hydrophobic core residues exposed for too long consign the protein to degradation, before it can lead to aggregation and other problems), that is a little less interesting to me. I think future work to tease out whether this mechanism is actually regulated and used by the cell to transmit information will be key. But the first step is showing that the phenomenon is real and widespread, and in my view this preprint accomplishes that goal very well.

      I come from a background of studying post-translational modifications in signaling, hence my hope that a regulatory role can be found. But even if cryptic phosphorylation turns out to be unregulated, the work provides important new insight into normal proteostasis, and therefore is a valuable contribution. I should note that I don't have extensive expertise in bioinformatic methods or the computational tools to study protein dynamics, but I assume other reviewers will critically evaluate these methods.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General Statements

      We sincerely thank all three reviewers for their thoughtful and constructive feedback. Your comments were invaluable in improving the clarity and quality of our work.

      In this study, we revisit a previously overlooked lipophilic dye, demonstrating its utility for live-cell imaging that transport in a non-vesicular pathway and label autophagy related structures. Against the backdrop of increasing attention to membrane contact sites (MCSs), bridge-like lipid transfer proteins (BLTPs), and organelle biogenesis, we aim to propose the possibility of a reversible one-way phospholipid transfer activity that really takes place in living cells.

      As Reviewer #1 noted, recent cryo-EM studies (e.g., Oikawa et al.) have highlighted the importance of lipids in autophagosome formation. And there are some existed in vitro studies. However, we believe that we have to think about the consistence of simplified in vitro reconstitution and the complex real cellular environment. In addition, to our knowledge, no studies have directly tracked lipid flow dynamics over time in living cells. We believe our work contributes to this gap by combining three interesting technical approaches: (a) R18 as a lipid-tracing dye, (b) FRAP analysis on the isolation membrane, and (c) the use of Ape1 overexpression to stall autophagosome closure, enabling us to visualize reversible lipid flow in vivo. While these techniques may not appear "fancy," we hope they offer new insights that can inspire further exploration in lipid dynamics story in a real cellular environment.

      We appreciate Reviewer #2's comments on our high imaging quality and Reviewer #3's recognition of our approach as an elegant way to study lipid transfer. We have revised the manuscript accordingly and included additional explanations, figure clarifications, and planned experiments to address remaining concerns.

      As two key concerns were raised repeatedly by all reviewers, we would like to address them here:

      1. Regarding the concern that the evidence for reversible lipid transfer from the IM to the ER is not sufficiently strong:

      We are deeply grateful to Reviewer #2 for the insightful suggestion to compare the fluorescence recovery of the adjacent bleached ER to that of the ER-IM MCS, to exclude the possibility that recovery at the ER-IM MCS originates from nearby ER rather than from the IM. Following this suggestion, we performed a quantitative analysis using unbleached ER as a background. Interestingly, in every sample, the adjacent bleached ER consistently showed a significantly lower fluorescence recovery than the ER-IM MCS. We also used the IM as a background for normalization, the difference became even more pronounced, further supporting the idea that the adjacent ER could not be the source of the recovery signal at the ER-IM MCS. These findings strengthen our conclusion that phospholipid recovery at the MCS could be derived from the IM. The updated analysis and corresponding figure panels (Figure 5K, 5L, and 5M), along with the relevant text (lines 384-396), have been revised accordingly.

      Regarding the concern that the evidence for R18 transfer via Atg2 as a bridge-like lipid transfer protein is not sufficiently direct:

      In addition to the evidence presented in this manuscript, we have now cited our parallel study currently under revision (Sakai et al., bioRxiv 2025.05.24.655882v1), where we provide direct evidence that Atg2 indeed functions as a bridge-like lipid transfer protein, rather than a shuttle. Importantly, we also show in that study that R18 transfer requires the bridge-like structure of Atg2. This new reference has been cited in the revised manuscript, and relevant textual explanations have been added to provide further support.

      We hope that the revisions and our revision plan can address the reviewers key concerns. Please find our detailed point-by-point responses below.

      Response to the Reviewer ____#____1

      In their study, Hao and colleagues exploited the fluorescent fatty acid R18 to follow phospholipid (PL) transfer in vivo from the endoplasmic reticulum to the IM during autophagosome formation. Although the results are interesting, especially the retrograde transport of PLs, based on the provided data, additional control experiments are needed to firmly support the conclusions.

      We sincerely thank the reviewer for the positive assessment and agree that additional controls are necessary to support our conclusion. Detailed responses and corresponding revisions are provided below.

      An additional point is that the authors also study the internalization of R18 into cells and found a role of lipid flippases and oxysterol binding proteins. While this information could be useful for researchers using this dye, these analyses/findings have no specific connection with the topic of the manuscript, i.e. the PL transfer during autophagosome formation. Therefore, they must be removed.

      We thank the reviewer for the thoughtful comment. We understand the concern that the R18 internalization analysis may appear peripheral to the manuscript's main focus on phospholipid transfer during autophagosome formation. However, we respectfully believe that this section is critical for establishing the mechanistic basis as this study represents the first detailed in vivo application of R18 for tracing lipid dynamics. We believe it is interesting that R18 entry is not due to chemically passive diffusion or non-specific adsorption, but occurs through a biologically regulated, non-vesicular lipid transport pathway. This mechanistic context underpins the reliability of using R18 to monitor ER-to-IM lipid transport in the autophagy pathway.

      To improve clarity and coherence, we have added explanatory text in the Introduction and at the start of the Results section to explicitly link the internalization assay to the subsequent autophagy-related experiments (line 94-98, 185-187). We hope this helps guide the reader through the rationale and relevance of this part of the study.

      Major points:

      1) In general, the quality of the microscopy images are quite poor and this make it difficult to assert some of the authors' conclusions.

      We thank the reviewer for the feedback. To better address this concern, we would appreciate clarification regarding which specific images or figure panels were found to be of low quality. Overall, we believe the microscopy data presented are of sufficient resolution and clarity to support our main conclusions, as also noted by Reviewer #2 ("the high-quality images and FRAP experiments").

      We acknowledge that certain phenomena-such as occasional R18 labeling of the vacuole-were not clearly explained in the original manuscript. We have now included additional clarification in the results section and mentioned this limitation in the discussion (lines 170-171, 436-438), along with a note on ongoing experiments to further investigate this point.

      2) It would be important to perform some lipidomics analysis to determine in which PLs and other lipids or lipid intermediates R18 is incorporated. First, it will be important to know which the major PL species are are labelled under the conditions of the experiments done in this study. Second, the authors assume that all the R18 is exclusively incorporated into PLs and this is what they follow in their in vivo experiments. What about acyl-CoA, which has been shown to be a key player in the IM elongation (Graef lab, Cell)?

      We thank the reviewer for raising this point. However, we believe this is based on a misunderstanding of the chemical nature of R18. R18 is not a free fatty acid analog and cannot be incorporated into phospholipids or acyl-CoA via metabolic pathways. Due to its chemical structure-a bulky rhodamine headgroup attached to a long alkyl chain-it cannot undergo enzymatic conjugation or incorporation into membrane lipids. This is why we did not pursue lipidomics analysis. Instead, we focused on characterizing the biological behavior of R18 through a range of live-cell assays, including temperature and ATP dependency, involvement of flippases, OSBP proteins, and Atg2, all of which support a regulated, non-vesicular lipid transport pathway. Additionally, the AF3 structural model presented in this study is consistent with this interpretation, showing no evidence of R18 forming chemical bonds with phospholipids.

      3) Figure 1A and 1B. The authors conclude that Atg2 is involved in the lipid transfer since R18 does not localize to the PAS/ARS in the atg2KO cells. However, another possible explanation is that in those cells the IM is not formed and does not expand, and con sequetly R18 is present in low amounts not detectable by fluorescence microscopy. To support their conclusion, the authors must assess PAS-labelling with R18 in cells lacking another ATG gene in which Atg2 is still recruited to the PAS.

      We thank the reviewer for this important suggestion. As noted, the absence of R18 at the PAS in atg2Δ cells may reflect a lack of membrane formation rather than impaired lipid transfer. However, in support of our interpretation, our previous work (Hirata E, Ohya Y, Suzuki K, 2017) has shown that R18 accumulates at PAS-like structures in delipidation mutants, where the IM fails to expand but Atg2 is still recruited (please refer to the attached revision plan for further details). This suggests that the presence of Atg2, rather than the mere existence of a mature IM, contributes to R18 localization.

      To address this, we revised our statement to the more cautious: "R18 was undetectable at the PAS in atg2Δ cells," to avoid overinterpretation (lines 119-120). 4)

      4) Figure 2. As written, the paragraph this figure seems to indicate that flippases are directly involved in the translocation of R18 from the PM to the ER. As correctly indicated by the authors, flippases flip PLs, not fatty acids. Moreover, there are no PL synthesizing at the PM and thus probably R18 is not flipped upon incorporation into PL. As a result, the relevance of flippase in R18 internalization is probably indirect. This must be explained clearly to avoid confusion/misunderstandings.

      We thank the reviewer for this important clarification. We fully agree that flippases act on phospholipids, not fatty acids, and that R18 is not metabolically incorporated into phospholipids at the plasma membrane. However, our ongoing work (Rev. Figure 1) shows that R18 preferential labeling affinity for PS and PE in vivo (yeast phospholipid synthesis mutants), consistent with its flippase-dependent localization. Flippases are known to specifically flip PS and PE. While R18 itself is not enzymatically modified or incorporated into phospholipids, its membrane distribution may thus depend on the lipid environment and the activity of lipid-translocating proteins.

      Preliminary data supporting this observation are included in the "Supplementary Figures for reviewer reference only" and are not part of the public submission.

      5) A couple of manuscript has shown a (partial) role of Drs2 in autophagy. The authors must explain the discrepancy between their own results and what published, especially because they use the GFP-Atg8 processing assay, which is less sensitive than the Pho8delta60 used in the other studies.

      We thank the reviewer for raising this important point. We are aware of prior reports implicating Drs2 in autophagy and in fact discussed this work directly with the authors during the course of our experiments, who kindly provided helpful suggestions. While our GFP-Atg8 processing assay did not show significant defects upon Drs2 deletion, strain background differences may explain this discrepancy. We also appreciate the suggestion to use the Pho8Δ60 assay and plan to include it in future experiments.

      Additionally, authors should check whether the Atg2 and Atg18 proteins are present at the IM-ER membrane contact sites in the same rates after nutrient replenished than when cells are nitrogen-starved, since this complex would determine the lipid transfer dynamics at this membrane contact site.

      We thank the reviewer for the helpful suggestion. We plan to perform additional experiments to monitor Atg18 localization during the nutrient replenishment assay.

      6) Authors used a predicted Atg2 lipid-transfer mutant (Srinivasan et al, J Cel Biol, 2024), but not direct prove that this mutant is defective for this activity. As previously done for other Atg2/ATG2-related manuscripts (Osawa et al, Nat Struct Mol Biol, 2019; Valverde et al, J Cel Biol, 2019), this must be measure in vitro. Moreover, they do not show whether other known functions of Atg2 are unaffected when expressing this Atg2 mutant, e.g. formation of the IM-ER MCSs, Atg2 interaction with Atg9 and localization at the extremity of the IM...

      We thank the reviewer for this concern. The lipid-transfer-deficient Atg2 mutant used here is based on the same structural rationale as in our recent parallel study (Sakai et al., bioRxiv 2025; https://www.biorxiv.org/content/10.1101/2025.05.24.655882v1, currently under revision). In that study, we addressed whether Atg2 indeed functions as a bridge-like lipid transfer protein, and also used R18 to directly demonstrate the lipid transfer defect of this Atg2 mutant in vivo.

      We therefore believe that referencing this study provides mechanistic support for the use of this Atg2 mutant in the current manuscript. A citation and brief explanation have now been added to the revised text (line 315-316, 439-441). We also plan to perform the lipid transfer assay in vitro.

      7) The mNG-Atg8 signal is not recovered in the fluorescent recovery assays. Based on the observation that R18 signal comes back after photobleaching, authors suggest that the supply of Atg8 is not required for IM expansion. This idea is opposite to data where the levels of Atg8 and deconjugation of lipidated Atg8 determines the size of the forming autophagosomes (e.g., Xie et al, Mol Biol Cell, 2008; Nair et al, Autophagy, 2012). Similar results have also been obtained in mammalian cells (Lazarou and Mizushima results in cell lacking components of the two ubiquitin-like conjugation systems). This discrepancy requires an explanation.

      We thank the reviewer for pointing out this imprecise interpretation, and we sincerely apologize for the confusion it may have caused. We fully agree that Atg8 is essential for the expansion of the isolation membrane (IM), as supported by previous studies. In our FRAP data, mNG-Atg8 showed gradual recovery at the later timepoints, indicating that Atg8 can be replenished over time. The reason why R18 recovery appears much more rapid is likely due to the inherently fast lipid transfer activity of Atg2, the bridge-like lipid transport protein. In contrast, Atg8 signal recovery may have been delayed for two reasons: (1) slower recruitment kinetics to the IM, and (2) partial depletion of the available mNG-Atg8 protein pool due to photobleaching during the experiment.

      We have revised the relevant paragraph in the manuscript (line 326-330) to clarify these points and avoid potential misinterpretation.

      8) Although authors claim that there is a retrograde lipid transfer from the IM to the ER, based on the data, it quite difficult to extract these conclusions as they show a decrease in the lipid flow dynamics rather to an inversion of the lipid flow per se. Can the authors exclude that ER microdomains are formed at the ERES in contact with the IM, and consequently what they measure is a slow diffusion of R18-labeled lipid from other part of the ER to these ERES?

      We appreciate the reviewer's insightful comment. Indeed, we are also considering the possibility that lipid-enriched microdomains may form in the ER and contribute to complex lipid dynamics at contact sites. However, direct visualization of such domains in cells remains technically challenging, this remains one of the important directions we aim to pursue in future studies. While our current data do not allow us to definitively state that all recovered lipids originate from the IM, our FRAP experiments provide indirect yet strong support for the possibility that at least a substantial portion of the recovered lipid signal in the ER derives from the IM. Moreover, following Reviewer 2's major point No.4, we performed a direct comparison of R18 fluorescence recovery between the photobleached ER-IM MCS region and the adjacent bleachedER region (Figure 5K and 5M). Interestingly, each sample consistently showed lower fluorescence recovery in the adjacent bleached ER near the ER-IM MCS (mean = 0.20), compared to the ER-IM MCS region (mean = 0.28). To further validate this observation, we also used the IM as a background reference for normalization. This analysis revealed a more significant difference, with the adjacent bleached ER near the ER-IM MCS showing a lower recovery (mean = 0.47) than the ER-IM MCS (mean = 0.80).

      As the Reviewer2 pointed out, these results support our reversible lipid transfer model by demonstrating that fluorescence recovery at the ER-IM MCS is due to the signal coming from the IM, rather than from the adjacent bleached ER, which recovers more slowly and less efficiently. We have incorporated this new analysis into Figure 5, and accordingly revised the figure legend and main text (lines 384-396).

      9) The retrograde PL transfer is studied in cells overexpressing Ape1, in which IM elongation is stalled. This is a non-physiological experimental setup and consequently it is unclear whether what observed applies to normal IM/autophagosomes. This event should be shown to occur in WT cells as well.

      We thank the reviewer for this point. Indeed, it remains technically difficult to visualize lipid flow during normal IM expansion in vivo, as this process is rapid and transient. And to date, there are no reports directly addressing lipid flow in this process.

      But the Ape1 overexpression system provides a strategic advantage by temporally extending the IM elongation phase and spatially enlarging the IM, thus offering a unique opportunity to capture membrane behavior that would otherwise be transient and difficult to resolve. Importantly, this system arrests autophagosome closure, which we leveraged to investigate the potential reversibility of phospholipid transfer in a controlled and prolonged context. Without this system, it would be exceedingly difficult for reaserchers to examine the lipid flow directionality in living cells.

      Furthermore, the use of Ape1 overexpression has been widely employed in previous high-impact autophagy studies. We emphasize that our aim is to understand Atg2-mediated lipid transfer, and in this context, the Ape1 system provides a valuable and informative tool without compromising the validity of our conclusions.

      10) From the images provided, it appears that R18 also labels the vacuole. The vacuole form MCSs with the IM. Can the author exclude a passage of R18 from the vacuole to the IM?

      We thank the reviewer for the insightful comment. Our data suggest that R18 traffics from the plasma membrane to the ER, then to autophagy-related structures. Actually, following that, as we kown, autophagosomes will eventually reaches and fused with the vacuole. This explains the occasional weak R18 signals at the vacuole membrane, particularly in late-stage cells. We have revised the figure and clarified this point in the text to avoid oversimplification of R18 localization (lines 169-171, 426-428)

      Here we also added the results of our onging work (in preparation). R18 tends to accumulate in a dot-like compartment after prolonged rapamycin treatment and incubation (Rev. Figure 2). And the vacuolar labeling of R18 correlates with the degradation status of autophagosomes, rather than reverse lipid transport from the vacuole to the IM (Rev. Figure 2). Taken together, we believe that R18 transport from the vacuole back to the IM is unlikely.

      Preliminary data supporting this response are included in the "Supplementary Figures for reviewer reference only" and are not part of the public submission.

      Minor points:

      1) L66. One report has indicated that Vps13 may also play a role in the transfer of lipids from the ER to the IM (Graef lab, J. Cell Biol).

      Thank you for pointing this out. Their excellent work also suggested that the inherent lipid transfer activity of Atg2 is required for IM expansion. We have revised the sentence (lines 67-68, 312-314) and included the appropriate citation at these two places.

      2) L70. It must be indicated that IM is also called phagophore.

      We have revised the sentence (line 70-71). Thank you for pointing this out.

      3) L74. It is mentioned "Additionally, a hydrophobic cavity in the N-terminal region of Atg2 directly tethers Atg2 to the ER, particularly the ER exit site (ERES), which is considered a key hub for autophagosome biogenesis", but there is no experimental evidence supporting that Atg2 is involved in the tethering with the ERES.

      Thank you for pointing this out. We have removed the N-terminal region part and revised the sentence accordingly (line 79-81) to avoid overstatement.

      4) L90. PAS must be listed between the ARS.

      We have revised the sentence (line 97-98). Thank you for pointing this out.

      5) Upon deletion of ATG39 and ATG40, there is a pronounced reduction of mNG-Atg8 labelled with R18. This would suggest that these two ER-phagy receptors are required for the PL transfer from the ER to the IM, which is not the case as autophagy is mildly affected by the absence of them (e.g., Zhang et al, Autophagy, 2020).

      We thank the reviewer for the important comment and agree that Atg39 and Atg40 are not required for phospholipid transfer from the ER to the IM. We have revised the text (lines 155-157). We appreciate if the reviewer could provide the DOI or PubMed ID for this paper.

      6) Authors referred that "no direct evidence has been found to confirm lipid transfer at the ER-IM MCS in living cells" (lines 282-283). However, a recent paper has shown that de novo-synthesized phosphatidylcholine is incorporated from the ER to the autophagosomes and autophagic bodies (Orii et al, J Cel Biol, 2021). This reference should be mentioned in the manuscript.

      Thank you for your insightful reminder. This paper beautifully demonstrated the importance of de novo-synthesized phosphatidylcholine in autophagy using electron microscopy. We have now included its citation and brief discussion in the revised manuscript (lines 74-76, 297-298). However, we respectfully note that direct observation of lipid transfer at the ER-IM MCS in living cells still remains unproven.

      7) In lines 252-253, the sentence "R18 transport from the PM to the ER was partially impaired in osh1Δ osh2Δ, osh6Δ osh7Δ, and oshΔ osh4-1 cells (Figure S3). These results suggest that Osh proteins participate in transferring R18 from the PM to the ER" does not recapitulate what is observed in Fig. S3. Moreover, the Emr lab has generate a tertadeletion mutant in which the PM-ER MCSs are abolished. The authors could examine this mutant.

      We thank the reviewer for this helpful comment and sincerely apologize for the lack of clarity in our original description. Our conclusion was primarily based on the partial PM accumulation of R18 observed in some osh mutant strains shown in Figure S3, which motivated us to further investigate this pathway using the OSW-1 inhibitor. We have revised the corresponding text to improve the logic and clarity of this section.

      We appreciate the recommendation of the tether∆ mutant. Our preliminary tests indicate that R18 still properly labels the ER in tether∆ cells, suggesting that its localization is not due to passive diffusion at membrane contact sites, but rather involves specific transport mechanisms. As this is an initial observation, we plan to confirm the result and include it in a future revision.

      Reviewer #1 (Significance (Required)):

      General assistent: Strength: potential new system to monitor lipid flow Limitations: Indirect evidences and in the case of the retrograde transport of phospholipids, it could be an artefact of the employed experimental approach. Advance: Little advances because something in part already shown in vitro. No new mechanisms uncovered. Audience: Autophagy and membrane contact site fields.

      We sincerely thank the reviewer for the overall evaluation. We agree that our current system offers indirect but promising evidence for lipid transfer events at ER-IM contact sites in vivo. While Atg2-mediated lipid transport has been proposed in vitro, our study adds value by (1) establishing a live-cell imaging way to monitor lipid flow in a non-vesicular transport pathway, (2) proposing a model of reversible one-way lipid transfer activity, and (3) addressing whether findings from simplified in vitro reconstitution accurately reflect the dynamics in the more complex real cellular environment.

      We recognize the limitations of our current approach and plan to include additional analyses to more cautiously interpret the observed retrograde movement. Although we do not claim to identify a new mechanism, we believe our work provides an interesting framework to inspire future efforts aimed at directly probing lipid flow at membrane contact sites in vivo.

      We also sincerely appreciate the reviewer's recognition of the potential value of this system for the autophagy and membrane contact site communities.

      Response to the Reviewer ____#2

      Non-vesicular lipid transfer plays an essential role in organelle biogenesis. Compared to vesicular lipid transfer, it is faster and more efficient to maintain proper lipid levels in organelles. In this study, Hao et al. introduced a high lipophilic dye octadecyl rhodamine B (R18), which specifically labels the ER structures and autophagy-related structures in yeast and mammalian cells. They characterised its distinct lipid entry into yeast cells via lipid flippase Neo1 and Drs2 on the plasma membrane, rather than through the endocytic pathway. They then demonstrated that R18 intracellular trafficking through plasma membrane to ER depends on "box-like" lipid transfer Osh proteins. They further looked into the "bridge-like" lipid transfer protein Atg2, using R18 as a lipid probe to track lipid transfer from ER to the isolation membrane (IM) during membrane expansion and reversible lipid transfer through IM to the ER-IM membrane contact sites (MCS) when autophagy is terminated by nutrient replenishment. The authors provide an interesting model of reversible directionality of Atg2 lipid transfer during autophagy induction and termination.

      We sincerely thank the reviewer for the thoughtful and constructive summary of our work. We are grateful for the recognition of the novelty of using R18 to visualize non-vesicular lipid transfer in vivo and for highlighting the conceptual contribution of our proposed model of reversible Atg2-mediated transport during autophagy.

      In response to the reviewer's valuable suggestions, we have revised key parts of the manuscript and prepared a detailed revision plan to address the specific concerns. We truly appreciate the reviewer's insights, which have been instrumental in improving the clarity of our study.

      Major points:

      1. Line 299-309: The FRAP assays were interesting and well performed. The authors photobleached R18 and Atg8 signal, and found R18 fluorescence recovery but not Atg8, which suggests lipid transfer occurs between ER and the IM and faster than Atg8 lipidation process during IM expansion. These results gave clear evidence that R18 can be transferred during IM expansion. The supply of Atg8 may not be not able to track within this time frame or the recovered amount of Atg8 may not be able to visualized due to the threshold limitation with confocal microcopy. This does not imply the supply of Atg8 to the IM is not required during IM expansion. This should be clarified.

      We thank the reviewer for this valuable comment and fully agree that Atg8 is essential for IM expansion. We apologize for any ambiguity that may have suggested otherwise.

      As pointed out, the lack of mNG-Atg8 recovery in our FRAP assay likely reflects the slower turnover of lipidated Atg8, limited observation time, and photobleaching of the existing protein pool. Notably, we observed a weak but gradual signal recovery at later time points, supporting this view. We have revised the relevant paragraph in the manuscript (line 326-330) to clarify these points and avoid potential misinterpretation.

      Please clarify how the length of the IM is measured and determined in Figure 4H and Figure 5D.

      We thank the reviewer for the vaulable comment. We have now clarified the method for quantifying IM length in the revised manuscript. Specifically, we modified the Statistical Analysis section of the Methods (line 642-643).

      Line 336-342: The description of the results should be clarified. Based on Figure 5H, the authors observed a significant decrease in the mNG-Atg8 signal during photobleaching of the R18 signal.

      We thank the reviewer for pointing out the ambiguity. We have now clarified the description in the revised manuscript. The sentence has been modified (line 360-362) as follows: "To determine whether nutrient replenishment terminates autophagy, we selectively photobleached the R18 signal and monitored the R18 (photobleached) and mNG-Atg8 (without photobleaching) signal following nutrient replenishment."

      The authors photobleached ER-IM MCS and the ER region (boxed region in Figure 5J) and quantified fluorescence recovery, normalized to the IM region and an ER control. The ER control was taken from the other cell. It would be helpful to compare and analyse the fluorescence recovery of R18 in the bleached ER region near the ER-IM MCS to that in the ER-IM MCS. This would help to confirm the ER-IM MCS fluorescence recovery is due to signal coming from the IM.

      We sincerely thank the reviewer for this insightful suggestion. We have now performed the suggested comparison. Interestingly, each sample consistently showed lower fluorescence recovery in the adjacent bleached ER near the ER-IM MCS (mean = 0.20), compared to the ER-IM MCS region (mean = 0.28). To further validate this observation, we also used the IM as a background reference for normalization. This analysis revealed a more significant difference, with the adjacent bleached ER near the ER-IM MCS showing a lower recovery (mean = 0.47) than the ER-IM MCS (mean = 0.80).

      As the reviewer pointed out, these results support our reversible lipid transfer model by demonstrating that fluorescence recovery at the ER-IM MCS is due to the signal coming from the IM, rather than from the adjacent bleached ER, which recovers more slowly and less efficiently. We have incorporated this new analysis into Figure 5, and accordingly revised the figure legend and main text (lines 384-396). Again, we appreciate this constructive and helpful suggestion.

      In figure 5K, the autophagic structure or IM labelled by R18 seems to be maintained when the mNG-Atg8 signal decreases or dissociates from the IM. Could the authors comment on that how they interpret the termination of the prolonged IM structure and IM shrinkage?

      We thank the reviewer for this insightful observation. Based on our live-cell imaging, we speculate that following the initial dissociation of Atg8, the IM membrane undergoes a relatively slow disassembly process, potentially retracting toward the ER-IM MCS, which often localizes near ER exit sites (ERES). This suggests that IM shrinkage may proceed via Atg8-independent mechanisms. Although the precise pathway remains unclear, we occasionally observed vesiculation events during this phase, supporting the idea that membrane remodeling continues even in the absence of Atg8. In response to this comment, we have revised our manuscript to reflect these interpretations (line 494-496).

      The author has shown that Atg2Δ and Atg2LT lipid transfer mutant impair R18 labelling of autophagic structures in Figure 4C. However, the evidence supporting that R18 fluorescence recovery at ER-IM MCS is mediated by reversible Atg2 lipid transfer is not direct. It would be helpful to clarify whether Atg2 stays on the enlarged autophagic membranes when the membrane has reached to its maximum length and no longer grows.

      We thank the reviewer for this important suggestion. As noted in our response to Reviewer 1 (Major Point 8-2), clarifying whether Atg2/Atg18 remains at the ER-IM contact sites after IM expansion is indeed important for supporting the reversible lipid transfer model. We plan to monitor the localization of Atg18 during the nutrient replenishment assay.

      Minor points:

      1. Figure 2A "Dpm-GFP" is missing. The experiment replicates in Figure 2M should be indicated.

      We thank the reviewer for pointing out these issues. The label for "Dpm-GFP" has been added in Figure 2A, and the number of experimental replicates for Figure 2M is now indicated in the figure legend.

      Figure S2, the magenta panel should be "R18".

      We thank the reviewer for catching this labeling error. We have corrected the magenta panel label in Figure S2 to "R18" in the revised version of the figure.

      Line 341-342: "Figure 5H and 5J" should be "Figure 5H and 5I"

      We thank the reviewer for pointing out this error. The citation has been corrected from "Figure 5H and 5J" to "Figure 5H and 5I" in the revised manuscript.

      Please describe how the lipid docking model of Atg2 is generated.

      We thank the reviewer for this question. We have added a description of the modeling approach in the Methods section of the revised manuscript (lines 640-646). We also added the configuration files of AlphaFold3 to the supplementary information.

      Reviewer #2 (Significance (Required)):

      Currently, lipid probes are emerging as powerful tools to understand membrane dynamics, integrity, and the lipid-mediated cellular functions. In this manuscript, the authors performed a detailed characterisation of octadecyl rhodamine B (R18) as a potential lipid probe, which specifically labels ER and autophagic membranes. They present high quality imaging data and performed FRAP experiments to monitor the membrane dynamics and investigate the lipid transfer directionality between the ER and autophagic structure. However, the evidence of Atg2-mediated reversible lipid transfer may not be direct and sufficient. The proposed reversible lipid transfer model is interesting and provides an explanation of lipid level regulation during autophagosome formation.

      We sincerely thank the reviewer for the positive assessment of our work and for acknowledging the potential of R18 as a lipid probe, as well as the quality of our imaging and FRAP experiments. We are particularly grateful that the reviewer found the proposed model of reversible lipid transfer both interesting and relevant to the broader question of lipid regulation during autophagosome formation.

      Regarding the reviewer's concern that the evidence for Atg2-mediated reversible lipid transfer may not be sufficiently direct, we agree this is a critical point. While technical limitations currently prevent direct visualization of lipid flow reversal at single-molecule resolution in vivo, we hope our revision plan strengthen the proposed model and better convey its biological relevance, while also acknowledging the current limitations and the need for further mechanistic work.

      Response to the ____Reviewer #3

      The authors address the question of how autophagic membrane seeds expand into autophagosomes. After nucleation, IMs expand in dependence of the bridge-like lipid transfer protein Atg2, which has been shown to tether the IM to the ER. Several studies have shown in vitro evidence for direct lipid transfer by Atg2 between tethered membranes, and previous evidence has shown that the hydrophobic groove of Atg2 implicated in lipid transfer is required for autophagosome biogenesis in vivo in yeast and mammalian cells.

      In this manuscript, the authors take advantage of the dye R18, which they show accumulates mainly in the ER after a few minutes. They show specifically that the import of R18 into cells and transfer to the ER depends on the activity of flippases in the plasma membrane and OSPB-related lipid transporter. Using different sets of FRAT experiments, the authors track the fluorescence recovery of R18 in the IM, the IM-ER membrane contact site and the neighboring ER. From these experiments the authors conclude that (a) R18 is transferred to IM from the ER when IMs expand and (b) can be transferred from IMs back to the ER when autophagy is deactivated.

      The use of a lipophilic dye to monitor lipid dynamics during IM expansion or dissolution is an elegant way to probe the mechanisms of lipid transfer across ER-IM contact sites. Quantitative in vivo data is critically needed to address this fundamental question in autophagy and contact site biology. However, the study remains limited in providing direct evidence that it is indeed the lipid transfer activity of Atg2, which underlies the R18 dynamics in IMs in vivo.

      We sincerely thank the reviewer for this thoughtful and encouraging summary. We appreciate the recognition of our approach using R18 to visualize lipid dynamics at ER-IM contact sites, and agree that in vivo quantitative data are critically needed to advance our understanding of autophagic membrane expansion.

      We also fully agree with the reviewer that our current study provides indirect-but conceptually informative-support for Atg2-mediated reversible one way lipid transfer. While prior in vitro studies have demonstrated the lipid transfer capability of Atg2, our goal here was to develop a live-cell system that allows the dynamic tracking of lipid flow in vivo, and to explore the possibility of reversible transport during autophagy termination. We hope our story will offer unique insights for future studies aiming to directly probe lipid transfer mechanisms in live cells.

      Regarding the reviewer's concern about the lack of direct evidence that Atg2's lipid transfer activity underlies the observed R18 dynamics, we fully acknowledge this limitation. To address this point, we would like to cite our parallel study currently under revision (Sakai et al., bioRxiv 2025.05.24.655882v1), which provides additional mechanistic evidence linking R18 dynamics to the lipid transfer function of Atg2. Further details and planned revisions are described in the responses below.

      Major points:

      (1) The authors use R18in FRAP experiments to follow its transfer from the ER into IMs. However, whether this transfer is mediated by Atg2 via its inherent lipid transfer activity remains indirect. The only evidence that implicates Atg2 directly is the observation that a lipid transfer deficient Atg2 variant fails to support IM expansion and autophagosome biogenesis. A similar full-length Atg2 mutant has previously been shown to block autophagosome formation in Dabrowski et al. 2023 in yeast, which the authors do not cite or discuss, suggesting the inherent lipid transfer activity of Atg2 is required for IM expansion. However, aside from this experiment, the mechanisms underlying R18 transfer remain unclear and, while they likely depend on or are at least partially mediated by Atg2, they may involve alternative mechanisms including vesicle transport or continuous membrane contacts. Moreover, for the assays with stalled or dissolving IM, it is essential for the authors to test whether Atg2 is still associated with these IMs. It is quite possible that Atg2 dissociates from maximally expanded or dissolving IMs, which would make their interpretation of the data very unlikely. Thus, it will be critical to provide consistent evidence that lipid transfer from the IM to the ER is mediated by Atg2. Ideally, the authors would label IM with BFP-Atg8, R18, and Atg2-GFP and perform their in vivo analysis.

      We sincerely thank the reviewer for the critical comments and valuable suggestions. To further support the link between R18 transfer and Atg2, we would like to highlight two complementary findings. As noted in our response to Reviewer 1 (Major Point 3), R18 can still label the PAS even when Atg2 is recruited but IM expansion is impaired, suggesting that R18 trafficking occurs in an Atg2-dependent manner. In addition, in our parallel study (bioRxiv, 2025.05.24.655882v1), we demonstrated that Atg2 acts as a bridge-like lipid transfer protein. Notably, when we mutated the bridge-forming region of Atg2, R18 transport to the IM was also disrupted.

      We greatly appreciate the reviewer's reminder regarding the study by Dabrowski et al., 2023, which we have now cited and discussed in the revised manuscript (lines 66-68, 312-314). Their findings that the inherent lipid transfer activity of Atg2 is required for autophagosome formation in vivo strongly reinforce our model.

      Regarding the possibility of vesicle transport, we consider this contribution minimal based on R18's preferential labeling of continuous membranes and its divergence from FM4-64 staining. As for the role of continuous membrane contacts, as also mentioned in our response to Reviewer 1, our preliminary tests indicate that R18 still properly labels the ER in tether∆ cells, suggesting that its localization is not due to passive diffusion at membrane contact sites, but rather involves specific transport mechanisms. As this is an initial observation, we plan to confirm the result and include it in a future revision.

      We also thank the reviewer for the suggestion to monitor Atg2 localization at the dissolving IM. As similarly pointed out by two other reviewers, we plan to track Atg18 during the nutrient replenishment assay.

      Finally, we appreciate the idea of triple-labeling with BFP-Atg8, R18, and Atg2-GFP. While our preliminary attempts encountered technical difficulties such as abnormal BFP-Atg8 localization and severe bleaching during long-term imaging in yeast, we plan to optimize this approach in future experiments.

      (2) Given the ER forms contact sites with many organelles using bridge-like lipid transfer proteins, how do the authors explain the preferential accumulation of R18 in ARS and not in for example PM (Fmp27), mitochondria, endosomes or vacuole (Vps13)? Why should R18 specifically transferred by Atg2 and not or to a much lower rate by Fmp27 or Vps13?

      We sincerely thank the reviewer for raising this insightful question. Indeed, we have carefully considered this point. Our data indicate that R18 labeling of autophagy-related structures (ARS) depends on Atg2, as demonstrated in the present manuscript and supported by our parallel study currently under revision (bioRxiv, 2025.05.24.655882v1).

      We speculate that the preferential accumulation of R18 in ARS may arise from structural and contextual differences among bridge-like LTPs, such as Atg2, Vps13, and Fmp27. Although all are capable of mediating lipid transfer, these proteins differ in their membrane tethering modes, cargo specificity, and spatial regulation. For example, Atg2 localizes specifically to ER-IM contact sites during autophagosome formation, where membrane expansion requires rapid lipid supply. In contrast, Vps13 and Fmp27 may function at more stable or less dynamic contacts, where lipid turnover or probe accessibility is more limited. We have added a brief discussion of this point in the revised manuscript to reflect this important consideration (lines 439-444).

      (3) Does R18 label autophagic bodies after they are formed. Could the authors add R18 after autophagic bodies have formed in atg15 or pep4 cells?

      We thank the reviewer for this excellent suggestion. To address whether R18 can label autophagic bodies post-formation, we plan to perform additional experiments by adding R18 after autophagic bodies have accumulated in atg15Δ or pep4Δ cells. This will help clarify whether R18 incorporates into pre-formed autophagic bodies or requires earlier membrane dynamics for its labeling.

      (4) Since Neo1- or OSBP-defective cells do not transfer R18 from the PM to the ER or other membranes, the authors should include these strains as controls for ER-dependent R18 transfer to ARSs.

      We thank the reviewer for this insightful suggestion. To further validate the ER-dependency of R18 transfer to autophagy-related structures, we plan to include Neo1- and OSBP-deficient strains as additional controls.

      Comments:

      The authors neglect to mention or discuss important recent literature directly related to their study:

      Schutter et al., Cell (2020); Orii et al., JCB (2021); Polyansky et al., EMBOJ (2022); Dabrowski et al., JCB (2023); Shatz et al., Dev Cell (2024)

      We sincerely thank the reviewer for pointing out these important and highly relevant studies. We apologize for our oversight in not citing them earlier. Each of these works has provided valuable insights that are directly related to and have greatly informed our current study. We have now cited and discussed these references in appropriate sections of the revised manuscript.

      Figure 1A and B: The authors need to describe how these cells were stained with R18 in the figure legend or text to help the reader to understand how these experiments were performed. Figure legends need to indicate at which time point after rapamycin treatment cells were analyzed.

      Thank you for the helpful suggestion. We have now added the corresponding information to the figure legends to clarify the staining procedure and time points.

      The authors need to clarify whether mNG-Atg8 colocalization with R18 was included for dot- and ring-like structures for WT cells as shown separately in 1A but not in 1B.

      Thank you for the comment. The quantification in Figure 1B includes both dot- and ring-like structures of mNG-Atg8 colocalized with R18 in WT cells, as shown in Figure 1A. We have now clarified this point in the revised figure legend.

      Figure 1C: The figure legend needs to describe the conditions cells were treated with and when cells were analyzed after rapamycin treatment (presumably).

      Thank you for the helpful suggestion. We have now added the corresponding information to the figure legends.

      Figure 1C: The authors should combine atg15 and pep4 deletions with atg2 or atg7 as controls in which autophagic bodies are not formed.

      Thank you for the valuable suggestion. We plan to perform these experiments that combine atg15 and pep4 deletions with atg2 or atg7 as controls.

      Figure 1E and F: R18 stains more than just the ER in the cells shown. In addition to atg39 and atg40, authors should include atg11 to inhibit all forms of selective autophagy.

      Thank you very much for the insightful comment. We agree and plan to include the atg11Δ mutant to inhibit all forms of selective autophagy.

      Figure S2A and B: The figures are mislabeled. Instead of FM4-64 it should say R18. In addition to the ER, in several images it is obvious to see R18 staining the vacuole membrane (for example Figure 2A 30 degrees) and others. Thus, the strong thresholding in S2 may give the reader an oversimplified view on R18 localization. This needs to be corrected.

      Thank you very much for pointing this out. We have corrected the labeling error in Figure S2A and B. Regarding the observation that R18 occasionally labels the vacuole membrane, we agree with the reviewer's comment. Based on our data, we believe that this signal likely reflects autophagosomes that have reached and fused with the vacuole, as expected in the later stages of autophagy. We have clarified this point in the text to avoid oversimplification of R18 localization (lines 169-171, 426-428).

      Figure 1G and H: In 1G, there are number of R18-stained patches not co-labeled by GFP-ER. What are these patches and which organelles to they represent? In 1H, given the tight association of the ER (omegasome) with forming IMs, it is difficult to discern whether R18 labels surrounding ER membrane or the IM itself. This needs to be more closely analyzed. The authors need to quantify these data similar to the yeast data.

      Thank you for the suggestion. We plan to perform additional quantification and colocalization analysis to clarify the identity of R18-positive signals in 1G and 1H.

      Figure 4A-C: A full-length PLT-deficient variant of Atg2 has been analyzed by Dabrowski et al, JCB 2023 in vivo. This work needs to be cited and discussed. The analysis needs to include punctate Atg8 structures for WT cells to exclude effects due to expansion defects.

      Thank you for the suggestion. We have now cited and discussed the work by Dabrowski et al., JCB 2023 in the revised manuscript (lines 67-68, 312-314). In addition, we have included an analysis of punctate Atg8 structures in WT cells to address the concern regarding potential expansion defects.

      Figure 4F-H: To measure the size changes in IMs, the authors would need to perform these experiments without bleaching the mNG-Atg8 signals.

      We apologize for the lack of clarity. The method for measuring IM size has now been added to the revised manuscript. In Figure 4, we note that mNG-Atg8 fluorescence actually shows a slow recovery over time. This limited recovery likely reflects both the slower turnover of Atg8 and the fact that the pre-existing Atg8 pool at the IM was partially photobleached. We have now revised the main text to clarify this point and included additional explanation (line 326-330).

      Figure 5C: The authors need to indicate the bleached areas in the mNG-Atg8 image for easier orientation. It looks to me that the area that the authors mark as IM-ER MCS is really the IM in proximity to the ER. Thus, if lipid transfer to the IM has ceased, I would not expect recovery here. If the IM-ER MCS area includes IM and the ER to similar extent, I would expect exactly what the authors show: IM does not recover while ER quickly recovers. On average, we would observe reduced recovery as shown in 5D.

      Thank you for the helpful suggestion, and we apologize for the oversight during figure preparation. We have now clearly indicated the bleached areas in the merged image in Figure 5C for better orientation. Additionally, we have carefully re-examined the defined ER-IM MCS region and confirm that the quantified area indeed corresponds to the contact site between the ER and the IM. And double checked the measurements shown in the figure remain correct.

      Figure 5L: Since mNG-Atg8 signal homogenously disappears from the IM, it is meaningless to measure size. How do the authors measure the size of something they cannot detect?

      Thank you for pointing this out. We agree with the reviewer's comment and have removed the panel from the revised version accordingly.

      Figure 5K: The authors need to show the whole bleached area overtime for the reader to be able to see where the recovered R18 signal might be coming from. Currently, it is impossible to discern whether the signal comes from the IM or from slow recovery from neighboring ER.

      We appreciate this insightful comment. To address the concern and following the suggestion from Reviewer 2 (Major Point No.4), we have now revised the figure to include an additional measurement of fluorescence recovery in the adjacent bleached ER (Figure 5K and 5M) (lines 384-396). These results further support our reversible lipid transfer model by demonstrating that fluorescence recovery at the ER-IM MCS originates from the IM, rather than from the adjacent bleached ER, which shows slower and less efficient recovery.

      We have also added time-lapse videos to the supplementary information due to space limitations in the main figure.

      Reviewer #3 (Significance (Required)):

      The use of a lipophilic dye to monitor lipid dynamics during IM expansion or dissolution is an elegant way to probe the mechanisms of lipid transfer across ER-IM contact sites. Quantitative in vivo data is critically needed to address this fundamental question in autophagy and contact site biology. However, the study remains limited in providing direct evidence that it is indeed the lipid transfer activity of Atg2, which underlies the R18 dynamics in IMs in vivo.

      We sincerely thank the reviewer for this encouraging and thoughtful comment. We appreciate the recognition that our live-cell approach using a lipophilic dye provides a valuable framework to visualize lipid dynamics during autophagosome biogenesis. As the reviewer pointed out, quantitative in vivo evidence is critically needed in this field, and we hope our study contributes meaningfully toward that goal.

      We also fully acknowledge the limitation. While our current data offer indirect evidence for Atg2-mediated lipid transfer, we would like to support this by our revision plan and also our parallel study (bioRxiv, 2025.05.24.655882v1) that shows Atg2 is indeed a bridge-like LTP and R18 transfer is lost in the bridge-structure defective strain. Together, we hope these can suggest that the lipid transfer activity of Atg2 underlies the observed R18 dynamics in vivo.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors address the question of how autophagic membrane seeds expand into autophagosomes. After nucleation, IMs expand in dependence of the bridge-like lipid transfer protein Atg2, which has been shown to tether the IM to the ER. Several studies have shown in vitro evidence for direct lipid transfer by Atg2 between tethered membranes, and previous evidence has shown that the hydrophobic groove of Atg2 implicated in lipid transfer is required for autophagosome biogenesis in vivo in yeast and mammalian cells.

      In this manuscript, the authors take advantage of the dye R18, which they show accumulates mainly in the ER after a few minutes. They show specifically that the import of R18 into cells and transfer to the ER depends on the activity of flippases in the plasma membrane and OSPB-related lipid transporter. Using different sets of FRAT experiments, the authors track the fluorescence recovery of R18 in the IM, the IM-ER membrane contact site and the neighboring ER. From these experiments the authors conclude that (a) R18 is transferred to IM from the ER when IMs expand and (b) can be transferred from IMs back to the ER when autophagy is deactivated.

      The use of a lipophilic dye to monitor lipid dynamics during IM expansion or dissolution is an elegant way to probe the mechanisms of lipid transfer across ER-IM contact sites. Quantitative in vivo data is critically needed to address this fundamental question in autophagy and contact site biology. However, the study remains limited in providing direct evidence that it is indeed the lipid transfer activity of Atg2, which underlies the R18 dynamics in IMs in vivo.

      Major points:

      1. The authors use R18in FRAP experiments to follow its transfer from the ER into IMs. However, whether this transfer is mediated by Atg2 via its inherent lipid transfer activity remains indirect. The only evidence that implicates Atg2 directly is the observation that a lipid transfer deficient Atg2 variant fails to support IM expansion and autophagosome biogenesis. A similar full-length Atg2 mutant has previously been shown to block autophagosome formation in Dabrowski et al. 2023 in yeast, which the authors do not cite or discuss, suggesting the inherent lipid transfer activity of Atg2 is required for IM expansion. However, aside from this experiment, the mechanisms underlying R18 transfer remain unclear and, while they likely depend on or are at least partially mediated by Atg2, they may involve alternative mechanisms including vesicle transport or continuous membrane contacts. Moreover, for the assays with stalled or dissolving IM, it is essential for the authors to test whether Atg2 is still associated with these IMs. It is quite possible that Atg2 dissociates from maximally expanded or dissolving IMs, which would make their interpretation of the data very unlikely. Thus, it will be critical to provide consistent evidence that lipid transfer from the IM to the ER is mediated by Atg2. Ideally, the authors would label IM with BFP-Atg8, R18, and Atg2-GFP and perform their in vivo analysis.
      2. Given the ER forms contact sites with many organelles using bridge-like lipid transfer proteins, how do the authors explain the preferential accumulation of R18 in ARS and not in for example PM (Fmp27), mitochondria, endosomes or vacuole (Vps13)? Why should R18 specifically transferred by Atg2 and not or to a much lower rate by Fmp27 or Vps13?
      3. Does R18 label autophagic bodies after they are formed. Could the authors add R18 after autophagic bodies have formed in atg15 or pep4 cells?
      4. Since Neo1- or OSBP-defective cells do not transfer R18 from the PM to the ER or other membranes, the authors should include these strains as controls for ER-dependent R18 transfer to ARSs.

      Comments:

      The authors neglect to mention or discuss important recent literature directly related to their study:

      Schutter et al., Cell (2020); Orii et al., JCB (2021); Polyansky et al., EMBOJ (2022); Dabrowski et al., JCB (2023); Shatz et al., Dev Cell (2024)

      Figure 1A and B: The authors need to describe how these cells were stained with R18 in the figure legend or text to help the reader to understand how these experiments were performed. Figure legends need to indicate at which time point after rapamycin treatment cells were analyzed.

      The authors need to clarify whether mNG-Atg8 colocalization with R18 was included for dot- and ring-like structures for WT cells as shown separately in 1A but not in 1B.

      Figure 1C: The figure legend needs to describe the conditions cells were treated with and when cells were analyzed after rapamycin treatment (presumably).

      The authors should combine atg15 and pep4 deletions with atg2 or atg7 as controls in which autophagic bodies are not formed.

      Figure 1E and F: R18 stains more than just the ER in the cells shown. In addition to atg39 and atg40, authors should include atg11 to inhibit all forms of selective autophagy.

      Figure S2A and B: The figures are mislabeled. Instead of FM4-64 it should say R18. In addition to the ER, in several images it is obvious to see R18 staining the vacuole membrane (for example Figure 2A 30 degrees) and others. Thus, the strong thresholding in S2 may give the reader an oversimplified view on R18 localization. This needs to be corrected.

      Figure 1G and H: In 1G, there are number of R18-stained patches not co-labeled by GFP-ER. What are these patches and which organelles to they represent? In 1H, given the tight association of the ER (omegasome) with forming IMs, it is difficult to discern whether R18 labels surrounding ER membrane or the IM itself. This needs to be more closely analyzed. The authors need to quantify these data similar to the yeast data.

      Figure 4A-C: A full-length PLT-deficient variant of Atg2 has been analyzed by Dabrowski et al, JCB 2023 in vivo. This work needs to be cited and discussed. The analysis needs to include punctate Atg8 structures for WT cells to exclude effects due to expansion defects.

      Figure 4F-H: To measure the size changes in IMs, the authors would need to perform these experiments without bleaching the mNG-Atg8 signals.

      Figure 5C: The authors need to indicate the bleached areas in the mNG-Atg8 image for easier orientation. It looks to me that the area that the authors mark as IM-ER MCS is really the IM in proximity to the ER. Thus, if lipid transfer to the IM has ceased, I would not expect recovery here. If the IM-ER MCS area includes IM and the ER to similar extent, I would expect exactly what the authors show: IM does not recover while ER quickly recovers. On average, we would observe reduced recovery as shown in 5D.

      Figure 5L: Since mNG-Atg8 signal homogenously disappears from the IM, it is meaningless to measure size. How do the authors measure the size of something they cannot detect?

      Figure 5K: The authors need to show the whole bleached area overtime for the reader to be able to see where the recovered R18 signal might be coming from. Currently, it is impossible to discern whether the signal comes from the IM or from slow recovery from neighboring ER.

      Significance

      The use of a lipophilic dye to monitor lipid dynamics during IM expansion or dissolution is an elegant way to probe the mechanisms of lipid transfer across ER-IM contact sites. Quantitative in vivo data is critically needed to address this fundamental question in autophagy and contact site biology. However, the study remains limited in providing direct evidence that it is indeed the lipid transfer activity of Atg2, which underlies the R18 dynamics in IMs in vivo.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Non-vesicular lipid transfer plays an essential role in organelle biogenesis. Compared to vesicular lipid transfer, it is faster and more efficient to maintain proper lipid levels in organelles. In this study, Hao et al. introduced a high lipophilic dye octadecyl rhodamine B (R18), which specifically labels the ER structures and autophagy-related structures in yeast and mammalian cells. They characterised its distinct lipid entry into yeast cells via lipid flippase Neo1 and Drs2 on the plasma membrane, rather than through the endocytic pathway. They then demonstrated that R18 intracellular trafficking through plasma membrane to ER depends on "box-like" lipid transfer Osh proteins. They further looked into the "bridge-like" lipid transfer protein Atg2, using R18 as a lipid probe to track lipid transfer from ER to the isolation membrane (IM) during membrane expansion and reversible lipid transfer through IM to the ER-IM membrane contact sites (MCS) when autophagy is terminated by nutrient replenishment. The authors provide an interesting model of reversible directionality of Atg2 lipid transfer during autophagy induction and termination.

      Major points:

      1. Line 299-309: The FRAP assays were interesting and well performed. The authors photobleached R18 and Atg8 signal, and found R18 fluorescence recovery but not Atg8, which suggests lipid transfer occurs between ER and the IM and faster than Atg8 lipidation process during IM expansion. These results gave clear evidence that R18 can be transferred during IM expansion. The supply of Atg8 may not be not able to track within this time frame or the recovered amount of Atg8 may not be able to visualized due to the threshold limitation with confocal microcopy. This does not imply the supply of Atg8 to the IM is not required during IM expansion. This should be clarified.
      2. Please clarify how the length of the IM is measured and determined in Figure 4H and Figure 5D.
      3. Line 336-342: The description of the results should be clarified. Based on Figure 5H, the authors observed a significant decrease in the mNG-Atg8 signal during photobleaching of the R18 signal.
      4. The authors photobleached ER-IM MCS and the ER region (boxed region in Figure 5J) and quantified fluorescence recovery, normalized to the IM region and an ER control. The ER control was taken from the other cell. It would be helpful to compare and analyse the fluorescence recovery of R18 in the bleached ER region near the ER-IM MCS to that in the ER-IM MCS. This would help to confirm the ER-IM MCS fluorescence recovery is due to signal coming from the IM.
      5. In figure 5K, the autophagic structure or IM labelled by R18 seems to be maintained when the mNG-Atg8 signal decreases or dissociates from the IM. Could the authors comment on that how they interpret the termination of the prolonged IM structure and IM shrinkage?
      6. The author has shown that Atg2Δ and Atg2LT lipid transfer mutant impair R18 labelling of autophagic structures in Figure 4C. However, the evidence supporting that R18 fluorescence recovery at ER-IM MCS is mediated by reversible Atg2 lipid transfer is not direct. It would be helpful to clarify whether Atg2 stays on the enlarged autophagic membranes when the membrane has reached to its maximum length and no longer grows.

      Minor points:

      1. Figure 2A "Dpm-GFP" is missing. The experiment replicates in Figure 2M should be indicated.
      2. Figure S2, the magenta panel should be "R18".
      3. Line 341-342: "Figure 5H and 5J" should be "Figure 5H and 5I"
      4. Please describe how the lipid docking model of Atg2 is generated.

      Significance

      Currently, lipid probes are emerging as powerful tools to understand membrane dynamics, integrity, and the lipid-mediated cellular functions. In this manuscript, the authors performed a detailed characterisation of octadecyl rhodamine B (R18) as a potential lipid probe, which specifically labels ER and autophagic membranes. They present high quality imaging data and performed FRAP experiments to monitor the membrane dynamics and investigate the lipid transfer directionality between the ER and autophagic structure. However, the evidence of Atg2-mediated reversible lipid transfer may not be direct and sufficient. The proposed reversible lipid transfer model is interesting and provides an explanation of lipid level regulation during autophagosome formation.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In their study, Hao and colleagues exploited the fluorescent fatty acid R18 to follow phospholipid (PL) transfer in vivo from the endoplasmic reticulum to the IM during autophagosome formation. Although the results are interesting, especially the retrograde transport of PLs, based on the provided data, additional control experiments are needed to firmly support the conclusions. An additional point is that the authors also study the internalization of R18 into cells and found a role of lipid flippases and oxysterol binding proteins. While this information could be useful for researchers using this dye, these analyses/findings have no specific connection with the topic of the manuscript, i.e. the PL transfer during autophagosome formation. Therefore, they must be removed.

      Major points:

      1. In general, the quality of the microscopy images are quite poor and this make it difficult to assert some of the authors' conclusions.
      2. It would be important to perform some lipidomics analysis to determine in which PLs and other lipids or lipid intermediates R18 is incorporated. First, it will be important to know which the major PL species are are labelled under the conditions of the experiments done in this study. Second, the authors assume that all the R18 is exclusively incorporated into PLs and this is what they follow in their in vivo experiments. What about acyl-CoA, which has been shown to be a key player in the IM elongation (Graef lab, Cell)?
      3. Figure 1A and 1B. The authors conclude that Atg2 is involved in the lipid transfer since R18 does not localize to the PAS/ARS in the atg2KO cells. However, another possible explanation is that in those cells the IM is not formed and does not expand, and con sequetly R18 is present in low amounts not detectable by fluorescence microscopy. To support their conclusion, the authors must assess PAS-labelling with R18 in cells lacking another ATG gene in which Atg2 is still recruited to the PAS.
      4. Figure 2. As written, the paragraph this figure seems to indicate that flippases are directly involved in the translocation of R18 from the PM to the ER. As correctly indicated by the authors, flippases flip PLs, not fatty acids. Moreover, there are no PL synthesizing at the PM and thus probably R18 is not flipped upon incorporation into PL. As a result, the relevance of flippase in R18 internalization is probably indirect. This must be explained clearly to avoid confusion/misunderstandings.
      5. A couple of manuscript has shown a (partial) role of Drs2 in autophagy. The authors must explain the discrepancy between their own results and what published, especially because they use the GFP-Atg8 processing assay, which is less sensitive than the Pho8delta60 used in the other studies.
      6. Authors used a predicted Atg2 lipid-transfer mutant (Srinivasan et al, J Cel Biol, 2024), but not direct prove that this mutant is defective for this activity. As previously done for other Atg2/ATG2-related manuscripts (Osawa et al, Nat Struct Mol Biol, 2019; Valverde et al, J Cel Biol, 2019), this must be measure in vitro. Moreover, they do not show whether other known functions of Atg2 are unaffected when expressing this Atg2 mutant, e.g. formation of the IM-ER MCSs, Atg2 interaction with Atg9 and localization at the extremity of the IM...
      7. The mNG-Atg8 signal is not recovered in the fluorescent recovery assays. Based on the observation that R18 signal comes back after photobleaching, authors suggest that the supply of Atg8 is not required for IM expansion. This idea is opposite to data where the levels of Atg8 and deconjugation of lipidated Atg8 determines the size of the forming autophagosomes (e.g., Xie et al, Mol Biol Cell, 2008; Nair et al, Autophagy, 2012). Similar results have also been obtained in mammalian cells (Lazarou and Mizushima results in cell lacking components of the two ubiquitin-like conjugation systems). This discrepancy requires an explanation.
      8. Although authors claim that there is a retrograde lipid transfer from the IM to the ER, based on the data, it quite difficult to extract these conclusions as they show a decrease in the lipid flow dynamics rather to an inversion of the lipid flow per se. Can the authors exclude that ER microdomains are formed at the ERES in contact with the IM, and consequently what they measure is a slow diffusion of R18-labeled lipid from other part of the ER to these ERES? Additionally, authors should check whether the Atg2 and Atg18 proteins are present at the IM-ER membrane contact sites in the same rates after nutrient replenished than when cells are nitrogen-starved, since this complex would determine the lipid transfer dynamics at this membrane contact site.
      9. The retrograde PL transfer is studied in cells overexpressing Ape1, in which IM elongation is stalled. This is a non-physiological experimental setup and consequently it is unclear whether what observed applies to normal IM/autophagosomes. This event should be shown to occur in WT cells as well.
      10. From the images provided, it appears that R18 also labels the vacuole. The vacuole form MCSs with the IM. Can the author exclude a passage of R18 from the vacuole to the IM?

      Minor points:

      1. L66. One report has indicated that Vps13 may also play a role in the transfer of lipids from the ER to the IM (Graef lab, J. Cell Biol).
      2. L70. It must be indicated that IM is also called phagophore.
      3. L74. It is mentioned "Additionally, a hydrophobic cavity in the N-terminal region of Atg2 directly tethers Atg2 to the ER, particularly the ER exit site (ERES), which is considered a key hub for autophagosome biogenesis", but there is no experimental evidence supporting that Atg2 is involved in the tethering with the ERES.
      4. L90. PAS must be listed between the ARS.
      5. Upon deletion of ATG39 and ATG40, there is a pronounced reduction of mNG-Atg8 labelled with R18. This would suggest that these two ER-phagy receptors are required for the PL transfer from the ER to the IM, which is not the case as autophagy is mildly affected by the absence of them (e.g., Zhang et al, Autophagy, 2020).
      6. Authors referred that "no direct evidence has been found to confirm lipid transfer at the ER-IM MCS in living cells" (lines 282-283). However, a recent paper has shown that de novo-synthesized phosphatidylcholine is incorporated from the ER to the autophagosomes and autophagic bodies (Orii et al, J Cel Biol, 2021). This reference should be mentioned in the manuscript.
      7. In lines 252-253, the sentence "R18 transport from the PM to the ER was partially impaired in osh1Δ osh2Δ, osh6Δ osh7Δ, and oshΔ osh4-1 cells (Figure S3). These results suggest that Osh proteins participate in transferring R18 from the PM to the ER" does not recapitulate what is observed in Fig. S3. Moreover, the Emr lab has generate a tertadeletion mutant in which the PM-ER MCSs are abolished. The authors could examine this mutant.

      Significance

      General assessment:

      Strength: potential new system to monitor lipid flow Limitations: Indirect evidences and in the case of the retrograde transport of phospholipids, it could be an artefact of the employed experimental approach.

      Advance: Little advances because something in part already shown in vitro. No ne mechanisms uncovered.

      Audience: Autophagy and membrane contact site fields.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      *We would like to thank all the reviewers for their positive comments and valuable feedback. In addition, we would like to address reviewer 1 query on novelty, which was not questioned by the other 2 reviewers. Our study uncovered two main aspects of hypoxia biology: first we addressed the role of NF-kappaB contribution towards the transcriptome changes in hypoxia, and second, this revealed a previously unknown aspect, that NF-kappaB is required for gene repression in hypoxia. While we know a lot about gene induction in hypoxia, much less is known about repression of genes. In times of energy preservation, gene repression is as important as gene induction. *

      .

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      • *

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The work from Shakir et al uses different cell line models to investigate the role of NF-kB in the transcriptional adaptation of cells to hypoxia, which is relevant. In addition, the manuscript contains a large amount of data that could be of interest and even useful for researchers in the field of hypoxia and NF-kB. However, in my opinion, there are several concerns that should be revised and additional experiments that could be included to strengthen the relevance of the work.

      We thank this reviewer for their positive comments.

      Specific issues: In Figure 1A, the authors examine which of the genes induced by hypoxia require NF-kB by RNA sequencing analysis of cells knocked down for specific NF-kB subunits and exposed to hypoxia for 24 hours. The knockdown is about 40-60% at the RNA level, but it would be helpful to show the effect of knockdown at the protein level.

      We agree with this and have added Western blot data (Sup. Figure S1F), which shows the effects of the siRNA are much more pronounced at the protein level.

      All the data regarding genes induced by hypoxia in control or NF-kB siRNA-treated cells are somewhat confusing. If I understand correctly, when the data from the three different siRNAs are crossed, only 1070 genes are upregulated and 295 are downregulated in an NF-kB-independent manner. If this is the case, I think it would be easier to use this information in Figure 2 to define the hypoxia-induced genes that are NF-kB-dependent by simply considering those induced in the control that are not in the NF-kB-independent subset (rather than repeating the integration of the data without additional explanation). If the authors do this simple analysis, are the resulting genes the same or similar? In any case, the way these numbers are obtained should be shown more clearly (i.e., a new Venn diagram showing genes up- or down-regulated in the siRNA control that are not up- or down-regulated in any of the siRNA-NF-kB treatments).

      Figure 1 shows the effects on gene expression of hypoxia in control and NF-____k____B ____subunit____-depleted cells compared to normoxia control cells. Figures 1F/1G compares genes up/downregulated in hypoxia when RelA, RelB, and cRel are depleted, compared to normoxia control. Figure 1 does not display N____F-____k____B____-dependent/independent hypoxia-responsive genes____, but rather the overall effect of siRNA control and siNF-____k____B treatments in hypoxia, compared to siRNA control in normoxia. Figure 2 then defines NF-____k____B-dependent ____and independent hypoxia-responsive genes. We actually define these exactly as the reviewer suggested and agree that we should show the way these numbers are obtained more clearly. We have added the suggested Venn diagrams (Sup. Figure S2) and added extra information to the methods section (page 5 of revised manuscript). We felt it was important to show all the data upfront in Figure 1 and then integrate and focus on NF-____k____B-dependent ____hypoxia-induced genes in Figure 2.

      Figure 2H shows that approximately 80% of the NF-kB-dependent genes up- or down-regulated in hypoxia were identified as RelA targets, which is statistically significant compared to RelB or cRel targets. However, what is the proportion of genes identified as RelA targets in the subset of NF-kB-independent hypoxia-induced genes? And in a randomly selected set of 500-600 genes? In my opinion, this statistical analysis should be included to demonstrate a relationship between NF-kB recruitment and hypoxia-induced upregulation (expected) and downregulation (unexpected). In this context, it is surprising that HIF consensus sites are preferentially detected in the genes that are supposed to be NF-kB dependent instead of RelA consensus.

      We thank the reviewer for this question, which is really helpful. The way we have displayed the stars on the graph for Figure 2H was slightly misleading we realize now. As such, we have amended the graph. RelA, RelB, and cRel bound genes (from the ChIP atlas) are all significantly enriched within our N____F-____k____B-dependent hypoxia-responsive genes, there is no statistical testing between RelA bound vs RelB bound or cRel bound. We have also performed this analysis on the NF-____k____B____-independent hypoxia-responsive genes ____and see the same trend (Sup. Figure S5B). This indicates that the enrichment of Rel binding sites from the ChIP atlas is not specific to NF-____k____B____-dependent hypoxia-responsive genes____. We have moved Figure 2H to (Sup. Figure S5A) and amended our description of the finding. This showcases how DNA binding does not necessarily mean functionality. We have amended our description of this result and limitation of the study.

      Figure 3 is just a confirmation by qPCR of the data obtained in the RNA-seq analysis, which in my opinion should be included as supplementary information. Moreover, both the effects of hypoxia and reversion by RelB siRNA are modest in several of the genes tested. The same is true for Figures 4 and 5 with very modest and variable results across cell types and genes.

      We appreciate this comment; we would like to keep this as a main figure for full transparency and show validation of our RNA-sequencing results.

      Figure 6 shows the effect of NF-kB knockdown on the induction of ROS in response to hypoxia. In the images provided, the effect of hypoxia is minimal in control cells, with the only clear differences shown in RelA-depleted cells.

      The quantification of the IF data (Figure 6B) shows ROS induction in hypoxia which is reduced in Rel-depleted cells, with RelA depletion having the strongest effect. ROS generation in hypoxia, although counterintuitive, is well documented and used for important signalling events. We believe our data supports the previously reported levels of ROS induction (reviewed in {Alva, 2024}) in hypoxia and importantly, that NF-____k____B depletion can at least partially____ reverse this.

      In 6B it is not clear what the three asterisks in the normoxia control represent (compared to the hypoxia siRNA control?). This should be clarified in the figure legend or text.

      We apologize for the lack of clarity we have now added this information to the figure legend.

      In the Western blot of 6C, there are no differences in the levels of SOD1 after RelA depletion. Again, there is no reason not to include the NF-kB subunits in the Western blot analysis.

      We have added the Western blot analysis to this figure. We were trying to simplify it. Although depletion of RelA does not rescue the hypoxia-induced repression of SOD1, depletion of RelB does. Furthermore, cRel although not statistically significant, has a trend for the rescue of this effect, see Figure 6C-D.

      Finally, regarding Figure 7, the authors mention that "we confirmed that hypoxia led to a reduction in several proteins represented in this panel (of proteins involved in oxidative phosphorylation), such as UQCRC2 and IDH1 (Figure 7A-B)". The authors cannot say this because it is not seen in the Western blot in 7A or in the quantification shown in 7B. In my personal opinion, stating something that is not even suggested in the experiments is very negative for the credibility of the whole message.

      We really do not agree with this comment. We do see reductions in the levels of the proteins we mentioned. We have made the figure less complex given that some proteins are very abundant while others are not. We hope the changes are now clear and apparent. We have changed the quantification normalisation to reflect this as well and modified our description of the results, see Figure 7 and Sup. Figure S18.

      In conclusion, this paper contains a large amount of relevant information, but i) non-essential data should be moved to Supplementary, ii) protein levels of relevant players need to be shown in addition to RNA, iii) minimal or undetectable differences need to be considered as no-differences, and iv) a model showing what is the interpretation of the data provided is needed to better understand the message of the paper. I mean, is it p65 or RelB binding to some of these genes leading to their activation or repression, or is it RelA or RelB inducing HIF1beta leading to NF-kB-dependent gene activation by hypoxia? If this were the case, experimental evidence that NF-kB regulates a subset of hypoxia genes through HIF1beta would make the story more understandable.

      We apologise but we do not know why the reviewer mentions HIF1beta. For gene induction, there is cooperation with the HIF system in some genes but not all. The most interesting and unexpected finding is that NF-kappaB is required for gene repression in hypoxia. We have added a new figure, investigating how HDAC inhibition could reverse the repression. A mechanism known to be employed by NF-kappaB when repressing genes. We have added all the blots for NF-kB, clarified the quantification and included other approaches including a CRISPR KO cell lines for both IKKs. We hope this is now clear.

      Reviewer #1 (Significance (Required)):

      The work presented here is interesting but does not provide a major advance over previous publications, the main message being that a subset of hypoxia-regulated genes are NF-kB dependent. However, there is no mechanistic explanation of how this regulation is achieved and several data that are not clearly connected. A more comprehensive analysis of the data and additional experimental validation would greatly enhance the significance of the work.

      We politely disagree with the reviewer. Our main finding is that NF-____k____B____ does play an important role in gene regulation in hypoxia but unexpectedly, this occurs mostly via gene repression. While there is vast knowledge on gene induction in hypoxia, gene repression, which typically does not occur directly via HIF, is virtually unknown. A previous study had identified Rest as a transcriptional repressor {PMID: 27531581} but this could only account for 20% of gene repression. Our findings reveal up to 60% of genes repressed in hypoxia require NF-____k____B____, hence this is a significant finding and a major advance over previous knowledge. Furthermore, we feel this paper is an excellent data resource for the field, as it is, to our knowledge, the first study characterising the extent to which NF-____k____B is required for hypoxia-induced gene changes, on a transcriptome-wide scale. Furthermore, we have validated this across multiple cell types and also used different approaches to investigate the role of NF-kB in the hypoxia transcriptional response. We are happy that the other reviewers agree with our novel findings.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this study, the authors have interrogated the role of NF-kappaB in the cellular transcriptional response to hypoxia. While HIF is considered the master regulator of the cellular response to hypoxia, it has long been known that mutliple transcription factors also play a role both independently of HIF and through the regulation of HIF-1alpha levels. Chief amongst these is NF-kappaB, a regulator of cell death and inflammation amongst other things. While NF-kappB has been known to be activated in hypoxia through altered PhD activity, the impact of this on global gene expression has remained unclear and this study addresses this important question. Of particular interest, genes downregulated in hypoxia appear to be repressed in a NF-kappaB-dependent manner. Overall, this nice study reveals an important role for NF-kappaB in the control of the global cellular transcriptional response to hypoxia.

      We thank this reviewer for their positive comments.

      Reviewer #2 (Significance (Required)):

      Some questions for the authors to consider with experiments or discussion: -One caveat of the current study which should be discussed is that while interesting and extensive, the analysis is restricted to cancer cell lines which have dysfunctional gene expression systems which may differ from "normal" cells. This should be discussed.

      We thank the reviewer for these comments. This is indeed an important aspect, which we now expand on in the discussion section. We also took advantage of RNA-seq datasets for HUVECs (a non-transformed cell lines) in response to hypoxia (Sup. Figure S15), TNF-alpha with and without RelA depletion (Sup. Figure S16). These data support our findings that in hypoxia NF-kB is important for transcriptional repression, with some contributions to gene induction, even in a non-transformed cell system.

      In the publicly available data sets analyzed, were the same hypoxic conditions used as in this study. This information should be included.

      We apologize if this was not clear, the hypoxia RNA-seq studies are the same oxygen level and time (1%, 24 hours), this is in the legend of Figure 4A and Sup. Figure S9 and in Sup. Table S2. We have added this information to the main text also.

      • What is known about NF-kappaB as a transcriptional repressor in other systems such as the control of cytokine or infection driven inflammation? This is briefly discussed but should be expanded. This is important as a key question in the study of hypoxia is what regulates gene repression.

      We have included this in the discussion and also analysed available data in HUVECs in response to cytokine stimulation with and without RelA depletion (Sup. Figure S16). This analysis revealed equal importance of RelA for activation and repression of genes upon TNF-alpha stimulation. Around 40% of genes require RelA for their induction or repression in response to TNF-a. In the discussion we have also included other references where NF-kappaB has been found to repress genes.

      NF-kappaB has previously been shown to regulate HIF-1alpha transcription. What are the effects of NF-kappaB subunit siRNAs on basal HIF-1alpha transcription? In figure 7, it appears that NF-kappaB subunit siRNA is without effect on hypoxia-induced HIF protein expression. Could this account for some of the effects of NF-kappaB depletion on the hypoxic gene signature? This point needs to be clarified in light of the data presented.

      We have included data for HIF-1α RNA levels in HeLa cells with/without NF-____k____B____ depletion followed by 24 hours of hypoxia (Sup. Figure S20) and we see a small reduction (~10-20%). The reviewer is correct, there was not much effect of NF-____k____B____ depletion on HIF-1α protein levels following 24 hours hypoxia in HeLa cells. Effects of NF-kappaB depletion can be found usually with lower times of hypoxia exposure or when more than one subunit is depleted at the same time. We have added this as a discussion point in the revised manuscript.

      NRF-2 is a key cellular sensor of oxidative stress in a similar way to HIF being a hypoxia sensor. The authors demonstrate using a dye that ROS are paradoxically increased in hypoxia (a more controversial finding than the authors present). It would be of interest to know if NFR-2 is induced in hypoxia as a marker of cellular oxidative stress. Similarly, it would be interesting to determine by metabolic analysis whether oxidative phosphorylation (O2 consumption) is decreased as the transcriptional signature would suggest (although the difficulty of performing metabolic analysis in hypoxia is acknowledged).

      To investigate if NRF2 is induced, we performed a western blot at 0, 1, and 24 hours 1% oxygen, but didn’t see any induction of NRF2 protein levels (____Sup. Figure S17A). We also overlapped our hypoxia upregulated genes with NRF2 target genes from {PMID:24647116 and PMID: 38643749} (Sup. Figure S17B) and found limited evidence of NRF2 target genes being induced. Based on these findings, it seems that NRF2 is not being induced in hypoxia, at least not at the hypoxia level/time point we have analysed. We also agree it would be ideal to measure oxygen consumption in hypoxia, but unfortunately, we do not have the technical ability to do this at present.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Strengths This manuscript attempts to integrate multiple strands of data to determine the role of NFkB in hypoxia -induced gene expression. This analysis looks at multiple NFkB subunits in multiple cell lines to convincingly demonstrate that NFkB does indeed play a central role in the regulation of hypoxia-induced gene expression. This broad approach integrates new experimental data with findings from the published literature.

      A significant amount of work has been performed both experimentally and bioinformatically to test experimental hypotheses.

      We thank this reviewer for their positive comments.

      Limitations

      The main analysis in the paper involves comparing the impact of knocking down different NFkB family members in hypoxia and comparing transcriptional responses. I am surprised that the authors did not include the impact of knockdown of the NFkB family members in normoxia too. The absence of these control experiments allows us to understand the role of NFkB in hypoxia, but does not give us information as to how many of those impacts are specific/ induced in hypoxic conditions. i.e. many of the observed effects of NFkB knockdown could be due to basal suppression of NFkB target genes that happen to be hypoxia sensitive. This finding is obviously important, but it would be nice to know how many of those genes are only / preferentially regulated by NFkB in hypoxia. This would give a much deeper insight into the role of NFkB in hypoxia induced gene expression.

      We agree this would have been ideal. For financial reasons we limited our analysis to hypoxia samples. We have performed qPCR analysis depleting RelA, RelB and cRel under normal oxygen conditions in HeLa (Sup. Figure S8). We find that the majority of the validated genes in HeLa cells which require____ NF-____k____B for gene changes in hypoxia, are not regulated by N____F-____k____B under normal oxygen conditions____. We have also added this limitation into our discussion section.

      The broad experimental approach while a strength of the paper in many ways also has its limitations e.g. Motif analysis revealing e.g. HIF-1a binding site enrichment in RelA and RelB-dependent DEGs is correlative observation and does not prove HIF involvement in NFkB-dependent hypoxia induced gene activation. Comparing responses with responses seen in one cell type with responses that have been described in a database comprised of many studies in a variety of different cells also has some limitations. These points can be described more fully in the discussion

      We agree these are mere correlations and hence a limitation and we have not formerly tested the involvement of HIF. We have included this in the discussion as suggested. For HIF binding site correlation, we do also compare to HIF ChIP-seq in HeLa cells exposed to 1% oxygen, albeit at 8 hours and not 24 hours (Sup. Figure S4).

      For siRNA transfections, single oligonucleotide sequences were used for RelA, RelB and cRel. This increases the potential likelihood of 'off targets' compared to pooled oligos delivered at lower concentrations. This limitation should at least be mentioned.

      We agree and have now included this as a limitation in the discussion section. We have now also included analysis using wild type and 2 different IKK____________ double KO CRISPR cell lines generated in the following publication {PMID: 35029639}. Out of the 9 genes we identified as NF-____k____B-dependent hypoxia upregulated genes from HeLa cell RNA-seq and validated by qPCR, which are also hypoxia-responsive in HCT116 cells (Sup. Figure S11D), 6 displayed ____NF-____k____B dependence in HCT116 cells (Sup. Figure S14). We also provide new protein data in this cell system for oxidative phosphorylation markers, which show as with the siRNA depletion, rescue of repression of these proteins when NF-____k____B is inactivated.

      RNA-seq experiments are performed on n=2 data which means relatively low statistical power. How has the statistical analysis been performed on normalised counts (corresponding to 2 n- numbers) to yield statistical significance? I am not familiar with hypergeometric tests - please justify their use here.

      __*We use DESeq2 for differential expression analysis and filter for effect size (> -/+ 0.58 log2 fold change) and statistical significance (FDR I am not familiar with hypergeometric tests - please justify their use here.

      The hypergeometric test (equivalent to a one-sided Fisher's exact test) is routinely used to determine whether the observed overlap between two gene lists is statistically significant compared to what would be expected by chance. It is also the statistical test of choice for popular bioinformatics tools which perform over representation analysis (ORA) to see which gene sets/groups/pathways/ontologies are over-represented in a gene list, examples include Metascape, clusterProfiler, WebGestalt (used in this study), and gProfiler.

      P14 RelB is described as having the most widespread impact of hypoxia dependent gene changes across all cell systems tested. Could this be due to a more potent silencing of RelB and / or due to particularly high/ low expression of RelB in these cells in general?

      This is an excellent point, at the RNA level the RelB depletion is slightly more efficient (Sup. Figure S1), at the protein level, silencing is highly potent with all 3 siRNAs (Sup. Figure S1). We looked at the RNA levels of RelA, RelB and cRel in HeLa cells at basal conditions, and RelA shows the highest abundance compared to RelB and cRel, while RelB and cRel have similar expression levels (see below). However, RelB is very dynamic in response to hypoxia, something we have observed but have not published yet.

      P18 For western blot analysis best practise is to have 2 MW markers per blot presented

      We have and have added the second MW markers suggested.

      For quantification, I suggest avoiding performing statistical analysis on semi-quantitative data unless a dynamic range of detection (with standards) has been fully established.

      We agree this has many limitations, we will keep the quantification but moved into supplementary information.

      P19 There is clearly an effect of reciprocal silencing with the NFkB knockdown experiments ie. siRelA affects RelB levels in hypoxia and vice versa. The implications of this for data interpretation should be discussed.

      Indeed, it is well known that RelB and cRel are RelA targets. Less is known about RelA as it is not a known NF-____k____B____ target. We have added a discussion in the revised manuscript.

      P20 The literature can be better cited in relation RelB and hypoxia A brief search reveals a few papers that should be mentioned/ discussed. Oliver et al. 2009 Patel et al. 2017 Riedl et al. 2021

      We have looked into these suggestions. Oliver et al, refer to hypercapnia, not hypoxia and the other two only briefly mentioned RelB with no effects toward the goals of their studies. We have tried to incorporate what is currently known as much as possible.

      I suggest leaving out mention of IkBa sumoylation and supplementary figure 10. I'm not sure the data in the paper as a whole merits focus on this very specific point.

      We thank the reviewer for this suggestion and we have removed this aspect from the manuscript.

      There is a very strong reliance on mRNA and TPM data. Some additional protein data in support of key findings will enhance

      We have added additional protein level analysis where we could obtain antibodies, see Figures 6, 7 and Sup. Figures S17, S18, and S19 for our protein level analysis.

      A graphical abstract summarising key findings with exemplar genes highlighted will enhance.

      We have added a model to summarise our findings as suggested.

      Both HIF and NFKB are ancient evolutionarily conserved pathways. Can lessons be learned from evolutionary biology as to how NFkB regulation of hypoxia induced genes occured. Does the HIF pathway pre-date the NFkB pathway or vice versa. This approach could be valuable in supporting the findings from this study.

      We have investigated this. Unfortunately, there are very little available data on hypoxia gene expression in lower organisms. However, we have added a few sentences on the evolution of NF-____k____B____ and HIF.

      Minor comments P2 please briefly explain how 5 genes give rise to 7 proteins

      We have added this to the introduction as requested.

      P2 there seems to be some recency bias in the studies cited as being associated with NFkB activation in response to hypoxia. Mention of Koong et al (1994) and Taylor et al (1999) and other early papers in the field will enhance

      We have added these as suggested.

      P3 The role of PHD enzymes in the regulation of NFkB in hypoxia can be introduced and / or discussed

      We have added a reference to this aspect as suggested.

      P8 I suggest use of proportional Venn diagrams to demonstrate the patterns more clearly

      We have added these as suggested.

      P11 To what extent might NFkB and Rest co-operate/ co-regulate gene repression in hypoxia?

      This is a good question. We have overlapped our datasets with Rest-dependent hypoxia-regulated genes identified by Cavadas et al., (Figure below), and find that these appear to act independently of each other for the most part, with very few genes co-regulated by both.

      Reviewer #3 (Significance (Required)):

      Shakir et al. present a manuscript titled 'NFkB is a central regulator of hypoxia-induced gene expression'.

      The research group are experts in both NFkB and hypoxia signaling and are the ideal group to perform these studies.

      Hypoxia and inflammation are co-incident in many physiological and pathophysiological conditions, where the microenvironment affects disease severity and patient outcome. The cross talk between inflammatory and hypoxia signaling pathways is not fully described. Thus, this manuscript takes a novel approach to an established question and concludes clearly that NFkB is a central regulator of hypoxia-induced gene expression.

      We thank the reviewer for these positive comments.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Strengths

      This manuscript attempts to integrate multiple strands of data to determine the role of NFkB in hypoxia -induced gene expression. This analysis looks at multiple NFkB subunits in multiple cell lines to convincingly demonstrate that NFkB does indeed play a central role in the regulation of hypoxia-induced gene expression. This broad approach integrates new experimental data with findings from the published literature.

      A significant amount of work has been performed both experimentally and bioinformatically to test experimental hypotheses.

      Limitations

      The main analysis in the paper involves comparing the impact of knocking down different NFkB family members in hypoxia and comparing transcriptional responses. I am surprised that the authors did not include the impact of knockdown of the NFkB family members in normoxia too. The absence of these control experiments allows us to understand the role of NFkB in hypoxia, but does not give us information as to how many of those impacts are specific/ induced in hypoxic conditions. i.e. many of the observed effects of NFkB knockdown could be due to basal suppression of NFkB target genes that happen to be hypoxia sensitive. This finding is obviously important, but it would be nice to know how many of those genes are only / preferentially regulated by NFkB in hypoxia. This would give a much deeper insight into the role of NFkB in hypoxia induced gene expression.

      The broad experimental approach while a strength of the paper in many ways also has its limitations e.g. Motif analysis revealing e.g. HIF-1a binding site enrichment in RelA and RelB-dependent DEGs is correlative observation and does not prove HIF involvement in NFkB-dependent hypoxia induced gene activation. Comparing responses with responses seen in one cell type with responses that have been described in a database comprised of many studies in a variety of different cells also has some limitations. These points can be described more fully in the discussion

      For siRNA transfections, single oligonucleotide sequences were used for RelA, RelB and cRel. This increases the potential likelihood of 'off targets' compared to pooled oligos delivered at lower concentrations. This limitation should at least be mentioned.

      RNA-seq experiments are performed on n=2 data which means relatively low statistical power. How has the statistical analysis been performed on normalised counts (corresponding to 2 n- numbers) to yield statistical significance? I am not familiar with hypergeometric tests - please justify their use here.

      P14 RelB is described as having the most widespread impact of hypoxia dependent gene changes across all cell systems tested. Could this be due to a more potent silencing of RelB and / or due to particularly high/ low expression of RelB in these cells in general?

      P18 For western blot analysis best practise is to have 2 MW markers per blot presented

      For quantification, I suggest avoiding performing statistical analysis on semi-quantitative data unless a dynamic range of detection (with standards) has been fully established.

      P19 There is clearly an effect of reciprocal silencing with the NFkB knockdown experiments ie. siRelA affects RelB levels in hypoxia and vice versa. The implications of this for data interpretation should be discussed.

      P20 The literature can be better cited in relation RelB and hypoxia A brief search reveals a few papers that should be mentioned/ discussed. Oliver et al. 2009 Patel et al. 2017 <br /> Riedl et al. 2021

      I suggest leaving out mention of IkBa sumoylation and supplementary figure 10. I'm not sure the data in the paper as a whole merits focus on this very specific point.

      There is a very strong reliance on mRNA and TPM data. Some additional protein data in support of key findings will enhance

      A graphical abstract summarising key findings with exemplar genes highlighted will enhance.

      Both HIF and NFKB are ancient evolutionarily conserved pathways. Can lessons be learned from evolutionary biology as to how NFkB regulation of hypoxia induced genes occured. Does the HIF pathway pre-date the NFkB pathway or vice versa. This approach could be valuable in supporting the findings from this study.

      Minor comments

      P2 please briefly explain how 5 genes give rise to 7 proteins

      P2 there seems to be some recency bias in the studies cited as being associated with NFkB activation in response to hypoxia. Mention of Koong et al (1994) and Taylor et al (1999) and other early papers in the field will enhance

      P3 The role of PHD enzymes in the regulation of NFkB in hypoxia can be introduced and / or discussed

      P8 I suggest use of proportional Venn diagrams to demonstrate the patterns more clearly

      P11 To what extent might NFkB and Rest co-operate/ co-regulate gene repression in hypoxia?

      Significance

      Shakir et al. present a manuscript titled 'NFkB is a central regulator of hypoxia-induced gene expression'.

      The research group are experts in both NFkB and hypoxia signaling and are the ideal group to perform these studies.

      Hypoxia and inflammation are co-incident in many physiological and pathophysiological conditions, where the microenvironment affects disease severity and patient outcome. The cross talk between inflammatory and hypoxia signaling pathways is not fully described. Thus, this manuscript takes a novel approach to an established question and concludes clearly that NFkB is a central regulator of hypoxia-induced gene expression.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, the authors have interrogated the role of NF-kappaB in the cellular transcriptional response to hypoxia. While HIF is considered the master regulator of the cellular response to hypoxia, it has long been known that mutliple transcription factors also play a role both independently of HIF and through the regulation of HIF-1alpha levels. Chief amongst these is NF-kappaB, a regulator of cell death and inflammation amongst other things. While NF-kappB has been known to be activated in hypoxia through altered PhD activity, the impact of this on global gene expression has remained unclear and this study addresses this important question. Of particular interest, genes downregulated in hypoxia appear to be repressed in a NF-kappaB-dependent manner. Overall, this nice study is reveals an important role for NF-kappaB in the control of the global cellular transcriptional response to hypoxia.

      Significance

      Some questions for the authors to consider with experiments or discussion:

      • One caveat of the current study which should be discussed is that while interesting and extensive, the analysis is restricted to cancer cell lines which have dysfunctional gene expression systems which may differ from "normal" cells. This should be discussed.
      • In the publicly available data sets analysed, were the same hypoxic conditions used as in this study. This information should be included.
      • What is known about NF-kappaB as a transcriptional repressor in other systems such as the control of cytokine or infection driven inflammation? This is briefly discussed but should be expanded. This is important as a key question in the study of hypoxia is what regulates gene repression.
      • NF-kappaB has previously been shown to regulate HIF-1alpha transcription. What are the effects of NF-kappaB subunit siRNAs on basal HIF-1alpha transcription? In figure 7, it appears that NF-kappaB subunit siRNA is without effect on hypoxia-induced HIF protein expression. Could this account for some of the effects of NF-kappaB depletion on the hypoxic gene signature? This point needs to be clarified in light of the data presented.
      • NRF-2 is a key cellular sensor of oxidative stress in a similar way to HIF being a hypoxia sensor. The authors demonstrate using a dye that ROS are paradoxically increased in hypoxia (a more controversial finding than the authors present). It would be of interest to know if NFR-2 is induced in hypoxia as a marker of cellular oxidative stress. Similarly it would be interesting to determine by metabolic analysis whether oxidative phosphorylation (O2 consumption) is decreased as the transcriptional signature would suggest (although the difficulty of performing metabolic analysis in hypoxia is acknowleged).
    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The work from Shakir et al uses different cell line models to investigate the role of NF-kB in the transcriptional adaptation of cells to hypoxia, which is relevant. In addition, the manuscript contains a large amount of data that could be of interest and even useful for researchers in the field of hypoxia and NF-kB. However, in my opinion, there are several concerns that should be revised and additional experiments that could be included to strengthen the relevance of the work.

      Specific issues:

      In Figure 1A, the authors examine which of the genes induced by hypoxia require NF-kB by RNA sequencing analysis of cells knocked down for specific NF-kB subunits and exposed to hypoxia for 24 hours. The knockdown is about 40-60% at the RNA level, but it would be helpful to show the effect of knockdown at the protein level. All the data regarding genes induced by hypoxia in control or NF-kB siRNA-treated cells are somewhat confusing. If I understand correctly, when the data from the three different siRNAs are crossed, only 1070 genes are upregulated and 295 are downregulated in an NF-kB-independent manner. If this is the case, I think it would be easier to use this information in Figure 2 to define the hypoxia-induced genes that are NF-kB-dependent by simply considering those induced in the control that are not in the NF-kB-independent subset (rather than repeating the integration of the data without additional explanation). If the authors do this simple analysis, are the resulting genes the same or similar? In any case, the way these numbers are obtained should be shown more clearly (i.e., a new Venn diagram showing genes up- or down-regulated in the siRNA control that are not up- or down-regulated in any of the siRNA-NF-kB treatments). Figure 2H shows that approximately 80% of the NF-kB-dependent genes up- or down-regulated in hypoxia were identified as RelA targets, which is statistically significant compared to RelB or cRel targets. However, what is the proportion of genes identified as RelA targets in the subset of NF-kB-independent hypoxia-induced genes? And in a randomly selected set of 500-600 genes? In my opinion, this statistical analysis should be included to demonstrate a relationship between NF-kB recruitment and hypoxia-induced upregulation (expected) and downregulation (unexpected). In this context, it is surprising that HIF consensus sites are preferentially detected in the genes that are supposed to be NF-kB dependent instead of RelA consensus. Figure 3 is just a confirmation by qPCR of the data obtained in the RNA-seq analysis, which in my opinion should be included as supplementary information. Moreover, both the effects of hypoxia and reversion by RelB siRNA are modest in several of the genes tested. The same is true for Figures 4 and 5 with very modest and variable results across cell types and genes. Figure 6 shows the effect of NF-kB knockdown on the induction of ROS in response to hypoxia. In the images provided, the effect of hypoxia is minimal in control cells, with the only clear differences shown in RelA-depleted cells. In 6B it is not clear what the three asterisks in the normoxia control represent (compared to the hypoxia siRNA control?). This should be clarified in the figure legend or text. In the Western blot of 6C, there are no differences in the levels of SOD1 after RelA depletion. Again, there is no reason not to include the NF-kB subunits in the Western blot analysis. Finally, regarding Figure 7, the authors mention that "we confirmed that hypoxia led to a reduction in several proteins represented in this panel (of proteins involved in oxidative phosphorylation), such as UQCRC2 and IDH1 (Figure 7A-B)". The authors cannot say this because it is not seen in the Western blot in 7A or in the quantification shown in 7B. In my personal opinion, stating something that is not even suggested in the experiments is very negative for the credibility of the whole message. In conclusion, this paper contains a large amount of relevant information, but i) non-essential data should be moved to Supplementary, ii) protein levels of relevant players need to be shown in addition to RNA, iii) minimal or undetectable differences need to be considered as no-differences, and iv) a model showing what is the interpretation of the data provided is needed to better understand the message of the paper. I mean, is it p65 or RelB binding to some of these genes leading to their activation or repression, or is it RelA or RelB inducing HIF1beta leading to NF-kB-dependent gene activation by hypoxia? If this were the case, experimental evidence that NF-kB regulates a subset of hypoxia genes through HIF1beta would make the story more understandable.

      Significance

      The work presented here is interesting but does not provide a major advance over previous publications, the main message being that a subset of hypoxia-regulated genes are NF-kB dependent. However, there is no mechanistic explanation of how this regulation is achieved and several data that are not clearly connected. A more comprehensive analysis of the data and additional experimental validation would greatly enhance the significance of the work.

  2. May 2025
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Review

      Manuscript number: RC-2024-02391

      Corresponding author(s): John Varga

      Dibyendu Bhattacharyya

      [Please use this template only if the submitted manuscript should be considered by the affiliate journal as a full revision in response to the points raised by the reviewers.

      If you wish to submit a preliminary revision with a revision plan, please use our "Revision Plan" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      Dear editor,

      We are pleased to submit a full revised version of the manuscript that addresses all the points raised by the reviewers. We have included new experiments and modified the text and figures based on the reviewers’ suggestions. We thank all the reviewers for their insightful feedback, which has significantly enhanced the quality of the manuscript. We are confident and optimistic that our improved manuscript will be accepted by the journal of our choice.

      This document is supposed to contain a few images, which were somehow missing after the processing through the manuscript submission path. For convenience we also included a PDF version of the response to reviewers.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      Reviewer #1

      • To reliably quantify the ciliary length in different cell types, and in independent ciliary marker needs to be included for comparison and the ciliary base needs to be labeled (e.g., g-TUBULIN). This needs to combined with a non-biased, high-throughput analysis, e.g., CiliaQ, Response: As suggested, we compared primary cilia length measurements using antibodies against Arl13b and γ-tubulin. The comparison between healthy controls (HC) and systemic sclerosis (SSc) is presented in Supplementary Figure S1. No significant differences in primary cilia length were observed compared to our previous measurements. Cilia length was quantified using ImageJ version 1.48v (http://imagej.nih.gov/ij) with the maximum intensity projection (MIP) method and visualized through 3D reconstruction using the ImageJ 3D Viewer.

      • As mentioned in the study, TGFbhas been implicated to drive myofibroblast transition. Thus TGFb stimulate ciliary signaling in the presented primary cells? The authors should provide a read-out for TGFb signaling in the cilium (ICC for protein phosphorylation etc.). Furthermore, canonical ciliary signaling pathways have been suggested to act as fibrotic drivers, such as Hedgehog and Wnt signaling - does stimulation of these pathways evoke a similar effect? Response: Yes, TGF-β1 stimulates ciliary signaling in growth-arrested foreskin fibroblasts. Clement et al. (2013) showed that TGF-β1 induces p-SMAD2/3 at the ciliary base, followed by the nuclear translocation of p-SMAD2/3 after 90 minutes. To assess whether canonical ciliary signaling pathways influence primary cilia length, we treated foreskin fibroblasts with Wnt (#908-SH, R&D) and a Shh agonist (#5036-WN, R&D) at 100 ng/mL each for 24 hours. We did not observe any changes in primary cilia length under either condition. These data are shown here for reference but are not included in the manuscript.

      Clement, Christian Alexandro, et al. "TGF-β signaling is associated with endocytosis at the pocket region of the primary cilium." Cell reports 3.6 (2013): 1806-1814.

      • Does TGFbinduce cell proliferation? If yes, this would force cilium disassembly and, thereby, reduce ciliary length, which is independent of a "shortening" mechanism proposed by the authors. Response: Yes, TGF-β induces cell proliferation in fibroblasts (Lee et al., 2013; Liu et al., 2016). However, we did serum starvation to stop proliferation. In our study, we observed a few percentage of Ki67-positive cells under TGF-β treatment at 24 hours (Supplementary Figure S2C). However, cell proliferation mainly stopped after 48 hours. Typically, proliferating cells rarely display any PC or show very small puncta. In our case, we observe a significantly elongated PC structure (although shorter than that of untreated cells) under TGF-beta-treated conditions. Our results display that a majority of cells are not proliferating but still display PC shortening under TGF-β treatment, suggesting that PC shortening is not due to cell division-induced PC disassembly. TGF beta-induced PC shortening is also reported in another fibroblast type previously (Kawasaki et al., 2024).

      Kawasaki, Makiri, et al. "Primary cilia suppress the fibrotic activity of atrial fibroblasts from patients with atrial fibrillation in vitro." Scientific Reports 14.1 (2024): 12470.

      Lee, J., Choi, JH. & Joo, CK. TGF-β1 regulates cell fate during epithelial–mesenchymal transition by upregulating survivin. Cell Death Dis 4, e714 (2013). https://doi.org/10.1038/cddis.2013.244.

      Liu, Y. et al. TGF-β1 promotes scar fibroblasts proliferation and transdifferentiation via up-regulating MicroRNA-21. Sci. Rep. 6, 32231; doi: 10.1038/srep32231 (2016).

      • As PGE2 has been shown to signal through EP4 receptors in the cilium, is the restoration of primary cilia length due to ciliary signaling? Response: As per your suggestion, we measured cilia length in the presence and absence of the EP4 receptor antagonist (#EP4 Receptor Antagonist 1; #32722; Cayman Chemicals; 500 nM) with PGE2. Interestingly, we did not observe a change in cilia length between the PGE2 and TGFβ (with EP4 receptor antagonist) treatment groups, as shown in supplementary figure S3. We believe that PGE2 works with the EP2 receptor under our experimental conditions. Kolodsick et al., 2003, also observed that PGE2 inhibits myofibroblast differentiation via activation of EP2 receptors and elevations in cAMP levels in healthy lung fibroblasts.

      Kolodsick, Jill E., et al. "Prostaglandin E2 inhibits fibroblast to myofibroblast transition via E. prostanoid receptor 2 signaling and cyclic adenosine monophosphate elevation." American journal of respiratory cell and molecular biology 29.5 (2003): 537-544.

      • Primary cilia length is regulated by cAMP signaling in the cilium vs. cytoplasm - does cAMP signaling play a role in this context? PGE2 is potent stimulator of cAMP synthesis - does this underlie the rescue of primary cilia length? Response: Yes, cAMP levels are important for both myofibroblast dedifferentiation and cilia length elongation. Kolodsick et al., 2003 observed that PGE2 inhibits myofibroblast differentiation via activation of EP2 receptors and elevations in cAMP levels in healthy lung fibroblasts. In a parallel set of experiments, treatment with forskolin (a cAMP activator) also reduced α-SMA protein levels by 40%. Forskolin is also known to increase PC length.

      Kolodsick, Jill E., et al. "Prostaglandin E2 inhibits fibroblast to myofibroblast transition via E. prostanoid receptor 2 signaling and cyclic adenosine monophosphate elevation." American journal of respiratory cell and molecular biology 29.5 (2003): 537-544.

      • The authors describe that they wanted to investigate how aSMA impacted primary cilia length. They only provide a knock-down experiment and measured ciliary length, but the mechanistic insight is missing. How does loss of aSMA expression control ciliary length? Response: We measured acetylated α-tubulin levels in ACTA2 siRNA-treated cells compared to control-treated cells. Acetylated α-tubulin levels increased under ACTA2 siRNA-treated conditions, as shown in Figure 4D, and TPPP3 levels were also elevated (Figure S8A). Interestingly, TPPP3 levels negatively correlated with disease severity in SSc fibroblasts (r = -0.2701, p = 0.0183), and TPPP3 expression significantly reduced in SSc skin biopsies, as shown in Figures 6C and 6D. These results strengthen our hypothesis that microtubule polymerization and actin polymerization, while they counterbalance each other, also contrarily affect PC length. We agree that a much more detailed study is needed to extensively delineate the intricate homeostasis of the actin network and microtubule network in conjunction with fibrosis and primary cilia length. We have mentioned this in the discussion.

      • The authors used LiCl in their experiments, which supposedly control Hh signaling. Coming back to my second questions, is this Hh-dependent? And what is the common denominator with respect to TGFbsignaling? And how is this mechanistically connected to actin and microtubule polymerization? Response: We used Shh inhibitor (Cyclopamine hydrate #C4116 Sigma-Aldrich) in both SSc and foreskin fibroblasts (with and without TGFβ). We found that PC length is significantly increased and αSMA intensity is reduced in the Shh inhibitor treated group (data not included in the Manuscript)

      • How was the aSMA Mean intensity determined? Response: We quantified aSMA mean intensity using ImageJ, and the procedure has been added to the respective figure legend and materials and methods section under ‘Quantification of immunofluorescence’ (each point represents mean intensity from three randomly selected hpf/slide was performed using ImageJ).

      • Fig: 1D: Statistical test is missing in Figure Legend and presentation of the p-values for the left graph is confusing Response: We added statistical test information in Figure Legend.

      • Some graphs are presented {plus minus} SD and some {plus minus} SEM, but this is not correctly stated in the Material & Methods Part __Response: __We added information to the figure legend as well as in the Material & Methods section.

        • 4D&E: Statistical test is missing in Figure Legend* Response: We added it now.
      • In general, text should be checked again for spelling mistakes and sentences may be re-written to promote readability. In particular, this applies to the discussion. __Response: __We checked and corrected.

      • Figure Legends are not written consistently, information is missing (e.g., statistical tests, see above). __Response: __We carefully checked and added information accordingly.

      • Figures should be checked again, and all text should be the same size and alignment of images should be improved. __Response: __We checked and corrected.

      Significance

      The authors present a novel connection between the regulation of primary cilia length and fibrogenesis. However, the study generally lacks mechanistic insight, in particular on how TGFb signaling, aSMA expression, and ciliary length control are connected. The spatial organization of the proposed signaling components is also not clear - is this a ciliary signaling pathway? If so, how does it interact with cytoplasmic signaling and vice versa?

      Response: Thank you for your thoughtful and constructive feedback. We appreciate your recognition of the novelty of our study linking primary cilia length regulation to fibrogenesis. In our revised manuscript, we did provide a mechanistic insight, though. Our results suggest that during the fibrotic response, higher-order actin polymerization, along with microtubule destabilization resulting from tubulin deacetylation, drives the shortening of PC length. In contrast, PC length elongation via stabilization of microtubule polymerization mitigates the fibrotic phenotype in fibrotic fibroblasts. We agree that a deeper mechanistic understanding particularly regarding how TGFβ signaling, αSMA expression, and ciliary length control intersect is essential for fully elucidating the pathway. We also acknowledge the importance of clarifying the spatial organization of the signaling components and plan to incorporate such analyses in future studies.

      Reviewer #2

      *I found the paper to be rather muddled and its presentation made if somewhat difficult to follow. For example, the Figures are disorganised (Fig 1 is a great example of this) and there was reference to Sup data that appeared out of order (eg Sup Fig 2 appeared before Sup Fig 1 in the text). *

      Response: We carefully revised the manuscript and arranged the figures.

      *Images in a single figure should be the same size. Currently they are almost random and us different magnifications. Overall, the paper needs to be better organized. *

      Response: We carefully revised the manuscript and figures provided with same magnification.

      *I have some significant concerns about how the PC length data was generated. To my mind the length may be hard to determine from the type of images shown in the paper (which may represent the best images?). Some of the images presented appear to show shorter, fatter PCs in the cells from fibrosis cases. Is this real or is it some kind of artefact? Would a shorter, fatter PCs have a similar or larger surface area? What would be the consequence of this? *

      Response: Primary cilia length was measured with ImageJ1.48v (using maximum intensity projection (MIP) method and visualized by 3D reconstruction with the ImageJ 3D viewer. Each small dot represents the PC length from an individual cell, and each large dot represents the average of the small dots for one cell line.

      *I am confused as to exactly what is meant by matched healthy controls. Age, sex and ethnicity, where stated seem to be very variable? What are CCL210 fibroblasts? *

      Response: We appreciate this comment. This is correct. The age, sex, and ethnicity are not matched for the available healthy controls. We have corrected that in the text. CCL210 is a commercially available fibroblast cell line that was isolated from the lung of a normal White, 20-year-old, female patient.

      *What does a change in PC length signify? DO shot PC foe a cellular transition or are they a consequence of it? What would happen is you targeted PCs with a drug and that influenced the length on all cell types? Is the effect on PC fibroblast specific? *

      __Response: __Significance and regulation of PC length are greatly debated and investigated still. It appears that PC length signify different features in different cell types. Although these are very interesting questions but such experiments are beyond the scope of our present work.

      Minor concerns

      *Page 4 second paragraph. I think it should be clarified that it is this group who have suggested a link between PCs and myofibroblast transition? *

      __Response: __We agree with the reviewer and clarified it.

      *Page 4 second paragraph. The use of the word "remarkably' is a bit subjective. *

      __Response: __We agree with the reviewer and have removed it.

      *Reference 27 is a paper on multiciliogenesis rather than primary ciliogenesis. *

      __Response: __We agree with the reviewer and have removed it.

      Figure 1 panel D. Make the image with the same sized vertical scale

      __Response: __We have replaced it with a new Figure 1.

      Significance

      Reviewer #2 (Significance (Required)):

      To my mind this is a novel paper and the data presented in it may be of interest to the cilia community as well as to the fibrosis field. This could be considered to be a significant advance and I am unaware that other groups are actively working in this area.

      Presentation of the data in the current form does not instil confidence in the work.

      Response: ____Thank you for recognizing the novelty and potential significance of our work. We appreciate your comments and fully acknowledge the concern regarding the presentation of the data. We have carefully revised the manuscript and reorganized the figures to improve clarity and overall presentation.

      Reviewer #3

      Major comments:

      • Need to demonstrate if the fibrotic phenotypes seen are produced through a ciliary-dependent mechanism. For example, to see if LiCl effects on Cgn1 are through ciliary expression or by other mechanisms. To achieve that objective, The authors should repeat the experiments in cells with a knockdown or knockout of ciliary proteins such as IFT20, IFT88, etc. The same approach should be applied to the tubacin experiments. Response: We silenced foreskin fibroblasts with IFT88/IFT20, both in the presence and absence of TGF-β1, followed by treatment with LiCl and Tubacin. Both LiCl and Tubacin can rescue cilia length and mitigate the myofibroblast phenotype in the presence of silenced IFT88/IFT20 gene, as shown in supplementary figure S9. Our result suggests that LiCl and Tubacin functions are both independent of the IFT-mediated ciliary mechanism. Regulation of PC length is still an enigma and highly debated. Moreover, PC length can be affected in multiple ways and is not solely dependent on IFTs (Avasthi and Marshall, 2012). One such method is the direct modification of the axoneme by altering microtubule stability through the acetylation state (Avasthi and Marshall, 2012), a pathway most likely the case for Tubacin. Another mode of PC length regulation is through a change in Actin polymerization. The remodeling of actin between contractile stress fibers and a cortical network alters conditions that are hospitable to basal body docking and maintenance at the cell surface (Avasthi and Marshall, 2012), causing PC length variation. Our results suggest that PC length functions as a sensor of the status of the fibrotic condition, as evidenced by the aSMA levels of the cells.

      Avasthi, P., and W.F. Marshall. 2012. Stages of ciliogenesis and regulation of ciliary length. Differentiation. 83:S30-42.

      • The use of LiCl to increase ciliary length is complicated. What are the molecular mechanisms underlying this effect? It is known that it may be affecting GSK-3b, which can have other ciliary-independent effects. Therefore, using ciliary KO/KD cells (IFT88 or IFT20) as controls may help assess the specificity of the proposed treatments. Response: As explained in the previous paragraph, PC length regulations are dependent on multiple factors and many of them are not IFT dependent. One such method is directly modifying the axoneme by altering microtubule stability/polymerization through the acetylation state(Avasthi and Marshall, 2012), a pathway most likely the case for Tubacin. Another mode of PC length regulation is through a change in Actin polymerization. The remodeling of actin between contractile stress fibers and a cortical network alters conditions that are hospitable to basal body docking and maintenance at the cell surface (Avasthi and Marshall, 2012), causing PC length variation. Higher order microtubule polymerization inhibit actin polymerization. By interrogating RNA-seq data we determined that several PC-disassembly related genes (KIF4A, KIF26A, KIF26B, KIF18A), as well as microtubule polymerization protein genes (TPPP, TPPP3, TUBB, TUBB2A etc), were differentially expressed in LiCl-treated SSc fibroblasts (Suppl. Fig. S6D). Altogether, these findings suggest that microtubule polymerization/depolymerization mechanisms may regulate PC elongation and attenuation of fibrotic responses after either LiCl or Tubacin treatment.

      • Also, assessing the frequency of ciliary-expressing cells is important. That may give another variable important to predict fibrotic phenotypes. Or do 100% of the cultured cells express cilia in those conditions? Response: We carefully checked and observed almost 95% cells express cilia in cultured conditions.

      • Have the authors evaluated if TGF-b1 treatments induce cell cycle re-entry and proliferation in these experimental conditions? This is important to exclude ciliary resorption due to cell cycle re-entry instead of the myofibroblast activation process. __Response:__Yes, TGF-β induces cell proliferation in fibroblasts (Lee et al., 2013; Liu et al., 2016). However, we did serum starvation to stop proliferation. In our study, we observed a few percentage of Ki67-positive cells under TGF-β treatment at 24 hours (Supplementary Figure S2C). However, cell proliferation mainly stopped after 48 hours. Typically, proliferating cells rarely display any PC or show very small puncta. In our case, we observe a significantly elongated PC structure (although shorter than that of untreated cells) under TGF-beta-treated conditions. Our results display that a majority of cells are not proliferating but still display PC shortening under TGF-β treatment, suggesting that PC shortening is not due to cell division-induced PC disassembly. TGF beta-induced PC shortening is also reported in another fibroblast type previously (Kawasaki et al., 2024).

      Kawasaki, Makiri, et al. "Primary cilia suppress the fibrotic activity of atrial fibroblasts from patients with atrial fibrillation in vitro." Scientific Reports 14.1 (2024): 12470.

      Lee, J., Choi, JH. & Joo, CK. TGF-β1 regulates cell fate during epithelial–mesenchymal transition by upregulating survivin. Cell Death Dis 4, e714 (2013). https://doi.org/10.1038/cddis.2013.244.

      Liu, Y. et al. TGF-β1 promotes scar fibroblasts proliferation and transdifferentiation via up-regulating MicroRNA-21. Sci. Rep. 6, 32231; doi: 10.1038/srep32231 (2016).

      • The authors described that they focused on the genes that are affected in opposite ways (supp table 4), but TEAD2, MICALL1, and HDAC6 are not listed in that table. Response: The list in Supplementary Table S3 includes common genes defined as differentially expressed based on a fold change >1 or Minor comments:

      • Figure 1A,B,C should also show lower magnification images where several cells/field are visualized. Response: We have replaced it with a new Figure 1.

      • The number of patients analyzed is not clear. For example, M&M describes 5 healthy and 8 SSc, but only 3 and 4 are shown in the figure. Furthermore, for orbital fibrosis, 2 healthy vs. 2 TAO are mentioned in the figure legend, but only one of each showed. Finally, the healthy control for lung fibroblast seems to be 3 independent experiments of the CCL210 cell line; please show the three independent controls and clarify on the X-axis and in the figure legend that these are CCL210 cells. Response: A total of 5 healthy and 8 SSc skin explanted fibroblast cell lines were used, as described in the Materials and Methods. Since these are patient-derived skin fibroblasts, maintaining equal numbers in each experiment is challenging. Revised graphs for orbital fibroblasts and CCL210 have been added in the new Figures 1B and 1C.

      • For the same set of experiments, please clarify and consistently describe the conditions that promote PC: 12hs serum starvation as described in M&M? Or 24hs as described in the text? Or 16 as described in figure legend 1? Or 24hs as described in supp figure 2? Response: We serum-starved the cells overnight, and this is also mentioned in the manuscript.

      • Please confirm in figure legends and M&M that 100 cells per group were counted. Response: We measured only 100 cells per cell line in Supplementary Figure S1B. To eliminate any confusion, we have now created a superplot for cilia analysis. Each small dot represents the PC length from an individual cell, and each large dot represents the average of the small dots for one cell line. An unpaired two-tailed t-test was performed on the small dots (mean ± SD).

      • Figure 2 should also provide lower magnification to show several cells per field. Response: Foreskin fibroblasts treated with TGF-β1 are added in S2A.

      • How do you explain that the increase in length of primary cilia after siACTA2 doesn't change COL1A1? Wouldn't it be a good approach to also check by Western Blot? Response: We believe that depletion of aSMA was sufficient to reduce the PC length for the reason described earlier (Avasthi and Marshall, 2012), but was not sufficient enough to change COL1A1 level. We added the western blot in Supplementary Figure S8B.

      • Once more, figure 5 will benefit from low mag images. How consistent is the effect of LiCl in the cultured cells? What is the percentage of rescued cells? Response: LiCl treatment was consistent for almost all the cells (~95%) as shown below and added in S4A.

      • Figure 5, panels F and G need better explanation in the results text as well as in the figure legend. Response: We added now.

      • 9) Some figures/supp figures are wrongly referenced in the text. *

      __ Response:__ We carefully revised the manuscript and corrected the references.

      10) Figure 6, panel A is confusing. Is it a comparison between SSC skin fibroblasts and foreskin fibroblasts? Maybe show labels on the panel.

      __ Response:__ We updated the figure legend for Panel A in Figure 6.

      11) Where is Figure 8 mentioned in the text?

      __ Response:__ In the discussion section.

      12) The work will benefit from an initial paragraph in the discussion enumerating the findings and a summary of the conclusion at the end.

      Response: We agree and modified the discussion accordingly.

      13) The nintedanib experiments are not described in the results section at all.

      Response: All nintedanib experiments are now included in Figure S5C-F and are described in the Results section.

      Significance

      Reviewer #3 (Significance (Required)): Beyond the lack of in situ ciliary expression assessment, the work is exciting, and the potential implications of treating/preventing fibrosis with small molecules to modulate ciliary length could be transformative in the field. Furthermore, there are a few HDAC6 inhibitors already in clinical trials for different tumors, which increases the significance of the work.

      Response: Thank you for your encouraging comments regarding the potential impact of our findings. We agree that the therapeutic implications of modulating ciliary length, particularly using small molecules such as HDAC6 inhibitors already in clinical trials, could be transformative in the context of fibrosis. We also acknowledge the importance of in situ assessment of ciliary expression and plan to incorporate such analyses in future studies to further strengthen our findings.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The author's main research topic in this work is the relationship between ciliary length and the level of fibrosis. Fibrotic samples show shorter primary cilia, profibrotic treatment with TGFB decreases the ciliary length, and posterior dedifferentiation of fibroblasts shows longer cilia. Cells with a decrease of αSMA by using siACTA2 siRNA, also show increased ciliary length. Most importantly, inducing the increase of ciliary length with LiCl or Tubacin has an inverse association with fibrosis phenotypes. The modulation of primary cilia length may represent a potential therapeutic strategy for fibrosis-associated diseases.

      The premise is relevant and exciting, and the methods are appropriate. The experiments partially sustain the conclusion. The results open a new potential area for studying fibrosis. The tables and figures aid in understanding the paper. The paper is clear and easy to read for a basic research specialized audience.

      Major comments:

      1. Need to demonstrate if the fibrotic phenotypes seen are produced through a ciliary-dependent mechanism. For example, to see if LiCl effects on Cgn1 are through ciliary expression or by other mechanisms. To achieve that objective, The authors should repeat the experiments in cells with a knockdown or knockout of ciliary proteins such as IFT20, IFT88, etc. The same approach should be applied to the tubacin experiments.
      2. The use of LiCl to increase ciliary length is complicated. What are the molecular mechanisms underlying this effect? It is known that it may be affecting GSK-3b, which can have other ciliary-independent effects. Therefore, using ciliary KO/KD cells (IFT88 or IFT20) as controls may help assess the specificity of the proposed treatments.
      3. Also, assessing the frequency of ciliary-expressing cells is important. That may give another variable important to predict fibrotic phenotypes. Or do 100% of the cultured cells express cilia in those conditions?
      4. Have the authors evaluated if TGF-b1 treatments induce cell cycle re-entry and proliferation in these experimental conditions? This is important to exclude ciliary resorption due to cell cycle re-entry instead of the myofibroblast activation process.
      5. The authors described that they focused on the genes that are affected in opposite ways (supp table 4), but TEAD2, MICALL1, and HDAC6 are not listed in that table.

      Minor comments:

      1. Figure 1A,B,C should also show lower magnification images where several cells/field are visualized.
      2. The number of patients analyzed is not clear. For example, M&M describes 5 healthy and 8 SSc, but only 3 and 4 are shown in the figure. Furthermore, for orbital fibrosis, 2 healthy vs. 2 TAO are mentioned in the figure legend, but only one of each showed. Finally, the healthy control for lung fibroblast seems to be 3 independent experiments of the CCL210 cell line; please show the three independent controls and clarify on the X-axis and in the figure legend that these are CCL210 cells.
      3. For the same set of experiments, please clarify and consistently describe the conditions that promote PC: 12hs serum starvation as described in M&M? Or 24hs as described in the text? Or 16 as described in figure legend 1? Or 24hs as described in supp figure 2?
      4. Please confirm in figure legends and M&M that 100 cells per group were counted.
      5. Figure 2 should also provide lower magnification to show several cells per field.
      6. How do you explain that the increase in length of primary cilia after siACTA2 doesn't change COL1A1? Wouldn't it be a good approach to also check by Western Blot?
      7. Once more, figure 5 will benefit from low mag images. How consistent is the effect of LiCl in the cultured cells? What is the percentage of rescued cells?
      8. Figure 5, panels F and G need better explanation in the results text as well as in the figure legend.
      9. Some figures/supp figures are wrongly referenced in the text.
      10. Figure 6, panel A is confusing. Is it a comparison between SSC skin fibroblasts and foreskin fibroblasts? Maybe show labels on the panel.
      11. Where is Figure 8 mentioned in the text?
      12. The work will benefit from an initial paragraph in the discussion enumerating the findings and a summary of the conclusion at the end.
      13. The nintedanib experiments are not described in the results section at all.

      Significance

      Beyond the lack of in situ ciliary expression assessment, the work is exciting, and the potential implications of treating/preventing fibrosis with small molecules to modulate ciliary length could be transformative in the field. Furthermore, there are a few HDAC6 inhibitors already in clinical trials for different tumors, which increases the significance of the work.

      Expertise: primary cilium functions, cell biology, cancer biology

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This is an interesting paper that appears to show that explanted fibroblasts from a range of fibrotic conditions exhibit a reduction in the length of their primary cilia (PC). The paper employs a number of different experimental approaches that appear to show that the modulation of fibroblast/myofibroblast differentiation is associated with alterations in PC length. The rational for the study is that actin polymerization has previously been associated with PC length. The authors suggest that modulation of PC dynamics may represent a potential theraputic strategy for fibrotic disease. To me that seems like a big jump.

      Major concerns.

      I found the paper to be rather muddled and its presentation made if somewhat difficult to follow. For example, the Figures are disorganised (Fig 1 is a great example of this) and there was reference to Sup data that appeared out of order (eg Sup Fig 2 appeared before Sup Fig 1 in the text). Images in a single figure should be the same size. currently they are almost random and us different magnifications. Overall, the paper needs to be better organised.

      I have some significant concerns about how the PC length data was generated. To my mind the length may be hard to determine from the type of images shown in the paper (which may represent the best images?). Some of the images presented appear to show shorter, fatter PCs in the cells from fibrosis cases. Is this real or is it some kind of artefact? Would a shorter, fatter PCs have a similar or larger surface area? What would be the consequence of this?

      I am confused as to exactly what is meant by matched healthy controls. Age, sex and ethnicity, where stated seem to be very variable? What are CCL210 fibroblasts?

      What does a change in PC length signify? DO shot PC foe a cellular transition or are they a consequence of it? What would happen is you targeted PCs with a drug and that influenced the length on all cell types? Is the effec on PC fibroblast specific?

      Minor concerns

      Page 4 second paragraph. I think it should be clarified that it is this group who have suggested a link between PCs and myofibroblast transition?

      Page 4 second paragraph. The use of the word "remarkably' is a bit subjective.

      Reference 27 is a paper on multiciliogenesis rather than primary ciliogenesis.

      Figure 1 panel D. Make the image with the same sized vertical scale

      Significance

      To my mind this is a novel paper and the date presented in it may be of interest to the cilia community as well as to the fibrosis field. This could be considered to be a significant advance and I am unaware that other groups are actively working in this area.

      Presentation of the data in the current form does not instil confidence in the work.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The manuscript by Verma et al. describes the involvement of primary cilia length control in driving pro-fibrotic progression of fibroblasts in fibrotic diseases. This is shown in primary cells from several organs from patients, suffering from different fibrotic diseases. They demonstrate that primary cilia are shorter in fibroblasts from different fibrotic conditions and that pro-fibrotic signaling, as exemplified by TGFb stimulation, causes shortening of the cilium. Vice versa, elongation of the cilium via different pharmacological substances reverses the pro-fibrotic phenotype.

      Major comments

      1. To reliably quantify the ciliary length in different cell types, and in independent ciliary marker needs to be included for comparison and the ciliary base needs to be labeled (e.g., -TUBULIN). This needs to combined with a non-biased, high-throughput analysis, e.g., CiliaQ,
      2. As mentioned in the study, TGF has been implicated to drive myofibroblast transition. Thus TGF stimulate ciliary signaling in the presented primary cells? The authors should provide a read-out for TGF signaling in the cilium (ICC fro protein phosphorylation etc.). Furthermore, canonical ciliary signaling pathways have been suggested to act as fibrotic drivers, such as Hedgehog and Wnt signaling - does stimulation of these pathways evoke a similar effect?
      3. Does TGF induce cell proliferation? If yes, this would force cilium disassembly and, thereby, reduce ciliary length, which is independent of a "shortening" mechanism proposed by the authors.
      4. As PGE2 has been shown to signal through EP4 receptors in the cilium, is the restoration of primary cilia length due to ciliary signaling?
      5. Primary cilia length is regulated by cAMP signaling in the cilium vs. cytoplasm - does cAMP signaling play a role in this context? PGE2 is potent stimulator of cAMP synthesis - does this underlie the rescue of primary cilia length?
      6. The authors describe that they wanted to investigate how aSMA impacted primary cilia length. They only provide a knock-down experiment and measured ciliary length, but the mechanistic insight is missing. How does loss of aSMA expression control ciliary length?
      7. The authors used LiCl in their experiments, which supposedly control Hh signaling. Coming back to my second questions, is this Hh-dependent? And what is the common denominator with respect to TGF signaling? And how is this mechanistically connected to actin and microtubule polymerization?
      8. How was the SMA Mean intensity determined?
      9. Fig: 1D: Statistical test is missing in Figure Legend and presentation of the p-values for the left graph is confusing.
      10. Some graphs are presented {plus minus} SD and some {plus minus} SEM, but this is not correctly stated in the Material & Methods Part.
      11. Fig. 4D&E: Statistical test is missing in Figure Legend.

      Minor comments

      • In general, text should be checked again for spelling mistakes and sentences may be re-written to promote readability. In particular, this applies to the discussion.
      • Figure Legends are not written consistently, information is missing (e.g., statistical tests, see above).
      • Figures should be checked again and all text should be the same size and alignment of images should be improved.

      Significance

      The authors present a novel connection between the regulation of primary cilia length and fibrogenesis. However, the study generally lacks mechanistic insight, in particular on how TGF signaling, SMA expression, and ciliary length control are connected. The spatial organization of the proposed signaling components is also not clear - is this a ciliary signaling pathway? If so, how does it interact with cytoplasmic signaling and vice versa?

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors describe a novel pattern of ncRNA processing by Pac1. Pac1 is a RNase III family member in S. pombe that has previously been shown to process pre-snoRNAs. Other RNase III family members, such as Rnt1 in S. cerevisiae and Dosha in human, have similar roles in cleaving precursors to ncRNAs (including miRNA, snRNA, snoRNA, rRNA). All RNAse III family members share that they recognize and cleave dsRNA regions, but differ in their exact sequence and structure requirement. snoRNAs can be processed from their own precursor, a polycistronic pre-cursor, or the intron of a snoRNA host gene. After the intron is spliced out, the snoRNA host gene can either encode an protein or be a non-functional by product.

      In the current manuscript the authors show that in S. pombe snoRNA snR107 and U14 are processed from a common precursor in a way that has not previously been described. snR107 is encoded within an intron and processed from the spliced out intron, similar to a typical intron-encoded snoRNA. What is different is that upon splicing, the host gene can adopt a new secondary structure that requires base-pairing between exon 1 and exon2, generating a Pac1 recognition site. This site is recognized, resulting in cleaving of the RNA and further processing of the 3' cleavage product into U14 snoRNA. In addition, the 5' cleavage product is processed into a ncRNA named mamRNA. The experiments describing this processing are thorough and convincing, and include RNAseq, degradome sequencing, northern blotting, qRT-PCR and the analysis of mutations that disrupt various secondary structures in figures 1, 2, and 3. The authors thereby describe a previously unknown gene design where both the exon and the intron are processed into a snoRNA. They conclude that making the formation of the Pac1 binding site dependent on previous splicing ensures that both snoRNAs are produced in the correct order and amount. Some of the authors findings are further confirmed by a different pre-print (reference 19), but the other preprint did not reveal the involvement of Pac1.

      While the analysis on the mamRNA/snR107/U14 precursor is convincing, as a single example the impact of these findings is uncertain. In Figure 4 and supplemental table 1, the authors use bioinformatic searches and identify other candidate loci in plans and animals that may be processed similarly. Each of these loci encode a putative precursor that results in one snoRNA processed from an intron, a different snoRNA processed from an exon, and a double stranded structure that can only form after splicing. While is potentially interesting, it is also the least developed and could be discussed and developed further as detailed below.

      Major comments:

      1. The proposal that plant and animal pre-snoRNA clusters are processed similarly is speculative. the authors provide no evidence that these precursors are processed by an RNase III enzyme cutting at the proposed splicing-dependent structure. This should not be expected for publication, but would greatly increase the interest.

      All three reviewers expressed a similar concern, and we now provide additional evidence supporting the conservation of the proposed mechanism. Specifically, we focused on the SNHG25 gene in H. sapiens, which hosts two snoRNAs—one intronic, as previously shown in Figure 4B, and one non-intronic. We substantiated our predictions through the re-analysis of multiple sequencing datasets in human cell lines, as outlined below:

      I. Analysis of CAGE-seq and nano-COP datasets indicates a single major transcription initiation site at the SNHG25 locus. Both the intronic and non-intronic snoRNAs are present within the same nascent precursor transcripts (Supplementary Figure 4D).

      II. Degradome-seq experiments in human cell lines reveal that the predicted splicing-dependent stem-loop structure within the SNHG25 gene is subject to endonucleolytic cleavage (Supplementary Figure 4D). The cleavage sites are located at the apical loop and flanking the stem, displaying a staggered symmetry characteristic of RNase III activity (Figure 4C). Importantly, the nucleotide sequence surrounding the 3' cleavage site and the 3' splice-site are conserved in other vertebrates (Supplementary Figure 4.D).

      III. fCLIP experiments demonstrate that DROSHA associates with the spliced SNHG25 transcript (Supplementary Figure 4D).

      Together, these analyses support the generalizability of our model beyond fission yeast. They confirm the structure of the SNHG25 gene as a single non-coding RNA precursor hosting two snoRNAs, one of which is intronic. Importantly, these findings show that the predicted stem-loop structure contains conserved elements and is subject to endonucleolytic cleavage. Human DROSHA, an RNase III enzyme, could be responsible for this processing step.

      The authors provide examples of similarly organized snoRNA clusters from human, mouse and rat, but the examples are not homologous to each other. Does this mean these snoRNA clusters are not conserved, even between mammals? Are the examples identified in Arabidopsis conserved in other plants? If there is no conservation, wouldn't that indicate that this snoRNA cluster organization offers no benefit?

      We noticed during this revision that the human SNHG25 locus is actually very well conserved in mice at the GM36220 locus, where both snoRNAs (SNORD104 and SNORA50C/GM221711) are similarly arranged. Although the murine host gene, GM36220, also contains an intron in the UCSC annotation, it is intronless in the Ensembl annotation we used to screen for mixed snoRNA clusters, which explains why it was not part of our initial list of candidates (Supplementary Table 1). Importantly, sequence elements in SNHG25, close to the splice sites and cleavage sites in exon 2, are also well conserved in mice and other vertebrates (Supplementary Figure 4D). Therefore, it is reasonable to think that the mechanism described for SNHG25 in humans may also apply in mice and other vertebrates.

      That being said, snoRNAs are highly mobile genetic elements. For example, it is well established that even between relatively closely related species (e.g., mouse and human), the positions of intronic snoRNAs within their host genes are not strictly conserved, even when both the snoRNAs and their host genes are. In the constrained drift model of snoRNA evolution (Hoeppner et al., BMC Evolutionary Biology, 2012; doi: 10.1186/1471-2148-12-183), it is proposed that snoRNAs are mobile and “may occupy any genomic location from which expression satisfies phenotype.”

      Therefore, a low level of conservation in mixed snoRNA clusters is generally expected and does not necessarily imply that is offers no benefit. Despite the limited conservation of snoRNA identity across species, mixed snoRNA clusters consistently display two recurring features: (1) non-intronic snoRNAs often follow intronic snoRNAs, and (2) the predicted secondary structure tends to span the last exon–exon junction. These enriched features support the idea that enforcing sequential processing of mixed snoRNA clusters may confer a selective advantage. We now explicitly discuss these points in the revised manuscript.

      Supplemental Figure 4 shows some evidence that the S. pombe gene organization is conserved within the Schizosaccharomyces genus, but could be enhanced further by showing what sequences/features are conserved. Presumably the U14 sequence is conserved, but snR107 is not indicated. Is it not conserved? Is the stem-loop more conserved than neighboring sequences? Are there any compensatory mutations that change the sequence but maintain the structure? Is there evidence for conservation outside the Schizosaccharomyces genus?

      We thank the reviewer for these excellent suggestions, which helped us significantly improve Supplementary Figure 4. In the revised version, we now include an additional species—S. japonicus, which is more evolutionarily distant—and show that the intronic snR107 is conserved across the Schizosaccharomyces genus (Supplementary Figure 4A). The distance between conserved elements (splice sites, snoRNAs, and RNA structures) varies, indicating that surrounding sequences are less conserved compared to these functionally constrained features

      We also performed a detailed alignment of the sequences corresponding to the predicted RNA secondary structures. This revealed that the apical regions are less conserved than the base, particularly near the splice and cleavage sites. In these regions, we observe compensatory or base-pair-neutral mutations (e.g., U-to-C or C-to-U, which both pair with G), suggesting structural conservation through evolutionary constraint (Supplementary Figures 4B–C). These observations are now described in greater detail in the revised manuscript, along with a discussion of the specific features likely to be under selective pressure at this locus.

      Conservation outside the Schizosaccharomyces genus is less clear. As already noted in the manuscript, the S. cerevisiae locus retains synteny between snR107 and snoU14, but the polycistronic precursor encompassing both is intronless and processed by RNase III (Rnt1) between the cistrons. Similarly, in Ashbya gossypii and a few other fungal species, synteny is preserved, but no intron appears to be present in the presumed common precursor. Notably, secondary structure predictions for the A. gossypii locus (not shown) suggest the formation of a stable stem-loop encompassing the first snoRNA in a large apical loop. This could reflect a distinct mode of snoRNA maturation, possibly analogous to pri-miRNA processing, where cleavage by an RNase III enzyme contributes to both 5′ and 3′ end formation. In Candida albicans, snoU14 is annotated within an intron of a host gene, but no homolog of snR107 is annotated. Other cases either resemble one of the above scenarios or are inconclusive due to the lack of a clearly conserved snoRNA (or possibly due to incomplete annotation). Although these examples are potentially interesting, we have chosen not to elaborate on them in the manuscript in order to maintain focus and avoid speculative interpretation in the absence of stronger evidence.

      The authors suggest that snoRNAs can be processed from the exons of protein coding genes, but snoRNA processing would destroy the mRNA. Thus snoRNAs processing and mRNA function seem to be alternative outcomes that are mutually exclusive. Can the authors comment?

      In theory, we agree with reviewer on the mutually exclusive nature of mRNA and snoRNA expression for putative snoRNA hosted in the exon of protein coding genes. However, we want to clarify that the specific examples of snoRNA precursor (or host) developed in the manuscript (mamRNA-snoU14 in S.pombe and, in this resubmission, SNHG25 in H. sapiens) are non-coding. So although we do not exclude that our model of sequential processing through splicing and endonucleolytic cleavage could apply to coding snoRNA precursors, it is not something we want to insist on, especially given the lack of experimental evidence for these cases.

      It is possible that the use of the term "exonic snoRNA" in the first version of the manuscript lead to the reviewer's impression that we explicitly meant that snoRNA processing can be processed from the exon of protein coding genes, which was not what we meant (although we do not exclude it). If that was the case, we apologize for the confusion. We have now clarified the issue (see next point).

      Minor comments:

      The term "exonic snoRNA" is confusing. Isn't any snoRNA by definition an exon?

      We agree that this term can be confusing, a sentiment that was also shared by reviewer 3. We replaced the problematic term by either "non-intronic snoRNA", "snoRNA" or "snoRNA gene located in exon" depending on the context, which are more unambiguous in conveying our intended meaning.

      The methods section does not include how similar snoRNA clusters were identified in other species

      We have now corrected this omission in the method section ('Identification of mixed snoRNA clusters' subsection): "To identify mixed snoRNA clusters, we downloaded the latest genome annotation from Ensembl and selected snoRNAs co-hosted within the same precursor, with at least one being intronic and at least one being non-intronic. We filtered out ambiguous cases where snoRNAs overlapped exons defined as 'retained introns', reasoning that in these cases the snoRNA is more likely to be intronic than not."

      In the discussion the authors argue that a previously published observation that S. pombe U14 does not complement a S. cerevisiae mutation can be explained because "was promoter elements... were simply not included in the transgene sequence". However, even if promoter elements were included, the dsRNA structure of S. pombe would not be cleaved by the S. cerevisiae RNase III. I doubt that missing promoter elements are the full explanation, and the authors provide insufficient data to support this conclusion.

      We agree with the reviewer that, given the substantial divergence in substrate specificity between Pac1 and Rnt1, it is unlikely that S. pombe snoU14 would be efficiently processed from its precursor in S. cerevisiae. We did not intend to suggest otherwise, and we have now removed this part of the discussion. As the experiment reported by Samarsky et al. did not detect expression of the S. pombe snoU14 precursor (even its unprocessed form), it remains inconclusive with respect to the conservation (or lack thereof) of snoU14 processing mechanisms.

      For the record, we had originally included this discussion to point out that the lack of cryptic promoter activity (or at least none that S. cerevisiae can use) within the S. pombe snoU14 precursor supports the idea that transcription initiates solely upstream of the mamRNA precursor. However, we recognize that this argument is speculative and potentially confusing. We have therefore removed it from the revised manuscript to maintain clarity and focus.

      **Referees cross-commenting**

      I agree with the other 2 reviewers but think the thiouracil pulse labeling reviewer 2 suggests would take considerable work and if snoRNA processing is very fast might not be as conclusive as the reviewer suggests.

      We are grateful to the reviewer for this comment, which helped us perform this reviewing in a timely manner.

      Reviewer #1 (Significance (Required)):

      In the current manuscript the authors show that in S. pombe snoRNA snR107 and U14 are processed from a common precursor in a way that has not previously been described. The experiments describing this processing are thorough and convincing, and include RNAseq, degradome sequencing, northern blotting, qRT-PCR and the analysis of mutations that disrupt various secondary structures in figures 1, 2, and 3. The authors thereby describe a previously unknown gene design where both the exon and the intron are processed into a snoRNA.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      __ __The manuscript presents a novel mode of processing for polycistronic snoRNAs in the yeast Saccharomyces pombe. The authors demonstrate that the processing sequence of a transcription unit containing U14, intronic snR107, and an overlapping non-coding mamRNA is determined by secondary structures recognized by RNase III (Pac1). Specifically, the formation of a stem structure over the mamRNA exon-exon junction facilitates the processing of terminal exonic-encoded U14. Consequently, U14 maturation occurs only after the mamRNA intron (containing snR107) is spliced out. This mechanism prevents the accumulation of unspliced, truncated mamRNA.

      1.The first section describing the processing steps is challenging to follow due to the unusual organization of the locus and maturation pathway. If the manuscript is intended for a broad audience, I recommend simplifying this section and presenting it in a more accessible manner. A larger diagram illustrating the transcription unit and processing intermediates would be beneficial. Additionally, introducing snR107 earlier in the text would improve clarity.

      We thank the reviewer for these excellent suggestions. In the previous version of the manuscript, we were cautious in how we introduced the locus, as snR107 and the associated intron had not yet been published. This is no longer the case, as the locus is now described in Leroy et al. (2025). Accordingly, we now introduce the complete locus at the beginning of the manuscript and have improved the corresponding diagram (new Figure 1A). We believe these changes enhance clarity and make the section more accessible to a broader audience.

      2.Evaluation of some results is difficult due to the overexposure of Northern blot signals in Figures 1 and 2. The unspliced and spliced precursors appear as a single band, making it hard to distinguish processing intermediates. Would the authors consider presenting these results similarly to Figure 3, where bands are more clearly resolved? Or presenting both overexposed and underexposed blots?

      For all blots (probes A, B, and C), we selected an exposure level that allows detection of precursor forms under wild-type (WT) conditions. This necessarily results in some overexposure of the accumulating precursors in mutant conditions, due to their broad dynamic range of accumulation. To address this, we now provide an additional supplementary "source data" file containing all uncropped blots with both low and high exposures.

      For example, a lower exposure version of the blot in new Figure 1.B (included in the source data file) confirms the consistent accumulation of the spliced precursor when Pac1 activity is compromised. The unspliced precursor also shows slight accumulation in the Pac1-ts mutant, although to a much lesser extent than the spliced precursor. This observation is consistent with our qPCR results (new Figure 1.C).

      Importantly, because this effect is not observed in neither the Pac1-AA or the steam-dead (SD) mutants, we interpret it as an indirect effect—possibly reflecting a mild growth defect in the Pac1-ts strain, even under growth-permissive conditions. We now explicitly address this point in the revised manuscript.

      3.Additionally, I noticed a discrepancy in U14 detection: Probe B gives a strong signal for U14 in Figure 3B, whereas in Figures 1 and 2, U14 appears as faint bands. Could the authors clarify this inconsistency?

      We thank the reviewer for pointing out this discrepancy. The variation in U14 signal intensity is most likely due to technical differences in UV crosslinking efficiency during the Northern blot procedure. This step can differentially affect the membrane retention of RNA species depending on their length, as previously reported (PMID: 17405769). Because U14 is a relatively abundant snoRNA, the fainter signal observed in Figure 1 (relative to the accumulating precursor) likely reflects suboptimal crosslinking of shorter RNAs in that particular blot.

      Importantly, this technical variability does not impact the conclusions of our study, as we do not compare RNA species of different lengths directly. To increase transparency, we now provide a supplementary "source data" file that includes all uncropped blots from our Northern blot experiments. These include examples—such as the uncropped blot for Figure 1B—where U14 retention is more consistent.

      4.Furthermore, ethidium bromide (EtBr) staining of rRNA is used as a loading control, but overexposed signals from the gel may not accurately reflect RNA amounts on the membrane. This could affect the interpretation of mature RNA species' relative abundance.

      We thank the reviewer for pointing this out and have now measured rRNAs loading on the same northern blot membrane from probes complementary to mature rRNA. We updated new Figures 1B, 2B, 3B, S1B, and S3A accordingly.

      5.To further support the sequential processing model, the authors could use pulse-labeling thiouracil to test the accumulation of newly transcribed RNAs and accumulation of individual species. Additionally, it could help determine whether U14 can be processed through alternative, less efficient pathways. Would the authors consider incorporating this approach?

      We thank the reviewer for this pertinent suggestion. We actually plan to investigate the putative alternative U14 maturation pathway in future work, and the suggested approach will definitely be instrumental for that. However, to keep the present manuscript focused, and also to keep the review timely (successful pulse-chase experiments are likely to take time to optimize – as also suggested by the other reviewers in their cross-commenting section), we prefer not to perform this experiment for this reviewing.

      7.In the final section, the authors propose that this processing mechanism is conserved across species, identifying 12 similar genetic loci in different organisms. This is very interesting finding. In my opinion, providing any experimental evidence would greatly strengthen this claim and the manuscript's significance. Even preliminary validation would add substantial value!

      We thank the reviewer for his/her enthusiasm and are glad to provide some preliminary validation to the final section of our manuscript. Specifically, we focused on the SNHG25 gene in H. sapiens, which hosts two snoRNAs—one intronic, as previously shown in Figure 4B, and one non-intronic. We substantiated our predictions through the re-analysis of multiple sequencing datasets in human cell lines, as outlined below:

      I.Analysis of CAGE-seq and nano-COP datasets indicates a single major transcription initiation site at the SNHG25 locus. Both the intronic and non-intronic snoRNAs are present within the same nascent precursor transcripts (Supplementary Figure 4D).

      II.Degradome-seq experiments in human cell lines reveal that the predicted splicing-dependent stem-loop structure within the SNHG25 gene is subject to endonucleolytic cleavage (Supplementary Figure 4D). The cleavage sites are located at the apical loop and flanking the stem, displaying a staggered symmetry characteristic of RNase III activity (Figure 4C). Importantly, the nucleotide sequence surrounding the 3' cleavage site and the 3' splice-site are conserved in other vertebrates (Supplementary Figure 4.D).

      III. fCLIP experiments demonstrate that DROSHA associates with the spliced SNHG25 transcript (Supplementary Figure 4D).

      Together, these analyses support the generalizability of our model beyond fission yeast. They confirm the structure of the SNHG25 gene as a single non-coding RNA precursor hosting two snoRNAs, one of which is intronic. Importantly, these findings unambiguously show that the predicted stem-loop structure is subject to endonucleolytic cleavage, and they are consistent with DROSHA, an RNase III enzyme, being responsible for this processing step.

      **Referees cross-commenting**

      The other two reviewers' comments are justified.

      Reviewer #2 (Significance (Required)):

      The authors describe an interesting novel mode of snoRNA procseeimg form the host transcript. The results appear sound and intriguing, especially if the proposed mechanism can be confirmed across different organisms. Including such validation would significantly enhance the impact and make this work of broad audience interest.

      My expertise: transcription, non-coding RNAs

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      The manuscript by Migeot et al., focuses on a new Pac1-mediated snoRNA processing pathway for intron-encoded snoRNA pairs in yeast Schizosaccharomyces pombe. The novelty of the findings described in MS is the report of an unusual and relatively rare genomic organization and sequential processing of a few snoRNA genes in S. pombe and other eukaryotic organisms. It appears that in the case of snoRNA pairs, hosted in pre-mRNA in the intron and exon, respectively, the release of separate pre-snoRNAs from the host gene relies first on splicing to free the intron-encoded snoRNA, followed by endonucleolytic cleavage by RNase III (Pac1 in S. pombe) to produce snoRNA present in the mRNA exon. The sequential processing pathway, ensuring proper maturation of two snoRNAs, was demonstrated and argued in an elegant and clear way. The main message of the MS is straightforward, most experiments are properly conducted and specific conclusions based on the data are justified and valid. The text is clearly written and well-presentded.

      But there are some shortcomings.

      1.First of all, the title of the MS and general conclusions regarding the Pac1-mediated sequential release of snoRNA pairs hosted within the intron are definitely an overstatement. Especially the title suggests that this genomic organization and unusual processing mode of these snoRNAs is widespread. Later in the discussion the authors themselves admit that such mixed exonic-intronic snoRNAs are rare, although their presence may be underestimated due to annotation problems. It is likely that such snoRNA arrangement and processing is conserved, but the evidence is missing and only unique cases were identified based on bioinformatics mining and their processing has not been assayed. This makes the generalization impossible based on a single documented mamRNA/snoU14 example, no matter how carefully examined.

      We thank the reviewer for clearly articulating this concern. In response, we now provide additional evidence supporting conservation of the proposed mechanism in other species:

      • Conservation within the Schizosaccharomyces genus (Figures S4A–C) has been further analyzed, as suggested by Reviewer 1. This expanded analysis highlights conserved features—such as splice sites and cleavage sites within the predicted stem-loop structure—indicating that these elements are under selective constraint.

      • Conservation in mammals is now supported by experimental data, as detailed in our responses to point #7 of Reviewer 2 and major comment #1 of Reviewer 1. Specifically, we show that for the SNHG25 gene in H. sapiens (Figure S4D):

      (1) nascent transcription give rise to a single non-coding RNA precursor that hosts two snoRNAs, one of which is intronic;

      (2) the predicted stem-loop structure contains conserved elements and is subject to endonucleolytic cleavage;

      (3) the RNase III enzyme DROSHA associates with the spliced SNHG25 precursor.

      Together, these analyses strengthen the evidence for the evolutionary conservation of the mechanism and support the general conclusions and title of the manuscript.

      Another interesting observation is that, similarly to other intron-encoded snoRNA in other species, there is a redundant pathway to produce mature U14 in addition to Pac1-mediated cleavage. In the case of intronic snoRNAs in S. cerevisiae, their release could be performed either by splicing/debranching or Rnt1 cleavage, but there is also a third alternative option, that is processing following transcription termination downstream of the snoRNA gene, which at the same time interferes with the expression of the host gene. Is such a scenario possible as an alternative pathway for U14? Are there any putative, or even cryptic, terminators downstream of the U14 gene? The authors did not consider or attempt to inspect this possibility.

      We thank the reviewer for this interesting and thoughtful comment. First, we would like to clarify that snoU14 is not intron-encoded; rather, it is located on the exon downstream of the intron-encoded snR107.

      Regarding the possibility of transcription termination-based processing: downstream of snoU14, we identified a non-consensus polyadenylation signal (AUUAAA) preceded by a U-rich tract, followed by three consensus polyadenylation signals (AAUAAA) within a 500-nt window. These elements likely contribute to robust and redundant transcription termination at this highly expressed locus. However, since all these sites are located downstream of snoU14, they do not provide an alternative 5′-end processing mechanism for this snoRNA –they reflect normal termination.

      If we correctly understood the reviewer’s suggestion (apologies if not), they may have been referring to the possibility of a cryptic or alternative polyadenylation site between snR107 and snoU14 instead. If cleavage were to occur in this inter-snoRNA region while transcription continued past snoU14, it could, in principle, allow for alternative processing of snoU14. We have indeed considered this scenario. However, we currently do not find strong support for it: there are no identifiable polyadenylation signals motifs between the two snoRNAs, aside from a weakly conserved and questionable AAUAAU hexamer that does not appear to be used as polyA site at least in WT conditions (DOI: 10.4161/rna.25758). Given the lack of evidence, we chose not to explore this hypothesis further in the present manuscript, though it remains an interesting possibility for future investigation.

      I also have some concerns or comments related to the presented research, which are no major, but are mainly related to data quatification, but have to be addressed.

      • In Pac1-ts and Pac1-AA strains the level of mature U14 seems upregulated compared to respective WT (Figure 1A). At the same time mature 25S and 18S rRNAs are less abundant. But there is no quantification and it is not mentioned in the text. What could be the reason for these effects?

      We thank the reviewer for this observation. As reviewer 2 also noted, ethidium bromide staining of mature rRNAs is not a reliable quantitative loading control. In response to this concern, we have now reprobed all northern blots with radiolabeled rRNA probes. These provide a more accurate and consistent loading for our blots (new Figures 1B, 2B, 3B, S1B, S3A).

      Using these improved loading controls, it is evident that snoU14, snR107, and the unspliced precursor are all slightly upregulated in the Pac1-ts strain, although to a much lesser extent than the spliced precursor, which accumulates dramatically. We do not observe this effect in either the Pac1-AA or stem-dead (SD) mutants. We therefore interpret the modest upregulation as an indirect effect, possibly linked to the physiological state of the Pac1-ts mutant, which exhibits slower growth even at growth-permissive temperatures. We now explicitly discuss this in the revised manuscript.

      Regarding the suggestion to include quantification of the northern blot signal: we opted not to include this in the figures for the following reasons. First, the accumulation of the spliced precursor—the central focus of our analysis—is large and highly reproducible across all replicates and conditions. Second, northern blot quantification by pixel intensity remains semi-quantitative, particularly for comparisons across RNAs of highly different abundance. Finally, we support our conclusions with additional quantitative data from RT-qPCR and RNA-seq, which provide more robust measures of RNA accumulation.

      • Processing of the other snoRNA from the mamRNA/snoU14 precursor is largely overlooked in the MS. It is commented on only in the context of mutants expressing constitutive mamRNA-CS constructs (Figure 3B). Its level was checked in Pac1-ts and Pac1-AA (Supplementary Figure 1), but the authors conclude that "its expression remained largely unaffected by Pac1 inactivation", which is clearly not true. Similarly to U14, also snR170 is increased in Pac1-ts and Pac1-AA strains, at least judged "by eye" because the loading control or quantification is not provided. This matter should be clarified.

      We thank the reviewer for pointing this out. We have now included appropriate loading controls for Supplementary Figure 1 to clarify the interpretation. As discussed in our response to the previous comment, we observe a general upregulation of the mamRNA locus in the Pac1-ts strain, which likely contributes to the increased levels of both snR107 and snoU14. However, because this upregulation is not observed in the Pac1-AA or stem-dead (SD) mutants, we interpret it as an indirect effect, possibly related to the altered physiological state of the Pac1-ts strain (e.g., slightly reduced growth rate even at the permissive temperature). This interpretation has now been clearly explained in the revised manuscript.

      We also identified and corrected a labeling error in the previous version of Supplementary Figure 1, where the Pac1-ts and Pac1-AA strains were inadvertently swapped. We sincerely apologize for the confusion this may have caused and have now ensured that all figure panels are correctly labeled and consistent with the text.

      Other minor comments:

      Minor points:

      1. Page 1, Abstract. The sentence "The hairpin recruits the RNase III Pac1 that cleaves and destabilizes the precursor transcript while participating in the maturation of the downstream exonic snoRNA, but only after splicing and release of the intronic snoRNA" is not entirely clear and should be simplified, maybe split into two sentences. This message is clear after reading the MS and learning the data, but not in the abstract.

      We thank the reviewer for pointing this out and have now clarified the abstract following the suggestion to split and simplify the problematic sentence : "... the sequence surrounding an exon-exon junction within their precursor transcript folds into a hairpin after splicing of the intron. This hairpin recruits the RNase III ortholog Pac1, which participates in the maturation of the downstream snoRNA by cleaving the precursor."

      Page 1, Introduction. I am not convinced by the need to use the term "exonic snoRNA" for all snoRNA that are not intronic, which is misleading, and is rather associated per se with snoRNA encoded in the mRNA exon. It has been used before in the review about snoRNAs by Michelle Scott published in RNA Biol (2024), but it does not justify its common use.

      We thank the reviewer for raising this important point. We agree that the term “exonic snoRNA” can be misleading, as it was previously used to specifically refer to snoRNAs embedded within exons of mRNA transcripts—an rare and potentially artifactual scenario, as very cautiously discussed by Michelle Scott and colleagues in their review published in RNA Biol (2024).

      In the previous version of our manuscript, we actually used “exonic snoRNA” in a broader sense to denote any snoRNA not encoded within an intron, primarily for convenience in contrasting the processing of intronic snR107 with that of non-intronic/exonic snoU14. However, we recognize that this usage is non-standard and risks confusion due to the ambiguity surrounding the term’s definition in the literature.

      In light of this, and in agreement with reviewer 1 who raised a similar concern, we have revised the manuscript to remove the term “exonic snoRNA” entirely. Depending on the context, we now refer more precisely to “non-intronic snoRNA,” “snoRNA gene located in exon,” or simply “snoRNA.”

      Supplementary Figure 3. It is difficult to assess whether the level of mature rRNAs is unchanged in the mutants based on EtBr staining and without calculations. Northern blotting should be performed and the levels properly calculated.

      As suggested, we performed northern blotting on mature 18S and 25S, quantified the signal and observed no significant differences (new Supplementary Figure 3).

      **Referees cross-commenting**

      I also agree that 4sU labeling may require too much work with a questionable result.

      We are grateful to the reviewer for this comment, which helped us perform this reviewing in a timely manner.

      Reviewer #3 (Significance (Required)):

      Strengths: 1. Novelty of the described genomic arrangement of snoRNA/ncRNA genes and their processing in a sequential and regulated manner.

      Potential conservation of this pathways across eukaryotic organisms. Well designed and performed experiments followed by proper conclusions.

      Limitations: 1. Insufficient evidence to support generalization of the study results.

      Moderate overall impact of the study

      Advance: This research can be placed within publications describing specific processing pathways for various non-coding RNAs, including for example unusual chimeric species such as sno-lncRNAs. In this context, the presented results do advance the knowledge in the field by providing mechanistic evidence for a tightly controlled and coordinated maturation of selected ncRNAs.

      Audience: Basic research and specialized. The interest in this research will rather be limited to a specific field.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Migeot et al., focuses on a new Pac1-mediated snoRNA processing pathway for intron-encoded snoRNA pairs in yeast Schizosaccharomyces pombe.

      The novelty of the findings described in MS is the report of an unusual and relatively rare genomic organization and sequential processing of a few snoRNA genes in S. pombe and other eukaryotic organisms. It appears that in the case of snoRNA pairs, hosted in pre-mRNA in the intron and exon, respectively, the release of separate pre-snoRNAs from the host gene relies first on splicing to free the intron-encoded snoRNA, followed by endonucleolytic cleavage by RNase III (Pac1 in S. pombe) to produce snoRNA present in the mRNA exon. The sequential processing pathway, ensuring proper maturation of two snoRNAs, was demonstrated and argued in an elegant and clear way. The main message of the MS is straightforward, most experiments are properly conducted and specific conclusions based on the data are justified and valid. The text is clearly written and well-presentded.

      But there are some shortcomings. First of all, the title of the MS and general conclusions regarding the Pac1-mediated sequential release of snoRNA pairs hosted within the intron are definitely an overstatement. Especially the title suggests that this genomic organization and unusual processing mode of these snoRNAs is widespread. Later in the discussion the authors themselves admit that such mixed exonic-intronic snoRNAs are rare, although their presence may be underestimated due to annotation problems. It is likely that such snoRNA arrangement and processing is conserved, but the evidence is missing and only unique cases were identified based on bioinformatics mining and their processing has not been assayed. This makes the generalization impossible based on a single documented mamRNA/snoU14 example, no matter how carefully examined. Another interesting observation is that, similarly to other intron-encoded snoRNA in other species, there is a redundant pathway to produce mature U14 in addition to Pac1-mediated cleavage. In the case of intronic snoRNAs in S. cerevisiae, their release could be performed either by splicing/debranching or Rnt1 cleavage, but there is also a third alternative option, that is processing following transcription termination downstream of the snoRNA gene, which at the same time interferes with the expression of the host gene. Is such a scenario possible as an alternative pathway for U14? Are there any putative, or even cryptic, terminators downstream of the U14 gene? The authors did not consider or attempt to inspect this possibility.

      I also have some concerns or comments related to the presented research, which are no major, but are mainly related to data quatification, but have to be addressed. In Pac1-ts and Pac1-AA strains the level of mature U14 seems upregulated compared to respective WT (Figure 1A). At the same time mature 25S and 18S rRNAs are less abundant. But there is no quantification and it is not mentioned in the text. What could be the reason for these effects? Processing of the other snoRNA from the mamRNA/snoU14 precursor is largely overlooked in the MS. It is commented on only in the context of mutants expressing constitutive mamRNA-CS constructs (Figure 3B). Its level was checked in Pac1-ts and Pac1-AA (Supplementary Figure 1), but the authors conclude that "its expression remained largely unaffected by Pac1 inactivation", which is clearly not true. Similarly to U14, also snR170 is increased in Pac1-ts and Pac1-AA strains, at least judged "by eye" because the loading control or quantification is not provided. This matter should be clarified.

      Other minor comments:

      Minor points:

      1. Page 1, Abstract. The sentence "The hairpin recruits the RNase III Pac1 that cleaves and destabilizes the precursor transcript while participating in the maturation of the downstream exonic snoRNA, but only after splicing and release of the intronic snoRNA" is not entirely clear and should be simplified, maybe split into two sentences. This message is clear after reading the MS and learning the data, but not in the abstract.
      2. Page 1, Introduction. I am not convinced by the need to use the term "exonic snoRNA" for all snoRNA that are not intronic, which is misleading, and is rather associated per se with snoRNA encoded in the mRNA exon. It has been used before in the review about snoRNAs by Michelle Scott published in RNA Biol (2024), but it does not justify its common use.
      3. Supplementary Figure 3. It is difficult to assess whether the level of mature rRNAs is unchanged in the mutants based on EtBr staining and without calculations. Northern blotting should be performed and the levels properly calculated.

      Referees cross-commenting

      I also agree that 4sU labeling may require too much work with a questionable result.

      Significance

      Strengths:

      1. Novelty of the described genomic arrangement of snoRNA/ncRNA genes and their processing in a sequential and regulated manner.
      2. Potential conservation of this pathways across eukaryotic organisms.
      3. Well designed and performed experiments followed by proper conclusions.

      Limitations:

      1. Insufficient evidence to support generalization of the study results.
      2. Moderate overall impact of the study

      Advance:

      This research can be placed within publications describing specific processing pathways for various non-coding RNAs, including for example unusual chimeric species such as sno-lncRNAs. In this context, the presented results do advance the knowledge in the field by providing mechanistic evidence for a tightly controlled and coordinated maturation of selected ncRNAs.

      Audience:

      Basic research and specialized. The interest in this research will rather be limited to a specific field.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript presents a novel mode of processing for polycistronic snoRNAs in the yeast Saccharomyces pombe. The authors demonstrate that the processing sequence of a transcription unit containing U14, intronic snR107, and an overlapping non-coding mamRNA is determined by secondary structures recognized by RNase III (Pac1). Specifically, the formation of a stem structure over the mamRNA exon-exon junction facilitates the processing of terminal exonic-encoded U14. Consequently, U14 maturation occurs only after the mamRNA intron (containing snR107) is spliced out. This mechanism prevents the accumulation of unspliced, truncated mamRNA.

      The first section describing the processing steps is challenging to follow due to the unusual organization of the locus and maturation pathway. If the manuscript is intended for a broad audience, I recommend simplifying this section and presenting it in a more accessible manner. A larger diagram illustrating the transcription unit and processing intermediates would be beneficial. Additionally, introducing snR107 earlier in the text would improve clarity.

      Evaluation of some results is difficult due to the overexposure of Northern blot signals in Figures 1 and 2. The unspliced and spliced precursors appear as a single band, making it hard to distinguish processing intermediates. Would the authors consider presenting these results similarly to Figure 3, where bands are more clearly resolved? Or presenting both overexposed and underexposed blots?

      Additionally, I noticed a discrepancy in U14 detection: Probe B gives a strong signal for U14 in Figure 3B, whereas in Figures 1 and 2, U14 appears as faint bands. Could the authors clarify this inconsistency? Furthermore, ethidium bromide (EtBr) staining of rRNA is used as a loading control, but overexposed signals from the gel may not accurately reflect RNA amounts on the membrane. This could affect the interpretation of mature RNA species' relative abundance.

      To further support the sequential processing model, the authors could use pulse-labeling thiouracil to test the accumulation of newly transcribed RNAs and accumulation of individual sopecies. Additionally, it could help determine whether U14 can be processed through alternative, less efficient pathways. Would the authors consider incorporating this approach?

      In the final section, the authors propose that this processing mechanism is conserved across species, identifying 12 similar genetic loci in different organisms. This is very interesting finding. In my opinion, providing any experimental evidence would greatly strengthen this claim and the manuscript's significance. Even preliminary validation would add substantial value!

      Referees cross-commenting

      The other two reviewers' comments are justified.

      Significance

      The authors describe an interesting novel mode of snoRNA procseeimg form the host transcript. The results appear sound and intriguing, especially if the proposed mechanism can be confirmed across different organisms. Including such validation would significantly enhance the impact and make this work of broad audience interest.

      My expertise: transcription, non-coding RNAs

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors describe a novel pattern of ncRNA processing by Pac1. Pac1 is a RNase III family member in S. pombe that has previously been shown to process pre-snoRNAs. Other RNase III family members, such as Rnt1 in S. cerevisiae and Dosha in human, have similar roles in cleaving precursors to ncRNAs (including miRNA, snRNA, snoRNA, rRNA). All RNAse III family members share that they recognize and cleave dsRNA regions, but differ in their exact sequence and structure requirement. snoRNAs can be processed from their own precursor, a polycistronic pre-cursor, or the intron of a snoRNA host gene. After the intron is spliced out, the snoRNA host gene can either encode an protein or be a non-functional by product.

      In the current manuscript the authors show that in S. pombe snoRNA snR107 and U14 are processed from a common precursor in a way that has not previously been described. snR107 is encoded within an intron and processed from the spliced out intron, similar to a typical intron-encoded snoRNA. What is different is that upon splicing, the host gene can adopt a new secondary structure that requires base-pairing between exon 1 and exon2, generating a Pac1 recognition site. This site is recognized, resulting in cleaving of the RNA and further processing of the 3' cleavage product into U14 snoRNA. In addition, the 5' cleavage product is processed into a ncRNA named mamRNA. The experiments describing this processing are thorough and convincing, and include RNAseq, degradome sequencing, northern blotting, qRT-PCR and the analysis of mutations that disrupt various secondary structures in figures 1, 2, and 3. The authors thereby describe a previously unknown gene design where both the exon and the intron are processed into a snoRNA. They conclude that making the formation of the Pac1 binding site dependent on previous splicing ensures that both snoRNAs are produced in the correct order and amount. Some of the authors findings are further confirmed by a different pre-print (reference 19), but the other reprint did not reveal the involvement of Pac1.

      While the analysis on the mamRNA/snR107/U14 precursor is convincing, as a single example the impact of these findings is uncertain. In Figure 4 and supplemental table 1, the authors use bioinformatic searches and identify other candidate loci in plans and animals that may be processed similarly. Each of these loci encode a putative precursor that results in one snoRNA processed from an intron, a different snoRNA processed from an exon, and a double stranded structure that can only form after splicing. While is potentially interesting, it is also the least developed and could be discussed and developed further as detailed below.

      Major comments:

      1. The proposal that plant and animal pre-snoRNA clusters are processed similarly is speculative. the authors provide no evidence that these precursors are processed by an RNase III enzyme cutting at the proposed splicing-dependent structure. This should not be expected for publication, but would greatly increase the interest.
      2. The authors provide examples of similarly organized snoRNA clusters from human, mouse and rat, but the examples are not homologous to each other. Does this mean these snoRNA clusters are not conserved, even between mammals? Are the examples identified in Arabidopsis conserved in other plants? If there is no conservation, wouldn't that indicate that this snoRNA cluster organization offers no benefit?
      3. Supplemental Figure 4 shows some evidence that the S. pombe gene organization is conserved within the Schizosaccharomyces genus, but could be enhanced further by showing what sequences/features are conserved. Presumably the U14 sequence is conserved, but snR107 is not indicated. Is it not conserved? Is the stem-loop more conserved than neighboring sequences? Are there any compensatory mutations that change the sequence but maintain the structure? Is there evidence for conservation outside the Schizosaccharomyces genus?
      4. The authors suggest that snoRNAs can be processed from the exons of protein coding genes, but snoRNA processing would destroy the mRNA. Thus snoRNAs processing and mRNA function seem to be alternative outcomes that are mutually exclusive. Can the authors comment?

      Minor comments:

      1. The term "exonic snoRNA" is confusing. Isn't any snoRNA by definition an exon?
      2. The methods section does not include how similar snoRNA clusters were identified in other species
      3. In the discussion the authors argue that a previously published observation that S. pombe U14 does not complement a S. cerevisiae mutation can be explained because "was promoter elements... were simply not included in the transgene sequence". However, even if promoter elements were included, the dsRNA structure of S. pombe would not be cleaved by the S. cerevisiae RNase III. I doubt that missing promoter elements are the full explanation, and the authors provide insufficient data to support this conclusion.

      Referees cross-commenting

      I agree with the other 2 reviewers but think the thiouracil pulse labeling reviewer 2 suggests would take considerable work and if snoRNA processing is very fast might not be as conclusive as the reviewer suggests.

      Significance

      In the current manuscript the authors show that in S. pombe snoRNA snR107 and U14 are processed from a common precursor in a way that has not previously been described. The experiments describing this processing are thorough and convincing, and include RNAseq, degradome sequencing, northern blotting, qRT-PCR and the analysis of mutations that disrupt various secondary structures in figures 1, 2, and 3. The authors thereby describe a previously unknown gene design where both the exon and the intron are processed into a snoRNA.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. Response to reviewers

      We would like to thank the reviewers for carefully reading our manuscript and for their valuable comments in support for the publication of our investigation of rapid promoter evolution of accessory gland genes between Drosophila species and hybrids. We are glad to read that the reviewers find our work interesting and that it provides valuable insights into the regulation and divergence of genes through their promoters. We are encouraged by their acknowledgement of the overall quality of the work and the importance of our analyses in advancing the understanding of cis-regulatory changes in species divergence.

      2. Point-by-point description of the revisions

      Reviewer #

      Reviewer Comment

      Author Response/Revision

      Reviewer 1

      The authors test the hypothesis that promoters of genes involved in insect accessory glands evolved more rapidly than other genes in the genome. They test this using a number of computational and experimental approaches, looking at different species within the Drosophila melanogaster complex. The authors find an increased amount of sequence divergence in promoters of accessory gland proteins. They show that the expression levels of these proteins are more variable among species than randomly selected proteins. Finally, they show that within interspecific hybrids, each copy of the gene maintains its species-specific expression level.

      We thank Reviewer 1 for their detailed review and positive feedback on our manuscript, and for their helpful suggestions. We have now fully addressed the points raised by Reviewer 1 and have provided the suggested clarifications and revisions to improve the flow, readability, and presentation of the data, which we believe have improved the manuscript significantly.

      The work is done with expected standards of controls and analyses. The claims are supported by the analysis. My main criticism of the manuscript has to do not with the experiments or conclusion themselves but with the presentation. The manuscript is just not very well written, and following the logic of the arguments and results is challenging.

      The problem begins with the Abstract, which is representative of the general problems with the manuscript. The Abstract begins with general statements about the evolution of seminal fluid proteins, but then jumps to accessory glands and hybrids, without clarifying what taxon is being studied, and what hybrids they are talking about. Then, the acronym Acp is introduced without explanation. The last two sentences of the Abstract are very cumbersome and one has to reread them to understand how they link to the beginning of the Abstract.

      More generally, if this reviewer is to be seen as an "average reader" of the paper, I really struggled through reading it, and did not understand many of the arguments or rationale until the second read-through, after I had already read the bottom line. The paragraph spanning lines 71-83 is another case in point. It is composed of a series of very strongly worded sentences, almost all starting with a modifier (unexpectedly, interestingly, moreover), and supported by citations, but the logical flow doesn't work. Again, reading the paragraph after I knew where the paper was going was clearer, but on a first read, it was just a list of disjointed statements.

      Since most of the citations are from the authors' own work, I suspect they are assuming too much prior understanding on the part of the reader. I am sure that if the authors read through the manuscript again, trying to look through the eyes of an external reader, they will easily be able to improve the flow and readability of the text.

      We thank the reviewer for their detailed feedback and are glad that they acknowledge our work fully supports the claims of our manuscript. We also appreciate their helpful suggestions for improving the readability of the manuscript and have done our best to re-write the abstract and main text where indicated. In particular, the paragraph between lines 71-83 have been rewritten and we have taken care to write to non-expert readers.

      1) In the analysis of expression level differences, it is not clear what specific stage / tissue the levels taken from the literature refer to. Could it be that the source of the data is from a stage or tissue where seminar fluid proteins will be expressed with higher variability in general (not just inter-specifically) and this could be skewing the results? Please add more information on the original source of the data and provide support for their validity for this type of comparison.

      These were taken from publicly available adult male Drosophila datasets, listed in the data availability statement and throughout the manuscript. We have provided more detail on the tissue used for analysis of Acp gene expression levels.

      2) The sentence spanning lines 155-157 needs more context.

      We have added more context to lines 155-157.

      3) Line 203-204: What are multi-choice enhancers?

      We replaced the sentence with "... such as rapidly evolving enhancers or nested epistasis enhancer networks"

      4) Figure 1: The terminology the authors use, comparing the gene of interest to "Genome" is very confusing. They are not comparing to the entire genome but to all genes in the genome, which is not the same.

      We have changed the word "genome" to "all genes in the genome" on the reviewer's suggestion.

      5) Figure 2: Changes between X vs. Y is redundant (either changes between X and Y or changes in X vs. Y).

      We assume that the reviewer is referring to Fig. 2B, which does not measure changes between X and Y, but changes in distribution between Acps and the control group. We have explained this in the figure legend.

      The manuscript addresses a general question in evolutionary biology - do control regions diverge more quickly protein coding regions. The answer is that yes, they do, but this is actually not very surprising. The work is probably thus of more interest to people interested in the copulatory proteins or in the evolution of mating systems, than to people interested in broader evolutionary questions.

      We appreciate this reviewer's recognition of the significance of our work and would like to point out that there are very few studies looking at promoter evolution as detailed in the introduction. Of particular relevance, our study using Acp genes allows us to directly test the impact of promoter mutations on the expression by comparing two alleles in male accessory glands of Drosophila hybrids. Male accessory glands consist of only two secretory cell types allowing us to study evolution of gene expression in a single cell type (Acps are either expressed in main cells or secondary cells). Amid this unique experimental set up we can conclude that promoter mutations can act dominant, in contrast to mutations in protein coding regions, which are generally recessive. Thus, our study is unique in pointing out a largely overseen aspect of gene evolution.

      Reviewer 2

      This manuscript explores promoter evolution of genes encoding seminal fluid proteins expressed in the male accessory gland of Drosophila and finds cis-regulatory changes underlie expression differences between species. Although these genes evolve rapidly it appears that the coding regions rarely show signs of positive selection inferring that changes in their expression and hence promoter sequences can underlie the evolution of their roles within and among species.

      We thank Reviewer 2 for their thorough review, positive feedback on the importance of our work, and suggestions for improving the manuscript. We have addressed all points raised by the reviewer, including analysis of Acp coding region evolution, additional analyses of hybrid expression data, and improved the clarity of the text.

      Figure 1 illustrates evidence that the promoter regions of these gene have accumulated more changes than other sampled genes from the Drosophila genome. While this convinces that the region upstream of the transcription start site has diverged considerably in sequence (grey line compared to black line), Figure 1A also suggests the "Genespan" region which includes the 5'UTR but presumably also part of the coding region is also highly diverged. It would be useful to see how the pattern extends into the coding region further to compare further to the promoter region (although Fig 1H does illustrate this more convincingly).

      The reviewer raises an interesting point, and certainly all parts of genes evolve. Fig. 1A shows the evolutionary rates of Acps compared to the genome average from phyloP27way scores calculated from 27 insect species. Since these species are quite distant it is unsurprising that they show divergence in coding regions as well as promoter regions. In fact, we addressed whether promoter regions evolve fast in closely related Drosophila species in Fig. 1H compared to coding regions. We have included an additional analysis of coding region evolution in Figure 1B.

      Figure 2 presents evidence for significant changes in (presumably levels of) expression of male accessory gland protein (AcP) genes and ribosomal proteins genes between pairs of species, which is reflected in the skew of expression compared to randomly selected genes.

      Correct, we have rephrased the statement for clarity.

      Figure 3 shows detailed analysis for 3 selected AcP genes with significantly diverged expression. The authors claim this shows 'substitution' hotspots in the promoter regions of all 3 genes but this could be better illustrated by extending the plots in B-D further upstream and downstream to compare to these regions.

      We picked the 300-nucleotide promoter region for this analysis as it accumulated significant changes as shown in Fig. 1E-H, and extending the G plots (Fig. 3B-D) to regions with lower numbers of sequence changes would not substantially change the conclusion. Specifically, this analysis identifies sequence change hotspots within fast-evolving promoter regions, rather than comparing promoter regions to other genomic regions, as we previously addressed. The plot is based on a cumulative distribution function and the significant positive slope in the upstream region where promoters are located identifies a hotspot for accumulation of substitutions. There could be other hotspots, but the point being made is that significant hotspots consistently appear in the promoter region of these three genes.

      Figure 4 shows the results of expression analysis in parental lines of each pair of species and F1 hybrids. However the results are very difficult to follow in the figure and in the relevant text. While the schemes in A, C. E and G are helpful, the gel images are not the best quality and interpretations confusing. An additional scheme is needed to illustrate hypothetical outcomes of trans change, cis change and transvection to help interpret the gels. On line 169 (presumably referring to panels D and F although C and D are cited on the next line) the authors claim that Obp56f and CG11598 'were more expressed in D. melanogaster compared to D. simulans' but in the gel image the D. sim band is stronger for both genes (like D. sechellia) compared to the D. mel band. The authors also claim that the patterns of expression seen in the F1s are dominant for one allele and that this must be because of transvection. I agree this experiment is evidence for cis-regulatory change. However the interpretation that it is caused by transvection needs more explanation/justification and how do the authors rule out that it is not a cis X trans interaction between the species promoter differences and differences in the transcription factors of each species in the F1? Also my understanding is that transvection is relatively rare and yet the authors claim this is the explanation for 2/4 genes tested.

      We appreciate the reviewer's comments on Figure 4 and the opportunity to improve its clarity. To address these concerns, we have carefully checked the figure citations and corrected any inconsistencies.

      The reviewer raises an important point about our interpretation of transvection. We have expanded our discussion of this result to consider why transvection is a plausible explanation for the observed dominance patterns and also consider cis x trans interactions between species-specific promoters and transcription factor binding. While rare, transvection likely has more relevance in hybrid regulatory contexts involving homologous chromosome pairing which we discuss this in the revised text.

      Line 112 states that the melanogaster subgroup contains 5 species - this is incorrect - while this study looked at 5 species there are more species in this subgroup such as mauritiana and santomea.

      We have corrected the statement about the number of species in the melanogaster subgroup.

      Lines 131-134 could explain better what the conservation scores and their groupings mean and the rationale for this approach.

      We have clarified what the conservation scores and their groupings mean and the rationale for this approach.

      Line 162 - the meaning of the sentence starting on this line is unclear - it sounds very circular.

      We have rephrased the statement for more clarity.

      Line 168 should cite Fig 4 H instead of F.

      We have amended citation of Fig 4F to H.

      Reviewer 3

      In this study, McQuarrie et al. investigate the evolution of promoters of genes encoding accessory gland proteins (Acps) in species within the D. melanogaster subgroup. Using computational analyses and available genomic and transcriptomic datasets, they demonstrate that promoter regions of Acp genes are highly diverse compared to the promoters of other genes in the genome. They further show that this diversification correlates with changes in gene expression levels between closely related species. Complementing these computational analyses, the authors conduct experiments to test whether differences in expression levels of four Acp genes with highly diverged promoter regions are maintained in hybrids of closely related species. They find that while two Acp genes maintain their expression level differences in hybrids, the other two exhibit dominance of one allele. The authors attribute these findings to transvection. Based on their data, they conclude that rapid evolution of Acp gene promoters, rather than changes in trans, drives changes in Acp gene expression that contribute to speciation.

      We thank Reviewer 3 for their thorough review and suggestions. We further thank the reviewer for acknowledging the importance of our findings and for pointing out that it contributes to our understanding of speciation. We have thoroughly addressed all comments from the reviewer and significantly revised the manuscript. We believe that this has greatly improved the manuscript.

      Unfortunately, the presented data are not sufficient to fully support the conclusions. While many of the concerns can be addressed by revising the text to moderate the claims and acknowledge the methodological limitations, some key experiments require repetition with more controls, biological replicates, and statistical analyses to validate the findings.

      Specifically, some of the main conclusions heavily rely on the RT-PCR experiments presented in Figure 4, which analyze the expression of four Acp genes in hybrid flies. The authors use PCR and RFLP to distinguish species-specific alleles but draw quantitative conclusions from what is essentially a qualitative experiment. There are several issues with this approach. First, the experiment includes only two biological replicates per sample, which is inadequate for robust statistical analysis. Second, the authors did not measure the intensity of the gel fragments, making it impossible to quantify allele-specific expression accurately. Third, no control genes were used as standards to ensure the comparability of samples.

      The gold standard for quantifying allele-specific expression is using real-time PCR methods such as TaqMan assays, which allow precise SNP genotyping. To address this major limitation, the authors should ideally repeat the experiments using allele-specific real-time PCR assays. This would provide a reliable and quantitative measurement of allele-specific expression.

      If the authors cannot implement real-time PCR, an alternative (though less rigorous) approach would be to continue using their current method with the following adjustments:

      • Include a housekeeping gene in the analysis as an internal control (this would require identifying a region distinguishable by RFLP in the control).

      • Quantify the intensity of the PCR products on the gel relative to the internal standard, ensuring proper normalization.

      • Increase the sample size to allow for robust statistical analysis.

      These experiments could be conducted relatively quickly and would significantly enhance the validity of the study's conclusions.

      We thank the reviewer for their detailed suggestions for improving the conclusions in Fig. 4. Indeed, incorporating a housekeeping gene as a control supports our results for qualitative analysis of gene expression in hybrids assessing each allele individually (Fig 4), and improves interpretation for non-experts. We have also included an additional analysis in the new Fig. 5 which analyses RNA-seq expression changes in D. melanogaster x D. simulans hybrid male accessory glands. We believe these additions have significantly improved the manuscript and its conclusions.

      While the following comments are not necessarily minor, they can be addressed through revisions to the text without requiring additional experimental work. Some comments are more conceptual in nature, while others concern the interpretation and presentation of the experimental results. They are provided in no particular order.

      1. A key limitation of this study is the use of RNA-seq datasets from whole adult flies for interspecies gene expression comparisons. Whole-body RNA-seq inherently averages gene expression across all tissues, potentially masking tissue-specific expression differences. While Acp genes are likely restricted to accessory glands, the non-Acp genes and the random gene sets used in the analysis may have broader expression profiles. As a result, their expression might be conserved in certain tissues while diverging in others- an aspect that whole-body RNA-seq cannot capture. The authors should acknowledge that tissue-specific RNA-seq analyses could provide a more precise understanding of expression divergence and potentially reveal reduced conservation when considering specific tissues independently.

      We have added a section discussing the limitations in gene expression analysis in the discussion. In addition, we have included an additional Figure analysing gene expression in hybrid male accessory glands (Fig. 5).

      1. The statement in line 128, "Consistent with this model," does not accurately reflect the findings presented in Figures 2A and B. Specifically, the data in Figure 2A show that Acp gene expression divergence is significantly different from the divergence of non-Acp genes or a random sample only in the comparison between D. melanogaster and D. simulans. However, when these species are compared to D. yakuba, Acp gene expression divergence aligns with the divergence patterns of non-Acp genes or random samples. In contrast, Figure 2B shows that the distribution of expression changes is skewed for Acp genes compared to random control samples when D. melanogaster or D. simulans are compared to D. yakuba. However, this skew is absent when the two D. melanogaster and D. simulans are compared. Therefore, the statement in line 128 should be revised to accurately reflect these nuanced results and the trends shown in Figure 2A and B.

      We have updated the statement for clarity. Here, the percentage of Acps showing significant gene expression changes is greater between more closely related species, but the distribution of expression changes increases between more distantly related species.

      1. The statement in lines 136-138, "Acps were enriched for significant expression changes in the faster evolving group across all species," while accurate, overlooks a key observation. This trend was also observed in other groups, including those with slower evolving promoters, in some of the species' comparisons. Therefore, the enrichment is not unique to Acps with rapidly evolving promoters, and this should be explicitly acknowledged in the text.

      This is a valid point, and we have updated this statement as suggested.

      1. It would be helpful for the authors to explain the meaning of the d score at the beginning of the paragraph starting in line 131, to ensure clarity for readers unfamiliar with this metric.

      This scoring method is described in the methods sections, and we have now included reference to thorough explanation of how d was calculated at the indicated section.

      1. In Figure 2C-E - the title of the Y-axis does not match the text. If it represents the percentage of genes with significant expression changes, as in Figure 2A, the discrepancies between the percentages in this figure and those in Figure 2A need to be addressed.

      We have updated the method used to categorise significant changes in gene expression in the text and the figure legend for clarity.

      1. The experiment in Figure 3 needs a better explanation in the text. What is the analysis presented in Figure 3B-D. How many species were compared?

      We have added additional details in the results section and an explanation of how sequence change hotspots were calculated in the results section is available.

      1. The concept of transvection should be omitted from this manuscript. First, the definition provided by the authors is inaccurate. Second, even if additional experiments were to convincingly show that one allele in hybrid animals is dominant over the other, there are alternative explanations for this phenomenon that do not involve transvection. The authors may propose transvection as a potential model in the discussion, but they should do so cautiously and explicitly acknowledge the possibility of other mechanisms.

      We have updated the text to more conservatively discuss transvection, moving this to the discussion section with additional possibilities discussed.

      1. The statement at the end of the introduction is overly strong and would benefit from more cautious phrasing. For instance, it could be reworded as: "These findings suggest that promoter changes, rather than genomic background, play a significant role in driving expression changes, indicating that promoter evolution may contribute to the rise of new species."

      We have reworded this line following the reviewer's suggestion.

      1. Line 32 of the abstract: The term "Acp" is introduced without explaining what it stands for. Please define it as "Accessory gland proteins (Acp)" when it first appears.

      We have updated the manuscript to define Acp where it is first mentioned.

      1. Line 61: The phrase "...through relaxed,..." is unclear. Specify what is relaxed (e.g., "relaxed selective pressures").

      We have included description of relaxed selective pressures.

      1. The sentence in lines 74-76, starting in "Interestingly,...." Needs revision for clarity.

      We have removed the word interestingly.

      1. Line 112: Revise "we focused on the melanogaster subgroup which is made up of five species" to: "we focused on the melanogaster subgroup, which includes five species."

      We have made this change in the text.

      1. In line 144 use the phrase "promoter conservation" instead of "promoter evolution"

      We have updated the phrasing.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this study, McQuarrie et al. investigate the evolution of promoters of genes encoding accessory gland proteins (Acps) in species within the D. melanogaster subgroup. Using computational analyses and available genomic and transcriptomic datasets, they demonstrate that promoter regions of Acp genes are highly diverse compared to the promoters of other genes in the genome. They further show that this diversification correlates with changes in gene expression levels between closely related species. Complementing these computational analyses, the authors conduct experiments to test whether differences in expression levels of four Acp genes with highly diverged promoter regions are maintained in hybrids of closely related species. They find that while two Acp genes maintain their expression level differences in hybrids, the other two exhibit dominance of one allele. The authors attribute these findings to transvection. Based on their data, they conclude that rapid evolution of Acp gene promoters, rather than changes in trans, drives changes in Acp gene expression that contribute to speciation.

      Major comments:

      Unfortunately, the presented data are not sufficient to fully support the conclusions. While many of the concerns can be addressed by revising the text to moderate the claims and acknowledge the methodological limitations, some key experiments require repetition with more controls, biological replicates, and statistical analyses to validate the findings.

      Specifically, some of the main conclusions heavily rely on the RT-PCR experiments presented in Figure 4, which analyze the expression of four Acp genes in hybrid flies. The authors use PCR and RFLP to distinguish species-specific alleles but draw quantitative conclusions from what is essentially a qualitative experiment. There are several issues with this approach. First, the experiment includes only two biological replicates per sample, which is inadequate for robust statistical analysis. Second, the authors did not measure the intensity of the gel fragments, making it impossible to quantify allele-specific expression accurately. Third, no control genes were used as standards to ensure the comparability of samples.

      The gold standard for quantifying allele-specific expression is using real-time PCR methods such as TaqMan assays, which allow precise SNP genotyping. To address this major limitation, the authors should ideally repeat the experiments using allele-specific real-time PCR assays. This would provide a reliable and quantitative measurement of allele-specific expression.

      If the authors cannot implement real-time PCR, an alternative (though less rigorous) approach would be to continue using their current method with the following adjustments:

      • Include a housekeeping gene in the analysis as an internal control (this would require identifying a region distinguishable by RFLP in the control).
      • Quantify the intensity of the PCR products on the gel relative to the internal standard, ensuring proper normalization.
      • Increase the sample size to allow for robust statistical analysis. These experiments could be conducted relatively quickly and would significantly enhance the validity of the study's conclusions.

      Minor comments

      While the following comments are not necessarily minor, they can be addressed through revisions to the text without requiring additional experimental work. Some comments are more conceptual in nature, while others concern the interpretation and presentation of the experimental results. They are provided in no particular order. 1. A key limitation of this study is the use of RNA-seq datasets from whole adult flies for interspecies gene expression comparisons. Whole-body RNA-seq inherently averages gene expression across all tissues, potentially masking tissue-specific expression differences. While Acp genes are likely restricted to accessory glands, the non-Acp genes and the random gene sets used in the analysis may have broader expression profiles. As a result, their expression might be conserved in certain tissues while diverging in others- an aspect that whole-body RNA-seq cannot capture. The authors should acknowledge that tissue-specific RNA-seq analyses could provide a more precise understanding of expression divergence and potentially reveal reduced conservation when considering specific tissues independently. 2. The statement in line 128, "Consistent with this model," does not accurately reflect the findings presented in Figures 2A and B. Specifically, the data in Figure 2A show that Acp gene expression divergence is significantly different from the divergence of non-Acp genes or a random sample only in the comparison between D. melanogaster and D. simulans. However, when these species are compared to D. yakuba, Acp gene expression divergence aligns with the divergence patterns of non-Acp genes or random samples. In contrast, Figure 2B shows that the distribution of expression changes is skewed for Acp genes compared to random control samples when D. melanogaster or D. simulans are compared to D. yakuba. However, this skew is absent when the two D. melanogaster and D. simulans are compared. Therefore, the statement in line 128 should be revised to accurately reflect these nuanced results and the trends shown in Figure 2A and B. 3. The statement in lines 136-138, "Acps were enriched for significant expression changes in the faster evolving group across all species," while accurate, overlooks a key observation. This trend was also observed in other groups, including those with slower evolving promoters, in some of the species' comparisons. Therefore, the enrichment is not unique to Acps with rapidly evolving promoters, and this should be explicitly acknowledged in the text. 4. It would be helpful for the authors to explain the meaning of the d score at the beginning of the paragraph starting in line 131, to ensure clarity for readers unfamiliar with this metric. 5. In Figure 2C-E - the title of the Y-axis does not match the text. If it represents the percentage of genes with significant expression changes, as in Figure 2A, the discrepancies between the percentages in this figure and those in Figure 2A need to be addressed. 6. The experiment in Figure 3 needs a better explanation in the text. What is the analysis presented in Figure 3B-D. How many species were compared? 7. The concept of transvection should be omitted from this manuscript. First, the definition provided by the authors is inaccurate. Second, even if additional experiments were to convincingly show that one allele in hybrid animals is dominant over the other, there are alternative explanations for this phenomenon that do not involve transvection. The authors may propose transvection as a potential model in the discussion, but they should do so cautiously and explicitly acknowledge the possibility of other mechanisms. 8. The statement at the end of the introduction is overly strong and would benefit from more cautious phrasing. For instance, it could be reworded as: "These findings suggest that promoter changes, rather than genomic background, play a significant role in driving expression changes, indicating that promoter evolution may contribute to the rise of new species."

      Text edits:

      Throughout the manuscripts there are incomplete sentences and sentences that are not clear. Below is a list of corrections:

      1. Line 32 of the abstract: The term "Acp" is introduced without explaining what it stands for. Please define it as "Accessory gland proteins (Acp)" when it first appears.
      2. Line 61: The phrase "...through relaxed,..." is unclear. Specify what is relaxed (e.g., "relaxed selective pressures").
      3. The sentence in lines 74-76, starting in "Interestingly,...." Needs revision for clarity.
      4. Line 112: Revise "we focused on the melanogaster subgroup which is made up of five species" to: "we focused on the melanogaster subgroup, which includes five species."
      5. In line 144 use the phrase "promoter conservation" instead of "promoter evolution"

      Significance

      This study addresses an important question in evolutionary biology: how seminal fluid proteins achieve rapid evolution despite showing limited adaptive changes in their coding regions. By focusing on accessory gland proteins (Acps) and examining their promoter regions, the authors suggest promoter-driven evolution as a potential mechanism for rapid seminal fluid protein diversification. While this hypothesis is intriguing and can contribute to our understanding of speciation, more rigorous analysis and experimental validation would be needed to support the conclusions. The revised manuscript can be of interest to fly geneticists and to scientists in the fields of gene regulation and evolution.

      Keywords for my expertise: Enhancers, transcriptional regulation, development, evolution, Drosophila.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      This manuscript explores promoter evolution of genes encoding seminal fluid proteins expressed in the male accessory gland of Drosophila and finds cis-regulatory changes underlie expression differences between species. Although these genes evolve rapidly it appears that the coding regions rarely show signs of positive selection inferring that changes in their expression and hence promoter sequences can underlie the evolution of their roles within and among species.

      Major comments

      Figure 1 illustrates evidence that the promoter regions of these gene have accumulated more changes than other sampled genes from the Drosophila genome. While this convinces that the region upstream of the transcription start site has diverged considerably in sequence (grey line compared to black line), Figure 1A also suggests the "Genespan" region which includes the 5'UTR but presumably also part of the coding region is also highly diverged. It would be useful to see how the pattern extends into the coding region further to compare further to the promoter region (although Fig 1H does illustrate this more convincingly).

      Figure 2 presents evidence for significant changes in (presumably levels of) expression of male accessory gland protein (AcP) genes and ribosomal proteins genes between pairs of species, which is reflected in the skew of expression compared to randomly selected genes.

      Figure 3 shows detailed analysis for 3 selected AcP genes with significantly diverged expression. The authors claim this shows 'substitution' hotspots in the promoter regions of all 3 genes but this could be better illustrated by extending the plots in B-D further upstream and downstream to compare to these regions.

      Figure 4 shows the results of expression analysis in parental lines of each pair of species and F1 hybrids. However the results are very difficult to follow in the figure and in the relevant text. While the schemes in A, C. E and G are helpful, the gel images are not the best quality and interpretations confusing. An additional scheme is needed to illustrate hypothetical outcomes of trans change, cis change and transvection to help interpret the gels. On line 169 (presumably referring to panels D and F although C and D are cited on the next line) the authors claim that Obp56f and CG11598 'were more expressed in D. melanogaster compared to D. simulans' but in the gel image the D. sim band is stronger for both genes (like D. sechellia) compared to the D. mel band. The authors also claim that the patterns of expression seen in the F1s are dominant for one allele and that this must be because of transvection. I agree this experiment is evidence for cis-regulatory change. However the interpretation that it is caused by transvection needs more explanation/justification and how do the authors rule out that it is not a cis X trans interaction between the species promoter differences and differences in the transcription factors of each species in the F1? Also my understanding is that transvection is relatively rare and yet the authors claim this is the explanation for 2/4 genes tested.

      Minor comments

      Line 112 states that the melanogaster subgroup contains 5 species - this is incorrect - while this study looked at 5 species there are more species in this subgroup such as mauritiana and santomea.

      Lines 131-134 could explain better what the conservation scores and their groupings mean and the rationale for this approach.

      Line 162 - the meaning of the sentence starting on this line is unclear - it sounds very circular.

      Line 168 should cite Fig 4 H instead of F.

      Significance

      This paper is generally well written although some sections would benefit from more explanation. The paper demonstrates cis-regulatory changes between the promoters of orthologs of male accessory gland genes underlie expression differences but that the species differences are not always reflected in hybrids, which the authors interpret as being caused by transvection although there could be other explanations. Overall this provides new insights into the regulation and divergence of these interesting genes. The paper does not explore the consequences of these changes in gene expression although this is discussed to some extent in the Discussion section.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors test the hypothesis that promoters of genes involved in insect accessory glands evolved more rapidly than other genes in the genome. They test this using a number of computational and experimental approaches, looking at different species within the Drosophila melanogaster complex. The authors find an increased amount of sequence divergence in promoters of accessory gland proteins. They show that the expression levels of these proteins are more variable among species than randomly selected proteins. Finally, they show that within interspecific hybrids, each copy of the gene maintains its species-specific expression level.

      The work is done with expected standards of controls and analyses. The claims are supported by the analysis. My main criticism of the manuscript has to do not with the experiments or conclusion themselves but with the presentation. The manuscript is just not very well written, and following the logic of the arguments and results is challenging. The problem begins with the Abstract, which is representative of the general problems with the manuscript. The Abstract begins with general statements about the evolution of seminal fluid proteins, but then jumps to accessory glands and hybrids, without clarifying what taxon is being studied, and what hybrids they are talking about. Then, the acronym Acp is introduced without explanation. The last two sentences of the Abstract are very cumbersome and one has to reread them to understand how they link to the beginning of the Abstract.

      More generally, if this reviewer is to be seen as an "average reader" of the paper, I really struggled through reading it, and did not understand many of the arguments or rationale until the second read-through, after I had already read the bottom line. The paragraph spanning lines 71-83 is another case in point. It is composed of a series of very strongly worded sentences, almost all starting with a modifier (unexpectedly, interestingly, moreover), and supported by citations, but the logical flow doesn't work. Again, reading the paragraph after I knew where the paper was going was clearer, but on a first read, it was just a list of disjointed statements.

      Since most of the citations are from the authors' own work, I suspect they are assuming too much prior understanding on the part of the reader. I am sure that if the authors read through the manuscript again, trying to look through the eyes of an external reader, they will easily be able to improve the flow and readability of the text.

      More specific comments:

      1. In the analysis of expression level differences, it is not clear what specific stage / tissue the levels taken from the literature refer to. Could it be that the source of the data is from a stage or tissue where seminar fluid proteins will be expressed with higher variability in general (not just inter-specifically) and this could be skewing the results? Please add more information on the original source of the data and provide support for their validity for this type of comparison.
      2. The sentence spanning lines 155-157 needs more context.
      3. Line 203-204: What are multi-choice enhancers?
      4. Figure 1: The terminology the authors use, comparing the gene of interest to "Genome" is very confusing. They are not comparing to the entire genome but to all genes in the genome, which is not the same.
      5. Figure 2: Changes between X vs. Y is redundant (either changes between X and Y or changes in X vs. Y).

      Significance

      The manuscript addresses a general question in evolutionary biology - do control regions diverge more quickly protein coding regions. The answer is that yes, they do, but this is actually not very surprising. The work is probably thus of more interest to people interested in the copulatory proteins or in the evolution of mating systems, than to people interested in broader evolutionary questions.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. Reviewer #1 Evidence, reproducibility and clarity:

        Summary:

      In this manuscript, authors demonstrated the role of ECM-dependent MEK/ERK/MITF signaling pathway that influences the plasticity of MCs (melanocytes) through their interactions with the environment. The findings emphasize the essential role of the extracellular matrix (ECM) in controlling MC function and differentiation, highlighting a critical need for further research to understand the complex interactions between mechanical factors and ECM components in the cellular microenvironment. Overall, the manuscript is concise, written well and shed light on a complex relationship between ECM protein types and substrate stiffness that affects MC mechanosensation. However, understanding detailed molecular mechanisms involved, especially the roles of MITF and other key regulators, is crucial for comprehending MC function and related pathologies. Authors need to clarify some minor queries to be considered for publication.

      We thank this reviewer for the time and caution taken to assess our work. To provide a better understanding of the molecular mechanisms involved in MITF modulation and MC function in response to ECM proteins, we substantially revised the manuscript and now included e.g. bulk RNA sequencing, pharmacological inhibition of FAK and ERK (in addition to MEK inhibition), and MITF depletion.

      Major comments to the Authors:

        • Authors have chosen ERK signaling pathways to test and draw their conclusion based on existing knowledge in the field, as several studies previously reported the role of ECM to modulate the ERK signaling pathway but it would be interesting to test other signaling pathways unbiasedly; e.g. ECM can also regulate Wnt signaling (PMID: 29454361) and connection of MITF and its target gene TYR expression is also regulated by Wnt in context of melanocyte. (PMID: 29454361, PMID: 34878101, PMID: 38020918).*

      The new transcriptome analysis (line 258 ff., revised fig. 5, new fig. 6, new suppl. fig. S5) indeed showed that some components of the Wnt signaling pathway are differentially expressed in response to ECM proteins (new fig. 6B). In comparison, however, the expression of genes involved in MAPK/ERK signaling was more prominently affected by the specific ECM types (new fig. 6C, D), congruent with the biochemical results we presented in the original manuscript. We therefore focused our mechanistic analyses on this pathway, and we consolidated our initial findings with additional pharmacological inhibition experiments. Specifically, like MEK inhibition, ERK inhibition (new fig. 6J-L) increased both MITF nuclear localization and melanin production in MCs exposed to FN, reinforcing the relevance of this pathway in control of MC functions in the model used.

      We agree that an additional contribution of Wnt signaling to ECM-dependent regulation of MC phenotypes is possible, including Mitf and Tyr expression. Next to the new Wnt-related transcriptome data (line 323 ff., new fig. 6B), we therefore now included a short discussion on that aspect (line 478 ff.). However, we feel that a comprehensive comparison of the individual contributions of Wnt vs. ERK signaling is beyond the scope of the current manuscript.

      • Discussion line 340-344. Please provide the data as it is directly connected to the study, and it would be crucial to interpret data better. As FAK is upregulated and FAK inhibitor did not reduce pERK, is there any possibility that other kinases might involve. Please discuss. Again, authors should check Wnt activation as FAK can activate Wnt signaling in response to matrix stiffness as well. (PMID 29454361).*

      We agree with the reviewer that the FAK data required further investigation. In the revised version, we re-examined the potential role of FAK as an upstream regulator of ERK activation using the FAK inhibitor Ifebemtinib, rather than Defactinib as used in our original experiments. Our previous conclusion-that ERK activation was independent of FAK-was likely influenced by limitations associated with Defactinib, which did not properly reduce p-FAK levels despite lowering focal adhesion numbers, accompanied with an increase of ERK phosphorylation alongside a decrease of nuclear MITF levels. In contrast, Ifebemtinib treatment led to a more effective inhibition of FAK, as evidenced by a marked reduction in both p-FAK levels and focal adhesion number (new suppl. fig. 6B,C). Importantly, this was accompanied by a significant decrease in p-ERK levels (new fig. 6M,N), suggesting that FAK contributes to ERK activation in response to ECM molecules in our model. Furthermore, FAK inhibition similar to MEK and ERK inhibition, led to increased melanin production in MCs cultured on FN (new fig. 6O). These new data are now included in the revised version of the manuscript (line 360 ff., new fig. 6M-O, new suppl. fig. 6).

      Nonetheless, this does not exclude the possibility that additional kinases and pathways, including Wnt signaling, may also be involved. We acknowledge this possibility in the revised discussion (lines 478-488).

      • Rationale for selecting MITF for the study is very weak. Please justify in the discussion why authors have chosen to study MITF/ERK axis with a more logistic approach.*

      We have focused central aspects of our analyses on MITF because it is a central regulator of MC differentiation, pigmentation, and survival, and its activity has previously been reported to be modulated by ERK. Considering the observed changes in pigmentation, proliferation, and gene expression in response to distinct ECM molecules, we hypothesized that MITF acts as a key integrator of these ECM-dependent signals. Our data indeed support this rationale: we detected ECM-type-dependent MITF levels and localization, and manipulating the ERK pathway altered MITF activity and associated functional outputs. Moreover, siRNA-mediated downregulation of MITF in MCs cultured on COL I led to a marked reduction in melanin content (revised fig. 4D). Together, these data emphasize that the ERK/MITF axis serves as a pathway that responds to extracellular cues and links these to MC behavior. For clarity, we have included an additional explanation on our rationale in the revised manuscript (lines 146-152).

      • It is suggested to check for the changes in the transcriptomic profile of melanocytes upon culturing on different matrix to get a more comprehensive view associated with the molecular mechanisms involved.*

      We fully agree with the reviewer on the importance of assessing the ECM-dependent transcriptomes of MCs. Therefore, we have now performed RNA sequencing to compare the transcriptomic profiles of MCs cultured on COL IV-, COL I- and FN-coated stiff substrates (line 258 ff. and revised fig. 5, new fig. 6, new suppl. fig. S5). This analysis provided a broader view of the molecular responses of MCs to ECM molecules and complemented our previous molecular and phenotypes analyses. The obtained transcriptomes confirmed significant modulation of genes associated with MC differentiation and pigmentation, as well as genes involved in signaling pathways such as MAPK/ERK and Wnt (see also answers to points 1-3). These findings help contextualize the ECM-dependent phenotypic changes and strengthen the mechanistic insights presented in the study.

      • Please provide the protein expression of genes involved in cell cycle progression and/or apoptosis to support the data in Fig. 3D-E.*

      To support the observations presented in original fig. 3, we employed immunostaining to assess the protein expression of Ki67, which is both a well-established marker and a protein involved in cell cycle progression (PMID: 28630280). In revised figure 3E, a significant reduction in the proportion of Ki67-positive cells on FN compared to COL I was observed, reinforcing our initial findings derived from BrdU incorporation assays and direct microscopic monitoring of cell division (revised fig. 3D,F).

      In addition, global gene expression analysis revealed differentially expressed genes related to cell cycle regulation and apoptosis (revised fig. 5C,D), in line with the reduced proliferation observed. Notably, FN also triggered the differential expression of genes associated with cellular senescence (revised fig. 5E). Together, these data suggest that adhesion to FN induces a transcriptional and phenotypic shift in MCs toward a less-proliferative state that is associated with differential cell cycle modulation and signs of senescence.

      Minor comment to the Authors:

        • Discussion line 358-359, using term synergy is an overstatement as the collective data do not support the claim. Very little role of matrix stiffness is demonstrated by experimental data.*

      We thank the reviewer for this comment and agree that the term "synergy" may overstate the conclusions drawn from the current dataset. We have therefore removed this term from the revised version of the manuscript to more accurately reflect the data.

      • Method section, BrdU assay and BrdU assay-cell proliferation can be combined in method section.*

      We have combined the descriptions of the BrdU assay and BrdU-based cell proliferation assay into a single, unified section in the Methods.

      • What trigger melanocytes to respond to different microenvironment. Please discuss.*

      To address this question, we have added the following paragraph to the Discussion (lines 377-380): "Our study identifies ECM components as critical environmental triggers that instruct MC behavior. Through dynamic interactions with the ECM, MCs engage adhesion-dependent signaling pathways, such as FAK activation, enabling them to decode contextual ECM inputs and adapt their phenotype accordingly."

      • Fig 3C and 5D Tyr mRNA expression is tested. Authors should also test for the protein expression in the similar set of studies.*

      We thank the reviewer for this suggestion and agree that assessing TYR protein expression would be valuable. However, we have encountered difficulties with the currently available antibodies and detection methods, which in our hands appeared unreliable for consistently detecting endogenous TYR protein levels in MCs under these conditions. For this reason, we relied on Tyr mRNA expression as a robust and reproducible readout and complemented this with functional assays such as melanin content measurement as a read-out that indirectly reflects TYR enzymatic activity. Of note, our transcriptomic analysis also revealed Tyr and other melanogenesis genes as differentially expressed genes when comparing MCs grown on COLI vs FN (revised fig. 5A,B).

      • Line 217-218, Authors claim stiffness mediated increase of MITF nuclear localization in Col I, however Fig. 4A-B does not represent that claim. Please justify.*

      Fig. 4A shows representative images of MCs cultured on stiff substrates coated with different ECM types, while the original figure 4B included the comparison across substrate stiffnesses for each ECM condition. We have now generated additional datasets to assess global MITF levels as well as nuclear localization across stiffness conditions in the presence of the different ECM types, demonstrating that nuclear MITF is significantly higher in cells cultured on stiff vs. soft or intermediate stiffness (revised fig. 4B,C). Of note, we do not detect a significant difference between soft and intermediate substrate stiffness, which could hint to a threshold of MITF dynamics in stiffness sensitivity. We have updated the figure legend and corresponding text to ensure the data presentation accurately supports our conclusions.

      Significance:

      Overall, the study is well-planned, the experiments are well-designed and executed with appropriate use of statistical analysis. However, a more in-depth analysis of the molecular mechanisms is necessary to clarify how the extracellular matrix (ECM) regulates ERK or MITF nuclear translocation.

      We agree and feel that the additional data in the revised manuscript that explored transcriptional changes and the FAK/MEK/ERK/MITF axis in response to ECM proteins provide improved insights into ECM-mediated regulation of ERK and MC pigmentation.

      This study enhances our existing knowledge by linking the well-established role of the extracellular matrix (ECM) in regulating ERK signaling to ERK's involvement in controlling MITF, a key regulator of melanocyte differentiation. It further establishes the ECM's role in controlling melanocyte function and differentiation.

      This study will interest readers working in the field of the tumor microenvironment, as it explores the role of the extracellular matrix and its complexity and stiffness in disease progression, not only in melanoma but also in other types of cancer.

      1. Reviewer #2 Evidence, reproducibility and clarity:

      Summary:

      In their manuscript, Luthold et al describe the behaviour of immortalized mouse melanocytes cultured on various extracellular matrix (ECM) proteins and substrates of different stiffness. They found that fibronectin, collagen IV and collagen I have different effects on melanocyte morphology, migration, and proliferation. They further link these differential effects to MITF localization and MEK/ERK signalling. This work shows that fibronectin supports melanocyte migration, which was associated with a dendritic morphology and correlated with increased MEK/ERK signalling and decreased MITF nuclear localization. In contrast, collagen I promoted melanocyte proliferation with low MEK/ERK signalling, enhanced MITF nuclear localization and high melanin production.

      While this study is well designed and the data adequately presented and interpreted, the impact of its conclusions is limited by the incomplete mechanistic characterization of the observed phenotypes and by the lack of parallels with physiological conditions. To strengthen their manuscript, the authors should consider the following comments:

      We also wish to thank this reviewer for the efforts made to assess our work and help us improve the study. We substantially revised the manuscript and now included e.g. bulk RNA sequencing and various loss-of-function approaches to better delineate the signaling pathways involved in ECM-dependent control of MCs.

      Major comments to the Authors:

        • Characterization of observed phenotypes:*
      • *The link between matrix-sensing and intracellular signalling is missing. Which types of integrins are expressed by iMCs? *

      This is indeed an interesting point. Our RNA sequencing analysis indicates that MCs express integrins known to mediate adhesion to COL I and FN, including Itga2, Itga3, Itga5, Itgav, Itgb1, and Itgb3 (revised fig. 5K). Importantly, the expression of these integrins remains relatively consistent across ECM conditions (COL I, COL IV, and FN), suggesting that the phenotypic differences observed may not be directly explained by variations in integrin expression.

      • Are any of these integrins required for the observed phenotypes?

      To assess a functional involvement, we conducted a pilot experiment blocking β1-integrin in MCs seeded on COL I and observed a marked reduction in MC adhesion (see associated graph 1, provided to this reviewer). However, the compromised cell spreading and resulting widespread detachment introduced confounding effects, making it difficult to interpret downstream events such as MITF nuclear localization. Since such readouts can be indirectly influenced by the overall adhesion state and associated signaling pathways such as FAK, we chose not to pursue further mechanistic analysis using this approach. Targeted strategies (e.g., inducible knockdown, acute protein degradation) will be needed in the future to dissect the precise role of individual integrins in mediating ECM-specific signaling responses in MCs.

      Graph 1: Effect of β1-integrin blocking on MC adhesion. iMCs were detached using PBS-EDTA (10 min, 37 {degree sign}C) and incubated for 15 min on ice with either 10 μg/mL β1-integrin-blocking antibody (CD29, clone TS2/16; Invitrogen, #AB_1210468) or 10 μg/mL IgG isotype control. Cells (5,000 per well) were then seeded on COL I-coated substrates. After 1 h, non-adherent cells were gently washed off with PBS, and adherent cells were fixed with 4% PFA. Cell adhesion was quantified by counting the number of attached cells per µm² under a microscope.

      • The phenotypic changes described here are interesting but only partially analysed. Transcriptomic studies would yield a more complete view of cell state transitions (optional). At a minimum, could the authors detect any changes in cadherin expression, or in other genes classically involved in phenotype switching, such as twist1, snail or zeb1?

      We thank the reviewer for this important suggestion, which helped to improve this manuscript. We have now performed bulk RNA sequencing to analyze global gene expression changes in MCs cultured on different ECM substrates (revised fig. 5, new suppl. fig. 5). Among these, we explored gene expression programs associated with MC plasticity and differentiation (revised fig. 5F-H): MCs cultured on FN exhibited reduced expression of melanocytic differentiation markers and upregulation of genes linked to plasticity, dedifferentiation, and neural crest-like features, suggesting a shift toward a less differentiated state, reflecting aspects of a phenotypic switch.

      Nonetheless, as part of this analysis (but not included in the manuscript), we found that Zeb2, Snai2, and Zeb1 were expressed at similar levels across ECM conditions. Similarly, among the cadherins, Cdh1 and Cdh2 were not differentially expressed, albeit the overall low expression of Cdh1 showed a trend towards a reduction on COL I. Finally, Snai1, Twist1, and Twist2 were detected at very low levels and not significantly regulated as well. These data suggest that, at the chosen experimental conditions, while a clear adaptive phenotypic cell plasticity is observed, classical EMT-like programs are not prominently activated. However, we cannot exclude the possibility that longer culture durations or additional cues could induce such transitions.

      • Lines 235-236, the authors write that ECM proteins regulate melanocyte behaviour "likely through modulation of MITF localization and activity". Could the authors support the role of MITF experimentally? Genetic experiments using different MITF mutants could address this question.

      To experimentally support the role of MITF, we now performed melanin assays following siRNA-mediated knockdown of MITF in MCs grown on COL I or FN. On COL I-coated substrates, MITF depletion led to a marked reduction in melanin content, supporting the conclusion that ECM-dependent regulation of pigmentation in our culture model involves MITF activity. These findings are now included in the revised manuscript (lines 244-245, revised fig. 4D, new fig. S4B).

      • *Additionally, how does MEK/ERK signalling control MITF activity in these melanocytes? The trametinib experiment should be consolidated with other inhibitors (including ERK inhibitors) and/or genetic manipulation. *

      To address this comment, we complemented our former Trametinib experiments with ERK inhibition using Ravoxertinib (new fig. 6J-L). ERK inhibition led to increased nuclear localization of MITF and elevated melanin production, supporting the involvement of MEK/ERK in restraining MITF activity in MCs in response to ECM molecules. These new data are now included in the revised manuscript (line 354 ff. and new fig. 6J-L).

      • Did the authors also measure the effect of trametinib on cell proliferation in Figure 5?

      Overall, compared to the observed pronounced phenotypes like ECM-dependent cell morphology, melanin production and others, the differences in cell proliferation of MCs grown at different ECM conditions were statistically significant but not very large. We therefore refrained from additionally assessing the effect of trametinib on the observed ECM-dependent MC behaviour. Given the well-established role of ERK signaling in promoting cell proliferation, we indeed expect that MEK inhibition can reduce MC proliferation in our system, though it remains open whether there is an ECM-specific aspect to this.

      • Parallels with physiological conditions:*
      • *Most experiments shown were performed with immortalized melanocytes even though authors mention the use of primary cells (pMCs, line 148). Were similar results obtained in primary melanocytes? Do human melanocytes in culture behave similarly? *

      While we have not assessed human MCs, original fig. S2 (__revised fig. S3) __provides data using primary murine MCs (freshly isolated from newborn mice), confirming a similar behavior of primary cells compared to immortalized MCs in terms of cell area, p-FAK levels, number of FAs, melanin production, and MITF nuclear localization.

      • Are some of these observations also true in vivo, for example in mouse skin (optional)?

      The current manuscript focuses on the behavior of MCs in culture, as it was important to use a reductionist model system that can uncouple the effect of distinct ECM types as well as substrate stiffness. However, as a perspective and beyond the scope of this manuscript, we indeed plan to translate our in vitro findings to mouse skin, taking different biophysical and biochemical cues into account. Data from the present in vitro study provides valuable insights into which parameters and which anatomical areas to study in vivo.

      • How do the authors reconciliate their findings that collagen IV induced melanocyte migration and decreased proliferation and melanin production with the fact that melanocytes in human skin are generally in contact with the collagen IV-rich basement membrane?

      We indeed regarded the use of collagen IV (COL IV) as a physiological reference condition, and considered MC migration, proliferation, and melanin production on COL as baseline levels. Relative to COL IV, COL I reduced migration and increased melanin production, while FN led to increased migration, and a decrease of proliferation and melanin production. This suggests that ECM composition can selectively modulate distinct aspects of MC behavior compared to attachment to COL IV. The intermediate state observed on COL IV would be in line with a model in which this abundant basement membrane molecule enables MCs to maintain high flexibility in their phenotype, e.g. to further increase melanin production upon external stimuli other than ECM (UV, inflammation etc.). The perhaps unexpected, opposing response of MCs to FN and COL I, respectively, opens the possibility that under specific (patho)physiological conditions, the then abundant ECM can direct MC behaviour. Both plasma- and cellular-derived FN is deposited upon skin injury and instructs various cell types to promote skin repair. Taking our observations in vitro into account, it is tempting to speculate that this FN-enriched tissue enables MCs to quickly migrate into wound sites to re-establish protection to UV. Conversely, increased COL I levels-as observed in fibrotic conditions such as scleroderma-might favor a more differentiated, pigment-producing phenotype. Interestingly, cases of localized hyperpigmentation have been reported in scleroderma patients, possibly reflecting such matrix-driven MC reprogramming. Though requiring further investigation, these observations open new avenues to explore how dynamic changes in ECM composition contribute to MC behavior in tissue homeostasis and repair.

      We now extended our original discussion to better emphasize the physiological relevance of our findings (lines 383-391) and hypothesize how ECM remodeling may contribute to the dynamic regulation of MC plasticity-not only during tissue homeostasis, but also in response to injury and in fibrotic conditions such as scleroderma (lines 393-406).

      Minor comments to the Authors:

      The evidence that FAK is not responsible for MEK/ERK activation could be presented in the main text rather than in the discussion.

      We thank the reviewer for highlighting this important point. Our initial conclusion-that ERK activation was independent of FAK-likely stemmed from limitations of the previously used FAK inhibitor (Defactinib). In those earlier experiments, while FAK inhibition reduced focal adhesion numbers, p-FAK levels were not properly decreased, and paradoxically, ERK phosphorylation increased alongside decreased nuclear MITF levels. Based on this initial discrepancy and because of this reviewer's comment, we performed additional experiments using another selective FAK inhibitor, Ifebemtinib, which achieved an effective reduction in both p-FAK levels and focal adhesion number (new suppl. fig. S6B, C). In the revised version, we present new experiments using Ifebemtinib, demonstrating that FAK inhibition in fact does reduce p-ERK levels (new fig. 6M-N), thus supporting the notion that FAK contributes to ECM-dependent ERK pathway activation in our model. These findings are now shown in the results section (lines 357-364).

      Significance:

      General assessment: This study establishes the cellular impact of different types of extracellular matrix proteins and stiffness conditions relevant to skin biology on the behaviour of untransformed mouse melanocytes. In particular, it shows opposite effects of fibronectin and collagen I on cell proliferation and migration, which could prove relevant to certain skin conditions in human. However, the scope of these results is limited by the incomplete mechanistic characterization of the observed phenotypes and by the lack of parallels with physiological conditions.

      Advance: The systematic comparison of different microenvironmental conditions on normal melanocyte behaviour is novel and opens perspectives to understand the role of melanocytes in some human skin pathologies.

      Audience: The comparison of different environmental conditions on melanocyte behaviour is of interest to the melanocyte biology community and could have implications for basic and clinical understanding of some skin diseases.

      My expertise is in melanoma biology, including the impact of the microenvironment on tumour cell behaviours.

      1. Reviewer #3 Evidence, reproducibility and clarity:

      In this manuscript Luthold et al. describe how extracelluar matrix proteins and mechanosensation affect melanocyte differentiation. In particular, they show that ECM proteins and surface stiffness lead to effects on the MEK/ERK pathway, thus affecting the MITF transcription factor. The manuscript is interesting, well written and the data presented in a clear and easy-to-follow manner. The data are nicely quantitated and largely convincing.

        • However, the discussion of the nuclear location of MITF (Figure 4A) is not convincing. The images presented show that upon exposure to ColI, there is a lot of MITF in the nucleus, a lot less so upon ColIV and none upon FN exposure. However, we only see a snapshot of the cells and thus we do not know if we are witnessing effects on MITF protein synthesis, degradation or nuclear localization (the least likely scenario since M-MITF, the isoform present in melanocytes is predominantly nuclear anyway). Was there a cytoplasmic signal detected? Upon FN treatment, there is no MITF protein visible in the cells. Does this mean that the protein is not made, that it is degraded or present at such low levels that the antibody does not detect it? The claim of the authors that this affects nuclear localization of MITF needs more corroboration. *

      We thank the reviewer for raising this important point regarding the interpretation of MITF localization. We agree that the data as represented in the original figure 4 cannot distinguish whether changes reflect differences in MITF expression, stability, or subcellular distribution.

      To better address this, we now included a quantitative analysis of both nuclear and cytoplasmic MITF signals (revised fig. 4B). These data show that MITF is detectable in both compartments at all conditions tested. While total MITF levels were not reduced on FN, nuclear MITF was markedly decreased and cytoplasmic MITF was even increased compared to COL I. This indicates that the reduced nuclear signal on FN compared to COL I is not due to an overall loss of MITF protein but rather reflects a shift in its subcellular distribution. These findings support the idea that ECM composition influences MITF localization, consistent with functional changes in its activity and with the observed phenotypic changes.

      • Also, the authors need to show immunocytochemical images for the effects on MITF nuclear localization for the images presented in Figure 5C. *

      As requested, we now provide representative micrographs illustrating the effects on MITF nuclear localization corresponding to the conditions shown in Fig. 5C. These images have been included in the revised version of the manuscript (new fig. 6G), further supporting the quantitative data presented.

      • It seems that the authors quantitated immune-reactivity for both MITF and YAP. What was the control and how was the data normalized? *

      MITF and YAP immunodetection were performed in separate experiments and were not analyzed in the same cells. For both stainings, secondary antibody controls were included (secondary antibody alone without primary antibody), which showed no detectable signal. For MITF and YAP quantifications (revised fig. 4B,F), nuclear (for both) and cytoplasmic (for MITF) intensity values were normalized within each independent experiment by dividing each individual measurement by the mean nuclear intensity across all conditions. This approach allowed us to deal with total signal variability between experiments while preserving relative differences between ECM conditions. For the percentage of nuclear MITF no normalization was applied. We have added this description to the revised methods section.

      • Similarly, the blots and data shown in Figure 5 are not consistent with the text as described in the results section. The differences observed are minor and the only set that is likely to be significant is the FN-set; the differences between soft, intermediate and stiff of the FN-set do not look significantly different. The description of this in the results section should be toned down accordingly.*

      To strengthen the conclusions drawn from the original Fig. 5 (now fig. 6), we performed additional immunoblot experiments to increase the number of replicates. These extended results now show a statistically significant increase in pERK levels in MCs cultured on FN compared to COL I. However, consistent with the reviewer's observation, no significant differences were detected across the stiffness conditions within FN. We have revised the Results section accordingly to tone down the interpretation and to better reflect the revised data (revised fig. 6E, lines 339-355).

      Significance:

      Upon improvement, this paper will provide an early characterization of the effects of the ECM on melanocyte differentiation. If the link to MITF holds, this will be the first time that mechanosensation has been shown to mediate effects on this transcription factor.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript Luthold et al. describe how extracelluar matrix proteins and mechanosensation affect melanocyte differentiation. In particular, they show that ECM proteins and surface stiffness lead to effects on the MEK/ERK pathway, thus affecting the MITF transcription factor. The manuscript is interesting, well written and the data presented in a clear and easy-to-follow manner. The data are nicely quantitated and largely convincing. However, the discussion of the nuclear location of MITF (Figure 4A) is not convincing. The images presented show that upon exposure to ColI, there is a lot of MITF in the nucleus, a lot less so upon ColIV and none upon FN exposure. However, we only see a snapshot of the cells and thus we do not know if we are witnessing effects on MITF protein synthesis, degradation or nuclear localization (the least likely scenario since M-MITF, the isoform present in melanocytes is predominantly nuclear anyway). Was there a cytoplasmic signal detected? Upon FN treatment, there is no MITF protein visible in the cells. Does this mean that the protein is not made, that it is degraded or present at such low levels that the antibody does not detect it? The claim of the authors that this affects nuclear localization of MITF needs more corroboration. Also, the authors need to show immunocytochemical images for the effects on MITF nuclear localization for the images presented in Figure 5C. It seems that the authors quantitated immune-reactivity for both MITF and YAP. What was the control and how was the data normalized? Similarly, the blots and data shown in Figure 5 are not consistent with the text as described in the results section. The differences observed are minor and the only set that is likely to be significant is the FN-set; the differences between soft, intermediate and stiff of the FN-set do not look significantly different. The description of this in the results section should be toned down accordingly.

      Significance

      Upon improvement, this paper will provide an early characterization of the effects of the ECM on melanocyte differentiation. If the link to MITF holds, this will be the first time that mechanosensation has been shown to mediate effects on this transcription factor.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      In their manuscript, Luthold et al describe the behaviour of immortalized mouse melanocytes cultured on various extracellular matrix (ECM) proteins and substrates of different stiffness. They found that fibronectin, collagen IV and collagen I have different effects on melanocyte morphology, migration, and proliferation. They further link these differential effects to MITF localization and MEK/ERK signalling. This work shows that fibronectin supports melanocyte migration, which was associated with a dendritic morphology and correlated with increased MEK/ERK signalling and decreased MITF nuclear localization. In contrast, collagen I promoted melanocyte proliferation with low MEK/ERK signalling, enhanced MITF nuclear localization and high melanin production.

      While this study is well designed and the data adequately presented and interpreted, the impact of its conclusions is limited by the incomplete mechanistic characterization of the observed phenotypes and by the lack of parallels with physiological conditions. To strengthen their manuscript, the authors should consider the following comments:

      Major comments

      1. Characterization of observed phenotypes: The link between matrix-sensing and intracellular signalling is missing. Which types of integrins are expressed by iMCs? Are any of these integrins required for the observed phenotypes? The phenotypic changes described here are interesting but only partially analysed. Transcriptomic studies would yield a more complete view of cell state transitions (optional). At a minimum, could the authors detect any changes in cadherin expression, or in other genes classically involved in phenotype switching, such as twist1, snail or zeb1? Lines 235-236, the authors write that ECM proteins regulate melanocyte behaviour "likely through modulation of MITF localization and activity". Could the authors support the role of MITF experimentally? Genetic experiments using different MITF mutants could address this question. Additionally, how does MEK/ERK signalling control MITF activity in these melanocytes? The trametinib experiment should be consolidated with other inhibitors (including ERK inhibitors) and/or genetic manipulation. Did the authors also measure the effect of trametinib on cell proliferation in Figure 5?
      2. Parallels with physiological conditions: Most experiments shown were performed with immortalized melanocytes even though authors mention the use of primary cells (pMCs, line 148). Were similar results obtained in primary melanocytes? Do human melanocytes in culture behave similarly? Are some of these observations also true in vivo, for example in mouse skin (optional)? How do the authors reconciliate their findings that collagen IV induced melanocyte migration and decreased proliferation and melanin production with the fact that melanocytes in human skin are generally in contact with the collagen IV-rich basement membrane?

      Minor comment

      The evidence that FAK is not responsible for MEK/ERK activation could be presented in the main text rather than in the discussion.

      Significance

      General assessment: This study establishes the cellular impact of different types of extracellular matrix proteins and stiffness conditions relevant to skin biology on the behaviour of untransformed mouse melanocytes. In particular, it shows opposite effects of fibronectin and collagen I on cell proliferation and migration, which could prove relevant to certain skin conditions in human. However, the scope of these results is limited by the incomplete mechanistic characterization of the observed phenotypes and by the lack of parallels with physiological conditions.

      Advance: The systematic comparison of different microenvironmental conditions on normal melanocyte behaviour is novel and opens perspectives to understand the role of melanocytes in some human skin pathologies.

      Audience: The comparison of different environmental conditions on melanocyte behaviour is of interest to the melanocyte biology community and could have implications for basic and clinical understanding of some skin diseases.

      My expertise is in melanoma biology, including the impact of the microenvironment on tumour cell behaviours.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, authors demonstrated the role of ECM-dependent MEK/ERK/MITF signaling pathway that influences the plasticity of MCs (melanocytes) through their interactions with the environment. The findings emphasize the essential role of the extracellular matrix (ECM) in controlling MC function and differentiation, highlighting a critical need for further research to understand the complex interactions between mechanical factors and ECM components in the cellular microenvironment. Overall, the manuscript is concise, written well and shed light on a complex relationship between ECM protein types and substrate stiffness that affects MC mechanosensation. However, understanding detailed molecular mechanisms involved, especially the roles of MITF and other key regulators, is crucial for comprehending MC function and related pathologies. Authors needs to clarify some minor queries to be considered for publication.

      Major comment:

      1. Authors have chosen ERK signaling pathways to test and draw their conclusion based on existing knowledge in the field, as several studies previously reported the role of ECM to modulate the ERK signaling pathway but it would be interesting to test other signaling pathways unbiasedly; e.g. ECM can also regulate Wnt signaling (PMID: 29454361) and connection of MITF and its target gene TYR expression is also regulated by Wnt in context of melanocyte. (PMID: 29454361, PMID: 34878101, PMID: 38020918).
      2. Discussion line 340-344. Please provide the data as it is directly connected to the study, and it would be crucial to interpret data better. As FAK is upregulated and FAK inhibitor did not reduce pERK, is there any possibility that other kinases might involve. Please discuss. Again, authors should check Wnt activation as FAK can activate Wnt signaling in response to matrix stiffness as well. (PMID 29454361).
      3. Rationale for selecting MITF for the study is very weak. Please justify in the discussion why authors have chosen to study MITF/ERK axis with a more logistic approach.
      4. It is suggested to check for the changes in the transcriptomic profile of melanocytes upon culturing on different matrix to get a more comprehensive view associated with the molecular mechanisms involved.
      5. Please provide the protein expression of genes involved in cell cycle progression and/or apoptosis to support the data in Fig. 3D-E.

      Minor comment:

      1. Discussion line 358-359, using term synergy is an overstatement as the collective data do not support the claim. Very little role of matrix stiffness is demonstrated by experimental data.
      2. Method section, BrdU assay and BrdU assay-cell proliferation can be combined in method section.
      3. What trigger melanocytes to respond to different microenvironment. Please discuss.
      4. Fig 3C and 5D Tyr mRNA expression is tested. Authors should also test for the protein expression in the similar set of studies.
      5. Line 217-218, Authors claim stiffness mediated increase of MITF nuclear localization in Col I, however Fig. 4A-B does not represent that claim. Please justify.

      Significance

      Overall, the study is well-planned, the experiments are well-designed and executed with appropriate use of statistical analysis. However, a more in-depth analysis of the molecular mechanisms is necessary to clarify how the extracellular matrix (ECM) regulates ERK or MITF nuclear translocation.

      This study enhances our existing knowledge by linking the well-established role of the extracellular matrix (ECM) in regulating ERK signaling to ERK's involvement in controlling MITF, a key regulator of melanocyte differentiation. It further establishes the ECM's role in controlling melanocyte function and differentiation.

      This study will interest readers working in the field of the tumor microenvironment, as it explores the role of the extracellular matrix and its complexity and stiffness in disease progression, not only in melanoma but also in other types of cancer.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study aims to classify prognostic and subtype-specific eRNAs in breast cancer, highlighting their potential as biomarkers. Data was analysed using existing machine learning algorithms, Data analysis is superficial and it is hard to understand the key significant findings

      This is an important topic and a highly relevant approach to identifying RNA-based biomarkers. They analyse published RNAseq datasets by focusing on molecular subtype-specific eRNAs, enhancing clinical relevance and thereby addressing the heterogeneity of the cancer type (strength of the study).

      Weaknesses include: Most of the findings are purely correlation-based and also based on a reanalysis of published datasets; it would benefit from experimental validation to support their findings. Differential expression analysis of large datasets likely yields some differences in the transcriptome. How significant are these changes? Does the expression of eRNAs affect the expression of genes in cis? Although this analysis would provide some associated gene expression differences, it can also provide some insights into subtype-specific differences in gene expression programs. If the authors find experimental validations are not feasible, I recommend validating the eRNA signature in an independent dataset.

      Here are major points; addressing these points in the revised version is important.

      From Figure 1B, what eRNAs were identified for LumB using log2MC?

      Page 8 However, sensitivity and F-measure .... It would help to include the metrics for the number of patients in each subtype. The ratio of eRNAs/number of cases in each subtype would inform if the number of eRNAs is an outcome of no. of cases or subgroup-specific.

      Page 9 "Altogether, both measurements classify eRNAs efficiently based on subtypes, InfoGain allowed us to distinguish further samples based on high and low expression of eRNAs for basal subtype and performed better in statistical metrics" Based on statistical metrics, both models seem to be performing similarly except for Her2. In Fig. 1B, the F-measure metrics are wrong for basal LogMC, as it is 0.94 rather than 0.54, which could lead to a misinterpretation of the model.

      Many genome browser figures, including Figure S3. TFBS is not at the same site as eRNAs detected. Is there CAGE data to show that binding these TFs at these sites leads to the expression of eRNAs? That will give direct evidence that the eRNAs are transcribed due to these TFs

      Page 10, There were 30 Her2-specific eRNA regions.... Do the same enhancers also regulate these genes as those from which eRNAs are transcribed? Is it cis-effect, or could these affect the trans-regulating of other genes?

      Minor comments:

      Page 8 "InfoGain meausure..." Fig. S2A also shows high and low expressed eRNAs for the basal group

      Page 11, Our analyses also identified the role of another..... The statement is misleading as it is the enrichment of these TFs with the eRNAs

      Page 13, "Around 90% of eRNAs are bidirectional and non-polyadenylated [53]. TCGA expression datasets are based on RNA-seq assays, which capture only non-polyadenylated RNAs. Thus, analysing the expression of eRNAs on mRNA-seq datasets might not be adequate". It is very confusing, please check

      Significance

      This is an important topic and a highly relevant approach to identifying RNA-based biomarkers. They analyse published RNAseq datasets by focusing on molecular subtype-specific eRNAs, enhancing clinical relevance and thereby addressing the heterogeneity of the cancer type (strength of the study).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Enhancer RNAs (eRNAs) are early indicators of transcription factor (TF) activity and can identify distinct molecular subtypes and pathological outcomes in breast cancer. In this study, Patel et al. analysed 302,951 polyadenylated eRNA loci from 1,095 breast cancer patients using RNA-seq data, applying machine learning (ML) to classify eRNAs associated with specific molecular subtypes and survival. They discovered subtype-specific eRNAs that implicate both established and novel regulatory pathways and TFs, as well as prognostic eRNAs -specifically, LumA and HER2-survival- that distinguish favorable from poor survival outcomes. Overall, this ML-based approach illustrates how eRNAs reveal the molecular grammar and pathological implications underlying breast cancer heterogeneity.

      Major comments

      1. The authors define 302,951 eRNA loci based on RNA-seq data, yet it is widely known that many enhancers reside in proximity to promoters or within intronic regions (examples presented in Fig. 3B and S3). Consequently, it seems likely that reads mapped to these regions might not truly represent eRNA signals but include mRNA contamination. Could the authors clarify how they ensured that the identified eRNAs were not confounded by mRNA reads? What fraction of these enhancer loci is promoter proximal or intronic? How does H3K4me3, a well-established and standardized active promoter histone mark, behave on these loci? The reviewer considers it important to confirm that the identified eRNAs are indeed of enhancer origin rather than promoter transcripts.
      2. In Fig. 1B, the F measure (0.540) of the Basal subtype using the Logmc method contradicts its extremely high precision (1.000) and sensitivity (0.890). The authors need to clarify the exact formula or method used to compute F1 and the discrepancy in the reported metrics for this subtype and perhaps other subtypes as well.
      3. As shown in Fig. 4C, S4B, and most, if not all, tracks of Fig. S3, ER binding regions are not annotated as eRNA loci. It seems, in this reviewer's opinion, very unlikely that this is because they generally lack eRNA expression, but rather they do not express polyadenylated eRNA (typically 1D eRNA), which is captured in this dataset. The reviewer posits that these enhancers produce more transient, non-polyadenylated 2D eRNA. It has been widely documented in prior studies that ER-bound enhancers exhibit bimodal eRNA expression patterns [e.g., Li, W. et al. Functional roles of enhancer RNAs for oestrogen-dependent transcriptional activation. Nature 498, 516-520 (2013)]. Could the authors address this opinion and elaborate on how the restriction to polyadenylated transcripts might underrepresent enhancers regulated by ER and other TFs and whether this bias impacts the overall findings?
      4. Despite the unsatisfied performance of the ML approach on classifying Her2 subtypes, the hierarchical clustering performed in Fig. 2A and S2A appears to show a reasonable separation of Her2 subtypes, showing as a clustered green band. Could the authors quantitatively assess how effective this clustering results and compare that to the ML outcome? (OPTIONAL)
      5. In Fig. 4 and S4, the authors reported to have enriched binding or motif of TFs, e.g., FOXA1, AP-2, and E2A, specifically at enhancer loci with low eRNA level, which conflicts with their established roles as transcriptional activators. The reviewer asks for an address as to why these factors would be associated with basal low-eRNA regions and whether any additional data might clarify their functional role in these contexts.
      6. Regarding Fig. 4B, the authors state that "ER binding occupies only the strongest ssDNA and GRO-seq-positive sites". Firstly, the GRO-seq data quality is poor with indiscernible peaks. This may be insufficient for a qualified representation of nascent eRNA expression. More importantly, it appears each heatmap is ranked independently, so top loci for ssDNA are not necessarily top loci for GRO-seq, ER, Pol-II, or H3K27ac. The reviewer requests clarification on how the authors plot these heatmaps and questions whether the statement is supported by the analysis as presented.
      7. In Fig. S4B and the third plot of 4C, the averaged histogram of ER binding appears in multiple sharp peaks with drastic asymmetric positioning around the enhancer centre, which is highly atypical of most published ER ChIP-seq profiles. Could the authors discuss possible "spatial syntax" or directional patterns of ER binding in relation to eRNA loci and cite any literature showing a similar pattern? Further evidence is required to substantiate these observations, as they are remarkably unique.

      Minor comments

      1. When introducing eRNAs, the reviewer recommends mentioning that 1) eRNA levels correlate with enhancer activity and 2) eRNA expression precedes target gene transcription, thus reflecting upstream regulatory events. Relevant references include: Arner, E. et al. Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells. Science 347, 1010-1014 (2015); Carullo, N. V. N. et al. Enhancer RNAs predict enhancer-gene regulatory links and are critical for enhancer function in neuronal systems. Nucleic Acids Res. 48, 9550-9570 (2020); Kaikkonen, Minna U. et al. Remodeling of the Enhancer Landscape during Macrophage Activation Is Coupled to Enhancer Transcription. Mol. Cell 51, 310-325 (2013).
      2. H3K27ac is used initially to define these regulatory loci, and like eRNAs, H3K27ac also varies among patients. Which H3K27ac dataset(s) were used initially, and could this approach potentially overlook patient-specific enhancers? (OPTIONAL)
      3. In addition to the overall metrics displayed in Fig. 2B, could the authors provide precision and sensitivity values for LumA and LumB separately under the Logmc method, given the observation in Fig. 2E that LumA and LumB are not well separated in the UMAP projection?
      4. Could the author elaborate, in the discussion session, on why there is a substantial difference in ML performance depending on whether InfoGain or Logmc is used?
      5. How does the expression pattern of Basal high, Basal low, Her2, and Lum eRNA clusters behave differentially in Basal, Her2, and LumA/B subtypes? Are Basal high eRNAs downregulated in Her2 or Lum subtypes, and vice versa? Since many downstream analyses rely on these eRNA clusters, it is suggested to include a heatmap and/or boxplot that displays how each eRNA category is expressed in each subtype to confirm that these definitions are consistent.

      Referee cross-commenting

      I share Reviewer #1's opinion that the manuscript should assess whether mRNA or eRNA is the stronger predictor of breast cancer subtypes and clinical outcomes. It will greatly improve the novelty if eRNA is shown to be a better indicator for cancer characterization.

      Also, I strongly concur with Reviewer #3 that the current informatics approach is superficial and that several conclusions are contentious. The authors need to resolve the inconsistencies in their ML statistics and the potentially misleading interpretations of the ChIP‑seq and motif‑enrichment results.

      It is further recommended that, building Reviewer #3's comment, the study integrate eRNA signatures with their proximal genes to address 1) whether genes located near these enhancers are differentially expressed-and correlated with enhancer activity-across cancer subtypes, and 2) whether it provides insights into understanding the enhancer-gene regulatory architecture in a subtype-specific context.

      Significance

      General Assessment

      This study provides insights into the potential use of eRNA to classify breast cancer subtypes and refine prognostic markers. A strength is the integration of large-scale RNA-seq data with machine learning to identify eRNA signatures in biologically-meaningful patient samples, revealing both established and novel TF networks. The study also discovered eRNA clusters that correlate with the survival of patients, thus providing strong clinical implications. However, the ML approach yields several inconsistencies-for instance, unsatisfactory classification results for the Her2 subtype as well as the confused statistical metrics in the results. Furthermore, the ML model struggles to differentiate more nuanced molecular classes (e.g., LumA vs. LumB) and higher-level histological subtypes (e.g., lobular vs. ductal), thus limiting its power to dissect more delicate pathological and molecular mechanisms. Another limitation worth noting of this ML approach is the exclusive use of only polyadenylated eRNAs via RNA-seq, which excludes perhaps the more prominent 2D eRNA expressed in regulatory enhancers. Moreover, certain datasets appear to be of suboptimal quality, leading to assertions that would benefit from additional supporting evidence. Altogether, while the study offers a promising angle on eRNA-based tumor stratification, more robust experimental validations are needed to resolve inconsistencies and clarify the mechanistic underpinnings.

      Advance

      Conceptually, the study highlights the potential for eRNA-based signatures to capture regulatory variation beyond classical markers. However, the utility of these signatures is constrained by the focus on polyadenylated transcripts alone, likely underrepresenting key enhancer regions, and certain evidence presented in this study is not substantial enough to support some statements. While the work adds an important dimension to the understanding of enhancer biology in breast cancer, the resulting insights are partly hampered by limitations in data coverage and quality.

      Audience

      The primary audience includes cancer epigenetics, functional genomics, and bioinformatics researchers who are interested in leveraging eRNAs as biomarkers and dissecting complex regulatory networks in breast cancer. Clinically oriented scientists focusing on molecular diagnostics may also find relevance in the authors' approach to stratify subtypes and outcomes. The research is most relevant to a specialized audience within basic and translational cancer genomics, as well as computational biology groups interested in eRNA analysis.

      Field of Expertise

      I evaluate this manuscript as a researcher specializing in cancer epigenetics, functional genomics, and NGS-based data analysis. Parts of the manuscript touching on clinical outcome measures may require additional review from practicing oncologists.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      This study assesses eRNA activity as a classifier of different subtypes of breast cancer and as a prognosis tool. The authors take advantage of previously published RNA-seq data from human breast cancer samples and assess it more deeply, considering the cancer subtype of the patient. They then apply two machine learning approaches to find which eRNAs can classify the different breast cancer subtypes. While they do not find any eRNA that helps distinguish ductal vs. lobular breast cancers, their approach helps identify eRNAs that distinguish luminal A, B, basal and Her2+ cancers. They also use motif enrichment analysis and ChIP-seq datasets to characterize the eRNA regions further. Through this analysis, they observe that those eRNAs where ER binds strongest are associated with a poor patient prognosis.

      Major comments

      • Part of the rationale for this study is the previous observation that eRNAs are less associated with the prognosis of breast cancer patients in comparison to mRNAs and they claim that the high heterogeneity between breast cancer subtypes would mask the importance of eRNAs. In this study, the authors solely focus on eRNAs as a classification of breast cancer subtypes and prognostic tool and do not answer whether eRNAs or mRNAs are a better predictor of cancer subtypes and of prognosis. Since the answer and the tools are already in their hands, it would be important to also see a comparative analysis where they assess which of the two (mRNAs or eRNAs) is a better predictor.
      • The authors run the umaps of Fig. 1C only taking the predictor eRNAs. It is then somewhat expected to observe a separation. Coming from a single-cell omics field, what I would suggest is to take the eRNA loci and compute a umap with the highly variable regions, perform clustering on it and assess how the cancer subtypes are structured within the data. This would give a first overview of how much segregation and structure one can have with this data. Having a first step of data exploration would also strengthen the paper. If the authors have tried it, could the authors comment on it?
      • 'neither measures could classify any distinct eRNAs for invasive ductal vs lobular cancer samples' S1B. Just by eye, I can see a potential enrichment of ductal on the left and on the right while lobular stays in the center. This suggests to me that, while perhaps each eRNA alone does not have the power to classify the lobular vs ductal subtype, perhaps there is a difference - which could result from a cooperative model of eRNA influence - that would need further exploration. Would a PCA also show enrichments of ductal vs. lobular in specific parts of the plot? It may be worth exploring the PC loadings to see which eRNAs could play an influence. In this regard, a more unbiased visual examination, as suggested in my previous point, could help clarify whether there could be an association of certain eRNAs that cannot be captured by ML.
      • "we employed machine learning approaches on 302,951 eRNA loci identified from RNA-seq datasets from 1,095 breast cancer patient samples from previous studies" - the previous studies from which the authors take the data [11,12] highlight the presence of ~60K enhancers in the human genome and they use less than that in their analysis. Could the authors please clarify the differences in numbers with previous studies and give a reasoning? Also, from the methods section, they discard many patient samples due to low QC, so, from what I understand, the number of samples analyzed in the end is 975 and not 1,095.

      Minor comments

      • Can the authors please state the parameters of the umap in methods? Although it could be intrinsic to the dataset, data points are grouped in a way that makes me think that the granularity is too forced. Could the authors please show how the umap would behave with more lenient parameters? Or even with PCA?
      • 'Majority of the basal' -> The majority of the basal.

      Significance

      This is a paper relevant in the cancer field, particularly for breast cancer research. The significance of the paper lies in digging into the breast cancer samples, taking the different existing subtypes into account to assess the contribution of eRNAs as a classifier and as a prognostic tool. The data is already available but it has not been studied to this degree of detail. It highlights the importance of characterizing cancer samples in more depth, considering its intrinsic heterogeneity, as averaging across different subtypes would mask biology. My expertise lies in gene regulation and single-cell omics. My contribution will therefore be more focused on the analysis and extraction of biological information. The extent of its specific relevance in cancer research falls beyond my expertise.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript the authors have done cryo-electron tomography of the manchette, a microtubule-based structure important for proper sperm head formation during spermatogenesis. They also did mass-spectrometry of the isolated structures. Vesicles, actin and their linkers to microtubules within the structure are shown.

      __We thank the reviewer for the critical reading of our manuscript; we have implemented the suggestions as detailed below, which we believe indeed improved the manuscript. __

      Major:

      The data the conclusions are based on seem very limited and sometimes overinterpreted. For example, only one connection between actin and microtubules was observed, and this is thought to be MACF1 simply based on its presence in the MS.

      __We regret giving the impression that the data is limited. We in fact collected >100 tilt series from 3 biological replicas for the isolated manchette. __

      __In the revised version, we added data from in-situ studies showing vesicles interacting with the manchette (as requested below, new Fig. 1). __

      Specifically, for the interaction of actin with microtubule we added more examples (Revised Fig. 6) and we toned down the discussion related to the relevance of this interaction (lines 193-194, 253-255). MACF1 is mentioned only as a possible candidate in the discussion (line 254).

      Another, and larger concern, is that the authors do a structural study on something that has been purified out of the cell, a process which is extremely disruptive. Vesicles, actin and other cellular components could easily be trapped in this cytoskeletal sieve during the purification process and as such, not be bona fide manchette components. This could create both misleading proteomics and imaging. Therefore, an approach not requiring extraction such as high-pressure freezing, sectioning and room-temperature electron tomography and/or immunoEM on sections to set aside this concern is strongly recommended. As an additional bonus, it would show if the vesicles containing ATP synthase are deformed mitochondria.

      __We recognise the concern raised by the reviewer. __

      __To alleviate this concern, we added imaging data of manchettes in-situ that show vesicles, mitochondria and filaments interacting with the manchette (new Fig. 1), essentially confirming the observations that were made on the isolated manchette. __

      __The benefits of imaging the isolated manchette were better throughput (being able to collect more data) and reaching higher resolution allowing to resolve unequivocally the dynein/dynactin and actin filaments. __

      Minor: Line 99: "to study IMT with cryo-ET, manchettes were isolated ...(insert from which organism)..."

      __Added in line 102 in the revised version. __

      Line 102 "...demonstrating that they can be used to study IMT".. can the authors please clarify?

      This paragraph was revised (lines 131-137), we hope it is now more clear.

      Line 111 "densities face towards the MT plus-end" How can a density "face" anywhere? For this, it needs to have a defined front and back.

      Microtubule motor proteins (kinesin and dynein) are often attached to the microtubules with an angle and dynactin and cargo on one side (plus end). We rephrased this part and removed the word “face” in the revised version to make it more clear (lines 161-162).

      Line 137: is the "perinuclear ring" the same as the manchette?

      The perinuclear ring is the apical part of the manchette that connects it to the nucleus. We added to the revised version imaging of the perinuclear ring with observations on how it changes when the manchette elongates (new Fig. 2).

      Figure 2B: How did the authors decide not to model the electron density found between the vesicle and the MT at 3 O'clock? Is there no other proteins with a similar lollipop structure as ATP synthase, so that this can be said to be this protein with such certainty?

      __The densities connecting the vesicles to the microtubules shown in (now) Fig. 4D are not consistent enough to be averaged. __

      __The densities resembling ATP synthase are inside the vesicles. Nevertheless, we have decided to remove the averaging of the ATP synthases from the revised manuscipt as they are not of great importance for this manuscript. Instead, the new in-situ data clearly show mitochondria (with their characteristic double membrane and cristae) interacting with manchette microtubule (new Fig 1C). __

      Line 189: "F-actin formed organized bundles running parallel to mMTs" - this observation needs confirming in a less disrupted sample.

      __Phalloidin (actin marker) was shown before to stain the manchette (PMID: 36734600). As actin filaments are very thin (7 nm) they are very hard to observe in plastic embedded EM. __

      In the in-situ data we added to the revised manuscipt (new Fig 1D), we observe filaments with a diameter corresponding to actin. In addition, we added more examples of microtubules interacting with actin in isolated manchette (new Fig. 6 E-K).

      Line 242 remove first comma sign.

      Removed.

      Line 363 "a total of 2 datasets" - is this manuscript based on only two tilt-series? Or two datasets from each of the 4 grids? In any case, this is very limited data.

      We apologise for not clearly providing the information about the data size in the original manuscipt. The data is based on three biological replicas (3 animals). We collected more than 100 tomograms of different regions of the manchettes. As such, we would argue that the data is not limited per se.

      Reviewer #1 (Significance (Required)):

      The article is very interesting, and if presented together with the suggested controls, would be informative to both microtubule/motorprotein researchers as well as those trying studying spermatogenesis.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manchette appears as a shield-like structure surrounding the flagellar basal body upon spermiogenesis. It consists of a number of microtubules like a comb, but actin (Mochida et al. 1998 Dev. Biol. 200, 46) and myosin (Hayasaka et al. 2008 Asian J. Androl. 10, 561) were found, suggesting transportation inside the manchette. Detailed structural information and functional insight into the manchette was still awaited. There is a hypothesis called IMT (intra-machette transport) based on the fact that machette and IFT (intraflagellar transport) share common components (or homologues) and on their transition along the stages of spermiogenesis. While IMT is considered as a potential hypothesis to explain delivery of centrosomal and flagellar components, no one has witnessed IMT at the same level as IFT. IMT has never been purified, visualized in motion or at high resolution. This study for the first time visualized manchette using high-end cryo-electron tomography of isolated manchettes, addressing structural characterization of IMT. The authors successfully microtubular bundles, vesicles located between microtubules and a linker-like structure connecting the vesicle and the microtubule. On multilamellar membranes in the vesicles they found particles and assigned them to ATPase complexes, based on intermediate (~60A) resolution structure. They further identified interesting structures, such as (1) particles on microtubules, which resemble dynein and (2) filaments which shows symmetry of F-actin. All the molecular assignments are consistent with their proteomics of manchettes.

      __We thank the reviewer for highlighting the novelty of our study.____ __

      Their assignment of ATPase will be strengthened by MS data, if it proves absence of other possible proteins forming such a membrane protein complex.

      All the ATPase components were indeed found in our proteomics data. Nevertheless, we have decided to remove the averaging of the ATPase as it does not directly relate to IMT, the focus of this manuscript.

      They discussed possible role of various motor proteins based on their abundance (Line 134-151, Line 200). This makes sense only with a control. Absolute abundance of proteins would not necessarily present their local importance or roles. This reviewer would suggest quantitative proteomics of other organelles, or whole cells, or other fractions obtained during manchette isolation, to demonstrate unique abundance of KIF27 and other proteins of their interest.

      We agree with the reviewer that absolute abundance does not necessarily indicate importance or a role. As such, we removed this part of the discussion from the revised manuscript.

      A single image from a tomogram, Fig.6B, is not enough to prove actin-MT interaction. A gallery and a number (how many such junctions were found from how many MTs) will be necessary.

      We agree that one example is not enough. In the new Fig. 6E-K, we provide a gallery of more examples. We have revised the text to reflect the point that these observations are still rare and more data will be needed to quantify this interaction (Lines 253-254).

      Minor points: Their manchette purification is based on Mochida et al., which showed (their Fig.2) similarity to the in vivo structure (for example, Fig.1 of Kierszenbaum 2001 Mol. Reproduc. Dev. 59, 347). Nevertheless, since this is not a very common prep, it is helpful to show the isolated manchette’s wide view (low mag cryo-EM or ET) to prove its intactness.

      We thank the reviewer for this suggestion, in the revised version, new Fig. 2 provides a cryo-EM overview of purified manchette from different developmental stages.

      Line 81: Myosin -> myosin (to be consistent with other protein names)

      Corrected.

      This work is a significant step toward the understanding of manchettes. While the molecular assignment of dynein and ATPase is not fully decisive, due to limitation of resolution (this reviewer thinks the assignment of actin filament is convincing, based on its helical symmetry), their speculative model still deserves publication.

      Reviewer #2 (Significance (Required)):

      This work is a significant step toward the understanding of manchettes. While the molecular assignment of dynein and ATPase is not fully decisive, due to limitation of resolution (this reviewer thinks the assignment of actin filament is convincing, based on its helical symmetry), their speculative model still deserves publication.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      ->Summary:

      The manchette is a temporary microtubule (MT)-based structure essential for the development of the highly polarised sperm cell. In this study, the authors employed cryo-electron tomography (cryo-ET) and proteomics to investigate the intra-manchette transport system. Cryo-EM analysis of purified rat manchette revealed a high density of MTs interspersed with actin filaments, which appeared either bundled or as single filaments. Vesicles were observed among the MTs, connected by stick-like densities that, based on their orientation relative to MT polarity, were inferred to be kinesins. Subtomogram averaging (STA) confirmed the presence of dynein motor proteins. Proteomic analysis further validated the presence of dynein and kinesins and showed the presence of actin crosslinkers that could bundle actin filaments. Proteomics data also indicated the involvement of actin-based transport mediated by myosin. Importantly, the data indicated that the intraflagellar transport (IFT) system is not part of the intra-manchette transport mechanism. The visualisation of motor proteins directly from a biological sample represents a notable technical advancement, providing new insights into the organisation of the intra-manchette transport system in developing sperm.

      We thank the reviewer for summarising the novelty of our observations.

      -> Are the key conclusions convincing? Below we comment on three main conclusions. MT and F-actin bundles are both constituents of the manchette While the data convincingly shows that MT and F-actin are part of the manchette, one cannot conclude from it that F-actin is an integral part of the manchette. The authors would need to rephrase so that it is clear that they are speculating.

      We have rephrased our statements and replaced “integral” with ‘actin filaments are associated’. Of note previous studies suggested actin are part of the manchette including staining with phalloidin (PMID: 36734600, PMID: 9698455, PMID: 18478159) and we here visualised the actin in high resolution.

      The transport system employs different transport machinery on these MTs Proteomics data indicates the presence of multiple motor proteins in the manchette, while cryo-EM data corroborates this by revealing morphologically distinct densities associated with the MTs. However, the nature of only one of these MT-associated densities has been confirmed-specifically, dynein, as identified through STA. The presence of kinesin or myosin in the EM data remains unconfirmed based on just the cryo-ET density, and therefore it is unclear whether these proteins are actively involved in cargo transport, as this cannot be supported by just the proteomics data. In summary, we recommend that the authors rephrase this conclusion and avoid using the term "employ".

      We agree that our cryo-ET only confirmed the motor protein dynein. As such, we removed the term employ and rephrased our claims regarding the active transport and accordingly changed the title.

      Dynein mediated transport (Line 225-227) The data shows that dynein is present in the manchette; however, whether it plays and active role in transport cannot be determined from the cryo-ET data provided in the manuscript, as it does not clearly display a dynein-dynactin complex attached to cargo. The attachment to cargo is also not revealed via proteomics as no adaptor proteins that link dynein-dynactin to its cargo have been shown.

      A list of cargo adaptor proteins were found in our proteomics data but we agree that cryo-ET and proteomics alone cannot prove active transport. As such we toned down the discussion about active transport (lines 212-220).

      -> Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      F-actin • In the abstract, the authors state that F-actin provides tracks for transport as well as having structural and mechanical roles. However, the manuscript does not include experiments demonstrating a mechanical role. The authors appear to base this statement on literature where actin bundles have been shown to play a mechanical role in other model systems. We suggest they clarify that the mechanical role the authors suggest is speculative and add references if appropriate.

      __ ____We removed the claim about the mechanical role of the actin from the abstract and rephrased this in the discussion to suggest this role for the F-actin (lines 242-243).__

      • Lines 15,92, 180 and 255: The statement "Filamentous actin is an integral part of the manchette" is misleading. While the authors show that F-actin is present in their purified manchette structures, whether it is integral has not been tested. Authors should rephrase the sentence.

      We removed the word integral.

      • To support the claim that F-actin plays a role in transport within the manchette, the authors present only one instance where an unidentified density is attached to an actin filament. This is insufficient evidence to claim that it is myosin actively transporting cargo. Although the proteomics data show the presence of myosin, we suggest the authors exercise more caution with this claim.

      We agree that our data do not demonstrate active transport as such we removed that claim. We mention the possibility of cargo transport in the discussion (lines 250-255).

      • The authors mention the presence of F-actin bundles but do not show direct crosslinking between the F-actin filaments. They could in principle just be closely packed F-actin filaments that are not necessarily linked, so the term "bundle" should be used more cautiously.

      We do not assume that a bundle means that the F-actin filaments are crosslinked. A bundle simply indicates the presence of multiple F-actin filaments together. We rephrased it to call them actin clusters.

      Observations of dynein • Relating to Figure 2B: From the provided image it is not clear whether the density corresponds to a dynein complex, as it does not exhibit the characteristic morphological features of dynein or dynactin molecules.

      We indeed do not claim that the densities in this figure are dynein or dynactin. __We revised this paragraph and hope that it is now more clear (lines 135-137). __

      • Lines 171-172 and Figure 4: It is well established that dynein is a dimer and should always possess two motor domains. The authors have incorrectly assumed they observed single motor heads, except possibly in Figure 4A (marked by an arrow). In all other instances, the dynein complexes show two motor domains in proximity, but these have not been segmented accurately. Furthermore, the "cargos" shown in grey are more likely to represent dynein tails or the dynactin molecule, based on comparisons with in vitro structures of these complexes (see references 1-3).

      We thank the reviewer for this correction. We improved the annotations in the figure and revised the text to clarify that we identified dimers of dynein motor heads (lines 140-144). We further added a projection of a dynein dynactin complex to compare to the observation on the manchette (new Fig. 5E). We further changed claims on the presence of protein cargo to the presence of dynein/dynactin that allows cargo tethering based on the presence of cargo adaptors in the proteomics data.

      • Lines 21, 173, and 233 mention cargos, but as noted above, it seems to be parts of the dynein complex the authors are referring to.

      This was corrected as mentioned above.

      • Panel 4B appears to show a dynein-dynactin complex, but whether there is a cargo is unclear and if there is it should be labelled accordingly. To assessment of whether there is any cargo bound to the dynein-dynactin complex a larger crop of the panel would be helpful In summary, we recommend that the authors revisit their segmentations in Figures 2B and 4, revise their text based on these observations, and perform quantification of the data (as suggested in the next section).

      We thank the reviewers for sharing their expertise on dynein-dynactin complexes. We have revised the text as detailed above and excluded the assignment of any cargo, as we cannot (even from larger panels) see a clear association of cargo. We have made clear that we only refer to dynein dynactin with the capability of linking cargo based on the presence of proteomics data. We have removed claims on active transport with dynein.

      Dynein versus kinesin-based transport The calculation presented in lines 147-151 does not account for the fact that both the dynein-dynactin complex and kinesin proteins require cargo adaptors to transport cargo. Additionally, the authors overlook the possibility that multiple motors could be attached to a single cargo. If the authors did not observe this, they should explicitly mention it to support their argument. In short, the calculations are based on an incorrect premise, rendering the comparison inaccurate. Unless the authors have identified any dynein-dynactin or kinesin cargo adaptors in their proteomics data which could be used for such a comparison, we believe the authors lack sufficient data to accurately estimate the "active transport ratio" between dynein and kinesin.

      Even though we detect cargo adaptors in our proteomics, we agree that calculating relative transport based only on the proteomics can be inaccurate as such we removed absolute quantification and comparison between dynein and kinesin-based IMT.

      • Would additional experiments be essential to support the claims of the paper?

      F-actin distance and length distribution • To support the claim that F-actin is bundled (line 189), could the authors provide the distance between each F-actin filament and its neighbours? Additionally, could they compare the average distance to the length of actin crosslinkers found in their proteomics data, or compare it to the distances between crosslinked F-actin observed in other research studies?

      We measured distances between the actin filaments and added a plot to new Fig 6.

      • While showing that F-actin is important for the manchette would require cellular experiments, authors could provide quantification of how frequently these actin structures are observed in comparison to MTs to support their claims that these actin filaments could be important for the manchette structure.

      We agree that claims on the role and function of actin in the manchette require cellular experiments that are beyond the scope of this study. Absolute quantification of the ratio between MTs and actin from cryoET is very hard and will be inaccurate as the manchette cannot be imaged as a whole due to its size and thickness. The ratio we have is based on the relative abundance provided by the proteomics (Fig. 5F).

      • In line 193, the authors claim that the F-actin in bundles appears too short for transport. Could they provide length distributions for these filaments? This might provide further support to their claim that individual F-actin filaments can serve as transport tracks (line 266).

      __In addition to the limitation mentioned in the previous point, quantification of length from high magnification imaging will likely be inaccurate as the length of the actin in most cases is bigger than the field of view that is captured. Nevertheless, we removed the claim about the actin being too short for transport. __

      • Could the authors also quantify the abundance of individual F-actin filaments observed, compared to MTs and F-actin bundles, to support the idea that they could play a role in transport?

      As explained for the above points absolute quantification of the ratio between MTs and actin is not feasible from cryoET data that cannot capture all of the manchette in high enough resolution to resolve the actin.

      • In the discussion, the authors mention "interactions between F-actin singlets and mMTs" (line 269), yet they report observing only one instance of this interaction (lines 210 and 211). Given the limited data, they should refer to this as a single interaction in the discussion. The scarcity of data raises questions about how representative this event truly is.

      We agree that one example is not enough. In the new Fig. 6E-K, we provide a gallery of more examples as also requested by reviewers 1 and 2. We have also revised the text to reflect the point that these observations are still rare (Lines 190-194).

      Quantifications for judgement of representativity The authors should quantify how often they observed vesicles with a stick-like connection to MTs (lines 106-107); this would strengthen the interpretation of the density, as currently only one example is shown in the manuscript (Figure 4A). If possible, they could show how many of them are facing towards the MT plus end.

      __As mentioned in the text (lines 135-137), the linkers connecting vesicles to MTs were irregular and so we could not interpret them further this is in contrast to dynein that were easily recognisable but were not associated with vesicles. __

      Dynein quantifications • The authors are recommended to quantify how many dynein molecules per micron of MT they observe and how often they are angled with their MT binding domain towards the minus-end.

      As the manchette is large and highly dense any quantification will likely be biased towards parts of the manchette that are easier to image, for example the periphery. As such we do not think quantifying the dynein density will yield meaningful insight.

      • Could the authors quantify how many dynein densities they found to be attached to a (vesicle) cargo, if any (line 175)? They could show these observations in a supplementary figure.

      We did not observe any case of a connection between a vesicle and dynein motors, we edited this sentence to be more clear on that.

      • For densities that match the size and location of dynein but lack clear dynein morphology (as seen in Figure 2B), could the authors quantify how many are oriented towards the MT minus end?

      We had many cases where the connection did not have a clear dynein morphology, and as the morphology is not clear, it is impossible to make a claim about whether they are oriented towards the minus end.

      Artefacts due to purification: Authors should discuss if the purification could have effects on visualizing components of the manchette. For example, if it has effect on the MTs and actin structure or the abundance/structure of the motor protein complexes (bound to cargo or isolated).

      We have followed a protocol that was published before and showed the overall integrity of the manchette. Nevertheless, losing connections between manchette and other cellular organelles are expected. To address this point, we added in-situ data (new Fig 1) showing manchette in intact spermatids interacting with vesicles and mitochondria, as well as overviews of manchettes (new Fig 2), the text was revised accordingly.

      • Are the experiments adequately replicated and statistical analysis adequate? The cryo-ET data presented in the manuscript is collected using two separate sample preparations. Along with the quantifications of the different observations suggested above which will help the reader assess how abundant and representative these observations are, the authors could further strengthen their claims by acquiring data from a third sample preparation and then analysing how consistent their observations are between different purifications. This however could be time consuming so it is not a major requirement but recommended if possible within a short time frame.

      We regret not explicitly mentioning our data set size, it was added now to the revised version. In essence, the data is based on three biological replicas (3 animals). We collected more than 100 tomograms of different regions of the manchettes. We provided in the revised version more observations (new Fig 1, 2, 4B-C and 6E-K).

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Most of the comments deal with either modifying the text or analysing the data already presented, so the revision could be done with 1-3 months.


      Minor comments: - Specific experimental issues that are easily addressable. 1) Could the authors state how many tilt series were collected for each dataset/independent sample preparation? We recommend that they upload their raw data or tomograms to EMPAIR.

      We added this information in the material and methods.

      2) It is not clear to me if the same sample was used for cryo-ET and proteomics. Could the authors clarify how comparable the sample preparation for the cryo-ET and proteomics data is or if the same sample was used for both. If there is a discrepancy between these preparations, they would need to discuss how this can affect comparing observations from cryo-ET and mass spectrometry. Ideally both samples should be the same.

      After sample preparation the manchettes were directly frozen on grids. The rest of the samples was used for proteomics. Consequently, EM and MS data were acquired on the same samples. We clarified this in the text (lines 327-328).

      • Are prior studies referenced appropriately? We recommend including additional references to support the claim that F-actin has a mechanical role (line 242). Could the authors compare their proteomics data to other mass spectrometry studies conducted on the Manchette (for example, see reference 4)?

      We added the comparison but it is important to point out that in reference 4 the manchettes were isolated from mice testes.

      • Are the text and figures clear and accurate? Text: We do not see the necessity of specifying the microtubules (MTs) in the data as "manchette MTs" or "mMTs" rather than simply "MTs". However, we recommend that the authors use either "MT" or "mMT" consistently throughout the manuscript.

      We changed to only MTs.

      The authors appear to refer to both dynein-1 (cytoplasmic dynein) and dynein-2 (axonemal dynein or IFT dynein). To avoid confusion, it is important that the authors clearly specify which dynein they are referring to throughout the text. This is particularly relevant as the study aims to demonstrate that IFT is not part of the manchette transport system.

      • Introduction: In the third paragraph (lines 59-75), the authors should specify that they are referring to dynein-2, which is distinct from cytoplasmic dynein discussed in the previous paragraph (lines 44-58).

      We specify the respective dyneins in the text (line 66,140-141,145).

      • Figure 4D: The authors could fit a dynein-1 motor domain instead of a dynein-2 into the density to stay consistent with the fact that the density belongs to cytoplasmic dynein-1.

      __We changed the figure and fitted a cytosolic dynein-1 structure (5nvu) instead. __

      Figures: • Figure 2B: The legend mentions a large linker complex; however, this may correspond to two or three separate densities.

      We have addressed this and changed the wording.

      • Figure 4: please revisit the segmentation of this whole figure based on previous comments.

      __We revised as suggested. __

      • Figures 1, 2, 4, 5, and 6: It would be helpful to state in the legends that the tomograms are denoised. There are stripe-like densities visible in the images (e.g., in the vesicle in Figure 2B). Do these artefacts also appear in the raw data?

      As stated in the Methods section, tomograms were generally denoised with CryoCare for visualisation purposes. The “stripe-like densities” are artefacts of the gold fiducials used for tomogram alignment and appear in the raw data (before denoising).

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? We suggest revising the paragraph title "Dynein-mediated cargo along the manchette" (line 165) to "Dynein-mediated cargo transport along the manchette".

      __We have changed this in the revised version. __

      We recommend that the authors provide additional evidence to support the interpretation that the observed EM densities correspond to motor proteins. Specifically: • Include scale bars or reference lines indicating the known dimensions of motor proteins, based on previous data, to demonstrate that the observed densities match the expected size.

      The dynein structure is provided for reference. We also added the cytosolic dynein–dynactin as a reference (Fig 5E).

      • Make direct comparisons to existing EM data and highlight morphological similarities.

      We have added a comparison to existing data (Fig 5E).

      In the discussion (lines 249-254), the authors could speculate on alternative roles for the IFT components in the manchette, particularly if they are not part of the IFT trains. We also suggest rephrasing the claim in line 266 to make it more speculative in tone.

      __We have addressed this in the revised version (lines 221-230). __

      Finally, a schematic overview of the manchette ultrastructure in a spermatid would greatly aid the reader in understanding the material presented.

      We now include a graphical abstract and overviews of isolated manchettes on cryo-EM grids.

      References: 1. Chowdhury, S., Ketcham, S., Schroer, T. et al. Structural organization of the dynein-dynactin complex bound to microtubules. Nat Struct Mol Biol 22, 345-347 (2015). https://doi.org/10.1038/nsmb.2996

      1. Grotjahn, D.A., Chowdhury, S., Xu, Y. et al. Cryo-electron tomography reveals that dynactin recruits a team of dyneins for processive motility. Nat Struct Mol Biol 25, 203-207 (2018). https://doi.org/10.1038/s41594-018-0027-7

      2. Chaaban, S., Carter, A.P. Structure of dynein-dynactin on microtubules shows tandem adaptor binding. Nature 610, 212-216 (2022).https://doi.org/10.1038/s41586-022-05186-y

      3. W. Hu, R. Zhang, H. Xu, Y. Li, X. Yang, Z. Zhou, X. Huang, Y. Wang, W. Ji, F. Gao, W. Meng, CAMSAP1 role in orchestrating structure and dynamics of manchette microtubule minus-ends impacts male fertility during spermiogenesis, Proc. Natl. Acad. Sci. U.S.A. 120 (45) e2313787120, https://doi.org/10.1073/pnas.2313787120 (2023).

      Reviewer #3 (Significance (Required)):

      This study employs cryo-electron tomography (cryo-ET) and proteomics to elucidate the architecture of the manchette. It advances our understanding of the components involved in intracellular transport within the manchette and introduces the following technical and conceptual innovations:

      a) Technical Advances: The authors have visualized the manchette at high resolution using cryo-ET. They optimized a purification pipeline capable of retaining, at least partially, the transport machinery of the manchette. Notably, they observed dynein and putative kinesin motors attached to microtubules-a significant achievement that, to our knowledge, has not been reported previously.

      b) Conceptual Advances: This study provides novel insights into spermatogenesis. The findings suggest that intraflagellar transport (IFT) is unlikely to play a role at this stage of sperm development while shedding light on alternative transport systems. Importantly, the authors demonstrate that actin filaments organize in two distinct ways: clustering parallel to microtubules or forming single filaments.

      This work is likely to be of considerable interest to researchers in sperm development and structural biology. Additionally, it may appeal to scientists studying motor proteins and the cytoskeleton.

      We thank the reviewers for appreciating the significance and novelty of our study.

      The reviewers possess extensive expertise in in situ cryo-electron tomography and single-particle microscopy, including work on dynein-based complexes. Collectively, they have significant experience in the field of cytoskeleton-based transport.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The manchette is a temporary microtubule (MT)-based structure essential for the development of the highly polarised sperm cell. In this study, the authors employed cryo-electron tomography (cryo-ET) and proteomics to investigate the intra-manchette transport system. Cryo-EM analysis of purified rat manchette revealed a high density of MTs interspersed with actin filaments, which appeared either bundled or as single filaments. Vesicles were observed among the MTs, connected by stick-like densities that, based on their orientation relative to MT polarity, were inferred to be kinesins. Subtomogram averaging (STA) confirmed the presence of dynein motor proteins. Proteomic analysis further validated the presence of dynein and kinesins and showed the presence of actin crosslinkers that could bundle actin filaments. Proteomics data also indicated the involvement of actin-based transport mediated by myosin. Importantly, the data indicated that the intraflagellar transport (IFT) system is not part of the intra-manchette transport mechanism. The visualisation of motor proteins directly from a biological sample represents a notable technical advancement, providing new insights into the organisation of the intra-manchette transport system in developing sperm.

      Are the key conclusions convincing?

      Below we comment on three main conclusions.

      MT and F-actin bundles are both constituents of the manchette While the data convincingly shows that MT and F-actin are part of the manchette, one cannot conclude from it that F-actin is an integral part of the manchette. The authors would need to rephrase so that it is clear that they are speculating.

      The transport system employs different transport machinery on these MTs Proteomics data indicates the presence of multiple motor proteins in the manchette, while cryo-EM data corroborates this by revealing morphologically distinct densities associated with the MTs. However, the nature of only one of these MT-associated densities has been confirmed-specifically, dynein, as identified through STA. The presence of kinesin or myosin in the EM data remains unconfirmed based on just the cryo-ET density, and therefore it is unclear whether these proteins are actively involved in cargo transport, as this cannot be supported by just the proteomics data. In summary, we recommend that the authors rephrase this conclusion and avoid using the term "employ".

      Dynein mediated transport (Line 225-227) The data shows that dynein is present in the manchette; however, whether it plays and active role in transport cannot be determined from the cryo-ET data provided in the manuscript, as it does not clearly display a dynein-dynactin complex attached to cargo. The attachment to cargo is also not revealed via proteomics as no adaptor proteins that link dynein-dynactin to its cargo have been shown.

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      F-actin

      • In the abstract, the authors state that F-actin provides tracks for transport as well as having structural and mechanical roles. However, the manuscript does not include experiments demonstrating a mechanical role. The authors appear to base this statement on literature where actin bundles have been shown to play a mechanical role in other model systems. We suggest they clarify that the mechanical role the authors suggest is speculative and add references if appropriate.
      • Lines 15,92, 180 and 255: The statement "Filamentous actin is an integral part of the manchette" is misleading. While the authors show that F-actin is present in their purified manchette structures, whether it is integral has not been tested. Authors should rephrase the sentence.
      • To support the claim that F-actin plays a role in transport within the manchette, the authors present only one instance where an unidentified density is attached to an actin filament. This is insufficient evidence to claim that it is myosin actively transporting cargo. Although the proteomics data show the presence of myosin, we suggest the authors exercise more caution with this claim.
      • The authors mention the presence of F-actin bundles but do not show direct crosslinking between the F-actin filaments. They could in principle just be closely packed F-actin filaments that are not necessarily linked, so the term "bundle" should be used more cautiously.

      Observations of dynein

      • Relating to Figure 2B: From the provided image it is not clear whether the density corresponds to a dynein complex, as it does not exhibit the characteristic morphological features of dynein or dynactin molecules.
      • Lines 171-172 and Figure 4: It is well established that dynein is a dimer and should always possess two motor domains. The authors have incorrectly assumed they observed single motor heads, except possibly in Figure 4A (marked by an arrow). In all other instances, the dynein complexes show two motor domains in proximity, but these have not been segmented accurately. Furthermore, the "cargos" shown in grey are more likely to represent dynein tails or the dynactin molecule, based on comparisons with in vitro structures of these complexes (see references 1-3).
      • Lines 21, 173, and 233 mention cargos, but as noted above, it seems to be parts of the dynein complex the authors are referring to.
      • Panel 4B appears to show a dynein-dynactin complex, but whether there is a cargo is unclear and if there is it should be labelled accordingly. To assessment of whether there is any cargo bound to the dynein-dynactin complex a larger crop of the panel would be helpful In summary, we recommend that the authors revisit their segmentations in Figures 2B and 4, revise their text based on these observations, and perform quantification of the data (as suggested in the next section).

      Dynein versus kinesin-based transport

      The calculation presented in lines 147-151 does not account for the fact that both the dynein-dynactin complex and kinesin proteins require cargo adaptors to transport cargo. Additionally, the authors overlook the possibility that multiple motors could be attached to a single cargo. If the authors did not observe this, they should explicitly mention it to support their argument. In short, the calculations are based on an incorrect premise, rendering the comparison inaccurate. Unless the authors have identified any dynein-dynactin or kinesin cargo adaptors in their proteomics data which could be used for such a comparison, we believe the authors lack sufficient data to accurately estimate the "active transport ratio" between dynein and kinesin.

      Would additional experiments be essential to support the claims of the paper?

      F-actin distance and length distribution

      • To support the claim that F-actin is bundled (line 189), could the authors provide the distance between each F-actin filament and its neighbours? Additionally, could they compare the average distance to the length of actin crosslinkers found in their proteomics data, or compare it to the distances between crosslinked F-actin observed in other research studies?
      • While showing that F-actin is important for the manchette would require cellular experiments, authors could provide quantification of how frequently these actin structures are observed in comparison to MTs to support their claims that these actin filaments could be important for the manchette structure.
      • In line 193, the authors claim that the F-actin in bundles appears too short for transport. Could they provide length distributions for these filaments? This might provide further support to their claim that individual F-actin filaments can serve as transport tracks (line 266).
      • Could the authors also quantify the abundance of individual F-actin filaments observed, compared to MTs and F-actin bundles, to support the idea that they could play a role in transport?
      • In the discussion, the authors mention "interactions between F-actin singlets and mMTs" (line 269), yet they report observing only one instance of this interaction (lines 210 and 211). Given the limited data, they should refer to this as a single interaction in the discussion. The scarcity of data raises questions about how representative this event truly is.

      Quantifications for judgement of representativity

      The authors should quantify how often they observed vesicles with a stick-like connection to MTs (lines 106-107); this would strengthen the interpretation of the density, as currently only one example is shown in the manuscript (Figure 4A). If possible, they could show how many of them are facing towards the MT plus end.

      Dynein quantifications

      • The authors are recommended to quantify how many dynein molecules per micron of MT they observe and how often they are angled with their MT binding domain towards the minus-end.
      • Could the authors quantify how many dynein densities they found to be attached to a (vesicle) cargo, if any (line 175)? They could show these observations in a supplementary figure.
      • For densities that match the size and location of dynein but lack clear dynein morphology (as seen in Figure 2B), could the authors quantify how many are oriented towards the MT minus end?

      Artefacts due to purification: Authors should discuss if the purification could have effects on visualizing components of the manchette. For example, if it has effect on the MTs and actin structure or the abundance/structure of the motor protein complexes (bound to cargo or isolated).

      Are the experiments adequately replicated and statistical analysis adequate?

      The cryo-ET data presented in the manuscript is collected using two separate sample preparations. Along with the quantifications of the different observations suggested above which will help the reader assess how abundant and representative these observations are, the authors could further strengthen their claims by acquiring data from a third sample preparation and then analysing how consistent their observations are between different purifications. This however could be time consuming so it is not a major requirement but recommended if possible within a short time frame.

      Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Most of the comments deal with either modifying the text or analysing the data already presented, so the revision could be done with 1-3 months.

      Minor comments:

      Specific experimental issues that are easily addressable.

      1. Could the authors state how many tilt series were collected for each dataset/independent sample preparation? We recommend that they upload their raw data or tomograms to EMPAIR.
      2. It is not clear to me if the same sample was used for cryo-ET and proteomics. Could the authors clarify how comparable the sample preparation for the cryo-ET and proteomics data is or if the same sample was used for both. If there is a discrepancy between these preparations, they would need to discuss how this can affect comparing observations from cryo-ET and mass spectrometry. Ideally both samples should be the same.

      Are prior studies referenced appropriately?

      We recommend including additional references to support the claim that F-actin has a mechanical role (line 242). Could the authors compare their proteomics data to other mass spectrometry studies conducted on the Manchette (for example see reference 4)?

      Are the text and figures clear and accurate?

      Text: We do not see the necessity of specifying the microtubules (MTs) in the data as "manchette MTs" or "mMTs" rather than simply "MTs". However, we recommend that the authors use either "MT" or "mMT" consistently throughout the manuscript.

      The authors appear to refer to both dynein-1 (cytoplasmic dynein) and dynein-2 (axonemal dynein or IFT dynein). To avoid confusion, it is important that the authors clearly specify which dynein they are referring to throughout the text. This is particularly relevant as the study aims to demonstrate that IFT is not part of the manchette transport system.

      • Introduction: In the third paragraph (lines 59-75), the authors should specify that they are referring to dynein-2, which is distinct from cytoplasmic dynein discussed in the previous paragraph (lines 44-58).
      • Figure 4D: The authors could fit a dynein-1 motor domain instead of a dynein-2 into the density to stay consistent with the fact that the density belongs to cytoplasmic dynein-1. Figures:
      • Figure 2B: The legend mentions a large linker complex; however, this may correspond to two or three separate densities.
      • Figure 4: please revisit the segmentation of this whole figure based on previous comments.
      • Figures 1, 2, 4, 5, and 6: It would be helpful to state in the legends that the tomograms are denoised. There are stripe-like densities visible in the images (e.g., in the vesicle in Figure 2B). Do these artefacts also appear in the raw data?

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      We suggest revising the paragraph title "Dynein-mediated cargo along the manchette" (line 165) to "Dynein-mediated cargo transport along the manchette".

      We recommend that the authors provide additional evidence to support the interpretation that the observed EM densities correspond to motor proteins. Specifically:

      • Include scale bars or reference lines indicating the known dimensions of motor proteins, based on previous data, to demonstrate that the observed densities match the expected size.
      • Make direct comparisons to existing EM data and highlight morphological similarities. In the discussion (lines 249-254), the authors could speculate on alternative roles for the IFT components in the manchette, particularly if they are not part of the IFT trains. We also suggest rephrasing the claim in line 266 to make it more speculative in tone. Finally, a schematic overview of the manchette ultrastructure in a spermatid would greatly aid the reader in understanding the material presented.

      References:

      1. Chowdhury, S., Ketcham, S., Schroer, T. et al. Structural organization of the dynein-dynactin complex bound to microtubules. Nat Struct Mol Biol 22, 345-347 (2015). https://doi.org/10.1038/nsmb.2996
      2. Grotjahn, D.A., Chowdhury, S., Xu, Y. et al. Cryo-electron tomography reveals that dynactin recruits a team of dyneins for processive motility. Nat Struct Mol Biol 25, 203-207 (2018). https://doi.org/10.1038/s41594-018-0027-7
      3. Chaaban, S., Carter, A.P. Structure of dynein-dynactin on microtubules shows tandem adaptor binding. Nature 610, 212-216 (2022). https://doi.org/10.1038/s41586-022-05186-y
      4. W. Hu, R. Zhang, H. Xu, Y. Li, X. Yang, Z. Zhou, X. Huang, Y. Wang, W. Ji, F. Gao, W. Meng, CAMSAP1 role in orchestrating structure and dynamics of manchette microtubule minus-ends impacts male fertility during spermiogenesis, Proc. Natl. Acad. Sci. U.S.A. 120 (45) e2313787120, https://doi.org/10.1073/pnas.2313787120 (2023).

      Significance

      This study employs cryo-electron tomography (cryo-ET) and proteomics to elucidate the architecture of the manchette. It advances our understanding of the components involved in intracellular transport within the manchette and introduces the following technical and conceptual innovations:

      a) Technical Advances:

      The authors have visualized the manchette at high resolution using cryo-ET. They optimized a purification pipeline capable of retaining, at least partially, the transport machinery of the manchette. Notably, they observed dynein and putative kinesin motors attached to microtubules-a significant achievement that, to our knowledge, has not been reported previously.

      b) Conceptual Advances:

      This study provides novel insights into spermatogenesis. The findings suggest that intraflagellar transport (IFT) is unlikely to play a role at this stage of sperm development while shedding light on alternative transport systems. Importantly, the authors demonstrate that actin filaments organize in two distinct ways: clustering parallel to microtubules or forming single filaments.

      This work is likely to be of considerable interest to researchers in sperm development and structural biology. Additionally, it may appeal to scientists studying motor proteins and the cytoskeleton.

      The reviewers possess extensive expertise in in situ cryo-electron tomography and single-particle microscopy, including work on dynein-based complexes. Collectively, they have significant experience in the field of cytoskeleton-based transport.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manchette appears as a shield-like structure surrounding the flagellar basal body upon spermiogenesis. It consists of a number of microtubules like a comb, but actin (Mochida et al. 1998 Dev. Biol. 200, 46) and myosin (Hayasaka et al. 2008 Asian J. Androl. 10, 561) were found, suggesting transportation inside the manchette. Detailed structural information and functional insight into the manchette was still awaited. There is a hypothesis called IMT (intra machette transport) based on the fact that machette and IFT (intraflagellar transport) share common components (or homologues) and on their transition along the stages of spermiogenesis. While IMT is considered as a potential hypothesis to explain delivery of centrosomal and flagellar components, no one has witnessed IMT at the same level as IFT. IMT has never been purified, visualized in motion or at high resolution.

      This study for the first time visualized manchette using high-end cryo-electron tomography of isolated manchettes, addressing structural characterization of IMT. The authors successfully microtubular bundles, vesicles located between microtubules and a linker-like structure connecting the vesicle and the microtubule. On multilamellar membranes in the vesicles they found particles and assigned them to ATPase complexes, based on intermediate (~60A) resolution structure. They further identified interesting structures, such as (1) particles on microtubules, which resemble dynein and (2) filaments which shows symmetry of F-actin. All the molecular assignments are consistent with their proteomics of manchettes.

      Their assignment of ATPase will be strengthened by MS data, if it proves absence of other possible proteins forming such a membrane protein complex.

      They discussed possible role of various motor proteins based on their abundance (Line 134-151, Line 200). This makes sense only with a control. Absolute abundance of proteins would not necessarily present their local importance or roles. This reviewer would suggest quantitative proteomics of other organelles, or whole cells, or other fractions obtained during manchette isolation, to demonstrate unique abundance of KIF27 and other proteins of their interest.<br /> A single image from a tomogram, Fig.6B, is not enough to prove actin-MT interaction. A gallery and a number (how many such junctions were found from how many MTs) will be necessary.

      Minor points:

      Their manchette purification is based on Mochida et al., which showed (their Fig.2) similarity to the in vivo structure (for example, Fig.1 of Kierszenbaum 2001 Mol. Reproduc. Dev. 59, 347). Nevertheless, since this is not a very common prep, it is helpful to show the wide view (low mag cryo-EM or ET) of the isolated manchette to prove its intactness. Line 81: Myosin -> myosin (to be consistent with other protein names)

      This work is a significant step toward the understanding of manchettes. While the molecular assignment of dynein and ATPase is not fully decisive, due to limitation of resolution (this reviewer thinks the assignment of actin filament is convincing, based on its helical symmetry), their speculative model still deserves publication.

      Significance

      This work is a significant step toward the understanding of manchettes. While the molecular assignment of dynein and ATPase is not fully decisive, due to limitation of resolution (this reviewer thinks the assignment of actin filament is convincing, based on its helical symmetry), their speculative model still deserves publication.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript the authors have done cryo-electron tomography of the manchette, a microtubule-based structure important for proper sperm head formation during spermatogenesis. They also did mass-spectrometry of the isolated structures. Vesicles, actin and their linkers to microtubules within the structure are shown.

      Major:

      The data the conclusions are based on seem very limited and sometimes overinterpreted. For example, only one connection between actin and microtubules was observed, and this is thought to be MACF1 simply based on its presence in the MS.

      Another, and larger concern, is that the authors do a structural study on something that has been purified out of the cell, a process which is extremely disruptive. Vesicles, actin and other cellular components could easily be trapped in this cytoskeletal sieve during the purification process and as such, not be bona fide manchette components. This could create both misleading proteomics and imaging. Therefore, an approach not requiring extraction such as high-pressure freezing, sectioning and room-temperature electron tomography and/or immunoEM on sections to set aside this concern is strongly recommended. As an additional bonus, it would show if the vesicles containing ATP synthase are deformed mitochondria.

      Minor:

      Line 99: "to study IMT with cryo-ET, manchettes were isolated ...(insert from which organism)..."

      Line 102 "...demonstrating that they can be used to study IMT".. can the authors please clarify?

      Line 111 "densities face towards the MT plus-end" How can a density "face" anywhere? For this, it needs to have a defined front and back.

      Line 137: is the "perinuclear ring" the same as the manchette?

      Figure 2B: How did the authors decide to not model the electron density found between the vesicle and the MT at 3 O'clock? Is there no other proteins with a similar lollipop structure as ATP synthase, so that this can be said to be this protein with such certainty?

      Line 189: "F-actin formed organized bundles running parallel to mMTs" - this observation needs confirming in a less disrupted sample.

      Line 242 remove first comma sign

      Line 363 "a total of 2 datasets" - is this manuscript based on only two tilt-series? Or two datasets from each of the 4 grids? In any case, this is very limited data.

      Significance

      The article is very interesting, and if presented together with the suggested controls, would be informative to both microtubule/motorprotein researchers as well as those trying studying spermatogenesis.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility, and clarity

      The work by Pinon et al describes the generation of a microvascular model to study Neisseria meningitidis interactions with blood vessels. The model uses a novel and relatively high throughput fabrication method that allows full control over the geometry of the vessels. The model is well characterized. The authors then study different aspects of Neisseria-endothelial interactions and benchmark the bacterial infection model against the best disease model available, a human skin xenograft mouse model, which is one of the great strengths of the paper. The authors show that Neisseria binds to the 3D model in a similar geometry that in the animal xenograft model, induces an increase in permeability short after bacterial perfusion, and induces endothelial cytoskeleton rearrangements. Finally, the authors show neutrophil recruitment to bacterial microcolonies and phagocytosis of Neisseria. The article is overall well written, and it is a great advancement in the bioengineering and sepsis infection field, and I only have a few major comments and some minor.

      Major comments:

      Infection-on-chip. I would recommend the authors to change the terminology of "infection on chip" to better reflect their work. The term is vague and it decreases novelty, as there are multiple infection on chips models that recapitulate other infections (recently reviewed in https://doi.org/10.1038/s41564-024-01645-6) including Ebola, SARS-CoV-2, Plasmodium and Candida. Maybe the term "sepsis on chip" would be more specific and exemplify better the work and novelty. Also, I would suggest that the authors carefully take a look at the text and consider when they use VoC or to current term IoC, as of now sometimes they are used interchangeably, with VoC being used occasionally in bacteria perfused experiments.

      We thank Reviewer #1 for this suggestion. Indeed, we have chosen to replace the term "Infection-on-Chip" by "infected Vessel-on-chip" to avoid any confusion in the title and the text. Also, we have removed all the terms "IoC" which referred to "Infection-on-Chip" and replaced with "VoC" for "Vessel-on-Chip". We think these terms will improve the clarity of the main text.

      Fig 3 and Suppmentary 3: Permeability. The authors suggest that early 3h infection with Neisseria do not show increase in vascular permeability in the animal model, contrary to their findings in the 3D in vitro model. However, they show a non-significant increase in permeability of 70 KDa Dextran in the animal xenograft early infection. This seems to point that if the experiment would have been done with a lower molecular weight tracer, significant increases in permeability could have been detected. I would suggest to do this experiment that could capture early events in vascular disruption.

      Comparing permeability under healthy and infected conditions using Dextran smaller than 70 kDa is challenging. Previous research [1] has shown that molecules below 70 kDa already diffuse freely in healthy tissue. Given this high baseline diffusion, we believe that no significant difference would be observed before and after N. meningitidis infection and these experiments were not carried out. As discussed in the manuscript, bacteria induced permeability in mouse occurs at later time points, 16h post infection as shown previoulsy [2]. As discussed in the manuscript, this difference between the xenograft model and the chip likely reflect the absence in the chip of various cell types present in the tissue parenchyma.

      The authors show the formation of actin of a honeycomb structure beneath the bacterial microcolonies. This only occurred in 65\% of the microcolonies. Is this result similar to in vitro 2D endothelial cultures in static and under flow? Also, the group has shown in the past positive staining of other cytoskeletal proteins, such as ezrin in the ERM complex. Does this also occur in the 3D system?

      We thank the Reviewer #1 for this suggestion. - According to this recommendation, we imaged monolayers of endothelial cells in the flat regions of the chip (the two lateral channels) using the same microscopy conditions (i.e., Obj. 40X N.A. 1.05) that have been used to detect honeycomb structures in the 3D vessels in vitro. We showed that more than 56% of infected cells present these honeycomb structures in 2D, which is 13% less than in 3D, and is not significant due to the distributions of both populations. Thus, we conclude that under both in vitro conditions, 2D and 3D, the amount of infected cells exhibiting cortical plaques is similar. We have added the graph and the confocal images in Figure S4B and lines 418-419 of the revised manuscript. - We recently performed staining of ezrin in the chip and imaged both the 3D and 2D regions. Although ezrin staining was visible in 3D (Fig. 1 of this response), it was not as obvious as other markers under these infected conditions and we did not include it in the main text. Interpretation of this result is not straight forward as for instance the substrate of the cells is different and it would require further studies on the behaviour of ERM proteins in these different contexts.

      One of the most novel things of the manuscript is the use of a relatively quick photoablation system. I would suggest that the authors add a more extensive description of the protocol in methods. Could this technique be applied in other laboratories? If this is a major limitation, it should be listed in the discussion.

      Following the Reviewer's comment, we introduced more detailed explanations regarding the photoablation: - L157-163 (Results): "Briefly, the chosen design is digitalized into a list of positions to ablate. A pulsed UV-LASER beam is injected into the microscope and shaped to cover the back aperture of the objective. The laser is then focused on each position that needs ablation. After introducing endothelial cells (HUVEC) in the carved regions,.." - L512-516 (Discussion): "The speed capabilities drastically improve with the pulsing repetition rate. Given that our laser source emits pulses at 10kHz, as compared to other photoablation lasers with repetitions around 100 Hz, our solution could potentially gain a factor of 100. Also,..." - L1082-1087 (Materials and Methods): "…, and imported in a python code. The control of the various elements is embedded and checked for this specific set of hardware. The code is available upon request."

      Adding these three paragraphs gives more details on how photoablation works thus improving the manuscript.

      Minor comments:

      Supplementary Fig 2. The reference to subpanels H and I is swapped.

      The references to subpanels H and I have been correctly swapped back in the reviewed version.

      Line 203: I would suggest to delete this sentence. Although a strength of the submitted paper is the direct comparison of the VoC model with the animal model to better replicate Neisseria infection, a direct comparison with animal permeability is not needed in all vascular engineering papers, as vascular permeability measurements in animals have been well established in the past.

      The sentence "While previously developed VoC platforms aimed at replicating physiological permeability properties, they often lack direct comparisons with in vivo values." has been removed from the revised text.

      Fig 3: Bacteria binding experiments. I would suggest the addition of more methodological information in the main results text to guarantee a good interpretation of the experiment. First, it would be better that wall shear stress rather than flow rate is described in the main text, as flow rate is dependent on the geometry of the vessel being used. Second, how long was the perfusion of Neisseria in the binding experiment performed to quantify colony doubling or elongation? As per figure 1C, I would guess than 100 min, but it would be better if this information is directly given to the readers.

      We thank Reviewer #1 for these two suggestions that will improve the text clarity (e.g., L316). (i) Indeed, we have changed the flow rate in terms of shear stress. (ii) Also, we have normalized the quantification of the colony doubling time according to the first time-point where a single bacteria is attached to the vessel wall. Thus, early adhesion bacteria will be defined by a longer curve while late adhesion bacteria by a shorter curve. In total, the experiment lasted for 3 hours (modifications appear in L318 and L321-326).}

      Fig 4: The honeycomb structure is not visible in the 3D rendering of panel D. I would recommend to show the actin staining in the absence of Neisseria staining as well.

      According to this suggestion, a zoom of the 3D rendering of the cortical plaque without colony had been added to the figure 4 of the revised manuscript.

      Line 421: E-selectin is referred as CD62E in this sentence. I would suggest to use the same terminology everywhere.

      We have replaced the "CD62E" term with "E-selectin" to improve clarity.}

      Line 508: "This difference is most likely associated with the presence of other cell types in the in vivo tissues and the onset of intravascular coagulation". Do the authors refer to the presence of perivascular cells, pericytes or fibroblasts? If so, it could be good to mention them, as well as those future iterations of the model could include the presence of these cell types.

      By "other cell types", we refer to pericytes [3], fibroblasts [4], and perivascular macrophages [5], which surround endothelial cells and contribute to vessel stability. The main text was modified to include this information (Lines 548 and 555-570) and their potential roles during infection disussed.

      Discussion: The discussion covers very well the advantages of the model over in vitro 2D endothelial models and the animal xenograft but fails to include limitations. This would include the choice of HUVEC cells, an umbilical vein cell line to study microcirculation, the lack of perivascular cells or limitations on the fabrication technique regarding application in other labs (if any).

      We thank Reviewer #1 for this suggestion. Indeed, our manuscript may lack explaining limitations, and adding them to the text will help improve it: - The perspectives of our model include introducing perivascular cells surrounding the vessel and fibroblasts into the collagen gel as discussed previously and added in the discussion part (L555-570). - Our choice for HUVEC cells focused on recapitulating the characteristics of venules that respect key features such as the overexpression of CD62E and adhesion of neutrophils during inflammation. Using microvascular endothelial cells originating from different tissues would be very interesting. This possibility is now mentioned in the discussion lines 567-568. - Photoablation is a homemade fabrication technique that can be implemented in any lab harboring an epifluorescence microscope. This method has been more detailed in the revised manuscript (L1085-1087).

      Line 576: The authors state that the model could be applied to other systemic infections but failed to mention that some infections have already been modelled in 3D bioengineered vascular models (examples found in https://doi.org/10.1038/s41564-024-01645-6). This includes a capillary photoablated vascular model to study malaria (DOI: 10.1126/sciadv.aay724).

      Thes two important references have been introduced in the main text (L84, 647, 648).}

      Line 1213: Are the 6M neutrophil solution in 10ul under flow. Also, I would suggest to rewrite this sentence in the following line "After, the flow has been then added to the system at 0.7-1 μl/min."

      We now specified that neutrophils are circulated in the chip under flow conditions, lines 1321-1322.

      Significance

      The manuscript is comprehensive, complete and represents the first bioengineered model of sepsis. One of the major strengths is the carful characterization and benchmarking against the animal xenograft model. Its main limitations is the brief description of the photoablation methodology and more clarity is needed in the description of bacteria perfusion experiments, given their complexity. The manuscript will be of interest for the general infection community and to the tissue engineering community if more details on fabrication methods are included. My expertise is on infection bioengineered models.

      Reviewer #2

      Evidence, reproducibility, and clarity

      Summary The authors develop a Vessel-on-Chip model, which has geometrical and physical properties similar to the murine vessels used in the study of systemic infections. The vessel was created via highly controllable laser photoablation in a collagen matrix, subsequent seeding of human endothelial cells and flow perfusion to induce mechanical cues. This vessel could be infected with Neisseria meningitidis, as a model of systemic infection. In this model, microcolony formation and dynamics, and effects on the host were very similar to those described for the human skin xenograft mouse, which is the current gold standard for these studies, and were consistent with observations made in patients. The model could also recapitulate the neutrophil response upon N. meningitidis systemic infection.

      Major comments:

      I have no major comments. The claims and the conclusions are supported by the data, the methods are properly presented and the data is analyzed adequately. Furthermore, I would like to propose an optional experiment could improve the manuscript. In the discussion it is stated that the vascular geometry might contribute to bacterial colonization in areas of lower velocity. It would be interesting to recapitulate this experimentally. It is of course optional but it would be of great interest, since this is something that can only be proven in the organ-on-chip (where flow speed can be tuned) and not as much in animal models. Besides, it would increase impact, demonstrating the superiority of the chip in this area rather than proving to be equal to current models.

      We have conducted additional experiments on infection in different vascular geometries now added these results figure 3/S3 and lines 288-305. We compared sheared stress levels as determined by Comsol simulation and experimentally determined bacterial adhesion sites. In the conditions used, the range of shear generated by the tested geometries do not appear to change the efficiency of bacterial adhesion. These results are consistent with a previous study from our group which show that in this range of shear stresses the effect on adhesion is limited [6] . Furthermore, qualitative observations in the animal model indicate that bacteria do not have an obvious preference in terms of binding site.

      Minor comments:

      I have a series of suggestions which, in my opinion, would improve the discussion. They are further elaborated in the following section, in the context of the limitations.

      • How to recapitulate the vessels in the context of a specific organ or tissue? If the pathogen is often found in the luminal space of other organs after disseminating from the blood, how can this process be recapitulated with this mode, if at all?

      • For reasons that are not fully understood, postmortem histological studies reveal bacteria only inside blood vessels but rarely if ever in the organ parenchyma. The presence of intravascular bacteria could nevertheless alter cells in the tissue parenchyma. The notable exception is the brain where bacteria exit the bacterial lumen to access the cerebrospinal fluid. The chip we describe is fully adapted to develop a blood brain barrier model and more specific organ environments. This implies the addition of more cell types in the hydrogel. A paragraph on this topic has been added (Lines 548 and 552-570).

      • Similarly, could other immune responses related to systemic infection be recapitulated? The authors could discuss the potential of including other immune cells that might be found in the interstitial space, for example.

      • This important discussion point has been added to the manuscript (L623-636). As suggested by Reviewer #2, other immune cells respond to N. meningitis and can be explored using our model. For instance, macrophages and dendritic cells are activated upon N. meningitis infection, eliminate the bacteria through phagocytosis, produce pro-inflammatory cytokines and chemokines potentially activating lymphocytes [7]. Such an immune response, yet complex, would be interesting to study in our model as skin-xenograft mice are deprived of B and T lymphocytes to ensure acceptance of human skin grafts.

      • A minor correction: in line 467 it should probably be "aspects" instead of "aspect", and the authors could consider rephrasing that sentence slightly for increased clarity.

      • We have corrected the sentence with "we demonstrated that our VoC strongly replicates key aspects of the in vivo human skin xenograft mouse model, the gold standard for studying meningococcal disease under physiological conditions." in lines 499-503.

        Strengths and limitations

      The most important strength of this manuscript is the technology they developed to build this model, which is impressive and very innovative. The Vessel-on-Chip can be tuned to acquire complex shapes and, according to the authors, the process has been optimized to produce models very quickly. This is a great advancement compared with the technologies used to produce other equivalent models. This model proves to be equivalent to the most advanced model used to date, but allows to perform microscopy with higher resolution and ease, which can in turn allow more complex and precise image-based analysis. However, the authors do not seem to present any new mechanistic insights obtained using this model. All the findings obtained in the infection-on-chip demonstrate that the model is equivalent to the human skin xenograft mouse model, and can offer superior resolution for microscopy. However, the advantages of the model do not seem to be exploited to obtain more insights on the pathogenicity mechanisms of N. meningitidis, host-pathogen interactions or potential applications in the discovery of potential treatments. For example, experiments to elucidate the role of certain N. meningiditis genes on infection could enrich the manuscript and prove the superiority of the model. However, I understand these experiments are time-consuming and out of the scope of the current manuscript. In addition, the model lacks the multicellularity that characterizes other similar models. The authors mention that the pathogen can be found in the luminal space of several organs, however, this luminal space has not been recapitulated in the model. Even though this would be a new project, it would be interesting that the authors hypothesize about the possibilities of combining this model with other organ models. The inclusion of circulating neutrophils is a great asset; however it would also be interesting to hypothesize about how to recapitulate other immune responses related to systemic infection.

      We thank Reviewer #2 for his/her comment on the strengths and limitations of our work. The difficulty is that our study opens many futur research directions and applications and we hope that the work serves as the basis for many future studies but one can only address a limited set of experiments in a single manuscript. - Experiments investigating the role of N. meningitidis genes require significant optimization of the system. Multiplexing is a potential avenue for future development, which would allow the testing of many mutants. The fast photoablation approach is particularly amenable to such adaptation. - Cells and bacteria inside the chambers could be isolated and analyzed at the transcriptomic level or by flow cytometry. This would imply optimizing a protocol for collecting cells from the device via collagenase digestion, for instance. This type of approach would also benefit from multiplexing to enhance the number of cells. - As mentioned above, the revised manuscript discusses the multicellular capabilities of our model, including the integration of additional immune cells and potential connections to other organ systems. We believe that these approaches are feasible and valuable for studying various aspects of N. meningitidis infection.

      Advance

      The most important advance of this manuscript is technical: the development of a model that proves to be equivalent to the most complex model used to date to study meningococcal systemic infections. The human skin xenograft mouse model requires complex surgical techniques and has the practical and ethical limitations associated with the use of animals. However, the Infection-on-chip model is completely in vitro, can be produced quickly, and allows to precisely tune the vessel's geometry and to perform higher resolution microscopy. Both models were comparable in terms of the hallmarks defining the disease, suggesting that the presented model can be an effective replacement of the animal use in this area.

      Other vessel-on-chip models can recapitulate an endothelial barrier in a tube-like morphology, but do not recapitulate other complex geometries, that are more physiologically relevant and could impact infection (in addition to other non-infectious diseases). However, in the manuscript it is not clear whether the different morphologies are necessary to study or recapitulate N. meningitidis infection, or if the tubular morphologies achieved in other similar models would suffice.

      We thank Reviewer #2 for his/her comment, also raised by reviewer 1. To answer this question, we have now infected vessel-on-chips of different geometries, to dissect the impact of flow distribution in N. meningitidis infection (Figures 3 and S3, explained in lines 288-307). In this range of shear stress, we show that bacterial infection is not strongly affected by geometry-induced shear stress variation. These observations are constistent with observations in flow chambers and qualitative observations of human cases and in the xenograft model [6].

      Audience

      This manuscript might be of interest for a specialized audience focusing on the development of microphysiological models. The technology presented here can be of great interest to researchers whose main area of interest is the endothelium and the blood vessels, for example, researchers on the study of systemic infections, atherosclerosis, angiogenesis, etc. Thus, the tool presented (vessel-on-chip) can have great applications for a broad audience. However, even when the method might be faster and easier to use than other equivalent methods, it could still be difficult to implement in another laboratory, especially if it lacks expertise in bioengineering. Therefore, the method could be more of interest for laboratories with expertise in bioengineering looking to expand or optimize their toolbox. Alternatively, this paper present itself as an opportunity to begin collaborations, since the model could be used to test other pathogen or conditions.

      Field of expertise: Infection biology, organ-on-chip, fungal pathogens.

      I lack the expertise to evaluate the image-based analysis.

      References:

      1. Gyohei Egawa, Satoshi Nakamizo, Yohei Natsuaki, Hiromi Doi, Yoshiki Miyachi, and Kenji Kabashima. Intravital analysis of vascular permeability in mice using two-photon microscopy. Scientific Reports, 3(1):1932, Jun 2013. ISSN 2045-2322. doi: 10.1038/srep01932.

      2. Valeria Manriquez, Pierre Nivoit, Tomas Urbina, Hebert Echenique-Rivera, Keira Melican, Marie-Paule Fernandez-Gerlinger, Patricia Flamant, Taliah Schmitt, Patrick Bruneval, Dorian Obino, and Guillaume Duménil. Colonization of dermal arterioles by neisseria meningitidis provides a safe haven from neutrophils. Nature Communications, 12(1):4547, Jul 2021. ISSN 2041-1723. doi:10.1038/s41467-021-24797-z.

      3. Mats Hellström, Holger Gerhardt, Mattias Kalén, Xuri Li, Ulf Eriksson, Hartwig Wolburg, and Christer Betsholtz. Lack of pericytes leads to endothelial hyperplasia and abnormal vascular morphogenesis. Journal of Cell Biology, 153(3):543–554, Apr 2001. ISSN 0021-9525. doi: 10.1083/jcb.153.3.543.

      4. Arsheen M. Rajan, Roger C. Ma, Katrinka M. Kocha, Dan J. Zhang, and Peng Huang. Dual function of perivascular fibroblasts in vascular stabilization in zebrafish. PLOS Genetics, 16(10):1–31, 10 2020. doi: 10.1371/journal.pgen.1008800.

      5. Huanhuan He, Julia J. Mack, Esra Güç, Carmen M. Warren, Mario Leonardo Squadrito, Witold W. Kilarski, Caroline Baer, Ryan D. Freshman, Austin I. McDonald, Safiyyah Ziyad, Melody A. Swartz, Michele De Palma, and M. Luisa Iruela-Arispe. Perivascular macrophages limit permeability. Arteriosclerosis, Thrombosis, and Vascular Biology, 36(11):2203–2212, 2016. doi: 10.1161/ATVBAHA. 116.307592.

      6. Emilie Mairey, Auguste Genovesio, Emmanuel Donnadieu, Christine Bernard, Francis Jaubert, Elisabeth Pinard, Jacques Seylaz, Jean-Christophe Olivo-Marin, Xavier Nassif, and Guillaume Dumenil. Cerebral microcirculation shear stress levels determine Neisseria meningitidis attachment sites along the blood–brain barrier . Journal of Experimental Medicine, 203(8):1939–1950, 07 2006. ISSN 0022-1007. doi: 10.1084/jem.20060482.

      7. Riya Joshi and Sunil D. Saroj. Survival and evasion of neisseria meningitidis from macrophages. Medicine in Microecology, 17:100087, 2023. ISSN 2590-0978. doi: https://doi.org/10.1016/j.medmic.2023.100087.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The authors develop a Vessel-on-Chip model, which has geometrical and physical properties similar to the murine vessels used in the study of systemic infections. The vessel was created via highly controllable laser photoablation in a collagen matrix, subsequent seeding of human endothelial cells and flow perfusion to induce mechanical cues. This vessel could be infected with Neisseria meningitidis, as a model of systemic infection. In this model, microcolony formation and dynamics, and effects on the host were very similar to those described for the human skin xenograft mouse, which is the current gold standard for these studies, and were consistent with observations made in patients. The model could also recapitulate the neutrophil response upon N. meningitidis systemic infection.

      Major comments

      I have no major comments. The claims and the conclusions are supported by the data, the methods are properly presented and the data is analyzed adequately. Furthermore, I would like to propose an optional experiment could improve the manuscript. In the discussion it is stated that the vascular geometry might contribute to bacterial colonization in areas of lower velocity. It would be interesting to recapitulate this experimentally. It is of course optional but it would be of great interest, since this is something that can only be proven in the organ-on-chip (where flow speed can be tuned) and not as much in animal models. Besides, it would increase impact, demonstrating the superiority of the chip in this area rather than proving to be equal to current models.

      Minor comments

      I have a series of suggestions which, in my opinion, would improve the discussion. They are further elaborated in the following section, in the context of the limitations. - How to recapitulate the vessels in the context of a specific organ or tissue? If the pathogen is often found in the luminal space of other organs after disseminating from the blood, how can this process be recapitulated with this mode, if at all? - Similarly, could other immune responses related to systemic infection be recapitulated? The authors could discuss the potential of including other immune cells that might be found in the interstitial space, for example. A minor correction: in line 467 it should probably be "aspects" instead of "aspect", and the authors could consider rephrasing that sentence slightly for increased clarity.

      Referee cross-commenting

      I agree with the rest of the comments, and also agree that the manuscript is already complete and could be published as it is.

      Significance

      Strengths and limitations

      The most important strength of this manuscript is the technology they developed to build this model, which is impressive and very innovative. The Vessel-on-Chip can be tuned to acquire complex shapes and, according to the authors, the process has been optimized to produce models very quickly. This is a great advancement compared with the technologies used to produce other equivalent models. This model proves to be equivalent to the most advanced model used to date, but allows to perform microscopy with higher resolution and ease, which can in turn allow more complex and precise image-based analysis. However, the authors do not seem to present any new mechanistic insights obtained using this model. All the findings obtained in the infection-on-chip demonstrate that the model is equivalent to the human skin xenograft mouse model, and can offer superior resolution for microscopy. However, the advantages of the model do not seem to be exploited to obtain more insights on the pathogenicity mechanisms of N. meningitidis, host-pathogen interactions or potential applications in the discovery of potential treatments. For example, experiments to elucidate the role of certain N. meningiditis genes on infection could enrich the manuscript and prove the superiority of the model. However, I understand these experiments are time consuming and out of the scope of the current manuscript. In addition, the model lacks the multicellularity that characterizes other similar models. The authors mention that the pathogen can be found in the luminal space of several organs, however, this luminal space has not been recapitulated in the model. Even though this would be a new project, it would be interesting that the authors hypothesize about the possibilities of combining this model with other organ models. The inclusion of circulating neutrophils is a great asset; however it would also be interesting to hypothesize about how to recapitulate other immune responses related to systemic infection.

      Advance

      The most important advance of this manuscript is technical: the development of a model that proves to be equivalent to the most complex model used to date to study meningococcal systemic infections. The human skin xenograft mouse model requires complex surgical techniques and has the practical and ethical limitations associated with the use of animals. However, the Infection-on-chip model is completely in vitro, can be produced quickly, and allows to precisely tune the vessel's geometry and to perform higher resolution microscopy. Both models were comparable in terms of the hallmarks defining the disease, suggesting that the presented model can be an effective replacement of the animal use in this area. Other vessel-on-chip models can recapitulate an endothelial barrier in a tube-like morphology, but do not recapitulate other complex geometries, that are more physiologically relevant and could impact infection (in addition to other non-infectious diseases). However, in the manuscript it is not clear whether the different morphologies are necessary to study or recapitulate N. meningitidis infection, or if the tubular morphologies achieved in other similar models would suffice.

      Audience

      This manuscript might be of interest for a specialized audience focusing on the development of microphysiological models. The technology presented here can be of great interest to researchers whose main area of interest is the endothelium and the blood vessels, for example, researchers on the study of systemic infections, atherosclerosis, angiogenesis, etc. Thus, the tool presented (vessel-on-chip) can have great applications for a broad audience. However, even when the method might be faster and easier to use than other equivalent methods, it could still be difficult to implement in another laboratory, especially if it lacks expertise in bioengineering. Therefore, the method could be more of interest for laboratories with expertise in bioengineering looking to expand or optimize their toolbox. Alternatively, this paper present itself as an opportunity to begin collaborations, since the model could be used to test other pathogen or conditions.

      Field of expertise: infection biology, organ-on-chip, fungal pathogens

      I lack the expertise to evaluate the image-based analysis.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The work by Pinon et al describes the generation of a microvascular model to study Neisseria meningitidis interactions with blood vessels. The model uses a novel and relatively high throughput fabrication method that allows full control over the geometry of the vessels. The model is well characterized. The authors then study different aspects of Neisseria-endothelial interactions and benchmark the bacterial infection model against the best disease model available, a human skin xenograft mouse model, which is one of the great strengths of the paper. The authors show that Neisseria binds to the 3D model in a similar geometry that in the animal xenograft model, induces an increase in permeability short after bacterial perfusion, and induces endothelial cytoskeleton rearrangements. Finally, the authors show neutrophil recruitment to bacterial microcolonies and phagocytosis of Neisseria. The article is overall well written, and it is a great advancement in the bioengineering and sepsis infection field, and I only have a few major comments and some minor.

      Major comments:

      Infection-on-chip. I would recommend the authors to change the terminology of "infection on chip" to better reflect their work. The term is vague and it decreases novelty, as there are multiple infection on chips models that recapitulate other infections (recently reviewed in https://doi.org/10.1038/s41564-024-01645-6) including Ebola, SARS-CoV-2, Plasmodium and Candida. Maybe the term "sepsis on chip" would be more specific and exemplify better the work and novelty. Also, I would suggest that the authors carefully take a look at the text and consider when they use VoC or to current term IoC, as of now sometimes they are used interchangeably, with VoC being used occasionally in bacteria perfused experiments.

      Fig 3 and Suppmentary 3: Permeability. The authors suggest that early 3h infection with Neisseria do not show increase in vascular permeability in the animal model, contrary to their findings in the 3D in vitro model. However, they show a non-significant increase in permeability of 70 KDa Dextran in the animal xenograft early infection. This seems to point that if the experiment would have been done with a lower molecular weight tracer, significant increases in permeability could have been detected. I would suggest to do this experiment that could capture early events in vascular disruption.

      The authors show the formation of actin of a honeycomb structure beneath the bacterial microcolonies. This only occurred in 65% of the microcolonies. Is this result similar to in vitro 2D endothelial cultures in static and under flow? Also, the group has shown in the past positive staining of other cytoskeletal proteins, such as ezrin in the ERM complex. Does this also occur in the 3D system?

      One of the most novel things of the manuscript is the use of a relatively quick photoablation system. I would suggest that the authors add a more extensive description of the protocol in methods. Could this technique be applied in other laboratories? If this is a major limitation, it should be listed in the discussion.

      Minor comments:

      Supplementary Fig 2. The reference to subpanels H and I is swapped.

      Line 203: I would suggest to delete this sentence. Although a strength of the submitted paper is the direct comparison of the VoC model with the animal model to better replicate Neisseria infection, a direct comparison with animal permeability is not needed in all vascular engineering papers, as vascular permeability measurements in animals have been well established in the past.

      Fig 3: Bacteria binding experiments. I would suggest the addition of more methodological information in the main results text to guarantee a good interpretation of the experiment. First, it would be better that wall shear stress rather than flow rate is described in the main text, as flow rate is dependent on the geometry of the vessel being used. Second, how long was the perfusion of Neisseria in the binding experiment performed to quantify colony doubling or elongation? As per figure 1C, I would guess than 100 min, but it would be better if this information is directly given to the readers.

      Fig 4: The honeycomb structure is not visible in the 3D rendering of panel D. I would recommend to show the actin staining in the absence of Neisseria staining as well.

      Line 421: E-selectin is referred as CD62E in this sentence. I would suggest to use the same terminology everywhere.

      Line 508: "This difference is most likely associated with the presence of other cell types in the in vivo tissues and the onset of intravascular coagulation". Do the authors refer to the presence of perivascular cells, pericytes or fibroblasts? If so, it could be good to mention them, as well as those future iterations of the model could include the presence of these cell types.

      Discussion: The discussion covers very well the advantages of the model over in vitro 2D endothelial models and the animal xenograft but fails to include limitations. This would include the choice of HUVEC cells, an umbilical vein cell line to study microcirculation, the lack of perivascular cells or limitations on the fabrication technique regarding application in other labs (if any).

      Line 576: The authors state that the model could be applied to other systemic infections but failed to mention that some infections have already been modelled in 3D bioengineered vascular models (examples found in https://doi.org/10.1038/s41564-024-01645-6). This includes a capillary photoablated vascular model to study malaria ( DOI: 10.1126/sciadv.aay724).

      Line 1213: Are the 6M neutrophil solution in 10ul under flow. Also, I would suggest to rewrite this sentence in the following line "After, the flow has been then added to the system at 0.7-1 μl/min."

      Referee cross-commenting

      I agree with the other reviewer's comments. The manuscript is already very complete could be published without the addition of other experiments, but the ones I proposed could validate even more the in vitro model. For example the permeability with lower molecular weight tracers, could show that the changes in vessel permeability might already exist at early timepoints in the xenograft model, similarly than in the in vitro model.

      Significance

      The manuscript is comprehensive, complete and represents the first bioengineered model of sepsis. One of the major strengths is the carful characterization and benchmarking against the animal xenograft model. Its main limitations is the brief description of the photoablation methodology and more clarity is needed in the description of bacteria perfusion experiments, given their complexity. The manuscript will be of interest for the general infection community and to the tissue engineering community if more details on fabrication methods are included.

      My expertise is on infection bioengineered models.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the Reviewers

      We thank the reviewers for their very thoughtful and insightful reviews. We have performed several new experiments and addressed their points in a revised manuscript, which has significantly improved the manuscript. Our detailed responses follow.


      Responses to Reviewer 1

      Major comments

      1. __ In the supplementary figures when looking at the KC vs KPC mice and the trichrome staining (S2) or looking at the muc5a and mucin levels in figure 2, the KPC mice appear to have a larger amount of PANIN formation than the KC mice which is usually indicative of further tumour progression that can occur in p53 null tumours. Due to the further progression of the tumour in the KPC mice, drugs such as nutlin-3a may have inhibitory effect on PANIN formation and nutlin results might therefore not be indicative of p53 dependence. This should at least be mentioned and discussed.__

      A: We appreciate the reviewer’s insightful comment. Indeed, in the KPC model, where PDAC progression is accelerated, Nutlin-3a may exhibit p53-independent effects. We now address this consideration in the revised manuscript on page 6, when describing our PanIN and Alcian blue results in KPC mice.

      __ When looking at how p53 controls the expression of acinar cell identity genes the authors look at MEFs when performing their GSEA (Figure 4a, b). The MEFs are used as a model for neoplastic cells but it would also be beneficial to test this in pancreatic cell lines. When looking at Bhlha15 expression (figure 4c) there is a decrease observed in the p53 containing vs knockout mice, this is from a data set GSE94566, it would be beneficial to test this in the KC and KPC mice the authors generated to see if the results can be validated. Results in 4d re mist expression should also be evaluated in KPC mice to prove p53 dependence. Finally, murine and human fibroblasts are treated with doxorubicin (figure 4e-f; figure s4b) to show Bhlha15 is upregulated, it would also be useful to show this either in pancreatic cell lines with/without p53 or from the murine tissue of the KC and KPC mice.__

      A: We appreciate the reviewer’s insightful comment. We recognize that the way we originally described the GSEA analysis may have inadvertently suggested that it was performed in RNA-seq from MEFs. To clarify, the GSEA analysis in Figure 4a is derived from RNA-seq of sorted precursor lesions from p53-proficient (KC) and p53-deficient (KPC) mice, not MEFs. We have revised the Results section that describes Figure 4 __to more clearly reflect this distinction. Additionally, we acknowledge the importance of confirming p53’s regulatory role over Bhlha15 expression in the pancreas. To support the findings from the GSE94566 dataset, which was generated using sorted precursor lesions from KC and KPC mice (Mello et al., 2017), we present a boxplot of Bhlha15 expression (__Figure 4c).In response to the reviewer’s suggestion, we incorporated Mist1 immunohistochemistry and quantification in KPC mice treated with Nutlin-3a or vehicle control (Figure 4b) to further validate the p53-dependent regulation of Mist1 expression. To strengthen the conclusions from Figures 4c–d, we also conducted complementary experiments in mouse-derived pancreatic cancer cell lines either proficient (KIC1 and KIC2, derived from Kras+/G12D; Pdx1-Cre; Cdkn2afl/fl mice) or deficient (KPC, derived from Kras+/G12D; Pdx1-Cre; Trp53fl/fl mice) for p53. These experiments aim to further substantiate the regulatory role of p53 in controlling Mist1 expression.

      __ It is unclear why varying numbers of mice have been used. For the majority of experiments, the authors use n=6 mock treated mice and n=3 or 4 nutlin-3a treated mice. For KPC mice n=4 and n=4 was used. N=3 for mock and nutlin-3a treated mice were used. Did some mice die unexpectedly during the experiment? It would be good to report this. Also, the smaller amount of animal models used even though it was n=3/the disparity between the control and nutlin treated may raise some question. Usually, 6 vs 3. Possibly testing this with some lesson common Kras mutants and increasing the time in which nutlin-3a is studied in the pancreatic tumours, can it constantly prevent tumour formation.__

      A: We appreciate the reviewer’s concern regarding the variation in cohort sizes across experiments. Several factors contributed to these differences. In some cases, mice were excluded due to health issues such as malocclusion and poor general condition. Additionally, a subset of animals was misgenotyped and later confirmed to lack the KrasG12D allele, necessitating their exclusion from the study. The KPC model in particular is challenging to breed due to the requirement for four specific alleles and its rapid progression toward pancreatic cancer, which can limit survival and experimental flexibility. Despite these limitations, our key experimental groups, such as those evaluating Amylase rescue upon Nutlin-3a treatment, Mist1 induction in ADM, and lineage tracing studies, maintained a statistical power of at least 80% based on our cohort sizes. We have now clarified these details in the Supplemental Materials and Methods to ensure transparency regarding animal exclusions and sample size variability. We also agree with the reviewer that assessing Nutlin-3a at later stages of tumorigenesis would strengthen our findings. To this end, we treated aging KC mice (6 months old), which accumulate ADM and PanINs due to chronic Kras activation (rather than pancreatitis), with Nutlin-3a for a week and analyzed them at 8 months. Treated mice showed increased normal acinar tissue and reduced high-grade PanINs. This new data, presented in Figure 5, highlights the sustained tumor-suppressive effect of p53 activation and suggests that it could delay or prevent PDAC onset.

      Minor comments:

      1. __ The text is mostly clear, apart from the results section around figure 4, where it is not always clear which material has been used for analysis when referring to a previous paper. The quantification graphs should be wider as they seem squished sometimes. Changing the colours to be darker would make these more easily identifiable as the pale blue/red are sometimes difficult to see.__

      A: We appreciate the reviewer’s feedback and agree with the reviewer that the results section for Figure 4 leaves some margin for misinterpretation. To address this, we have revised the text to improve clarity and ensure that the source of each dataset, condition, and cell line is explicitly stated. Additionally, we have added a Supplementary Materials and Methods section that provides detailed information about the datasets and experimental conditions used in Figure 4. We have also adjusted the quantification graphs to be wider, preventing them from appearing compressed, and modified the color scheme, using gray and white tones to improve visibility and contrast, making the data easier to interpret.

      __ a) Figure 4a and 4b can be moved to the supplementary figures. b). For the figure legend of S1 on the separate file for the supplemental figures there is no S1e mentioned but it is in the paper. c) Figure S1a needs a scale bar.__

      A: We appreciate the reviewer’s suggestions and have implemented all the requested changes:

      1. a) Figures 4a and 4b have been moved to the Supplementary Figures section.
      2. b) The Figure S1 legend in the supplemental file has been updated to include S1e, ensuring consistency with the main text.
      3. c) A scale bar has been added to Figure S1a for clarity.

      Responses to Reviewer 2

      Major comments

      1. __ The study implies that pharmacologically engaging wild-type p53 (for example, through Nutlin-3a) may serve as a strategy to prevent or significantly delay the onset of PDAC by preserving the normal acinar cell phenotype and blocking early metaplastic changes. Doing a search in Pubmed search, no such findings has been previously published. It is a very important findings as it paves the way to clinical trial. The data are of excellent quality and would support the conclusions but the experiments need additional control experiments to strengthen the conclusions.__

      A: We thank the reviewer for the positive assessment of our work.

      __ For all immunohistochemistry quantification: The authors should explain better how the scoring was performed. - the authors should present a range of positive staining (negative, Weak, medium, high). The authors should state the number of sections analysed and how many cells or nuclei in total were counted per section or ROI to define the percentage of positive cells/nuclei.__

      A: We appreciate the reviewer’s suggestion and have addressed this point by adding a detailed description of our immunohistochemistry quantification methodology in the Supplementary Materials and Methods. This includes a clear explanation of how positive staining was defined. Specifically, we did not use a categorical intensity scoring system (e.g., weak, medium, strong); instead, positive staining was determined based on signal levels clearly distinguishable from background noise, enabling reliable automated detection by the analysis software that we employed. Regarding sample size and quantification scope, we analyzed one representative section per individual in the cohort. For each section, either the entire tissue or specific ROIs, such as ADM or PanIN lesions, were annotated and quantified. The number of nuclei or cells evaluated per section varied depending on tissue size and ROI, and this is now described in the Supplementary Materials and Methods.

      __ In material methods: the antibodies concentration must be indicated in ug/ml.__

      A: We appreciate the reviewer’s suggestion and have updated the Materials and Methods section to include antibody concentrations in µg/ml.

      __ a) Figure1b, 1c must present the following control staining in addition to presented data: i) staining of non-treated pancreas (as negative control); ii) staining of pancreas treated with Nutlin only (not-treated with cerulein) to assess the effect of Nutlin in absence of Cerulein. b) Figure 4d: the authors should repeat the experiment in p53fl/fl mice to assess nutlin off-target effect. c) Figure S1 e) there is no legend for it. d) Figure S4: which p53 exon has been deleted by CRISPr. The sequences of the sgRNA are not indicated.__

      A: We appreciate the reviewer’s suggestions and have addressed all the requested changes. For Figures 1b and 1c, we have added the necessary control stainings, including (a) staining of non-treated pancreas as a negative control and (b) staining of pancreas treated with Nutlin-3a only (without cerulein) to assess the effect of Nutlin-3a in the absence of Cerulein (Figure S1c). For Figure 4d (now Figure 4b), we have included sections from p53-deficient (KPC) mice stained for Mist1 to evaluate potential off-target effects of Nutlin-3a. Our results show no Mist1 expression in the absence of p53, suggesting that Nutlin-3a-mediated upregulation of Mist1 in ADM is p53-dependent. Additionally, we have added a legend for Figure S1e. For Figure S4, we clarified that CRISPR interference (CRISPRi) was used in this experiment rather than gene deletion. As such, the sgRNA is not designed against a specific exon, but instead targets the promoter region of the TP53 gene to suppress its transcription. We have now included the sgRNA sequence used in Figure S4d for clarity.

      Responses to Reviewer 3


      1. __ The authors suggested, based on their data, that Mist1 may be transactivated by p53 "presumably directly, across distinct cell types and in different contexts, such as oncogenic stress and DNA damage." This statement is too speculative and that is noteworthy because the experiments to get at those potential functional details (including, e.g., gene interference, biochemical assays) are not particularly difficult and would significantly improve the manuscript.__

      A: We appreciate the reviewer’s feedback. Our original statement aimed to accurately reflect our findings without overinterpretation, as we identified a conserved p53 binding site in the Bhlha15 locus, observed p53 occupancy in published ChIP-seq datasets, and demonstrated p53-dependent expression of Mist1 at both RNA and protein levels. To further support this relationship, we expanded our analysis to include p53-proficient and p53-deficient mouse PDAC cell lines, confirming the dependency of Mist1 expression on p53 (Figure 4e). Additionally, we now show that Mist1 protein was detected in lesions of Nutlin-3a–treated KC mice, but not in KPC mice, further indicating that Mist1 induction is p53-dependent in vivo (Figure 4b). While we acknowledge that direct functional testing of the p53 binding site would further strengthen the mechanistic insight, the Bhlha15 locus contains multiple p53 ChIP-seq peaks, making it difficult to isolate the contribution of individual sites. For this reason, we believe that dissecting the precise binding events underlying p53-mediated regulation of Bhlha15 goes beyond the scope of the current study, but we agree it is a valuable direction for future work.

      __ The study did not explore a novel concept beside showing that ADM can be reversed by inhibiting p53, which though may sound novel is intuitive (the focus on Mist1 alone appear narrow too).__

      A: We respectfully disagree with the reviewer’s assessment. The prevailing view in the literature is that p53 suppresses pancreatic cancer primarily by preventing the progression from PanINs to PDAC, largely through the induction of senescence in precursor lesions (Caldwell et al. Oncogene 2012; Morton et al. PNAS 2010). However, whether p53 also plays a tumor-suppressive role at earlier stages, particularly in ADM, remains unclear. Our study provides evidence that p53 regulates ADM plasticity and acinar cell identity, expanding its known functions beyond senescence induction. Additionally, the role of p53 in maintaining tissue homeostasis through the regulation of differentiation programs is an emerging and underexplored concept. Our focus on Mist1 is well justified, as we observed a significant overlap between gene expression changes in p53-proficient (KC) and p53-deficient (KPC) precursor lesions and known Mist1-regulated genes (Fig. 4a), highlighting its potential as a key mediator of p53-dependent acinar cell identity maintenance. While Mist1 is a focal point of our study, the broader implication is that p53 plays an active role in controlling acinar cell fate, which challenges the conventional view of its function solely in later-stage tumor suppression.

      __ The RNA-seq and ChIP data may provide several opportunities to get at how p53 mediates the proposed effect on ADM and it would be worthwhile to leverage those data.__

      A: We appreciate the reviewer’s suggestion. Like the reviewer, we recognize that p53 has a vast downstream network, and while additional pathways may contribute to p53-mediated cell differentiation, we believe that investigations of other mechanisms involved in this process extends beyond the scope of this manuscript and would dilute the central message rather than strengthen it.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, Twardowski et al. showed that the treatment of pancreatic cancer mice models with Nutlin-3a, a drug that allows p53 stabilization, limits tumor initiation by reversing acinar-to-ductal metaplasia (ADM), which is an early event in pancreatic cancer. This study is well done, and the figures are quite convincing. It also uses a combination of state-of-the-art mouse models, most notably the inducible model that allowed cell lineage tracing. However, it is generally descriptive and provides no mechanistic detail beside suggesting that p53 potentially drives the upregulation of the transcription factor Mist1 (Bhlha15) as part of the ADM to acinar differentiation process. The authors also suggested, based on their data, that Mist1 may be transactivated by p53 "presumably directly, across distinct cell types and in different contexts, such as oncogenic stress and DNA damage." This statement is too speculative and that is noteworthy because the experiments to get at those potential functional details (including, e.g., gene interference, biochemical assays) are not particularly difficult and would significantly improve the manuscript. Besides the above comments, the study did not explore a novel concept beside showing that ADM can be reversed by inhibiting p53, which though may sound novel is intuitive (the focus on Mist1 alone appear narrow too). The RNA-seq and ChIP data may provide several opportunities to get at how p53 mediates the proposed effect on ADM and it would be worthwhile to leverage those data. The study in its current state is descriptive and appears too preliminary.

      Significance

      Besides the above comments, the study did not explore a novel concept beside showing that ADM can be reversed by inhibiting p53, which though may sound novel is intuitive (the focus on Mist1 alone appear narrow too). The RNA-seq and ChIP data may provide several opportunities to get at how p53 mediates the proposed effect on ADM and it would be worthwhile to leverage those data. The study in its current state is descriptive and appears too preliminary.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript entitled:" Drug-Induced p53 Activation Limits Pancreatic Cancer Initiation. " Twardowski et al., investigate using mouse animal models the impact of pharmacological stabilization of the wild-type p53 protein on the formation of acinar-to-ductal metaplasia (ADM) in a KrasG12D-driven mouse model of Pancreatic ductal adenocarcinoma (PDAC). The authors mostly performed immunohistochemistry to assess the differentiation status in response to treatment. The authors claims that they demonstrate that p53 stabilisation via Nutlin-3a treatment, an inhibitor of its ubiquitin ligase MDM2, significantly reduces both ADM and the formation of precursor lesions, such as pancreatic intraepithelial neoplasia (PanIN) by promoting the differentiation of ADM into acinar cells. The authors claim that the differentiation is concomitant with p53-dependent induction of the transcription factor Mist1 (also named Bhlha15), a critical inducer of acinar cell formation. The authors conclude that their data reveal a role for p53 in promoting the re-differentiation of ductal metaplasia in healthy acinar cells, preventing ductal-metaplasia to progress to Pancreatic ductal adenocarcinoma.

      Significance

      The study implies that pharmacologically engaging wild-type p53 (for example, through Nutlin-3a) may serve as a strategy to prevent or significantly delay the onset of PDAC by preserving the normal acinar cell phenotype and blocking early metaplastic changes. Doing a search in Pubmed search, no such findings has been previously published. It is a very important findings as it paves the way to clinical trial.

      The data are of excellent quality and would support the conclusions but the experiments need additional control experiments to strengthen the conclusions.

      Here are the major points that preclude publication as it is

      • For all immunohistochemistry quantification: The authors should explain better how the scoring was performed. - the authors should present a range of positive staining (negative, Weak, medium, high). The authors should state the number of sections analysed and how many cells or nuclei in total were counted per section or ROI to define the percentage of positive cells/nuclei.
      • In material methods: the antibodies concentration must be indicated in ug/ml.
      • Figure1b, 1c must present the following control staining in addition to presented data

      a. staining of non-treated pancreas (as negative control)

      b. staining of pancreas treated with Nutlin only (not-treated with cerulein) to assess the effect of Nutlin in absence of Cerulein - Figure 4d: the authors should repeat the experiment in p53fl/fl mice to assess nutlin off-target effect - Figure S1 e) there is no legend for it - Figure S4: which p53 exon has been deleted by CRISPr. The sequences of the sgRNA are not indicated.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The key findings in this paper are that using nutlin-3a to stabilise p53 a reduction of the formation of ADM and PanIN in the KrasG12D driven mouse model of PDAC are observed. They show that as p53 is stabilised by nutlin-3a ADM cells are differentiated into acinar cells, corresponding with a p53 dependent upregulation of Mist1. To show these results the authors utilised multiple mouse models and induced pancreatic damage/oncogenic stress via the injection of cerulein. Histological sections of the pancreas in the various mouse models were stained and quantified to allow the authors to come to their conclusions. The methods are presented sufficiently. The statistical analysis was adequate for this work. References are appropriate.

      Major comments:

      1. In the supplementary figures when looking at the KC vs KPC mice and the trichrome staining (S2) or looking at the muc5a and mucin levels in figure 2, the KPC mice appear to have a larger amount of PANIN formation than the KC mice which is usually indicative of further tumour progression that can occur in p53 null tumours. Due to the further progression of the tumour in the KPC mice, drugs such as nutlin-3a may have inhibitory effect on PANIN formation and nutlin results might therefore not be indicative of p53 dependence. This should at least be mentioned and discussed.
      2. When looking at how p53 controls the expression of acinar cell identity genes the authors look at MEFs when performing their GSEA (Figure 4a, b). The MEFs are used as a model for neoplastic cells but it would also be beneficial to test this in pancreatic cell lines. When looking at Bhlha15 expression (figure 4c) there is a decrease observed in the p53 containing vs knockout mice, this is from a data set GSE94566, it would be beneficial to test this in the KC and KPC mice the authors generated to see if the results can be validated. Results in 4d re mist expression should also be evaluated in KPC mice to prove p53 dependence. Finally, murine and human fibroblasts are treated with doxorubicin (figure 4e-f; figure s4b) to show Bhlha15 is upregulated, it would also be useful to show this either in pancreatic cell lines with/without p53 or from the murine tissue of the KC and KPC mice.
      3. It is unclear why varying numbers of mice have been used. For the majority of experiments, the authors use n=6 mock treated mice and n=3 or 4 nutlin-3a treated mice. For KPC mice n=4 and n=4 was used. N=3 for mock and nutlin-3a treated mice were used. Did some mice die unexpectedly during the experiment? It would be good to report this.

      Minor comments:

      1. The text is mostly clear, apart from the results section around figure 4, where it is not always clear which material has been used for analysis when referring to a previous paper. The quantification graphs should be wider as they seem squished sometimes. Changing the colours to be darker would make these more easily identifiable as the pale blue/red are sometimes difficult to see.
      2. Figure 4a and 4b can be moved to the supplementary figures.
      3. For the figure legend of S1 on the separate file for the supplemental figures there is no S1e mentioned but it is in the paper.
      4. Figure S1a needs a scale bar.

      Significance

      The study is a conscience investigation into how p53 is involved in pancreatic cancer initiation and how this can be reduced by over activation of p53. A strong point of the study is the generation of the genetic lineage mouse model. This allowed the authors to persistently label ADM cells and trace their progeny. This experiment provided strong evidence that nutlin-3a treatment can indeed reverse acini to ADM formation and prevent PanIN formation. Some of the limitations of the study involve relying on mainly immunohistochemistry to show changes in protein level in mouse tissue, western blotting could be used in conjunction with this to further validate the claims put forward in the paper. Also, the smaller amount of animal models used even though it was n=3/the disparity between the control and nutlin treated may raise some question. Usually, 6 vs 3. Possibly testing this with some lesson common Kras mutants and increasing the time in which nutlin-3a is studied in the pancreatic tumours, can it constantly prevent tumour formation.

      Other work in this area has looked at nutlin-3a and its effect on NSCLC with a Kras mutant (https://pubmed.ncbi.nlm.nih.gov/38093368/) and has shown that nutlin-3a is able to induce cell death in Kras mutant NSCLC cells. This paper also builds on work by (https://pmc.ncbi.nlm.nih.gov/articles/PMC5730340/#S9) who looks at NRF2-mediated induction of MDM2 and accumulation of p62 leading to PDAC and how inhibition of MDM2 by nutlin-3a may reduce this progression and this is shown in the present paper. The study for review advances using MDM2 inhibitors such at nutlin-3a in a clinical manner by looking at how it affects the progression of PDAC in mice and starts to elucidate the interactions which cause this to happen.

      The research present is specialised research that will hopefully be able to be translated to the clinic, if the use of nutlin-3a is able to prevent progression to PDAC in a mouse it would be useful to see if this also possible in patient derived primary cell lines to further elucidate the mechanism of this work.

      My expertise: P53, mouse work, lung cancer.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewer for their constructive comments and the fair and interesting discussion between reviewers.

      __Reviewer #1 __

      We are delighted to read that the reviewer finds the manuscript “very clear and of immediate impact […] and ready for publication” regarding this aspect. We have toned down the conclusion, proposing rather than concluding that “the incapacitation of Cmg2[KO] intestinal stem cells to function properly […] is due to their inability to transduce Wnt signals”.

      We have addressed the 3 points that were raised as well as the minor comments.

      Point #1

      The mouse mutant is just described as 'KO', referring to the previous work by the authors. The cited work simply states that this is a zygotic deletion of exon 3, which somehow leads to a decrease in protein abundance that is almost total in the lung but not so clear in the uterus. Exon 3 happens to be 72 bp long [https://www.ncbi.nlm.nih.gov/nuccore/NM_133738], so its deletion (assuming there are no cryptic splicing sites used) leads to an internal in-frame deletion of 24 amino acids. So, at best, this 'KO' is not a null, but a hypomorphic allele of context-dependent strength.

      Unfortunately, neither the previous work nor this paper (unless I have missed it!) contains information provided about the expression levels of Cmg2 in the intestine of KO mice - nor which cell types usually express it (see below). I think that using anti Cmg2 in WB and immunohistofluorescence of with ISC markers with intestine homogenate/sections of wild-type and mutant mice would be necessary to set the stage for the rest of the work.

      We now provide and explanation and characterization the Cmg2KO mice. Exon 3 indeed only encodes a short 24 amino acid sequence. This exon however encodes a ß-strand that is central to the vWA domain of CMG2, and therefore critical for the folding of this domain. As now shown in Fig. S1c, CMG2Dexon3 is produced in cells but cleared by the ER associated degradation pathway, therefore it is only detectable in cells treated with the proteasome inhibitor MG132, at a slightly lower molecular weight than the full-length protein. This is consistent, and was inspired by the fact that multiple Hyaline Fibromatosis missense mutations that map to the vWA domain lead to defective folding of CMG2, further illustrating that this domain is very vulnerable to modifications. In Fig. S1c, we moreover now show immunoprecipitation of Cmg2 from colonic tissue of wild-type (WT) and knockout (KO) mice, which confirm the absence of Cmg2 protein in Cmg2KO samples.

      Point #2

      Connected to the previous point, the expression pattern of Cmg2 in the intestine is not described. Maybe this is already established in the literature, but the authors do not refer to the data. This is important when considering that the previous work of the authors suggests that Cmg2 might contribute to Wnt signalling transduction through physical, cis interactions with the Wnt co-receptor LRP6. Therefore, one would expect that Cmg2 would be cell-autonomously required in the intestinal stem cells.

      The expression pattern of Cmg2 in the gut has not been characterized and is indeed essential to understanding its function. To address this gap, we now added a figure (Fig. 1) providing data from publicly available RNA-seq datasets and from our RNAscope experiments on Cmg2WT mice. Of note, we unfortunately have never managed to detect Cmg2 protein expression by immunohistochemistry of mouse tissue with any of the antibodies available, commercial or generated in the lab.

      In the RESULTS section we now mention:

      To investigate Cmg2 expression in the gut, we first analyzed publicly available spatial and scRNA-seq datasets to identify which cell types express Cmg2 across different gut regions. Spatial transcriptomic data from the mouse small intestine and colon revealed that Cmg2 is broadly expressed throughout the gut, including in the muscular, crypt, and epithelial layers (Fig. 1A–C). To validate these findings, we performed RNAscope in situ hybridization targeting Cmg2 in the duodenum and colon of wild-type mice. The expression pattern observed was consistent with the spatial transcriptomics data (Fig. 1D–E). We then analyzed scRNA-seq data from the same dataset to assess cell-type-specific expression in the mouse colon. Cmg2 was detected at varying levels across multiple cell types, including enterocytes and intestinal stem cells, as well as mesenchymal cells, notably fibroblasts.

      Of note for the reviewer, not mentioned in the manuscript, this wide-spread distribution of Cmg2 across the different cell types is not true for all organs. We have recently investigated the expression of Cmg2 in muscle and found that it is almost exclusively expressed in fibroblasts (so-called fibro-adipocyte progenitors) and very little in any other muscle cells, in particular fibers.

      Interestingly also, as now mentioned in the manuscript and shown in Fig. S1,the ANTXR1 protein, which is highly homologous to Cmg2 at the protein level and share its function of anthrax toxin receptor, displayed a much more restricted expression pattern, being confined primarily to fibroblasts and mural cells, and notably absent from epithelial cells. This differential expression highlights a potentially unique and epithelial-specific role for Cmg2 in maintaining intestinal homeostasis.

      Point #3

      The authors establish that the regenerating crypts of Cmg2[KO] mice are unable to transduce Wnt signalling, but it is not clear whether this situation is provoked by the DSS-induce injury or existed all along. Can Cmg2[KO] intestinal stem cells transduce Wnt signalling before the DSS challenge? If they were, it might suggest that the 'context-dependence' of the Cmg2 role in Wnt signalling is contextual not only because of the tissue, but because of the history of the tissue or its present structure. It would also suggest that Cmg2 mutant mice, unless reared in a germ-free facility for life, would eventually lose intestinal homeostasis, and maybe suggest the level of intervention/monitoring that HFS patients would require. It might also provide an explanation in case Cmg2 was not expressed in ISCs - if the state of the tissue was as important as the presence of the protein, then the effect on Wnt transduction could be indirect and therefore it might not be required cell-autonomously.

      We agree that understanding whether Cmg2KO intestinal stem cells are intrinsically unable to transduce Wnt signals, or whether this defect is contextually induced following injury (such as DSS treatment), is a critical point.

      As a first line of evidence, we show than under homeostatic condition, Wnt signaling appears largely intact in Cmg2KO crypts, with comparable levels of ß-catenin and expression levels of canonical Wnt target genes (e.g., Axin2, Lgr5) to those observed in WT animals (Figs. S1j-l and S3d-e). This indicates that Cmg2 is not essential for basal Wnt signaling under steady-state conditions.

      These findings thus support the idea that the requirement for Cmg2 in Wnt signal transduction is context-dependent—not only at the tissue level but also temporally, being specifically required during regenerative processes or in altered microenvironments such as during inflammation or epithelial damage. This context-dependence may reflect changes in the composition or accessibility of Wnt ligands, receptors, or matrix components during repair, where Cmg2 could play a scaffolding or stabilizing role.

      These aspects are now discussed in the text.

      I think points 1 and 2 are absolutely fundamental in a reverse genetics investigation. Point 3 would be nice to know but the outcome would not change the tenet of the paper. I believe that the work needed to deal these points can be performed on archival material. I do not think the mechanism proposed can be taken from 'plausible' to 'proven' without proposing substantial additional investigation, so I will not suggest any of it, as it could well be another paper.

      We have addressed points 1 and 2, and provided evidence and discussion for Point 3.

      __Minor points __

      1- Figure 1 legend says "In (c), results are mean {plus minus} SEM" - this seems applicable to (d) as (c) does not show error whiskers.

      We thank the reviewer for picking up this error. We modified : “In (c), results are median” and “In (d, f and g) Results are mean ± SEM.”

      2- Figure 1 legend says "(d) Body weight loss, (f) the aspect of the feces and presence of occult blood were monitored and used for the (e) DAI. Results are mean {plus minus} SEM. Each dot represents the mean of n = 12 mice per genotype". This part looks like has suffered some rearrangement of words. The first instance of (f) should be (e), I guess, and I am not sure what "(e) DAI" means. And for (e), "mean {plus minus} SEM" does not seem applicable. This needs some light revision.

      The legend was clarified as followed : “(d) __Body weight loss, and (e) aspect of the feces and presence of occult blood were monitored and used to evaluate Disease activity index in (f).__

      3 - Figure 1H legend does not say which statistical test was made in the survival experiment in (h) - presumably log-rank? A further comment on the survival statistics: euthanised animals should not be counted towards true mortality when that is what is recorded as an 'event'. They should be right-censored. However, in this case, reaching the euthanasia criterion is just as good an indicator of health as mortality itself. So, simply by changing the Y axis from 'survival' to 'event-free survival' (or something to that effect), where 'events' are either death or reaching the euthanasia criterion, leaves the analysis as it is, and authors do not need to clarify that figure 1H shows "apparent mortality", as it is straightforward "complication-free survival" (just not entirely orthogonal to weight loss).

      The Y axis was changed from 'survival' to “percentage of mice not reaching the euthanasia criterion”.

      4 - Some density measurements are made unnecessarily on arbitrary units (per field of view) - this should be simple to report in absolute measures (i.e. area of tissue screened or, better still, length of epithelium screened).

      Because the aera of tissue can vary significantly between damages, regenerating and undamaged tissue, we reported the length of epithelium screened as suggested : “per 800um tissue screened” in Fig S1c and Fig 2b.

      5 - Figure 2E should read "percent involvement"

      This has been corrected.

      6 - Figure 2J should read "lipocalin..."

      This has been corrected.

      7 - In section "CMG2 Is Dispensable for YAP/TAZ-Mediated Reprogramming to Fetal-Like Stem Cells", the authors write ""We measured the mRNA levels of two additional YAP target genes, Cyr61 and CTGF...". I presume the "additional" is because Ly6a is also a target of YAP/TAZ, but if the reader does not know, it is puzzling. I would suggest to make this link explicit.

      We added : “In addition to the fetal-like stem cell marker Ly6a, which is a YAP/TAZ target gene, we measured the mRNA levels of two others YAP target genes, Cyr61 and CTGF”

      8 - In Figures S2, 3 and S3, I think that the measures expressed as "% of homeostatic X in WT" really mean "% of average homeostatic X in WT". This should be made clear somewhere.

      We added: “Dotted line represents the average homeostatic levels of Cmg2 WT” in figure legends

      9 - In panel C, the nature of the data is not entirely clear. First, the corresponding part of the legend says "Representative images of n=4 mice per genotype" which I presume should refer to panel B. Then, the graph plots 4 data points, which suggests that they correspond to 4 mice - but how many fields of view? Also, the violin plot outline is not described - I presume it captures all the data points from the coarse-grained pixel analysis, but it should be clarified.

      It was modified as suggested : “(c) Results are presented as violin plot of the Ly6a mean intensity of all data points from the coarse-grain analysis. Each symbol represents the mean per mice of n=4 mice per condition. Results are mean ± SEM. Dotted line represents the average homeostatic levels of Cmg2WT. P values obtained by two-tailed unpaired t test.”

      10 - In Figure 3H and 3I, I would suggest to add the 7+3 timepoint where the data come from.

      We unfortunately do not understand the suggestion of the reviewer, given that these panels show the 7+3 time point.

      11 - In section "CMG2 Is Critical for Restoring the Lgr5+ Intestinal Stem Cell Pool", the authors say "...The mRNA levels of ... LRP6, β-catenin (Fig. S3a-b), and Wnt ligands (Wnt5a, 5b, and 2b) were comparable between the colons of Cmg2WT and Cmg2KO mice (Fig. S3c)..." without clarifying in which context - one needs to read the figure legend to realise this is "timepoint 7+3". I suggest to add "in the recovery phase" or "in regenerating colons" or something shorter, just to guide the reader.

      We added : “Initially, we quantified the expression of key molecular components involved in Wnt signaling in mice colon 3 days after DSS withdrawal using qPCR.”

      12 - Like with the previous point, it is not clear when the immunohistofluorescence of B-catenin is made - not even in the legend, as far as I could see. The only hint is that authors say "the nuclei of cells in the atrophic crypts of Cmg2KO..." with 'atrophic' probably indicating again the 7+3 timepoint.

      We have changed the text and now mention “Next, we analyzed β-catenin activation in the colon of Cmg2WT and Cmg2KO mice during the recovery phase.”

      13 - A typo in the discussion: tunning for tuning.

      This has been corrected.

      14 - In the discussion, the authors talk about the 'CMG2' protein (all caps - formatting convention for human proteins) but before they were referring to 'Cmg2' (formatting convention for mouse proteins). That is fine but some of the statements where "CMG2" is used clearly refer to observations made in the mouse.

      We have now used Cmg2, whenever referring to the mouse protein.

      15 - Typos in methods: "antigen retrieval by treating [with] Proteinase K"; "Image acquisition and analyze [analysis]"; "All details regarding code used for immunofluorescence analysis”.

      This has been corrected.

      __Reviewer #2 __

      We are very pleased to read that the reviewer found the study “overall well designed, meticulously carried out, and with clear and convincing results that are most reasonably and thoughtfully interpreted”.

      For this reader, one additional thought comes to mind. If I understand the field correctly it would be informative to know with greater confidence where - in what cell type, epithelial or mesenchymal - the CMG2-LRP6-WNT interaction occurs.

      This point was also raised by Reviewer I, and we have now added a new Figure 1, that describes Cmg2 expression in the gut, based both on from publicly available RNA-seq datasets and our RNAscope experiments on Cmg2WT mice. Of note, we unfortunately have never managed to detect Cmg2 protein expression by immunohistochemistry of mouse tissue with any of the antibodies available, commercial or generated in the lab.

      After injury the CMG2-KO mouse epithelium exhibits defective WNT signal transduction - as evidenced by failure of b-catenin to translocate into the nucleus. At first glance, this result is a disconnect with the paper by van Rijin that claims the defect in Hyaline Fibromatosis Syndrome cannot be due to loss of CMG2 expression/function in the barrier epithelial cell - a claim based on the mostly normal phenotypes of human CMG2 KO duodenal organoids. But the human organoids studied in the van Rijin paper, like all others, are established and cultured in very high WNT conditions, perhaps obscuring the lack of the CMG2-LRP6-WNT interaction. And in fact, the phenotypes of these human CMG2-KO duodenoids were not entirely normal - the CMG2-KO stem-like organoids (even when cultured in high WNT/R-spondin conditions) developed abnormal intercellular blisters consistent with a defect in epithelial structure/function - of unknown cause and not investigated.

      We thank the reviewer for raising this point and we fully agree. We now specify in the text that the human CMG2-KO duodenoids showed blisters, indeed consistent with a defect in epithelial structure/function, and that they were grown on high Wnt media which likely obscure the CMG2 requirement.

      I think it would be informative to prepare colon organoids (and duodenoids) from WT and CMG2-KO mice to quantify their WNT dependency during establishment and maintenance of the stem-like (and WNT-dependent) state. If CMG2 acts within the epithelial cell to affect WNT signaling (regardless of WNT source), organoids prepared from colons of CMG2-KO mice would require more WNT in culture media to establish and maintain the stem cell proliferative state - when compared to organoids prepared from WT mice. This can be quantified (and confirmed molecularly by transgene expression if successful). Enhanced dependency of high concentrations of exogenous WT would be evidence for a primary defect in WNT-(LRP2)-CMG2 signal transduction localized to the epithelial barrier cell - thus addressing the apparent discrepancy with the van Rijin paper - and for my part, advancing the field. And the discovery of a defect in the epithelium itself for WNT signal transduction would implicate a biologically most plausible mechanism for development of protein losing enteropathy.

      By no means do I consider these experiments to be required for publication (especially if considered to be incremental or already defined - WNT-CMG2 is not my field of research). This study already makes a meaningful contribution to the field as I state above. But in the absence of new experimentation, the issue should probably be discussed in greater depth.

      We are working out conditions to grow colon organoids that from WT and Cmg2 KO mice, indeed playing around with the concentrations of Wnt in the various media to identify those that would best mimic the regeneration conditions. This is indeed a study in itself. We have however included a discussion on this point in the manuscript as suggested.

      __Reviewer #3: __

      We thank the reviewer for her/his insightful comments.

      The premise is that the causative germline mutated gene, CMG2/ANTRX2, may have a functional role in colonic epithelium in addition to controlling the ECM composition. There is little background information but one study has shown no primary defect in epithelial organoids grown from patients with the syndrome. This leads the authors to wonder if non-homeostatic, conditions might reveal a function role for the gene in regeneration.

      Reviewer 2 commented on the fact that “human organoids studied in the van Rijin paper, like all others, are established and cultured in very high WNT conditions, perhaps obscuring the lack of the CMG2-LRP6-WNT interaction. And in fact, the phenotypes of these human CMG2-KO duodenoids were not entirely normal - the CMG2-KO stem-like organoids (even when cultured in high WNT/R-spondin conditions) developed abnormal intercellular blisters consistent with a defect in epithelial structure/function - of unknown cause and not investigated”.

      We have now added a discussion on this point in the manuscript.

      The authors' approach to test the hypothesis is to use a mouse germline knockout model and to induce colitis and regeneration by the established protocol of introducing dextran sodium sulfate (DSS) into the drinking water for five days. In brief there is no phenotype apparent in the untreated knockout (KO) but these animals show a more severe response to DSS that requires them to be killed by 10 days after the start of treatment. This effect following phenotypic characterisation of the colonic epithelium is interpreted as showing the CMG2 is a Wnt modifier required for the restoration of the intestinal stem cell population in the final stages of repair.

      The experiment and analysis seem reasonably well executed - although a few specific comments follow below. The narrative is simple and easy to understand. However, there are significant caveats that cast doubts on the interpretation made that loss of CMG2 impairs the transition of colonic epithelial cells from a fetal like state to adult ISCs.

      First there is only a single approach and single type of experiment performed. There is a lack of independent validation of the phenotype and how it is mediated.

      We do not fully understand what type of independent validation of the phenotype the reviewer would have liked to see. Is it the induction of intestinal damage using a stress other than DSS?

      The DSS dose in this kind of experiment is often determined empirically in individual units. Here the 3% used is within published range but at upper end. The control animals show a typical response with symptoms of colitis worsening for 2-3 days after the removal of DSS and then recovery commonly over another 5-7 days. Here the CMG2 KO mice fail to recover and are killed by 9 or 10 days. The authors attempt to exploit the time course by identifying normal initial (7days) and defective late (10days) repair phases in KO animals when compared to controls. It is from this comparison that conclusions are drawn. However, the alternative interpretation might be that the epithelium of KO animals is so badly damaged, and indeed non-existent (from viewing Fig2a), that it is incapable of mounting any other response other than death and that the profiling shown is of an epithelium in extremis. The repair capability and dynamics of the KO would have been better tested under more moderate DSS challenge, if this experiment had been regarded as a pilot rather than as definitive.

      The choice of 3% DSS was in fact based on a pilot experiment. As now shown in Fig. S4, we tested different concentrations and found that 3% DSS was the lowest concentration that reliably induced the full spectrum of colitis-associated symptoms, including significant body weight loss, diarrhea, rectal bleeding (summarized in the Disease Activity Index), as well as macroscopic signs such as colon shortening and spleen enlargement. Based on these criteria, we selected 3% DSS for the study described in the manuscript.

      In this model, WT mice showed a typical progression: body weight stabilized rapidly after DSS withdrawal, with resolution of diarrhea and rectal bleeding. Histological analysis at day 9 revealed signs of epithelial regeneration, including hypertrophic crypts and increased epithelial proliferation.

      In contrast, Cmg2KO mice failed to initiate this recovery phase. Clinical signs such as weight loss, diarrhea, and bleeding persisted after DSS withdrawal, ultimately necessitating euthanasia at day 9–10 due to humane endpoint criteria. Unfortunately, this prevented us from exploring later timepoints to determine whether regeneration was delayed or completely abrogated in the absence of Cmg2.

      Regarding the severity of epithelial damage, as raised by Reviewer 1, we now provide detailed histological scoring in the supplementary data. This analysis shows that the severity of inflammation and crypt damage was similar between WT and KO animals, as were inflammatory markers such as Lipocalin-2. The key difference lies in the extent of tissue involvement. While the lesions in WT mice were more localized, Cmg2KO mice displayed widespread and diffuse damage with no sign of regeneration as shown by the absence of hypertrophic crypts and a marked reduction in both epithelial coverage and proliferative cells. Importantly, at day 7, the percentage of epithelial and proliferating cells was comparable between genotypes, further supporting the idea that Cmg2KO mice failed to initiate this recovery phase and present a defective repair response.

      The animals used were young (8 weeks) and lacked any obvious defect in collagen deposition. Does this change with treatment? Even if not, is it possible that there is a defect in peristalsis or transit time of gut contents, resulting in longer dwell times and higher effective dose of DSS to the KO epithelium?

      Collagen deposition, particularly of collagen VI, is known to increase in response to intestinal injury and plays a critical role in promoting tissue repair following DSS-induced damage (Molon et al., PMID: 37272555). As suggested, we investigated whether Cmg2KO mice exhibit abnormal collagen VI accumulation following DSS treatment.

      Our results show that, consistent with published data, WT mice exhibit a marked increase in collagen VI expression during the acute phase of colitis, with levels returning toward baseline following DSS withdrawal. A similar expression pattern was observed in Cmg2KO mice, with no significant differences in Col6a1 mRNA levels between WT and KO animals throughout the entire time course of the experiment. This observation was further confirmed at the protein level by western blot and immunohistochemistry analyses, suggesting that the impaired regenerative capacity observed in Cmg2KO mice is independent of Collagen VI.

      Regarding the possibility of altered peristalsis or intestinal transit time contributing to increased DSS exposure in KO mice, this is indeed a possibility. Although we did not directly measure gut motility in this study, we did not observe any signs of intestinal obstruction or fecal retention in Cmg2KO mice. Indeed, during the experiment, animals were single caged for 30min in order to collect feces and no difference in the amount of feces collected was observed between WT and KO mice, arguing against a substantial difference in transit time (see figure below). The possible altered peristalsis and these observations are now mentioned in the discussion.

      Is CMG2 RNA and protein expressed in the colonic epithelium? It is not indicated or tested in the submitted manuscript. This reviewer struggled to find evidence, notably it did not seem to be referenced in the organoid paper they reference in introduction (ref 13).

      This very valid point was also raised by Reviewers 1 and 2. The expression pattern of Cmg2 in the gut has indeed not been characterized and is essential to understanding its function. To address this gap, we added a figure (Fig. 1) providing data from publicly available RNA-seq datasets and from our RNAscope experiments on Cmg2WT mice. Of note, we unfortunately have never managed to detect Cmg2 protein expression by immunohistochemistry of mouse tissue with any of the antibodies available, commercial or generated in the lab.

      __Specific comments: __

      Figure 3 c-e and associated text are confusing. In c the Y scale seems inappropriate to show percentages up to 15,000%.

      In this graph values are normalized to homeostatic level of WT mice which represent 100%

      In d and e the use of percentages may by correct. However, it is claimed in text that Cty61 and CTFG are upregulated in the KO. That is not what the plots appear to show as the compare to WT untreated cells, in which case the KO have not downregulated these genes in the way the controls have.

      As clarified in the text, under regenerative conditions, a transient activation of YAP signaling is crucial to induce a fetal-like reversion of intestinal stem cells. However, in a subsequent phase, the downregulation of YAP and the reactivation of Wnt signaling are necessary to complete intestinal regeneration. Several studies have highlighted a strong interplay between the Wnt and YAP pathways, suggesting that their coordinated regulation is essential for effective gut repair. Nevertheless, the precise mechanisms governing this interaction remain incompletely understood.

      In our model, this critical transition—YAP downregulation and Wnt reactivation—appears to be impaired. CMG2 may either hinder Wnt reactivation directly, or lead to sustained YAP signaling, which in turn suppresses activation of the Wnt pathway. Further studies, using in-vivo model and organoid models, will be necessary to understand the mechanistic role of Cmg2 in this regulatory process.

      A precision of the figure has been updated as followed: both of which were significantly upregulated in the injured colons of Cmg2KO mice compared to DSS-injured Cmg2WT mice

      __**Referees cross-commenting** __

      Rev2 Points 1 and 2 made by Referee 1 (and point 4 of Referee 3) appear most reasonable, and if not already done should be.

      We have indeed addressed these 2 points.

      I also noted the more severe morphology of DSS damaged epithelium shown in Fig 2a noted by Referee 3 - and this I agree is a confounding factor. […] For my part, the concern is understandable but likely not operating in a confounding way. And the evidence for the reprogramming of the damaged epithelium into "fetal-like stem cells" (the 1st step in restitution of lost stem cells) occurs in both WT and KO mice - and these data are strong. For this reader, the block convincingly shows up for KO mouse at the WNT dependent step

      The representative image has been updated, and a transverse section has been added to better illustrate that, although both epithelium and crypt structures can be present, the epithelial morphology differs significantly. Indeed, the regenerating epithelium of Cmg2WT mice displays a thick epithelial layer with well-polarized epithelial cells, whereas in cmg2KO mice, the epithelium appears atrophic, characterized by a thinner epithelial layer and elongated epithelial cells.

      __Rev 3 __

      This reviewer remains sceptical. I agree the authors performed the experiment well to confirm that DSS dosing was as equivalent as possible across the study. But DSS acts to induce colitis because it is concentrated in the colonic lumen as water is absorbed. Also ECM responses and remodelling are a central part of colitis models. And my concern is that the actual exposure in the KO group is influenced by transit of faeces/DSS is secondary to the known action of CMG2 on collagen deposition. The consequence of this being a protracted damage phase in which a restoration of adult stem cells would not be expected and leading to epithelial failure.

      However, we differ. I might propose that the authors are asked to investigate and confirm expression of CMG2 in the epithelium and to repeat the analysis of collagen levels they performed on untreated CMG2 KO mice on colons from CMG2 KO mice having received DSS to see if these differ from controls.

      This has now been done.

      __Rev 1 __

      Both reviewer #2 and reviewer #3 make relevant points, from the point of view of extracting as much biological knowledge as we can from the observations reported in the manuscript.

      Reviewer #2 suggestion to use Cmg2[KO] organoids to investigate the dependence of Wnt transduction on Cmg2 is the type of experiments I refrained to propose. However, I think the "skeleton" of the mechanism is there and is reasonably solid. Fleshing it out may well be another paper.

      I agree with Reviewer #3 objections to the timing and severity of the DSS damage. However, I am not sure how much they invalidate the main tenet of the paper:

      • DSS may affect Cmg2[KO] more severely, but the overall disease score is comparable during the DSS treatment. If this severity was enough to be the main driver of the phenotype, it should have left a mark in the Histological and Disease activity scores. In this regard, I think it would be helpful if the authors provided an expanded version of Figure 2A with examples of the different levels of "Crypt damage" scored, and the proportions for each. This could be in the supplementary material and would balance the impressions induced by a single image.

      As suggested, we included a detail of histological score including the crypt damage score in Supplementary Fig 3i showing no significant differences in crypt damage between Cmg2WT and Cmg2KO mice.

      • If DSS affected the recovery, this would also be compatible with having a more severe histological phenotype (which is not shown overall, just in Fig 2A) because one would also expect the tissue to attempt regeneration during the 7 days of DSS treatment.

      This is an interesting point, and we now allude to this aspect in the manuscript.

      • The only objection that I find difficult to argue is the effective duration of the treatment. If indeed peristalsis is affected, it may be that during the 'recovery' phase there is still DSS in the intestine. This could be perhaps verified using a DS detection assay (e.g. https://arxiv.org/pdf/1703.08663) on the intestinal contents or the faeces of the mice during the 3-day recovery period.

      We have attempted to obtain and purchase Heparin Red to perform this assay. Unfortunately, we have not obtained the reagent, which has never been delivered. We now also mention the following in the Discussion:

      One could envision that Cmg2KO mice have a defect in peristalsis resulting in longer dwell times and possibly higher effective dose of DSS to the KO epithelium. We however did not observe any signs of intestinal obstruction or fecal retention in Cmg2KO mice. Animals were single-caged for 30 min to collect feces. We did not observe any difference in amounts collected from WT and KO mice, arguing against a substantial difference in transit time of gut contents. Moreover, if DSS affected the recovery, one would have expected a more severe histological phenotype in the colon of Cmg2KO since the tissue likely already attempts regeneration during the 7 days of DSS treatment. But this was not the case. Therefore, while we cannot formally rule out the presence of residual DSS in Cmg2KO mice during the DSS withdrawal phase, there is currently no indication that this was the case.

      I think of what the aim of scholarly publication is, with this paper, and I find myself going back to a statement of the authors' discussion - that this work suggests that infants risking death may be offered (compassionate, I guess) IBD treatment. What does this hinge upon? I think, on the basic observation that diarrhoea (in the mouse model) is not intrinsic but caused by an inflammation-promoting insult. Is this substantiated? I think it is. Could we learn more biology from this disease model, about Wnt and about how ECM affects tissue regeneration? Certainly. Can this learning wait? I believe it can.

      We thank the reviewer for this statement.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript has a good rationale in trying to understand why infants with an inherited condition, Hyaline Fibromatosis Syndrome, that is primarily associated with turnover and deposition of extracellular collagen also develop severe diarrhoea that can contribute to their premature death. The premise is that the causative germline mutated gene, CMG2/ANTRX2, may have a functional role in colonic epithelium in addition to controlling the ECM composition. There is little background information but one study has shown no primary defect in epithelial organoids grown from patients with the syndrome. This leads the authors to wonder if non-homeostatic, conditions might reveal a function role for the gene in regeneration.

      The authors' approach to test the hypothesis is to use a mouse germline knockout model and to induce colitis and regeneration by the established protocol of introducing dextran sodium sulfate (DSS) into the drinking water for five days. In brief there is no phenotype apparent in the untreated knockout (KO) but these animals show a more severe response to DSS that requires them to be killed by 10 days after the start of treatment. This effect following phenotypic characterisation of the colonic epithelium is interpreted as showing the CMG2 is a Wnt modifier required for the restoration of the intestinal stem cell population in the final stages of repair.

      The experiment and analysis seem reasonably well executed - although a few specific comments follow below. The narrative is simple and easy to understand. However, there are significant caveats that cast doubts on the interpretation made that loss of CMG2 impairs the transition of colonic epithelial cells from a fetal like state to adult ISCs.

      Significance

      1. First there is only a single approach and single type of experiment performed. There is a lack of independent validation of the phenotype and how it is mediated.
      2. The DSS dose in this kind of experiment is often determined empirically in individual units. Here the 3% used is within published range but at upper end. The control animals show a typical response with symptoms of colitis worsening for 2-3 days after the removal of DSS and then recovery commonly over another 5-7 days.

      Here the CMG2 KO mice fail to recover and are killed by 9 or 10 days. The authors attempt to exploit the time course by identifying normal initial (7days) and defective late (10days) repair phases in KO animals when compared to controls. It is from this comparison that conclusions are drawn.

      However, the alternative interpretation might be that the epithelium of KO animals is so badly damaged, and indeed non-existent (from viewing Fig2a), that it is incapable of mounting any other response other than death and that the profiling shown is of an epithelium in extremis. The repair capability and dynamics of the KO would have been better tested under more moderate DSS challenge, if this experiment had been regarded as a pilot rather than as definitive. 3. The animals used were young (8 weeks) and lacked any obvious defect in collagen deposition. Does this change with treatment? Even if not, is it possible that there is a defect in peristalsis or transit time of gut contents, resulting in longer dwell times and higher effective dose of DSS to the KO epithelium? 4. Is CMG2 RNA and protein expressed in the colonic epithelium? It is not indicated or tested in the submitted manuscript. This reviewer struggled to find evidence, notably it did not seem to be referenced in the organoid paper they reference in introduction (ref 13).

      Specific comments:

      Figure 3 c-e and associated text are confusing. In c the Y scale seems inappropriate to show percentages up to 15,000%. In d and e the use of percentages may by correct. However, it is claimed in text that Cty61 and CTFG are upregulated in the KO. That is not what the plots appear to show as the compare to WT untreated cells, in which case the KO have not downregulated these genes in the way the controls have.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The paper uses mice lacking Capillary Morphogenesis Gene 2 (CMG2- KO) mice to investigate the pathogenic mechanism underlying the protein losing enteropathy seen in children with severe Hyaline Fibromatosis Syndrome. Significance of the work is further enhanced as the intestinal phenotype induced by CMG2-KO provided a model system (with robust validated tools) for testing newly emerging (and paradigm shifting) ideas in mechanisms of tissue regeneration after injury - generalizable to tissue restitution beyond the intestine.

      The study shows that in the mouse colon CMG2 plays a critical role in recovery from mucosal/epithelial damage chemically induced by dextran-sulfate-sodium (DSS). Mice lacking CMG2 failed to recover from DSS colitis with no evidence for restitution of the DSS-damaged epithelium. WT mice recovered after DSS removal.

      The first step in restitution of epithelial damage in the intestine, when the epithelial stem-cell populations are depleted as in this model of DSS colitis, occurs by the transformation of surviving differentiating/differentiated epithelial cells back into a stem-cell-like (fetal-cell-like) state. This step in the process was found to occur normally in the CMG2 KO mouse. The block in restitution was located to the step where de-differentiated (fetal-cell-like) colonocytes are induced back into their WNT-dependent proliferative state - thus replenishing the normally proliferating stem (LGR5+) cells of the colonic crypt. The reason for this failure is explained by a defect in WNT signaling in the injured colons of CMG2 KO mice, as assessed by failure of -catenin translocation into the nucleus of barrier epithelial cells - a down-stream effect of WNT signaling and consistent with the dependence on CMG2 for WNT signaling in other experimental systems.

      The study is overall well designed, meticulously carried out, and with clear and convincing results that are most reasonably and thoughtfully interpreted. The paper makes a meaningful contribution to the field. It models an experiment of nature to test, delineate, and verify disease pathogenesis and a newly revised mechanism for mucosal tissue repair.

      For this reader, one additional thought comes to mind. If I understand the field correctly it would be informative to know with greater confidence where - in what cell type, epithelial or mesenchymal - the CMG2-LRP6-WNT interaction occurs.

      After injury the CMG2-KO mouse epithelium exhibits defective WNT signal transduction - as evidenced by failure of -catenin to translocate into the nucleus. At first glance, this result is a disconnect with the paper by van Rijin that claims the defect in Hyaline Fibromatosis Syndrome cannot be due to loss of CMG2 expression/function in the barrier epithelial cell - a claim based on the mostly normal phenotypes of human CMG2 KO duodenal organoids. But the human organoids studied in the van Rijin paper, like all others, are established and cultured in very high WNT conditions, perhaps obscuring the lack of the CMG2-LRP6-WNT interaction. And in fact, the phenotypes of these human CMG2-KO duodenoids were not entirely normal - the CMG2-KO stem-like organoids (even when cultured in high WNT/R-spondin conditions) developed abnormal intercellular blisters consistent with a defect in epithelial structure/function - of unknown cause and not investigated.

      I think it would be informative to prepare colon organoids (and duodenoids) from WT and CMG2-KO mice to quantify their WNT dependency during establishment and maintenance of the stem-like (and WNT-dependent) state. If CMG2 acts within the epithelial cell to affect WNT signaling (regardless of WNT source), organoids prepared from colons of CMG2-KO mice would require more WNT in culture media to establish and maintain the stem cell proliferative state - when compared to organoids prepared from WT mice. This can be quantified (and confirmed molecularly by transgene expression if successful). Enhanced dependency of high concentrations of exogenous WT would be evidence for a primary defect in WNT-(LRP2)-CMG2 signal transduction localized to the epithelial barrier cell - thus addressing the apparent discrepancy with the van Rijin paper - and for my part, advancing the field. And the discovery of a defect in the epithelium itself for WNT signal transduction would implicate a biologically most plausible mechanism for development of protein losing enteropathy.

      By no means do I consider these experiments to be required for publication (especially if considered to be incremental or already defined - WNT-CMG2 is not my field of research). This study already makes a meaningful contribution to the field as I state above.

      But in the absence of new experimentation, the issue should probably be discussed in greater depth.

      Significance

      The study is overall well designed, meticulously carried out, and with clear and convincing results that are most reasonably and thoughtfully interpreted. The paper makes a meaningful contribution to the field. It models an experiment of nature to test, delineate, and verify disease pathogenesis and a newly revised mechanism for mucosal tissue repair.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this work, Bracq and colleagues provide clear evidence that the persistent diarrhoea seen in a mouse model of Hyaline Fibromatosis Syndrome is related to the inability of their intestinal epithelium to properly regenerate. This is very clear and of immediate impact. This aspect of the paper, I think, is ready for publication, and would merit immediate dissemination on its own. It is great that the manuscript is in bioRxiv already.

      I am not so thoroughly convinced about the mechanism that the author propose to explain the incapacitation of Cmg2[KO] intestinal stem cells to function properly. The authors propose that it is due to their inability to transduce Wnt signals, and while this is plausible, I think there are few things that the paper should contain before this can be proposed firmly:

      Point #1

      The mouse mutant is just described as 'KO', referring to the previous work by the authors. The cited work simply states that this is a zygotic deletion of exon 3, which somehow leads to a decrease in protein abundance that is almost total in the lung but not so clear in the uterus. Exon 3 happens to be 72 bp long [https://www.ncbi.nlm.nih.gov/nuccore/NM_133738], so its deletion (assuming there are no cryptic splicing sites used) leads to an internal in-frame deletion of 24 amino acids. So, at best, this 'KO' is not a null, but a hypomorphic allele of context-dependent strength. Unfortunately, neither the previous work nor this paper (unless I have missed it!) contains information provided about the expression levels of Cmg2 in the intestine of KO mice - nor which cell types usually express it (see below). I think that using anti Cmg2 in WB and immunohistofluorescence of with ISC markers with intestine homogenate/sections of wild-type and mutant mice would be necessary to set the stage for the rest of the work.

      Point #2

      Connected to the previous point, the expression pattern of Cmg2 in the intestine is not described. Maybe this is already established in the literature, but the authors do not refer to the data. This is important when considering that the previous work of the authors suggests that Cmg2 might contribute to Wnt signalling transduction through physical, cis interactions with the Wnt co-receptor LRP6. Therefore, one would expect that Cmg2 would be cell-autonomously required in the intestinal stem cells.

      Point #3

      The authors establish that the regenerating crypts of Cmg2[KO] mice are unable to transduce Wnt signalling, but it is not clear whether this situation is provoked by the DSS-induce injury or existed all along. Can Cmg2[KO] intestinal stem cells transduce Wnt signalling before the DSS challenge? If they were, it might suggest that the 'context-dependence' of the Cmg2 role in Wnt signalling is contextual not only because of the tissue, but because of the history of the tissue or its present structure. It would also suggest that Cmg2 mutant mice, unless reared in a germ-free facility for life, would eventually lose intestinal homeostasis, and maybe suggest the level of intervention/monitoring that HFS patients would require. It might also provide an explanation in case Cmg2 was not expressed in ISCs - if the state of the tissue was as important as the presence of the protein, then the effect on Wnt transduction could be indirect and therefore it might not be required cell-autonomously.

      I think points 1 and 2 are absolutely fundamental in a reverse genetics investigation. Point 3 would be nice to know but the outcome would not change the tenet of the paper. I believe that the work needed to deal these points can be performed on archival material. I do not think the mechanism proposed can be taken from 'plausible' to 'proven' without proposing substantial additional investigation, so I will not suggest any of it, as it could well be another paper.

      A few minor points picked along the way:

      1. Figure 1 legend says "In (c), results are mean {plus minus} SEM" - this seems applicable to (d) as (c) does not show error whiskers.
      2. Figure 1 legend says "(d) Body weight loss, (f) the aspect of the feces and presence of occult blood were monitored and used for the (e) DAI. Results are mean {plus minus} SEM. Each dot represents the mean of n = 12 mice per genotype". This part looks like has suffered some rearrangement of words. The first instance of (f) should be (e), I guess, and I am not sure what "(e) DAI" means. And for (e), "mean {plus minus} SEM" does not seem applicable. This needs some light revision.
      3. Figure 1H legend does not say which statistical test was made in the survival experiment in (h) - presumably log-rank? A further comment on the survival statistics: euthanised animals should not be counted towards true mortality when that is what is recorded as an 'event'. They should be right-censored. However, in this case, reaching the euthanasia criterion is just as good an indicator of health as mortality itself. So, simply by changing the Y axis from 'survival' to 'event-free survival' (or something to that effect), where 'events' are either death or reaching the euthanasia criterion, leaves the analysis as it is, and authors do not need to clarify that figure 1H shows "apparent mortality", as it is straightforward "complication-free survival" (just not entirely orthogonal to weight loss).
      4. Some density measurements are made unnecessarily on arbitrary units (per field of view) - this should be simple to report in absolute measures (i.e. area of tissue screened or, better still, length of epithelium screened).
      5. Figure 2E should read "percent involvement"
      6. Figure 2J should read "lipocalin..."
      7. In section "CMG2 Is Dispensable for YAP/TAZ-Mediated Reprogramming to Fetal-Like Stem Cells", the authors write ""We measured the mRNA levels of two additional YAP target genes, Cyr61 and CTGF...". I presume the "additional" is because Ly6a is also a target of YAP/TAZ, but if the reader does not know, it is puzzling. I would suggest to make this link explicit.
      8. In Figures S2, 3 and S3, I think that the measures expressed as "% of homeostatic X in WT" really mean "% of average homeostatic X in WT". This should be made clear somewhere.
      9. In panel C, the nature of the data is not entirely clear. First, the corresponding part of the legend says "Representative images of n=4 mice per genotype" which I presume should refer to panel B. Then, the graph plots 4 data points, which suggests that they correspond to 4 mice - but how many fields of view? Also, the violin plot outline is not described - I presume it captures all the data points from the coarse-grained pixel analysis, but it should be clarified.
      10. In Figure 3H and 3I, I would suggest to add the 7+3 timepoint where the data come from.
      11. In section "CMG2 Is Critical for Restoring the Lgr5+ Intestinal Stem Cell Pool", the authors say "...The mRNA levels of ... LRP6, β-catenin (Fig. S3a-b), and Wnt ligands (Wnt5a, 5b, and 2b) were comparable between the colons of Cmg2WT and Cmg2KO mice (Fig. S3c)..." without clarifying in which context - one needs to read the figure legend to realise this is "timepoint 7+3". I suggest to add "in the recovery phase" or "in regenerating colons" or something shorter, just to guide the reader.
      12. Like with the previous point, it is not clear when the immunohistofluorescence of B-catenin is made - not even in the legend, as far as I could see. The only hint is that authors say "the nuclei of cells in the atrophic crypts of Cmg2KO..." with 'atrophic' probably indicating again the 7+3 timepoint.
      13. A typo in the discussion: tunning for tuning.
      14. In the discussion, the authors talk about the 'CMG2' protein (all caps - formatting convention for human proteins) but before they were referring to 'Cmg2' (formatting convention for mouse proteins). That is fine but some of the statements where "CMG2" is used clearly refer to observations made in the mouse.
      15. Typos in methods: "antigen retrieval by treating [with] Proteinase K"; "Image acquisition and analyze [analysis]"; "All details regarding code[s] used for immunofluorescence analysis"

      Referees cross-commenting

      *this session contains comments from ALL the reviewers"

      Rev2

      Points 1 and 2 made by Referee 1 (and point 4 of Referee 3) appear most reasonable, and if not already done should be.

      I also noted the more severe morphology of DSS damaged epithelium shown in Fig 2a noted by Referee 3 - and this I agree is a confounding factor. But overall, multiple lines of evidence were assembled to show that the KO mice and WT mice suffered DSS-induced colitis with equal severity - and with closely equal severity of damage to the intestinal epithelium (though the image in Fig 2a is disturbing). For my part, the concern is understandable but likely not operating in a confounding way. And the evidence for the reprogramming of the damaged epithelium into "fetal-like stem cells" (the 1st step in restitution of lost stem cells) occurs in both WT and KO mice - and these data are strong. For this reader, the block convincingly shows up for KO mouse at the WNT dependent step

      Rev 3 This reviewer remains sceptical. I agree the authors performed the experiment well to confirm that DSS dosing was as equivalent as possible across the study. But DSS acts to induce colitis because it is concentrated in the colonic lumen as water is absorbed. Also ECM responses and remodelling are a central part of colitis models. And my concern is that the actual exposure in the KO group is influenced by transit of faeces/DSS is secondary to the known action of CMG2 on collagen deposition. The consequence of this being a protracted damage phase in which a restoration of adult stem cells would not be expected and leading to epithelial failure.

      However, we differ. I might propose that the authors are asked to investigate and confirm expression of CMG2 in the epithelium and to repeat the analysis of collagen levels they performed on untreated CMG2 KO mice on colons from CMG2 KO mice having received DSS to see if these differ from controls.

      Rev 1 Both reviewer #2 and reviewer #3 make relevant points, from the point of view of extracting as much biological knowledge as we can from the observations reported in the manuscript.

      Reviewer #2 suggestion to use Cmg2[KO] organoids to investigate the dependence of Wnt transduction on Cmg2 is the type of experiments I refrained to propose. However, I think the "skeleton" of the mechanism is there and is reasonably solid. Fleshing it out may well be another paper.

      I agree with Reviewer #3 objections to the timing and severity of the DSS damage. However, I am not sure how much they invalidate the main tenet of the paper:

      • DSS may affect Cmg2[KO] more severely, but the overall disease score is comparable during the DSS treatment. If this severity was enough to be the main driver of the phenotype, it should have left a mark in the Histological and Disease activity scores. In this regard, I think it would be helpful if the authors provided an expanded version of Figure 2A with examples of the different levels of "Crypt damage" scored, and the proportions for each. This could be in the supplementary material and would balance the impressions induced by a single image.

      • If DSS affected the recovery, this would also be compatible with having a more severe histological phenotype (which is not shown overall, just in Fig 2A) because one would also expect the tissue to attempt regeneration during the 7 days of DSS treatment.

      • The only objection that I find difficult to argue is the effective duration of the treatment. If indeed peristalsis is affected, it may be that during the 'recovery' phase there is still DSS in the intestine. This could be perhaps verified using a DS detection assay (e.g. https://arxiv.org/pdf/1703.08663) on the intestinal contents or the faeces of the mice during the 3-day recovery period.

      I think of what the aim of scholarly publication is, with this paper, and I find myself going back to a statement of the authors' discussion - that this work suggests that infants risking death may be offered (compassionate, I guess) IBD treatment. What does this hinge upon? I think, on the basic observation that diarrhoea (in the mouse model) is not intrinsic but caused by an inflammation-promoting insult. Is this substantiated? I think it is. Could we learn more biology from this disease model, about Wnt and about how ECM affects tissue regeneration? Certainly. Can this learning wait? I believe it can.

      Significance

      In this work, Bracq and colleagues provide clear evidence that the persistent diarrhoea seen in a mouse model of Hyaline Fibromatosis Syndrome is related to the inability of their intestinal epithelium to properly regenerate. This is very clear and of immediate impact. For instance, the authors themselves point at the possibility of applying treatments for Inflammatory Bowel Disease to HFS patients. While what happens in a mouse model is not necessarily the same as in human patients, the fact that persistent diarrhoea is a life-threatening symptom in HFS make this proposal, at least in compassionate use of the therapies and until its efficacy is disproven, very plausible. This is a clear gap of knowledge that addresses an unmet medical need.

      I find that the work shows clearly that HFS mouse model subjects have normal intestinal function until challenged with a standard chemically-induced colitis. Then, the histological and health deterioration of the HFS mouse model is clear in comparison with normal mice, which can regenerate appropriately. This is shown with a multiplicity of orthogonal techniques spanning molecular, histological and organismal, which are standard and very well reported in the paper.

      The authors propose a specific cellular and molecular mechanism to explain the incapacity of the intestinal epithelium in the mouse model of HFS to regenerate. According to this mechanism, the protein Cmg2, whose mutation causes HFS in humans, would be necessary for intestinal stem cells to transduce the signal of Wnt ligands and therefore support their behaviour as regenerative cells. This mechanism is plausible, but more basic and advanced work would be needed to take it as proven.

      This work would be of interest to both the clinical, biomedical, and basic research communities interested in rare diseases, the gastrointestinal system, collagen and extracellular matrix, and Wnt signalling.

      My general expertise is in developmental and stem cell biology using reverse genetics, transgenesis and immunohistological and molecular methods of data production, and lineage tracing, digital imaging and bioinformatic analytical methods; I work with Drosophila melanogaster and its adult gastrointestinal system.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In the manuscript entitled " The hepatitis E virus capsid protein ORF2 counteracts cell-intrinsic antiviral responses to enable persistence in hepatocytes ", Ann-Kathrin Mehnert interrogated that HEV pORF2 can inhibit host antiviral response. They found interaction of HEV ORF2 and TBK1. The finding is interesting and echoed with some previous studies that ORF2 can inhibit innate immunity.

      The study could benefit from the consideration of some major and specific points, as indicated below:

      Major issues:

      1. The researchers used p6, a cell-adapted clone, which was isolated form a chronic HEV patient. As previous studies suggested, p6 may behave differently than wild-type strains. Did the authors tried other HEV strains, as they used ips-induced model that was reported supportive to wild-type HEV?
      2. Figure 1F, ORF2 can interact with TBK1 as showed. But the prediction from Alphafold is weak. Also, could the author more evidence than the co-IP?
      3. Figure 2C and 2D, at 5 dpi, one can observed a stronger antiviral response, but at 7 dpi, no obvious difference was observed. Could the authors comment on this? 4.Figure 2H and 2I, detailed description of how the authors measured the positive cells should be provided. Did the authors selected whole plate of cells for counting? As showed in Figure 2H, the signals of IF were stronger at 5 and 7 dpi when compared at 3 dpi, but why the proportion of positive cells was reduced in Figure 2I?
      4. The study emphasized the function of ORF2 on HEV "persistence". However, this cannot be fully supported by cell models. In future, study on chronic HEV infection animal models may be conducted.
      5. The authors study ORF2 in whole. It will be of benefit to the readers that the authors could specified the function of secreted ORF2 and ORF2 capsid in the current study.

      Minor issues:

      1. Figure 3A, this is an elegant design. More data may provide for the validation of the formation of the virions.
      2. Figure 1, data should be provided for the successful expression of HEV-1 or HEV-3 ORF2, and ORF3.
      3. line 219, the current evidence that supported this statement is weak, especially for ORF2.
      4. Suppl Figure 3F-3H, statistical analysis is needed
      5. Suppl Figure 3F-3H, it seems that when no treatment was admistrated, the level of ISG15 in ΔORF2 group was higher than those of the WT and ΔORF3 group. Could the authors comment?
      6. Figure 3D and 3E, the starting time of the detection is not aligned.
      7. Figure 3F, scale bar is missing.
      8. In M&M, statistical method should be provided with more details and cover all the experiments used.

      Significance

      In the manuscript entitled " The hepatitis E virus capsid protein ORF2 counteracts cell-intrinsic antiviral responses to enable persistence in hepatocytes ", Ann-Kathrin Mehnert interrogated that HEV pORF2 can inhibit host antiviral response. They found interaction of HEV ORF2 and TBK1. The finding is interesting and echoed with some previous studies that ORF2 can inhibit innate immunity.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: Authors described the protective mechanism mediated by ORF2 that protects viral replication from the antiviral responses. They have utilized the advanced single-cell RNA sequencing to decipher the dampened antiviral responses in the presence of ORF2 HEV. I believe the study is important for the HEV literature and believe that the manuscript can be considered for publication after authors (1) rewrites the results and discussions separately until the journal wants it to be together. (2) answer the below questions.

      Minor comments:

      Line 69, 71 - I have never seen in any paper including reference in this way!

      Line 72 and 73 - missing reference!

      Line 92, 93 - missing reference!

      Line 95 to 99 - missing references!

      Major comments:

      I would like the authors to answer few questions: 1. Did the authors study only the P6 HEV genome? Have they done anything comparative with the other strain to understand if the proposed mechanism is not the strain specific? 2. Can the authors explain why we do not see any band in the Fig. 1F B-actin?

      Significance

      The paper uses advanced technique as single cell RNA seq to understand the mechanism of ORF2 assisting in the HEV replication.

      The study is well designed.

      This study will add up to understand some of the persistence infection seen in solid organ transplant patients. This study gives a mechanistic overview of HEV avoidance of antiviral response.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, the authors investigated how HEV ORF2 interferes with host antiviral responses to sustain viral infection. They employed several models, including pattern recognition receptors (PRRs) KO cell lines, immunodeficient cells and stem cell-derived models, to prove that: 1) ORF2 is essential for viral replication and 2) ORF2 dampens the interferon and inflammatory signaling pathways. They confirmed the interaction between ORF2 and TBK1, a central mediator of innate immune responses and identified residues in ORF2 that affect its interaction with TBK1. Finally, through single cell RNAseq, they demonstrated that ORF2 is a viral antagonist that inhibits host ISG expression in both infected and bystander cells. Interestingly, the sets of genes that are upregulated in WT vs ORF2-deficient virus infected cells are not entirely identical, suggesting that ORF2 may also modulate host gene expression in addition to suppressing the immune response. This research confers new immune antagonism mechanisms mediated by HEV capsid for sustainable HEV replication in host cells and provides potential therapeutic targets for HEV treatments.

      Major comments:

      1. The authors conclude from Figure 1 that the HEV ORF2 protein antagonizes both antiviral and inflammatory signaling pathways. The authors comprehensively investigated PRRs-mediated activation of type I interferon by viruses or poly(I:C) through overexpression of MDA5, RIG-I and TLR3. However, they only investigated the impact of ORF2 on host inflammatory response through evaluating the levels of TNFAIP3 RNA in the presence of MDA5 overexpression. It would be informative if the authors also check for NFkB activation/phosphorylation and expression of classical pro-inflammatory cytokines such as IL-1b and IL6. Interestingly, changes in IFNB secretion after ORF2 overexpression appear more dramatic compared to changes in IFNB1 RNA levels (compare Figure 1A-C with Supplementary Figure 1A and C). Are the IFN-beta protein expression changes statistically significant in Supplementary Figure 1?
      2. Changes in the IFN response do not always translate into changes in the viral RNA levels. In Figure 2B-D, the authors attributed the higher induction of IFNL1 and ISG15 on day 5 to the absence of ORF2 inhibition. However, the expression of these two genes drops to the same levels as the ones in WT viral RNA-electroporated cells on day 7, which is strange as ORF2-deficient viral RNA levels continue to be inhibited on day 7. This is different from the stem cell derived hepatocytes infected with the trans-complementation viruses in Figure 3G-H where there are significant differences in ISG15 levels between WT and ORF2-deficient virus infected cells on both days 5 and 7. To support their hypothesis, the authors need to further confirm the sudden upregulated antiviral activity on day 5 in electroporated HepG2/C3A cells by testing JAK/STAT phosphorylation and type I interferon secretion.
      3. The authors used different hepatocyte systems coupled with viral RNA electroporation or trans-complementation virus infection to investigate ORF2-mediated interference of the IFN pathway, which is highly complementary. However, while the electroporation of viral RNA into HepG2/C3A (Figure 2B-D) and infection of stem cell-derived hepatocytes with trans-complementation viruses (Figure 3F-H) result in similar upregulation of ISG expression on day 5, that wasn't observed in HepG2/C3A cells infected with trans-complementation viruses (Figure 3C-E) on day 5. The authors need to discuss the discrepancy among these different systems. Since the ORF2-deficient trans-complementation virus still brings in ORF2 proteins from the producer cells but cannot generate new ORF2 proteins, do ORF2 proteins from these two different sources have different functions in different hepatocyte systems? In addition, other than the data points that are shown to be not significantly different in Figure 3D-E, are any of the other data points significantly different?
      4. The single cell RNAseq data are very informative and revealed two interesting groups of genes. First, the ISGs that are further induced in the cells infected with ORF2-deficient HEV compared to cells infected with WT HEV (Figure 4N) are likely suppressed by ORF2. Second, the ISGs that are uniquely induced in the absence of ORF2 are different from the genes that are uniquely induced by WT HEV (Supplementary Table 2), suggesting that ORF2 may also modulate host gene expression. The authors can further characterize these two groups of ISGs by performing gene knockdown or knockout and investigating whether ORF2 directly interacts with these ISG products to determine the functional consequences of their upregulation. Related to that, are there other gene expression changes beyond ISG signatures which would suggest that ORF2 can regulate host gene expression? Figure 4A-C only shows comparisons for WT or ORF2-deficient vs. uninfected cells. The authors can perform GO and KEGG analyses to see if certain biological processes/pathways are enriched among the WT vs ORF2-deficient HEV induced genes. Further characterization of these genes (ISGs or not) would shed light on the novel roles of ORF2 in both immune antagonism and gene regulation and greatly increase the significance of the study.
      5. In Supplementary Figure 3F-H, the authors used BX795 to inhibit TBK1 (a target of ORF2) and found decreases in IFNL1 and ISG15 expression whether cells are electroporated with WT, ORF2-deficient, or ORF3-deficient viral RNA. However, this does not correlate with the data in Figure 2E-G where TBK1 inhibition results in significant differences in viral RNA levels only in the absence of ORF2 or ORF3. These results would suggest that the effects of TBK1 inhibition on viral RNA levels is independent of changes in the IFN/ISG expression levels.

      Significance

      The study addresses a long-standing question in the field about the immune antagonism activities of HEV ORF2 and ORF3 which previous studies have conflicting results on. The strength of this study is the use of complementary approaches such as ORF2 trans complementation system and single cell sequencing, and more relevant models such as stem cell derived hepatocytes to rigorously dissect the role of newly synthesized ORF2 protein in immunocompetent cell context. The manuscript is well written and would appeal to researchers in the HEV and innate immunity fields. However, the significance of the study is dampened by changes in the IFN response not always correlate with the inconsistency of ORF2-mediated inhibitory effects in different models and the still poorly defined mechanism of ORF2 suppression of the IFN pathway. The study would make conceptual advance if the authors can address the discrepancies in their findings and perform additional characterization to determine the functional consequences of ORF2-mediated immune suppression and gene regulation.

      My expertise is in innate immunity and host-virus interactions.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __ Summary In this work, the authors present a careful study of the lattice of the indirect flight muscle (IFM) in Drosophila using data from a morphometric analysis. To this end, an automated tool is developed for precise, high-throughput measurements of sarcomere length and myofibril width, and various microscopy techniques are used to assess sub-sarcomeric structures. These methods are applied to analyze sarcomere structure at multiple stages in the process of myofibrillogenesis. In addition, the authors present various factors and experimental methods that may affect the accurate measurement of IFM structures. Although the comprehensive structural study is appreciated, there are major issues with the presentation/scope of the work that need to be addressed: Major Comments 1. The main weakness of the paper is in its claim of presenting a model of the sarcomere. Indeed, the paper reports a structural study that is drawn onto a 3D schematic. There is no myofibrillogenesis model that would provide insights into mechanisms. Therefore, the use of the word model is grossly overstated.

      In biology, the term “model” is used in various contexts, but it generally refers to a simplified representation of a biological system, a structure or a process. Accordingly, we consider “model” the most fitting phrase for what we present in Figure 4 (Figure 7 in the revised manuscript). These are not arbitrary 3D schematics; they are scaled representations in which the length, the number and the relative three-dimensional arrangement of thin and thick filaments are based on measurements. These measurements are primarily based on our own data (presented in the main text and provided in the supplementary materials), as published data were either lacking or inconsistent. Moreover, we would like to highlight that we do not claim to present a conceptual or mechanistic model of myofibrillogenesis, but we do present structural reconstructions or models for four developmental time points. Therefore, we disagree with the remark that “the use of the word model is grossly overstated”, as our wording fully corresponds to the common sense.

      In general, the major focus and contribution of the work is unclear. How does the comprehensive nature of the measurements contribute to existing literature?

      We significantly revised the text to highlight the main points more firmly, and added an additional section to help non-specialist readers to better understand our aims and findings.

      Figure labels are often rather confusing - for example it is unclear why there is a B, B', B' etc instead of B,C,D, etc.

      The figure labels have been revised in accordance with the reviewer’s recommendation.

      Some comments in the text are not clearly tied to the figures. For example, in lines 108-109, are the authors referring to the shadow along the edges of the myofibril when saying they are not clearly defined (Figure 1C)?

      The lines refer to the fact that identifying the boundary of an “object” in a fluorescence microscopy image is inherently challenging - even under ideal conditions where the object’s image is not affected by nearby signals or background noise. To improve clarity, we revised this section and now it reads: The other key parameter - myofibril diameter - is typically measured using phalloidin staining. However, accurately delineating their boundaries in micrographs is difficult - even under optimal conditions (high signal‑to‑noise ratio, no overlapping fibers, etc.; Fig. 1C). This limitation arises from the fundamental nature of light microscopy as the image produced is a blurred version of the actual structure, due to convolution with the microscope’s point spread function.

      In line 116, it is unclear what "surrounding structures" the authors are referring to if the myofibrils are isolated.

      We revised the text for clarity. It now states: Once isolated, myofibrils lie flat on the coverslip, aligning with the focal plane of the objective lens. This orientation allows for high-resolution, undistorted imaging and accurate two-dimensional measurements, free from interference by neighboring biological structures (e.g.: other myofibrils).

      In lines 141-142, there is no reference of data to back up the claim of validation.

      We addressed this mistake by including a reference to Fig. S1E (Fig. S1D in the revised manuscript).

      In line 170, the authors mention the mef2-Gal4/+ strain as a Gal4 driver line but do not clearly state how this strain is different from the wildtypes or how this impacts their results.

      Mef2-Gal4 is a muscle-specific Gal4 driver, often used in Drosophila muscle studies. It is a convention between Drosophila geneticists that presence of a transgene (i.e. Mef2-Gal4) changes the genetic background, and although it does not necessariliy cause any phenotypic effect, it is clearly distinguished from the wild type situation, and whenever relevant, Mef2-Gal4/+ is the preferred choice (if not the correct choice) as a control instead of wild type. As clear from our data, presence of the Mef2-Gal4 driver line does not affect the length or width of IFM sarcomeres as compared to wild type.

      In lines 182-185, the authors discuss the effects of tissue embedding on morphometrics. Were factors such as animal sex, age, fiber type, etc. conserved in these experiments? If not, any differences in results may be confounding.

      We fully agree with the reviewer that when testing the effect of a single variable, all other variables should remain constant. This is actually one of the main points emphasized in the results section. Additionally, this information is already provided in the Source Data files for each panel.

      In lines 199-201, the authors discuss results of myofibril diameter using different preparation methods, yet no data is cited to support the claims. In line 220, the phrase "6 independent experiments" is unclear. Is each independent experiment performed using a different animal? Furthermore, are 6 experiments performed for each time point?

      We substantially revised the relevant paragraphs and ensured that the corresponding data (Figure 2A in the revised manuscript) is cited each time when it is discussed. We conducted six independent experiments at each time point. This is consistently indicated in the figures and can be verified in the SourceData files (specifically, Fig3SourceData in this case). To clarify what we mean by "independent experiments," we added the following sentence to the Methods section: Experiments were considered independent when specimens came from different parental crosses, and each experiment included approximately six animals to capture individual variability.

      In line 254, the authors refer to "number of sarcomeres". It must be clearly stated if this refers to sarcomeres per myofibril, image area, etc.

      It is now clearly stated as: "number of sarcomeres per myofibril".

      In line 274, the authors refer to "myofilament number". It must be clearly stated if this refers to myofilaments per myofibril, image area, etc.

      We counted the number of myofilaments in developing myofibrils, and this is now clearly stated in the text and in the legend of Figure 3 (Figure 4 in the revised manuscript).

      In line 299, the authors mention that thin filaments measured less than 560 nm in length, yet no data is cited to support this.

      The previously missing reference to Figure 4 (Figure 7 in the revised manuscript) has now been added in addition to the revised Supplementary Figure 5.

      In the "Quantifying sarcomere growth dynamics" section of the summary (starting from line 402) the authors introduce data that would be more naturally placed in the results and discussion section.

      As suggested by the reviewer, we incorporated the key aspects of sarcomere growth dynamics into the Results and Discussion section.

      In lines 422-423, it is not mentioned what the controls are for.

      This was already explained in the main text between lines 167 and 173.

      In the caption of Figure 1C, it is not mentioned what the red dashed lines in the microscope images represent.

      The caption has been updated to include the following clarification: The red dashed lines border the ROI used for generating the intensity profiles.

      In the caption of Figure 1D, the difference between the lighter and darker grey points is not mentioned.

      This was already explained in each relevant figure legend. In this specific case, it is stated between lines 850 and 852: “Light gray dots represent individual measurements of sarcomere length and myofibril diameter, while the larger dots indicate the mean values from independent experiments.”

      In line 849, the stated p-value (0.003) does not match that mentioned in the figure (0.0003).

      We thank the reviewer for noticing this small mistake; correction was made to display the accurate p-value of 0.0003 at both places.

      In line 874, it is not clear what an "independent experiment" refers to (different animal, etc.?).

      We refer the reviewer to point 9, where this question has already been addressed.

      Figure 2A is hard to read. Using different colored dots for different time points might help.

      As suggested by the reviewer, we generated a plot with the individual points color-coded by time.

      The significant figures presented in Figure 4 give a completely inaccurate representation of the variability of the measurements achieved with these techniques.

      Certainly, each measured parameter exhibits inherent biological and technical variability. We have made all the raw data available to the reader through the SourceData files, and this variability is also evident in Figures 1, 2, 3, Supplementary Figure 1, 3, and 5 (Figure 1, 2, 3, 4, 6, and Supplementary Figure 1 in the revised manuscript). Also we have included an additional plot (Supplementary Figure 5 in the revised manuscript) that presents the calculated thin and thick filament lengths and their uncertainty. However, in Figure 4 (Figure 7 in the revised manuscript), our goal was to present an easily understandable visual representation of the sarcomeric structures for each time point, based on the averages of the relevant measurements.

      In line 877, it should be mentioned that the number of filaments is counted per myofibril. The y-axes in the figure should also be adjusted to clarify this.

      As suggested by the reviewer, both the figure legend and the plot have been updated to clearly indicate that the filament count refers to the number per myofibril.

      In line 883, it is not clear what an "independent experiment" refers to (different animal, etc.?).

      We refer the reviewer to point 9, where this question has already been addressed.

      The statement of sample sizes in all figures is a little confusing.

      Following general guidelines, we used SuperPlots to effectively present the data, as nicely demonstrated in the JCB viewpoint article by Lord et al., 2020 (PMID: 32346721). Individual measurements are shown as pooled data points, allowing readers to appreciate the spread, distribution and number of measurements. Overlaid on these pooled dot plots are the mean values from each independent experiment, with error bars representing variability between independent experiments. Sample sizes are provided for both individual measurements and independent experiments. This is now clearly explained in the Materials and Methods section, and we corrected the legends to improve clarity (“n” indicates the number of independent experiments/individual measurements).

      In lines 1007-1008, the authors imply that the lattice model is needed for calculation of myofilament length. However, from the equations and previous data, it seems that this can be estimated using the confocal and dSTORM images.

      As the reviewer correctly noted, myofilament length can be estimated using measurements from confocal and dSTORM images, following the equations provided. However, constructing even a simplified model requires multiple constraints to be defined and applied in a specific order. In practice, one must first determine the number and arrangement of myofilaments in a cross-sectional view of an “average sarcomere” before attempting to build a longitudinal model, where length calculations become relevant. This is now clarified in the text.

      A more specific discussion of future directions is needed to put this paper in context. For example: Can anything from the overall process be used to better understand sarcomere dynamics in larger animals/humans? Can this be applied to disease modelling?

      To address these questions, we have added a section titled STUDY LIMITATIONS, which states: “Our study is focused on describing the growth of IFM sarcomeres during myofibrillogenesis at the level of individual myofilaments. Additionally, we developed a user-friendly software tool for precise sarcomere size measurements and demonstrate that these measurements are sensitive to varying conditions. Whereas, this tool can be used successfully on whole muscle fiber preparations as well, our pipeline was intentionally optimized for individual IFM myofibrils ensuring higher measurement precision in our hands than other type of preparations. Thus, we predict that future work will be required to extend it to sarcomeres from other muscle tissues or species. Nevertheless, our study exemplifies a workflow how to measure sarcomere dimensions precisely. With some variations, it should be possible to adopt it for other muscles, including vertebrate and human striated muscles. To facilitate this and to enhance the accessibility and usability of this dataset, we welcome any feedback and suggestions from researchers in the field.”

      One of the major claims of the paper is that there is a measurable variability with sex and other parameters. However, this data is never clearly summarized, presented (except for supplement), or discussed for its implications.

      We followed the suggestion of the reviewer, and we moved this supplementary data into a main figure, and thoroughly revised the corresponding paragraphs to present and discuss the findings more clearly.

      Minor Comments: 1. Lines 60-65 seem to break the flow of the introduction. As the authors discuss existing methods in literature for IFM analysis in the previous couple sentences, the following sentences should clearly state the limitations of existing methods/current gap in literature and a general idea of what the current work is contributing.

      We agree with this remark, and we substantially revised the Introduction to clearly define the existing gap in the literature and to articulate how our work addresses this gap.

      In line 104, the acronym for ZASPs is not spelled out.

      The acronym has now been spelled out for clarity.

      **Referee Cross-commenting**

      I agree as well.

      Reviewer #1 (Significance (Required)):

      In summary, this paper provides a multi-scale characterization of Drosophila flight muscle sarcomere structure under a variety of conditions, which is potentially a significant contribution for the field. However, the paper scope is overstated in that it does not provide an actual sarcomere model. Further, there are multiple issues with data presentation that impact the readability of the manuscript.

      Although it is somewhat unclear what would be “an actual sarcomere model” for the reviewer, but we cannot accept that we made on overstatement by using the word “model”, because one of the main outcomes of our work are indeed the myofilament level sarcomere models depicted in Figure 4 (Figure 7 in the revised manuscript). As said above, we do not claim that these would be molecular models, or mechanistic models or developmental models, but it makes absolutely nonsense (even in common terms!) that our scaled graphical representations (based on a wealth of measurements) should not be or cannot be called models.

      As to the comment with data presentation, we thank the reviewer for the numerous suggestions, and we substantially revised the manuscript to increase clarity and overall readability.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __ Summary: In this manuscript titled "A myofilament lattice model of Drosophila flight muscle sarcomeres based on multiscale morphometric analysis during development," Görög et al. perform a detailed analysis of morphological parameters of the indirect flight muscle (IFM) of D. melanogaster. The authors start by illustrating the range of measurements reported in the literature for mature IFM sarcomere length and width, showing a need to revisit and determine a standardized measurement. They develop a new Python-based tool, IMA, to analyze sarcomere lengths from confocal micrographs of isolated myofibrils stained with phalloidin and a z-disc marker. Using this tool, they demonstrate that sample preparation (especially mounting medium), as well as fiber type, sex, and age influence sarcomere measurements. Combining IMA, TEM, and STORM data, they measure sarcomere parameters across development, providing a comprehensive and up-to-date set of "standardized" sarcomere measurements. Using these data, they generate a model integrating all of the parameters to model sarcomeres at four discrete timepoints of development, recapitulating key phases of sarcomere formation and growth.

      Major comments: Line 200 & 901 - Figure S1B - The authors make a strong statement about the use of liquid versus hardening media, and it is clear from the image provided in Figure S1 that there is a difference in the apparent sarcomere width. The identity of the "liquid media" versus the "hardening media" should be clearly identified in the Results, in addition to the legend for Figure S1. The authors show that "glycerol-based solutions" increase sarcomere width, but the Materials only list 90% glycerol and PBS. However, a frequently used liquid mounting media is Vectashield. Based on the literature, measurements in liquid Vectashield show diameters significantly less than 2.2 microns observed here with presumably 90% glycerol or PBS. Can the authors qualify this statement, or provide data that all forms of liquid mounting media cause this effect? Does this also apply to hemi-thorax and sectioned preparations, or just isolated myofibrils?

      We used a PBS-based solution containing 90% glycerol as our liquid medium, as now stated in the main text. In response to the reviewer’s suggestion, we also tested a non-hardening version of Vectashield (H-1000). Myofibrils in Vectashield were significantly thicker than those in ProLong Gold but still thinner than those in the 90% glycerol–PBS solution, shown in Figure 2B. The mechanisms that could potentially explain these observations have been described in several studies (Miller et al., 2008; Tanner et al., 2011, 2012). Briefly, IFM is a densely packed macromolecular assembly. Upon removal of the cell membrane, myofibrillar proteins attract water, leading to overhydration of the myofilament lattice. This increases the spacing between filaments, resulting in an expansion of overall myofibril diameter. The extent of hydration depends on the osmolarity of the surrounding medium, as the system eventually reaches osmotic equilibrium. While both liquid media induced significant swelling, the observed differences likely reflect variations in their osmotic properties. In contrast, dehydration - an essential step in electron microscopy sample preparation - reduces the spacing between filaments, making myofibrils appear thinner. This explains why EM micrographs consistently show significantly smaller myofibril diameters (Chakravorty et al., 2017).

              Hardening media such as ProLong Gold introduce additional artifacts: during polymerization, these media shrink, exerting compressive forces on the tissue (Jonkman et al., 2020). We therefore propose that isolated myofibrils first expand due to overhydration in the dissection solution, and are then compressed back toward their *in vivo* dimensions during incubation in ProLong Gold. The average *in vivo* diameter of IFM myofibrils can be estimated without direct measurements, as it is determined by two key factors: (i) the number of myofilaments, which has been quantified in EM cross-sections in several studies (Fernandes & Schöck, 2014; Shwartz et al., 2016; Chakravorty et al., 2017) including our own, and (ii) the spacing between filaments, which can be measured by X-ray diffraction even in live *Drosophila* or under various experimental conditions (Irving & Maughan, 2000; Miller et al., 2008; Tanner et al., 2011, 2012). Our findings suggest that the effects of lattice overhydration and media-induced shrinkage are most pronounced in isolated myofibrils. In larger tissue preparations, the inter-myofibrillar space likely acts as a mechanical and osmotic buffer, reducing the extent of such distortions
      

      Can the authors comment on whether the length of fixation or fixation buffer solution, in addition to the mounting medium, make a difference on sarcomere length and diameter measurements? This is another source of variation in published protocols.

      The effect of fixation time on sarcomere morphometrics in whole-mount IFM preparations has been previously demonstrated by DeAguero et al. (2019), as briefly noted in our manuscript. To extend these findings, we performed a comparison using isolated myofibrils, assessing morphometric parameters after fixation for 10, 20 (standard) and 60 minutes. We found no difference between the 10- and 20-minute fixation conditions; however, fixation for 60 minutes resulted in significantly increased myofibril diameter (and these data are now shown in Supplementary Figure 1C). A comparable increase in thickness was also observed when using a glutaraldehyde-based fixative. These results suggest that more extensively fixed myofibrils may better resist the compressive forces exerted by hardening media.

      Line 237-238. The authors conclude that premyofibrils are much thinner than previously measured. The use of Airyscan to more accurately measure myofibril width at this timepoint is a good contribution, as indeed diffraction and light scatter likely contribute to increased width measured in light microscopy images. I also wonder, though, how well the IMP software performs in measuring width at 36h APF, given how irregular the isolated myofibrils at this stage look (wide z-lines but thinner and weaker H and I bands as shown in Fig. 2B)?

      The reviewer is correct that measurements during the early stages of myofibrillogenesis require additional effort. However, in addition to its automatic mode, IMA can also operate in semi-automatic or manual modes, ensuring complete control over the measurements. Myofibril width is determined from the phalloidin channel at the Z-line (as described in the software’s User Guide and Supplementary Figure 2), where it is at its thickest.

      Also, how much of the difference in sarcomere width arises due to effects of "stripping" components off of the sarcomere at the earliest timepoint (for example alpha-actinin or Zasp proteins)?

      A comparison between isolated myofibrils and those from microdissected muscles (Supplementary Figure 3B, Figure 3C in the revised manuscript) shows that the isolation process does not alter the morphometric measurements of sarcomeres. Moreover, the measured myofibril width aligns well with what we expect based on the number of myofilaments observed in TEM cross-sections of myofibrils at 36 hours APF (Figure 3A, now Figure 4A in the revised manuscript), supporting the consistency of our model.

      Myofibrils at early timepoints do contain more than 4-12 sarcomeres in a line (they extend the full length of the myofiber), so it is possible they are breaking due to the detergent and mechanical disruption induced by the isolation method.

      The reviewer is correct - myofibrils likely span the full length of the myofiber from the onset of myofibrillogenesis. However, during the isolation of individual myofibrils, they often break, and even mature myofibrils typically fragment into pieces of about 300 µm in length (illustrated in Figure 1E, now Figure 2A in the revised manuscript). Importantly, our measurements show that this fragmentation does not affect the assessed sarcomere length or width (as shown in Supplementary Figure 3B, now Figure 3C in the revised manuscript).

      Line 312 - What does "stable association" mean in this context? The authors mention early timepoints lack stable association of alpha-Actinin or Zasp52, and they reference Fig. S4C, but this figure only shows 72h and 24 AE, not 36h and 48 h APF. Previous reports have seen localization of both alpha-Actinin and Zasp52, so presumably the detergent or mechanical isolation is stripping these components off of the isolated myofibrils up until 72h.

      In agreement with previous reports, we also detected both α-Actinin (as shown in former Supplementary Figure 3B, now Figure 3C) and Zasp52 in microdissected IFM starting from 36 hours APF. However, these markers were largely absent from the isolated myofibrils of young pupae (36 to 60 hours APF). By 60 hours APF, strong α-Actinin and Zasp52 staining became evident in isolated myofibrils, whereas dTitin epitopes were clearly detectable from the earliest time point examined. This indicates that some proteins, such as α-Actinin and Zasp52, can be lost during the isolation process, whereas others like dTitin are retained and this differential sensitivity appears to depend on developmental stage. A likely explanation is that α-Actinin and Zasp52 are recruited early to Z-bodies but are only fully incorporated as more mature Z-disks form between 48 and 60 hours APF. This incomplete incorporation at the earlier stages could account for their loss during the isolation process. This interpretation is supported by our morphological analysis of the Z-discs, as shown in the dSTORM dataset (former Figure 3B, B’’, now Figure 4C, E) and in longitudinal TEM sections (former Supplementary Figure 5B, now in Figure 6B). Because α-Actinin and Zasp52 are not detected in isolated myofibrils at 36 and 48 hours APF, they are not included in Figure S4C (Figure 5C in the revised manuscript). This is explained in the updated figure legend.

      This same type of issue comes up again in Lines 325-334, where the authors talk about 3E8 and MAC147. They state that 3E8 signal significantly declines in later stages and that MAC147 is not suitable to label myofibrils in young pupae, but they only show data from 72 APF and 24 AE (which looks to have decent staining for both 3E8 and MAC147). A clearer explanation here would be helpful.

      To put it simply: we used one myosin antibody to label the A-band in the IFM of 36h APF and 48h APF animals, and a different antibody for the 72h APF and 24h AE stages. In more detail: Myosin 3E8 is a monoclonal antibody targeting the myosin heavy chain and labels the entire length of mature thick filaments except for the bare zone (former Supplementary Figure 4D, now in Figure 5D), suggesting its epitope is near the head domain. As a result, we expect a uniform A-band staining - excluding the bare zone - which is exactly what we observe in the IFM of young pupae (36h APF and 48h APF; formerly Figure 3B, now Figure 4C in the revised manuscript). However, at 72h APF and 24h AE, Myosin 3E8 produces a different staining pattern: two narrow stripes flanking the bare zone and two broader, more diffuse stripes near the A/I band junction (former Supplementary Figure 4D, now Figure 5D). This change is likely due to restricted antigen accessibility at these later developmental stages - a common issue in the densely packed IFM - making this antibody unsuitable for reliably measuring thick filament length in these stages.

      MAC147 is another monoclonal antibody against Mhc that recognizes an epitope near the head domain. However, it only works reliably in more mature myofibrils (72h APF and 24h AE; formerly Figure 3B, now Figure 4C in the revised manuscript), likely due to its specificity for a particular Mhc isoform. This is why we do not include images from earlier developmental stages using this antibody. We added a revised, concise explanation in the main text for general readers, and provided a more detailed description for specialist readers in the legend of Supplementary Figure 4D (updated as Figure 5D in the revised manuscript).

      Figure 3B. The authors show the H, Z, and I lengths in B', B', and B' and discuss these lengths in the text (lines 305-320). It would also be nice to actually have the plots showing the measured/calculated lengths for thin and thick filaments. These are mentioned in the results, but I cannot find the plots in the figures and there is no panel reference.

      A summary table of the measured and calculated parameters is provided in Fig4SourceData (Fig7Source Data in the revised manuscript). However, following the reviewer’s suggestion, we also generated an additional plot (Supplementary Figure 5 in the revised manuscript) that displays the calculated thin and thick filament lengths.

      Line 400. Does the model in Figure 4 actually have molecular resolution as the authors claim? From these views, thick and thin filaments appear to be represented by cylindrical objects. Localization of specific molecules would require further modeling with individual proteins. Or do the authors mean localization from STORM imaging relative to the ends of the thick and/or thin filaments? The model itself is a useful contribution, but based on Figure 4, resolution of individual molecules is not evident.

      The reviewer is correct; and we fully agree that we do not present a molecular model of sarcomeres in this study - nor do we claim to. Instead we present a myofilament level model. Nevertheless, the scaled myofilament lattice model we introduce could serve as a geometric constraint when constructing supramolecular models of sarcomeres. As the reviewer rightly notes, implementing such an approach would require additional effort.

      The main Results section of the text is condensed into 4 figures. However, I found myself flipping back and forth between the main figures and the supplement continuously, especially parts of Supplemental Figures 1, 3, 4, and 5. With such large amounts of detail in the Results relying on the supplement, it may be worth considering reorganizing the main and supplemental figures, and having 7 main figures, to include important panels that are currently in the supplement (esp. Fig S1B, S1C, S1D, S3B, S4, S5).

      We found it a very useful suggestion, and we substantially reorganized the figures in the revised manuscript according to the recommendations of the reviewer.

      Minor comments: On the plots in Fig. S1B, D, and F, it is hard to see the color of the dots because the red error bars are on top of them. Can the other distribution dots be tinted the correct color or the x-axis labels be added, so it is clear which dataset is which?

      We significantly enlarged the dots to enhance visual clarity.

      Line 142 needs a reference to Figure S1, Panel E, which shows the accuracy and precision measurements.

      The requested panel reference has now been included in the revised manuscript.

      Lines 198 - is this range from the above publications? Needs to be clearly cited.

      The range has indeed been estimated using measurements from the aforementioned publications, and this point is now further clarified in the revised text.

      Figure S3B is confusing - why do the blow-ups overlap both the top (presumably microdissected) and the bottom (presumably isolated) images? The identity of microdissected images should be labeled, as they are hard to see underneath of the blown-up images and the identity of individual image planes wasn't immediately obvious.

      We refined the panel structure of Figure S3B (Figure 3C in the revised manuscript) to enhance clarity as the reviewer suggested.

      Line 298. By "misaligned," do the authors mean the pointed ends are not uniformly anchored in the z-disc, leading to the wide z-disc measurements? At this early stage, I'm not sure "misaligned" is the right word - perhaps "were not yet aligned in register at the z-disc" or something similar.

      We revised the text for clarity. It now reads: At 36 hours APF, thin filaments had not yet aligned in perfect register at the Z-disc, with most measuring less than 560 nm in length - and exhibiting considerable variability.

      Figure S6 - spelling mistake in label of panel A, "sarcomer" should be "sarcomere"

      The typo is corrected.

      Line 487. Spelling "Zaps52" should be "Zasp52"

      The typo is corrected.

      Line 887. Spelling "Myofilement" should be "Myofilament"

      The typo is corrected.

      Line 946-947. In the legend for Supp. Fig. 3., the authors should specify which published datasets on sarcomere length are shown in the figure by including the references in the legend. Presumably the "isolated individual myofibrils" are the blue "this study" lines, leaving the "microdissected muscles" as the magenta "previous reports" on the figure. Without the reference, it is not clear if these are microdissected, isolated myofibrils, hemi-thorax sections, cryosections, or another preparation method for the "previous reports" data.

      The references have now been added to both the figure and its legend.

      **Referee Cross-commenting**

      I agree with the comments from the other reviewers. Many of the major themes are consistent across the reviews, including regarding the model, preparation methods, and the software tool.

      Reviewer #2 (Significance (Required)):

      Strengths: This manuscript is an important contribution to the field of sarcomere development. The authors use modern technologies to revisit variation in morphometric measurements in the literature, and they identify parameters that influence this variation. Notably, sex-specific differences, DLM versus DVM measurements, and mouting media are potential contributors to the variability. Combining TEM and STORM with a confocal timecourse of isolated myofibrils, they refine previously published values of sarcomere length and width, and add more comprehensive data for filament length, number and spacing. This highly accurate timecourse demonstrates continual growth of sarcomeres after 48 h APF, and correct some inconsistencies from previous large-scale timecourse datasets. These data are very valuable to the field, especially Drosophila muscle biologists, and will serve as a comparative resource for future studies. Weaknesses: At early timepoints, loss of sarcomere components through mechanical or detergent-mediated artifacts may influence the authors' measurements. In addition, isolating myofibrils is not always the most ideal approach, as it loses information on myofiber structure as well as organization and structure of the myofibrils in vivo.

      We believe that the control experiments we presented here adequately demonstrate that sarcomere measurements are not affected by the myofibril isolation process at early timepoints (Figure 3C). Nevertheless, we certainly agree with the reviewer that isolated myofibrils alone cannot capture the entire complexity of muscle tissues, and additional approaches should also be applied in complex projects. Yet, we are confident that our approach offers the most reliable and efficient method for precise morphometric analysis of the sarcomeres, and although alone it is very unlikely to be sufficient to address all questions of a muscle development project, it can still be applied as a very useful and robust tool.

      The point regarding liquid versus hardening mounting media is valuable, but remains to be tested and validated with the diverse liquid and hardening media used by other labs.

      Whereas it would not be feasible for us to test all possible liquid and hardening media used by others in all possible conditions, we tested the effect of Vectashield (the most commonly used liquid media) according to the suggestion of the reviewer, and the results are now included in the manuscript. We think that this is a valuable extension of the list of the materials and conditions we tested, although we need to point out that our primary goal was not necessarily to test as many conditions as possible (because the number of those conditions is virtually endless), rather to raise awareness among colleagues that these variables can significantly impact the data obtained and affect their comparability.

      The IMA software seems to be designed specifically for analysis of isolated myofibrils, and it is unclear if it would work for other types of IFM preparations.

      As stated in the manuscript, IMA is a specialized tool designed for the analysis of individual myofibrils. While it can also process other types of IFM preparations in semi-automatic or manual modes, we believe these approaches compromise both efficiency and accuracy. This is further clarified in the revised manuscript.

      A last point is that TEM and STORM may not be available on a regular basis to many labs, hindering wide implementation of the approach used in this manuscript to generate very accurate and detailed measurements of sarcomere morphometrics.

      Regarding the availability of TEM and STORM, we acknowledge that these techniques are not universally accessible. However, that is exactly one major value of our work that our open-source software tool now allows researchers to generate valuable data using only a confocal microscope in combination with our published datasets.

      Audience: Scientists who study sarcomerogenesis or Drosophila muscle biology.

      My expertise: I study muscle development in the Drosophila model.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __ Summary: This manuscripts presents a computational tool to quantify sarcomere length and myofibril width of the Drosophila indirect flight muscles, including developmental samples. This tool was applied to confocal and STORM super-resolution images of isolated myofibrils from adult and developing flight muscles. Thick filament numbers per myofibril were counted during development of flight muscles. A myofilament model of developing flight muscle myofibrils is presented that remains speculative for the early developmental stages.

      Major comments: 1. The title of the manuscript appears unclear. What is a lattice model? Lattice is an ordered array. The filament array parameters for mature flight muscles was aready measured. It appears that the authors speculate how this order might be generated during sarcomere assembly, which is not studied in this manuscript as it is limited to periodic arrays after 36h APF.

      As the reviewer correctly points out, a lattice refers to an ordered array - in the case of IFM sarcomeres, this includes both thin and thick filaments. Therefore, the phrase "myofilament lattice model of Drosophila flight muscle sarcomeres" specifically describes a model representing the spatial organization of these filament arrays within the sarcomere. To provide additional clarity for readers, we have revised the title to include more context. It now reads: Developmental Remodeling of Drosophila Flight Muscle Sarcomeres: A Scaled Myofilament Lattice Model Based on Multiscale Morphometrics

      To create a model of these arrays, three essential pieces of information are required:

      1) The length of the filaments,

      2) The number of filaments, and

      3) The relative position of the filaments.

      While some direct measurements are available in the literature, and others can be used to calculate the necessary values, available data is often contradictory or simply different from each other (as described in our ms) making them unsuitable for constructing scaled models of the myofilament arrays. In contrast to that, here we present a comprehensive and consistent set of measurements that enabled us to build models not only of mature sarcomeres but also of sarcomeres at three other significant developmental time points.

      Regarding the mention of "sarcomere assembly" in line 37, we intended it to refer to the growth of the sarcomeres, not their initial formation. We do not speculate about sarcomere assembly anywhere in the text. In fact, we have clearly stated multiple times that our focus is on the growth of the IFM myofilament array during myofibrillogenesis. Nevertheless, to avoid confusion, we revised the phrase in line 37 to "sarcomere growth".

      The authors review the flight muscle sarcomere length literature and conclude it is variable because of imprecise measurements. Likely this is partially true, however, more importantly is that the sarcomere length and width changes during isolation methods of the myofibrils, as well as by various embedding methods, as the authors show here as well in Figure 1B-E.

      We dedicated two sections of the Results - “An automated method to accurately measure sarcomeric parameters” and “IFM sarcomere morphometrics are affected by sex, age, fiber type, and sample preparation” - to exploring potential sources of variability in published IFM sarcomere measurements. Based on these analyses, we conclude that such variability stems from both measurement imprecision and biological or technical factors, including sex, age, fiber type and, of foremost, sample preparation. Because it is difficult to quantify the relative impact of each variable across published studies, we have refrained from speculations about the relative contribution of the different factors in the revised manuscript.

      Hence, I find the strongly claims the authors make here surprising, while they are isolating the myofibrils. Hence, these myofibrils are ruptured at the ends, relaxed or contracted, depending on buffer choice and passive tension is released. On page 8, the authors correctly state that the embedding medium causes shrinkage of the myofibrils. While isolation is state of the art for electron microscopy techniques, other methods including sectioning or even whole mount preparation have been developed for high resolution microscopy of IFMs that avoid these artifacts. Unfortunately, this manuscript only uses isolated myofibrils that were fixed and then mechanically dissociated by pipetting. This method likely induces variations as seen by the large spread of sarcomere length reported in Figure 1C (2.8-3.9µm?) and even bigger spreads for myofibril widths. Are these also seen in tissue without dissections? Unfortunately, no comparision to intact flight muscles are reported with the here presented quantification tool. The sarcomere length spread in the developmental samples is even larger.

      The major issue raised in this paragraph is the use of isolated myofibril versus intact flight muscle preparations. The reviewer claims that the latter might be superior because the isolated myofibrils are ruptured at their ends. Clearly, the intact IFMs cannot be imaged in vivo by light microscopy because the adult fly cuticle is opaque. To visualize these muscles, one must open the thorax, but neither microdissection nor sectioning preserves them perfectly, even the cleanest longitudinal cuts sever some myofibrils, and dissection itself can damage the tissue. Although published images often show only the most pristine regions, the practice of selective cropping cannot be taken as a scientific argument. Here, by comparing sarcomere lengths measured in isolated myofibrils with those from whole-mount longitudinal DLM sections and microdissected IFM myofibers, we demonstrate that isolation does not alter sarcomere length (Figure 1E, now Figure 2A in the revised manuscript). As to myofibril width, it is determined by two parameters: the number of myofilaments and the spacing between them. In vivo filament spacing has been measured directly, and filament counts can be obtained from EM cross-sections of DLM fibers. Combining these values gives an expected in vivo myofibril diameter. While isolated myofibrils measure thinner than those in whole-mount or microdissected samples (Figure 1E, now Figure 2A in the revised manuscript), their diameter closely matches this in vivo estimate (see manuscript, lines 187–198). Therefore, we conclude that isolated myofibrils (even if it seems counterintuitive for this reviewer) are superior for sarcomere measurements than whole-mount preparations - and that is why we primarily rely on them here.

      Despite that, we certainly recognize that isolated myofibrils cannot recapitulate every aspect of an IFM fiber, and the need for whole-mount preparations during our IFM studies is not questioned by us.

              In addition to this general answer to the issues raised in the above paragraph of the reviewer, we would like to specifically reflect for some of the remarks:
      

      „Unfortunately, this manuscript only uses isolated myofibrils that were fixed and then mechanically dissociated by pipetting.”

      This is a false statement that “this manuscript only uses isolated myofibrils” as we used different preparation methods for initial comparisons (see Figure 1E, now Figure 2A in the revised manuscript). Additionally, unlike the reviewer assumed, the myofibrils were first dissociated and then fixed, and not vice versa (as described in the Materials and Methods section).

      „This method likely induces variations as seen by the large spread of sarcomere length reported in Figure 1C (2.8-3.9µm?) and even bigger spreads for myofibril widths. Are these also seen in tissue without dissections?”

      This remark makes absolutely no sense, as we do not report sarcomere length values in Figure 1C at all. By assuming that the reviewer meant to refer to Figure 1B, it still remains a misunderstanding or a false statement, because that panel refers to the variations found in published data (not in our current data), and this is clearly explained both in the figure legend and the main text. Regardless of that, the stated spread does not appear unusual. In the article by Spletter et al. (2018), the authors report a similar spread (2.576–3.542 µm) for sarcomere length in mature IFM using whole-mount DLM cross-sections. As to the second question here, we do observe a comparable spread in other preparations as well (see Figure 1E, now Figure 2A in the revised manuscript), which is again the opposite conclusion as compared to the (clearly false) assumption of the reviewer.

      „Unfortunately, no comparision to intact flight muscles are reported with the here presented quantification tool. „

      This is also a false statement; as we do report comparison to whole mount cross sections which we belive the reviewer considers „intact” in Figure 1E (Figure 2A in the revised manuscript).

      „The sarcomere length spread in the developmental samples is even larger.”

      The spread is not larger at all than in previous reports, as clearly shown in Supplementary Figure 3A.

      The authors suggest that there are sex differences in sarcomere length and pupal development duration. This is potentially interesting, unfortunately they then use mixed sex samples to analyse sarcomeres during flight muscle development.

      In the revised manuscript, we now provide a more detailed description of a subtle post-eclosion difference in IFM sarcomere metrics between male and female Drosophila. We attribute this variation to the well-established observation that female pupae develop slightly faster than males, a property that may last till shortly after eclosion. Confirming this experimentally would require considerable effort with limited scientific benefit. Nonetheless, the subtle nature of this sex-linked variation reinforced our decision to include IFM sarcomeres from both male and female flies in our comprehensive developmental analysis.

      The IMA software tool lacks critical assessment of its performance compared to other tools and the validation presented is too limited. IMA seems to generate systematic errors, based on Fig S1E, as it does not report the ground truth. These have to be discussed and compared to available tools. The principles of fitting used in IMA seem well adapted to IFM myofibrils in low noise conditions, but may not be usable in other situations. This should be assessed and discussed.

      IMA is a specialized software tool developed to address a specific need, notably, to accurately and efficiently measure sarcomere length and myofibril diameter in individual IFM myofibril images labeled with both phalloidin and Z-disc markers. For our purposes, it remains the most suitable and reliable option, and we are confident that IMA outperforms all other available tools. To demonstrate this, we have included a table comparing the few alternatives (MyofibrilJ, SarcGraph, and sarcApp) capable of both measurements, which further supports our conclusion. Given IMA's focused application, extensive validation under artificially low signal-to-noise conditions is unnecessary. While IMA may introduce minor systematic errors (~0.01 µm for sarcomere length and ~0.03 µm for myofibril diameter), these are negligible errors relative to the limitations of the simulated ground truth data used for benchmarking. This point is now addressed in the manuscript.

      It is claimed that validation was achieved on simulated IFM images: do the authors rather mean simulated isolated IFM myofibril images? This is not quite the same in terms of algorithm complexity and this should be corrected if this is the case.

      Indeed, we used simulated individual IFM myofibril images, where both phalloidin labeling and Z-disc labeling are present. This is clearly shown in Supplementary Figure 1A, and stated in the text when first introduced: „we generated artificial images of IFM myofibrils with known dimensions, simulating the image formation process”

      The authors need to revise their comparison to other tools. It is incomplete and seemingly incorrect. It should be clearly stated that IMA is limited to isolated myofibrils, which is a far easier segmentation task than what other tools can do, such as sarcApp (Neininger-Castro et al. 2023, PMID: 37921850). Defining the acronym would be valuable in that sense. The claim line 129-130 "none can adequately measure myofibril diameter from regular side view images" is unclear. What do the authors refer to as "side view images"? Sarc-Graph from Zhao et al 2021, PMID: 34613960, and sarcApp from Neininger-Castro et al. 2023 provide sarcomere width, in conditions that are very similar to what IMA does, e.g. on xy images based on the documentation provided on github. A performance comparison with these tools would be valuable. Does installation and use of IMA require computational skills?

      Motivated by the reviewer’s comments, we revised the section introducing IMA. However, we chose not to include an extensive comparison with other software tools, as this would divert the manuscript’s focus without impacting the main conclusions. Instead, we added a summary table highlighting the key requirements for analyzing IFM sarcomere morphometrics from Z-stacks of phalloidin- and Z-line-labeled individual myofibrils and compared the available tools accordingly. In our experience, most software tools are developed to address very specific problems, even those marketed as general-purpose solutions. Consequently, applying them beyond their intended scope often results in reduced efficiency and suboptimal performance. Although sarcApp was initially available as a free tool, one of its dependencies (PySimpleGUI 5) has since adopted a commercial license model. Using a trial version of PySimpleGUI 5, we evaluated sarcApp on our dataset. The software is limited to single-plane image input, hence raw image stacks must be preprocessed into a suitable format, which is a time consuming step. Furthermore, implementation requires basic programming proficiency, as parameter adjustments must be performed directly within the source code to accommodate dataset-specific configurations. Once appropriately configured, sarcApp reliably quantifies both sarcomere length and myofibril width with accuracy comparable to that of IMA. However, it lacks built-in diagnostic feedback or visualization tools to facilitate measurement verification or troubleshooting during batch processing. SarcGraph also supports only single-plane image inputs and requires prior image preprocessing. Additionally, images must be loaded manually one by one, which further reduces processing efficiency. Parameter optimization relies on direct code modification through a trial-and-error process, demanding a certain level of programming proficiency. Even with these adjustments, the software frequently introduces artifacts - such as Z-line splitting - when applied to our dataset. Even when segmentation is successful, sarcomere length is often overestimated, whereas myofibril diameter is consistently underestimated. As compared to these issues, IMA was designed for ease of use and does not require any programming experience to install or operate. It can automatically handle raw microscopic image formats without the need for preprocessing. Segmentation is fully automated, with no requirement for parameter tuning. The tool provides visual feedback during both the segmentation and fitting steps, allowing users to confidently assess and validate the results. IMA produces accurate and precise measurements of sarcomere length and diameter. Batch processing is enabled by default, significantly improving efficiency when analyzing multiple images. Finally, unlike the reviewer stated, IMA is not limited to isolated myofibrils. It is optimized for isolated myofibrils (i.e. full performance is achieved on these samples), but it can also work on whole-mount preparations in semi-automatic and manual mode, which still allow precise measurements (with some reduction in processing efficiency).

      As to the minor comments, the acronym IMA was already defined in lines 541 and 917–918 of the original submission, as well as on the software’s GitHub page. Additionally, we replaced the phrase "side view images" with "longitudinal myofibril projections" to improve clarity.

      How do the authors know that the bright phallodin signal visible that the Z-disc at 36h and 48h APF is due to actin filament overlap, as suggested? An alternative solution are more short actin filaments at the early Z-discs.

      It is widely accepted that the bright phalloidin signal at the Z-line in mature sarcomeres reflects actin filament overlap (e.g., Littlefield and Fowler, 2002; PMID: 11964243). Accordingly, in slightly stretched myofibrils, this bright signal diminishes, and in more significantly stretched myofibrils, a small gap appears (e.g., Kulke et al., 2001; PMID: 11535621). The width of this bright phalloidin signal corresponds to the electron-dense band seen in longitudinal EM sections (Figure 3B and Supplementary Figure 5B, now Figure 4B and Figure 6B in the revised manuscript) and matches the actin filament overlap observed in Z-disc cryo-EM reconstructions from other species (Yeganeh et al., 2023; Rusu et al., 2017), where individual thin filaments can be resolved. By extension, we interpret the bright phalloidin signals at the Z-discs observed at 36 h and 48 h APF as arising from similar actin filament overlaps, given their comparable width to the electron-dense Z-bodies described both in our study (Supplemantary Figure 5B, now Figure 6B in the revised manuscript) and by Reedy and Beall (1993). While we cannot fully rule out the reviewer’s alternative interpretation, for the time being it remains a bold speculation without supporting evidence, and therefore we prefer to stay with the conventional view.

      The authors seem to doubt their own interpretation that actin filaments shrink when reading line 304 and following. This is obviously critical for the "model" presented.

      Unlike the reviewer implies, we certainly do not doubt our own interpretation, but to avoid confusion we revised the corresponding paragraph in the manuscript and provided more details on our explanation, and we also provide a brief overview of it here. Between 36 h and 48 h APF we observe a pronounced structural transition in the IFM sarcomeres. In EM cross-sections, the previously irregular myofilament lattice becomes organized into a regular hexagonal pattern (Figure 3A, now Figure 4A in the revised manuscript) with filament spacing typical of mature myofibrils (Supplementary Figure 5A, now Figure 6A in the revised manuscript). In longitudinal EM sections, the elongated, amorphous Z-bodies condense along the myofibril axis to form well-defined, adult-like Z-discs (Supplementary Figure 5B, now Figure 6B in the revised manuscript). Similarly, dSTORM imaging shows that the Z-disc associated D-Titin epitopes become more compact and organized during this period (Supplementary Figure 4E, now Figure 5E in the revised manuscript). The edges of the thick filament arrays also become more sharply defined, and the appearance of a distinct bare zone indicates the establishment of a regular register (Figure 3B, now Figure 4B in the revised manuscript). By assuming that a similar reorganization occurs within the thin filament array, the apparent length of the thin filament array would decrease—not due to shortening of individual filaments, rather due to improved alignment. Although we cannot directly resolve single thin filaments, this reorganization offers the most plausible explanation for the observed change.

      Minor comments: 1. Figure S1B is not called out in the text.

      The reviewer might have missed this, but in fact, it is explicitly called out in line 181.

      Fig. 1: Please state whenever images are simulations?

      We appreciate the reviewer’s observation that the simulated IFM myofibril images are indistinguishable from the real ones, as this confirms the adequacy of these images for testing our software tool. However, this is already clearly indicated: Figure 1B features simulated images, as noted in the figure legend (line 824), and Supplementary Figure 1A similarly shows simulated images, as stated both in the legend (line 886) and in the figure.

      Fig. 2: Length-width correlation - please provide individual points color-coded by time point?

      As suggested by the reviewer, we generated a plot with the individual points color-coded by time.

      "newly eclosed males and females, we observed that males have slightly shorter sarcomeres and narrower myofibrils". Please provide a statistical test supporting the difference.

      In the revised manuscript, we compared sarcomere length and myofibril width between males and females from 0 to 96 hours AE using a two-way ANOVA with Sidak’s multiple comparisons test. We expanded our description of these observations in the main text, and details of the statistical analysis are now included in the revised figure legend (Figure 1E). Briefly, newly eclosed males showed slightly shorter sarcomeres than females - a consistent but non-significant trend (p = 0.9846) - which resolved by 12 h AE, with sarcomere lengths remaining similar thereafter (p = 0.1533; Figure 1E). In contrast, myofibril width was significantly narrower in the newly eclosed males (p = 0.0374), but this difference disappeared between 24 and 48 h AE as myofibrils expanded in diameter during post-eclosion development (p

      Were statistical tests performed using animals as sample numbers? Please clarify in the images what are animal and what are sarcomere numbers.

      Following standard guidelines, statistical tests were performed using the means of independent experiments, as noted in the figure legends. For each experiment, we used approximately 6 animals, and this information is now included in the Materials and Methods section.

      mef2-Gal4 should be spelled Mef2-GAL4 according to Flybase.

      This has been corrected in the revised text and figures.

      Are the images shown in Figure 2B representative? 96h AE appears thicker than 24h AE but the graph reports no difference.

      We aimed to show representative images, however, in the case of 96h APF we may have selected a wrong example. We now changed the image for a more appropriate one.

      The authors only found Zasp52 and alpha-Actinin at the Z-discs from 72h APF onwards, which is different to what others have reported.

      Similarly to former reports, we detected both α-Actinin (see Supplementary Figure 3B, now Figure 3C in the revised manuscript) and Zasp52 in microdissected IFMs as early as 36 hours APF. However, these markers were largely absent in isolated myofibrils from the early pupal stages (36–60 hours APF). By 60 hours APF, strong α-Actinin and Zasp52 signals were clearly visible in isolated myofibrils (the closest timepoint captured by dSTORM is 72h APF). As discussed in the manuscript, a likely explanation is that α-Actinin and Zasp52 are recruited to developing Z-bodies early on but are only fully incorporated into mature Z-discs between 48 and 60 hours APF. Their incomplete integration at earlier stages may lead to their loss during the isolation procedure.

      Thick filament length during development has also been estimated by Orfanos and Sparrow, which should be cited (PMID: 23178940)

      Contrary to the reviewer’s claim, the article 'Myosin isoform switching during assembly of the Drosophila flight muscle thick filament lattice' does not provide any measurements or estimates of thick filament length; it only includes a schematic illustration where the length of the thick filaments is not based on empirical data.

      **Referee Cross-commenting**

      I also agree with my colleagues comments, which are largely consistent.

      Reviewer #3 (Significance (Required)):

      This paper introduces a tool to measure sarcomere length. Easy to use tools that do this as well already exist. The tool can also measure sarcomere width, which it claims as unique point, which is not the case, see above comment.

      We are aware that other tools exist to measure sarcomere parameters (and we did not claim the opposite in our ms), nevertheless, we need to emphasize that based on our comparisons, IMA is superior to all three alternatives. Three software tools could, in principle, be used to measure both sarcomere length and myofibril diameter: MyofibrilJ, SarcGraph, and sarcApp. However, two of them - MyofibrilJ and SarcGraph - consistently under- or overestimate these values. The only tool capable of performing these measurements reliably, sarcApp, is no longer freely available, it requires programming expertise, and it does not support raw image file formats, making it difficult to use in practice (see above comments for more details). In contrast, IMA is user-friendly and does not require any programming expertise to install or operate. It can automatically process raw microscopic image formats without the need for preprocessing. Segmentation is fully automated, and no parameter tuning is necessary. The tool offers visual feedback on both the segmentation and fitting processes, enabling users to validate results with confidence. IMA delivers accurate and precise measurements of sarcomere length and diameter. Additionally, batch processing is enabled by default, significantly enhancing workflow efficiency.

      This manuscript shows that depending on the isolation and embedding media sarcomere and myofibrils width changes and hence artifacts can be introduced. While this is not suprising, it has not been well controlled in a number of previous publications.

      Furthermore, this paper measures sarcomere length and width during flight muscle development and consolidates what was already known from previous publications. Sarcomeres are added until 48 h APF, then they grow in diameter. Despite strong claims in the text, I do not see any significant novel findings how sarcomeres grow in length or width or any significant deviations from what has been published before. This is even documented in the supplementary graphs by comparing to published data. It is close to identical.

      The overall process has been quantitatively described in four previous studies (Reedy and Beall, 1993, Orfanos et al., 2015, Spletter et al., 2018, Nikonova et al., 2024). While there is general agreement on the pattern of sarcomere development, significant discrepancies exist among these datasets; differences that become particularly problematic when attempting to build structural models. More specifically: Reedy and Beall (1993) report substantially shorter sarcomeres compared to all other datasets, including ours. This discrepancy likely stems from two factors: (i) their use of longitudinal EM sections, where sample preparation is known to cause considerable tissue shrinkage; and (ii) the maintenance of their flies at 23 °C, a temperature that clearly delays development relative to the more commonly used 25 °C. Interestingly, Spletter et al. (2018) and Nikonova et al. (2024) conducted their experiments at 27 °C, which also deviates from standard conditions and may complicate comparisons. Orfanos et al. (2015) suggested that mature sarcomere length is reached by approximately 88 hours after puparium formation (APF). In contrast, our measurements show that sarcomeres continue to elongate beyond this point, reaching mature length between 12 and 24 hours post-eclosion. All four earlier studies report a mature sarcomere length around 3.2-3.3 µm, only slightly longer than the ~3.2 µm length of thick filaments (Katzemich et al., 2012; Gasek et al., 2016). This would imply an I-band length below ~100 nm, which is an implausibly short distance. In contrast, our data, along with several recent studies (González-Morales et al., 2019; Deng et al., 2021; Dhanyasi et al., 2020; DeAguero et al., 2019), support a mature sarcomere length of approximately 3.45 µm, placing the length of the I-band at around 250 nm. This estimate is more consistent with high-resolution structural observations from longitudinal EM sections and fluorescent nanoscopy (Szikora et al., 2020; Schueder et al., 2023). Although Reedy and Beall (1993) provide limited data on myofibril diameter during myofibrillogenesis, a more detailed quantitative analysis is presented by Spletter et al. (2018) and by Nikonova et al. (2024). Interestingly, Spletter et al. report two separate datasets - one based on longitudinal sections and another on cross-sections of DLM fibers. While the measurements are consistent during early pupal stages, they diverge significantly in mature IFMs (1.116 ± 0.1025 µm vs. 1.428 ± 0.0995 µm), a discrepancy that is not addressed in their publication. Nikonova et al. (2024) report even narrower myofibril widths (0.9887 ± 0.1273 µm). Moreover, the reported diameters of early myofibrils in all three datasets are nearly twice as large as those reported by Reedy and Beall (1993) and in our own measurements, directly contradicting the reviewer's claim that the values are “close to identical.” Finally, our data clearly demonstrate that both the length and diameter of IFM sarcomeres reach a plateau in young adults, which is a key developmental feature not examined in previous studies.

      In summary, we did not and we do not intend to claim that our conclusions are novel as to the general mechanisms of myofibril and sarcomere growth. Rather, our contribution lies in providing a high-precision, robust analysis of the growth process using a state-of-the-art toolkit, resulting in a comprehensive description that aligns with structural data obtained from TEM and dSTORM. We therefore believe that expert readers will recognize numerous valuable aspects of our approaches that will advance research in the field.

      Counting the total number of thick filaments during myofibril development is nice, however, this also has been done (REEDY, M. C. & BEALL, C. 1993, PMID: 8253277). In this old study, the authors reported the amount of filament across one myofibril. How does this compare to the new data here counting all filaments? Unfortunatley, this is not discussed.

      Indeed, the study by Reedy and Beall (1993) was primarily based on longitudinal DLM sections, which were used to estimate myofibril width and count the number of thick filaments on this lateral view images (e.g., ~15 thick filaments wide at 75 hours APF), but total thick filament numbers were not provided. While such data could theoretically be used to estimate the number of myofilaments per myofibril, these estimations would depend on the unverified assumption that the section includes the full width of the myofibril. Additionally, the study did not provide standard deviations or the number of measurements, limiting the interpretability and reproducibility of their findings. These points highlight the need for a more rigorous and quantitative approach. For these reasons, we chose to quantify myofilament number using cross-sections, providing more accurate and reliable assessments.

      Besides the difference between the lateral versus cross sections, a direct comparison of our studies is further complicated by differences in the developmental time points and experimental conditions used. Reedy and Beall (1993) reports data from pupae aged 42, 60, 75 and 100 hours, as well as from adults, whereas we present data from 36, 48, and 72 hours APF, and from 24 hours after eclosion, which corresponds to approximately 124 hours APF. Moreover, their experiments were carried out at 23 °C, a temperature that somewhat slows down pupal development and results in adult eclosion at around 112 hours APF, as stated in their study. In contrast, our experiments were carried out at the more commonly used 25 °C, where adults typically emerge around 100 hours APF.

      Collectively, these differences prevented meaningful comparisons between the two datasets, and therefore we preferred to avoid lengthy discussions on this issue.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This manuscripts presents a computational tool to quantify sarcomere length and myofibril width of the Drosophila indirect flight muscles, including developmental samples. This tool was applied to confocal and STORM super-resolution images of isolated myofibrils from adult and developing flight muscles. Thick filament numbers per myofibril were counted during development of flight muscles. A myofilament model of developing flight muscle myofibrils is presented that remains speculative for the early developmental stages.

      Major comments:

      1. The title of the manuscript appears unclear. What is a lattice model? Lattice is an ordered array. The filament array parameters for mature flight muscles was aready measured. It appears that the authors speculate how this order might be generated during sarcomere assembly, which is not studied in this manuscript as it is limited to periodic arrays after 36h APF.

      2. The authors review the flight muscle sarcomere length literature and conclude it is variable because of imprecise measurements. Likely this is partially true, however, more importantly is that the sarcomere length and width changes during isolation methods of the myofibrils, as well as by various embedding methods, as the authors show here as well in Figure 1B-E.

      Hence, I find the strongly claims the authors make here surprising, while they are isolating the myofibrils. Hence, these myofibrils are ruptured at the ends, relaxed or contracted, depending on buffer choice and passive tension is released. On page 8, the authors correctly state that the embedding medium causes shrinkage of the myofibrils. While isolation is state of the art for electron microscopy techniques, other methods including sectioning or even whole mount preparation have been developed for high resolution microscopy of IFMs that avoid these artifacts. Unfortunately, this manuscript only uses isolated myofibrils that were fixed and then mechanically dissociated by pipetting. This method likely induces variations as seen by the large spread of sarcomere length reported in Figure 1C (2.8-3.9µm?) and even bigger spreads for myofibril widths. Are these also seen in tissue without dissections? Unfortunately, no comparision to intact flight muscles are reported with the here presented quantification tool. The sarcomere length spread in the developmental samples is even larger.

      1. The authors suggest that there are sex differences in sarcomere length and pupal development duration. This is potentially interesting, unfortunately they then use mixed sex samples to analyse sarcomeres during flight muscle development.

      2. The IMA software tool lacks critical assessment of its performance compared to other tools and the validation presented is too limited. IMA seems to generate systematic errors, based on Fig S1E, as it does not report the ground truth. These have to be discussed and compared to available tools. The principles of fitting used in IMA seem well adapted to IFM myofibrils in low noise conditions, but may not be usable in other situations. This should be assessed and discussed.

      It is claimed that validation was achieved on simulated IFM images: do the authors rather mean simulated isolated IFM myofibril images? This is not quite the same in terms of algorithm complexity and this should be corrected if this is the case.

      1. The authors need to revise their comparison to other tools. It is incomplete and seemingly incorrect. It should be clearly stated that IMA is limited to isolated myofibrils, which is a far easier segmentation task than what other tools can do, such as sarcApp (Neininger-Castro et al. 2023, PMID: 37921850). Defining the acronym would be valuable in that sense. The claim line 129-130 "none can adequately measure myofibril diameter from regular side view images" is unclear. What do the authors refer to as "side view images"? Sarc-Graph from Zhao et al 2021, PMID: 34613960, and sarcApp from Neininger-Castro et al. 2023 provide sarcomere width, in conditions that are very similar to what IMA does, e.g. on xy images based on the documentation provided on github. A performance comparison with these tools would be valuable. Does installation and use of IMA require computational skills?

      2. How do the authors know that the bright phallodin signal visible that the Z-disc at 36h and 48h APF is due to actin filament overlap, as suggested? An alternative solution are more short actin filaments at the early Z-discs. The authors seem to doubt their own interpretation that actin filaments shrink when reading line 304 and following. This is obviously critical for the "model" presented.

      Minor comments:

      1. Figure S1B is not called out in the text.

      2. Fig. 1: Please state whenever images are simulations?

      3. Fig. 2: Length-width correlation - please provide individual points color-coded by time point?

      4. "newly eclosed males and females, we observed that males have slightly shorter sarcomeres and narrower myofibrils". Please provide a statistical test supporting the difference.

      5. Were statistical tests performed using animals as sample numbers? Please clarify in the images what are animal and what are sarcomere numbers.

      6. mef2-Gal4 should be spelled Mef2-GAL4 according to Flybase.

      7. Are the images shown in Figure 2B representative? 96h AE appears thicker than 24h AE but the graph reports no difference.

      8. The authors only found Zasp52 and alpha-Actinin at the Z-discs from 72h APF onwards, which is different to what others have reported.

      9. Thick filament length during development has also been estimated by Orfanos and Sparrow, which should be cited (PMID: 23178940)

      Referee Cross-commenting

      I also agree with my colleagues comments, which are largely consistent.

      Significance

      This paper introduces a tool to measure sarcomere length. Easy to use tools that do this as well already exist. The tool can also measure sarcomere width, which it claims as unique point, which is not the case, see above comment.

      This manuscript shows that depending on the isolation and embedding media sarcomere and myofibrils width changes and hence artifacts can be introduced. While this is not suprising, it has not been well controlled in a number of previous publications.

      Furthermore, this paper measures sarcomere length and width during flight muscle development and consolidates what was already known from previous publications. Sarcomeres are added until 48 h APF, then they grow in diameter. Despite strong claims in the text, I do not see any significant novel findings how sarcomeres grow in length or width or any significant deviations from what has been published before. This is even documented in the supplementary graphs by comparing to published data. It is close to identical.

      Counting the total number of thick filaments during myofibril development is nice, however, this also has been done (REEDY, M. C. & BEALL, C. 1993, PMID: 8253277). In this old study, the authors reported the amount of filament across one myofibril. How does this compare to the new data here counting all filaments? Unfortunatley, this is not discussed.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript titled "A myofilament lattice model of Drosophila flight muscle sarcomeres based on multiscale morphometric analysis during development," Görög et al. perform a detailed analysis of morphological parameters of the indirect flight muscle (IFM) of D. melanogaster. The authors start by illustrating the range of measurements reported in the literature for mature IFM sarcomere length and width, showing a need to revisit and determine a standardized measurement. They develop a new Python-based tool, IMA, to analyze sarcomere lengths from confocal micrographs of isolated myofibrils stained with phalloidin and a z-disc marker. Using this tool, they demonstrate that sample preparation (especially mounting medium), as well as fiber type, sex, and age influence sarcomere measurements. Combining IMA, TEM, and STORM data, they measure sarcomere parameters across development, providing a comprehensive and up-to-date set of "standardized" sarcomere measurements. Using these data, they generate a model integrating all of the parameters to model sarcomeres at four discrete timepoints of development, recapitulating key phases of sarcomere formation and growth.

      Major comments:

      • Line 200 & 901 - Figure S1B - The authors make a strong statement about the use of liquid versus hardening media, and it is clear from the image provided in Figure S1 that there is a difference in the apparent sarcomere width. The identity of the "liquid media" versus the "hardening media" should be clearly identified in the Results, in addition to the legend for Figure S1. The authors show that "glycerol-based solutions" increase sarcomere width, but the Materials only list 90% glycerol and PBS. However, a frequently used liquid mounting media is Vectashield. Based on the literature, measurements in liquid Vectashield show diameters significantly less than 2.2 microns observed here with presumably 90% glycerol or PBS. Can the authors qualify this statement, or provide data that all forms of liquid mounting media cause this effect? Does this also apply to hemi-thorax and sectioned preparations, or just isolated myofibrils?

      • Can the authors comment on whether the length of fixation or fixation buffer solution, in addition to the mounting medium, make a difference on sarcomere length and diameter measurements? This is another source of variation in published protocols.

      • Line 237-238. The authors conclude that premyofibrils are much thinner than previously measured. The use of Airyscan to more accurately measure myofibril width at this timepoint is a good contribution, as indeed diffraction and light scatter likely contribute to increased width measured in light microscopy images. I also wonder, though, how well the IMP software performs in measuring width at 36h APF, given how irregular the isolated myofibrils at this stage look (wide z-lines but thinner and weaker H and I bands as shown in Fig. 2B)? Also, how much of the difference in sarcomere width arises due to effects of "stripping" components off of the sarcomere at the earliest timepoint (for example alpha-actinin or Zasp proteins)? Myofibrils at early timepoints do contain more than 4-12 sarcomeres in a line (they extend the full length of the myofiber), so it is possible they are breaking due to the detergent and mechanical disruption induced by the isolation method.

      • Line 312 - What does "stable association" mean in this context? The authors mention early timepoints lack stable association of alpha-Actinin or Zasp52, and they reference Fig. S4C, but this figure only shows 72h and 24 AE, not 36h and 48 h APF. Previous reports have seen localization of both alpha-Actinin and Zasp52, so presumably the detergent or mechanical isolation is stripping these components off of the isolated myofibrils up until 72h. This same type of issue comes up again in

      • Lines 325-334, where the authors talk about 3E8 and MAC147. They state that 3E8 signal significantly declines in later stages and that MAC147 is not suitable to label myofibrils in young pupae, but they only show data from 72 APF and 24 AE (which looks to have decent staining for both 3E8 and MAC147). A clearer explanation here would be helpful. Figure 3B. The authors show the H, Z, and I lengths in B', B', and B' and discuss these lengths in the text (lines 305-320). It would also be nice to actually have the plots showing the measured/calculated lengths for thin and thick filaments. These are mentioned in the results, but I cannot find the plots in the figures and there is no panel reference.

      • Line 400. Does the model in Figure 4 actually have molecular resolution as the authors claim? From these views, thick and thin filaments appear to be represented by cylindrical objects. Localization of specific molecules would require further modeling with individual proteins. Or do the authors mean localization from STORM imaging relative to the ends of the thick and/or thin filaments? The model itself is a useful contribution, but based on Figure 4, resolution of individual molecules is not evident.

      • The main Results section of the text is condensed into 4 figures. However, I found myself flipping back and forth between the main figures and the supplement continuously, especially parts of Supplemental Figures 1, 3, 4, and 5. With such large amounts of detail in the Results relying on the supplement, it may be worth considering reorganizing the main and supplemental figures, and having 7 main figures, to include important panels that are currently in the supplement (esp. Fig S1B, S1C, S1D, S3B, S4, S5).

      Minor comments:

      • On the plots in Fig. S1B, D, and F, it is hard to see the color of the dots because the red error bars are on top of them. Can the other distribution dots be tinted the correct color or the x-axis labels be added, so it is clear which dataset is which?

      • Line 142 needs a reference to Figure S1, Panel E, which shows the accuracy and precision measurements.

      • Lines 198 - is this range from the above publications? Needs to be clearly cited.

      • Figure S3B is confusing - why do the blow-ups overlap both the top (presumably microdissected) and the bottom (presumably isolated) images? The identity of microdissected images should be labeled, as they are hard to see underneath of the blown-up images and the identity of individual image planes wasn't immediately obvious.

      • Line 298. By "misaligned," do the authors mean the pointed ends are not uniformly anchored in the z-disc, leading to the wide z-disc measurements? At this early stage, I'm not sure "misaligned" is the right word - perhaps "were not yet aligned in register at the z-disc" or something similar.

      • Figure S6 - spelling mistake in label of panel A, "sarcomer" should be "sarcomere"

      • Line 487. Spelling "Zaps52" should be "Zasp52"

      • Line 887. Spelling "Myofilement" should be "Myofilament"

      • Line 946-947. In the legend for Supp. Fig. 3., the authors should specify which published datasets on sarcomere length are shown in the figure by including the references in the legend. Presumably the "isolated individual myofibrils" are the blue "this study" lines, leaving the "microdissected muscles" as the magenta "previous reports" on the figure. Without the reference, it is not clear if these are microdissected, isolated myofibrils, hemi-thorax sections, cryosections, or another preparation method for the "previous reports" data.

      Referee Cross-commenting

      I agree with the comments from the other reviewers. Many of the major themes are consistent across the reviews, including regarding the model, preparation methods, and the software tool.

      Significance

      Strengths: This manuscript is an important contribution to the field of sarcomere development. The authors use modern technologies to revisit variation in morphometric measurements in the literature, and they identify parameters that influence this variation. Notably, sex-specific differences, DLM versus DVM measurements, and mouting media are potential contributors to the variability. Combining TEM and STORM with a confocal timecourse of isolated myofibrils, they refine previously published values of sarcomere length and width, and add more comprehensive data for filament length, number and spacing. This highly accurate timecourse demonstrates continual growth of sarcomeres after 48 h APF, and correct some inconsistencies from previous large-scale timecourse datasets. These data are very valuable to the field, especially Drosophila muscle biologists, and will serve as a comparative resource for future studies.

      Weaknesses: At early timepoints, loss of sarcomere components through mechanical or detergent-mediated artifacts may influence the authors' measurements. In addition, isolating myofibrils is not always the most ideal approach, as it loses information on myofiber structure as well as organization and structure of the myofibrils in vivo. The point regarding liquid versus hardening mounting media is valuable, but remains to be tested and validated with the diverse liquid and hardening media used by other labs. The IMA software seems to be designed specifically for analysis of isolated myofibrils, and it is unclear if it would work for other types of IFM preparations. A last point is that TEM and STORM may not be available on a regular basis to many labs, hindering wide implementation of the approach used in this manuscript to generate very accurate and detailed measurements of sarcomere morphometrics.

      Audience: Scientists who study sarcomerogenesis or Drosophila muscle biology.

      My expertise: I study muscle development in the Drosophila model.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      In this work, the authors present a careful study of the lattice of the indirect flight muscle (IFM) in Drosophila using data from a morphometric analysis. To this end, an automated tool is developed for precise, high-throughput measurements of sarcomere length and myofibril width, and various microscopy techniques are used to assess sub-sarcomeric structures. These methods are applied to analyze sarcomere structure at multiple stages in the process of myofibrillogenesis. In addition, the authors present various factors and experimental methods that may affect the accurate measurement of IFM structures. Although the comprehensive structural study is appreciated, there are major issues with the presentation/scope of the work that need to be addressed:

      Major Comments

      1. The main weakness of the paper is in its claim of presenting a model of the sarcomere. Indeed, the paper reports a structural study that is drawn onto a 3D schematic. There is no myofibrillogenesis model that would provide insights into mechanisms. Therefore, the use of the word model is grossly overstated.

      2. In general, the major focus and contribution of the work is unclear. How does the comprehensive nature of the measurements contribute to existing literature?

      3. Figure labels are often rather confusing - for example it is unclear why there is a B, B', B' etc instead of B,C,D, etc.

      4. Some comments in the text are not clearly tied to the figures. For example, in lines 108-109, are the authors referring to the shadow along the edges of the myofibril when saying they are not clearly defined (Figure 1C)?

      5. In line 116, it is unclear what "surrounding structures" the authors are referring to if the myofibrils are isolated.

      6. In lines 141-142, there is no reference of data to back up the claim of validation.

      7. In line 170, the authors mention the mef2-Gal4/+ strain as a Gal4 driver line but do not clearly state how this strain is different from the wildtypes or how this impacts their results.

      8. In lines 182-185, the authors discuss the effects of tissue embedding on morphometrics. Were factors such as animal sex, age, fiber type, etc. conserved in these experiments? If not, any differences in results may be confounding.

      9. In lines 199-201, the authors discuss results of myofibril diameter using different preparation methods, yet no data is cited to support the claims. In line 220, the phrase "6 independent experiments" is unclear. Is each independent experiment performed using a different animal? Furthermore, are 6 experiments performed for each time point?

      10. In line 254, the authors refer to "number of sarcomeres". It must be clearly stated if this refers to sarcomeres per myofibril, image area, etc.

      11. In line 274, the authors refer to "myofilament number". It must be clearly stated if this refers to myofilaments per myofibril, image area, etc.

      12. In line 299, the authors mention that thin filaments measured less than 560 nm in length, yet no data is cited to support this.

      13. In the "Quantifying sarcomere growth dynamics" section of the summary (starting from line 402) the authors introduce data that would be more naturally placed in the results and discussion section.

      14. In lines 422-423, it is not mentioned what the controls are for.

      15. In the caption of Figure 1C, it is not mentioned what the red dashed lines in the microscope images represent.

      16. In the caption of Figure 1D, the difference between the lighter and darker grey points is not mentioned.

      17. In line 849, the stated p-value (0.003) does not match that mentioned in the figure (0.0003).

      18. In line 874, it is not clear what an "independent experiment" refers to (different animal, etc.?).

      19. Figure 2A is hard to read. Using different colored dots for different time points might help.

      20. The significant figures presented in Figure 4 give a completely inaccurate representation of the variability of the measurements achieved with these techniques.

      21. In line 877, it should be mentioned that the number of filaments is counted per myofibril. The y-axes in the figure should also be adjusted to clarify this.

      22. In line 883, it is not clear what an "independent experiment" refers to (different animal, etc.?).

      23. The statement of sample sizes in all figures is a little confusing.

      24. In lines 1007-1008, the authors imply that the lattice model is needed for calculation of myofilament length. However, from the equations and previous data, it seems that this can be estimated using the confocal and dSTORM images.

      25. A more specific discussion of future directions is needed to put this paper in context. For example: Can anything from the overall process be used to better understand sarcomere dynamics in larger animals/humans? Can this be applied to disease modelling?

      26. One of the major claims of the paper is that there is a measurable variability with sex and other parameters. However, this data is never clearly summarized, presented (except for supplement), or discussed for its implications.

      Minor Comments

      1. Lines 60-65 seem to break the flow of the introduction. As the authors discuss existing methods in literature for IFM analysis in the previous couple sentences, the following sentences should clearly state the limitations of existing methods/current gap in literature and a general idea of what the current work is contributing.

      2. In line 104, the acronym for ZASPs is not spelled out.

      Referee Cross-commenting

      I agree as well.

      Significance

      In summary, this paper provides a multi-scale characterization of Drosophila flight muscle sarcomere structure under a variety of conditions, which is potentially a significant contribution for the field. However, the paper scope is overstated in that it does not provide an actual sarcomere model. Further, there are multiple issues with data presentation that impact the readability of the manuscript.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-02953

      Corresponding author(s): Andreas, Villunger

      1. General Statements [optional]

      *We would like to thank the reviewers for their constructive input and overall support. We appreciate to provide a provisional revision plan, as outlined here, and are happy to engage in additional communication with journal editors via video call, in case further clarifications are needed. *

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Reviewer #1

      __Evidence, reproducibility and clarity __

      Summary: This manuscript by Leone et al describes the role of the PIDDosome in cardiomyocytes. Using a series of whole body and cardiomyocyte specific knockouts, the authors show that the PIDDosome maintains correct ploidy in these cells. It achieves this through inducing cell cycle arrest in cardiomyocytes in a p53 dependent manner. Despite this effect on ploidy, PIDDosome-deficient hearts show no structural or functional defects. Statistics and rigor appear to be adequate.

      We thank this referee for taking the time to evaluate our work and their valuable comments. We assume that this reviewer by mistake indicates that the phenomenon we describe, depends on p53. As outlined in the abstract and throughout the manuscript, the effect is independent of p53, but may additionally still involve p21, acting along or parallel to the PIDDosome.

      Major comments: 1. Figure 1 uses fluorescent intensity of a nuclear stain to determine ploidy per nucleus and they further separate the results into mononucleated, binucleated or multinucleated cells. It is hard to know how to interpret these results without further information or controls. Is there a good positive control that can be used to help to show whether this assay is quantitative? The differences are larger with the Raidd and caspase-2 knockouts than with the Pidd knockouts but this is not addressed.

      *We appreciate this concern. Regarding a "good positive control" we can say that we follow state-of the art in the cardiomyocyte field and studies by the Evans (PMID: 36622904), Kuhn (PMID: 32109383), Bergmann (PMID: 26544945) and Patterson labs (PMID: 28783163, 36912240) all use the identical approach to discriminate 2n from 4n nuclei in microscopy images at the cellular level. The fact that the majority of rodent CM nuclei is indeed diploid (PMID: 31175264, 31585517 and 32078450) and a large number of nuclei has been evaluated to assess their mean fluorescence intensity (MFI) reduces the risk of a systematic bias in our analysis. Moreover, we have used an orthogonal approach that is indeed quantitative to define DNA content, i.e,. flow-cytometry based evaluation of DNA content in isolated CM nuclei (Fig. 1C). We hence are confident our assays are quantitative. *

      Regarding the fact that loss of Pidd1 causes a more saddle phenotype, we can offer to discuss this in light of the fact that Pidd1 has additional functions, outside the PIDDosome (PMID: 35343572), and that we made similar observations when analyzing ploidy in hepatocytes (PMID: *31983631). Given the fact that all components of the PIDDosome show a similar phenotype, and that this phenotype is mimicked by loss of the protein that connects PIDD1 and centrosomes, ANKRD26 (Fig. 4a), we are confident that this biological variation in our analysis is not affecting our conclusions. *

      On line 459 the authors state that the increase in polyploidy in PIDDosome knockouts occurs in adult hood but this is not directly tested. In fact, in the next section the polyploidy is assessed in early postnatal development. This statement should be explained or removed.

      We see that we have made an unclear statement here. In fact, we first noted increases in ploidy in adult heart and then define the time window in development when this happens. This sentence will be rephrased.

      In Figure 4. The authors obtained RNAseq data for P1, P7 and P14 but only show the differences with and without caspase-2 at P7. Given that the differences in ploidy are more significant at P14 (Fig 3D), all the comparisons should be shown along with analysis of whether the same genes/gene families are altered in the absence of caspase-2.

      The reason why we focus on postnatal day 7 (P7) is that data from Alkass et al (PMID: 26544945) and other labs (PMID: 31175264 ) document that on this day the initial wave of binucleation peaks. Hence, we hypothesized that the PIDDosome must be active in most CM, which aligns well with the increased mRNA levels of all of its components (Figure 3). Interestingly, it seems that its action is tightly regulated, as mRNA of PIDDosome components drop on P10, suggesting PIDDosome shut-down or downregulation. Similar findings have been noted in the liver (PMID: *31983631). Alkass and colleagues also show that very few CMs enter another round of DNA synthesis between P7 and P14, and hence possible transcriptome changes in the absence of the PIDDosome will be strongly diluted. *

      Please note that on P1, there is no difference between genotypes to be expected as all CM are mononucleated diploids and cytokinesis competent, as previously demonstrated (PMID: *26544945). Moreover, PIDDosome expression levels are extremely low (Fig. 3A). As such, no difference between genotypes are expected on P1. In addition, on P14 the ploidy phenotype observed in PIDDosome knockout mice reaches the maximum and ploidy increases are comparable to adult tissue. Thus, at this time the trigger for PIDDosome activation (cytokinesis failure) is no longer observed as the majority of CMs are post-mitotic, (PMID: 26247711). As such the impact of PIDDosome activation on the P14 transcriptome is most likely negligible. However, if desired, we can expand our bioinformatics analysis summarizing findings made related to DEGs over time in wt animals by comparing genotypes also on day 1 and day 14. In light of the above, analysis between genotypes on P7 holds still appears as the one most meaningful. *

      Some validation of the RNAseq and/or proteomics results would be an important addition to this study

      We agree with this notion and propose to validate key candidates related to cardiomyocyte proliferation and polyploidization, some of which we found to be differentially expressed at the mRNA level on day 7in the RNAseq data (e.g., p21, Foxm1, Kif18a, Lin37 and others)

      Regarding the proteomics results, we face the challenge that we can only try to confirm if candidate proteins are likely caspase substrates in silico using DeepCleave*, and potentially pick one or two candidates linked to CM differentiation for further analysis in vitro and in heterologous cell based assays (e.g. 293T cells), as no bona-fide ventricular cardiomyocyte cell lines exist. Primary postnatal CMs are extremely difficult to transfect, nor they proliferate without drug-treatment, or fail cytokinesis ex vivo. *

      Figure 4D: the authors make the conclusion that p21 is downstream of PIDD (et p53 independent). However, this is not supported by the data because the increase in 4N cells/decrease in 2N cells, although statistically significant, is nowhere near that of caspase-2 KO and caspase-2/p21 KO. Statistics should also compare p32KO with c2KO. In the absence of any other data, the more likely conclusion is that p21 is not involved.

      *We agree that the findings related to the impact seen upon loss of p21 suggest that it is not the only effector involved in ploidy control and it may not even be an effector engaged by caspase-2, as C2/p21 DKO mice have an even higher ploidy increase, albeit not statistically significant. However, it is important to highlight that p21 (Cdkn1a) was found to be downregulated in our transcriptomic analysis suggesting an involvement in the caspace2-cascade. We are happy to highlight this when presenting the results and in the discussion. *

      *We assume that this referee refers to p73 KO data that should be compared to Casp2 KO data (could be read as p73 or p53, but the latter we compare side by side with Casp2 in Fig. 4 already). As p73 KO mice were not found to be viable beyond day 7 (our attempt to find animals on day 10 failed, in line with published literature (PMID: 24500610, 10716451)), we can only offer to compare this data set to the data presented in Figure 3C, where we have analyzed ploidy increases on day 7 from wt and PIDDosome mutant mice. This re-analysis will show that only Caspase-2 mutant mice display a significant ploidy increase on P7, when compared to wt or p73 mutant animals, while no difference are noted between wt and p73 mutant mice (to be included in new Suppl. Fig. 3C) *

      Minor comments: Suggest moving Figure 4A to Figure 3 as it seems to fit better there based on the citation of this figure in the text

      *We can see some benefit in this recommendation and included panel 4A now in an updated version of Figure 3. *

      Recommend enhancing the brightness of microscopy images in Figure 1E and 2D

      We will try to improve image quality, may have been due to PDF conversion

      Significance

      This study provides interesting information for the role of the PIDDosome in protecting from polyploidy and adds to the body of work by this same group studying this pathway in the liver.

      The main weakness in terms of significance is the lack of a phenotype in the hearts of these animals. Therefore, it is clear that ploidy (or at least PIDDosome dependent ploidy) has minimal impact on cardiac development.

      We respectfully disagree with the comment that the lack of impact on cardiac function constitutes a weakness of our findings. Several studies on ploidy control in the liver (PMID 34228992) but importantly also heart (PMID: 36622904) have failed to document a clear impact of increased ploidy on organ function. This does not infer insignificance, but maybe rather that the context where this becomes relevant has not been identified. We are happy expand on this in our discussion

      The authors mention that they have not tried giving these mice an myocardial infarct (MI) or inducing any other type of cardiac damage. Although it is understood that these experiments are likely outside of the scope of the present study, without this information the impact of this study is moderate. I recommend expanding the discussion to provide a more in-depth possible rationale as to why ploidy perturbations do not lead to structural changes like in the liver.

      Despite this, the insights to the pathway itself are interesting to investigators in the caspase-2 field if a little underdeveloped, especially concerning the role of p21.

      My expertise is in cell death and caspase biology (especially caspase-2). I have sufficient expertise to evaluate all parts of this paper.

      *As mentioned above, we will amend our conclusions on p21, in light of potential findings made when validating DEG candidates, as stated above. *

      *We hope that the changes and amendments proposed here will be satisfactory to this referee to recommend publication of a revised manuscript. *

      Reviewer #2

      __Evidence, reproducibility and clarity: __

      __Summary: __

      In this study, the authors investigated the role of the PIDDosome during cardiomyocyte polyploidization. PIDDosome is a multi-protein complex activating the endopeptidase Caspase-2, and shown to be involved in eliminating cells with extra centrosomes or in response to genotoxic stress (Burigotto & Fava, 2021, Sladky and Villunger, 2020). In both cases, the PIDDosome is recruited in a ANKRD26-dependent manner at the centrosomes leading to p53 stabilization and cell death (Burigotto & Fava, 2021; Evans et al., 2020; Burigotto et al., 2021).

      Here, by studying mouse cardiomyocyte differentiation, the authors showed that PIDDosome is imposing ploidy restriction during cardiomyocyte differentiation. Importantly, in contrast to a previous report in the liver (Sladky et al., 2020), they showed that PIDDosome acts in a p53-independent manner in cardiomyocytes. Indeed, they suggested that PIDDosome controls ploidy in cardiomyocytes through p21 activation.

      We want to thank this reviewer for the time taken to evaluate our work and provide critical feedback that will help to improve our revised manuscript.

      __Major comments: __

      In general the conclusions of the authors are well supported by the experiments. However, I would suggest the following experiments/analysis to strengthen the paper:

      The authors should improve the Figure 1 to help the readers who are not familiar with cardiomyocyte polyploidization. For instance, I would suggest to add a scheme to summarize cardiomyocyte polyploidization (in terms of nuclear size, mono vs multi and so on).

      We agree that a visual summary of the postnatal timing of CM polyploidization will be helpful for the generalist not familiar with the topic and have added a scheme, adapted from a study by Alkass et al. (PMID: *26544945), who elegantly defined the timing of this process during postnatal mice life (now Fig. 1A). *

      Based on the images they presented in 1B, the authors should also measure the nuclear area or volume in the different conditions in which components of the PIDDosome were depleted. Indeed, these two parameters should be easier to conceptualize for the readers (instead of the fluorescence nuclear intensity). This could help to understand if the nuclear size is maintained between the different conditions and if this is comparable between mono, bi or multinucleated cardiomyocytes.

      We have acquired this data and it can be used to provide additional information on nuclear area and/or volume. We propose to focus on re-analyzing data from wt, Casp2 and XMLC2CRE/Casp2f/f mice. The additional information can be included in Figures 1 & 2, respectively.

      • In Figure 2A, the authors presented cross section of heart from animals showing that PIDDosome depletion has no effect on heart size. This is a surprising result since cardiomyocytes have higher ploidy levels and this could have an effect on their function. Since the importance of this observation, the authors should present a quantification of the heart size in the different conditions shown in Figure 2A.

      We agree with this comment. We can measure the heart vs. body weight ratio or tibia length in adult Casp2-/- vs. WT (3 month old) in order to indirectly evaluate possible increases in CM size linked to increased ploidy.

      Also, the percentage of cardiomyocytes presenting higher levels of ploidy seems quite low. The authors should discuss this point. In particular because this could explain the absence of consequences on heart size and function at steady state.

      We agree with this conclusion and will expand on this in our discussion. It is important to note that as opposed to findings made in liver (PMID: *31983631), genetic manipulation of ploidy regulators such as E2f7/8 (PMID: 36622904), only led to modest changes in CM ploidy, suggesting that either a small band-width compatible with normal heart function exists, or that additional mechanisms exist that take control when these thresholds set by the PIDDosome or E2f7/8 are exceeded. These mechanisms could involve Cyclin G (PMID: 20360255), or TNNI3K (PMID: 31589606). Importantly, a recent publication has shown that overexpression of Plk1(T210D) and Ect2 from birth causes increased heart weight coupled with a minor decrease in CM size. These mice undergo to premature death (PMID: 39912233) suggesting that CM polyploidization is a tight regulated process regulated by several independent mechanisms during heart development. *

      In Figure 2D, the authors measured the cardiomyocyte cross-sectional area and concluded that removing PIDDosome components have no effect on cardiomyocyte cell size. Since it has been shown that ploidy increase is normally associated with an increase in cell area, the authors should measure cell area of cardiomyocytes analyzed in Figure 1B. It could be then interesting to establish a correlation with nuclear area and the mono, bi or multinucleated status. This will strengthen the results showing that ploidy increases without affecting cell area.

      Indeed, studies in PIDDosome deficient livers suggest that tissue is containing fewer but bigger cells (PMID: *31983631). As opposed to the liver the percentage of cardiomyocytes presenting higher levels of ploidy is relatively low. Thus, a possible increase in CM size in PIDDosome deficient mice may be masked in heart cross-sections. In order to better correlate the ploidy with cell size, we propose to reanalyze our microscopy images used to extract the data displayed in Fig. 1D. We may run into the problem though that the number of cells acquired may become limiting to achieve sufficient statistical power. In this case we could pool data from different PIDDosome mutant CM to increase statistical power. Again, we propose to initially prioritize wt vs. Casp2 vs. XMLC2/Casp2f/f mice. In addition, we can offer to quantify heart to body weight ratio or tibia length as an additional read-out (see answer to a previous reviewer comment). *

      The authors should discuss the fact that PIDDosome depletion lead only to a mild increase in ploidy levels (4N) in a small percentage of cardiomyocyte. If the PIDDosome is controlling ploidy, one could expect that removing it should lead to a drastic increase in the ploidy levels. Is PIDDosome depletion leading to cell death in some cardiomyocyte? The authors should discuss this point in the discussion or if relevant show a staining with an apoptosis marker. Is another mechanism compensating to prevent higher ploidy levels in cardiomyocytes?

      These are valid thoughts, some of which we contemplated before. In part, we have addressed them in our response to Reviewer#1, above, discussing similar findings made in E2f7/8 deficient hearts (PMID: 36622904), or Cyclin G overexpressing hearts (PMID: 20360255), where also only modest changes in ploidy were achieved. Together these observations are suggesting alternative control mechanism able to act, or limited tolerance towards larger shifts in ploidy, incompatible with proper cell function and survival. Towards this end, we can offer to test if we find increased signs of cell death in PIDDosome mutant hearts by TUNEL staining of histological sections. Of note, we did not find evidence for such a phenomenon in the liver (PMID: 31983631).

      Even if the authors presented RNAseq data suggesting that the PIDDosome is activated during cardiomyocyte differentiation, they should clearly demonstrate this point to strengthen the message of the paper. Indeed, the conclusions are based on the absence of PIDDosome components triggering higher ploidy in cardiomyocytes. However, we don't know whether (and when) the PIDDosome is activated during cardiomyocyte differentiation to control their ploidy levels. I would suggest to analyze PIDDosome activation markers by immunofluorescence in *cardiomyocytes at different developmental stages. *

      *We agree with this referee that direct proof of PIDDosome activation would be helpful and that we only infer back from loss of function phenotypes when and where the PIDDosome becomes activated. However, several technical issues prevent us from collecting more direct evidence of PIDDosome activation in the developing heart. 1) Polyploidization in heart CM appears to happen gradually in CM from day 3 on with a peak at day 7 (PMID: 26544945). Hence, this is not a synchronous process, where we could pinpoint simultaneous activation of the PIDDosome in all cells at the same time, which would facilitate biochemical analysis, e.g., by western blotting for signs of Caspase-2 activation (i.e. the loss of its pro-form, PMID: 28130345). 2) Our most reliable readout, MDM2 cleavage by caspase-2 giving rise to specific fragments detectable in western, is not applicable to mouse tissue, as the antibody we use only detects human MDM2 (PMID: 28130345) and no other MDM2 Ab we tested gave satisfactory results. Independent of that, 3) we do not see involvement of p53 in CM ploidy control (arguing against a role of MDM2). *

      *As such, we can only offer to look at extra centrosome clustering in postnatal binucleated CM (as also suggested further below), as a putative trigger for PIDDosome activation. However, this has been published by the first author of this study before (PMID 31301302). Given that we have made the significant effort to time resolve the increase in ploidy in postnatal mice (please note that several hearts needed to be pooled for each time point, analyzed in multiple biological replicates), we think that our conclusions are well-justified based on the genetic data provided. *

      Concerning the methods, the authors must add the references for each product they used and not only the origin. When relevant, the RRID should be indicated. Without this information the method and the data cannot be reproduced.

      We will update this information where relevant to reproduce our results

      Minor comments:

      In general, the text and the figures are clear. Nevertheless, I would suggest the following changes:

      • Figures 1B, 2B and 2C: the y-axis must start at 0.

      We will adopt axes accordingly

      Figure 4A: The authors should stain centrosomes in cardiomyocytes. This should strengthen the conclusion taken by the authors based on the results obtained in mice depleted for ANKRD26. Indeed, for the moment they are insufficient to conclude about the role of the centrosomes. The authors should show that centrosomes cluster in cardiomyocytes (a condition necessary for PIDDosome activation in polyploid cells) and if possible that component of the PIDDosome are recruited here.

      *This point is well taken and addressed in part above. Clustering of extra centrosomes has been documented and published by the first author of this study in rat polyploid cardiomyocytes (PIMID; cited). We can offer to show clustering of centrosomes in mouse CM isolated from day 7 hearts, but while PIDD1 can be detected well in MEF, we repeatedly failed to stain fro PIDD1 in primary CMs. *

      Figure 4F: I would suggest to modify the working model to emphasize more the differences between WT and PIDDosome KO.

      We will aim to improve this cartoon/graphical abstract

      The prior studies are referenced appropriately.

      Reviewer #2 (Significance (Required)):

      How polyploid cells control their ploidy levels during differentiation remains poorly understood. The data presented here represent thus an advance concerning this question. The actual model concerning PIDDosome activation relies on the presence of extra centrosomes that drives the ANKDR26-dependent recruitment of the PIDDosome. Then, Caspase 2 is activated leading to a p53-p21 dependent cell cycle arrest (Burigotto & Fava, 2021, Sladky and Villunger, 2020; Janssens & Tinel, 2012; Evans et al., 2020; Burigotto et al., 2021). In this study, the authors showed that similar pathway takes place during cardiomyocyte differentiation to control ploidy levels. These data are reminiscent of previous work showing PIDDosome involvement during hepatocyte polyploidization (Sladky et al. 2020). Together, these data highlight the prominent role of the PIDDosome complex in controlling ploidy levels in physiological context. Importantly, this study identified that the classical p53-dependent cell cycle arrest described after PIDDosome activation is not involved here. Instead, the data established that independently of p53, p21 contribute to control cardiomyocyte ploidy. In consequence, this study extends the initial pathway associated with PIDDosome activation and suggest that other mechanisms could take place to restrain cell proliferation upon PIDDosome activation. Overall, this makes this paper significant and of interest for the following fields: polyploidy, heart/cardiomyocyte development and PIDDosome.

      My field of expertise includes polyploidy, cell cycle and genetic instability.

      We thank this reviewer for the time taken and the positive feedback provided.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      N/A

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      • *As outlined above, limited tools are available to validate putative caspase-2 substrates, identified in proteomics analysis, in an impactful manner. *
      • *Also, as discussed above, we deem myocardial infarction experiments in mice as unsuitable to improve our work, as with all likely-hood, they will yield negative results. *
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this study, the authors investigated the role of the PIDDosome during cardiomyocyte polyploidization. PIDDosome is a multi-protein complex activating the endopeptidase Caspase-2, and shown to be involved in eliminating cells with extra centrosomes or in response to genotoxic stress (Burigotto & Fava, 2021, Sladky and Villunger, 2020). In both cases, the PIDDosome is recruited in a ANKRD26-dependent manner at the centrosomes leading to p53 stabilization and cell death (Burigotto & Fava, 2021; Evans et al., 2020; Burigotto et al., 2021).

      Here, by studying mouse cardiomyocyte differentiation, the authors showed that PIDDosome is imposing ploidy restriction during cardiomyocyte differentiation. Importantly, in contrast to a previous report in the liver (Sladky et al., 2020), they showed that PIDDosome acts in a p53-independent manner in cardiomyocytes. Indeed, they suggested that PIDDosome controls ploidy in cardiomyocytes through p21 activation.

      Major comments:

      In general the conclusions of the authors are well supported by the experiments. However, I would suggest the following experiments/analysis to strengthen the paper:

      • The authors should improve the Figure 1 to help the readers who are not familiar with cardiomyocyte polyploidization. For instance, I would suggest to add a scheme to summarize cardiomyocyte polyploidization (in terms of nuclear size, mono vs multi and so on). Based on the images they presented in 1B, the authors should also measure the nuclear area or volume in the different conditions in which components of the PIDDosome were depleted. Indeed, these two parameters should be easier to conceptualize for the readers (instead of the fluorescence nuclear intensity). This could help to understand if the nuclear size is maintained between the different conditions and if this is comparable between mono, bi or multinucleated cardiomyocytes.
      • In Figure 2A, the authors presented cross section of heart from animals showing that PIDDosome depletion has no effect on heart size. This is a surprising results since cardiomyocytes have higher ploidy levels and this could have an effect on their function. Since the importance of this observation, the authors should present a quantification of the heart size in the different conditions shown in Figure 2A. Also, the percentage of cardiomyocytes presenting higher levels of ploidy seems quiet low. The authors should discuss this point. In particular because this could explain the absence of consequences on heart size and function at steady state.
      • In Figure 2D, the authors measured the cardiomyocyte cross-sectional area and concluded that removing PIDDosome components have no effect on cardiomyocyte cell size. Since it has been shown that ploidy increase is normally associated with an increase in cell area, the authors should measure cell area of cardiomyocytes analyzed in Figure 1B. It could be then interesting to establish a correlation with nuclear area and the mono, bi or multinucleated status. This will strengthen the results showing that ploidy increases without affecting cell area.
      • The authors should discuss the fact that PIDDosome depletion lead only to a mild increase in ploidy levels (4N) in a small percentage of cardiomyocyte. If the PIDDosome is controlling ploidy, one could expect that removing it should lead to a drastic increase in the ploidy levels. Is PIDDosome depletion leading to cell death in some cardiomyocyte? The authors should discuss this point in the discussion or if relevant show a staining with an apoptosis marker. Is another mechanism compensating to prevent higher ploidy levels in cardiomyocytes?
      • Even if the authors presented RNAseq data suggesting that the PIDDosome is activated during cardiomyocyte differentiation, they should clearly demonstrate this point to strengthen the message of the paper. Indeed, the conclusions are based on the absence of PIDDosome components triggering higher ploidy in cardiomyocytes. However, we don't know whether (and when) the PIDDosome is activated during cardiomyocyte differentiation to control their ploidy levels. I would suggest to analyze PIDDosome activation markers by immunofluorescence in cardiomyocytes at different developmental stages.

      Concerning the methods, the authors must add the references for each product they used and not only the origin. When relevant, the RRID should be indicated. Without this information the method and the data cannot be reproduced.

      The statistics are well indicated in the figures and in the figure legends.

      Minor comments:

      In general, the text and the figures are clear. Nevertheless, I would suggest the following changes:

      • Figures 1B, 2B and 2C: the y-axis must start at 0.
      • Figure 4A: The authors should stain centrosomes in cardiomyocytes. This should strengthen the conclusion taken by the authors based on the results obtained in mice depleted for ANKRD26. Indeed, for the moment they are insufficient to conclude about the role of the centrosomes. The authors should show that centrosomes cluster in cardiomyocytes (a condition necessary for PIDDosome activation in polyploid cells) and if possible that component of the PIDDosome are recruited here.
      • Figure 4F: I would suggest to modify the working model to emphasize more the differences between WT and PIDDosome KO.

      The prior studies are referenced appropriately.

      Significance

      How polyploid cells control their ploidy levels during differentiation remains poorly understood. The data presented here represent thus an advance concerning this question.

      The actual model concerning PIDDosome activation relies on the presence of extra centrosomes that drives the ANKDR26-dependent recruitment of the PIDDosome. Then, Caspase 2 is activated leading to a p53-p21 dependent cell cycle arrest (Burigotto & Fava, 2021, Sladky and Villunger, 2020; Janssens & Tinel, 2012; Evans et al., 2020; Burigotto et al., 2021). In this study, the authors showed that similar pathway takes place during cardiomyocyte differentiation to control ploidy levels. These data are reminiscent of previous work showing PIDDosome involvement during hepatocyte polyploidization (Sladky et al. 2020). Together, these data highlight the prominent role of the PIDDosome complex in controlling ploidy levels in physiological context.

      Importantly, this study identified that the classical p53-dependent cell cycle arrest described after PIDDosome activation is not involved here. Instead, the data established that independently of p53, p21 contribute to control cardiomyocyte ploidy. In consequence, this study extend the initial pathway associated with PIDDosome activation and suggest that other mechanisms could take place to restrain cell proliferation upon PIDDosome activation.

      Overall, this makes this paper significant and of interest for the following fields: polyploidy, heart/cardiomyocyte development and PIDDosome.

      My field of expertise includes polyploidy, cell cycle and genetic instability.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This manuscript by Leone et al describes the role of the PIDDosome in cardiomyocytes. Using a series of whole body and cardiomyocyte specific knockouts, the authors show that the PIDDosome maintains correct ploidy in these cells. It achieves this through inducing cell cycle arrest in cardiomyocytes in a p53 dependent manner. Despite this effect on ploidy, PIDDosome-deficient hearts show no structural or functional defects. Statistics and rigor appears to be adequate.

      Major comments:

      1. Figure 1 uses fluorescent intensity of a nuclear stain to determine ploidy per nucleus and they further separate the results into mononucleated, binucleated or multinucleated cells. It is hard to know how to interpret these results without further information or controls. Is there a good positive control that can be used to help to show whether this assay is quantitative. The differences are larger with the Raidd and caspase-2 knockouts than with the Pidd knockouts but this is not addressed.
      2. On line 459 the authors state that the increase in polyploidy in PIDDosome knockouts occurs in adult hood but this is not directly tested. In fact in the next section the polyploidy is assessed in early postnatal development. This statement should be explained or removed
      3. In Figure 4. The authors obtained RNAseq data for P1, P7 and P14 but only show the differences with and without caspase-2 at P7. Given that the differences in ploidy are more significant at P14 (Fig 3D), all the comparisons should be shown along with analysis of whether the same genes/gene families are altered in the absence of caspase-2
      4. Some validation of the RNAseq and/or proteomics results would be an important addition to this study
      5. Figure 4D: the authors make the conclusion that p21 is downstream of PIDD (et p53 independent). However, this is not supported by the data because the increase in 4N cells/decrease in 2N cells, although statistically significant, is nowhere near that of caspase-2 KO and caspase-2/p21 KO. Statistics should also compare p32KO with c2KO. In the absence of any other data, the more likely conclusion is that p21 is not involved.

      Minor comments:

      Suggest moving Figure 4A to Figure 3 as it seems to fit better there based on the citation of this figure in the text Recommend enhancing the brightness of microscopy images in Figure 1E and 2D

      Significance

      This study provides interesting information for the role of the PIDDosome in protecting from polyploidy and adds to the body of work by this same group studying this pathway in the liver. The main weakness in terms of significance is the lack of a phenotype in the hearts of these animals. Therefore, it is clear that ploidy (or at least PIDDosome dependent ploidy) has minimal impact on cardiac development. The authors mention that they have not tried giving these mice an MI or inducing any other type of cardiac damage. Although it is understood that these experiments are likely outside of the scope of the present study, without this information the impact of this study is moderate. I recommend expanding the discussion to provide a more indepth possible rationale as to why ploidy perturbations do not lead to structural changes like in the liver. Despite this, the insights to the pathway itself are interesting to investigators in the caspase-2 field if a little underdeveloped, especially concerning the role of p21.

      My expertise is in cell death and caspase biology (especially caspase-2). I havesufficient expertise to evaluate all parts of this paper.

    4. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-02953

      Corresponding author(s): Andreas, Villunger

      [The “revision plan” should delineate the revisions that authors intend to carry out in response to the points raised by the referees. It also provides the authors with the opportunity to explain their view of the paper and of the referee reports.

      • *

      The document is important for the editors of affiliate journals when they make a first decision on the transferred manuscript. It will also be useful to readers of the reprint and help them to obtain a balanced view of the paper.

      • *

      If you wish to submit a full revision, please use our "Full Revision" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      *We would like to thank the reviewers for their constructive input and overall support. We appreciate to provide a provisional revision plan, as outlined here, and are happy to engage in additional communication with journal editors via video call, in case further clarifications are needed. *

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      • *

      Reviewer #1

      __Evidence, reproducibility and clarity __

      Summary: This manuscript by Leone et al describes the role of the PIDDosome in cardiomyocytes. Using a series of whole body and cardiomyocyte specific knockouts, the authors show that the PIDDosome maintains correct ploidy in these cells. It achieves this through inducing cell cycle arrest in cardiomyocytes in a p53 dependent manner. Despite this effect on ploidy, PIDDosome-deficient hearts show no structural or functional defects. Statistics and rigor appear to be adequate.

      We thank this referee for taking the time to evaluate our work and their valuable comments. We assume that this reviewer by mistake indicates that the phenomenon we describe, depends on p53. As outlined in the abstract and throughout the manuscript, the effect is independent of p53, but may additionally still involve p21, acting along or parallel to the PIDDosome.

      Major comments: 1. Figure 1 uses fluorescent intensity of a nuclear stain to determine ploidy per nucleus and they further separate the results into mononucleated, binucleated or multinucleated cells. It is hard to know how to interpret these results without further information or controls. Is there a good positive control that can be used to help to show whether this assay is quantitative? The differences are larger with the Raidd and caspase-2 knockouts than with the Pidd knockouts but this is not addressed.

      *We appreciate this concern. Regarding a “good positive control” we can say that we follow state-of the art in the cardiomyocyte field and studies by the Evans (PMID: 36622904), Kuhn (PMID: 32109383), Bergmann (PMID: 26544945) and Patterson labs (PMID: 28783163, 36912240) all use the identical approach to discriminate 2n from 4n nuclei in microscopy images at the cellular level. The fact that the majority of rodent CM nuclei is indeed diploid (PMID: 31175264, 31585517 and 32078450) and a large number of nuclei has been evaluated to assess their mean fluorescence intensity (MFI) reduces the risk of a systematic bias in our analysis. Moreover, we have used an orthogonal approach that is indeed quantitative to define DNA content, i.e,. flow-cytometry based evaluation of DNA content in isolated CM nuclei (Fig. 1C). We hence are confident our assays are quantitative. *

      Regarding the fact that loss of Pidd1 causes a more saddle phenotype, we can offer to discuss this in light of the fact that Pidd1 has additional functions, outside the PIDDosome (PMID: 35343572), and that we made similar observations when analyzing ploidy in hepatocytes (PMID: *31983631). Given the fact that all components of the PIDDosome show a similar phenotype, and that this phenotype is mimicked by loss of the protein that connects PIDD1 and centrosomes, ANKRD26 (Fig. 4a), we are confident that this biological variation in our analysis is not affecting our conclusions. *

      On line 459 the authors state that the increase in polyploidy in PIDDosome knockouts occurs in adult hood but this is not directly tested. In fact, in the next section the polyploidy is assessed in early postnatal development. This statement should be explained or removed.

      We see that we have made an unclear statement here. In fact, we first noted increases in ploidy in adult heart and then define the time window in development when this happens. This sentence will be rephrased.

      In Figure 4. The authors obtained RNAseq data for P1, P7 and P14 but only show the differences with and without caspase-2 at P7. Given that the differences in ploidy are more significant at P14 (Fig 3D), all the comparisons should be shown along with analysis of whether the same genes/gene families are altered in the absence of caspase-2.

      The reason why we focus on postnatal day 7 (P7) is that data from Alkass et al (PMID: 26544945) and other labs (PMID: 31175264 ) document that on this day the initial wave of binucleation peaks. Hence, we hypothesized that the PIDDosome must be active in most CM, which aligns well with the increased mRNA levels of all of its components (Figure 3). Interestingly, it seems that its action is tightly regulated, as mRNA of PIDDosome components drop on P10, suggesting PIDDosome shut-down or downregulation. Similar findings have been noted in the liver (PMID: *31983631). Alkass and colleagues also show that very few CMs enter another round of DNA synthesis between P7 and P14, and hence possible transcriptome changes in the absence of the PIDDosome will be strongly diluted. *

      Please note that on P1, there is no difference between genotypes to be expected as all CM are mononucleated diploids and cytokinesis competent, as previously demonstrated (PMID: *26544945). Moreover, PIDDosome expression levels are extremely low (Fig. 3A). As such, no difference between genotypes are expected on P1. In addition, on P14 the ploidy phenotype observed in PIDDosome knockout mice reaches the maximum and ploidy increases are comparable to adult tissue. Thus, at this time the trigger for PIDDosome activation (cytokinesis failure) is no longer observed as the majority of CMs are post-mitotic, (PMID: 26247711). As such the impact of PIDDosome activation on the P14 transcriptome is most likely negligible. However, if desired, we can expand our bioinformatics analysis summarizing findings made related to DEGs over time in wt animals by comparing genotypes also on day 1 and day 14. In light of the above, analysis between genotypes on P7 holds still appears as the one most meaningful. *

      Some validation of the RNAseq and/or proteomics results would be an important addition to this study

      We agree with this notion and propose to validate key candidates related to cardiomyocyte proliferation and polyploidization, some of which we found to be differentially expressed at the mRNA level on day 7in the RNAseq data (e.g., p21, Foxm1, Kif18a, Lin37 and others)

      Regarding the proteomics results, we face the challenge that we can only try to confirm if candidate proteins are likely caspase substrates in silico using DeepCleave*, and potentially pick one or two candidates linked to CM differentiation for further analysis in vitro and in heterologous cell based assays (e.g. 293T cells), as no bona-fide ventricular cardiomyocyte cell lines exist. Primary postnatal CMs are extremely difficult to transfect, nor they proliferate without drug-treatment, or fail cytokinesis ex vivo. *

      Figure 4D: the authors make the conclusion that p21 is downstream of PIDD (et p53 independent). However, this is not supported by the data because the increase in 4N cells/decrease in 2N cells, although statistically significant, is nowhere near that of caspase-2 KO and caspase-2/p21 KO. Statistics should also compare p32KO with c2KO. In the absence of any other data, the more likely conclusion is that p21 is not involved.

      *We agree that the findings related to the impact seen upon loss of p21 suggest that it is not the only effector involved in ploidy control and it may not even be an effector engaged by caspase-2, as C2/p21 DKO mice have an even higher ploidy increase, albeit not statistically significant. However, it is important to highlight that p21 (Cdkn1a) was found to be downregulated in our transcriptomic analysis suggesting an involvement in the caspace2-cascade. We are happy to highlight this when presenting the results and in the discussion. *

      *We assume that this referee refers to p73 KO data that should be compared to Casp2 KO data (could be read as p73 or p53, but the latter we compare side by side with Casp2 in Fig. 4 already). As p73 KO mice were not found to be viable beyond day 7 (our attempt to find animals on day 10 failed, in line with published literature (PMID: 24500610, 10716451)), we can only offer to compare this data set to the data presented in Figure 3C, where we have analyzed ploidy increases on day 7 from wt and PIDDosome mutant mice. This re-analysis will show that only Caspase-2 mutant mice display a significant ploidy increase on P7, when compared to wt or p73 mutant animals, while no difference are noted between wt and p73 mutant mice (to be included in new Suppl. Fig. 3C) *

      Minor comments: Suggest moving Figure 4A to Figure 3 as it seems to fit better there based on the citation of this figure in the text

      *We can see some benefit in this recommendation and included panel 4A now in an updated version of Figure 3. *

      Recommend enhancing the brightness of microscopy images in Figure 1E and 2D

      We will try to improve image quality, may have been due to PDF conversion


      Significance

      This study provides interesting information for the role of the PIDDosome in protecting from polyploidy and adds to the body of work by this same group studying this pathway in the liver.

      The main weakness in terms of significance is the lack of a phenotype in the hearts of these animals. Therefore, it is clear that ploidy (or at least PIDDosome dependent ploidy) has minimal impact on cardiac development.

      We respectfully disagree with the comment that the lack of impact on cardiac function constitutes a weakness of our findings. Several studies on ploidy control in the liver (PMID 34228992) but importantly also heart (PMID: 36622904) have failed to document a clear impact of increased ploidy on organ function. This does not infer insignificance, but maybe rather that the context where this becomes relevant has not been identified. We are happy expand on this in our discussion

      • *

      The authors mention that they have not tried giving these mice an myocardial infarct (MI) or inducing any other type of cardiac damage. Although it is understood that these experiments are likely outside of the scope of the present study, without this information the impact of this study is moderate. I recommend expanding the discussion to provide a more in-depth possible rationale as to why ploidy perturbations do not lead to structural changes like in the liver.

      Despite this, the insights to the pathway itself are interesting to investigators in the caspase-2 field if a little underdeveloped, especially concerning the role of p21.

      My expertise is in cell death and caspase biology (especially caspase-2). I have sufficient expertise to evaluate all parts of this paper.

      *As mentioned above, we will amend our conclusions on p21, in light of potential findings made when validating DEG candidates, as stated above. *

      *We hope that the changes and amendments proposed here will be satisfactory to this referee to recommend publication of a revised manuscript. *

      • *


      Reviewer #2

      __Evidence, reproducibility and clarity: __

      __Summary: __

      In this study, the authors investigated the role of the PIDDosome during cardiomyocyte polyploidization. PIDDosome is a multi-protein complex activating the endopeptidase Caspase-2, and shown to be involved in eliminating cells with extra centrosomes or in response to genotoxic stress (Burigotto & Fava, 2021, Sladky and Villunger, 2020). In both cases, the PIDDosome is recruited in a ANKRD26-dependent manner at the centrosomes leading to p53 stabilization and cell death (Burigotto & Fava, 2021; Evans et al., 2020; Burigotto et al., 2021).

      Here, by studying mouse cardiomyocyte differentiation, the authors showed that PIDDosome is imposing ploidy restriction during cardiomyocyte differentiation. Importantly, in contrast to a previous report in the liver (Sladky et al., 2020), they showed that PIDDosome acts in a p53-independent manner in cardiomyocytes. Indeed, they suggested that PIDDosome controls ploidy in cardiomyocytes through p21 activation.

      We want to thank this reviewer for the time taken to evaluate our work and provide critical feedback that will help to improve our revised manuscript.

      __Major comments: __

      In general the conclusions of the authors are well supported by the experiments. However, I would suggest the following experiments/analysis to strengthen the paper:

      The authors should improve the Figure 1 to help the readers who are not familiar with cardiomyocyte polyploidization. For instance, I would suggest to add a scheme to summarize cardiomyocyte polyploidization (in terms of nuclear size, mono vs multi and so on).

      We agree that a visual summary of the postnatal timing of CM polyploidization will be helpful for the generalist not familiar with the topic and have added a scheme, adapted from a study by Alkass et al. (PMID: *26544945), who elegantly defined the timing of this process during postnatal mice life (now Fig. 1A). *

      Based on the images they presented in 1B, the authors should also measure the nuclear area or volume in the different conditions in which components of the PIDDosome were depleted. Indeed, these two parameters should be easier to conceptualize for the readers (instead of the fluorescence nuclear intensity). This could help to understand if the nuclear size is maintained between the different conditions and if this is comparable between mono, bi or multinucleated cardiomyocytes.

      We have acquired this data and it can be used to provide additional information on nuclear area and/or volume. We propose to focus on re-analyzing data from wt, Casp2 and XMLC2CRE/Casp2f/f mice. The additional information can be included in Figures 1 & 2, respectively.

      • In Figure 2A, the authors presented cross section of heart from animals showing that PIDDosome depletion has no effect on heart size. This is a surprising result since cardiomyocytes have higher ploidy levels and this could have an effect on their function. Since the importance of this observation, the authors should present a quantification of the heart size in the different conditions shown in Figure 2A.

      We agree with this comment. We can measure the heart vs. body weight ratio or tibia length in adult Casp2-/- vs. WT (3 month old) in order to indirectly evaluate possible increases in CM size linked to increased ploidy.

      Also, the percentage of cardiomyocytes presenting higher levels of ploidy seems quite low. The authors should discuss this point. In particular because this could explain the absence of consequences on heart size and function at steady state.

      We agree with this conclusion and will expand on this in our discussion. It is important to note that as opposed to findings made in liver (PMID: *31983631), genetic manipulation of ploidy regulators such as E2f7/8 (PMID: 36622904), only led to modest changes in CM ploidy, suggesting that either a small band-width compatible with normal heart function exists, or that additional mechanisms exist that take control when these thresholds set by the PIDDosome or E2f7/8 are exceeded. These mechanisms could involve Cyclin G (PMID: 20360255), or TNNI3K (PMID: 31589606). Importantly, a recent publication has shown that overexpression of Plk1(T210D) and Ect2 from birth causes increased heart weight coupled with a minor decrease in CM size. These mice undergo to premature death (PMID: 39912233) suggesting that CM polyploidization is a tight regulated process regulated by several independent mechanisms during heart development. *

      • *

      In Figure 2D, the authors measured the cardiomyocyte cross-sectional area and concluded that removing PIDDosome components have no effect on cardiomyocyte cell size. Since it has been shown that ploidy increase is normally associated with an increase in cell area, the authors should measure cell area of cardiomyocytes analyzed in Figure 1B. It could be then interesting to establish a correlation with nuclear area and the mono, bi or multinucleated status. This will strengthen the results showing that ploidy increases without affecting cell area.

      Indeed, studies in PIDDosome deficient livers suggest that tissue is containing fewer but bigger cells (PMID: *31983631). As opposed to the liver the percentage of cardiomyocytes presenting higher levels of ploidy is relatively low. Thus, a possible increase in CM size in PIDDosome deficient mice may be masked in heart cross-sections. In order to better correlate the ploidy with cell size, we propose to reanalyze our microscopy images used to extract the data displayed in Fig. 1D. We may run into the problem though that the number of cells acquired may become limiting to achieve sufficient statistical power. In this case we could pool data from different PIDDosome mutant CM to increase statistical power. Again, we propose to initially prioritize wt vs. Casp2 vs. XMLC2/Casp2f/f mice. In addition, we can offer to quantify heart to body weight ratio or tibia length as an additional read-out (see answer to a previous reviewer comment). *

      The authors should discuss the fact that PIDDosome depletion lead only to a mild increase in ploidy levels (4N) in a small percentage of cardiomyocyte. If the PIDDosome is controlling ploidy, one could expect that removing it should lead to a drastic increase in the ploidy levels. Is PIDDosome depletion leading to cell death in some cardiomyocyte? The authors should discuss this point in the discussion or if relevant show a staining with an apoptosis marker. Is another mechanism compensating to prevent higher ploidy levels in cardiomyocytes?

      These are valid thoughts, some of which we contemplated before. In part, we have addressed them in our response to Reviewer#1, above, discussing similar findings made in E2f7/8 deficient hearts (PMID: 36622904), or Cyclin G overexpressing hearts (PMID: 20360255), where also only modest changes in ploidy were achieved. Together these observations are suggesting alternative control mechanism able to act, or limited tolerance towards larger shifts in ploidy, incompatible with proper cell function and survival. Towards this end, we can offer to test if we find increased signs of cell death in PIDDosome mutant hearts by TUNEL staining of histological sections. Of note, we did not find evidence for such a phenomenon in the liver (PMID: 31983631).

      Even if the authors presented RNAseq data suggesting that the PIDDosome is activated during cardiomyocyte differentiation, they should clearly demonstrate this point to strengthen the message of the paper. Indeed, the conclusions are based on the absence of PIDDosome components triggering higher ploidy in cardiomyocytes. However, we don't know whether (and when) the PIDDosome is activated during cardiomyocyte differentiation to control their ploidy levels. I would suggest to analyze PIDDosome activation markers by immunofluorescence in *cardiomyocytes at different developmental stages. *

      *We agree with this referee that direct proof of PIDDosome activation would be helpful and that we only infer back from loss of function phenotypes when and where the PIDDosome becomes activated. However, several technical issues prevent us from collecting more direct evidence of PIDDosome activation in the developing heart. 1) Polyploidization in heart CM appears to happen gradually in CM from day 3 on with a peak at day 7 (PMID: 26544945). Hence, this is not a synchronous process, where we could pinpoint simultaneous activation of the PIDDosome in all cells at the same time, which would facilitate biochemical analysis, e.g., by western blotting for signs of Caspase-2 activation (i.e. the loss of its pro-form, PMID: 28130345). 2) Our most reliable readout, MDM2 cleavage by caspase-2 giving rise to specific fragments detectable in western, is not applicable to mouse tissue, as the antibody we use only detects human MDM2 (PMID: 28130345) and no other MDM2 Ab we tested gave satisfactory results. Independent of that, 3) we do not see involvement of p53 in CM ploidy control (arguing against a role of MDM2). *

      *As such, we can only offer to look at extra centrosome clustering in postnatal binucleated CM (as also suggested further below), as a putative trigger for PIDDosome activation. However, this has been published by the first author of this study before (PMID 31301302). Given that we have made the significant effort to time resolve the increase in ploidy in postnatal mice (please note that several hearts needed to be pooled for each time point, analyzed in multiple biological replicates), we think that our conclusions are well-justified based on the genetic data provided. *

      Concerning the methods, the authors must add the references for each product they used and not only the origin. When relevant, the RRID should be indicated. Without this information the method and the data cannot be reproduced.

      We will update this information where relevant to reproduce our results

      Minor comments:

      In general, the text and the figures are clear. Nevertheless, I would suggest the following changes:

      • Figures 1B, 2B and 2C: the y-axis must start at 0.

      We will adopt axes accordingly

      Figure 4A: The authors should stain centrosomes in cardiomyocytes. This should strengthen the conclusion taken by the authors based on the results obtained in mice depleted for ANKRD26. Indeed, for the moment they are insufficient to conclude about the role of the centrosomes. The authors should show that centrosomes cluster in cardiomyocytes (a condition necessary for PIDDosome activation in polyploid cells) and if possible that component of the PIDDosome are recruited here.

      *This point is well taken and addressed in part above. Clustering of extra centrosomes has been documented and published by the first author of this study in rat polyploid cardiomyocytes (PIMID; cited). We can offer to show clustering of centrosomes in mouse CM isolated from day 7 hearts, but while PIDD1 can be detected well in MEF, we repeatedly failed to stain fro PIDD1 in primary CMs. *

      Figure 4F: I would suggest to modify the working model to emphasize more the differences between WT and PIDDosome KO.

      We will aim to improve this cartoon/graphical abstract

      The prior studies are referenced appropriately.

      Reviewer #2 (Significance (Required)):

      How polyploid cells control their ploidy levels during differentiation remains poorly understood. The data presented here represent thus an advance concerning this question. The actual model concerning PIDDosome activation relies on the presence of extra centrosomes that drives the ANKDR26-dependent recruitment of the PIDDosome. Then, Caspase 2 is activated leading to a p53-p21 dependent cell cycle arrest (Burigotto & Fava, 2021, Sladky and Villunger, 2020; Janssens & Tinel, 2012; Evans et al., 2020; Burigotto et al., 2021). In this study, the authors showed that similar pathway takes place during cardiomyocyte differentiation to control ploidy levels. These data are reminiscent of previous work showing PIDDosome involvement during hepatocyte polyploidization (Sladky et al. 2020). Together, these data highlight the prominent role of the PIDDosome complex in controlling ploidy levels in physiological context. Importantly, this study identified that the classical p53-dependent cell cycle arrest described after PIDDosome activation is not involved here. Instead, the data established that independently of p53, p21 contribute to control cardiomyocyte ploidy. In consequence, this study extends the initial pathway associated with PIDDosome activation and suggest that other mechanisms could take place to restrain cell proliferation upon PIDDosome activation. Overall, this makes this paper significant and of interest for the following fields: polyploidy, heart/cardiomyocyte development and PIDDosome.

      My field of expertise includes polyploidy, cell cycle and genetic instability.

      We thank this reviewer for the time taken and the positive feedback provided.

      • *

      • *

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      • *

      N/A

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      • *

      • *As outlined above, limited tools are available to validate putative caspase-2 substrates, identified in proteomics analysis, in an impactful manner. *

      • *Also, as discussed above, we deem myocardial infarction experiments in mice as unsuitable to improve our work, as with all likely-hood, they will yield negative results. *
    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this study, the authors investigated the role of the PIDDosome during cardiomyocyte polyploidization. PIDDosome is a multi-protein complex activating the endopeptidase Caspase-2, and shown to be involved in eliminating cells with extra centrosomes or in response to genotoxic stress (Burigotto & Fava, 2021, Sladky and Villunger, 2020). In both cases, the PIDDosome is recruited in a ANKRD26-dependent manner at the centrosomes leading to p53 stabilization and cell death (Burigotto & Fava, 2021; Evans et al., 2020; Burigotto et al., 2021).

      Here, by studying mouse cardiomyocyte differentiation, the authors showed that PIDDosome is imposing ploidy restriction during cardiomyocyte differentiation. Importantly, in contrast to a previous report in the liver (Sladky et al., 2020), they showed that PIDDosome acts in a p53-independent manner in cardiomyocytes. Indeed, they suggested that PIDDosome controls ploidy in cardiomyocytes through p21 activation.

      Major comments:

      In general the conclusions of the authors are well supported by the experiments. However, I would suggest the following experiments/analysis to strengthen the paper:

      • The authors should improve the Figure 1 to help the readers who are not familiar with cardiomyocyte polyploidization. For instance, I would suggest to add a scheme to summarize cardiomyocyte polyploidization (in terms of nuclear size, mono vs multi and so on). Based on the images they presented in 1B, the authors should also measure the nuclear area or volume in the different conditions in which components of the PIDDosome were depleted. Indeed, these two parameters should be easier to conceptualize for the readers (instead of the fluorescence nuclear intensity). This could help to understand if the nuclear size is maintained between the different conditions and if this is comparable between mono, bi or multinucleated cardiomyocytes.
      • In Figure 2A, the authors presented cross section of heart from animals showing that PIDDosome depletion has no effect on heart size. This is a surprising results since cardiomyocytes have higher ploidy levels and this could have an effect on their function. Since the importance of this observation, the authors should present a quantification of the heart size in the different conditions shown in Figure 2A. Also, the percentage of cardiomyocytes presenting higher levels of ploidy seems quiet low. The authors should discuss this point. In particular because this could explain the absence of consequences on heart size and function at steady state.
      • In Figure 2D, the authors measured the cardiomyocyte cross-sectional area and concluded that removing PIDDosome components have no effect on cardiomyocyte cell size. Since it has been shown that ploidy increase is normally associated with an increase in cell area, the authors should measure cell area of cardiomyocytes analyzed in Figure 1B. It could be then interesting to establish a correlation with nuclear area and the mono, bi or multinucleated status. This will strengthen the results showing that ploidy increases without affecting cell area.
      • The authors should discuss the fact that PIDDosome depletion lead only to a mild increase in ploidy levels (4N) in a small percentage of cardiomyocyte. If the PIDDosome is controlling ploidy, one could expect that removing it should lead to a drastic increase in the ploidy levels. Is PIDDosome depletion leading to cell death in some cardiomyocyte? The authors should discuss this point in the discussion or if relevant show a staining with an apoptosis marker. Is another mechanism compensating to prevent higher ploidy levels in cardiomyocytes?
      • Even if the authors presented RNAseq data suggesting that the PIDDosome is activated during cardiomyocyte differentiation, they should clearly demonstrate this point to strengthen the message of the paper. Indeed, the conclusions are based on the absence of PIDDosome components triggering higher ploidy in cardiomyocytes. However, we don't know whether (and when) the PIDDosome is activated during cardiomyocyte differentiation to control their ploidy levels. I would suggest to analyze PIDDosome activation markers by immunofluorescence in cardiomyocytes at different developmental stages.

      Concerning the methods, the authors must add the references for each product they used and not only the origin. When relevant, the RRID should be indicated. Without this information the method and the data cannot be reproduced.

      The statistics are well indicated in the figures and in the figure legends.

      Minor comments:

      In general, the text and the figures are clear. Nevertheless, I would suggest the following changes:

      • Figures 1B, 2B and 2C: the y-axis must start at 0.
      • Figure 4A: The authors should stain centrosomes in cardiomyocytes. This should strengthen the conclusion taken by the authors based on the results obtained in mice depleted for ANKRD26. Indeed, for the moment they are insufficient to conclude about the role of the centrosomes. The authors should show that centrosomes cluster in cardiomyocytes (a condition necessary for PIDDosome activation in polyploid cells) and if possible that component of the PIDDosome are recruited here.
      • Figure 4F: I would suggest to modify the working model to emphasize more the differences between WT and PIDDosome KO.

      The prior studies are referenced appropriately.

      Significance

      How polyploid cells control their ploidy levels during differentiation remains poorly understood. The data presented here represent thus an advance concerning this question.

      The actual model concerning PIDDosome activation relies on the presence of extra centrosomes that drives the ANKDR26-dependent recruitment of the PIDDosome. Then, Caspase 2 is activated leading to a p53-p21 dependent cell cycle arrest (Burigotto & Fava, 2021, Sladky and Villunger, 2020; Janssens & Tinel, 2012; Evans et al., 2020; Burigotto et al., 2021). In this study, the authors showed that similar pathway takes place during cardiomyocyte differentiation to control ploidy levels. These data are reminiscent of previous work showing PIDDosome involvement during hepatocyte polyploidization (Sladky et al. 2020). Together, these data highlight the prominent role of the PIDDosome complex in controlling ploidy levels in physiological context.

      Importantly, this study identified that the classical p53-dependent cell cycle arrest described after PIDDosome activation is not involved here. Instead, the data established that independently of p53, p21 contribute to control cardiomyocyte ploidy. In consequence, this study extend the initial pathway associated with PIDDosome activation and suggest that other mechanisms could take place to restrain cell proliferation upon PIDDosome activation.

      Overall, this makes this paper significant and of interest for the following fields: polyploidy, heart/cardiomyocyte development and PIDDosome.

      My field of expertise includes polyploidy, cell cycle and genetic instability.

    6. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This manuscript by Leone et al describes the role of the PIDDosome in cardiomyocytes. Using a series of whole body and cardiomyocyte specific knockouts, the authors show that the PIDDosome maintains correct ploidy in these cells. It achieves this through inducing cell cycle arrest in cardiomyocytes in a p53 dependent manner. Despite this effect on ploidy, PIDDosome-deficient hearts show no structural or functional defects. Statistics and rigor appears to be adequate.

      Major comments:

      1. Figure 1 uses fluorescent intensity of a nuclear stain to determine ploidy per nucleus and they further separate the results into mononucleated, binucleated or multinucleated cells. It is hard to know how to interpret these results without further information or controls. Is there a good positive control that can be used to help to show whether this assay is quantitative. The differences are larger with the Raidd and caspase-2 knockouts than with the Pidd knockouts but this is not addressed.
      2. On line 459 the authors state that the increase in polyploidy in PIDDosome knockouts occurs in adult hood but this is not directly tested. In fact in the next section the polyploidy is assessed in early postnatal development. This statement should be explained or removed
      3. In Figure 4. The authors obtained RNAseq data for P1, P7 and P14 but only show the differences with and without caspase-2 at P7. Given that the differences in ploidy are more significant at P14 (Fig 3D), all the comparisons should be shown along with analysis of whether the same genes/gene families are altered in the absence of caspase-2
      4. Some validation of the RNAseq and/or proteomics results would be an important addition to this study
      5. Figure 4D: the authors make the conclusion that p21 is downstream of PIDD (et p53 independent). However, this is not supported by the data because the increase in 4N cells/decrease in 2N cells, although statistically significant, is nowhere near that of caspase-2 KO and caspase-2/p21 KO. Statistics should also compare p32KO with c2KO. In the absence of any other data, the more likely conclusion is that p21 is not involved.

      Minor comments:

      Suggest moving Figure 4A to Figure 3 as it seems to fit better there based on the citation of this figure in the text Recommend enhancing the brightness of microscopy images in Figure 1E and 2D

      Significance

      This study provides interesting information for the role of the PIDDosome in protecting from polyploidy and adds to the body of work by this same group studying this pathway in the liver. The main weakness in terms of significance is the lack of a phenotype in the hearts of these animals. Therefore, it is clear that ploidy (or at least PIDDosome dependent ploidy) has minimal impact on cardiac development. The authors mention that they have not tried giving these mice an MI or inducing any other type of cardiac damage. Although it is understood that these experiments are likely outside of the scope of the present study, without this information the impact of this study is moderate. I recommend expanding the discussion to provide a more indepth possible rationale as to why ploidy perturbations do not lead to structural changes like in the liver. Despite this, the insights to the pathway itself are interesting to investigators in the caspase-2 field if a little underdeveloped, especially concerning the role of p21.

      My expertise is in cell death and caspase biology (especially caspase-2). I havesufficient expertise to evaluate all parts of this paper.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This article investigates the phenomenon of intracellular protein agglomeration. The authors distinguish between agglomeration and aggregation, both physically characterising them and developing a simple but elegant assay to differentiate the two. Using microscopy and structural analysis, this research demonstrates that unlike aggregates, agglomerates retain their folded structures (and are not misfolded), and do not colocalise with chaperones or interact with the proteostasis machinery which targets and breaks down misfolded proteins. The inert nature of agglomerates was further confirmed in fitness assays, though they were observed to disrupt the yeast proteome. Overall, agglomerated proteins were described and characterised, and shown to be largely neutral in vivo.

      The claims and conclusions were well supported by the data. Microscopy and CD spectra (previously published) were used to confirm the nature of agglomerates and to rule out colocalistion with proteostasis machinery. This was confirmed by testing ubiquitination.

      The fitness of yeast cells carrying enzymatically-inactive agglomerates was assayed by generating growth curves over 24 hours. The growth rate and doubling time were taken from these growth curves as a proxy for relative fitness. The authors mention not wanting to mask differences in lag, log or stationary phases between mutants. This could be achieved by using the area under each growth curve, rather than growth rate or doubling time alone. No further experimentation would be needed, and area under the curve may provide a more holistic metric to measure fitness by.

      The results indicate that agglomerates confer a slight fitness advantage. The authors do not speculate on a reason for this. I would be interested to know why they thought this might be.

      Referees cross-commenting

      I have read the reports from the other reviewers and agree with their comments.

      Significance

      Protein filamentation is observed across the tree of life, and contributes greatly to cell structure and organisation (Wagstaff, J., Löwe, J. Prokaryotic cytoskeletons: protein filaments organizing small cells. Nat Rev Microbiol 16, 187-201 (2018).). Recent work in this field has shown that self-assembly is also important for enzyme function (S. Lim, G. A. Jung, D. J. Glover, D. S. Clark, Enhanced Enzyme Activity through Scaffolding on Customizable Self-Assembling Protein Filaments. Small 2019, 15, 1805558.). Previous work from several of these authors demonstrated that the ability of a protein to filament is subject to selection (Garcia-Seisdedos H, Empereur-Mot C, Elad N, Levy ED. Proteins evolve on the edge of supramolecular self-assembly. Nature. 2017 Aug 10;548(7666):244-247. doi: 10.1038/nature23320. Epub 2017 Aug 2. PMID: 28783726.). It has become increasingly clear that protein assemblies are ubiquitous, evolvable and perhaps overlooked in research.

      This research explores a specific type of filamentation, named agglomeration, unique in that the protein which assemble are not misfolded (Romero-Romero ML, Garcia-Seisdedos H. Agglomeration: when folded proteins clump together. Biophys Rev. 2023;15: 1987-2003.). This is particularly of biomedical interest due to its role in disease, such as sickle cell anaemia (J. Hofrichter, P.D. Ross, & W.A. Eaton, Kinetics and Mechanism of Deoxyhemoglobin S Gelation: A New Approach to Understanding Sickle Cell Disease*, Proc. Natl. Acad. Sci. U.S.A. 71 (12) 4864-4868, https://doi.org/10.1073/pnas.71.12.4864 (1974).) The current research adds to the field by specifically exploring agglomerates in the most detailed methodology to date.

      The novelty of this research lies especially in two areas; (1) establishing a method for distinguishing between aggregation and agglomeration, and (2) the finding that agglomerates are largely innocuous in vivo. The method established for defining agglomerates is simple, elegant and well-described in this paper's methods. The authors then probe cellular responses to agglomeration via both proteostasis machinery and cellular fitness. They noted no disruption to fitness and observed little targeting of agglomerates by chaperones. The experiments were thorough, conclusive, and resulted in interesting findings.

      The inertia of this type of protein filament is unexpected; agglomerates are large and have been associated with disease. The results of this study, however, indicate that agglomerates are non-toxic and well-tolerated in vivo. The authors speculate that agglomerates may have evolved in a non-adaptive process, which is evolutionary very interesting. They also posit that these results could lead to synthetic biology applications such as a tracking expression or as a molecular sensor. This work is of great interest and impact both in cell biology, biomedicine and in-vivo biology.

      Personal note: I come from a background of enzyme evolution and have viewed the work in this light.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This is a very interesting paper investigating the fitness and cellular effects of mutations that drive dihedral protein complex into forming filaments. The Levy group have previously shown that this can happen relatively easily in such complexes and this paper now investigates the cellular consequences of this phenomenon. The study is very rigorous biophysically and very surprisingly comes up empty in terms of an effect: apparently this kind of self-assembly can easily be tolerated in yeast, which was certainly not my expectation. This is a very interesting result, because it implies that such assemblies may evolve neutrally because they fulfill the two key requirements for such a trajectory: They are genetically easily accessible (in as little as a single mutation), and they have perhaps no detrimental effect on fitness. This immediately poses two very interesting questions: Are some natural proteins that are known to form filaments in the cell perhaps examples of such neutral trajectories? And if this trait is truly neutral (as long as it doesn't affect the base biochemical function of the protein in question), why don't we observe more proteins form these kinds of ordered assemblies.

      I have no major comments about the experiments as I find that in general very carefully carried out. I have two more general comments:

      1. The fitness effect of these assemblies, if one exists, seems very small. I think it's worth remembering that even very small fitness effects beyond even what competition experiments can reveal could in principle be enough to keep assembly-inducing alleles at very low frequencies in natural populations. Perhaps this could be acknowledged in the paper somewhere.
      2. The proteins used in this study I think were chosen such that they do not have an important function in yeast that could be disrupted by assembly This allows the effect of the large scale assemblies to be measured in isolation. If I deduced this correctly, this should probably be pointed out agin in this paper (I apologise if I missed this).
      3. The model system in which these effects were tested for is yeast. This organism has a rigid cell wall and I was wondering if this makes it more tolerant to large scale assemblages than wall-less eukaryotes. Could the authors comment on this?

      Minor points:

      In Figure 2D, what are the fits? And is there any analysis that rules out expression effects on the mutant caused by higher levels of the wild-type? The error bars in Figure 2E are not defined.

      Significance

      This is a remarkably rigours paper that investigates whether self-assembly into large structures has any fitness effect on a single celled organism. This is very relevant, because a landmark paper from the Levy group showed that many proteins are very close in genetic terms to forming such assemblies. The general expectation I think would have been that this phenomenon is pretty harmful. This would have explained why such filaments are relatively rare as far as we know. This paper now does a large number of highly rigours experiments to first prove beyond doubt that a range of model proteins really can be coaxed into forming such filaments in yeast cells through a very small number of mutations. Its perhaps most surprising result is that this does not negatively affect yeast cells.

      From an evolutionary perspective, this is a very interesting and highly surprising result. It forces us to rethink why such filaments are not more common in Nature. Two possible answers come to mind: First, it's possible that filamentation is not directly harmful to the cell, but that assembling proteins into filaments can interfere with their basic biochemical function (which was not tested for here).

      Second, perhaps assembly does cause a fitness defect, but one so small that it is hard to measure experimentally. Natural selection is very powerful, and even fitness coefficients we struggle to measure in the laboratory can have significant effects in the wild. If this is true, we might expect such filaments to be more common in organisms with small effective population sizes, in which selection is less effective.

      A third possibility is of course that the prevalence of such self-assembly is under-appreciated. Perhaps more proteins than we currently know assemble into these structures under some conditions without any benefit or detriment to the organism.

      These are all fascinating implications of this work that straddle the fields of evolutionary genetics and biochemistry and are therefore relevant to a very wide audience. My own expertise is in these two fields. I also think that this work will be exciting for synthetic biologists, because it proves that these kinds of assemblies are well tolerated inside cells.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this work, the authors used yeast cell as a model system to study the abovementioned question. They established a model protein system using fluorescently labeled proteins that can form both agglomerates and aggregates. Using imaging experiments, they arguably showed that agglomerates do not colocalize with the proteostasis machinery, echoing what was observed by proteomics results. The proteomics results after pull down assay to study the interactome revealed that agglomerate-size-dependent changes were dependent on the cell-wall and plasma-membrane proteins. On the other hand, as expected, the misfolded proteins (aggregates) showed heavy involvement of proteostasis network components.

      Although the experiments still lack some controls and failed to support some of the conclusions, I found this work is a nice complement of the field to emphasize the point that "aggregates" and "agglomerates" are two different states, which is often mistaken by lots of researchers in recent years, in particular with the membraneless organelles (LLPS). I support its publication after the authors may consider the following suggestions and make necessary improvement.

      Major concerns:

      My major concern was raised by the lack of evidence to support the model system's folding state in the cell. 1. In Figure 1 and 2, I found the evidence to distinguish the folded state of proteins in the cells was limited. The concept of using hybrid imaging technique to prove the folding state is not a common experiment. The description of Figure 2 was very limited. I am sure the general audience can be convinced that the model proteins were actually folded and form agglomeration. 2. In addition, for mutants formed aggregates, the authors may consider to perform fractionation or crosslinking or native page experiment to show the evidence of protein misfolding and aggregation. 3. Have the authors considered to use FRAP assay to distinguish "aggregates" and "agglomerates" states in the cell? Does each of the state display different dynamics in the cell?

      Minor concerns:

      1. In Figure 3, it is very interesting to see such patten. I wonder why some of the chaperones were not responsive to misfolded proteins but some were very addicted to proteostasis. Could you elaborate more on this point? Are they chaperone sensitive, namely selective to 60/10, 70/40 or 90 system?
      2. In Figure 6, I suggest to add GO analysis and KEGG analysis to distinguish pathways and functional mismatch between "aggregates" and "agglomerates" interactomics.
      3. The quantity control for the proteomics studies is needed, namely the reproducibility of 3 repeats?
      4. This may beyond the scope of this work. I am interested whether the authors could point out whether similar works can be done in mammalian cells. What is the model system for mammalian cell that can form "agglomerates".

      Referees cross-commenting

      I read through the other two reviewers' comments, which I found reasonable. It seems like all reviewers agreed that this work is of enough significance for the field only with several technical concerns.

      Significance

      The submitted manuscript emphasized on a very important but often misleading concept: "aggregates" and "agglomerates" are two different states of protein structures in the cell with distinct physiological roles. However, these two states are of very similar phenotype: punctate structure in the cell. While the proteostasis network has been well-established for its central role of protein quality control and coping with misfolded and aggregated proteome, the authors attempted to profile the mechanism and physiological impact of mutation-induced folded-state protein filamentation, namely a model of "agglomerates". Such overarching goal of this work clearly pointed out the novelty of this work. Clearly, this is a new angle and aspect remained to be clarified for the field.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-02887

      Corresponding author(s): Philippe Bastin

      1. General Statements [optional]

      • *

      We thank the reviewers for their constructive suggestions. We are delighted to see that they appreciated our work and its interest for the broad cell biology community, as well as the potential impact of the inducible expression of tagged tubulin as a new tool to investigate microtubule assembly at large.

      We are now providing a full revision that contains two major modifications and that addresses all the minor points detailed below. The two major modifications are:

      • A simplification and a shortening of the text as requested by reviewers 1 and 3
      • The addition of a new experiment evaluating the role of the locking protein CEP164C to gain insight into the mechanism, as suggested by reviewers 1 and 2 Briefly, CEP164C is a protein localised to the transition fibres (structures that dock the basal body of the flagellum to the membrane) of only the old flagellum. Its depletion leads to an excessive elongation of the old flagellum and the production of a shorter new flagellum, suggesting competition between the two flagella for tubulin incorporation (Atkins et al., 2021). In the new figure 5, we have expressed tagged tubulin in the CEP164CRNAi cell line and formally demonstrated simultaneous incorporation in both flagella. Unexpectedly, the new flagellum incorporated more tubulin than the old one, suggesting a bias of tubulin targeting in favour of the new flagellum and the existence of additional contributors to the Grow-and-Lock model.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      • *

      Reviewer #1

      Evidence, reproducibility and clarity

      The manuscript by Daniel Abbühl on "A novel approach to tagging tubulin reveals MT assembly dynamics of the axoneme in Trypanosoma brucei" uses an innnovative approach to label tubulin, which allows the authors to unveil new mechanisms in flagellar length regulation.

      The manuscript is very nice and will be very interesting for the cell biology community and therefore should be accepted. In some parts it becames a bit complex with all the models and complex phrasing, I wonder whether the text could be simplified to be more appealing. I have a few minor comments:

      We agree that some of the explanations are lengthy and complex. We have simplified the explanations and hopefully made the models more accessible. Complexity comes from the fact that trypanosomes do not have a synchronized cell cycle.

      -From the model the authors show in Figure 8- there should be a way of pulsing the cells in G1 for a short amount of time -2 hours- and getting both flagella tips labelled. But the authors seem to require longer labelling to get that result. This should be better explained.

      We are not quite sure what is meant here with both flagella as in G1-phase, all cells are mono-flagellated. We do see mono-flagellated cells with a labelled tip after 2 hours, both with the HALO-tag or the Ty-1-tubulin system.

      In regard to bi-flagellate cells, we believe that incorporation in the OF happened at the beginning of G1-phase when the cell was mono-flagellated. If tubulin is present at that point, it will be incorporated at the tip. This cell then approaches the end of G1-phase and starts to initiate NF assembly. Since tagged tubulin is already present it will be incorporated along the whole length of the NF.

      A short induction of 2h would not suffice as it wouldn't cover the duration of the G1-phase and the initiation of a NF (duration of G1-phase is ~4h). We attempted to explain this in Fig. 4 and reworked the text to make this clearer.

      -Why do some cells not express the construct? Weren´t they all selected?

      We never managed to get a cell line where inducible expression is present in 100% of cells. Here, around 95% of cells were positive for Ty-1-tubulin after 24h of induction. Non-expression is not a phenomenon restricted to this tubulin cell line but also observed with other ectopically expressed proteins (e.g. Sunter et al. JCS 2015, Bastin et al. MCB 1999). All these cell lines represent clonal populations and are resistant to antibiotic treatment, however not all cells express the respective protein. For each experiment where we believed the number of expressing cells matter (for example the washout), we quantified in how many cells Ty-1-tubulin was present in the cell body microtubules.

      -"The linear regression line in Fig. 3C was corrected by subtracting 45 minutes from each timepoint due to the previously reported delay between addition of tetracycline and the expression of the respective protein". However, in the authors data the delay may amount to one hour (western analysis- S4). Shouldn´t they use their data.

      Indeed, the western blot shows expression after 1-hour, however we did not take a 45-minute timepoint, so we don't know if the protein was detectable at that time. In addition, IFA is more sensitive than western blot. We cannot say exactly when the average cell starts to express the induced protein.

      -Fig 3: To measure the timepoints of flagella growth, wouldn´t it be better to do it with NF that started to grow before induction, rather than starting to grow after induction, to be sure that the timing of incorporation is fully accounted for?

      We indeed did consider only NFs, which started to grow before induction, as suggested by the reviewer. In the revised version the description of the experiment can be found on page 9 line 22 - 28.

      -Although it is not the focus of the manuscript it would have been very interesting to use the CEP164C mutant to see whether it would change the dynamics of incorporation and fully test their model and discussion.

      This is a great suggestion, so we performed some experiments to address this issue. When CEP164C was knocked down before Ty-1-tubulin expression, integration is seen at the distal tip of both NF and OF. This is coherent with the idea of removal of the locking protein from the OF. However, lengths of the green segments in NF and OF do not have the same length (NF ~6 µm, OF ~2 µm), which indicates that CEP164C might not be the only protein involved in regulating flagellum length. A new figure explaining this experiment was added (Fig. 5, Fig. S6). We believe this data provides novel insight on the locking mechanism and strengthens the manuscript.

      -In some parts of the manuscript/supplemental material the authors say they insert the Ty-1- tag one aminoacid after the acetylated lysine- other parts they say two aminoacids after- this should be consistent.

      We thank the reviewer for spotting these mistakes, we have changed the text accordingly.

      -Fig. S1: 'Binding epitope of the TAT-1 antibody is highlighted in red'. There is no highlighting in red in this figure?

      This sentence was removed.

      -Fig. S2: Western blots are not very clear. What is the 'X' present in the C (first lane)? Weight of markers should be shown also in S4.

      Molecular weight markers have been added. X is an empty lane, we have now indicated this in the figure legend.

      -Fig 5: 'C: Frequency of bi-flagellated cells grouped by the different types of' The authors didn't finish the sentence.

      Previous Fig. 5 is now Fig. 6. Sentence has been completed. "Frequency of bi-flagellated cells grouped by different types of old flagella"

      -Fig. S7: The 'B' is missing in both picture and legend.

      This has been added


      Significance

      This study advances our knowledge of flagellar length regulation and maintenance. Moreover, the tools designed in this work will be very useful for the cell biology community in general.


      Reviewer #2

      Evidence, reproducibility and clarity

      Summary: The length of the old flagellum of Trypanosome is constant during G1 phase as well as during cell cycle progression when the new flagellum is assembled. The authors have previously proposed a "Grow and Lock" model for the flagellar length control in which no flagellar building blocks are incorporated. To test this hypothesis, the authors used a tagging strategy for alpha-tubulin and tracking its incorporation. The authors showed that the new flagellum incorporates new tubulins, as is expected. For the mature flagellum, tubulins are incorporated at the flagellar tip and only when the cells start to assemble the new flagellum. Thus, it shows that old flagellum is stable but not completely locked for the incorporation of tubulins.

      Major comments: The study is methodologically rigorous, integrating fluorescence microscopy, biochemical approaches, and proteomic analyses to validate the functionality of the tagged tubulin. The use of both inducible expression and endogenous protein tagging (HaloTag) strengthens the conclusions. This study has supported the "Grow-and-Lock" model" that the authors previously proposed. In addition, they have revealed that the stability of the old flagellum is temporally controlled.

      The data showed that brief incorporation of tubulins at the tip of the old flagellum occurs when the cells start to form the new flagellum. I thought the assembly of the new flagellum occurs during the cell division. However, in the abstract, it says that "The restriction is lifted briefly after the bi-flagellated cell has divided." Is my understanding wrong?

      We believe incorporation at the tip of the "OF" occurred after the cell has divided, when the OF daughter is mono-flagellated. It happens before this daughter cells starts assembling its new flagellum is formed. Of course, when looking at biflagellated cells, the NF as well as the tip of the OF will be green, but our data supports that incorporation happened in G1-phase and not during the biflagellated stage as the lock seals the OF before the NF emerges. To clarify on terminology: The bi-flagellate stage begins when basal bodies are duplicated, shortly after the beginning of S-phase and ends with cytokinesis. This means G1-phase and the mono-flagellated stage are nearly the same (Woodward and Gull, JCS1990) and occupy ~40% of the cell cycle.

      P12, "The cartoon in Fig. 5A illustrates the progression of the cells in scenario 2 (Fig. 4A) over the duration of one cell cycle (~9 hours)" I thought that one cell cycle should start with cell with only one flagellum, followed by assembly of a new flagellum during cell division, the cell then divides when the new flagellum is almost completely assembled. If my understanding is correct, perhaps the cartoon should be modified accordingly.

      Indeed, the cell cycle starts with a cell in G1-phase. Here, we have chosen the initiation of a NF assembly as our starting point because we focused the investigation on bi-flagellated cells. We have now illustrated the cell cycle (adapted from Woodward and Gull 1990) and when cells are biflagellated in Fig. 6A (revised version).

      Minor comments:

      1) Several references are not correctly formatted. P3: (Flavin and Slaughter, 1974) (Rosenbaum 1969). P10, (Sherwin et al., 1987)(Sheriff et al., 2014) 2) In several places there are no space between the number and the unit. For eample, P3, 9 - 24µm/h. 7, 1μg/m; P8, 50kDa; P9, 1M; 8-9h; P11, 2.9µm/h and etc. 3) P11, Flagella were extracted. I thought the cells were extracted.

      Thank you for pointing these out, we have changed these in the text.


      Significance

      Cilia and eukaryotic flagella are considered dynamic structures in which the flagellar components especially tubulins under constant turnovers even in steady state. This work demonstrates that in Trypanosome the stable old flagellum is temporally controlled for tubulin turnovers, suggesting a tight regulation of microtubule dynamics. Future elucidation of the regulatory mechanism will be more interesting. This work will be interesting to the field of cilia and microtubules. In addition, the new technique used for tracking tubulins will also be interesting.

      I am an expert on ciliary biology.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      This study seeks to investigate the mechanism by which the length of an eukaryotic cilium is set and maintained in a constant state. The flagellated protist Trypanosoma brucei serves as the study model and the authors take advantage of the genetic tools that allow precise modification and tagging of flagellar proteins and they build on prior knowledge about the well-characterised flagellar assembly cycle, which allows tracking the assembly of a new flagellum alongside an existing old one in the course of one cell cycle. The group of Bastin has previously reported a very interesting "Grow-and-Lock Model for the Control of Flagellum Length in Trypanosomes" and this current manuscript provides a test of this model, and a refinement. Key to this is an advance in technique, reported here, namely expression of an epitope tagged version of alpha tubulin. The epitope is inserted in an internal loop, which apparently for the first time provides a traceable tubulin that is reliably incorporated into the cytoskeleton (subpellicular array, spindle and cilium). Expressing an inducible version of this Ty-1-tubulin allows for a set of experiments that measure the place and timing of tubulin incorporation into cilia. The results are largely confirmatory of previous findings (incorporation exclusively into the new flagellum, at the distal end, linear growth rate that matches previous estimates). Examination of tubulin incorporation patterns then reveal additional information about the old flagellum: evidence from Ty-1-tubulin labelling, corroborated by incorporation patterns of another ciliary protein (RSP 4/6) suggest that the "lock" on the old flagellum is relieved for short periods after cell division, leading to a refined model presented in Figure 8.

      Major comments:

      This study provides an elegant test of the grow-and-lock model and the major conclusions are supported by the data. I have no major concerns.

      Minor comments:

      There are several minor points that could be addressed to make the manuscript easier to follow (and adding line numbers to the manuscript would help with reviewing).

      The introduction is quite long. Some of the well-established background information on the T. brucei cell cycle could be shortened. If the paper is intended for a broader audience, it would be valuable instead to cite studies that have succeeded in tagging tubulin and tracing its incorporation in other cilia. Could the Ty-1-tubulin approach be relevant more broadly or are simpler methods already established?

      The introduction has been shortened, we now also cite two published studies that tracked tubulin integration in Chlamydomonas and C. elegans respectively.

      On p.6 the rationale for endogenous tagging was to "reduce the risk of artifacts portentially due to untimely expression or unnatural protein levels". However most of the experiments were done with ectopically expressed inducible Ty-1-tubulin. For the experiments it is crucial to use an inducible system but the authors may wish to comment why the risk of artifacts was no longer a concern.

      The reasoning here was that in case the Ty-1-tubulin would not have been incorporated into MTs, we could have attributed it solely to the presence of the tag and no other factors, but this was not the case. This therefore allowed us to move to the inducible expression system.

      On p.7 / Fig S2A-B there appears to be a mistake in the presentation. Spindles are mentioned in the text - I can't see any in the figure. Fig S2A and B both show cytoskeletons, but the text suggests only B is about cytoskeletons. None of the blot shows BB2 staining of different cell fractions, contrary to statements in the text. The letter codes in the panel (T, C, D) don't match the codes in the legend (T, P, S).

      We thank the reviewer for spotting the mistakes. A panel with the spindle was added in Fig. S2. We did not stain fraction blots of the in-situ tagged cell lines with BB2. However, this was done with the inducible cell line and is shown in Fig. 1D. Letter code in the legend was adapted to match the figure.

      Figure 1. The evidence for incorporation into spindles is not strong. The structure indicated by the arrive could be a spindle but it's not very clear. There is a great example of a labelled spindle only in figure S5A. Here, at the start, it would be good to show a panel of cells in successive cell cycle stages (best, whole cells and cytoskeletons) to clearly show the structures that are labelled with Ty-1-tubulin.

      The current Fig. 1B (Fig. 1A before) depicts whole cells of an induced and a non-induced culture; we show whole cells to provide a complete picture of tubulin integration. A panel with detergent extracted cytoskeletons from the in situ tagged cell line has been added to Fig. 1A. We chose to show cytoskeletons or isolated flagella instead of whole cells because (1) the flagella are easier to see and (2) it formally demonstrates that tagged tubulin is incorporated in MTs.

      In general, tubulin labelling of the spindle was more consistently observed in whole cells as we did not use spindle preserving extraction buffers when preparing cytoskeletons. However, we did observe clear spindles in cytoskeletons as well (see Fig. S5 for example). The same was observed for the beta-tubulin specific KMX1 antibody in the past which is the gold standard to visualize the spindle (Sasse and Gull JCS1988). Regardless, a panel depicting spindle progression through mitosis using staining of Ty-1-tubulin has been added in Fig. S2 (The panel is a mix of whole cells and cytoskeletons).

      On p.8 (end of first paragraph) there is reference to cell cycle analyses, but no data is shown. Also on p.8, please clarify what the evidence is that "a fraction of cells did not respond to tetracycline". The fact that they remain unstained by Ty-1-tubulin is not in itself evidence they did not respond to tetracycline.

      We did not show the cell cycle data as it was similar to non-induced and does not provide any new information in our opinion. Hence, the sentence has been removed.

      The reviewer is correct that we do not have evidence that these cells did not respond to tetracycline. Some cells remained completely devoid of Ty-1-tubulin even after multiple days of induction. This was typically between 5-10% of cells. In experiments where the exact number is important, we counted the amount of "non-expressers" in whole cells.

      Figure S4A. The blot for the soluble fraction is not of great quality. I don't see how the conclusion was reached that the Ty-1-tubulin bands were faint.

      The blot of the soluble fraction that was stained with BB2 had to be exposed a lot longer compared to the blot stained with TAT-1. The soluble blots were repeated with the same result (lots of background noise when using BB2, a clear blot with TAT-1). In the TAT-1 blot only the endogenous tubulin band is clearly visible, with some very faint signal above corresponding to the Ty-1-tubulin. Soluble Ty-1-tubulin with BB2 or TAT-1 is visible in Fig. 1D after longer inductions.

      On p.11, it would be interesting to compare measured elongation rates with previously measured estimates for flagellum growth, comparing the growth rates, and relating them to cell cycle times in the corresponding experiments (which vary slightly between labs and studies).

      We attempted to address this in the discussion by comparing our experiments to the assembly rate measured with the PFR as reporter (Bastin et al. 1999). We could mention the corresponding doubling times in correlation to how many cells are bi-flagellated, but this was only done with the Ty-1-tubulin cell line and not with the PFR. In our experiments the average doubling time was ~9 hours with 52% of cells being bi-flagellated. This was measured with FTZC (marker of the transition zone at the base of the flagellum) and Mab25 (marker of the axoneme of the flagellum) which will lead to a slight underestimate of the real number of bi-flagellated cells, as the NF is initially very close which makes it difficult to notice/differentiate from the old one.

      Figure S6. I find the presentation of this figure confusing. It should be revised with clearer labelling of "cell cycle 1", "cell cycle 2", and the precise meaning of "type 3" should be clarified. There are two instances of "type 1" in the drawing, but one of these seems to fulfil the criteria of "type 3" (OF 1-4µm).

      We agree with the reviewer and therefore decided to remove this figure. We also considered the comments of the other two reviewers about complexity of the manuscript and changed the text of figure 5 to make it more approachable. This includes a simpler explanation for the expected amounts of flagella.

      Figure 7. In panel A, the absence of label at the NF distal end is not total, a purple line is still visible. Was any quantitation attempted (signal intensity, changes in length of labelled fragments over time?). Minimally, say how many cells were analysed for the numbers in panels D and E, and how many times this experiment was done.

      We agree with the reviewer that the decrease in the TMR signal in the NF of the cell in the original Fig. 7A (currently Fig. 8A) is gradual and not abrupt. Similarly to the Ty-1-tubulin experiments where the tagged protein becomes progressively more available (increasing intensity), the intensity of TMR-ligand becomes progressively less abundant (gradually decreasing intensity) as new (not TMR labelled) protein gets synthesized during the period of NF construction, progressively diluting the initially fully labeled population of RSP4/6. The slope of the gradient may differ between axonemal constituents, as it reflects the kinetics of protein synthesis, degradation, its incorporation into the axoneme, as well as the size of the soluble protein pool in the cytosol. We classify this type of signal as gradients, as opposed to the sharp decrease. At initial times after TMR-ligand washout (e.g. 4 hours in Fig. 8C), this long gradient is observed at the distal end of NFs and in some uniflagellated cells (NF-inheriting daughters). The distal ends of OFs in these experiments (if not fully labelled) display a sharp decrease, as do frequent uniflagellated cells, likely OF-inheriting daughters. The existence of these two different patterns demonstrates that two different mechanisms are responsible for incorporation of fresh RSP4/6 into the NF and OF axoneme, respectively. While incorporation into the NF is gradual, incorporation into the distal region of the OF is stepwise (restricted in time). Numbers of cells quantified for the table in Fig. 8 have been added. The NFs and OFs displaying the patterns of the gradient and sharp decrease, respectively, were observed in multiple experiments.

      Reviewer #3 (Significance (Required)):

      • General assessment: strengths and limitations

      Strengths: Trypanosoma brucei is a powerful model system in which to ask detailed questions about the assembly dynamics and hierarchy of microtubule-based cytoskeletal structures in general and cilia in particular. This elegant and well-designed study overcomes a previous technical limitation by allowing for the direct labelling of alpha tubulin, one of the main building blocks of the ciliary axoneme. The study sets out to test a specific hypothesis (grow-and-lock model) and provides evidence in support, leading to a refined model for cilia length regulation in trypanosomes.

      Limitations: With this system, visualisation of new tubulin incorporation requires de novo synthesis. There is a time lag between inducing expression of Ty-1-tubulin with tetracycline and being able to visualize the tagged proteins that needs to be taken into consideration. This time lag was estimated based on previous studies and the relatively quick appearance of Ty-1-tubulin on Western blots (within hours). This inevitably creates a situation where levels of tagged tubulin change rapidly, creating gradients of signal intensity (and variations in levels) that lead to some uncertainty in estimations of length of labelled microtubule fragments. Furhtermore, the epitope label is not compatible with live cell imaging, restricting analyses to fixed cells. The Ty-1-tubulin data is well ducmented; the RSP4/6 data appear to corroborate these findings but are less extensively documented.

      • Advance: The results succeed in integrating several recent findings from different research groups into a refined coherent model about cilia length regulation in trypanosomes. The tubulin tagging method could be gainfully transferred to other systems (although the state of the field in tubulin tagging in other systems is not clearly laid out in the paper).

      This paper could be of interest to a broad cell biology community interested in cilia and cytoskeletal dynamics.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This study seeks to investigate the mechanism by which the length of an eukaryotic cilium is set and maintained in a constant state. The flagellated protist Trypanosoma brucei serves as the study model and the authors take advantage of the genetic tools that allow precise modification and tagging of flagellar proteins and they build on prior knowledge about the well-characterised flagellar assembly cycle, which allows tracking the assembly of a new flagellum alongside an existing old one in the course of one cell cycle. The group of Bastin has previously reported a very interesting "Grow-and-Lock Model for the Control of Flagellum Length in Trypanosomes" and this current manuscript provides a test of this model, and a refinement. Key to this is an advance in technique, reported here, namely expression of an epitope tagged version of alpha tubulin. The epitope is inserted in an internal loop, which apparently for the first time provides a traceable tubulin that is reliably incorporated into the cytoskeleton (subpellicular array, spindle and cilium). Expressing an inducible version of this Ty-1-tubulin allows for a set of experiments that measure the place and timing of tubulin incorporation into cilia. The results are largely confirmatory of previous findings (incorporation exclusively into the new flagellum, at the distal end, linear growth rate that matches previous estimates). Examination of tubulin incorporation patterns then reveal additional information about the old flagellum: evidence from Ty-1-tubulin labelling, corroborated by incorporation patterns of another ciliary protein (RSP 4/6) suggest that the "lock" on the old flagellum is relieved for short periods after cell division, leading to a refined model presented in Figure 8.

      Major comments:

      This study provides an elegant test of the grow-and-lock model and the major conclusions are supported by the data. I have no major concerns.

      Minor comments:

      There are several minor points that could be addressed to make the manuscript easier to follow (and adding line numbers to the manuscript would help with reviewing).

      The introduction is quite long. Some of the well-established background information on the T. brucei cell cycle could be shortened. If the paper is intended for a broader audience, it would be valuable instead to cite studies that have succeeded in tagging tubulin and tracing its incorporation in other cilia. Could the Ty-1-tubulin approach be relevant more broadly or are simpler methods already established?

      On p.6 the rationale for endogenous tagging was to "reduce the risk of artifacts portentially due to untimely expression or unnatural protein levels". However most of the experiments were done with ectopically expressed inducible Ty-1-tubulin. For the experiments it is crucial to use an inducible system but the authors may wish to comment why the risk of artifacts was no longer a concern.

      On p.7 / Fig S2A-B there appears to be a mistake in the presentation. Spindles are mentioned in the text - I can't see any in the figure. Fig S2A and B both show cytoskeletons, but the text suggests only B is about cytoskeletons. None of the blot shows BB2 staining of different cell fractions, contrary to statements in the text. The letter codes in the panel (T, C, D) don't match the codes in the legend (T, P, S).

      Figure 1. The evidence for incorporation into spindles is not strong. The structure indicated by the arrive could be a spindle but it's not very clear. There is a great example of a labelled spindle only in figure S5A. Here, at the start, it would be good to show a panel of cells in successive cell cycle stages (best, whole cells and cytoskeletons) to clearly show the structures that are labelled with Ty-1-tubulin.

      On p.8 (end of first paragraph) there is reference to cell cycle analyses, but no data is shown. Also on p.8, please clarify what the evidence is that "a fraction of cells did not respond to tetracycline". The fact that they remain unstained by Ty-1-tubulin is not in itself evidence they did not respond to tetracycline.

      Figure S4A. The blot for the soluble fraction is not of great quality. I don't see how the conclusion was reached that the Ty-1-tubulin bands were faint.

      On p.11, it would be interesting to compare measured elongation rates with previously measured estimates for flagellum growth, comparing the growth rates, and relating them to cell cycle times in the corresponding experiments (which vary slightly between labs and studies).

      Figure S6. I find the presentation of this figure confusing. It should be revised with clearer labelling of "cell cycle 1", "cell cycle 2", and the precise meaning of "type 3" should be clarified. There are two instances of "type 1" in the drawing, but one of these seems to fulfil the criteria of "type 3" (OF 1-4µm).

      Figure 7. In panel A, the absence of label at the NF distal end is not total, a purple line is still visible. Was any quantitation attempted (signal intensity, changes in length of labelled fragments over time?). Minimally, say how many cells were analysed for the numbers in panels D and E, and how many times this experiment was done.

      Significance

      General assessment: strengths and limitations

      Strengths: Trypanosoma brucei is a powerful model system in which to ask detailed questions about the assembly dynamics and hierarchy of microtubule-based cytoskeletal structures in general and cilia in particular. This elegant and well-designed study overcomes a previous technical limitation by allowing for the direct labelling of alpha tubulin, one of the main building blocks of the ciliary axoneme. The study sets out to test a specific hypothesis (grow-and-lock model) and provides evidence in support, leading to a refined model for cilia length regulation in trypanosomes.

      Limitations: With this system, visualisation of new tubulin incorporation requires de novo synthesis. There is a time lag between inducing expression of Ty-1-tubulin with tetracycline and being able to visualize the tagged proteins that needs to be taken into consideration. This time lag was estimated based on previous studies and the relatively quick appearance of Ty-1-tubulin on Western blots (within hours). This inevitably creates a situation where levels of tagged tubulin change rapidly, creating gradients of signal intensity (and variations in levels) that lead to some uncertainty in estimations of length of labelled microtubule fragments. Furhtermore, the epitope label is not compatible with live cell imaging, restricting analyses to fixed cells. The Ty-1-tubulin data is well ducmented; the RSP4/6 data appear to corroborate these findings but are less extensively documented.

      Advance: The results succeed in integrating several recent findings from different research groups into a refined coherent model about cilia length regulation in trypanosomes. The tubulin tagging method could be gainfully transferred to other systems (although he state of the field in tubulin tagging in other systems is not clearly laid out in the paper).

      This paper could be of interest to a broad cell biology community interested in cilia and cytoskeletal dynamics.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The length of the old flagellum of Trypanosome is constant during G1 phase as well as during cell cycle progression when the new flagellum is assembled. The authors have previously proposed a "Grow and Lock" model for the flagellar length control in which no flagellar building blocks are incorporated. To test this hypothesis, the authors used a tagging strategy for alpha-tubulin and tracking its incorporation. The authors showed that the new flagellum incorporates new tubulins, as is expected. For the mature flagellum, tubulins are incorporated at the flagellar tip and only when the cells start to assemble the new flagellum. Thus, it shows that old flagellum is stable but not completely locked for the incorporation of tubulins.

      Major comments:

      The study is methodologically rigorous, integrating fluorescence microscopy, biochemical approaches, and proteomic analyses to validate the functionality of the tagged tubulin. The use of both inducible expression and endogenous protein tagging (HaloTag) strengthens the conclusions. This study has supported the "Grow-and-Lock" model" that the authors previously proposed. In addition, they have revealed that the stability of the old flagellum is temporally controlled.

      The data showed that brief incorporation of tubulins at the tip of the old flagellum occurs when the cells start to form the new flagellum. I thought the assembly of the new flagellum occurs during the cell division. However, in the abstract, it says that "The restriction is lifted briefly after the bi-flagellated cell has divided." Is my understanding wrong?

      P12, "The cartoon in Fig. 5A illustrates the progression of the cells in scenario 2 (Fig. 4A) over the duration of one cell cycle (~9 hours)" I thought that one cell cycle should start with cell with only one flagellum, followed by assembly of a new flagellum during cell division, the cell then divides when the new flagellum is almost completely assembled. If my understanding is correct, perhaps the cartoon should be modified accordingly.

      Minor comments:

      1. Several references are not correctly formatted. P3: (Flavin and Slaughter, 1974) (Rosenbaum 1969). P10, (Sherwin et al., 1987)(Sheriff et al., 2014)
      2. In several places there are no space between the number and the unit. For eample, P3, 9 - 24µm/h. 7, 1μg/m; P8, 50kDa; P9, 1M; 8-9h; P11, 2.9µm/h and etc.
      3. P11, Flagella were extracted. I thought the cells were extracted.

      Significance

      Cilia and eukaryotic flagella are considered dynamic structures in which the flagellar components especially tubulins under constant turnovers even in steady state. This work demonstrates that in Trypanosome the stable old flagellum is temporally controlled for tubulin turnovers, suggesting a tight regulation of microtubule dynamics. Future elucidation of the regulatory mechanism will be more interesting. This work will be interesting to the field of cilia and microtubules. In addition, the new technique used for tracking tubulins will also be interesting.

      I am an expert on ciliary biology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Daniel Abbühl on "A novel approach to tagging tubulin reveals MT assembly dynamics of the axoneme in Trypanosoma brucei" uses an innnovative approach to label tubulin, which allows the authors to unveil new mechanisms in flagellar length regulation.

      The manuscript is very nice and will be very interesting for the cell biology community and therefore should be accepted. In some parts it becames a bit complex with all the models and complex phrasing, I wonder whether the text could be simplified to be more appealing. I have a few minor comments:

      • From the model the authors show in Figure 8- there should be a way of pulsing the cells in G1 for a short amount of time -2 hours- and getting both flagella tips labelled. But the authors seem to require longer labelling to get that result. This should be better explained.
      • Why do some cells not express the construct? Weren´t they all selected?
      • "The linear regression line in Fig. 3C was corrected by subtracting 45 minutes from each timepoint due to the previously reported delay between addition of tetracycline and the expression of the respective protein". However, in the authors data the delay may amount to one hour (western analysis- S4). Shouldn´t they use their data.
      • Fig 3: To measure the timepoints of flagella growth, wouldn´t it be better to do it with NF that started to grow before induction, rather than starting to grow after induction, to be sure that the timing of incorporation is fully accounted for?
      • Although it is not the focus of the manuscript it would have been very interesting to use the CEP164C mutant to see whether it would change the dynamics of incorporation and fully test their model and discussion.
      • In some parts of the manuscript/supplemental material the authors say they insert the Ty-1- tag one aminoacid after the acetylated lysine- other parts they say two aminoacids after- this should be consistent.
      • Fig. S1: 'Binding epitope of the TAT-1 antibody is highlighted in red'. There is no highlighting in red in this figure?
      • Fig. S2: Western blots are not very clear. What is the 'X' present in the C (first lane)? Weight of markers should be shown also in S4.
      • Fig 5: 'C: Frequency of bi-flagellated cells grouped by the different types of' The authors didn't finish the sentence.
      • Fig. S7: The 'B' is missing in both picture and legend.

      Significance

      This study advances our knowledge of flagellar length regulation and maintenance. Moreover the tools designed in this work will be very useful for the cell biology community in general.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for providing valuable comments and suggestions for improving the manuscript.

      Response to reviewer comments:

      Reviewer-1

      Comment 1: Major concern is the study lacks rigor in several areas where n=2, results are not quantified with statistics. They need to run power analysis and increase their samples sizes. Please include statistics on all measurements. Filamentous actin staining and alpha-sma is used to visualize mechanosensing but also in other cell activities such as cell contractility for movement, cell to substrate adhesion, cell division, etc. They need to query more mechanosensing related pathways (Piezo1/2, Yap/taz-Hippo, integrin-Focal Adhesion Kinase, etc) to show that mechanosensing changed.

      Response: We have increased the sample size to a minimum of n=3 in most cases. However, a few experiments will require more time to increase sample size, as mentioned below.

      Our data emphasized the role of Rac1 and SRF. We understand that other molecular players may also be involved in sensing or responding to mechanical forces, but surveying multiple families of candidates without a specific hypothesis or functional experiment is beyond the scope of this study.

      __Comment 2: __Fig. 1: In panel E, the cranial bone area measurement is not normalized to mitigate the possible effect of individual differences.

      Response: We have re-quantified the data with normalization to the length of the skull.

      __Comment 3: __In Fig. 2 the authors mentioned many phenotypical changes (bone length changes, gap thickness change, apex thickness change, etc.) based on histology stain, none of them are quantified to show a significant difference between Rac1-WT and Rac1-KO.

      Response: In Fig. 2A, we present the gross morphology of the Rac1-KO embryos and only discuss the tissue defects like edema, hematoma, and hypoplasia, confirmed through H&E as shown in Fig. 2C. We also show the apical limits of the intact calvaria in Fig. 2D, consistent with the calvaria defects observed at birth. In fact, we do not discuss any “bone length changes, gap thickness, or apex thickness change” in this section as suggested by the reviewer. To address the request for more quantification we have added measurement of the edematous area of the apical mesenchyme at E14.5 (Fig. 2C), and this is now shown in Suppl. Fig. 1E. We also added quantification of embryo genotypes and Chi-square tests, now shown in Suppl. Fig. 1D.

      Comment 4: Fig. 2 In panel D, with only 2 embryos per group is not enough for quantitation

      Response: We plan to increase the number of embryos during the revision period.

      Comment 5: Fig. 2 In panel D, the two arrows in the Rac1-KO mutants are not easy to catch.

      Response: We made the arrows bigger and bolder.

      Comment 6: Fig. 3 The thickness quantification is not performed.

      Response: We added quantification in Fig. 3D.

      Comment 7: Fig. 3 The images show an obvious curve change of the apex between the control and mutant. Such change is not discussed in the results. Is it due to histology issue?

      Response: We do not think it is due to technical issues but reflects a real change in the shape of the apex of the head. We modified the graphical representation in Figure 3E to reflect this change in curvature. We also added the following sentence to the results on page 7: “We also noted a loss of curvature in the apex of the Rac1-KO head at E13.5, which correlated with loss of aSMA+ mesenchymal cells and thinning of the EMM (Fig. 3E).”

      __Comment 8: __The merged layer did not show S100a6. While the authors are showing apical expansion of the mesenchyme toward the dermis and meninges, it is hard to track where they are without a merged image.

      Response: We added merged images.

      Comment 9: Fig. 4 In panel B, 2 biological replicates per genotype are very low.

      __Response: __The effect of Rac1-KO on cell cycle is already known (Moore et al. 1997; Nikolova et al. 2007; Gahankari et al. 2021), and our result is supported by in vivo quantification of Tom+Edu+ cells in different regions of the embryonic head shown in Fig. 4A. We prefer not to repeat this assay.

      Comment 10: Fig. 4 There is no cell death data.

      Response: We will generate data on cell death during the revision period.

      __Comment 11: __Fig. 5 In panel B, the GAPDH western plot bands in the mutants seem to be thinner than those of controls.

      Response: We verified equal loading with a Ponceau stain, so this minor change in the GAPDH level could be due to biological differences in the protein level. Nevertheless, by our estimation this minor difference does not explain away the major difference in Rac1 and Srf levels.

      __Comment 12: __Though the immunostain showed a decrease in signal intensity, it is hard to know whether the decrease is significant enough across all Rac1-KO mutants. They need to measure the fluorescence intensity and perform statistics.

      Response: We will generate better images of SRF staining and quantify the difference between Rac1-WT and Rac1-KO during the revision period.

      Comment 13: Fig. 6: Similar as Fig. 2, there is no quantification and n=1 per genotype is not enough

      Response: During the revision period we will increase the number of E12.5 Srf-KO and Srf-WT embryos to n=3 for Figure 6G. All other panels currently have n=7 or greater.

      Comment 14: Fig. 7: Need quantification between Srf-KO and Rac1-KO with statistics to show they are not different, but both significantly different from WTs

      Response: In Figure 7D we have added quantification of aSMA area in Srf-KO and Rac1-KO. These results show that both mutants have a similar phenotype with reduced aSMA expression compared to their respective WT littermates, which supports the conclusion that they work in the same pathway. We do not agree with the reviewer that the two mutants should show no statistical difference, because Rac1 and Srf are different genes with overlapping but also non-overlapping functions. During the revision period we will add more Srf-KO embryos and repeat the statistical analysis.

      Comment 15: Supplement Fig.2: No image showing the time point before E11.5.

      Response: We will add an E10.5 time point during the revision period.

      Comment 16: Supplement Fig.3: The ventral view of Rac1-WT does not have the same angle as it shows in Rac1-KO. Makes harder to see the difference between control and mutant.

      Response: We adjusted the brightness/contrast to make the difference clearer.

      Comment 17: Supplement Fig.4 &7: The alkaline phosphatase stained area needs to be normalized to some other metric because the embryos could be different size.

      Response: We normalized to the width of the eye and is now represented in Suppl. Fig. 4 and 7.

      Comment 18: Supplement Fig 6 A: The legend and figure don't match. Is it E13.5 or 14.5. Panel 6B needs better images without curling of the tissue.

      Response: This has been fixed. The immunostaining images in Suppl. Fig. 6A is E14.5. Panel B is now replaced with better images in the revised manuscript.


      Reviewer-2

      __Comment 1.1: __In Fig. 5, links between Rac1, SRF, αSMA, and contractility in mesenchymal cells are shown. Molecular analyses (Western blot and qPCR) were performed using primary cultured mesenchymal cells (prepared after freed from the epidermal population). Although use of cells prepared from E18.5 embryos may have been chosen by the authors for the safe isolation of the mesenchymal population without contamination of epidermal cells, this reviewer finds that anti-SRF immunoreactivity is weaker at E13.5 than at E12.5 (throughout the section including the mesencephalic wall) and therefore wonder whether SRF expression changes in a stage-dependent manner. So, simply borrowing results obtained from E18.5-derived cells for describing the scenario around E12.5 and E13.5 is a little disappointing point found only here in this study.

      Response: In fact, the reason we chose E18.5 was to get enough cells to do the experiments in Figure 5A-D without extensive passaging and/or immortalization, which would undoubtedly cause the cells to deviate from their in vivo character as they become adapted to growing on plastic with 10% serum. Therefore, we prefer not to change the cells as suggested by the reviewer.

      __Comment 1.2: __In Fig. 5F, it is difficult to clearly see "reduction" of SRF immunoreactivity in Rac1-KO. Therefore, quantification of %SRF+/totalTomato+ would be desired.

      Response: __We will generate better images of SRF staining and quantify the difference between Rac1-WT and Rac1-KO during the __revision period.

      __Comment 1.3: __Separately, direct comparison of spontaneous centripetal shrinkage of the apical/dorsal scalp tissues, which will occur in 30 min when prepared at E12.5 or E13.5 (Tsujikawa et al., 2022), between WT and Rac1-KO would strengthen the results in Fig. 5D. As KO is specific to the mesenchyme, the authors do not have to worry about removal of the epidermal layer (which would be much more difficult at E12.5-13.5 than E18.5). If the degree of centripetal shrinkage of the "epidermis plus mesenchyme" layers were smaller in Rac1-KO, it would be interpreted to be mainly due to poorer recoiling activity and contractility of the Rac1-KO mesenchymal tissue.

      Response: __We will try to perform the centripetal shrinkage assays as shown by Tsujikawa et al., during the __revision period.

      Comment 2: The authors favor "apical" vs. "basolateral" to tell the relative positions in the embryonic head, not only in the adult head. But "apical" vs. "basolateral" should be accompanied with dorsal vs. ventral at least at the first appearance. Apical-to-basal axis or apex vs. basolateral by itself can provide, in many contexts, impressions that epithelial layers/cells are being discussed. Please note that the authors also use "caudal" (in the embryonic head). Usually, a universally defined anatomical axis perpendicular to the rostral-to-caudal axis is the dorsal-to-ventral axis.

      Response: Apologies for confusing terminology. The terminology is now defined uniformly according to the anatomical axis.

      Comment 3: One of the authors' statements in ABSTRACT "In control embryos, α-smooth muscle actin (αSMA) expression was spatially restricted to the apical mesenchyme, suggesting a mechanical interaction between the growing brain and the overlying mesenchyme" and a similar one in RESULTS "αSMA was not detected in the basolateral mesenchyme of either genotype from E12.5-E14.5 (Suppl. Fig. 4A), suggesting restriction of the mechanosensitive cell state to the apical mesenchyme" need to be at least partly revised, taking previous publication about the normal αSMA pattern in the embryonic head into account more carefully. Tsujikawa et al. (2022) described "Low-magnification observations showed superficial immunoreactivity for alpha smooth muscle actin (αSMA), which has been suggested to function in cells playing force-generating and/or constricting roles; this immunoreactivity was continuously strong throughout the dorsal (calvarial) side of the head but not ventrally toward the face, producing a staining pattern similar to a cap (Figure 2A)" . Therefore, in this new paper, descriptions like "we observed ...., consistent with ....(2022)" or "we confirmed .... (2022)" would be more accurate and appropriate regarding this specific point. Such a minor change does not reduce this study's overall novelty at all.

      Response: Thank you for the correction. We have replaced the terminology and cited the article (Tsujikawa et al., 2022) appropriately, crediting their finding.

      Comment 4: It would be very helpful if the authors provide a schematic illustration in which physiological and pathological scenarios (at the molecular, cellular, and tissue levels found or suggested by this study) are shown.

      Response: We have added a schematic representation of the molecular changes happening in the apical head development because of Rac1- and Srf-KO, and it is represented in Suppl. Fig. 7C.


      Comment 5: Despite being put in the title, "mechanosensing" by mesenchymal cells is not directly assessed in this study. If appropriate, something like "mechano-functioning" would be closer to what the authors demonstrated.

      __Response: __We changed the title to refer to “mechano-responsive mesenchyme”. We think this is appropriate because the cells of interest have reduced aSMA and reduced proliferation, both of which are known to occur, at least in part, as responses to mechanical inputs.

      Reviewer-3

      Comment 1: Prrx1-Cre targets calvarial mesenchyme and Suzuki et al., 2009 showed that Prrx1-Cre mediated loss of Rac1 lead to calvarial bone phenotype due to incomplete fusion of the skull. While this phenotype was not studied in detail, the statement in the intro and discussion that the calvarial phenotype has not been recapitulated in mice is incorrect.

      Response: Suzuki et al showed incomplete fusion of the skull. Although the skull is a tissue that is affected in AOS, it is not akin to the scalp and calvaria aplasia that typifies AOS. Our result stands apart from this. We clarified our position as such:

      Introduction (page 4): “Nevertheless, the calvaria phenotype seen in AOS individuals has not been explored in detail or fully recapitulated in mice.”

      Discussion (page 11): Previous studies have demonstrated the role of Rac1 in mesenchyme-derived tissues, but they did not recapitulate AOS phenotypes.”

      Comment 2: The authors show that Pdgfra-Cre induced knockout of Rac1 leads to lower-than-expected numbers of Rac1-cKO embryos at E18.5 and P1. Phenotypic analysis shows that the earliest phenotype is blebbing and hematoma in the nasal region at E11.5/12.5. It is stated that this was resolved at E18.5. It is unclear if this is truly a resolution of the phenotype or that these embryos fail to survive until E18.5. Do 100% of the Rac1-cKO embryos exhibit the blebbing/hematoma at E11.5/12.5? What is the observed number/percentage of Rac1-cKO embryos at E11.5/12.5? If the observed percentage of Rac1-cKO is similar to that at E18.5 (lower than the expected 25%), this would support resolution. If the observed ratio is as expected at E11.5/12.5, then this would support embryonic loss before E18.5 rather than phenotypic resolution.

      Response: Please note that 100% (n=12) of E12.5 Rac1-KO embryos displayed nasal and mild caudal edema as exhibited in Fig. 2A, but none (n=16) had blebbing/hematoma by E18.5. We added tables for the number of embryos recovered at E12.5 and E18.5 to Supplemental Figure 1. These results show that the percentage of mutants at E12.5 was 21.42%, not significantly different from the expected frequency (p = 0.5371). At E18.5, the percentage dropped slightly to 18.3%, but still not significantly different from expected (p = 0.1545). The significant change in frequency of blebbing/hematoma from E12.5 to E18.5, without any significant change in the frequency of mutants, supports phenotypic resolution of the early blebbing/hematoma.

      Comment 3: It is stated that brain shape is altered in Rac1-cKO embryos at E14.5 and E18.5 and concluded that these shape differences are secondary to the cranial defects. Pdgfra+ cells gives rise to the meninges and if the Pdgfra-Cre line recapitulates this expression, then loss of the ubiquitously expressed Rac1 in the meninges could lead to a primary defect in the brain, which may lead to secondary defects in the calvarium and scalp. Their conclusion should recognize other possibilities.

      Response: We agree it is possible that there are meninges defects that secondarily change the shape of the brain, and we added a mention of this possibility. It is highly unlikely that scalp defects are only secondary to brain changes because the first observable phenotypes are in the EMM that gives rise to the scalp.

      Comment 4: The TdTom staining in wholemount at E13.5 (Supplemental Figure 2B) is difficult to appreciate in the image shown.

      Response: At E11.5 there is good contrast between labeled cranial structures and non-labeled body. At E13.5, Tomato appears in most of the mesenchymal cells in the embryo, so there is not as much contrast. The lack of contrast at E13.5 may cause the reviewer think there is something wrong with the image.

      Comment 5: The idea that the EMM laminates into the meninges and scalp layers is not new and should be properly cited (Vu et al., 2021, Scientific Reports). The following paper should also be cited on the use of alpha-SMA (Acta2) as a marker of the anterior calvaria mesenchyme: Holms et al., 2020 Cell Reports.

      Response: Thank you. We are happy to add those citations.

      Comment 6: It is concluded that meningeal development is maintained in the cKO; however, this conclusion was based on a single marker (S100a6) that is both expressed in the presumptive meninges and dermis and greatly reduced overall in the cKO. This conclusion should be softened or other markers used to show that the meninges is indeed normal.

      Response: We softened the conclusion on the meninges in the revised manuscript, as this part of the phenotype is was not our focus but it would be a good thing to look at in the future.

      Comment 7: The overlap of S100a6 and alpha-SMA is difficult to appreciate in the images shown in Figure 3. Since this is important to the conclusion, co-staining should be done. If co-staining cannot be done due to the primary antibodies' origins, then ISH should be done.

      Response: We added merged images.

      Comment 8: It is concluded that reduced alpha-SMA suggests an early failure of Rac-cKO cells to respond to the mechanical environment. While this is one possibility, the reduction of alpha-SMA may simply be due to a reduction of these cells resulting from failed differentiation, decreased proliferation, or increased apoptosis.

      Response: We think the fact that aSMA is downregulated in cultured cells strongly argues against it being a trivial consequence of reduce proliferation etc. Nevertheless, we softened our conclusion to allow for some of these things to also contribute to the reduced aSMA expression. We will check apoptosis during the revision period.

      Comment 9: The conclusion that alpha-SMA is a transient population only present in apical cranial mesenchyme between E12.5-14.5 is not consistent with prior studies: Holms et al., 2020 Cell Reports; Holms et al., 2021 Nature Communications; Farmer et al., 2021 Nature Communications; Takeshita et al., 2016 JBMR.

      Response: There is no contradiction. Our statements are based on antibody staining where it is very evident that a-SMA-expressing cells are detectable throughout the apical mesenchyme between E12.5 and E14.5. But at E18.5 we do not see this kind of broad aSMA expression the apical head, suggesting a transient and spatially restricted population of cells in the apical mesenchyme. This is consistent with the studies from Tsujikawa et al., 2022 and Angelozzi et al., 2022. The papers mentioned by the reviewer are only focused on the suture mesenchyme. They do not claim there is broad aSMA/Acta2 expression in the apical head, but only in a spatially restricted subpopulation of suture mesenchymal cells.

      Comment 10: In the SRF immunostaining results in control and Rac1-cKO embryos, it is difficult to appreciate the nuclear localization at E12.5 in Figure 5E, as the DAPI is over saturated, and the image quality is poor. The image quality is also poor in Figure 5F.

      Response: We will generate better images of SRF staining and quantify the difference between Rac1-WT and Rac1-KO during the revision period.

      Comment 11: To what extent is the expression/localization of MRTF, the transcriptional co-activator of SRF, altered in the calvarial mesenchyme of Rac1-cKO embryos? Changes in MRTF would strengthen the link between Rac1 and SRF.

      Response: We do not know how MRTF expression/localization changes in the embryo tissue, but western blot data on Rac1-KO fibroblasts revealed a reduction in expression/nuclear localization of MRTF-A/B that mirrored the changes in SRF. We added these blots to Figure 5A. However, as noted at the end of the discussion, MRTF is not always required for SRF function in vivo ( Dinsmore, Elife 2022). The MRTFA/B-KO is a possibility for future work.

      Comment 12: Hypoplasia of the apical mesenchyme (Figure 6G, inset 1) in Srf-cKO is difficult to see.

      Response: During the revision period we will increase the number of E12.5 Srf-KO and Srf-WT embryos to n=3 for Figure 6G and replace the picture with a better one.

      Comment 13: Generally, the organization of the data into many main and supplemental Figures makes the flow difficult to follow.

      __Response____: __We understand the concern, but we have tried our best to organize the most important data into main figures and the relevant but less essential data into supplemental figures.

      Comment 14: SFR interacts with Pdgfra interacts genetically with Srf in neural crest cells in craniofacial development, with Srf being a target of PDGFRa signaling (Vasudevan and Soriano, 2015, Dev Cell). Since the Pdgfra-Cre line used here is hemizygous, is important that the control used to look at SRF expression in the Rac1-cKO is Pdgfra-Cre+.

      Response: It is standard practice to include some Cre+ mice in the control set to reveal whether Cre has toxic effects in the cells of interest. To the reviewer’s concern about genetic interactions between the Pdgfra gene and Srf, this should not be relevant here because the Pdgfra-Cre used in our study is a transgene and does not affect the endogenous Pdgfra gene.

      Comment 15: The text size in all figures is too small and varies throughout, making it difficult to read.

      Response: To fit the panel in the Word document, the figure is resized. This should not be an issue in the final manuscript.

      Comment 16: Details about the pulse-chase timing of the EdU experiments should be included in the results. Also, does n = 3 for each stage and each genotype? I would be helpful to include a representative section for a control and cKO littermate pair.

      Response: The details are now included in the methods section. Yes, n=3 in each stage and genotype (Fig. 4A). The representative images are also included.

      Comment 17: The relative sizing of the panels within and between figures is haphazard. Some are very large and others very small (Figure 2, 6, Supplemental Figure 1, 2, 6, 7).

      Response: The image panels are fixed in the revised manuscript.

      Comment 18: In Figure 5A and F, the titles "E12.5" and "E13.5" are in italics.

      Response: The fonts for the figures are fixed in the revised manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: This manuscript by Rathnakar et al. examines the role of the small GTPase Rac1 in apical closure of the scalp and skull. Rac1 activity is regulated the guanine nucleotide exchange factor DOCK6 and the GTPase AHGAP31. Loss of function variants in DOCK6 and gain of function variants in AHGAP31 lead to sustained inactivation of Rac1 in Adams-Oliver syndrome (AOS), which is characterized by aplasia cutis congenita, underlying calvarial defects, and limb abnormalities. While Rac1 is thought to be a key in the pathogenesis of AOS, how decreased in Rac1 activity impact development of the head is not well-understood. The authors find that conditional loss of Rac1 in cranial mesenchyme (using Pdgfra-Cre), leads to AOS-like abnormalities in the scalp and skull. They go on to show that these abnormalities are linked to reduced alpha-SMA expression in the early migrating mesenchyme (EMM), decreased osteoprogenitor cells in the supraorbital mesenchyme (SOM), decreased proliferation, and the contractile function of fibroblasts. They also find that Rac1 cKO leads to reduced expression of the mechanosensitive transcription factor SRF. Finally, they show that loss of SRF in cranial mesenchyme (using Pdgfra-Cre) leads to an AOS-like scalp and skull phenotype that has mechanistic overlap with their findings in the Rac1 cKO.

      Major:

      1. Prrx1-Cre targets calvarial mesenchyme and Suzuki et al., 2009 showed that Prrx1-Cre mediated loss of Rac1 lead to calvarial bone phenotype due to incomplete fusion of the skull. While this phenotype was not studied in detail, the statement in the intro and discussion that the calvarial phenotype has not been recapitulated in mice is incorrect.
      2. The authors show that Pdgfra-Cre induced knockout of Rac1 leads to lower-than-expected numbers of Rac1-cKO embryos at E18.5 and P1. Phenotypic analysis shows that the earliest phenotype is blebbing and hematoma in the nasal region at E11.5/12.5. It is stated that this was resolved at E18.5. It is unclear if this is truly a resolution of the phenotype or that these embryos fail to survive until E18.5. Do 100% of the Rac1-cKO embryos exhibit the blebbing/hematoma at E11.5/12.5? What is the observed number/percentage of Rac1-cKO embryos at E11.5/12.5? If the observed percentage of Rac1-cKO is similar to that at E18.5 (lower than the expected 25%), this would support resolution. If the observed ratio is as expected at E11.5/12.5, then this would support embryonic loss before E18.5 rather than phenotypic resolution.
      3. It is stated that brain shape is altered in Rac1-cKO embryos at E14.5 and E18.5 and concluded that these shape differences are secondary to the cranial defects. Pdgfra+ cells gives rise to the meninges and if the Pdgfra-Cre line recapitulates this expression, then loss of the ubiquitously expressed Rac1 in the meninges could lead to a primary defect in the brain, which may lead to secondary defects in the calvarium and scalp. Their conclusion should recognize other possibilities.
      4. The TdTom staining in wholemount at E13.5 (Supplemental Figure 2B) is difficult to appreciate in the image shown.
      5. The idea that the EMM laminates into the meninges and scalp layers is not new and should be properly cited (Vu et al., 2021, Scientific Reports). The following paper should also be cited on the use of alpha-SMA (Acta2) as a marker of the anterior calvaria mesenchyme: Holms et al., 2020 Cell Reports.
      6. It is concluded that meningeal development is maintained in the cKO; however, this conclusion was based on a single marker (S100a6) that is both expressed in the presumptive meninges and dermis and greatly reduced overall in the cKO. This conclusion should be softened or other markers used to show that the meninges is indeed normal.
      7. The overlap of S100a6 and alpha-SMA is difficult to appreciate in the images shown in Figure 3. Since this is important to the conclusion, co-staining should be done. If co-staining cannot be done due to the primary antibodies' origins, then ISH should be done.
      8. It is concluded that reduced alpha-SMA suggests an early failure of Rac-cKO cells to respond to the mechanical environment. While this is one possibility, the reduction of alpha-SMA may simply be due to a reduction of these cells resulting from failed differentiation, decreased proliferation, or increased apoptosis.
      9. The conclusion that alpha-SMA is a transient population only present in apical cranial mesenchyme between E12.5-14.5 is not consistent with prior studies: Holms et al., 2020 Cell Reports; Holms et al., 2021 Nature Communications; Farmer et al., 2021 Nature Communications; Takeshita et al., 2016 JBMR.
      10. In the SRF immunostaining results in control and Rac1-cKO embryos, it is difficult to appreciate the nuclear localization at E12.5 in Figure 5E, as the DAPI is over saturated, and the image quality is poor. The image quality is also poor in Figure 5F.
      11. To what extent is the expression/localization of MRTF, the transcriptional co-activator of SRF, altered in the calvarial mesenchyme of Rac1-cKO embryos? Changes in MRTF would strengthen the link between Rac1 and SRF.
      12. Hypoplasia of the apical mesenchyme (Figure 6G, inset 1) in Srf-cKO is difficult to see.
      13. Generally, the organization of the data into many main and supplemental Figures makes the flow difficult to follow.
      14. SFR interacts with Pdgfra interacts genetically with Srf in neural crest cells in craniofacial development, with Srf being a target of PDGFRa signaling (Vasudevan and Soriano, 2015, Dev Cell). Since the Pdgfra-Cre line used here is hemizygous, is important that the control used to look at SRF expression in the Rac1-cKO is Pdgfra-Cre+.

      Minor:

      1. The text size in all figures is too small and varies throughout, making it difficult to read.
      2. Details about the pulse-chase timing of the EdU experiments should be included in the results. Also, does n = 3 for each stage and each genotype? I would be helpful to include a representative section for a control and cKO littermate pair.
      3. The relative sizing of the panels within and between figures is haphazard. Some are very large and others very small (Figure 2, 6, Supplemental Figure 1, 2, 6, 7).
      4. In Figure 5A and F, the titles "E12.5" and "E13.5" are in italics.

      Significance

      Overall, this is an interesting study that shares mechanistic insight into the scalp and skull deformities in AOS. The overall presentation of the work, particularly the figures, should be improved and streamlined to enhance clarity and better emphasize the novelty of the study. In addition, the conclusions are not always well-supported by the results and the interpretation of the results do not fully consider and cite previous studies.

      Audience: Developmental Biologists

      Expertise: Craniofacial development and disease

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      In mice lacking Rac1 in the PDGFRa+ mesenchymal cell lineage, the authors found Adams-Oliver syndrome (AOS)-like defects of the apical/dorsal scalp and calvaria, which was accompanied by the secondary brain protrusion by E18.5. The primary phenotype emerged at E11.5 and worsened from E12.5 to E14.5 in the apical/dorsal region of the embryonic head, with limited lateral expansion as well as reduced thickening/stratification of the mesenchymal layer expressing α-smooth muscle actin (αSMA). Very similar in vivo abnormalities were obtained when serum response factor (SRF), known as a mechanotransducing factor, was removed in PDGFRα+ mesenchymal cells. Rac1-lacking mesenchymal cells proliferated poorly in vivo and contracted weakly in culture, with reduced expression of SRF and αSMA. Based on these results and previously obtained understanding that the developing apical/dorsal mesenchyme is mechanically stretched by the underlying brain, the authors conclude that the mechanosensing-triggered morphogenetic behaviors of the apical/dorsal mesenchymal cells (i.e., proliferation, stratification, and contraction, which all lead to physical stability or mechanical resilience of that layer) is mediated by Rac1 and SRF. The authors also suggest that this molecular mechanism for the physiological maturation of the apical/dorsal mesenchyme may underlie the ventral-to-dorsal progression of osteogenesis, absence of which explains AOS pathogenesis.

      Major comments:

      In Fig. 5, links between Rac1, SRF, αSMA, and contractility in mesenchymal cells are shown. Molecular analyses (Western blot and qPCR) were performed using primary cultured mesenchymal cells (prepared after freed from the epidermal population). Although use of cells prepared from E18.5 embryos may have been chosen by the authors for the safe isolation of the mesenchymal population without contamination of epidermal cells, this reviewer finds that anti-SRF immunoreactivity is weaker at E13.5 than at E12.5 (throughout the section including the mesencephalic wall) and therefore wonder whether SRF expression changes in a stage-dependent manner. So, simply borrowing results obtained from E18.5-derived cells for describing the scenario around E12.5 and E13.5 is a little disappointing point found only here in this study. In Fig. 5F, it is difficult to clearly see "reduction" of SRF immunoreactivity in Rac1-KO. Therefore, quantification of %SRF+/totalTomato+ would be desired. Separately, direct comparison of spontaneous centripetal shrinkage of the apical/dorsal scalp tissues, which will occur in 30 min when prepared at E12.5 or E13.5 (Tsujikawa et al., 2022), between WT and Rac1-KO would strengthen the results in Fig. 5D. As KO is specific to the mesenchyme, the authors do not have to worry about removal of the epidermal layer (which would be much more difficult at E12.5-13.5 than E18.5). If the degree of centripetal shrinkage of the "epidermis plus mesenchyme" layers were smaller in Rac1-KO, it would be interpreted to be mainly due to poorer recoiling activity and contractility of the Rac1-KO mesenchymal tissue.

      Minor comments:

      1. The authors favor "apical" vs. "basolateral" to tell the relative positions in the embryonic head, not only in the adult head. But "apical" vs. "basolateral" should be accompanied with dorsal vs. ventral at least at the first appearance. Apical-to-basal axis or apex vs. basolateral by itself can provide, in many contexts, impressions that epithelial layers/cells are being discussed. Please note that the authors also use "caudal" (in the embryonic head). Usually, a universally defined anatomical axis perpendicular to the rostral-to-caudal axis is the dorsal-to-ventral axis.
      2. One of the authors' statements in ABSTRACT "In control embryos, α-smooth muscle actin (αSMA) expression was spatially restricted to the apical mesenchyme, suggesting a mechanical interaction between the growing brain and the overlying mesenchyme" and a similar one in RESULTS "αSMA was not detected in the basolateral mesenchyme of either genotype from E12.5-E14.5 (Suppl. Fig. 4A), suggesting restriction of the mechanosensitive cell state to the apical mesenchyme" need to be at least partly revised, taking previous publication about the normal αSMA pattern in the embryonic head into account more carefully. Tsujikawa et al. (2022) described "Low-magnification observations showed superficial immunoreactivity for alpha smooth muscle actin (αSMA), which has been suggested to function in cells playing force-generating and/or constricting roles; this immunoreactivity was continuously strong throughout the dorsal (calvarial) side of the head but not ventrally toward the face, producing a staining pattern similar to a cap (Figure 2A)" . Therefore, in this new paper, descriptions like "we observed ...., consistent with ....(2022)" or "we confirmed .... (2022)" would be more accurate and appropriate regarding this specific point. Such a minor change does not reduce this study's overall novelty at all.
      3. It would be very helpful if the authors provide a schematic illustration in which physiological and pathological scenarios (at the molecular, cellular, and tissue levels found or suggested by this study) are shown.
      4. Despite being put in the title, "mechanosensing" by mesenchymal cells is not directly assessed in this study. If appropriate, something like "mechano-functioning" would be closer to what the authors demonstrated.

      Significance

      This study advances understanding of a key aspect of the molecular mechanisms underlying the normal mammalian craniofacial development, unveiling the role of Rac1 and SRF in the apical/dorsal mesenchymal layer which has inter-tissue mechanical relationships with the embryonic brain underneath. This study also advances understanding of Adams-Oliver Syndrome pathogenesis, demonstrating the biological significance of the normal inter-tissue mechanical relationships in the developing mammalian head. This study may have opened a door for the genetic/molecular dissection toward the tissue-level mechano-engineering, which would stimulate development of next-generation organoids or assembloids. Broad audience including developmental biologists/neuroscientists, molecular/cellular biologists, pathologists, clinical geneticists, and pediatricians would be interested in this work.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this paper "Mouse scalp development requires Rac1 and SRF for the maintenance of mechanosensing mesenchyme", the authors demonstrated that deletion of Rac1 (Rac1-KO) with a PDGFRαCreTG mouse model led to absence of skull apex and a blebbing formation while the limbs were not impacted. Rac1-KO mice showed the Rac1 regulated expansion of the apical mesenchyme toward the very apex meningeal and dermis layer and the osteogenic differentiation of supra orbital arch mesenchyme. Rac1 also regulates the proliferation of apical mesenchyme, dermis differentiation, and mechanosensing of the cranial mesenchyme cells. The authors also indicated Rac1 was a regulator of Srf by showing the deletion of Rac1 lead to lower Srf mRNA level and SRF protein expression. Deletion of Srf showed similar phenotypes as Rac1-KO mice.

      Major concern is the study lacks rigor in several areas where n=2, results are not quantified with statistics. They need to run power analysis and increase their samples sizes. Please include statistics on all measurements. Filamentous actin staining and alpha-sma is used to visualize mechanosensing but also in other cell activities such as cell contractility for movement, cell to substrate adhesion, cell division, etc. They need to query more mechanosensing related pathways (Piezo1/2, Yap/taz-Hippo, integrin-Focal Adhesion Kinase, etc) to show that mechanosensing changed.

      Comments by figure.

      Fig. 1: In panel E, the cranial bone area measurement is not normalized to mitigate possible effect of individual differences.

      Fig. 2:

      1. While the authors mentioned many phenotypical changes(bone length changes, gap thickness change, apex thickness change, etc) based on histology stain, none of them are quantified to show a siginificant difference between Rac1-WT and Rac1-KO.
      2. In panel D, with only 2 embryos per group is not enough for quantitation.
      3. In panel D, the two arrows in the Rac1-KO mutants are not easy to catch.

      Fig. 3:

      1. The thickness quantification is not performed.
      2. The images show an obvious curve change of the apex between the control and mutant. Such change is not discussed in the results. Is it due to histology issue?
      3. The merged layer did not show S100a6. While the authors are showing apical expansion of the mesenchyme toward the dermis and meninges, it is hard to track where they are without a merged image.

      Fig.4:

      1. In panel B, 2 biological replicates per genotype are very low
      2. There is no cell death data.

      Fig. 5:

      1. In panel B, the GPDH western plot bands in the mutants seem to be thinner than those of controls.
      2. Though the immunostain showed a decrease in signal intensity, it is hard to know whether the decrease is significant enough across all Rac1-KO mutants. They need to measure the fluorescence intensity and perform statistics.

      Fig. 6: Similar as Fig. 2, there is no quantification and n=1 per genotype is not enough.

      Fig. 7: Need quantification between Srf-KO and Rac1-KO with statistics to show they are not different but both significantly different with WTs.

      Supplement Fig.2: No image showing the time point before E11.5.

      Supplement Fig.3: The ventral view of Rac1-WT does not have the same angle as it shows in Rac1-KO. Makes harder to see the difference between control and mutant.

      Supplement Fig.4 &7: The alkaline phosphatase stained area needs to be normalized to some other metric because the embryos could be different size.

      Supplement Fig 6 A: The legend and figure don't match. Is it E13.5 or 14.5. Panel 6B needs better images without curling of the tissue.

      Significance

      Please see my comments above. This work is broadly of interest to developmental biologist, fracture healing, and human genetics fields.

      The paper is easy to understand and follow. The massive amount of histology and immunostaining images make it easy to identify the point the authors want to show. All the figures are well-labeled and visually informative. The experiment sequence is logic. The gene deletion models provide solid and direct evidence on the necessity of their function during early head development. The discussion is thoughtfully written and clear. The authors discuss the connection of Rac1 and SRF with other signaling pathways, which makes them promising target toward Adams-Oliver syndrome.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their comments

      __Reviewer 1 __

      This is review of the manuscript „A simple method to efficiently generate structural variation in plants" by Bechen et al. The manuscript presents a very interesting and innovative approach to generate structural variant mutations (including large ones) in the genome of Arabidopsis thaliana using a simple chemical treatment with TOPII inhibitor etoposide. Authors show that unlike chemical mutagens commonly used for induction of SNPs (EMS, sodium azide...), etoposide-treatment caused structural variants like DNA deletions, insertions, inversions and translocations. These mutations were identified by the whole genome short and long read sequencing that also indicated a WT-like frequency of SNPs. This finding can potentially help inducing mutations similar to high energy radiation in potentially any plant. First, the manuscript provides description of the unusual phenotypes found after etoposide treatment and their Mendelistic inheritance. Based on this, authors performed whole genome sequencing and mutation detection, validation. The experimental part ends by transcriptome analysis that authors use as the approach to identify the causal mutations. This part is, in my opinion, the weakest part of the manuscript and would benefit from further clarification or even additional experiments (see below). Overall the manuscript is very clear and contains all necessary information. The only part that was confusing to me, was the section focusing on the transcriptome analysis.

      __Response: __Thank you for your appreciation of the study. As detailed below, we have changed our presentation of the RNA-seq results to better describe their purpose.

      Major points: Line 222: In the section „RNA-Seq identifies genes that are associated with structural variation and mutant phenotypes", authors suggest that the changes in the transcript amount were used to identify causal mutations. I got confused by this section. Exach of the examples represents unique situation and thus only single cases are presented which makes it hard to estimate robustness of the presented approach. Also, the presented mutations have prominent phenotypes that were already heavily studied in the past and therefore the possible causal genes are mostly known. Therefore, I am not sure how this approach would stand in case of traits with unknown underlying genes.

      __R____esponse: __Our intent was not to present a new method for mapping causative mutations. Like any other induced genetic mutation, there are many possible strategies for identifying the causative locus (loci), such as mapping-by-sequencing via a segregating F2 population (as mentioned below). Organization of the manuscript’s results has been changed to reflect that mRNA-seq was conducted to learn more about the phenotypes, not to definitively identify causal genes/mutations. We have also added additional text to the discussion to clarify candidate mutation mapping approaches on line 376: “How do we identify genetic changes that are causative for phenotypes of interest? Future studies will accelerate candidate-gene discovery by employing structural-variant callers, de novo genome assembly-based approaches, and RNA-Seq based mapping (Mahmoud et al 2019). Although our study did not aim to use RNA-Seq to identify mutations, it provides an example of how RNA-Seq data in tandem with genome sequencing can help shortlist potential causal mutations. In cases where a potential causative variant is not obvious, these strategies can be combined with traditional genetic mapping approaches. However, mapping-by-sequencing approaches might not be easily applicable to some classes of mutants. For example, inversions or translocations can suppress recombination and reduce the efficacy of mapping-by-sequencing.”

      When refering to the case with the chromosomal inversion, I do not see how one will be able to map a candidate based on the relatively mild expression (but maybe I am missing something here). Similarly, the „mapping" approach applied to the variegated line would not be possible on a trait that is less studied and the candidates are not well known. I wonder why authors did not perform association mapping on a bulk of phenotypically mutant plants collected from a segregating F2 backcross population. This might be a more robust way of linking the phenotype with a mutation.

      Response: Our primary goal with the manuscript was to demonstrate that etoposide-treatment induces mutant phenotypes and structural variation. Identifying the causal mutation for every phenotype is outside the scope of the present study. As described above, we have added additional text in the discussion to briefly describe candidate mutation mapping approaches that researchers can use.

      Discussion section: I am missing discussion on how etoposide could be causing such structural variants.

      Response: Etoposide’s mode of action is well-studied in animal systems and has been described in text that has been moved from results to introduction. Starting on line 90 it reads “Topo II relaxes torsional stress from DNA supercoiling generated during DNA replication or transcription by transiently breaking both strands and then ligating them after passing a DNA segment through the break. Between strand breakage and ligation, Topo II is covalently linked to DNA via a tyrosine residue, forming a topoisomerase cleavage complex [37]. This complex is stabilized by the inhibitor etoposide. A collision between covalently-linked Topo II and DNA polymerases during DNA replication, or with RNA polymerases during transcription, leads to removal of the Topo II enzyme, which results in the generation of double-stranded breaks (DSBs) [38–41]. The imprecise repair of DSBs leads to genomic rearrangements and structural variation in mouse spermatocytes, fibroblasts, and in human cells [42–44]. Previously, it was shown that treatment with etoposide inhibits plant growth[45,46] and causes fragmentation of chromosomes during meiosis in Arabidopsis [45]. However, its potential as a mutagen that can induce structural variation has not been investigated"

      Minor points:

      Line 70: Possibly add sodium azide. It is frequently used as mutagen for some plant species.

      __Response: __Added sodium azide to line 76.

      Line 122: „...etoposide is an excellent mutagen for efficiently creating large-effect mutations." This cannot be claimed at this point because the sequence analysis data were not shown yet. Please reformulate.

      Response: We changed this line (now line 133) to: “The large proportion of plants showing visible phenotypes suggested that etoposide could be an excellent mutagen for efficiently creating large-effect mutations.”

      **Referees cross-commenting** My main issue was the mapping protocol using transcriptomic changes. It is hard to believe that this approach would work well on unknown/less studied traits. What is your opinion?

      Response: Identifying the causal mutation for every phenotype is outside the scope of the present study. Organization of the manuscript’s results has been changed to reflect that RNA-seq was conducted to learn more about the phenotypes, not to definitively identify causal genes/mutations. In some instances (BR-like dwarf), the RNA-seq data, combined with prior knowledge, suggested a causal variant (AS1), which was further bolstered by the identification of structural variants. Note that it is only for the variegated mutant that we definitively identified the causal mutation (IM) by genetic complementation.

      Reviewer #1 (Significance (Required)):

      Strengths - innovative way on how to induce structural variant mutations in plants.

      Limitations - The approach on how to map the mutations needs more development. At this point i tis not clear how well the approach will work in other plant species.

      Audience - basic and applied plant scientists

      Response: We have adjusted our discussion of the role of mRNA-seq in the study, and added comments on approaches for mapping causative mutations. We hope this now clarifies the overall strategy. Topoisomerase’s sensitivity to etoposide inhibition is conserved amongst tested plants and animal species. We have changed the sentence and added references to introduction to show that etoposide acts on other plant species (line 100): “Previously, it was shown that treatment with etoposide impacts genome stability and inhibits plant growth in Arabidopsis thaliana, Allium cepa, and Lathyrus sativus [46-49] and causes fragmentation of chromosomes during meiosis in Arabidopsis [48]. “ In addition, preliminary (unpublished) work in our labs shows that etoposide has mutagenic impacts on legumes and Brassicaceae crop species. We therefore believe that this protocol should be widely applicable to other plant species.

      ===

      Reviewer 2

      • The manuscript describes mutagenesis of Arabidopsis by a topoisomerase II inhibitor. The method is effective, resulting in good density of SV and no detectable SNV. The authors provide a full characterization of the mutants, their phenotypes, and their genomes. The 34-sample selected for genomic analysis is sufficient to make firm conclusions.

      • The manuscript is clearly written and illustrated.

      • The manuscript does a very good job at covering the phenotypic and molecular analysis for this type of mutagenesis. For example, they highlight the difference between short and long reads in the identification of SV.

      • The figures are very clear, with the exception of Fig. 3, which I found harder to follow. It would be enhanced by describing the candidate lesion(s) in the first panel of each mutant series. This would clarify the expectation. For example, larger indels (not examined here) should be associated with higher (insertion) or lower (deletion) expression of the affected genes. In the cases presented in Fig.3, the structural changes do not suggest obvious hypotheses. The authors examine the regions near breakpoint of inversions or near small indels. It makes sense, but it does not make the figure very digestible. The connected text in the results, on the other hand, is very clear. Perhaps, making the conclusions in the figure legend as well? As a connected thought, it would have been useful to provide expression data for a large indel exemplifying the cis/trans nature of regulatory changes.

      __Response: __Thank you for your helpful comments. The RNA-seq data is now presented before the DNA sequencing data to clarify the role of the RNA-seq data in this study. The previous Figure 3 is now Figure 2. We have clarified the presentation by combining original Fig. 3C and 2E into a single panel (Fig. 4F); moving 3J to Figure 4E; removing what was 3I; and moving the original 3B, 3E-G to Figure S9. In Figure S9, a label for the type of SV was added to the scatterplots of expression of genes surrounding the SVs. For further clarity, the panel describing the inversion in BR-like dwarf is also now plotted in the same way as those of short-internode dwarf. We agree it would be informative to provide expression data for a large indel to determine the extent of cis or trans effects, but we did not find any large indels in the samples we sequenced with long-reads.

      We agree with the reviewer that some of the mutants we have generated would be great material for further studying cis/trans nature of regulatory changes. We are strongly interested in this question; it is certainly a subject for a future publication.

      • Deletions and other rearrangements may affect meiosis as noted by the authors. In addition, they can display gametophytic phenotypes and a deficit in transmission. The likelihood increases with the size of the indel. Large indels are not transmitted. Accordingly, for indels above a certain size, it is not possible to determine the number of causal loci from F2 ratios.

      __Response: __We agree that the impact of very large SVs on meiosis alters segregation ratios and prevents determination of causal loci from F2 ratios. Impacts on meiosis will also likely impact our ability to use techniques based on bulk segregant analysis to finely map causal mutations. It is important to note that such mutations comprise only a fraction of all detected SVs.

      Identification of multiple loci causing a phenotype in plants carrying large SVs will therefore require other approaches. For example, structural variation callers can be used to identify the boundaries of SVs like duplications or deletions. RNA-Seq can be used to identify genes at the SV whose expression is highly effected by cis-regulatory changes. To test if those SVs are responsible for the phenotype, genes at SV boundaries along with the novel promoters can then be reintroduced as transgenes. This should enable one to identify combinations of mutated genes that are responsible for the phenotypes.

      • Although the use of topo II inhibitors for mutagenesis in plants is novel, the mutagenic effects described here are well documented in animals. This should be acknowledged (e.g. Heisig, Mutagen. 2009; Ferguson, Env Mol Mutagen. 1994)

      __Response: __We have acknowledged prior work demonstrating the mutagenic impact of etoposide using primary literature as well as more recent references. These include references # 43-45.

      **Referees cross-commenting** I also found the expression analysis confusing and in need of revision. One reason is that a set of clear expectations were not provided. I believe that the RNAseq analysis is expected to help identify the gene(s) that underlie a trait. For example, genes located on an indel are likely to display expression proportional to copy number. Also, a new junction or translocation could influence expression of the gene next to the break point. The authors should make this clear in the figure and the text.

      Response: Organization of manuscript’s results has been changed to reflect that mRNA-seq was conducted to learn more about the phenotypes, not to definitively identify causal genes/mutations.

      Reviewer #2 (Significance (Required)):

      • I appreciated the description of the method. It should be widely applicable. In arabidopsis, it requires sustained growth in the presence of the inhibitor. This could limit its applicability. For example, it may not be effective with pollen because exposure by a short soaking period may not be sufficient. Culturing of large seeded species is possible, but adds complexity. In this context, radiations have advantages. I do agree with the authors on the difficulty in identifying a source. However, once one is found, radiation treatment is very simple and convenient.

      • The manuscript describes a useful tool and the connected spectrum of mutations. It has the novelty, quality, and relevance to represent a significant contribution to plant biology and to be of broad interest.

      __Response: __Thank you for your feedback. For Arabidopsis, we germinated and grew seeds on media containing etoposide for about two weeks (see Methods). In work that is not yet ready for publication, we have taken the same approach with a legume species and other Brassicaceae that have substantially larger seeds. We find that we need to use a higher dose of etoposide to induce phenotypes, but that it is easy to germinate and grow large-seeded plants for a couple of weeks on media containing etoposide. We don’t anticipate that seed size will be limiting for this method.

      We agree that this technique will not work for pollen; it will only work for tissues with significant levels of DNA replication. However, this technique alleviates the need for collecting pollen. For example, pollen irradiation has been used to create poplar with structural variants. However, if using etoposide-based mutagenesis, one could grow poplar seeds, cuttings, explants, embryos, or calli on etoposide-containing media.

      Reviewer 3


      Summary:

      The manuscript entitled "A simple method to efficiently generate structural variation in plants" by Bechen et al. investigated an efficient mutagen for inducing large structural variations in plants, replacing traditional irradiation methods with a chemical mutagenesis strategy. The study examined the effects of etoposide, a DNA topoisomerase II inhibitor, on structural variations and demonstrated that etoposide treatment induces a wide range of phenotypic and genome changes, including inversions, duplications, and deletions. Additionally, the authors analyzed the relationship between gene expression changes and genomic alterations to identify potential causal genes underlying specific phenotypes. While their findings provide clear and reliable evidence of structural variations induced by etoposide, I have several suggestions to enhance the clarity of their results, as detailed below.

      __Response: __We thank the reviewer for their feedback. It has helped improve the presentation and clarity of our results.

      Major comments:

      1. Lines 166-169: My understanding is that you selected etoposide-treated M1 plants based on specific phenotypes, and observed their M2 and M3 progeny, categorizing them as either phenotype-positive or phenotype-negative. In Table S3, phenotypes other than BR-like dwarf, virescent, and short internode dwarf are not mentioned. Does this indicate that these other lines did not exhibit heritable phenotypic traits? If other lines showed some phenotype changes, could you incorporate progeny relationships along with phenotype information into Table S3? Additionally, in Figures S2 and S3, you reference 26A lines. Did they exhibit similar phenotypic changes among them?

      __Response:____ __Unfortunately, we do not have detailed phenotypes of each chosen M1 line. Most lines had one or more of the phenotypes we mention in the results – “Those exposed to 160 µM of etoposide exhibited significantly more abnormal phenotypes than DMSO only or 80 µM etoposide plants, including loss of apical dominance, gnarled leaves, reduced plant size, seed abortion, and lower seed number at maturity (Figure 1A).” It is important to note that M2 phenotypes were not observed in M1.

      Table S3 is now Table S7. Only some mutants lines or lineages that were sequenced had one of the scored phenotypes in M2 (10B,13B, 1A, 1B, 21A, 24B, 26A, 34C, 5A,9A). Of these, we have formally tracked inheritance of only 13B, 1A, 34C, and 5A over multiple generations. We do not have similar data for other sequenced lines . We also sequenced some lines (17B, 21B, 26C) without any mutant phenotypes to assess if plants lacking visible phenotypes still carried SVs. Indeed, as described in Table S9, all three of these lines carried small SVs, which might not affect genes that create an obvious visible phenotype under our growth or observation conditions.

      Yes. 26A siblings all exhibited the same flat leaf phenotype.

      Overall, our data suggests that mutant phenotypes and their causal SVs can be stably transmitted through multiple rounds of meiosis.

      Lines 187-189 and Figure S4: The assessment of repeat copy number variation provides valuable insights. However, based on the figure, the conclusion that "etoposide treatment likely did not trigger genomic instability in repetitive DNA" is difficult to interpret. Could you modify the figure into a box plot with raw data points and include a statistical analysis to support this conclusion?

      __Response: __The figure has been modified to a box plot and is now presented as main Figure 3. Wilcox test has been performed and shows no significant difference in read depth over NOR2, NOR4, and telomere regions between control and etoposide-treated lines.

      Line 200 and Figure S8A: You state that SNV analysis identified a similar number of SNVs in treated and control plants. However, this is not easily interpretable from the figure. Could you include a statistical comparison between etoposide-treated and control plants? For example, EMS mutagenesis is known to induce specific G/C → A/T transitions. Did etoposide-treated and control plants exhibit the same types of nucleotide changes, or were there differences in the mutation spectrum?

      Response: This figure has been modified to assess the entire mutational spectrum of SNVs, including statistical comparisons, and is now part of Figure 3. We have also added the following text on line 244: “However, SNV analysis identified a comparable spectrum and number of SNVs in etoposide-treated and control lines (Figure 3), suggesting that etoposide did not induce excess SNVs.”

      Lines 219-220: Your conclusion clearly demonstrates the detection of numerous structural variations using both short- and long-read sequencing technologies. Could you provide a summary table listing the detected mutation positions? Since short-read sequencing is generally less effective in detecting large structural variations, I am particularly interested in evaluating the accuracy of Lumpy Express in identifying mutations.

      Response: Short-read sequencing and Lumpy Express are unsurprisingly less effective in detecting large structural variations when compared with long-read based approaches. SVs detected by Nanopore were missed by short-read sequencing and Lumpy Express. However, it is hard to benchmark the efficacy of Lumpy Express as only a few lines were sequenced by both Nanopore long-read sequencing and short-read sequencing. After removing SVs that were also present in control lines, we could identify only one SV detected by both Lumpy Express and Nanopore sequencing; this SV is a deletion. In plant 1A_4_5, which was sequenced by short reads, Lumpy Express called a 70 bp deletion at Chr 5: 5776579. SV calling using Nanopore-generated sequence of a sibling plant, 1A_4_11, identified a 70 bp deletion at Chr 5:5776578. The SVs identified by Lumpy Express are presented in Table S9. Those identified from long-read data are in Table S11.

      1. Figures 3E-G: To facilitate a clearer comparison of the effects of structural variations on gene expression between BR-like dwarf and short internode dwarf, could you add an average trend line to the figures, similar to Figure 3B?

      Response: An average trend line has been added to these plots. These data are now presented in Figure S9.

      Minor comments:

      Line 105 and Figure 1A: In the manuscript, etoposide concentrations are stated as 0, 40, 80, and 160 μM, whereas Figure 1A labels the concentrations as 0, 80, 160, and 320 μM. Should the figure be updated to 0, 40, 80, and 160 μM for consistency?

      __Response: __Thank you for the comment. We updated the results section to make concentrations listed consistent with methods (0, 20, 40, 80,160, 320, and 640 µM) and added additional description of seedling growth such that all concentrations are described. We also updated references to the figure such that only sentences regarding concentrations with photos reference Fig 1A.

      Figure 1B legend: Typographical error: "roundsof" → "rounds of".

      Response: Corrected.

      Line 109: Do you have a summary table for the M1 generation? If so, could you provide it as a supplementary table?

      Response: We regretfully do not have the data to populate a summary table for the M1 generation. As described above, most M1 plants have one or more of the following phenotypes mentioned in text: “loss of apical dominance, gnarled leaves, reduced plant size, seed abortion, and lower seed number at maturity “

      Line 119: Figure 1B only defines developmental stages. To improve clarity, consider revising "Figure 1B" to "Figure 1B-F", allowing readers to easily understand the corresponding figures.

      __Response: __We updated the Figure reference to Fig. 1B-F.

      Line 121: The citation "(Figure S1, Table S1)" would be clearer if placed at the end of the sentence.

      __Response: __The citation was moved to the end of the sentence.

      Lines 137, 148, 167: To maintain consistency with Figure 1C-F and the manuscript's logical flow, could you standardize the order of phenotypes as "virescent, short internode dwarf, and BR-like dwarf" instead of the current variation?

      __Response: __We have standardized the order of the figures and the discussions of phenotypes in the text as: BR-like dwarf, short-internode dwarf, virescent, then variegated.

      Line 139: Why is "Figure 1B" referenced at this position? Would it be more appropriate to remove this reference?

      Response: Figure 1B shows that the phenotypes were able to be transmitted at least until the M5 generation, thus the reference.

      Figure S7 legend: Typographical error: "to to" → "to".

      __Response: __Corrected.

      Figure S8B (Chromosome 5 labels): Could you adjust the position labels to maintain a consistent format with other chromosomes?

      Response: This figure has been modified on request of another reviewer such that this is no longer applicable.

      Lines 262, 277, 279: "Figure S11" should be corrected to "Figure S10".


      __Response: __Corrected.

      Line 269: "Figure S10" should be corrected to "Figure S11B-H".


      __Response: __Corrected.

      Reviewer #3 (Significance (Required)):

      In mutation studies aimed at inducing large-scale genomic variations, irradiation has traditionally been the primary method for mutagenesis. However, this study proposes a more efficient and accessible alternative using chemical mutagenesis with a DNA topoisomerase II inhibitor. Genomic analysis of mutants generated through this treatment revealed extensive genomic alterations, with a mutation frequency exceeding that of gamma irradiation-induced mutants. These findings suggest that this approach has the potential to advance mutation research for plant biologists and breeders seeking efficient methods for trait improvement. Furthermore, the authors integrate RNA-seq analysis for selected traits, demonstrating a systematic workflow for candidate gene identification and facilitating the determination of causal genes.


    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The manuscript entitled "A simple method to efficiently generate structural variation in plants" by Bechen et al. investigated an efficient mutagen for inducing large structural variations in plants, replacing traditional irradiation methods with a chemical mutagenesis strategy. The study examined the effects of etoposide, a DNA topoisomerase II inhibitor, on structural variations and demonstrated that etoposide treatment induces a wide range of phenotypic and genome changes, including inversions, duplications, and deletions. Additionally, the authors analyzed the relationship between gene expression changes and genomic alterations to identify potential causal genes underlying specific phenotypes. While their findings provide clear and reliable evidence of structural variations induced by etoposide, I have several suggestions to enhance the clarity of their results, as detailed below.

      Major comments:

      1. Lines 166-169: My understanding is that you selected etoposide-treated M1 plants based on specific phenotypes, and observed their M2 and M3 progeny, categorizing them as either phenotype-positive or phenotype-negative. In Table S3, phenotypes other than BR-like dwarf, virescent, and short internode dwarf are not mentioned. Does this indicate that these other lines did not exhibit heritable phenotypic traits? If other lines showed some phenotype changes, could you incorporate progeny relationships along with phenotype information into Table S3? Additionally, in Figures S2 and S3, you reference 26A lines. Did they exhibit similar phenotypic changes among them?
      2. Lines 187-189 and Figure S4: The assessment of repeat copy number variation provides valuable insights. However, based on the figure, the conclusion that "etoposide treatment likely did not trigger genomic instability in repetitive DNA" is difficult to interpret. Could you modify the figure into a box plot with raw data points and include a statistical analysis to support this conclusion?
      3. Line 200 and Figure S8A: You state that SNV analysis identified a similar number of SNVs in treated and control plants. However, this is not easily interpretable from the figure. Could you include a statistical comparison between etoposide-treated and control plants? For example, EMS mutagenesis is known to induce specific G/C → A/T transitions. Did etoposide-treated and control plants exhibit the same types of nucleotide changes, or were there differences in the mutation spectrum?
      4. Lines 219-220: Your conclusion clearly demonstrates the detection of numerous structural variations using both short- and long-read sequencing technologies. Could you provide a summary table listing the detected mutation positions? Since short-read sequencing is generally less effective in detecting large structural variations, I am particularly interested in evaluating the accuracy of Lumpy Express in identifying mutations.
      5. Figures 3E-G: To facilitate a clearer comparison of the effects of structural variations on gene expression between BR-like dwarf and short internode dwarf, could you add an average trend line to the figures, similar to Figure 3B?

      Minor comments:

      Line 105 and Figure 1A: In the manuscript, etoposide concentrations are stated as 0, 40, 80, and 160 μM, whereas Figure 1A labels the concentrations as 0, 80, 160, and 320 μM. Should the figure be updated to 0, 40, 80, and 160 μM for consistency?

      Figure 1B legend: Typographical error: "roundsof" → "rounds of".

      Line 109: Do you have a summary table for the M1 generation? If so, could you provide it as a supplementary table?

      Line 119: Figure 1B only defines developmental stages. To improve clarity, consider revising "Figure 1B" to "Figure 1B-F", allowing readers to easily understand the corresponding figures.

      Line 121: The citation "(Figure S1, Table S1)" would be clearer if placed at the end of the sentence.

      Lines 137, 148, 167: To maintain consistency with Figure 1C-F and the manuscript's logical flow, could you standardize the order of phenotypes as "virescent, short internode dwarf, and BR-like dwarf" instead of the current variation?

      Line 139: Why is "Figure 1B" referenced at this position? Would it be more appropriate to remove this reference?

      Figure S7 legend: Typographical error: "to to" → "to".

      Figure S8B (Chromosome 5 labels): Could you adjust the position labels to maintain a consistent format with other chromosomes?

      Lines 262, 277, 279: "Figure S11" should be corrected to "Figure S10".

      Line 269: "Figure S10" should be corrected to "Figure S11B-H".

      Significance

      In mutation studies aimed at inducing large-scale genomic variations, irradiation has traditionally been the primary method for mutagenesis. However, this study proposes a more efficient and accessible alternative using chemical mutagenesis with a DNA topoisomerase II inhibitor. Genomic analysis of mutants generated through this treatment revealed extensive genomic alterations, with a mutation frequency exceeding that of gamma irradiation-induced mutants. These findings suggest that this approach has the potential to advance mutation research for plant biologists and breeders seeking efficient methods for trait improvement. Furthermore, the authors integrate RNA-seq analysis for selected traits, demonstrating a systematic workflow for candidate gene identification and facilitating the determination of causal genes.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      • The manuscript describes mutagenesis of Arabidopsis by a topoisomerase II inhibitor. The method is effective, resulting in good density of SV and no detectable SNV. The authors provide a full characterization of the mutants, their phenotypes, and their genomes. The 34-sample selected for genomic analysis is sufficient to make firm conclusions.
      • The manuscript is clearly written and illustrated.
      • The manuscript does a very good job at covering the phenotypic and molecular analysis for this type of mutagenesis. For example, they highlight the difference between short and long reads in the identification of SV.
      • The figures are very clear, with the exception of Fig. 3, which I found harder to follow. It would be enhanced by describing the candidate lesion(s) in the first panel of each mutant series. This would clarify the expectation. For example, larger indels (not examined here) should be associated with higher (insertion) or lower (deletion) expression of the affected genes. In the cases presented in Fig.3, the structural changes do not suggest obvious hypotheses. The authors examine the regions near breakpoint of inversions or near small indels. It makes sense, but it does not make the figure very digestible. The connected text in the results, on the other hand, is very clear. Perhaps, making the conclusions in the figure legend as well? As a connected thought, it would have been useful to provide expression data for a large indel exemplifying the cis/trans nature of regulatory changes.
      • Deletions and other rearrangements may affect meiosis as noted by the authors. In addition, they can display gametophytic phenotypes and a deficit in transmission. The likelihood increases with the size of the indel. Large indels are not transmitted. Accordingly, for indels above a certain size, it is not possible to determine the number of causal loci from F2 ratios.
      • Although the use of topo II inhibitors for mutagenesis in plants is novel, the mutagenic effects described here are well documented in animals. This should be acknowledged (e.g. Heisig, Mutagen. 2009; Ferguson, Env Mol Mutagen. 1994)

      Referees cross-commenting

      I also found the expression analysis confusing and in need of revision. One reason is that a set of clear expectations were not provided. I believe that the RNAseq analysis is expected to help identify the gene(s) that underlie a trait. For example, genes located on an indel are likely to display expression proportional to copy number. Also, a new junction or translocation could influence expression of the gene next to the break point. The authors should make this clear in the figure and the text.

      Significance

      • I appreciated the description of the method. It should be widely applicable. In arabidopsis, it requires sustained growth in the presence of the inhibitor. This could limit its applicability. For example, it may not be effective with pollen because exposure by a short soaking period may not be sufficient. Culturing of large seeded species is possible, but adds complexity. In this context, radiations have advantages. I do agree with the authors on the difficulty in identifying a source. However, once one is found, radiation treatment is very simple and convenient.
      • The manuscript describes a useful tool and the connected spectrum of mutations. It has the novelty, quality, and relevance to represent a significant contribution to plant biology and to be of broad interest.
    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is review of the manuscript „A simple method to efficiently generate structural variation in plants" by Bechen et al. The manuscript presents a very interesting and innovative approach to generate structural variant mutations (including large ones) in the genome of Arabidopsis thaliana using a simple chemical treatment with TOPII inhibitor etoposide. Authors show that unlike chemical mutagens commonly used for induction of SNPs (EMS, sodium azide...), etoposide-treatment caused structural variants like DNA deletions, insertions, inversions and translocations. These mutations were identified by the whole genome short and long read sequencing that also indicated a WT-like frequency of SNPs. This finding can potentially help inducing mutations similar to high energy radiation in potentially any plant. First, the manuscript provides description of the unusual phenotypes found after etoposide treatment and their Mendelistic inheritance. Based on this, authors performed whole genome sequencing and mutation detection, validation. The experimental part ends by transcriptome analysis that authors use as the approach to identify the causal mutations. This part is, in my opinion, the weakest part of the manuscript and would benefit from further clarification or even additional experiments (see below).

      Overall the manuscript is very clear and contains all necessary information. The only part that was confusing to me, was the section focusing on the transcriptome analysis.

      Major points:

      Line 222: In the section „RNA-Seq identifies genes that are associated with structural variation and mutant phenotypes", authors suggest that the changes in the transcript amount were used to identify causal mutations. I got confused by this section. Exach of the examples represents unique situation and thus only single cases are presented which makes it hard to estimate robustness of the presented approach. Also, the presented mutations have prominent phenotypes that were already heavily studied in the past and therefore the possible causal genes are mostly known. Therefore, I am not sure how this approach would stand in case of traits with unknown underlying genes. When refering to the case with the chromosomal inversion, I do not see how one will be able to map a candidate based on the relatively mild expression (but maybe I am missing something here). Similarly, the „mapping" approach applied to the variegated line would not be possible on a trait that is less studied and the candidates are not well known. I wond why authors did not perform association mapping on a bulk of phenotypically mutant plants collected from a segregating F2 backcross population. This might be a more robust way of linking the phenotype with a mutation.

      Discussion section: I am missing discussion on how etoposide could be causing such structural variants.

      Minor points:

      Line 70: Possibly add sodium azide. It is frequently used as mutagen for some plant species. Line 122: „...etoposide is an excellent mutagen for efficiently creating large-effect mutations." This cannot be claimed at this point because the sequence analysis data were not shown yet. Please reformulate.

      Referees cross-commenting

      My main issue was the mapping protocol using transcriptomic changes. It is hard to believe that this approach would work well on unknown/less studied traits. What is your opinion?

      Significance

      Strengths Innovative way on how to induce structural variant mutations in plants.

      Limitations The approach on how to map the mutations needs more development. At this point i tis not clear how well the approach will work in other plant species.

      Audience basic and applied plant scientists

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-02888

      Corresponding author(s): Christian, Fankhauser

      General Statements

      We were pleased to see that the three reviewers found our work interesting and provided supportive and constructive comments.

      Our answers to their comments and/or how we propose to address them in a revised manuscript are included in bold.

      1. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: Plant systems sense shading by neighbors via the phytochrome signaling system. In the shade, PHYTOCHROME-INTERACTING FACTORS (PIFs) accumulate and are responsible for transcriptional reprogramming that enable plants to mobilize the "shade-avoidance response". Here, the authors have sought to examine the role of chromatin in modulating this response, specifically by examining whether "open" or "closed" chromatin regions spanning PIF target genes might explain the transcriptional output of these genes. They used a combination of ATAC-seq/CoP-qPCR (to detect open regions of chromatin), ChIP (to assay PIF binding) and RNA-seq (to measure transcript abundance) to understand how these processes may be mechanistically linked in Arabidopsis wild-type and pif mutant lines. They found that some chromatin accessibility changes do occur after LRFR (shade) treatment (32 regions after 1h and 61 after 25 h). While some of these overlap with PIF-binding sites, the authors found no correlation between open chromatin states and high levels of transcription. Because auxin is an important component of the shade-avoidance response and has been shown to control chromatin accessibility in other contexts, they examined whether auxin might be required for opening these regions of chromatin. They find that in an auxin biosynthesis mutant, there is a small subset of PIF target genes whose chromatin accessibility seems altered relative to the wild-type. Likewise, they found that chromatin accessibility for certain PIF targets is altered in phyB and pif mutant and propose that PIFs are necessary for changing the accessibility of chromatin in these genes. The authors thus propose that PIF occupancy of already open regions, rather than increased accessibility, underly the increase in transcript of abundance of these target genes in response to shade.

      Major comments: *• I find that the data generally support the hypothesis presented in the manuscript that chromatin accessibility alone does not predict transcription of PIF target genes in the shade. That said, I think that a paragraph from the discussion (lines 321-332) would benefit from some careful rephrasing. I think it is perfectly reasonable to propose that PIF occupancy is more predictive of shade-induced transcriptional output than chromatin accessibility, but I think that calling PIF occupancy "the key drivers" (line 323) or "the main driving force" (line 76) risks ignoring the observation that levels of PIF occupancy specifically do not predict expression levels of PIF target genes (Pfeiffer et al., 2014, Mol Plant). For PIL1 and HFR1, the authors have shown that PIF promoter occupancy and transcript levels are correlated, but the central finding of Pfeiffer et al. was that this pattern does not apply to the majority of PIF direct target genes. Finding factors (i.e. histone marks) that convert PIF-binding information into transcriptional output appears to have been the impetus for the experiments devised in Willige et al., 2021 and Calderon et al., 2022. It is great that the authors have outlined in the discussion that there are a number of factors that modulate PIF transcriptional activating activity but I think that the emphasis on PIF-binding explaining transcript abundance should be moderated in the text. *

      We appreciate the reviewers’ comments and will address it by introducing appropriate changes to the discussion. One element that should be pointed out is that the study of Willige et al., 2021 allows us to look at sites where PIF7 is recruited in response to the shade stimulus (a low R/FR treatment) and relate this to higher transcript abundance of the nearby genes. The study of Pfeiffer et al., 2014 which analyses PIF ChIP studies from several labs does not include this dynamic view of PIF recruitment in response to a stimulus. For example, this study re-analyses data from our lab, Hornitschek et al., 2012, in which we did PIF5 ChIP in low R/FR, but we did not compare that to high R/FR to enable an analysis of sites where we see recruitment of PIF5 in response to a shade cue. In the revised manuscript we will also include a new figure comparing PIF7 recruitment and changes in gene expression at direct PIF target genes.

      • I think that the hypothesis could be further supported by incorporating the previously published ChIP-seq data on PIF1, PIF3 and PIF5 binding. Given these data are published/publicly available, I think it would be helpful to note which of the 72 DARs are bound by PIF1, PIF3 and/or PIF5. Especially so given that PIF5 (Lorrain et al., 2008, Plant J) and PIF1/PIF3 (Leivar et al., 2012, Plant Cell) contribute at least in some capacity to transcriptional regulation in response to shade. At the very least, it might help explain some of the observed increases in nucleosome accessibility observed for genes that don't have PIF4 or PIF7-binding.* This is a thoughtful suggestion. Our choice to focus on PIF7 target genes is dictated by two reasons. First, the finding that amongst all tested PIFs, PIF7 is the major contributor to the control of low R/FR (neighbor proximity) induced responses in seedlings (e.g. Li et al., 2012; de Wit et al., 2016; Willige et al., 2021). In addition, the PIF7 ChIP-seq and gene expression data from the Willige et al., 2021 paper was obtained using growth conditions very similar to the ones we used, hence allowing us to compare it to our data. As the reviewer suggests, other PIFs also contribute to the low R/FR response and hence looking at ChIP-seq for those PIFs in publicly available data is also informative. One limitation of this data is that ChIP-seq was not always done in seedlings grown in conditions directly comparable to the conditions we used (except for PIF5, see above). Nevertheless, we have performed this analysis with the available data suggested by the reviewer and intend to include the results in the revised version of the manuscript, presumably updated Figure 4B.

      • In the manuscript, there are several instances where separate col-0 (wild type) controls have been used for identical experiments. Specifically, qPCR (Fig 3C, Fig S7C/D and Fig S8C/D), CoP-qPCR (Fig 5B/5C and Fig S8E/F) and hypocotyl measurements (Fig S7A/B and Fig S8A/B). In the cases of the hypocotyl measurements, there appear to be hardly any differences between col-0 controls indicating the measurements can be confidently compared between genotypes.

      We appreciate this comment but to be comprehensive, we like to include a Col-0 control for each experiment (whenever possible) and hence also include the data when available.

      • In some cases of qPCR and CoP-qPCR experiments however, the differences in values obtained from col-0 samples that underwent identical experimental treatments appear to vary significantly. In Figure 3C for example, the overall trend for PIL1 expression in col-0 is the same (e.g. HRFR levels are low, LRFR1 levels are much higher and LRFR25 levels drop down to some intermediate level) but the expression levels themselves appear to differ almost two-fold for the LRFR 1h timepoint (~110 on the left panel vs ~60 for the right panel). Given the size of the error bars, it appears that these data represent the mean from only one biological replicate. PIL1 expression levels at LRFR 1h as reported in Fig S7C and D also show similar ~2-fold differences. __This is a good comment. Having looked at PIL1 gene induction by low R/FR in dozens of similar experiments made us realize that indeed while the PIL1 induction is always massive, the extent is somewhat variable. Based on the data that we have (including from RNA-seq) we are convinced that this is due to the very low level of expression of PIL1 in high R/FR conditions. Given that induction by low R/FR is expressed as fold increase relative to baseline high R/FR expression, small changes in the lowly expressed PIL1* in high R/FR leads to seemingly significant differences in its induction by low R/FR across experiments.__

      All qPCR data is represented by three biological replicates, and the variation between them per experiment is low, which is reflected in the size of the SD error bars. Data on technical and biological replicates in each panel will be clearly indicated in the revised figure legends.

      • I would recommend that the authors explicitly describe the number of biological replicates used for each experiment in the methods section. If indeed these experiments were only performed once, I think the authors should be very careful in the language used in describing their conclusions and in assigning statistical significance. One possibility that could also be helpful would be normalizing LRFR 1h and LRFR 25h values to HRFR values and plotting these data somewhere in the supplemental data. If, for example, the relative levels of PIL1 are different between replicates but the fold-induction between HRFR and LRFR 1h are the same, this would at least allay any concerns that the experimental treatments were not the same. I understand that doing so precludes comparison between genotypes, but I do think it's important to show that at least the control data are comparable between experiments.

      * All qPCR and CoP-qPCR experiments have been performed with three 3 biological replicates as described in Materials and Methods section, and these are represented in the Figures. Relative gene expression in the qPCR experiments was normalized to two housekeeping genes YLS8 and UBC21 and afterwards to one biological replicate of Col-0 control in HRFR. As indicated for the previous comment information about replicates will be included in the updated figure legends.

      • Similarly, for the CoP-qPCR experiments presented in Fig 5B and 5C, the col-0 values for region P2 between Fig 5B and 5C shows that while HRFR and LRFR 1h look comparable, the values presented for LRFR 25h are quite different.

      * This comment of the reviewer prompted us to propose a different way of representing the data that is clearer (new Figure 5B and 5C). We believe that this facilitates the comparison between the genotypes. Enrichment over the input was calculated for the chromatin accessibility of each region. Chromatin accessibility was further normalized against two open control regions on the promoters of ACT2 (AT3G18780, region chr3:6474579: 6474676) and RNA polymerase II transcription elongation factor (AT1G71080 region chr1:26811833:26811945). The difference between previous representation is that the regions are not additionally subtracted to Col-0 in HRFR. We will update the Materials and Methods and figure legend sections with this information.

      Minor comments: • Presentation of Supplemental Figure 7A/7B and Supplemental Figure 8A/8B could be changed to make the data more clear (i.e. side-by-side rather than superimposed).

      We propose changing the presentation of the hypocotyl length data to show the values for days side-by-side as the Reviewer suggests.

      • I think that the paragraph introducing auxin (lines 25-37) could be reduced to 1-2 sentences and merged into a separate introductory paragraph given that the SAV3 work makes up a relatively minor component of the manuscript.

      * We agree with the reviewer and will reduce the paragraph about auxin and merge it with the previous paragraph about transcription.

        • For Figure 3A, I would strongly encourage the authors to show some of the raw western blot data for PIF4, PIF5 and PIF7 protein abundance (and loading control), not just the normalized values. This could be put into supplemental data, but I think it should accompany the manuscript.

      * We agree that presenting the raw data that was used for quantification is important. We will include the western blots used for quantifying PIF4, PIF5 and PIF7 protein abundance (and loading control DET3). This information will presumably be included to the Supplementary Figure 3C (figure number to be confirmed once we decide on all new data to be presented).

      • Lines 145-147 "we observed a strong correlation between PIF4 protein levels (Figure 3A) and PIL1 promoter occupancy (Figure 3B), and a similar behavior was seen with PIF7 (Figure 3B)." According to Fig 3A, there is no statistically significant increase in PIF7 abundance after 1h shade. There is an apparent increase in PIF7 promoter occupancy, but the variation appears too large for it to be statistically significant. I agree that qualitatively there is a correlation for PIF4 but I think the description of the behavior of PIF7 should be rephrased.

      * __As suggested by the reviewer, we will rephrase this paragraph to more accurately account for our data and also what was reported by others (e.g. Willige et al, 2021, in Li et al, 2012) regarding the regulation PIF7 levels and phosphorylation in response to a low R/FR treatment. __

      • There appear to be issues in the coloring of the labels (light blue dots vs dark blue dots) for the PIF7 panels of Fig 3B and Supplemental Fig 3B.*

      We thank the reviewer for pointing this out. This will be clarified by appropriate changes in the figure to avoid confusion in the revised version of Figure 3B.

      Reviewer #1 (Significance (Required)):

      This authors here have sought to examine the possibility that the transcriptional responses to shade mediated by the phy-PIF system might involve large-scale opening or closing of chromatin regions. This is an important and unanswered question in the field despite several studies that have looked at the role of histone variants (H2A.Z) and modifications (H3K4me3 and H3K9ac) in modulating PIF transcriptional activating activity. The authors have shown that, at least in the case of the transcriptional response to shade mediated by PIF7 (and to an extent PIF4), large-scale changes in chromatin accessibility are not occurring in response to shade treatment.

      The results presented in this study support the hypothesis that large-scale changes in chromatin accessibility may have already occurred before plants see shade. This opens the possibility that perhaps the initial perception of light by etiolated (dark-grown seedlings) might trigger changes in chromatin accessibility, opening up chromatin in regions encoding "shade-specific" genes and/or closing chromatin in regions encoding "dark-specific" genes.

      The audience for this particular manuscript encompasses a fairly broad group of biologists interested in understanding how environmental stimuli can trigger changes in chromatin reorganization and transcription. The results here are important in that they rule out chromatin accessibility changes as underlying the changes in transcription between the short-term and long-term shade responses. They also reveal that there are a few cases in which chromatin accessibility does change in a statistically-significant manner in response to shade. These regions, and the molecular players which regulate their accessibility, merit further exploration.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The study by Paulisic et al. explores the variations in chromatin accessibility landscape induced by plant exposure to light with low red/far-red ratios (LRFR), which mimicks neighbor shade perception. The authors further compare these changes with the genome association of PIF4 and PIF7 transcription factors - two major actors of gene expression regulation in response to LRFR. While this is not highlighted in the main text, the analyses of chromatin accessibility are performed on INTACT-mediated nucleus sorting, presumably to ensure proper and clean isolation of nuclei.

      Major comments

      • Why is the experimental setup exposing plants to darkness overnight? Does this affect the response to LRFR, by a kind of reset of phytochrome signaling? I guess this choice was made to maintain a strong circadian rhythm. Yet, given that PIF genes are clock-regulated, I am afraid that this choice complicates data interpretation concerning the specific effects of LRFR exposure.

      There appears to be some confusion which prompts us to better explain our protocol both by changing Figure 1A (that outlines the experimental conditions) and in the text.

      Seedlings are grown in long day conditions because this is more physiologically relevant than growing them in constant light, which is a rather unnatural condition.

      The reviewer is correct that PIF transcription is under circadian control and the shade avoidance response is gated by the circadian clock (e.g. Salter et al., 2003). To prevent conflating circadian and light quality effects, all samples that are compared are harvested at the same ZT (circadian time – hours after dawn). This allows us to focus our analysis on light quality effects specifically. We are therefore convinced that our protocol does not complicate the interpretation of the LRFR effects reported here.

      • As a result of this setup, the 1h exposure to LRFR immediately follows HRFR while the 3h final LRFR exposure of the « 25h LRFR » samples immediately follows a long period of darkness. Can this explain why in several instances (e.g., at the ATHB2 gene) 1h LRFR seems to have stronger effects than 25h LRFR on chromatin accessibility?* Please check the explanation above. Both samples are harvested at the same ZT (ZT3, meaning 3 hours after dawn). The 1h LRFR seedlings went through the night, had 2 hours of HRFR then 1h of LRFR. The 25h are harvested at the very same ZT, meaning 3h after dawn. Importantly, the HRFR control was also harvested at ZT3, meaning 3h after dawn. As indicated above this protocol allows us to focus on the light quality effects by comparing samples that are all harvested at the same ZT.

      We expect that the changes in Fig. 1A and associated text changes will clarify this issue.

      • Lane 42 cites the work by Calderon et al 2022 as « Transcript levels of these genes increase before the H3K4me3 levels, implying that H3K4me3 increases as a consequence of active transcription ». Despite this previous study being reviewed and published, such a strong conclusion should be taken cautiously, and I disagree with it. The study by Calderon et al compares RNA-seq with ChIP-seq data, two methodologies with very different sensitivity, especially when employing bulk cells/whole seedlings as starting materials. For example, a gene strongly induced in a few cells will give a good Log2FC in RNA-seq data analysis (as new transcripts are produced after a low level of transcripts before shade) but, even though its chromatin variations would follow the same temporality or would even precede gene induction, this would be invisible in bulk ChIP-seq data analysis (which averages the signal of all cells together). I understand the rationale for relying on the conclusions made in an excellent lab with strong expertise in light signaling, but I recommend being cautious when relying on these conclusions to interpret new data.* We agree with this comment, and we will change the text to reflect this.

      • The problem is that the same issue holds true when comparing ATAC-seq and RNA-seq data. ATAC signals reflect average levels over all cells while RNA-seq data can be influenced by a few cell highly expressing a given gene. Even though authors carefully sorted nuclei using an INTACT approach, this should be discussed, in particular when gene clusters (such as cluster C-D) show no match between chromatin accessibility and transcript level variations. In this regard, is PIF7 expressed in many cells or a small niche of cells upon LRFR exposure? The conclusions on its role in chromatin accessibility, analyzed here as mean levels of many different seedling cells, could be affected by PIF7 activity pattern (e.g., at lane 293). __This is a good comment. PIF7 is expressed in the cotyledons and leaves in LD conditions (Kidokoro et al, 2009, Galvao et al, 2019), and few available scRNA-seq datasets indicate an enrichment of PIF7 in the epidermis (Kim et al, 2021, Lopez-Anido et al, 2021). LRFR exposure only mildly represses PIF7* expression as seen in Figure 3A and also in our bulk RNA-seq study (Table S4). We will discuss this potential limitation to our study in a revised version of the manuscript.__

      • Lane 89, the conclusion linking DNA methylation and DNA accessibility is unclear to me, this may be rephrased. Also, it should be noted that in gene-rich regions, most DNA methylation is located along the body of moderately to highly transcribing genes (gene-body methylation) while promoters of active and inactive genes are most frequently un-methylated.* We will rephrase to better reflect the presence or absence of DNA methylation on promoter regions of shade regulated genes that contain accessible sites.

      • Figure 3B shows a few ChIP-qPCR results with important conclusions. Why not sequencing the ChIPped DNA to obtain a genome-wide view of the PIF4-PIF7 relationships at chromatin, and also consequently a more robust genome-wide normalization?

      * Several studies have shown that in the conditions that we studied here: transfer of seedlings from high R/FR (simulated sun) to low R/FR (neighbor proximity), amongst all PIFs, PIF7 is the one that plays the most dominant function (e.g. Li et al., 2012; de Wit et al., 2016; Willige et al., 2021). PIF4 and PIF5 also contribute but to a lesser extent. Given that Willige et al., 2021 did extensive ChIP-seq studies for PIF7 using similar conditions to the ones we used, we decided to rely on their data (that we re-analyzed), rather than performing our own PIF7 ChIP-seq analysis. While also performing a ChIP-seq analysis for PIF4 in similar conditions might be useful (this data is not available as far as we know), we are not convinced that doing that experiment would substantially modify the message. In the revised version we will also include analysis of the data from Pfeiffer et al., 2014, which comprises a ChIP-seq. dataset for PIF5 (the closest paralog of PIF4) initially performed by Hornitschek et al., in 2012 in low R/FR conditions (see comment to reviewer 1 above). For new ChIP-seq, we would have to make this experiment from scratch with substantially more material than what we used for the targeted ChIP-qPCR analyses. We thus do not feel that such an investment (time and money) is warranted.

        • Given the known functional interaction between PIF7 and INO80, it would be relevant to test whether changes in chromatin accessibility at ATHB2 and other genes are affected in ino80 mutant seedlings. __We agree with the reviewer that this is potentially an interesting experiment. This will allow us to determine whether the nucleosome histone composition has an influence on nucleosome positioning at selected shade-regulated genes (e.g. ATHB2). We note that according to available data, the effect of INO80 would be expected once PIF7 started transcribing shade-induced genes. We therefore propose comparing the WT with an ino80 mutant for their seedling growth phenotype, expression of selected shade marker gene (e.g. ATHB2*) and chromatin accessibility before (high R/FR) and after low R/FR treatment at selected shade marker genes. This will allow us to determine whether INO80 influences chromatin accessibility prior to a low R/FR treatment and/or once the treatment started. Our plan is to include this data in a revised version of the manuscript. __
      • On the same line, it would be interesting to test whether PIF7 target regions with pre-existing accessible chromatin would exist in ino80 mutant plants. In other words, testing a model in which chromatin remodeling by INO80 defines accessibility under HRFR to enable rapid PIF recruitment and DNA binding upon LRFR exposure.*

      See our answer just above.

      Minor comments

      *• In Figure 1C, it seems that PIF7 target genes do not match the set of LRFR-downregulated genes (even less than at random). Why not exclude these 4 genes from the analyses? *

      This is correct. There are indeed only 4 downregulated PIF7 target genes as we define them. Removing these genes from the analyses does not change our interpretation of the data and hence for completeness we propose keeping them in a revised version of the manuscript

      • Figure 3A shows the quantification of protein blots, but I did not find the corresponding images. These should be shown in the figure or as a supplementary figure with proper controls.

      * We will include the raw Westen blots used for quantification of PIF4, PIF5 and PIF7 in the revised version of the manuscript

        • Lane 102, it is unclear why PIF7 target genes were defined as the -3kb/TSS domains while Arabidopsis intergenic regions are on average much shorter. Gene regulatory regions, or promoters, are typically called within -1kb/TSS regions to avoid annotating a ChIP peak to the upstream gene or TE. A better proxy of PIF7 typical binding sites in gene regulatory regions could be determined by analysing the mean distance between PIF7 peak coordinates and the closest TSS. Typically, a gene meta-plot would give this information. __We agree that the majority of PIF7 binding peaks are close to the 5’ of the TSS based on the PIF7 binding distribution meta-plot. But several known PIF binding sites are actually further upstream than 1kb 5’ of the TSS (e.g. ATHB2 and HFR1). However, we re-analyzed the data using your suggestion with -2kb/TSS and -1kb/TSS and while the number of target genes is reduced, it does not change our conclusions about PIF7 binding sites being located on accessible chromatin regions. Importantly, some well characterized LRFR induced genes such as HFR1* would not be annotated correctly if only peaks closest to the gene TSS were taken into account, without flanking genes. In this case only the neighboring AT1G02350 would be annotated, hence missing some important PIF7 target genes. Taking this into consideration we will not modify this part of the analysis in a revised manuscript.__
      • Figure 4B, what's represented in the ATAC-seq heatmap: does a positive z-score represent high accessibility?*

      On the ATAC-seq heatmap we have represented z-scores of the average CPM (counts per million) for accessible chromatin regions. Z-scores are calculated by subtracting the average CPM from the median of averaged CPMs for each accessible chromatin region and then divided by the standard deviation (SD) of those averaged CPMs across all groups per accessible region (in our case a group is an average of three biological replicates for either HRFR, 1h or 25h of LRFR). In that sense, z-score indicates a change in accessibility, where higher z-score indicates opening of the region and lower z-score indicates a region becoming more closed when compared among the three light treatments (HRFR, 1h or 25h of LRFR). We will make sure that this is clear in the revised manuscript. Reviewer #2 (Significance (Required)):

      Contradicting the naive hypothesis that PIFs may target shade-inducible genes to « open » chromatin of shade-inducible genes with the help of chromatin remodelers, such as INO80, the study highlights that PIF7 typically associates with pre-existing accessible chromatin states. Thus, even though this is not stated, results from this study indicate that PIF7 is not a pioneer transcription factor. The data seem very robust, and while some conclusions need clarification, it should be of great interest to the community of scientists studying plant light signaling and shade responses.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In their manuscript, Paulisic et al. investigate whether the transcriptional response of Arabidopsis seedlings to shade depends on chromatin accessibility, with a specific focus on PIF7-regulated genes. To this end, they perform ATAC-seq and RNA-seq, along with other experiments, on seedlings exposed to short and long shade and correlate the results with previously reported PIF7 and PIF4 ChIP-seq data. Based on their findings, they propose that shade-mediated transcriptional regulation may not require extensive remodeling of DNA accessibility. Specifically, they suggest that the open chromatin conformation allows PIFs to easily access and recognize their binding motifs, rapidly initiating gene expression in response to shade. This transcriptional response primarily depends on a transient increase in PIF stability and gene occupancy, with changes in chromatin accessibility occurring in only a small number of genes.

      Major comments: * • I have one issue that, in my opinion, requires more attention. To define the PIF7 target genes, which were later used to estimate whether PIF7 binds to open or closed chromatin and affects DNA accessibility after its binding, the authors compared the 4h LRFR data point from Willige et al. (2021) ChIP-seq with their 1h RNA-seq data point. This comparison might have missed early genes where PIF7 binds before the 1h time point but is no longer present on DNA at 4h. I understand the decision to choose the 4h Willige et al. ChIP-seq data point, performed under LD conditions, as it matches the data in this study, rather than the 5min-30min data points, which were conducted in constant light. However, if possible, it would be interesting to also compare the RNA-seq data with the early PIF7 binding genes to assess how many additional PIF7 target genes could be identified based on that comparison and whether this might alter the conclusions. If the authors do not agree with this point, it should at least be emphasized that the ChIP-seq data and the RNA-seq/ATAC-seq data were performed under different LRFR conditions (R/FR 0.6 vs. 0.1), which may lead to the misidentification of PIF7 target genes in the manuscript.*

      1) This is an interesting suggestion, we therefore reanalyzed 5, 10 and 30 min ChIP-seq timepoints from Willige et al, 2021 and compared them to 4h of LRFR (ZT4). We have crossed these lists of potential PIF7 targets with our 1h LRFR PIF457 dependent genes based on our RNA-seq. While some PIF7 targets appear only in early time points 5-10 min of LRFR exposure, overall, the number and composition of PIF7 target genes is rather constant across these timepoints. We propose to include these additional analyses in a revised version of the manuscript as a supplemental figure. However, these additional analyses do not influence our general conclusions.

      2) The comment regarding the R/FR ratio is important. We will point this out although the conditions used by Willige et al., 2021 and the ones we used are similar, they are not exactly the same in terms of R/FR ratio. Importantly, in both studies the early transcriptional response largely depends on the same PIFs, many of the same response genes are induced (e.g. PIL1, AtHB2, HFR1, YUC8, YUC9 and many others) and the physiological response (hypocotyl elongation) is similar. This shows that this low R/FR response yields robust responses.

      Minor comments: • In Fig. 1D, please describe the meaning of the blue shaded areas and the blue lines under the ATAC-seq peaks, as they do not always correlate.

      The shaded areas and the bars define the extension of the ATAC-seq accessible chromatin peaks. We will add the meaning of the shaded areas and the blue bars in the Figure legend and correct the colors in a revised manuscript

      • In Fig. 1E, it could be helpful to note that the 257 peaks in the right bar correspond to the peaks associated with the 177 genes in the left bar.* We will update Figure 1E and Figure legends for better understanding as the Reviewer suggested.

      • In lines 116, 119, and 122, I believe it should read "Fig. 2" instead of "Fig. 2A."* We thank the Reviewer for noticing the error that we will correct.

      • Lines 138-139: "PIF7 total protein levels were overall more stable, and only a mild and non-significant increase of PIF7 levels was seen at 1 h of LRFR." Since PIF7 usually appears as two bands in HRFR and only one band in LRFR, how was the protein level of PIF7 quantified in Fig. 3A? Additionally, I was wondering about the authors' thoughts on the discrepancy with Willige et al. (2021, Extended Data Fig. 1d), where PIF7 abundance seems to increased after 30 min and 2 h of LRFR.* PIF7 protein levels were quantified by considering both the upper and the lower band in HRFR (total PIF7) and normalizing its levels to DET3 loading control. We still observe an increase in the total PIF7 protein levels at 1h of LRFR, however this change was not statistically significant in these experiments. In our conditions as in Willige et al, 2021, the increase in PIF7 protein levels to short term shade seems consistent as is the pronounced shift or disappearance of the upper band (phosphorylated form) on the Western blots (raw data will be available in the revised manuscript). We will introduce text changes referring to the phosphorylation status of PIF7 in our conditions.

      • Line 150: "... many early PIF target genes (Figure 3C)." Since only PIL1 is shown in Fig. 3C, I would recommend revising this sentence. Alternatively, the data could be presented, as in Fig. 2, for all the PIF7 target genes with transient expression patterns.

      * We will introduce changes in the text to reflect that we only show PIL1 in the main Figure 3C.

      • Line 204: I'm not sure if Supplementary Fig. 7C-D is correct here. If it is, could the order of the figures be changed so that Supplementary Fig. 7C-D becomes Supplementary Fig. 7A-B?*

      The order of the panels A-B in the Supplementary Figure 7 follows the order of the text in the manuscript and is mentioned before panels C-D. It refers to the sentence “Overexpression of phyB resulted in a strong repression of hypocotyl elongation in both HRFR and LRFR, while the absence of phyB promoted hypocotyl elongation (Supplementary Figure 7A-B).”

        • Line 208: "In all three cases...". Please clarify what the three cases refer to. __We will change the text to more explicitly refer to the differentially accessible regions (DARs) of the genes ATHB2 and HFR1* shown in Figure 5A.__
      • Line 231: Should Fig. 5C also be cited here in addition to Supplementary Fig. 7?* We will add the reference to Figure 5C that was missing.

      *• In Supplementary Table 3, more information is needed. For example, it could mention: "This data is presented in Fig. 3 and is based on datasets from ChIP-seq, RNA-seq, etc."

      *

      The table will be updated with more information as suggested by the Reviewer.

      • In the figure legend of Fig. 4B, please check the use of "( )".*

      We will correct the error and include the references inside the parenthesis.

      Reviewer #3 (Significance (Required)):

      Paulisic et al. present novel discoveries in the field of light signaling and shade avoidance. Their findings extend our understanding of how DNA organization, prior to shade, affects PIF binding and how PIF binding remodels DNA accessibility. The data presented support the conclusions well and are backed by sufficient experimental evidence.

      2. Description of the revisions that have already been incorporated in the transferred manuscript

      The manuscript has not been modified yet.

      3. Description of analyses that authors prefer not to carry out

      • *

      Reviewer 2 asked for new ChIP-seq analyses for PIF7 and PIF4. For reasons that we outlined above, we believe that such analyses are not required, and we currently do not intend performing these experiments.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In their manuscript, Paulisic et al. investigate whether the transcriptional response of Arabidopsis seedlings to shade depends on chromatin accessibility, with a specific focus on PIF7-regulated genes. To this end, they perform ATAC-seq and RNA-seq, along with other experiments, on seedlings exposed to short and long shade and correlate the results with previously reported PIF7 and PIF4 ChIP-seq data. Based on their findings, they propose that shade-mediated transcriptional regulation may not require extensive remodeling of DNA accessibility. Specifically, they suggest that the open chromatin conformation allows PIFs to easily access and recognize their binding motifs, rapidly initiating gene expression in response to shade. This transcriptional response primarily depends on a transient increase in PIF stability and gene occupancy, with changes in chromatin accessibility occurring in only a small number of genes.

      Major comments:

      I have one issue that, in my opinion, requires more attention. To define the PIF7 target genes, which were later used to estimate whether PIF7 binds to open or closed chromatin and affects DNA accessibility after its binding, the authors compared the 4h LRFR data point from Willige et al. (2021) ChIP-seq with their 1h RNA-seq data point. This comparison might have missed early genes where PIF7 binds before the 1h time point but is no longer present on DNA at 4h. I understand the decision to choose the 4h Willige et al. ChIP-seq data point, performed under LD conditions, as it matches the data in this study, rather than the 5min-30min data points, which were conducted in constant light. However, if possible, it would be interesting to also compare the RNA-seq data with the early PIF7 binding genes to assess how many additional PIF7 target genes could be identified based on that comparison and whether this might alter the conclusions. If the authors do not agree with this point, it should at least be emphasized that the ChIP-seq data and the RNA-seq/ATAC-seq data were performed under different LRFR conditions (R/FR 0.6 vs. 0.1), which may lead to the misidentification of PIF7 target genes in the manuscript.

      Minor comments:

      - In Fig. 1D, please describe the meaning of the blue shaded areas and the blue lines under the ATAC-seq peaks, as they do not always correlate.

      - In Fig. 1E, it could be helpful to note that the 257 peaks in the right bar correspond to the peaks associated with the 177 genes in the left bar.

      - In lines 116, 119, and 122, I believe it should read "Fig. 2" instead of "Fig. 2A."

      Lines 138-139: "PIF7 total protein levels were overall more stable, and only a mild and non-significant increase of PIF7 levels was seen at 1 h of LRFR."

      Since PIF7 usually appears as two bands in HRFR and only one band in LRFR, how was the protein level of PIF7 quantified in Fig. 3A? Additionally, I was wondering about the authors' thoughts on the discrepancy with Willige et al. (2021, Extended Data Fig. 1d), where PIF7 abundance seems to increased after 30 min and 2 h of LRFR. Line 150: "... many early PIF target genes (Figure 3C)." Since only PIL1 is shown in Fig. 3C, I would recommend revising this sentence. Alternatively, the data could be presented, as in Fig. 2, for all the PIF7 target genes with transient expression patterns. - Line 204: I'm not sure if Supplementary Fig. 7C-D is correct here. If it is, could the order of the figures be changed so that Supplementary Fig. 7C-D becomes Supplementary Fig. 7A-B? - <br /> - Line 208: "In all three cases...". Please clarify what the three cases refer to. - <br /> - Line 231: Should Fig. 5C also be cited here in addition to Supplementary Fig. 7? - <br /> - In Supplementary Table 3, more information is needed. For example, it could mention: "This data is presented in Fig. 3 and is based on datasets from ChIP-seq, RNA-seq, etc." - <br /> - In the figure legend of Fig. 4B, please check the use of "( )".

      Significance

      Paulisic et al. present novel discoveries in the field of light signaling and shade avoidance. Their findings extend our understanding of how DNA organization, prior to shade, affects PIF binding and how PIF binding remodels DNA accessibility. The data presented support the conclusions well and are backed by sufficient experimental evidence.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The study by Paulisic et al. explores the variations in chromatin accessibility landscape induced by plant exposure to light with low red/far-red ratios (LRFR), which mimicks neighbor shade perception. The authors further compare these changes with the genome association of PIF4 and PIF7 transcription factors - two major actors of gene expression regulation in response to LRFR. While this is not highlighted in the main text, the analyses of chromatin accessibility are performed on INTACT-mediated nucleus sorting, presumably to ensure proper and clean isolation of nuclei.

      Major comments

      • Why is the experimental setup exposing plants to darkness overnight? Does this affect the response to LRFR, by a kind of reset of phytochrome signaling? I guess this choice was made to maintain a strong circadian rhythm. Yet, given that PIF genes are clock-regulated, I am afraid that this choice complicates data interpretation concerning the specific effects of LRFR exposure.
      • As a result of this setup, the 1h exposure to LRFR immediately follows HRFR while the 3h final LRFR exposure of the « 25h LRFR » samples immediately follows a long period of darkness. Can this explain why in several instances (e.g., at the ATHB2 gene) 1h LRFR seems to have stronger effects than 25h LRFR on chromatin accessibility?
      • Lane 42 cites the work by Calderon et al 2022 as « Transcript levels of these genes increase before the H3K4me3 levels, implying that H3K4me3 increases as a consequence of active transcription ». Despite this previous study being reviewed and published, such a strong conclusion should be taken cautiously, and I disagree with it. The study by Calderon et al compares RNA-seq with ChIP-seq data, two methodologies with very different sensitivity, especially when employing bulk cells/whole seedlings as starting materials. For example, a gene strongly induced in a few cells will give a good Log2FC in RNA-seq data analysis (as new transcripts are produced after a low level of transcripts before shade) but, even though its chromatin variations would follow the same temporality or would even precede gene induction, this would be invisible in bulk ChIP-seq data analysis (which averages the signal of all cells together). I understand the rationale for relying on the conclusions made in an excellent lab with strong expertise in light signaling, but I recommend being cautious when relying on these conclusions to interpret new data.
      • The problem is that the same issue holds true when comparing ATAC-seq and RNA-seq data. ATAC signals reflect average levels over all cells while RNA-seq data can be influenced by a few cell highly expressing a given gene. Even though authors carefully sorted nuclei using an INTACT approach, this should be discussed, in particular when gene clusters (such as cluster C-D) show no match between chromatin accessibility and transcript level variations. In this regard, is PIF7 expressed in many cells or a small niche of cells upon LRFR exposure? The conclusions on its role in chromatin accessibility, analyzed here as mean levels of many different seedling cells, could be affected by PIF7 activity pattern (e.g., at lane 293).
      • Lane 89, the conclusion linking DNA methylation and DNA accessibility is unclear to me, this may be rephrased. Also, it should be noted that in gene-rich regions, most DNA methylation is located along the body of moderately to highly transcribing genes (gene-body methylation) while promoters of active and inactive genes are most frequently un-methylated.
      • Figure 3B shows a few ChIP-qPCR results with important conclusions. Why not sequencing the ChIPped DNA to obtain a genome-wide view of the PIF4-PIF7 relationships at chromatin, and also consequently a more robust genome-wide normalization?
      • Given the known functional interaction between PIF7 and INO80, it would be relevant to test whether changes in chromatin accessibility at ATHB2 and other genes are affected in ino80 mutant seedlings.
      • On the same line, it would be interesting to test whether PIF7 target regions with pre-existing accessible chromatin would exist in ino80 mutant plants. In other words, testing a model in which chromatin remodeling by INO80 defines accessibility under HRFR to enable rapid PIF recruitment and DNA binding upon LRFR exposure.

      Minor comments

      • In Figure 1C, it seems that PIF7 target genes do not match the set of LRFR-downregulated genes (even less than at random). Why not exclude these 4 genes from the analyses?
      • Figure 3A shows the quantification of protein blots, but I did not find the corresponding images. These should be shown in the figure or as a supplementary figure with proper controls.
      • Lane 102, it is unclear why PIF7 target genes were defined as the -3kb/TSS domains while Arabidopsis intergenic regions are on average much shorter. Gene regulatory regions, or promoters, are typically called within -1kb/TSS regions to avoid annotating a ChIP peak to the upstream gene or TE. A better proxy of PIF7 typical binding sites in gene regulatory regions could be determined by analysing the mean distance between PIF7 peak coordinates and the closest TSS. Typically, a gene meta-plot would give this information.
      • Figure 4B, what's represented in the ATAC-seq heatmap: does a positive z-score represent high accessibility?

      Significance

      Contradicting the naive hypothesis that PIFs may target shade-inducible genes to « open » chromatin of shade-inducible genes with the help of chromatin remodelers, such as INO80, the study highlights that PIF7 typically associates with pre-existing accessible chromatin states. Thus, even though this is not stated, results from this study indicate that PIF7 is not a pioneer transcription factor. The data seem very robust, and while some conclusions need clarification, it should be of great interest to the community of scientists studying plant light signaling and shade responses.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Plant systems sense shading by neighbors via the phytochrome signaling system. In the shade, PHYTOCHROME-INTERACTING FACTORS (PIFs) accumulate and are responsible for transcriptional reprogramming that enable plants to mobilize the "shade-avoidance response". Here, the authors have sought to examine the role of chromatin in modulating this response, specifically by examining whether "open" or "closed" chromatin regions spanning PIF target genes might explain the transcriptional output of these genes. They used a combination of ATAC-seq/CoP-qPCR (to detect open regions of chromatin), ChIP (to assay PIF binding) and RNA-seq (to measure transcript abundance) to understand how these processes may be mechanistically linked in Arabidopsis wild-type and pif mutant lines. They found that some chromatin accessibility changes do occur after LRFR (shade) treatment (32 regions after 1h and 61 after 25 h). While some of these overlap with PIF-binding sites, the authors found no correlation between open chromatin states and high levels of transcription. Because auxin is an important component of the shade-avoidance response and has been shown to control chromatin accessibility in other contexts, they examined whether auxin might be required for opening these regions of chromatin. They find that in an auxin biosynthesis mutant, there is a small subset of PIF target genes whose chromatin accessibility seems altered relative to the wild-type. Likewise, they found that chromatin accessibility for certain PIF targets is altered in phyB and pif mutant and propose that PIFs are necessary for changing the accessibility of chromatin in these genes. The authors thus propose that PIF occupancy of already open regions, rather than increased accessibility, underly the increase in transcript of abundance of these target genes in response to shade.

      Major comments:

      I find that the data generally support the hypothesis presented in the manuscript that chromatin accessibility alone does not predict transcription of PIF target genes in the shade. That said, I think that a paragraph from the discussion (lines 321-332) would benefit from some careful rephrasing. I think it is perfectly reasonable to propose that PIF occupancy is more predictive of shade-induced transcriptional output than chromatin accessibility, but I think that calling PIF occupancy "the key drivers" (line 323) or "the main driving force" (line 76) risks ignoring the observation that levels of PIF occupancy specifically do not predict expression levels of PIF target genes (Pfeiffer et al., 2014, Mol Plant). For PIL1 and HFR1, the authors have shown that PIF promoter occupancy and transcript levels are correlated, but the central finding of Pfeiffer et al. was that this pattern does not apply to the majority of PIF direct target genes. Finding factors (i.e. histone marks) that convert PIF-binding information into transcriptional output appears to have been the impetus for the experiments devised in Willige et al., 2021 and Calderon et al., 2022. It is great that the authors have outlined in the discussion that there are a number of factors that modulate PIF transcriptional activating activity but I think that the emphasis on PIF-binding explaining transcript abundance should be moderated in the text.

      I think that the hypothesis could be further supported by incorporating the previously published ChIP-seq data on PIF1, PIF3 and PIF5 binding. Given these data are published/publicly available, I think it would be helpful to note which of the 72 DARs are bound by PIF1, PIF3 and/or PIF5. Especially so given that PIF5 (Lorrain et al., 2008, Plant J) and PIF1/PIF3 (Leivar et al., 2012, Plant Cell) contribute at least in some capacity to transcriptional regulation in response to shade. At the very least, it might help explain some of the observed increases in nucleosome accessibility observed for genes that don't have PIF4 or PIF7-binding.

      In the manuscript, there are several instances where separate col-0 (wild type) controls have been used for identical experiments. Specifically, qPCR (Fig 3C, Fig S7C/D and Fig S8C/D), CoP-qPCR (Fig 5B/5C and Fig S8E/F) and hypocotyl measurements (Fig S7A/B and Fig S8A/B). In the cases of the hypocotyl measurements, there appear to be hardly any differences between col-0 controls indicating the measurements can be confidently compared between genotypes.

      In some cases of qPCR and CoP-qPCR experiments however, the differences in values obtained from col-0 samples that underwent identical experimental treatments appear to vary significantly. In Figure 3C for example, the overall trend for PIL1 expression in col-0 is the same (e.g. HRFR levels are low, LRFR1 levels are much higher and LRFR25 levels drop down to some intermediate level) but the expression levels themselves appear to differ almost two-fold for the LRFR 1h timepoint (~110 on the left panel vs ~60 for the right panel). Given the size of the error bars, it appears that these data represent the mean from only one biological replicate. PIL1 expression levels at LRFR 1h as reported in Fig S7C and D also show similar ~2-fold differences.

      I would recommend that the authors explicitly describe the number of biological replicates used for each experiment in the methods section. If indeed these experiments were only performed once, I think the authors should be very careful in the language used in describing their conclusions and in assigning statistical significance. One possibility that could also be helpful would be normalizing LRFR 1h and LRFR 25h values to HRFR values and plotting these data somewhere in the supplemental data. If, for example, the relative levels of PIL1 are different between replicates but the fold-induction between HRFR and LRFR 1h are the same, this would at least allay any concerns that the experimental treatments were not the same. I understand that doing so precludes comparison between genotypes, but I do think it's important to show that at least the control data are comparable between experiments.

      Similarly, for the CoP-qPCR experiments presented in Fig 5B and 5C, the col-0 values for region P2 between Fig 5B and 5C shows that while HRFR and LRFR 1h look comparable, the values presented for LRFR 25h are quite different.

      Minor comments:

      Presentation of Supplemental Figure 7A/7B and Supplemental Figure 8A/8B could be changed to make the data more clear (i.e. side-by-side rather than superimposed).

      I think that the paragraph introducing auxin (lines 25-37) could be reduced to 1-2 sentences and merged into a separate introductory paragraph given that the SAV3 work makes up a relatively minor component of the manuscript.

      For Figure 3A, I would strongly encourage the authors to show some of the raw western blot data for PIF4, PIF5 and PIF7 protein abundance (and loading control), not just the normalized values. This could be put into supplemental data, but I think it should accompany the manuscript.

      Lines 145-147 "we observed a strong correlation between PIF4 protein levels (Figure 3A) and PIL1 promoter occupancy (Figure 3B), and a similar behavior was seen with PIF7 (Figure 3B)." According to Fig 3A, there is no statistically significant increase in PIF7 abundance after 1h shade. There is an apparent increase in PIF7 promoter occupancy, but the variation appears too large for it to be statistically significant. I agree that qualitatively there is a correlation for PIF4 but I think the description of the behavior of PIF7 should be rephrased.

      There appear to be issues in the coloring of the labels (light blue dots vs dark blue dots) for the PIF7 panels of Fig 3B and Supplemental Fig 3B.

      Significance

      This authors here have sought to examine the possibility that the transcriptional responses to shade mediated by the phy-PIF system might involve large-scale opening or closing of chromatin regions. This is an important and unanswered question in the field despite several studies that have looked at the role of histone variants (H2A.Z) and modifications (H3K4me3 and H3K9ac) in modulating PIF transcriptional activating activity. The authors have shown that, at least in the case of the transcriptional response to shade mediated by PIF7 (and to an extent PIF4), large-scale changes in chromatin accessibility are not occurring in response to shade treatment.

      The results presented in this study support the hypothesis that large-scale changes in chromatin accessibility may have already occurred before plants see shade. This opens the possibility that perhaps the initial perception of light by etiolated (dark-grown seedlings) might trigger changes in chromatin accessibility, opening up chromatin in regions encoding "shade-specific" genes and/or closing chromatin in regions encoding "dark-specific" genes.

      The audience for this particular manuscript encompasses a fairly broad group of biologists interested in understanding how environmental stimuli can trigger changes in chromatin reorganization and transcription. The results here are important in that they rule out chromatin accessibility changes as underlying the changes in transcription between the short-term and long-term shade responses. They also reveal that there are a few cases in which chromatin accessibility does change in a statistically-significant manner in response to shade. These regions, and the molecular players which regulate their accessibility, merit further exploration.

      My fields of expertise are photobiology, photosynthesis and early seedling development.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General Statements [optional]

      There were several points that were raised by multiple reviewers, which we respond to as follows.

      1. The reviewers pointed to a lack of clear comparison with experimental data. Perhaps this was insufficiently clear in the first submission, but the analysis of ECT-2 localization during cytokinesis was intended as a validation of the model, parameterized based on polarization and applied without further modification to cytokinesis. These situations differ in numerous respects: centrosome number, centrosome size, and we used several experimental conditions to control centrosome positioning. To address this more extensively, in the revised submission we analyzed our data further (Longhini and Glotzer, 2022) to extract profiles of ECT-2 and myosin. We used these profiles both to constrain model parameters (Appendix B.3) and to compare with model predictions for both polarization and cytokinesis (Figs. 3 and 5).
      2. All of the reviewers pointed to our assumption that myosin indirectly recruits ECT-2. We apologize for a lack of clarity in the original draft about this. We had intended to convey the hypothesis that ECT-2 is recruited by a species that is advected with myosin, but for the sake of the minimal model we do not introduce any extra equations for this species and instead assume it colocalizes with myosin. In the revised manuscript, we address this by clearly listing the assumption (#2 on p. 7), and by comparing to an alternative model (Eq. (S4) and Fig. S7) that accounts directly for a third advected species. We also document specifically (second panel from left in Fig. 4) why the short residence time of ECT-2 makes patterning by pure advection impossible. That said, we still do not know the identity of this factor.
      3. The reviewers pointed out that our use of the M4 term to limit contractility was dubious. This was a (probably misguided) attempt to use previously-published models to constrain our model. In the revised submission, we replaced this term with a more general nonlinear term Mk, where we first demonstrate that k = 1 is insufficient to match the data (p. 32), then consider k = 2,3. We present results in the main text for k = 2, while Fig. S5 shows that the corresponding results for k = 3 are not very different. Put another way, we empirically demonstrate that the specific form of this nonlinear term is not important, as long as it prevents contractile instabilities (as pointed out by one of the reviewers).
      4. Apparently, our extension of the model to cytokinesis, and the evidence for validation of the model, was not clear in the original draft. Because of this, we reformulated the section (3.4) and figure (5) on cytokinesis. We identified four representative examples of centrosome positions, then compared the experimental profile of ECT-2 accumulation to the model result. For simplicity, we also eliminated the simulations of the non-phosphorylatable inactive copy of ECT-2 (“ECT-2 6A”). A more detailed analysis of that data revealed that the pattern of accumulation of ECT-2 6A at cleavage furrowing was more similar to the end of polarization, indicating that this copy of ECT-2 appears to have much slower turnover than the endogenous copy (as would expected from phosphorylation-dependent membrane displacement).
      5. Fundamentally, our study addresses a similar question to (Illukkumbura et al., 2023), in the sense that we seek to understand how cortical flows could pattern ECT-2 and myosin, even though the residence time of ECT-2 is very low. Despite the similarities, it differs from the cited study in that ECT-2 is not an inert component that is asymmetrically distributed, but rather a component which regulates myosin levels and cortical flows, ultimately feeding back on its own accumulation. Due to these similarities and differences, we added an expository section in the discussion (p. 18) comparing our results to those of that study.

      Point-by-point description of the revisions

      This section is mandatory. Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript.

      Reviewer 1

      In this article, Maxian et al. propose a model combining 1-d simulations of ECT-2 and Myosin concentration at the cortex through binding/unbinding and advection at the cortex, with an input for AIR-1 cortical concentration based on the spatial localisation of the centrosomes in the cytoplasm. The objective of the authors is to recapitulate the role of (1) AIR-1, (2) its effector ECT-2 and (3) the downstream effector, driver of cortical flows, the molecular motors Myosin, in two key physiological processes, polarization and cell division. This is important as work over the last 10 years have emphasized the role of AIR-1 in embryo polarization. Previous biochemical-mechanical models have focused on RhoA/Myosin interactions (Nishikawa et al, 2017), the importance of a negative feedback and excitable RhoA dynamics (Michaux et al, 2018), or anterior PARs/posterior PARs/Myosin (Gross et al, 2019). The authors thus attempt to provide a new descriptive model in which RhoA is implicit, instead focusing on the role of centrosome localization on AIR-1 localization, and providing a framework to explore polarity establishment and cell division based on these 3 simple players. The first part of the model is very reminiscent of previously published models, while the second instead provides a link between the initial polarizing cue AIR-1 and polarization. Based on this description, the model is precisely tuned to achieve polarization while matching experimental observations of flow speed and ECT-2 A/P enrichment shape. The results are therefore certainly new and interesting.

      Thank you for the positive assessment!

      Major comments:

      1. The authors use the position of the centrosomes as a static entry, resulting in a static AIR1 input. Is this true, or are the positions of the centrosomes dynamically modulated over the course of the different processes simulated here (for example as a consequence of cortical flows?), and if so, is the assumption of immobile position?

      We assume that the centrosomes are fixed on the timescale of the cortical dynamics, and study how the cortex responds to a static AIR-1 signal (see clarifying comment on p. 4). In Fig. S4, we show that the cortex responds rapidly to changes in the existence or position of the AIR-1 signal. As such, slower dynamics might be the result of slowly moving centrosomes, as we show in supplementary simulations (Fig. S8).

      1. While in its principle the model is quite simple and elegant, the detailed form of the equations describing the interactions between the players is more complex. Are all these required? If they are crucially important for the behavior of the model, these should be described more thoroughly, and if possible rooted more directly in experimental results:

      Thank you for this comment. We agree that there were several non-trivial terms in our “minimal” model. Our guiding principle for the revision was to reduce complexity and better justify the terms that are included.

      (a) kMEMEc _(Linear enhancement term): why would myosin impact E concentration? The authors state, p.7, ”There is a modest increase in the recruitment rate of ECT-2 due to cortical myosin (directly or indirectly), in a myosin concentration-dependent manner (Longhini and Glotzer, 2022).” I could not find the data supporting this assumption Longhini and Glotzer apparently rather point to a modulation of cortical flows. (”During anaphase, asymmetric ECT-2 accumulation is also myosin-dependent, presumably due to its role in generating cortical flows.”). Embedding this effect in the recruitment rate instead of expecting it from the model thus appears awkward. Could the authors specify how they came to this conclusion, which the authors might have derived from observations made in their previous work, but maybe did not fully document there?

      This is an important issue. Since it was raised by all of the reviewers, we addressed it in our general comments. Throughout the manuscript (Figs. 4 and S4), we tried to highlight that cortical flows are insufficient to localize ECT-2, while the recruitment hypothesis provides a better match to the experimental data. The recruitment by an advected species was speculated upon in Longhini and Glotzer: ”Rather, we favor a model in which the association of ECT-2 with the cortex involves interactions with cortical component(s) that are concentrated by cortical flows.”

      (b) kEME2Mc (ECT-2 non-linear impact on Myosin): does the specific form of the value to convey the enhancement (square form) have an impact on the results?

      The specific form does not have an impact. In fact, in the revised version, our experimental data shows an asymmetry in myosin that is actually lower than ECT-2. As such, a nonlinear term here lacks justification, and we switched to a linear term of the form kEMEMc (see model equations on p. 6).

      (c) KfbM4 ”The form of this term is a coarse-grained version of previously-published work (Michaux et al., 2018).” Myosin feedback on myosin localization proportionally to_ M4 _does not seem to directly derive from Michaux et al. Please detail this points more extensively and detail the derivation, in the supplements if not in the main text.

      Based on this comment and that of reviewer 2, we decided to switch to a more general term for nonlinear negative feedback, as discussed in point 3 in general comments.

      (d) P23. Parameter values: ”This is 1.5 times longer than the estimate for single molecules (Nishikawa et al., 2017; Gross et al., 2019) to reflect the more long-lived nature of myosin foci during establishment phase (Munro et al., 2004).” Not sure what the authors mean by more long-lived duration of foci during establishment phase. Seems rather arbitrary.

      This was a misstatement on our part. A closer look at Gross et al. revealed that, under conditions similar to those we simulate (initial polarity establishment), the residence time of myosin is about 15 s (off rate 0.06 s−1). We modified our justification (p. 30) to include this. We also looked at the effect of longer myosin residence time on polarity establishment (Fig. S8).

      1. It would be very helpful (and indeed more convincing) to include a direct comparison between modeling results and experimental counterpart whenever possible. This might not be possible for some data (e.g. Fig. 3d from Cowan et al), but should be possible for other, in particular Fig. 3c and Fig. 5b, for the flow speed and ECT-2 profiles. In Fig. 5b in particular, previously published experimental data could be produced to give the reader to compare model with experiments (possibly provided as an inset, at least for the wild type conditions).

      We tried to bring in more data based on what was available from previous work (Longhini and Glotzer, 2022). Frame intervals of 10 s prohibited a PIV analysis for flow speeds, and punctate myosin profiles often made it difficult to measure myosin concentration. We were, however, able to extract the ECT-2 concentration from our previous movies and compare it to the model results. We included these comparisons in Figs. 3 and 5, with accompanying discussion in the text.

      Minor comments: 1. Fig. 5b: ECT-2 C 6A(dhc-1) do not seem to be referenced or discussed in the main text.

      Also, why present the results for the flow for 2 conditions and the ECT-2 localisation for 4? Or does the variation of ECT-2 not impact the flow profile?

      As discussed in general comments, we decided to reformulate the cytokinesis figure to incorporate more experimental data. Since we have detailed data on ECT-2 localization, we presented these in Fig. 5 for four experimental conditions, comparing each to the model.

      1. p.6: Given that the non-normalized data is used in the main text, and the normalized only appears in the supplemental, maybe star the dimensionless and remove all hats from the main for greater legibility?

      We changed the notation to make the main text variables (dimensional) unadorned, while the dimensionless variables in the SI now have hats.

      1. p.6: Eqn 1a: carrot missing on 3rd E?

      This is now a moot point because of the previous comment.

      1. p.14: replace_“embryo treatment” with ”experimental conditions”?

      We changed “embryo treatment” to “experimental conditions” globally.

      1. p.21, S4a: add_ A = A/A(Tot)

      We added it in the last display on p. 28.

      1. p.22: ”L = 134.6_ µ_m” - please write 134_ µ_m to retain the precision of original measurements

      We made this change.

      1. p.22: Please provide formula for all dimensionless values as a table at the end of the supplemental for the eager but less-mathematically proficient reader.

      We added Table 1 to list the relationship between dimensional and dimensionless parameters.

      Reviewer 2

      The manuscript by Maxian, Longhini and Glotzer presents purely modeling work performed by the first author in conjunction with the already published experimental work by Longhini and Glotzer (eLife, 2022). The aim of the manuscript is to provide a mathematical model that connects the actomyosin contractility of the cell cortex in C. elegans zygote with the activity of the centrosomal kinase AurA (AIR-1 in C. elegans). The major claim of the authors is that their model, fitted to the experimental data pertaining to the zygote polarization, also describes dynamics during the zygote cytokinesis. In the model, the authors provide a heuristic approach to the biochemical dynamics, reducing their treatment to two variables: myosin and Ect2 Rho GEF. The biochemical model is integrated with a simple 1D active gel-type model for the cortical flow. The model uses static diffusive field of activity of AurA kinase in the cytoplasm as an input to their chemo-mechanical model.

      Major concerns:

      1. The biochemical model is highly heuristic and several major assumptions are poorly justified. Thus, the authors explicitly introduce recruitment of Ect2 by myosin, something apparently based on the experimental observations by Longhini and Glotzer in 2022, which had not been biochemically confirmed since with a clear molecular mechanism.

      This is an important issue, and we appreciate your concern which was shared by the other reviewers. As discussed above on p. 1, we tried to justify this assumption better by (a) clearly stating it on p. 7, and (b) demonstrating that the dynamics we observe in live embryos are impossible without it. The model confirms what was pointed out by Longhini and Glotzer, that the short residence time of ECT-2, combined with in vivo flow speeds on the order of 10 µm/min, make it impossible for cortical flows alone to redistribute ECT-2.

      1. The contribution of AurA is introduced highly schematically as a term based on enzyme inhibition biochemistry that increases the off rate of Ect2. The major assumption of the model is that AurA phosphorylates Ect2 strictly on the membrane (cortex) of the cell. Why? No molecular justification is given. If the authors cannot provide clear justification, this major assumption has to be clearly declared as such. The phosphorylation/dephosphorylation dynamics of Ect2 is not considered at all.

      We clarified that the species we consider in the model (E) is unphosphorylated ECT-2, so that the negative flux comes from either unbinding or phosphorylation. Of course, AIR-1 phosphorylates ECT-2 in the cytoplasm as well, but our model only tracks the binding of unphosphorylated ECT-2 to the cortex. We clarified this on p. 6.

      1. In the equation for myosin, the authors introduce disassembly/ inactivation term proportional to the fourth order of concentration of myosin. Why? This is a major assumption, which appears to be derived from the work by Michaux et al. 2018. There the authors (Michaux et al.) postulated that the rate of inactivation of RhoA GTPase was somehow proportional to the fourth power of RhoA concentration. It appears that Maxian et al. further assume that the myosin concentration is fast variable enslaved by Rho, so that_ M ∼ _[RhoA]. They then presumably assume that if the rate of degradation/ inactivation of Rho is proportional to the fourth power of Rho concentration, so is true for myosin (M). This is a logical error and is not justified. An important question, why do the current authors need this unusual assumption with such a high power of M disassembly/inactivation? Perhaps, this is because without this rather dubious term the cortex flow produces a blow-up of myosin concentration? This would be expected in their mechanical model - the continuous flow of actomyosin not compensated by cortex disassembly generally causes blow-up of biochemical concentrations transported by the flow, this is a known problem of the “simple” active gel model used by the authors. Maxian et al. have to provide clear derivation of the term −KfbM4 _and also demonstrate why they need this exotic assumption.

      As mentioned above in general comments, this was a misguided attempt on our part to use previous literature to directly assign values to model parameters. In the revised manuscript, we considered a more general term for the nonlinear feedback. The fitting occurs in Fig. S3, where we impose the ECT-2 profile during pseudo-cleavage and try to fit the myosin profile. k = 1 is eliminated because the ECT-2 and myosin have different asymmetries. Higher order nonlinearities (k = 2,3) are successful in fitting the experimental data. In the main text, we present results from k = 2, then use Fig. S5 to present results on the k = 3 case.

      1. The equation for myosin M has a membrane-binding term, which is second order in concentration of Ect2~E2, without which the model will not show the instability that the authors need. The only justification given is that ”some nonlinearity is required”. A proper derivation should be given here.

      Our experimental data shows an asymmetry in myosin that is actually lower than ECT-2. As such, a nonlinear term in the binding rate lacks justification, and we switched to a linear term of the form kEMEMc (see model equations on p. 6).

      1. The diffusion coefficients for Ect2 and myosin are chosen to be the same. Why? Clearly these molecules so different in size - myosin being a gigantic cluster monster of size_ 300nm _believed to be bound to actin, should have a much smaller diffusion coefficient?

      Thank you for raising this point. We used the same diffusion coefficient for simplicity; because its dimensionless value is less than 10−4, diffusion is relatively unimportant in shaping the concentration fields. If we assume instead, for instance, that myosin cannot diffuse in the membrane, while ECT-2 has a ten-fold larger diffusion coefficient, the steady state profiles of ECT-2 and myosin are changed by at most 5% (see Fig. S6).

      1. There are confusing statements regarding the role of actomyosin flows. In the beginning of the manuscript, the authors seem to state that since Ect2 has a high off rate, the effect of the flow on Ect2 localization is negligible in comparison with direct binding to myosin. Later, the authors state that flows are absolutely essential for the patterning. The authors need to clearly explain where and how the flows are important or not.

      Thank you for pointing out this confusion. In the revised manuscript, we tried to be explicit that the combination of recruitment and flows is essential for patterning ECT-2. We did this in Figs. 4 and 5 by showing the results of simulations without recruitment (Fig. 4) and without recruitment and flows (Fig. 5).

      Minor points:

      1. page 9. Why is the rate of dephosphorylation of AurA is named Koff?

      We changed the notation to kinac to reflect inactivation.

      1. page 10. “Note that the model is calibrated to predict... which matches experimental observations” - this sentence needs changing. You want to say that you fit the model to experiments in the Longhini and Glotzer paper. There is no prediction here.

      We removed this sentence.

      1. page 14. “A plot of Ect-2 accumulation as a function of distance from the nearest cortex...” - clearly the word ”centrosome” is meant here instead of ”cortex”.

      What was meant by this sentence was the distance from the centrosome to the nearest cortex pole (anterior or posterior). We modified it to make this more clear (p. 15).

      1. page 16. ”Inactive, non-phosphorylatable version of Ect-2...” - non-phosphorylatable is clear, but why inactive?

      As discussed in general comments we decided to simplify the cytokinesis figure and remove the simulations with non-phosphorylatable ECT-2. While it is not relevant, the ECT-2 6A variant represents a fragment of the protein that lacks the catalytic domain. Our original goal was to use these data to track the ECT-2 localization without perturbing the system biochemistry, but the data gave the hint of longer exchange kinetics, which confounded our analysis.

      Reviewer 3

      _Maxian et al. developed a mathematical model to explain the essential elements and interactions necessary and sufficient for the polarisation of the C. elegans zygote. The initiation of zygote polarisation has been extensively studied in recent years, highlighting the role of the centrosomal kinase Aurora-A (AIR-1) in controlling the cortical distribution of RhoGEF (ECT-2) and actomyosin contractility during polarisation. Although genetic experiments have demonstrated their function in this process, it remains to be tested whether these factors and their interactions are sufficient to induce polarisation.

      This work has provided a theoretical framework to predict the activity of AIR-1 in the cytoplasm and at the cell cortex, and the cortical distribution of ECT-2 and myosin-II (NMY-2). This framework can recapitulate the dynamic rearrangement of ECT-2 and myosin-II during polarisation, with centrosomes positioned at the posterior pole of the zygote. This model can explain, at least in part, the asymmetric distribution of ECT-2 and myosin-II in the zygote undergoing cytokinesis, suggesting that the mechanism of AIR-1-mediated control of ECT-2 and myosin-II would regulate patterning during polarisation and cytokinesis. This theoretical framework is developed with reasonable assumptions based on previous genetic experiments (except for the myosin-dependent regulation of ECT-2; see comments below).

      Thank you for the positive assessment!

      Major issues:

      1. The authors insist that this model correctly predicts the spatio-temporal dynamics of ECT-2 and myosin-II during polarisation and cytokinesis. However, the predicted results do not reproduce the in vivo pattern of ECT-2 in both phases. ECT-2 is cleared from the posterior cortex and establishes a graded pattern across the antero-posterior axis during polarisation (see their previous publication in eLife 2022, 11, e83992, Fig1A -480s) and cytokinesis (see eLife 2022, 11, e83992, Fig1C 60s and 120s). During both stages, ECT-2 does not show local enrichment at the boundary between the anterior and posterior cortical domains in vivo. In fact, when comparing the predicted results with the in vivo pattern of ECT-2 and cortical flow, the authors used non-quantitative descriptions such as ’in good agreement’, ’a realistic magnitude’,, ’resemble’. These vague descriptions should be revised and a quantitative assessment of ECT-2 distribution between in silico and in vivo should be included in a revised manuscript.

      As mentioned on p. 1, in the revised manuscript we interacted with the data in a much stronger way. We first used data during pseudo-cleavage to infer the ECT-2/myosin relationship. We then examined (Fig. 3) quantitatively how the ECT-2 accumulation during polarization matches the experimental data (it matches early but not later stages). We repeated this for cytokinesis in Fig. 5, where we compared the ECT-2 profile across four experimental conditions to the model prediction.

      1. I assume that the strange local enrichment of ECT-2 at the anteroposterior boundary is due to their assumption that the binding rate of ECT-2 is increased by a linear increase via cortical myosin-II (page 6). This assumption is not directly supported by experimental evidence. A previous study by the same group (eLife 2022, 11, e83992) showed that a progressive increase in ECT-2 concentration at the anterior cortex is partially accompanied by an increase in cortical flow and transport of myosin-II from the posterior pole to the anterior cortex. This observation supports the idea that ECT-2 may associate with cortical components transported by myosin-II based cortical flow. This unrealistic assumption makes the predicted distribution pattern of ECT-2 almost identical to that of cortical myosin-II, resulting in an increase in the concentration of ECT-2 at the anteroposterior boundary where myosin-II forms pseudocleavages and cleavage furrows. The authors should clarify why their mathematical model used this assumption and provide a comprehensive analysis and evaluation of the parameter value for an ECT-2-myosin-II interaction.

      In the revised manuscript, we outlined the justification for this assumption after presenting the model equations. In the Appendix, we were able to constrain all parameters except the recruitment term. Then, we provided an analysis of how polarization changes when the recruitment term is increased. We show that the ECT-2 asymmetries with myosin flows are the same as those simply due to AIR-1 inhibition (since the lifetime of ECT2 is small). Adding indirect recruitment gives asymmetries that resemble experimental data from early establishment of polarity. We showed this both by assuming “myosin” (a species which colocalizes with myosin) recruits ECT-2 (Fig. S4) and by simulating an alternative model (Eq. (S4)) where an explicit species that is advected with cortical flows recruits myosin (Fig. S7).

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: In this paper, the authors perform a screen by feeding C. elegans different E. coli genetic mutants and examining the effect on the expression of fat-7, a stearoyl-CoA 9-desturase, which has been associated with longevity. They identify 26 E. coli strains that decrease fat-7 expression, all of which slow development and increase lifespan. RNA sequencing of worms treated with 4 of these strains identified genes involved in defense against oxidative stress among those genes that are commonly upregulated. Feeding C. elegans these 4 bacterial strains results in increased ROS and activation of the mitochondrial unfolded protein response, which appears to contribute to lifespan extension as these bacterial strains do not increase lifespan when the mitochondrial unfolded protein response transcription factor ATFS-1 is disrupted. Finally, the authors demonstrate a role for iron levels in mediating these phenotypes: iron supplementation inhibits the phenotypes caused by the identified bacterial strains, while iron chelation mimics these phenotypes. Response: We thank the reviewer for an excellent summary of our work.

      Major comments: The proposed model involves an increase in ROS levels activating the UPRmt and then leading to lifespan extension. If the elevation is ROS levels is contributing then treatment with antioxidants should prevent UPRmt activation and lifespan extension. Response: This is an excellent point. We will treat the FAT-7-suppressing diets with antioxidants and observe the effect on C. elegans UPRmt activation and lifespan.

      The authors suggest that iron depletion may disrupt iron-sulfur cluster proteins. The Rieske iron-sulfur protein ISP-1 from mitochondrial electron transport chain complex III has previously been associated with lifespan. Point mutations affecting the function of ISP-1 or RNAi decreasing the levels of ISP-1 both result in increased lifespan (PMID 20346072, 11709184). Thus, iron depletion may be increasing ROS, activating UPRmt and increasing lifespan through decreasing ISP-1 levels.

      Response: The reviewer has raised an intriguing possibility that the increased lifespan on the FAT-7-suppressing diets could be because of perturbation of ISP-1 function. While ISP-1 levels may not be directly affected by the mutant diets, ISP-1 function might be perturbed on these diets as ISP-1 function requires iron-sulfur clusters. Therefore, we will study the lifespan of isp-1(qm150) mutant on the FAT-7-suppressing diets to explore whether the lifespan extension on these diets is ISP-1 dependent.

      All of the Kaplan-meier survival plots are missing statistical analyses. Please add p-values.

      Response: The p-values for all the survival plots are included in the respective figure legends.

      It would be helpful to include a model diagram of the proposed mechanisms in the main figures.

      Response: We will make a model diagram after completing the experiments suggested by the reviewers.

      Minor comments: Rather than "mutant diets" it would be more informative to call these "FAT-7-decreasing diets"

      Response: We have changed “mutant diets” to “FAT-7-suppressing diets” throughout the manuscript.

      Is it surprising that none of the bacterial strains increased FAT-7 levels? Why do you think this is?

      Response: Yes, it was indeed surprising to find only bacterial strains that reduced FAT-7 levels and none that increased them. One possible explanation is that these bacterial mutants may not directly regulate fat-7 expression. Instead, they might alter the overall dietary composition, which is known to influence fat-7 levels. It appears that none of the tested mutants modified the diet in a manner that would lead to fat-7 upregulation.

      Page 5. "We hypothesized that diets reducing FAT-7 might elevate oleic acid levels". Since FAT-7 converts stearic acid to oleic acid, wouldn't deceasing FAT-7 levels decrease oleic acid levels and increase stearic acid levels?

      Response: FAT-7 expression is regulated by a feedback mechanism and is sensitive to the fatty acid composition within host cells; elevated levels of unsaturated fatty acids, such as oleic acid, suppress FAT-7 expression. There are two possible ways bacterial mutants could lead to reduced FAT-7 levels: (1) by directly inhibiting FAT-7 expression, which would be expected to result in increased stearic acid levels; or (2) by supplying higher amounts of oleic acid through their composition, thereby suppressing FAT-7 expression via feedback regulation. We focused on the second possibility, as elevated oleic acid levels—like those seen with FAT-7-suppressing diets—are known to promote C. elegans lifespan. To avoid confusion, we have revised the statement to: “We hypothesized that bacterial diets might reduce FAT-7 expression because they have elevated levels of oleic acid”.

      Page 6. The authors cite Bennett et al. 2014 for the statement that "Activation of the UPRmt has been associated with lifespan extension". This paper reaches the opposite conclusion "Activation of the mitochondrial unfolded protein response does not predict longevity in Caenorhabditis elegans". Also, in the Bennett paper and PMID 34585931, it is shown that constitutive activation of ATFS-1 decreases lifespan. Thus, the relationship between the UPRmt and lifespan is not straightforward. These points should be mentioned.

      Response: The reviewer has raised an important point. We have now included a paragraph in the discussion to highlight these points. The revised manuscript reads: “All 26 FAT-7-suppressing diets identified in our study elevated hsp-6p::GFP expression and extended C. elegans lifespan. Although UPRmt activation and lifespan extension were consistently observed across these diets, there was no strong correlation between hsp-6p::GFP levels and the degree of lifespan extension. The role of the UPRmt in promoting longevity remains controversial (Bennett et al., 2014; Soo et al., 2021; Wu et al., 2018). For instance, gain-of-function mutations in atfs-1 have been shown to reduce lifespan (Bennett et al., 2014; Soo et al., 2021). However, a recent study demonstrated that mild UPRmt activation can extend lifespan, whereas strong activation has the opposite effect (Di Pede et al., 2025). These findings suggest that UPRmt contributes to longevity only under specific conditions and at specific activation levels. In our study, lifespan extension on FAT-7-suppressing diets was dependent on ATFS-1, indicating that UPRmt activation was necessary for this effect.

      Page 6. "Our transcriptomic analysis suggested elevated ROS". Rather than refer to gene expression, it would be better to refer to the ROS measurements that were performed.

      Response: We have changed it to the following sentence: “Our ROS measurement analysis suggested elevated ROS levels in worms fed FAT-7-suppressing diets.

      The long-lived mitochondrial mutants isp-1 and nuo-6 have increased ROS, UPRmt activation and increased lifespan. Multiple studies have examined gene expression in these long-lived mutant strains. How does gene expression in these mutants compare to worms treated with the FAT-7-decreasing E. coli mutants? While not necessary for this publication, it would be interesting to see whether the FAT-7-decreasing E. coli strains can increase isp-1 and nuo-6 lifespan.

      Response: We will compare the gene expression changes observed in isp-1 and nuo-6 mutants with the gene expression changes observed in worms exposed to FAT-7-suppressing diets. Additionally, we will examine the lifespan of isp-1 mutants on the mutant diets. These data will be included in the revised manuscript.

      SEK-1 is also involved in the p38-mediated innate immune signaling pathway, which has been shown to contribute to longevity in C. elegans. In fact, disruption of sek-1 using RNAi decreased the lifespan of several long-lived mutant strains PMID 36514863.

      Response: We thank the reviewer for highlighting this point. We have now added that the role of SEK-1 in regulating lifespan on FAT-7-suppressing diets could also be because of its role in innate immunity. The revised manuscript reads: “Notably, SEK-1 also regulates innate immunity and is essential for the extended lifespan observed in several long-lived C. elegans mutants (Soo et al., 2023). Therefore, its effect on lifespan in response to FAT-7-suppressing diets may also stem from its role in innate immune regulation.

      Figure 2. Why were cyoA and ycbk chosen to show the full Kaplan-meier survival plot?

      Response: These were selected randomly to show the range of the lifespan phenotype observed.

      Figure 2, panel D. A better title may be "Mean Survival (Percent increase from control)"

      Response: We have made this change.

      While not necessary for this paper, it would be interesting to determine whether the FAT-7-decreasing E. coli strains alter resistance to oxidative stress.

      Response: We will study the survival of worms on these diets upon supplementation with paraquat.

      Figure 4. It may be interesting to include a correlation plot comparing hsp-6::GFP fluorescence and lifespan. It looks like the magnitudes of increase for each phenotype are not correlated.

      Response: We have added a new Figure (Figure S4) to show the correlation between hsp-6::GFP fluorescence levels and percent change in mean lifespan. Indeed, there is no correlation between these phenotypes.

      Reviewer #1 (Significance (Required)):

      Overall, this is an interesting paper and the experiments are rigorously performed. The bacterial screen was comprehensive and was followed up by careful mechanistic experiments. This paper will be of interest to researchers studying the biology of aging. A diagram of the working model of the underlying mechanisms would enhance the paper. Response: We thank the reviewer for highlighting the significance of the study. We will include a model in the revised manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Das et al. investigate how different bacterial mutants affect the lifespan of C. elegans. The authors screened a library of E. coli mutants using a fat-7 reporter and identified 26 strains that reduce fat-7 expression, cause developmental delay, induce the mitochondrial unfolded protein response (using hsp-6 reporter), and increase worm lifespan. Among these, they focused on four strains and demonstrated that the effects of these mutants on developmental delay, fat-7 expression, and hsp-6 induction could be suppressed by iron supplementation. Furthermore, they showed that iron depletion alone is sufficient to induce fat-7 expression in worms. The lifespan extension observed in worms fed these mutant bacterial strains depends on SKN-1, SEK-1, and HLH-30. Overall, this is a well-written manuscript that highlights the role of iron in regulating fat-7 expression. However, the findings from the initial screen do not significantly expand upon what is already known in the literature. Many of the identified hits overlap with those reported by Zhang et al. (2019), which also highlighted the role of iron in developmental delay and hsp-6 induction. While the lifespan data and the role of fat-7 are novel aspects of this study, the authors have not conducted detailed mechanistic investigations to address key questions, such as: 1) How does the deletion of these bacterial genes alter the metabolic state of the diet? 2) How do these metabolic changes influence fat-7 expression in worms? 3) How does the downregulation of fat-7 contribute to longevity? Addressing these points would strengthen the mechanistic insights of the study.

      Response: We thank the reviewer for a thoughtful summary of our work and for the valuable feedback provided to improve the manuscript. We would like to emphasize that the screening conditions and objectives of our study were fundamentally different from those of Zhang et al. (2019). Furthermore, Zhang et al. (2019) did not investigate the effects of the bacterial mutants identified in their screens on C. elegans lifespan. Notably, the 26 bacterial mutants identified in our screen do not overlap with those reported in previous studies that examined bacterial strains promoting C. elegans longevity. As detailed below, we will address the points raised by the reviewer that will certainly strengthen the mechanistic insights of the study.

      Here are my detailed comments: 1. Suppressing FAT-7 levels in C. elegans does not inherently increase lifespan. To directly attribute this effect to FAT-7, it would be important to attempt a rescue experiment to restore FAT-7 expression and assess whether the lifespan extension persists. Additionally, measuring oleic acid levels in these mutants would help determine whether a high-oleic-acid diet is suppressing FAT-7 expression. The role of oleic acid cannot be ruled out using fat-2 mutants (Fig. 3B), as fat-2 mutants accumulate oleic acid when fed WT bacteria, but this may not translate to endogenous oleic acid accumulation in conditions where FAT-7 is suppressed.

      Response: We thank the reviewer for these useful suggestions. We will overexpress FAT-7 under a pan-tissue promoter (eft-3) and study lifespan on FAT-7-suppressing diets. Moreover, to explore whether oleic acid has any role in enhancing lifespan on FAT-7-suppressing diets, we will study the lifespan of worms on these diets upon supplementing with oleic acid along with wild-type bacterium control.

      To understand the host-microbe interaction in this study, it is important to determine what specific changes in the bacteria contribute to the observed phenotypes in worms. Identifying these bacterial factors will provide a clearer picture of their role in influencing worms stress signaling and lifespan.

      Response: The phenotypes observed in C. elegans across all the identified bacterial mutants are remarkably consistent, including increased UPRmt activation, reduced FAT-7 levels, delayed development, and extended lifespan. This consistency suggests that a common underlying factor is driving these effects. Although the bacterial mutants appear genetically diverse, gene expression data from C. elegans, along with comparisons to the findings of Zhang et al. (2019), indicate that elevated levels of reactive oxygen species (ROS) may represent this shared factor. These results suggest that bacterial ROS play a central role in mediating the host-microbe interactions underlying the observed phenotypes. To further support this hypothesis, we will directly measure ROS levels in the identified bacterial mutants. Additionally, we will test whether antioxidant treatment can suppress the C. elegans phenotypes, thereby establishing a causal role for bacterial ROS.

      It is important to rule out any changes in food consumption in worms fed these bacterial mutants, as differences in feeding amount could attribute to the observed lifespan effects.

      Response: We will carry out pharyngeal pumping rate measurements to study whether there is any difference in food consumption in worms fed these bacterial mutants.

      In figure 5A to 5G, please include the same-day controls to help clarify how iron supplementation effects these phenotypes relative to the control. For example, in Fig. 5F, it appears that iron extends the lifespan of worms fed the control diet. It would be clearer if appropriate controls were included in all of these figures or summarized in a table to help understand the impact of iron.

      Response: We will include these controls in the revised manuscript.

      How does iron depletion affect the levels of fat-7, and how does this contribute to the activation of the longevity pathways discussed in the manuscript.

      Response: This is an intriguing question. There are at least two possible explanations: (1) oxidative stress may directly downregulate fat-7 expression, and (2) iron depletion could reduce ferroptosis, which in turn may influence fatty acid metabolism. In the revised manuscript, we will include data on how oxidative stress affects FAT-7 expression.

      Minor comments 1. Please include a detailed table of the lifespan data for all replicates as a supplementary table.

      Response: We have included the details of survival curves for all the data in the new Table S2.

      In the Methods section, specify at what stage the worms were exposed to iron and the iron chelator for the lifespan experiments.

      Response: The L1-synchronized worms were exposed to iron and iron chelator plates and allowed to develop till the late L4 stage before being transferred to lifespan assay plates that also contained the respective supplements. This information is now included in the Methods section.

      Please clarify whether equal optical density (O.D.) of cells was seeded for both the WT and mutant strains, and mention if the mutants exhibit any growth defects.

      Response: We have examined the growth of the bacterial mutants and found that they do not exhibit growth defects. Therefore, for all the assays, NGM plates were seeded with saturated cultures of all the bacterial strains. We have now included the growth curves data in the manuscript (Figure S4).

      Reviewer #2 (Significance (Required)):

      Significance General Assessment: This study by Das et al. explores the impact of bacterial mutants on C. elegans lifespan. A key strength of the study is the identification of bacterial mutants that influence the expression of the gene encoding fatty acid desaturase (fat-7) and lifespan in C. elegans. Furthermore, the study highlights the role of iron in regulating fat-7 expression, suggesting that iron imbalance may play a crucial role in modulating fatty acid metabolism. However, the study's main limitation is that it does not significantly extend the current understanding of the microbial modulation of host metabolism and aging, as many of the identified bacterial hits overlap with those previously reported in Zhang et al. (2019). The manuscript would benefit from more in-depth mechanistic exploration, especially with regard to how specific bacterial factors influence the metabolic state of the worms and how these changes ultimately modulate fat-7 expression and longevity.

      Response: We thank the reviewer for highlighting the significance of our study. Once again, we would like to emphasize that the screening conditions and objectives of our study differed fundamentally from those of Zhang et al. (2019). Furthermore, Zhang et al. did not investigate the impact of the bacterial mutants identified in their screen on C. elegans lifespan. As outlined above, we will address the reviewer’s comments, which will undoubtedly strengthen the mechanistic insights of our study.

      Advance: This study presents a conceptual advance by exploring the iron-dependent regulation of fat-7 expression and lifespan in C. elegans, linking bacterial mutations with key longevity pathways (SKN-1, SEK-1, and HLH-30). The novelty lies in the direct investigation of the bacterial-induced changes in fat-7 expression, though the role of iron in these mutants for development and induction of mito-UPR was previously shown in the literature. This study also adds to the growing body of work on C. elegans as a model for studying aging and host-microbe interactions, particularly in understanding how diet and microbial exposure affect metabolic processes and lifespan.

      Response: We thank the reviewer for highlighting the advancement made by our study.

      Audience: This research will primarily interest specialized audiences in aging research, microbiology, and metabolism, especially those focused on host-microbe interactions. Keywords of my expertise: Host-microbe interactions, metabolism, system biology, C. elegans, aging.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Das et al. investigate how different bacterial mutants affect the lifespan of C. elegans. The authors screened a library of E. coli mutants using a fat-7 reporter and identified 26 strains that reduce fat-7 expression, cause developmental delay, induce the mitochondrial unfolded protein response (using hsp-6 reporter), and increase worm lifespan. Among these, they focused on four strains and demonstrated that the effects of these mutants on developmental delay, fat-7 expression, and hsp-6 induction could be suppressed by iron supplementation. Furthermore, they showed that iron depletion alone is sufficient to induce fat-7 expression in worms. The lifespan extension observed in worms fed these mutant bacterial strains depends on SKN-1, SEK-1, and HLH-30.

      Overall, this is a well-written manuscript that highlights the role of iron in regulating fat-7 expression. However, the findings from the initial screen do not significantly expand upon what is already known in the literature. Many of the identified hits overlap with those reported by Zhang et al. (2019), which also highlighted the role of iron in developmental delay and hsp-6 induction. While the lifespan data and the role of fat-7 are novel aspects of this study, the authors have not conducted detailed mechanistic investigations to address key questions, such as: 1) How does the deletion of these bacterial genes alter the metabolic state of the diet? 2) How do these metabolic changes influence fat-7 expression in worms? 3) How does the downregulation of fat-7 contribute to longevity? Addressing these points would strengthen the mechanistic insights of the study.

      Here are my detailed comments:

      1. Suppressing FAT-7 levels in C. elegans does not inherently increase lifespan. To directly attribute this effect to FAT-7, it would be important to attempt a rescue experiment to restore FAT-7 expression and assess whether the lifespan extension persists. Additionally, measuring oleic acid levels in these mutants would help determine whether a high-oleic-acid diet is suppressing FAT-7 expression. The role of oleic acid cannot be ruled out using fat-2 mutants (Fig. 3B), as fat-2 mutants accumulate oleic acid when fed WT bacteria, but this may not translate to endogenous oleic acid accumulation in conditions where FAT-7 is suppressed.
      2. To understand the host-microbe interaction in this study, it is important to determine what specific changes in the bacteria contribute to the observed phenotypes in worms. Identifying these bacterial factors will provide a clearer picture of their role in influencing worms stress signaling and lifespan.
      3. It is important to rule out any changes in food consumption in worms fed these bacterial mutants, as differences in feeding amount could attribute to the observed lifespan effects.
      4. In figure 5A to 5G, please include the same-day controls to help clarify how iron supplementation effects these phenotypes relative to the control. For example, in Fig. 5F, it appears that iron extends the lifespan of worms fed the control diet. It would be clearer if appropriate controls were included in all of these figures or summarized in a table to help understand the impact of iron.
      5. How does iron depletion affect the levels of fat-7, and how does this contribute to the activation of the longevity pathways discussed in the manuscript.

      Minor comments

      1. Please include a detailed table of the lifespan data for all replicates as a supplementary table.
      2. In the Methods section, specify at what stage the worms were exposed to iron and the iron chelator for the lifespan experiments.
      3. Please clarify whether equal optical density (O.D.) of cells was seeded for both the WT and mutant strains, and mention if the mutants exhibit any growth defects.

      Significance

      General Assessment: This study by Das et al. explores the impact of bacterial mutants on C. elegans lifespan. A key strength of the study is the identification of bacterial mutants that influence the expression of the gene encoding fatty acid desaturase (fat-7) and lifespan in C. elegans. Furthermore, the study highlights the role of iron in regulating fat-7 expression, suggesting that iron imbalance may play a crucial role in modulating fatty acid metabolism. However, the study's main limitation is that it does not significantly extend the current understanding of the microbial modulation of host metabolism and aging, as many of the identified bacterial hits overlap with those previously reported in Zhang et al. (2019). The manuscript would benefit from more in-depth mechanistic exploration, especially with regard to how specific bacterial factors influence the metabolic state of the worms and how these changes ultimately modulate fat-7 expression and longevity.

      Advance: This study presents a conceptual advance by exploring the iron-dependent regulation of fat-7 expression and lifespan in C. elegans, linking bacterial mutations with key longevity pathways (SKN-1, SEK-1, and HLH-30). The novelty lies in the direct investigation of the bacterial-induced changes in fat-7 expression, though the role of iron in these mutants for development and induction of mito-UPR was previously shown in the literature. This study also adds to the growing body of work on C. elegans as a model for studying aging and host-microbe interactions, particularly in understanding how diet and microbial exposure affect metabolic processes and lifespan.

      Audience: This research will primarily interest specialized audiences in aging research, microbiology, and metabolism, especially those focused on host-microbe interactions.

      Keywords of my expertise: Host-microbe interactions, metabolism, system biology, C. elegans, aging.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this paper, the authors perform a screen by feeding C. elegans different E. coli genetic mutants and examining the effect on the expression of fat-7, a stearoyl-CoA 9-desturase, which has been associated with longevity. They identify 26 E. coli strains that decrease fat-7 expression, all of which slow development and increase lifespan. RNA sequencing of worms treated with 4 of these strains identified genes involved in defense against oxidative stress among those genes that are commonly upregulated. Feeding C. elegans these 4 bacterial strains results in increased ROS and activation of the mitochondrial unfolded protein response, which appears to contribute to lifespan extension as these bacterial strains do not increase lifespan when the mitochondrial unfolded protein response transcription factor ATFS-1 is disrupted. Finally, the authors demonstrate a role for iron levels in mediating these phenotypes: iron supplementation inhibits the phenotypes caused by the identified bacterial strains, while iron chelation mimics these phenotypes.

      Major comments:

      The proposed model involves an increase in ROS levels activating the UPRmt and then leading to lifespan extension. If the elevation is ROS levels is contributing then treatment with antioxidants should prevent UPRmt activation and lifespan extension.

      The authors suggest that iron depletion may disrupt iron-sulfur cluster proteins. The Rieske iron-sulfur protein ISP-1 from mitochondrial electron transport chain complex III has previously been associated with lifespan. Point mutations affecting the function of ISP-1 or RNAi decreasing the levels of ISP-1 both result in increased lifespan (PMID 20346072, 11709184). Thus, iron depletion may be increasing ROS, activating UPRmt and increasing lifespan through decreasing ISP-1 levels.

      All of the Kaplan-meier survival plots are missing statistical analyses. Please add p-values.

      It would be helpful to include a model diagram of the proposed mechanisms in the main figures.

      Minor comments:

      Rather than "mutant diets" it would be more informative to call these "FAT-7-decreasing diets"

      Is it surprising that none of the bacterial strains increased FAT-7 levels? Why do you think this is?

      Page 5. "We hypothesized that diets reducing FAT-7 might elevate oleic acid levels". Since FAT-7 converts stearic acid to oleic acid, wouldn't deceasing FAT-7 levels decrease oleic acid levels and increase stearic acid levels?

      Page 6. The authors cite Bennett et al. 2014 for the statement that "Activation of the UPRmt has been associated with lifespan extension". This paper reaches the opposite conclusion "Activation of the mitochondrial unfolded protein response does not predict longevity in Caenorhabditis elegans". Also, in the Bennett paper and PMID 34585931, it is shown that constitutive activation of ATFS-1 decreases lifespan. Thus, the relationship between the UPRmt and lifespan is not straightforward. These points should be mentioned.

      Page 6. "Our transcriptomic analysis suggested elevated ROS". Rather than refer to gene expression, it would be better to refer to the ROS measurements that were performed.

      The long-lived mitochondrial mutants isp-1 and nuo-6 have increased ROS, UPRmt activation and increased lifespan. Multiple studies have examined gene expression in these long-lived mutant strains. How does gene expression in these mutants compare to worms treated with the FAT-7-decreasing E. coli mutants? While not necessary for this publication, it would be interesting to see whether the FAT-7-decreasing E. coli strains can increase isp-1 and nuo-6 lifespan.

      SEK-1 is also involved in the p38-mediated innate immune signaling pathway, which has been shown to contribute to longevity in C. elegans. In fact, disruption of sek-1 using RNAi decreased the lifespan of several long-lived mutant strains PMID 36514863.

      Figure 2. Why were cyoA and ycbk chosen to show the full Kaplan-meier survival plot?

      Figure 2, panel D. A better title may be "Mean Survival (Percent increase from control)"

      While not necessary for this paper, it would be interesting to determine whether the FAT-7-decreasing E. coli strains alter resistance to oxidative stress.

      Figure 4. It may be interesting to include a correlation plot comparing hsp-6::GFP fluorescence and lifespan. It looks like the magnitudes of increase for each phenotype are not correlated.

      Significance

      Overall, this is an interesting paper and the experiments are rigorously performed. The bacterial screen was comprehensive and was followed up by careful mechanistic experiments. This paper will be of interest to researchers studying the biology of aging. A diagram of the working model of the underlying mechanisms would enhance the paper.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):*

      Summary: Chitin is a critical component of the extracellular matrix of arthropods and plays an essential role in the development and protection of insects. There are two chitin synthases in insects: Type A (exoskeletons) and Type B (for the peritrophic matrix in the gut). The study aims to investigate the specificity and mechanisms of the two chitin synthases in D. melanogaster and to clarify whether they are functionally interchangeable. Various genetic manipulations and fluorescence-based labeling were used to analyze the expression, localization, and function of Kkv and Chs2 in different tissues. Chs2 is expressed in the PR cells of the proventriculus and is required for chitin deposition in the peritrophic matrix. Kkv can deposit chitin in ectodermal tissues but not in the peritrophic matrix, whereas Chs2 can deposit chitin in the peritrophic matrix but not in ectodermal tissues. The subcellular localization of chitin synthases is specific to the tissues in which they are expressed. Kkv localizes apically in ectodermal tissues, whereas Chs2 localizes apically in the PR cells of the proventriculus. Altogether, Kkv and Chs2 cannot replace each other. The specificity of chitin synthases in D. melanogaster relies on distinct cellular and molecular mechanisms, including intracellular transport pathways and the specific molecular machinery for chitin deposition.*

      • *

      Congratulations on this incredible story and manuscript, which is straightforward and well-written. However, I have some comments that may help to improve it.

      We thank the reviewer for this very positive comment. We have addressed all comments to clarify and improve our manuscript.

      Major comments: 1.) Funny thing: the Chs2 mutant larva shows a magenta staining below the chitin accumulation of the esophagus, which looks like a question mark in 1H but cannot be found in control. Is that trachea reaching the pv?

      We assume that the reviewer refers to Fig 1N. As the reviewer suspects, this corresponds to a piece of trachea. Figure 1N shows a single section, making it difficult to identify what this staining corresponds to. We are providing below a projection of several sections where it is easier to identify the staining as tracheal tissue (arrow).

      We are now marking this pattern as trachea (tr) in the manuscript Figure 1N

      2.) Also, though it is evident that the PM chitin is lost in Ch2 mutants, could it be that the region is disturbed and cells express somewhere else chitin? There are papers by Fuß and Hoch (e.g., Mech of Dev, 79, 1998; Josten, Fuß et al., Dev. Biol.267, 2004) using markers such as Dve, Fkh, Wg, Delta, and Notch, etc. for precisely marking the endodermal/ectodermal region in the embryonic foregut/proventriculus. It would be beneficial to show, along with chitin and Chs expression patterns, the ectoderm/endoderm cells. This is particularly important as the authors report endodermal expression of Chs2 in embryos but don't use co-markers of the endodermal cells.

      We agree with the reviewer that this is an important issue and we note that Reviewer 2 also raised the same point. Therefore, we have addressed this issue.

      We obtained an antibody against Dve, kindly provided by Dr. Hideki Nakagoshi. Dve marks the endodermal region in the proventriculus (Fuss and Hoch, 1998, Fuss et al., 2004, Nakagoshi et al., 1998).This antibody worked nicely in our dissected L3 digestive tracts and allowed us to mark the endodermal region. We also obtained an antibody against Fkh, kindly provided by Dr. Pilar Carrera. Fkh marks the ectodermal foregut cells (Fuss and Hoch, 1998, Fuss et al., 2004). While, in our hands, this antibody performed well in embryonic tissues, we observed no staining in our dissected L3 digestive tracts. The reason for this is unclear, but we suspect technical limitations may be responsible (the ectodermal region of the proventriculus is very internal, potentially hindering antibody penetration). To circumvent this inconvenience, we tested a FkhGFP tagged allele available in Bloomington Stock Center. Fortunately, we were able to detect GFP in ectodermal cells of L3 carrying this allele. Using this approach, we conducted experiments to detect Fkh and Dve in the wild type or in Df(Chs2) conditions (Fig S1). In addition, we used these markers to map the expression of Kkv and Chs2 in the proventriculus (Fig 4).

      Altogether the results using these endodermal/ectodermal markers confirmed the presence of a cuticle adjacent to the FkhGFP-positive cells and a PM adjacent to the PR cells, marked by Dve. This PM is absent in Df(Chs2) L3 escapers, however, the general pattern of Fkh/Dve expression is not affected. Finally, we show that Chs2-expressing cells are positive for Dve while Kkv-expressing cells are not. We were unable to conduct an experiment demonstrating Kkv and Fkh co-expression due to technical incompatibilities, as both genes require the use of GFP-tagged alleles to visualise their expression. However, we believe that our imaging of Dve/Kkv clearly shows that Kkv expressing cells lack Dve expression and are localised in the internal (ectodermal) region of the proventriculus (Fig 4E).

      3.) The origin of midgut chitin accumulation is unclear. Chitin can come from yeast paster. Can the authors check kkv and chs2 mutants for food passage and test starving L1 larvae to detect chitin accumulation in the midgut without feeding them?

      This is a very interesting point that has also intrigued us.

      We observed that, in addition to the PM layer lining the midgut epithelium, CBP staining also revealed a distinct luminal pattern. Our initial hypothesis was that this pattern corresponded to the PM. However, its presence in Df(Chs2) larval escapers clearly indicates that this is not the case. Unfortunately, we cannot assess this pattern in kkv mutants, as these die at eclosion and do not proceed to larva stages.

      As the reviewer suggests, a likely possibility is that the luminal pattern originates from components in the food. These could correspond to yeast, as suggested by the reviewer, or possibly remnants of dead larvae present in the media (although Drosophila is considered herbivore in absence of nutritional stress).

      To assess whether the luminal pattern originates from the food we conducted two independent experiments. In experiment 1, we collected larvae reared under normal food conditions. Newly emerged L3 larvae were transferred in small numbers to minimise cannibalism (Ahmad et al., 2015) to new Petri plates containing moist paper. Larvae were starved for 3,4 or 5 days. Larvae starved for more than 5 days did not survive. We then dissected the guts and analysed CBP staining. We observed the presence of luminal CBP staining in these larvae, along with the typical PM signal in the proventriculus and along the midgut. In experiment 2, we collected larvae directly on agar plates containing only agar (without yeast or any other nutrients). We allowed the larvae to develop. These larvae showed minimal growth. We dissected the guts of these small larvae (which were challenging to dissect) and analysed CBP staining. Again, we detected presence of luminal CBP staining.

      These experiments indicate that, despite starvation, a luminal chitin pattern is still detected, suggesting that it is unlikely to originate from food. However, we cannot unequivocally rule out the possibility that the cannibalistic, detrivorous or carnivorous behavior of the nutrionally stressed larvae (Ahmad et al., 2015) in our experiments may influence the results. Therefore, more experiments would be required to address this point.

      In summary, while we cannot provide a definitive answer to the reviewer's question, nor fully satisfy our own curiosity, we would like to note that this specific observation is unrelated to the main focus of our study, as we have confirmed that the luminal pattern is not dependent on Chs2 function.

      Portions of midgut of starved larvae under the regimes indicated, stained for chitin (CBP, magenta). Note the presence of the luminal chitin pattern in the midgut

      4.) Subcellular localization assays require improved analysis, such as a co-marker for the apical membrane and statistical analysis with co-localization tools, showing the overlap at the membrane and intracellularly with membrane co-markers and KDEL.

      We have addressed the point raised by the reviewer. To analyse and quantify Chs2 subcellular localisation, particularly considering the observed pattern, we decided to use both a membrane and an ER marker. As a membrane marker we used srcGFP expressed in tracheal cells (see answer to point 7 of Reviewer 1) and as an ER marker we used KDEL. In this analysis, tracheal cells also expressed Chs2, which was visualised using the Chs2 antibody generated in the lab.

      To assess the colocalisation of Chs2 with each marker we used the JaCop pluggin in Fiji. We analysed individual cells from different embryos stained for membrane/ER/Chs2 using single confocal sections (to avoid artificial colocalisation). Images were processed as described in Materials and Methods. We obtained the Pearson's correlation coefficient (r), which measures the degree of colocalisation, for Chs2/srcGFP and Chs2/KDEL, n=36 cells from 9 different embryos. The average r value for Chs2/srcGFP was 0,064, while the average for Chs2/KDEL was around 0,7. r ranges between -1 and 1, where 1 indicates perfect correlation, 0 no correlation, and -1 perfect anti-correlation. Typically, an r value of 0.7 and above is considered a strong positive correlation, whereas a value below 0,1 is regarded as very weak or no correlation. Thus, our colocalisation analysis supports the hypothesis that Chs2 is primarily retained in the ER when expressed in non-endogenous tissues, likely unable to reach the membrane.

      We have reorganised the figures and now present an example of Chs2/srcGFP/KDEL subcellular localisation in tracheal cells and the colocalisation analysis in Fig 5H. The colocalisation analysis is described in the Materials and Methods section.

      Minor comments:

      5.) The authors used "L3 larval escapers." It would be interesting to know if the lack of Chs2 and the peritrophic matrix cause any physiological defects or lethality.

      The point raised by the reviewer is very interesting and relevant. The peritrophic matrix is proposed to play several important physiological roles, including the spatial organisation of the digestive process, increasing digestive efficiency, protection against toxins and pathogens, and serving as a mechanical barrier. Therefore, it is expected that the absence of chitin in the PM of the Df(Chs2) larval escapers may cause various physiological effects.

      Analysing these effects is a complex task, and it constitutes an entire research project on its own. In addressing the physiological requirements of the PM, we aim to analyse adult flies and assess various parameters, including viability, digestive transit dynamics, gut integrity, resistance to infections, fitness and fertility.

      A critical initial challenge in conducting a comprehensive analysis of the physiological requirements of the PM is identifying a suitable condition to evaluate the absence of Chs2. In this work we are using a combination of two overlapping deficiencies that uncover Chs2, along with a few additional genes (as indicated in Fig S1F). This deficiency condition presents two major inconveniences: first, the observed defects could be caused or influenced by the absence of genes other than Chs2, preventing us from conclusively attributing the defects to Chs2 loss (unless we rescued the defects by adding Chs2 back as we did in the manuscript). Second, the larva escapers, which are rare, do not survive to adulthood (indicating lethality but preventing us from analysing specific physiological aspects).

      To overcome these limitations, we are currently working to identify a genetic condition in which we can specifically analyse the absence of Chs2. We have identified several available RNAi lines and we are testing their efficiency in preventing chitin deposition in the PM. Additionally, we are characterising a putative null Chs2 allele, Chs2CR60212-TG4.0. This stock contains a Trojan-GAL4 gene trap sequence in the third intron, inserted via CRISPR/Cas9. As described in Flybase (https://flybase.org/), the inserted cassette contains a 'Trojan GAL4' gene trap element composed of a splice acceptor site followed by the T2A peptide, the GAL4 coding sequence and an SV40 polyadenylation signal. When inserted in a coding intron in the correct orientation, the cassette should result in truncation of the trapped gene product and expression of GAL4 under the control of the regulatory sequences of the trapped gene. We already know that, when crossed to a reporter line (e.g. UAS-GFP or UAS-nlsCherry) this line reproduces the Chs2 expression pattern, suggesting that the insertion may generate a truncated Chs2 protein. This line would represent an ideal tool to assess the absence of Chs2, and we are currently characterising it for further analysis

      In summary, we fully agree with the reviewer that investigating the physiological requirements of the PM is a compelling area of research, and we are actively addressing this question. However, this investigation constitutes a substantial and independent research effort that we believe is beyond the scope of the current manuscript at this stage.

      6.) The order identifiers are missing for materials and antibodies, e.g., anti-GFP (Abcam), but Abcam provides several ant-GFP; which was used? Please provide order numbers that guarantee the repeatability for others.

      We have now added all identifiers for materials and reagents used, in the materials and methods section.

      7.) Figure S5C, C', what marks GFP (blue) in the trachea? Maybe I have overlooked the description. What is UASsrcGFP? What is the origin of this line?

      We apologise for not providing a more detailed description of the UASsrcGFP line. This line corresponds to RRID BDSC#5432, as now indicated in Materials and Methods section.

      In this transgene, the UAS regulatory sequences drive the expression of GFP fused to Tag:Myr(v-src). As described in Flybase (https://flybase.org/), the P(UAS-srcEGFP) construct contains the 14 aa myristylation domain of v-src fused to EGFP. This tag is commonly used to target proteins of interest to the plasma membrane. The construct was generated by Eric Spana and is available in Drosophila stock centers.

      We typically use this transgene as a plasma membrane marker to outline cell membrane contours. In our experiments, srcGFP, under the control of the btlGal4 promoter, was used to visualise the membrane of tracheal cells in relation to Chs2 accumulation. As indicated in point 4, we have now transferred the images of srcGFP/Chs2/KDEL to the main Figures and used it for colocalisation analyses.

      8.) The authors claim that they validated the anti-Chs2 antibody. However, they show only that it recognizes a Cht2 epitope via ectopic expression. For more profound validation, immune staining is required in deletion mutants, upon knockdown, or upon expression of recombinant proteins, which is not shown.

      We generated an antibody against Chs2. We found that the antibody does not reliably detect the endogenous Chs2 protein, and so we find no pattern in the proventriculus or any other tissue in our immunostainings. It is very possible that the combination of low endogenous levels of Chs2 with a sub-optimal antibody (or low titer) leads to this result. In any case, as the antibody does not detect endogenous Chs2, it cannot be validated by analysing the expression upon Chs2 knockdown. In contrast, our antibody clearly detects specific staining in various tissues (e.g. trachea, salivary glands, gut) when Chs2 is expressed using the Gal4/UAS system, confirming its specificity for Chs2. It is worth to point that it is not unusual to find antibodies that are not sensitive enough to detect endogenous proteins but can detect overexpressed proteins (e.g

      (Lebreton and Casanova, 2016)).

      As an additional way to validate the specificity of our antibody, we have used the chimeras generated, as suggested by the reviewer. As indicated in the Materials and Methods section, the Anti-Chs2 was generated against a region comprising 1222-1383 aa in Chs2, with low homology to Kkv. This region is present in the kkv-Chs2GFP chimera but absent in Chs2-KkvGFP (see Fig 7A). Accordingly, our antibody recognises kkv-Chs2GFP but does not recognise Chs2-KkvGFP (Fig S7).

      We have revised the text in chapter 6 (6. Subcellular localisation of Chs2 in endogenous and ectopic tissues) to clarify these points and we have added the validation of the antibody using the chimeras in chapter 8 (8. Analysis of Chs2-Kkv chimeras) and Fig S7

      9) The legend and text explaining Fig. 4 D-E' can be improved. The authors used the Crimic line, which is integrated into the third ("coding") intron. This orientation can lead to the expression of Gal4 and cause a truncated version of the protein (according to Flybase). Is Chs2 expression reduced in the crimic mutant? If the mutation causes expression of a truncated version, the Chs2 antibody may not be able to detect it as it recognizes a fragment between 1222 and 1383 aa? Also, I'm unsure whether the Chs2 antibody or GFP was used to detect expression in PR cells. The authors describe using Ch2CR60212>SrcGFP together with Chs2+ specific antibodies.

      We apologise for the confusion.

      As the reviewer points, Chs2CR60212-TG4.0 contains a Trojan-GAL4 gene trap sequence in the third intron, inserted via CRISPR/Cas9. As described in Flybase (https://flybase.org/), the inserted cassette contains a 'Trojan GAL4' gene trap element composed of a splice acceptor site followed by the T2A peptide, the GAL4 coding sequence and an SV40 polyadenylation signal. When inserted in a coding intron in the correct orientation, the cassette should result in truncation of the trapped gene product and expression of GAL4 under the control of the regulatory sequences of the trapped gene.

      We found that when crossed to UAS-GFP or UAS-nlsCherry, this line reproduces a expression pattern that must correspond to Chs2. As the antibody that we generated is not suitable for detecting Chs2 endogenous expression, we resorted to using this combination, Chs2CR60212-TG4.0 crossed to a reporter line (such asUAS-GFP or UAS-nlsCherry), to visualise Chs2 expression by staining for GFP/Cherry in the intestinal tract and in the embryo (Figures 4 and S4).

      We realise that the Figure labelling we used in our original submission is very misleading, and we apologise for this. In the original figures we had labelled the staining combination with Kkv, Chs2, Exp as if we had used these antibodies. However, in all cases, we used GFP to visualise the pattern of these proteins in the genetic combinations indicated in the figures. We have corrected this in our revised version. We have also updated the text (Chapter 5), figures and figure legends.

      As the reviewer points, the insertion in Chs2CR60212-TG4.0 is likely to generate a truncated Chs2 protein. We cannot confirm this using the Chs2 antibody we generated because it does not recognise the endogenous Chs2 pattern. Nevertheless, as indicated in point 5, we are currently characterising this line. Our preliminary results indicate a high complexity of effects from this allele that require thorough analysis, as it may be acting as a dominant negative.

      Reviewer #1 (Significance (Required)):

      Significance: The manuscript's strength and most important aspects are the genetic analysis, expression, and localization studies of the two Chitin synthases in Drosophila embryos and larvae. However, beyond this manuscript, the development of mechanistic details, such as interaction partners that trigger secretion and action at the apical membranes and the role of the coiled-coil domain, will be interesting.

      The manuscript uses "first-class" genetics to describe the different roles of the two Chitin synthases in Drosophila, comparing ectodermal chitin (tracheal and epidermal chitin) with endodermal (midgut) chitin. Such a precise analysis has not been investigated before in insects. Therefore, the study deeply extends knowledge about the role of Chitin synthases in insects.

      The audience will specialize in basic research in zoology, developmental biology, and cell biology regarding - how the different Chitin synthases produce chitin. Nevertheless, as chitin is relevant to material research and medical and immunological aspects, the manuscript will be fascinating beyond the specific field and thus for a broader audience.

      I'm working on chitin in the tracheal system and epidermis in Drosophila.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __ Drosophila have two different chitin synthase enzymes, Kkv and Chs2, and due to unique expression patterns and mutant phenotypes, it is relatively clear that they have different functions in producing either the cuticle-related chitin network (Kkv) or the chitin associated with the peritrophic matrix (PM). However, what is unknown is whether the different functions in making cuticle vs PM chitin is related to differences in cellular expression and/or enzyme properties within the cell. The authors exploit the genetic tractability of Drosophila and their ability to image cuticle vs PM chitin production to examine whether these 2 enzymes can substitute each other. They conclude that these two proteins are not equivalent in their capacity to generate chitin. The data are convincing; however, it is currently presented in a subjective fashion, which makes it difficult to interpret. Additionally, in my opinion there is some interpretation that requires softening or alternatively interpreted.

      We are pleased that the reviewer finds our data convincing. However, we acknowledge the reviewer's concern that our data was presented in a subjective manner, and we apologise for this. In response, we have carefully reviewed the entire manuscript and revised our data presentation to ensure a more objective tone. Numerous changes (including additional quantifications, new experiments and clarifications) have been incorporated throughout the text. These revisions are highlighted in the marked-up version. We hope that this revision provides a more accurate and objective presentation of our work.

      Major Comments:

      1- While the imaging is lovely, there are some things that are difficult to see in the figures. For example, the "continuous, thin and faint 'chitin' layer that lined the gut epithelium" is very difficult to visualise in the control images. Can they increase the contrast to help the reader appreciate this layer? This is particularly important as we are asked to appreciate a loss of this layer in the absence of Chs2.

      We have tried to improve the figures so that the PM layer in the midgut region is more clearly visible. We have added magnifications of small sections at the midgut lumen/epithelium border in grey to help visualise the PM. These improvements have been made in Figures 1,2,S1,S2,S3 and we believe that they better illustrate our results.

      2- All the mutant analysis is presented subjectively. For example, the authors state that they "found a consistent difference of CBP staining when they compared the 'Chs2' escapers to the controls". How consistent is consistent? Can this be quantified? What is the penetrance of this phenotype? They say that the thin layer is absent in the midgut and the guts are thinner. Could they provide more concrete data?

      As indicated above, we have reviewed the text to provide a more objective description of the phenotypes.

      We have quantified the defects in the Df(Chs2) mutant conditions. For this quantification we dissected intestinal tracts of control and Df(Chs2) larva escapers. We fixed, stained and mounted them together. The control guts expressed GFP in the midgut region as a way to distinguish control from mutants. We analysed the presence or absence of chitin in the PM. We found absence of chitin in the proventricular lumen and in the midgut in all Df(Chs2) guts and presence of chitin there in all control ones (n=12 Df(Chs2) guts, n=9 control guts, from 5 independent experiments). The results indicate a fully penetrant phenotype of lack of chitin in Df(Chs2) larva escapers (100% penetrance). We have added this quantification in the text, chapter 2 (2. Chs2 deposits chitin in the PM).

      To quantify the thickness of the guts, we took measurements of the diameter in control and Df(Chs2) guts at two comparable distance positions from the proventriculus (position 1, position 2, see image). Our quantifications indicated thinner tubes in mutant conditions.

      Image shows the anterior part of the intestinal tract, with the proventriculus encircled in white. Positions 1 and 2 indicate where the diameter quantifications were taken. Scatter plots quantifying the diameter at the two different positions in control and Chs2 larval escapers. Bars show mean {plus minus} SD. p=p value of unpaired t test two-tailed with Welch's correction.

      However, we are aware that our analysis of the thickness of the gut is not accurate, because we have not used markers to precisely measure at the same position in all guts and because we have not normalised the measurement position in relation to the whole intestinal tract (mainly due to technical issues).

      In relation to the fragility, we noticed that the guts of Chs2 larval escapers tended to break more easily during dissection than control guts, however, we have not been able to quantify this parameter in a reliable and objective manner.

      Since we consider that the requirement of Chs2 for PM deposition is sufficiently demonstrated, and that aspects such as gut morphology or fragility relate to the physiological requirements of the PM, which we are beginning to address as a new independent project (see our response to point 5 of Reviewer 1), we have decided to remove the sentence 'We also noticed that the guts of L3 escapers were thinner and more fragile at dissection." from the manuscript to avoid subjectivity.

      3- They state that Chs2 was able to restore accumulation of chitin in the PM of the proventriculus and the midgut. Please quantify. Additionally, does this restore the morphology of the guts (related to the comment above on the thinner guts in the absence of Chs2)?

      We have quantified the rescue of chitin deposition in the PM when Chs2 is expressed in PR cells in a Df(Chs2) mutant background. For this quantification we used the following genetic cross: PRGal4/Cyo; Df(Chs2)/TM6dfdYFP (females) crossed to UASChs2GFP or UASChs2/Cyo; Df(Chs2)/TM6dfdYFP. We selected Df(Chs2) larval escapers by the absence of TM6 (recognisable by the body shape). Among these larval escapers, we identified the presence of Chs2 in PR cells by the expression of GFP or Chs2. We found absence of chitin in the proventriculus and in the midgut in all Df(Chs2) guts that did not express Chs2 in PR cells (n=8/8 Df(Chs2)). In contrast, chitin was present in those intestinal tracts where Chs2 expression was detected in PR cells (n=8/8 PRGal4-UASChs2; Df(Chs2) guts, from 5 independent experiments). The results indicate a full rescue of chitin deposition by Chs2 expression in PR cells in Df(Chs2) mutant larvae. We have added this quantification in the text, chapter 2 (2. Chs2 deposits chitin in the PM).

      As requested by the reviewer, we have also conducted measurements to quantify gut thickness. We performed an analysis similar to the one described in point 2, this time comparing the diameter of Df(Chs2) and PRGal4-UASChs2;Df(Chs2) guts at positions 1 and 2 (see image in point 2 of Reviewer 2). Our quantifications indicated that guts were thicker when Chs2 is expressed in the PR region in Df(Chs2) larval escapers.

      As discussed in point 2, we have decided not to include these results in the manuscript, as this type of analysis requires a more comprehensive investigation.

      Scatter plots quantifying the diameter at the two different positions in Chs2 larval escapers and Chs2 larval escapers expressing Chs2 in PR cells. Bars show mean {plus minus} SD. p=p value of unpaired t test two-tailed with Welch's correction.

      4- This may be beyond the scope of this paper, but I find it interesting that the PM chitin is deposited in the proventricular lumen. Yet it forms a thin layer that lines the entire midgut? Any idea how this presumably dense chitin network gets transported throughout the midgut to line the epithelium? I imagine that this is unlikely due to diffusion, especially if they see an even distribution across the midgut. Do they see any evidence of a graded lining (i.e. is it denser in the midgut towards the proventriculus and does this progressively decrease as you look through the midgut?)?

      Insect peritrophic matrices have been classified into Type I and II (with some variations) depending on their origin (extensively reviewed in (Peters, 1992, Hegedus et al., 2019). Type I PMs are typically produced by delamination as concentric lamellae along the length of the midgut. Type II PMs, in contrast, are produced in a specialised region of the midgut that corresponds to the proventriculus and are typically more organised than Type I. In Type II PMs, distinct layers originate from distinct cell clusters in the proventriculus. It has been proposed that as food passes, it becomes encased by the extruded PM, which then slides down to ensheath the midgut. Drosophila larvae have been proposed to secrete a type II PM: through PM implantation experiments, Rizki proposed that the proventriculus is required to generate the PM in Drosophila larvae (Rizki, 1956). Our experiments confirmed this hypothesis: we show that expressing Chs2 exclusively in PR cells is sufficient to produce a PM along the midgut. Furthermore, we also show that expressing Chs2 in the midgut is not sufficient to produce a PM layer lining the midgut, at least at larval stages.

      The type II PM in Drosophila is proposed to be fully organised into four layers in the proventricular region (also referred as PM formation zone) before reaching the midgut (Peters, 1992, King, 1988, Rizki, 1956, Zhu et al., 2024). However, the mechanism by which the PM is subsequently transported into the midgut remains unclear. PM movement posteriorly is thought to depend on to the pressure exerted by continuous secretion of PM material (Peters, 1992). Early work by Wigglesworth (1929, 1930) proposed that the PM is secreted into the proventricular lumen, becomes fully organised, and is then pushed down by a press mechanism involving the aposed ectodermal/endodermal walls of the proventriculus. Rizki suggested that muscular contractions of the proventriculus walls may play a role, and that peristaltic movements of the gut add a pulling force to push the PM into the midgut (Rizki, 1956). Nevertheless, to our knowledge, the exact mechanism is still not fully understood.

      In response to the reviewer's question, the level of resolution of our analysis does not allow us to determine whether there is a graded PM lining along the midgut. However, available data using electron microscopy approaches suggest that the PM is a fully organised structure composed of four layers that is secreted and transported to line the midgut (King, 1988, Zhu et al., 2024).

      5- The authors state that expression of kkv in tracheal cells of kkv mutants perfectly restores accumulation of chitin in the luminal filaments. Is this really 100% restoration? They also reference a paper here, which may have quantified this result.

      We previously reported that the expression of kkv in tracheal cells restores chitin deposition in kkv mutants (Moussian et al,2015). However, our previous study did not quantify this rescue. As requested by the reviewer, we have now quantified the extent of the rescue.

      To perform this quantification, we used the following genetic cross:

      btlGa4/(Cyo); kkv/TM6dfdYFP (females) crossed to +/+; kkv UASkkvGFP/TM6dfdYFP (males)

      We stained the resulting embryos with CBP (to detect chitin) and GFP. GFP staining allowed us to identify the kkv mutants (by the absence of dfdYFP marker) and to simultaneously identify the embryos that expressed kkvGFP in tracheal cells (through btlGal4-driven expression). Since btlGal4 is homozygous viable, most females carried two copies of btlGal4.

      We compared the following embryo populations across 4 independent experiments:

      1. Cyo/+; kkv/kkv UASkkvGFP (kkv mutants not expressing kkv in the trachea)
      2. btlGal4/+; kkv/kkv UASkkvGFP (kkv mutants expressing kkv in the trachea) Results:

      3. Cyo/+; kkv/kkv UASkkvGFP ---- 0/6 embryos deposited chitin in trachea

      4. btlGal4/+; kkv/kkv UASkkvGFP ---- 27/27 embryos deposited chitin in trachea These results indicate complete restauration of chitin deposition in kkv mutants when kkv is expressed in tracheal cells (100% rescue).

      To further investigate whether Chs2 can compensate for kkv function in ectodermal tissues, we performed a similar quantification using the following genetic cross:

      btlGa4/(Cyo); kkv/TM6dfdYFP (females) crossed to UASChs2GFP/UASChs2GFP; kkv UASkkvGFP/TM6dfdYFP (males)

      We compared the following embryo populations across 2 independent experiments:

      1. Cyo/UASChs2GFP; kkv/kkv (kkv mutants not expressing Chs2 in the trachea)
      2. btlGal4/ UASChs2GFP; kkv/kkv (kkv mutants expressing Chs2 in the trachea) Results:

      3. Cyo/UASChs2GFP; kkv/kkv ---- 0/4 embryos deposited chitin in trachea

      4. btlGal4/ UASChs2GFP; kkv/kkv ---- 0/16 embryos deposited chitin in trachea These results indicate no restauration of chitin deposition in kkv mutants expressing Chs2 in the trachea (0% rescue).

      We have now incorporated these quantifications in the text, chapter 4 (4. Chs2 cannot replace Kkv and deposit chitin in ectodermal tissues.)

      6- They ask whether Kkv overexpression in the proventriculus can rescue Chs2 mutants... and vice versa, whether Chs2 overexpression in ectodermal cells can rescue kkv mutants. They show that kkv overexpression leads to an intracellular accumulation of chitin in the proventriculus. However, Chs2 overexpression in the trachea did not lead to any accumulation of chitin in the cells. They tailored their experiments and the associated discussion to address the hypothesis that there is potentially some difference in trafficking of these components. However, another possibility, which they have not ruled out, is that the different ability of kkv and Chs2 to produce chitin inside cells of the proventriculus and ectoderm, respectively, is potentially related to different enzymatic activities and cofactors required for chitin formation in these different cell types. Is this another potential explanation for the differences that they observe?

      We note that Kkv overexpression in any cell type (e.g. ectoderm, endoderm) consistently leads to chitin polymerisation. In ectodermal tissues, Kkv expression, in combination with Exp/Reb activity, results in extracellular chitin deposition. In the absence of Exp/Reb, Kkv expression leads to the accumulation of intracellular chitin punctae (De Giorgio et al., 2023, Moussian et al., 2015); this work). This correlates with the accumulation of Kkv at the apical membrane and presence of Kkv-containing vesicles, regardless of the presence of Exp/Reb (De Giorgio et al., 2023, Moussian et al., 2015); Figure 6, S6). In endodermal tissues, regardless of the presence of Exp/Reb, Kkv cannot deposit chitin extracellularly and instead produces intracellular chitin punctae. This correlates with a diffuse accumulation of Kkv in the endodermal cells (PR cells, or gut cells in the embryo) but presence of Kkv-containing vesicles (Figure 6, S6).

      In previous work we showed that Kkv's ability to polymerise chitin is completely abolished when it is retained in the ER. Indeed, we found that a mutation in a conserved WGTRE region leads to ER retention, the absence of Kkv-containing vesicles in the cell, and absence of intracellular chitin punctae or chitin deposition (De Giorgio et al., 2023).

      These findings indicate a correlation between Kkv subcellular localisation and chitin polymerisation/extrusion. Therefore, we hypothesise that intracellular trafficking and subsequent subcellular localisation play a crucial role in regulating Kkv activity (De Giorgio et al., 2023; this work).

      We find that Chs2 is expressed in PR cells (Figure 4) and observe that only in these PR cells does Chs2 localise apically (Fig 5A-D, S5A,B). This localisation correlates with the ability of Chs2 to deposit chitin in the PM and the presence of intracellular chitin punctae in PR cells (Fig 1F). When Chs2 is expressed in other cells types, we detect it primarily in the ER and observed no Chs2-containing vesicles (vesicles are suggestive of trafficking). This localisation correlates with the inability of Chs2 to produce intracellular chitin punctae or extracellular chitin deposition.

      Again, these results suggest a correlation between Chs2 subcellular localisation and chitin polymerisation/extrusion, aligning with the results observed for Kkv. Therefore, we hypothesise in this work that the intracellular trafficking and subsequent subcellular localisation of Chs2 play a crucial role in regulating its activity.

      Our hypothesis is consistent with seminal work in yeast chitin synthases, which has demonstrated the critical role of intracellular trafficking, and particularly ER exit, in regulating chitin synthase activity (reviewed in (Sanchez and Roncero, 2022).

      That said, we cannot exclude other explanations that are also compatible with the observed results. As pointed out by the reviewer, it is possible that Chs2 and Kkv require different enzymatic activities and/or cofactors for chitin polymerisation/deposition, which may be specific to different cell types. Indeed, we know that the auxiliary proteins Exp/Reb are specifically expressed in certain ectodermal tissues (Moussian et al., 2015). These mechanisms could act jointly or in parallel with the regulation of intracellular trafficking, or could even regulate this intracellular trafficking itself.

      Identifying the exact mechanisms controlling Kkv and Chs2 intracellular trafficking would be necessary to determine whether additional mechanisms (specific cofactors or enzymatic activities) are also involved or even serve as the primary regulatory elements.

      We have introduced these additional possibilities in the discussion section.

      7- They co-express Chs2 and Reb and show that this does not lead to chitin production or secretion. In the discussion they conclude that Chs2 does not "seem to be dependent on 'Reb' activity". I think that this statement potentially needs softening. They show that Reb is not sufficient in to induce Chs2 chitin production in cells that do not normally make a PM. However, they do not show that it is not essential in cells that normally express Chs2 and make PM.

      We fully agree with the reviewer's observation and thank her/him for pointing it out.

      As indicated by the reviewer, we show that co-expression of Reb and Chs2 in different tissues does not lead to an effect distinct from that observed with Chs2 expression alone. In addition, in the discussion we mention that we could not detect expression of reb/exp in PR cells, which aligns with the findings from Zhu et al, 2024, indicating no expression of reb/exp in the midgut cells of the adult proventriculus, as assessed by scRNAseq. We found that exp is expressed in the ectodermal cells of the larval proventriculus (Fig S4D), correlating with kkv expression in this region and cuticle deposition. These findings led us to propose that Chs2 does not seem to be dependent on Exp/Reb activity.

      However, in our original manuscript, we did not directly address whether Exp/Reb are required in the cells that normally express Chs2. As a result, we could not conclude that Chs2 relies on a set of auxiliary proteins different from Exp/Reb, and therefore a different molecular mechanism to that of Kkv in regulating chitin deposition.

      To address this specific point, we have conducted a new experiment to test Exp/Reb requirement in PR cells. We co-expressed RNAi lines for Exp/Reb in these cells and found that chitin deposition in the PM was not prevented. This further supports the hypothesis that Exp/Reb activity is not necessary for Chs2 function. We have added this experiment to Chapter 4 and Fig S3I,J.

      8- They looked at the endogenous expression pattern of kkv and Chs2 and say that they found accumulation of Kkv in the proventriculus and no accumulation in the midgut. Siimilarly, they look at the expression of Chs2 and detect it in cells of the proventriculus. Are there markers of these different cell types that they could use to colocalize these enzymes?

      We agree with the reviewer that this is an important issue and we note that Reviewer 1 also raised the same point. Therefore, we have addressed this issue.

      We obtained an antibody against Dve, kindly provided by Dr. Hideki Nakagoshi. Dve marks the endodermal region in the proventriculus (Fuss and Hoch, 1998, Fuss et al., 2004, Nakagoshi et al., 1998).This antibody worked nicely in our dissected L3 digestive tracts and allowed us to mark the endodermal region. We also obtained an antibody against Fkh, kindly provided by Dr. Pilar Carrera. Fkh marks the ectodermal foregut cells (Fuss and Hoch, 1998, Fuss et al., 2004, Nakagoshi et al., 1998). While, in our hands, this antibody performed well in embryonic tissues, we observed no staining in our dissected L3 digestive tracts. The reason for this is unclear, but we suspect technical limitations may be responsible (the ectodermal region of the proventriculus is very internal, potentially hindering antibody penetration). To circumvent this inconvenience, we tested a FkhGFP tagged allele available in Bloomington Stock Center. Fortunately, we were able to detect GFP in ectodermal cells of L3 carrying this allele. Using this approach, we conducted experiments to detect Fkh and Dve in relation to chitin accumulation in the wild type (Fig S1). In addition, we used these markers to map the expression of Kkv and Chs2 in the proventriculus (Fig 4). Our results using these endodermal/ectodermal markers confirmed the presence of a cuticle adjacent to the FkhGFP-positive cells and a PM adjacent to the PR cells, marked by Dve. Additionally, we show that Chs2-expressing cells are positive for Dve while Kkv-expressing cells are not. We could not conduct an experiment showing Kkv and Fkh co-expression due to technical incompatibilities, as we have to use GFP tagged alleles for both Kkv and Fkh to reveal their expression. However, we believe that our imaging of Dve/Kkv clearly shows that Kkv expressing cells lack Dve expression and localise in the internal (ectodermal) region of the proventriculus (Fig 4E).

      9- They overexpress Chs2 in cells of the midgut and see that it colocalises with an ER marker. They conclude that it is retained in the ER, which again, for them suggests that it has a trafficking problem in these cells. However, they are overexpressing it in these cells and this strong accumulation that they observe in the ER could simply be due to the massive expression levels. Additionally, they cannot conclude that it doesn't get out of the ER at all. They could be correct in thinking that there may be a trafficking issue, but this experiment does not conclusively show that Chs2 is entirely retained in the ER when expressed in ectopic tissues. I wonder if their interpretation needs softening or whether they should potentially address alternative hypotheses.

      The reviewer raises two distinct issues: 1) the localisation of overexpressed proteins 2) Chs2 ER retention.

      We agree that massive overexpression can lead to artifactual subcellular localisation due to saturation of the secretory pathway, causing ER accumulation. In our experiments, we overexpressed Kkv and Chs2 in different tissues (trachea, salivary glands, embryonic gut, and larval proventriculus), inducing high levels of both chitin synthases.

      For Kkv, we observed distinct subcellular localisation patterns in ectodermal versus endodermal tissues (illustrated in new Fig S6). In ectodermal tissues such as the trachea, large amounts of KkvGFP were detected, most of it localising apically. We also detected a more general KkvGFP distribution throughout the cell, including the ER, particularly at early stages. Additionally, we observed many KkvGFP-positive vesicles, reflecting exocytic and endocytic trafficking, as described previously (De Giorgio et al., 2023). The presence of these vesicles (as well as the apical localisation) indicates that KkvGFP is able to exit the ER. Indeed, our previous work demonstrated that when Kkv is retained in the ER, it does not localise apically or appear in vesicles (De Giorgio et al, 2023). In endodermal tissues, as described in our manuscript, KkvGFP did not exhibit polarised apical localisation and instead showed a diffuse pattern with some cortical enrichment. However, the presence of KkvGFP-containing vesicles still suggests that the protein is capable of exiting the ER also in these endodermal tissues.

      We observed a different subcellular pattern when we overexpressed Chs2GFP. In tissues where Chs2 is not normally expressed (e.g., trachea, salivary gland, embryonic gut), we did not detect apical or membrane accumulation (see Fig. 5,S5, S6 and response to point 4 of Reviewer #1). Nor did we observe accumulation of Chs2GFP in intracellular vesicles. Instead, Chs2GFP showed strong colocalisation with an ER marker (see Fig. 5,S5, S6 and response to point 4 of Reviewer #1). In contrast, when overexpressed in PR cells, we detected apical enrichment (Fig 5A-D, S5A,B). This indicates that despite massive expression levels, Chs2 can exit the ER in particular tissues.

      Taken together, our results strongly suggest that overexpressed Kkv can exit the ER in the different tissues analysed, whereas most Chs2GFP is retained in the ER in tissues other than PR cells. This correlates with the ability of overexpressed KkvGFP to polymerise chitin (either in intracellular puncta or deposited extracellularly depending on the presence of Exp/Reb) in all analysed tissues. Conversely, Chs2 was unable to polymerise chitin (either in intracellular puncta or extracellularly regardless of Exp/Reb presence) in tissues other than PR cells.

      Nevertheless, we acknowledge that we cannot definitively conclude that all Chs2 protein is entirely retained in the ER. We have included this caveat in our revised manuscript (Chapter 6 and Discussion section).

      Minor Comments: - No mention of Fig 3I in the results section and the order discussed in the results does not match the order in the figure.

      We apologise for these inconsistencies. We have addressed this issue in the text, figure legend, and the image order in Figure 3 and Figure S3.

      • In the results please provide some information on what the CRIMIC collection is and how it allows you to see Chs2 expression for non-experts.

      We have addressed this point in chapter 5 in the revised version, and we now provide a more detailed explanation of the CRIMIC Chs2CR60212-TG4.0 allele.

      Further details of this allele are also provided in our responses to points 5 and 9 of Reviewer 1.

      Reviewer #2 (Significance (Required)):

      Drosophila produce different types of chitinous structures that are required for either the exoskeleton of the animal or for proper gut function (peritrophic matrix). Additionally, most insects have two enzymes involved in the production of chitin and current data suggests that they have unique roles in producing either the exoskeleton or the peritrophic matrix. However, it is unclear whether their different functions are due to differences in cell type expression or differences in physiological activity of the enzymes. The authors exploit Drosophila to drive these 2 enzymes in different cell types that are known to produce the exoskeleton or the peritrophic matrix to determine whether they can functionally substitute mutant backgrounds. Their results give us a hint that these enzymes are not equivalent. What the authors were unable to address is why they are not equivalent. They hypothesise that the different physiological functions of the enzymes may be related to trafficking differences within their respective cell types. While this is an interesting hypothesis, the date are not really clear yet to make this conclusion.

      This work will be of interest to anyone interested in chitinous structures in insects and the cell biology of chitin-related enzymes.

      Literature


      AHMAD, M., CHAUDHARY, S. U., AFZAL, A. J. & TARIQ, M. 2015. Starvation-Induced Dietary Behaviour in Drosophila melanogaster Larvae and Adults. Sci Rep, 5__,__ 14285.

      DE GIORGIO, E., GIANNIOS, P., ESPINAS, M. L. & LLIMARGAS, M. 2023. A dynamic interplay between chitin synthase and the proteins Expansion/Rebuf reveals that chitin polymerisation and translocation are uncoupled in Drosophila. PLoS Biol, 21__,__ e3001978.

      FUSS, B. & HOCH, M. 1998. Drosophila endoderm development requires a novel homeobox gene which is a target of Wingless and Dpp signalling. Mech Dev, 79__,__ 83-97.

      FUSS, B., JOSTEN, F., FEIX, M. & HOCH, M. 2004. Cell movements controlled by the Notch signalling cascade during foregut development in Drosophila. Development, 131__,__ 1587-95.

      HEGEDUS, D. D., TOPRAK, U. & ERLANDSON, M. 2019. Peritrophic matrix formation. J Insect Physiol, 117__,__ 103898.

      KING, D. G. 1988. Cellular organization and peritrophic membrane formation in the cardia (proventriculus) of Drosophila melanogaster. J Morphol, 196__,__ 253-82.

      LEBRETON, G. & CASANOVA, J. 2016. Ligand-binding and constitutive FGF receptors in single Drosophila tracheal cells: Implications for the role of FGF in collective migration. Dev Dyn, 245__,__ 372-8.

      MOUSSIAN, B., LETIZIA, A., MARTINEZ-CORRALES, G., ROTSTEIN, B., CASALI, A. & LLIMARGAS, M. 2015. Deciphering the genetic programme triggering timely and spatially-regulated chitin deposition. PLoS Genet, 11__,__ e1004939.

      NAKAGOSHI, H., HOSHI, M., NABESHIMA, Y. & MATSUZAKI, F. 1998. A novel homeobox gene mediates the Dpp signal to establish functional specificity within target cells. Genes Dev, 12__,__ 2724-34.

      PETERS, W. 1992. Peritrophic Membranes, Springer Berlin, Heidelberg.

      RIZKI, M. T. M. 1956. The secretory activity of the proventriculus of Drosophila melanogaster. Journal of Experimental Zoology, 131__,__ 203-221.

      SANCHEZ, N. & RONCERO, C. 2022. Chitin Synthesis in Yeast: A Matter of Trafficking. Int J Mol Sci, 23.

      ZHU, H., LUDINGTON, W. B. & SPRADLING, A. C. 2024. Cellular and molecular organization of the Drosophila foregut. Proc Natl Acad Sci U S A, 121__,__ e2318760121.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Drosophila have two different chitin synthase enzymes, Kkv and Chs2, and due to unique expression patterns and mutant phenotypes, it is relatively clear that they have different functions in producing either the cuticle-related chitin network (Kkv) or the chitin associated with the peritrophic matrix (PM). However, what is unknown is whether the different functions in making cuticle vs PM chitin is related to differences in cellular expression and/or enzyme properties within the cell. The authors exploit the genetic tractability of Drosophila and their ability to image cuticle vs PM chitin production to examine whether these 2 enzymes can substitute each other. They conclude that these two proteins are not equivalent in their capacity to generate chitin. The data are convincing; however, it is currently presented in a subjective fashion, which makes it difficult to interpret. Additionally, in my opinion there is some interpretation that requires softening or alternatively interpreted.

      Major Comments:

      • While the imaging is lovely, there are some things that are difficult to see in the figures. For example, the "continuous, thin and faint 'chitin' layer that lined the gut epithelium" is very difficult to visualise in the control images. Can they increase the contrast to help the reader appreciate this layer? This is particularly important as we are asked to appreciate a loss of this layer in the absence of Chs2.
      • All the mutant analysis is presented subjectively. For example, the authors state that they "found a consistent difference of CBP staining when they compared the 'Chs2' escapers to the controls". How consistent is consistent? Can this be quantified? What is the penetrance of this phenotype? They say that the thin layer is absent in the midgut and the guts are thinner. Could they provide more concrete data?
      • They state that Chs2 was able to restore accumulation of chitin in the PM of the proventriculus and the midgut. Please quantify. Additionally, does this restore the morphology of the guts (related to the comment above on the thinner guts in the absence of Chs2)?
      • This may be beyond the scope of this paper, but I find it interesting that the PM chitin is deposited in the proventricular lumen. Yet it forms a thin layer that lines the entire midgut? Any idea how this presumably dense chitin network gets transported throughout the midgut to line the epithelium? I imagine that this is unlikely due to diffusion, especially if they see an even distribution across the midgut. Do they see any evidence of a graded lining (i.e. is it denser in the midgut towards the proventriculus and does this progressively decrease as you look through the midgut?)?
      • The authors state that expression of kkv in tracheal cells of kkv mutants perfectly restores accumulation of chitin in the luminal filaments. Is this really 100% restoration? They also reference a paper here, which may have quantified this result.
      • They ask whether Kkv overexpression in the proventriculus can rescue Chs2 mutants... and vice versa, whether Chs2 overexpression in ectodermal cells can rescue kkv mutants. They show that kkv overexpression leads to an intracellular accumulation of chitin in the proventriculus. However, Chs2 overexpression in the trachea did not lead to any accumulation of chitin in the cells. They tailored their experiments and the associated discussion to address the hypothesis that there is potentially some difference in trafficking of these components. However, another possibility, which they have not ruled out, is that the different ability of kkv and Chs2 to produce chitin inside cells of the proventriculus and ectoderm, respectively, is potentially related to different enzymatic activities and cofactors required for chitin formation in these different cell types. Is this another potential explanation for the differences that they observe?
      • They co-express Chs2 and Reb and show that this does not lead to chitin production or secretion. In the discussion they conclude that Chs2 does not "seem to be dependent on 'Reb' activity". I think that this statement potentially needs softening. They show that Reb is not sufficient in to induce Chs2 chitin production in cells that do not normally make a PM. However, they do not show that it is not essential in cells that normally express Chs2 and make PM.
      • They looked at the endogenous expression pattern of kkv and Chs2 and say that they found accumulation of Kkv in the proventriculus and no accumulation in the midgut. Siimilarly, they look at the expression of Chs2 and detect it in cells of the proventriculus. Are there markers of these different cell types that they could use to colocalize these enzymes?
      • They overexpress Chs2 in cells of the midgut and see that it colocalises with an ER marker. They conclude that it is retained in the ER, which again, for them suggests that it has a trafficking problem in these cells. However, they are overexpressing it in these cells and this strong accumulation that they observe in the ER could simply be due to the massive expression levels. Additionally, they cannot conclude that it doesn't get out of the ER at all. They could be correct in thinking that there may be a trafficking issue, but this experiment does not conclusively show that Chs2 is entirely retained in the ER when expressed in ectopic tissues. I wonder if their interpretation needs softening or whether they should potentially address alternative hypotheses.

      Minor Comments:

      • No mention of Fig 3I in the results section and the order discussed in the results does not match the order in the figure.
      • In the results please provide some information on what the CRIMIC collection is and how it allows you to see Chs2 expression for non-experts.

      Significance

      Drosophila produce different types of chitinous structures that are required for either the exoskeleton of the animal or for proper gut function (peritrophic matrix). Additionally, most insects have two enzymes involved in the production of chitin and current data suggests that they have unique roles in producing either the exoskeleton or the peritrophic matrix. However, it is unclear whether their different functions are due to differences in cell type expression or differences in physiological activity of the enzymes. The authors exploit Drosophila to drive these 2 enzymes in different cell types that are known to produce the exoskeleton or the peritrophic matrix to determine whether they can functionally substitute mutant backgrounds. Their results give us a hint that these enzymes are not equivalent. What the authors were unable to address is why they are not equivalent. They hypothesise that the different physiological functions of the enzymes may be related to trafficking differences within their respective cell types. While this is an interesting hypothesis, the date are not really clear yet to make this conclusion.

      This work will be of interest to anyone interested in chitinous structures in insects and the cell biology of chitin-related enzymes.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Chitin is a critical component of the extracellular matrix of arthropods and plays an essential role in the development and protection of insects. There are two chitin synthases in insects: Type A (exoskeletons) and Type B (for the peritrophic matrix in the gut). The study aims to investigate the specificity and mechanisms of the two chitin synthases in D. melanogaster and to clarify whether they are functionally interchangeable. Various genetic manipulations and fluorescence-based labeling were used to analyze the expression, localization, and function of Kkv and Chs2 in different tissues.

      Chs2 is expressed in the PR cells of the proventriculus and is required for chitin deposition in the peritrophic matrix. Kkv can deposit chitin in ectodermal tissues but not in the peritrophic matrix, whereas Chs2 can deposit chitin in the peritrophic matrix but not in ectodermal tissues. The subcellular localization of chitin synthases is specific to the tissues in which they are expressed. Kkv localizes apically in ectodermal tissues, whereas Chs2 localizes apically in the PR cells of the proventriculus. Altogether, Kkv and Chs2 cannot replace each other. The specificity of chitin synthases in D. melanogaster relies on distinct cellular and molecular mechanisms, including intracellular transport pathways and the specific molecular machinery for chitin deposition. Congratulations on this incredible story and manuscript, which is straightforward and well-written. However, I have some comments that may help to improve it.

      Major comments:

      1. Funny thing: the Chs2 mutant larva shows a magenta staining below the chitin accumulation of the esophagus, which looks like a question mark in 1H but cannot be found in control. Is that trachea reaching the pv?
      2. Also, though it is evident that the PM chitin is lost in Ch2 mutants, could it be that the region is disturbed and cells express somewhere else chitin? There are papers by Fuß and Hoch (e.g., Mech of Dev, 79, 1998; Josten, Fuß et al., Dev. Biol.267, 2004) using markers such as Dve, Fkh, Wg, Delta, and Notch, etc. for precisely marking the endodermal/ectodermal region in the embryonic foregut/proventriculus. It would be beneficial to show, along with chitin and Chs expression patterns, the ectoderm/endoderm cells. This is particularly important as the authors report endodermal expression of Chs2 in embryos but don't use co-markers of the endodermal cells.
      3. The origin of midgut chitin accumulation is unclear. Chitin can come from yeast paster. Can the authors check kkv and chs2 mutants for food passage and test starving L1 larvae to detect chitin accumulation in the midgut without feeding them?
      4. Subcellular localization assays require improved analysis, such as a co-marker for the apical membrane and statistical analysis with co-localization tools, showing the overlap at the membrane and intracellularly with membrane co-markers and KDEL.

      Minor comments:

      1. The authors used "L3 larval escapers." It would be interesting to know if the lack of Chs2 and the peritrophic matrix cause any physiological defects or lethality.
      2. The order identifiers are missing for materials and antibodies, e.g., anti-GFP (Abcam), but Abcam provides several ant-GFP; which was used? Please provide order numbers that guarantee the repeatability for others.
      3. Figure S5C, C', what marks GFP (blue) in the trachea? Maybe I have overlooked the description. What is UASsrcGFP? What is the origin of this line?
      4. The authors claim that they validated the anti-Chs2 antibody. However, they show only that it recognizes a Cht2 epitope via ectopic expression. For more profound validation, immune staining is required in deletion mutants, upon knockdown, or upon expression of recombinant proteins, which is not shown.
      5. The legend and text explaining Fig. 4 D-E' can be improved. The authors used the Crimic line, which is integrated into the third ("coding") intron. This orientation can lead to the expression of Gal4 and cause a truncated version of the protein (according to Flybase). Is Chs2 expression reduced in the crimic mutant? If the mutation causes expression of a truncated version, the Chs2 antibody may not be able to detect it as it recognizes a fragment between 1222 and 1383 aa? Also, I'm unsure whether the Chs2 antibody or GFP was used to detect expression in PR cells. The authors describe using Ch2CR60212>SrcGFP together with Chs2+ specific antibodies.

      Referees cross-commenting I fully agree with the comments of Rev#2

      Significance

      The manuscript's strength and most important aspects are the genetic analysis, expression, and localization studies of the two Chitin synthases in Drosophila embryos and larvae. However, beyond this manuscript, the development of mechanistic details, such as interaction partners that trigger secretion and action at the apical membranes and the role of the coiled-coil domain, will be interesting.

      The manuscript uses "first-class" genetics to describe the different roles of the two Chitin synthases in Drosophila, comparing ectodermal chitin (tracheal and epidermal chitin) with endodermal (midgut) chitin. Such a precise analysis has not been investigated before in insects. Therefore, the study deeply extends knowledge about the role of Chitin synthases in insects.

      The audience will specialize in basic research in zoology, developmental biology, and cell biology regarding - how the different Chitin synthases produce chitin. Nevertheless, as chitin is relevant to material research and medical and immunological aspects, the manuscript will be fascinating beyond the specific field and thus for a broader audience.

      I'm working on chitin in the tracheal system and epidermis in Drosophila.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Dear Editor,

      Thank you for reviewing our article. We are happy to see that the reviewers are positive on our manuscript. We have tried to address nearly all their comments. Find below a point-by-point answer.

      With best regards,

      Bruno Lemaitre and Asya Dolgikh

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This work defines NimB1 protein as a PS binding bridging molecule but with a negative regulatory role in efferocytosis. Specifically, the authors demonstrate via a variety of genetic, cell biological, and other approaches that loss of NimB1 leads to Drosophila macrophages being more adherent to apoptotic targets and engulf them more robustly. The authors also nicely demonstrate that the function of NimB1 differs from NimB4, and the double mutant demonstrating PS-binding yet, distinct roles. Further, the authors show that NimB1 does not affect bacterial phagocytosis.

      Overall, this is a well-done study. The authors have already done a very thorough job addressing the key points and I congratulate the authors.

      My only minor comment is that the authors could try to make the comment better about whether or not such a 'negative regulatory' bridging molecules may exist in other species, and particularly mammals. If so, this is quite novel. The authors refer to CD47 but this is a membrane protein. The other minor comment is whether the authors ever tried express the PS binding domains as a fusion protein - this would provide a more direct evidence for the binding to PS (although the authors do competitive inhibition with Annexin V). This could be commented upon although testing this is not necessary if they have not already done so.

      We greatly appreciate the reviewer’s positive feedback. In the revised manuscript, we have now included a more detailed discussion of mammalian proteins with analogous roles, specifically referencing Draper isoforms (I and II), the CD300 receptor family, and surfactant proteins A and B (see page 16).

      Reviewer #1 (Significance (Required)):

      The identification of the negative regulator bridging protein NimB1 is novel and could be broadly interesting to those studying efferocytosis.

      Regarding the suggestion to overexpress just the putative PS-binding domain of NimB1, we agree this could strengthen the evidence for its PS-binding function. However, generating a new transgenic fly line would require significant additional time. Moreover, the presence of a PS-binding motif was also proposed in the recent study on Orion (Ji et al., 2023), which we have cited in our manuscript. The Orion binds PS through a conserved RRY motif. This motif is critical for Orion’s ability to directly interact with PS and facilitate its secretion. Mutagenesis experiments disrupting the RRY motif—specifically substituting arginine residues with alanines—abolished Orion’s PS-binding capacity, demonstrating the essential role of this sequence. Functional assays also validated that Orion competes with Annexin V, a well-established PS-binding protein, for access to PS-exposing surfaces (Ji et al., 2023).

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this study, Dolgikh and colleagues propose a first investigation about the role of the drosophila Nimrod protein NimB1. Although the role of several members of the family in phagocytosis has been explored, the function of Nimrod type B proteins is less addressed. Within silico analysis, they first see a strong similarity between NimB1 and NimB4. They show that NimB1 is primarily expressed in phagocytes, and as NimB4 can bind phosphatidylserines (PS), leading to a possible shared role in efferocytosis. Using transgenic and null drosophila models, the authors then compare the impact of NimB1 overexpression or deficiency. They compare the effects shown to NimB4 and Draper deficient lines, as these two proteins were previously shown to play a role in efferocytosis. They propose that NimB1 is a secreted protein that binds apoptotic cells. They show that NimB1 deficiency changes the adhesion properties of macrophages. The major finding is that NimB1 delays the early stages of efferocytosis, contrary to NimB4 and Draper that on the contrary facilitate efferocytosis. In contrast, the authors propose that NimB1 increases the formation of phagosomes.

      We appreciate the reviewer’s acknowledgment that our key discovery centered around NimB1 functioning as a negative regulator of efferocytosis. This finding highlights NimB1’s distinct role compared to NimB4 and Draper, which instead promote the process.

      Major comments:

      One of the major technical challenges here was to generate models to allow the detection of the protein in cellulo and in vivo. Although the results are convincing in transgenic lines NimB1 expression is driven by the endogenous promoter, one could still argue that the GFP tags would lead to changes in the localization of the protein.

      We understand the concern regarding potential localization changes introduced by GFP tags. However, in previous studies, the same fosmid construct was applied to NimB4-sGFP, and produced a distinctly different expression pattern—NimB4-sGFP expression was more pronounced and clearly present in the glial cells in the brain (Petrignani et al, 2021: Figure EV1A). The fact that the NimB1-sGFP and NimB4-sGFP fosmids localized to different tissues suggests that possible any mis-localization changes due to the GFP tag do not override localization properties intrinsic to the proteins.

      In line with the previous comment, to show that NimB1 is a secreted protein, the authors use an overexpression model. How to be sure, that overexpression itself does not lead to increased secretion, or shedding from the membrane?

      The observation that uas-NimB1-RFP accumulates in the nephrocytes upon Lpp-Gal4 (fat body) expression, and the presence of a signal peptide suggests that this protein can be secreted.

      We cannot exclude that in endogenous condition, NimB1, remains attached to hemocytes. We have confirmed that the Lpp driver is not expressed in nephrocytes.

      Would an experiment with a control consisting in a known protein secreted by macrophages lead to the same staining pattern (positive control)? Could another methodology like a Western Blot on supernatants from hemocyte cell culture (over)expressing NimB1, with an anti-RFP staining, be envisaged?

      We have already performed similar experiment with other secreted proteins such as NimB4-GFP (Petrignani et al., 2021: Figure: 1B). In the revised article, we have added Viking-RFP as a positive control of a secreted protein (Figure S1F). Figure S2 shows a Western blot with hemolymph extract. We detected NimB1-RFP at its expected molecular weight of 44 kDa, verifying that is present into the hemolymph (Supplementary Document S2 C).

      It sems counterintuitive that phagocytes from Draper and NimB4 null mutants with defects in efferocytosis show increased load of apoptotic cells (Figure 6C and D in both unchallenged and injury condition). Do the authors have precedent data to cite going to the same direction? Are cell debris engulfed but not degraded efficiently?

      The observation that Draper and NimB4 null mutants have an increased load of apoptotic cells has already been reported in the literature. Several studies have now shown that Draper is not always required for the initial uptake of apoptotic corpses but is critical for phagosome maturation (Meehan et al., 2016; Serizier et al., 2022; Serizier & McCall, 2017). In our article on NimB4 (Petrignani et al., 2021), we have previously shown that the accumulation of immature phagosomes that are not properly eliminated indirectly impairs the uptake of new apoptotic corpses. This explains why efferocytosis is then impaired only at late time points, when unresolved phagosomes have accumulated to the threshold that prevents further phagocytosis.

      In Figure 6D it seems indeed that NimB4, NimB1/NimB4 and Draper mutants do not accumulate more apoptotic material upon injury. However, levels for NimB4 is close to the one obtained with NimB1 mutants. Is it statistically true? If yes, what could be the reason for this similarity? In any case, as some important conclusion relies on the comparison between UC and injury conditions, adequate statistics and representations could be proposed.

      We thank the reviewer for this pertinent observation and the opportunity to clarify. In the unchallenged (UC) condition, NimB4sk2 and draperΔ5 mutants indeed exhibit significantly elevated levels of apoptotic cell (AC) content in macrophages compared to wild-type and NimB1 mutant genotypes (****p crimic and NimB1229/NimB1crimic* mutants show significantly lower levels in the UC condition, consistent with a role for NimB1 in early recognition or regulation of phagocytic initiation, not in corpse degradation.

      In contrast, upon injury (90 minutes post-challenge) we observe a statistically significant increase in apoptotic material in NimB1 mutants compared to UC hemocytes of the same genotype (****p sk2 and draperΔ5* mutants between the UC and 90 min conditions (ns for NimB4). This is consistent with their known defect in corpse degradation, which results in saturation of phagocytic capacity at baseline, and an inability to respond further upon challenge with apoptotic cells.

      While the absolute levels of apoptotic material in injured NimB1 and UC NimB4 mutants appear similar at first glance, statistical testing confirms that they are significantly different. NimB4 mutant macrophages retain apoptotic debris due to defective degradation, whereas NimB1 mutants have an increase in newly acquired apoptotic content due to enhanced uptake.

      Additionally, NimB161, NimB4sk2 double mutants display a partial increase in apoptotic load upon injury (****p To directly address the reviewer’s suggestion, we have now recalculated and visualized key comparisons with appropriate statistical testing, as shown in Revision Figure 1. All statistical analyses were conducted using unpaired two-tailed Student’s t-tests. This additional figure allows clearer evaluation of genotype-specific differences at both baseline and post-injury conditions and supports our conclusions that NimB1 and NimB4 regulate distinct stages of phagocytosis. We have also clarified the text to better explain that both NimB4 and Draper mutants accumulate unresolved apoptotic material under baseline conditions, and do not accumulate further material upon challenge, due to a block in phagosome maturation.

      Revisions Figure 1.

      __Quantification of phagocytic events in wild-type and mutant macrophages under unchallenged and post-injury conditions __

      (A) Comparison of phagocytic events per frame in w1118 (wild-type), NimB1crimic, NimB1229/NimB1crimic, NimB4sk2, NimB161,NimB4 sk2, and draperΔ5 larvae under unchallenged conditions (UC) and 90 minutes after injury (90 min). Data are presented as individual data points with means. Statistical significance was determined using Student's t-test (*P (B) Direct comparison of phagocytic events between NimB1crimic (red) and NimB4sk2 (gray), and between NimB1229/crimic (dark red) and NimB4sk2 (gray) under both unchallenged (UC) and post-injury (90 min) conditions.

      The authors claim with analyses of Figure 8C and D, that NimB1 mutants show acidic vehicles normal in size and fluorescence intensity. However, statistical differences are still observed compared to control condition, which is also seen in representative images shown.

      In Figure 8C and D, we provide two quantitative measures to clarify the size and intensity of acidic vesicles. First, we show that mean fluorescence in hemocytes is elevated for all NimB and draper mutants compared to wild type, indicating an overall increase in internalized material. However, we also quantified the number of vesicles per hemocyte and found that NimB1 mutants exhibit significantly more vesicles. Despite this increase, the representative images do not show an obvious enlargement of individual vesicles, suggesting that while more material is being taken up, the vesicles themselves are not enlarged. The enlarged vesicles in case of NimB4 or draper mutant would result from the unresolved cargo (Petrignani et al., 2021). This distinction underscores that higher fluorescence values reflect increased cargo internalization, rather than the larger vesicular structures that result from impaired degradation as in NimB4 or draper mutants.

      Minor comments:

      In figure 2D, what allows to say the expression is restricted in macrophages? Is it the colocalization with SIMU being a macrophage-specific marker?

      In Figure 2D, we relied on SIMU as a macrophage-specific marker in Drosophila embryos to determine that NimB1 expression is restricted to macrophages. Previous research has demonstrated that SIMU is predominantly expressed in embryonic macrophages (where it is essential for apoptotic cell clearance) (Kurant et al., 2008; Roddie et al., 2019). Consequently, the colocalization of NimB1 signal with SIMU-positive cells strongly indicates that NimB1 is confined to macrophages during this developmental stage.

      In figure S3B and C, it appears that double NimB1/NimB4 mutants exhibit less spreading than single ones (especially NimB4). Is it the case (statistical significance). If yes what could be the explanation?

      Yes, the double NimB1, NimB4 mutants exhibit higher number of hemocytes and significantly reduced cell spreading compared to single mutants. The phenotype is similar to NimC1, eater double mutants (Melcarne et al., 2019) which also show higher number of hemocytes, reduced cell spreading and also diminished capacity to phagocytose apoptotic cells (and, in the case of NimC1, Eater, bacteria as well) (Melcarne et al., 2019). A likely explanation lies in impaired membrane remodeling critical for pseudopod extension and phagosome formation. Studies have shown that defects in actin polymerization or membrane tension can hinder pseudopod extension, reducing phagocytic efficiency (Lee et al., 2007; Masters et al., 2013). Same for the decreased ability of these mutants to form filopodium, a process essential for effective target engagement and engulfment. Filopodia play a significant role in capturing particles and directing them toward the macrophage body for engulfment (Horsthemke et al., 2017). Disruptions in these pathways lead to reduced phagocytic efficiency and a more rounded macrophage morphology, as the cells fail to spread properly (Horsthemke et al., 2017; Lillico et al., 2018). Other than these general observations, we do not have an explanation as to why NimB1, NimB4 double mutants specifically show a higher number of hemocytes and reduced cell spreading.

      Several graphs are identical between figure 4 and S4. It is probably not useful and complicates reading.

      We agree that duplicating these graphs complicates the presentation. Therefore, we have removed the redundant graphs in the supplementary materials, ensuring the data are shown only once to maintain clarity and ease of reading

      As TEM images shown in Figure 8B do not lead to quantitative data, I would put it as supplementary file.

      We agree that the TEM images in Figure 8B do not provide strictly quantitative data. To streamline the main manuscript, we have relocated these images to the supplementary section in the revised version

      Reviewer #2 (Significance (Required)):

      This study uses several approaches and models to address the role of NimB1 in efferocytosis. Both In Vitro and In Vivo approaches are proposed. They give insight into the role of this protein with unknown function so far. Some statistical analysis could be performed to improve the clarity of conclusions. One of the important aspects is the secreted nature of NimB1.However, additional approaches could be proposed to confirm this.

      Basic immunologists and cell biologists would be interested in reading this article that highlights the delicate equilibrium between pro and anti-efferocytosis molecules.

      I am an immunologist/cell biologist with expertise in lysosomal catabolism. As I work on mouse models or Human samples, my mastering of drosophila as a model is limited.

      We thank the reviewer for the positive evaluation of our work. In this revision, we have added further detail to clarify the properties of NimB1 as a secreted protein and strengthen our data presentation. By providing additional clarity on methods and interpretations, we hope immunologists and cell biologists—including those who do not routinely work with Drosophila—will find our findings more accessible.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This paper investigates the role of NimB1, a secreted member of the Nimrod family in Drosophila, in the process of efferocytosis, the clearance of apoptotic cells by macrophages. Previous studies have identified NimB4, another secreted Nimrod protein, as a positive regulator of efferocytosis, enhancing both apoptotic cell binding and phagosome maturation. In contrast, the authors propose that NimB1 functions as a negative regulator, slowing down the early stages of apoptotic cell binding and internalization. This regulatory balance is suggested to fine-tune efferocytosis to maintain homeostasis.

      The primary aim of this study was to characterize the function of NimB1 to better understand the roles of proteins within the NimB family.

      This study identifies a novel function for NimB1 in modulating the early stages of efferocytosis, adding to our understanding of how Nimrod proteins fine-tune apoptotic cell clearance. The authors establish a clear phenotypic contrast between NimB1 and NimB4, which provides a compelling framework for understanding how positive and negative regulators coordinate phagocytosis. It also highlights the multiple roles of the secreted members of the Nimrod scavenger receptor family, which have remained so far poorly investigated.

      This is an interesting study that could be strengthened by additional validation and broader experimental support. As the authors point out in the discussion, it is known that PS bridging molecules contribute to phagocytosis and that the contribution of positive and negative players finely tune phagocytosis in flies and mammals. Clarifying the mode of action of NimB1 in those processes would higher the impact of this interesting piece of work. For example, does NimB1 interact with NimB4 and if so, what is the role of this interaction? How does NimB1 integrate in the signaling cascade that allows scavenger receptors to bind PS? Does it act similar to Orion by enhancing the PS binding of a scavenger receptor?

      Key Findings • NimB1 and NimB4 are structurally similar, as supported by AlphaFold2 modeling, suggesting functional relatedness. • NimB1 is expressed in macrophages, secreted into the hemolymph, and binds apoptotic cells in a phosphatidylserine (PS)-dependent manner. • NimB1 is induced by challenge. • NimB1 mutants display a hyper-phagocytic phenotype, with faster recognition and internalization of apoptotic cells. • NimB1 loss enhances macrophage adhesion and actin remodeling, while bacterial phagocytosis remains unaffected, suggesting a specific role in apoptotic clearance. • NimB1 acts early in the phagocytic process, while NimB4 functions at later stages, particularly in phagosome maturation.

      We thank the reviewer for their positive assessment and are pleased that our findings identify NimB1 as a novel secreted negative regulator of efferocytosis, underscoring a greater level of regulatory complexity in apoptotic cell clearance.

      Unfortunately, attempts to produce functional NimB1 protein were not successful, limiting our ability to address some of the reviewer’s suggestions experimentally. Despite these challenges, the evidence we present—particularly from our genetic assays—clearly indicates that NimB1 exerts an inhibitory influence during the early steps of apoptotic cell binding, distinguishing it from the late-stage promoting function of NimB4.

      Major comments:

      Figure 1: AlphaFold is a valuable tool for generating hypotheses, however predicted structures should not be presented as definitive evidence of similarity, particularly without complementary experimental validation. This section would be stronger if the structural predictions were explicitly framed as predictions. In the absence of such data, the interpretation should be toned down.

      We agree with the reviewer and we have now framed our observation as prediction and toned down our interpretation. We also note that the similarities between NimB4 and NimB1 are also underlined by the phylogenetic analysis and expression pattern.

      Figure 2DE: Given its basal level in homeostatic conditions, it would have been useful to look at the NimB1-GFP upon challenge. Also, the authors show only a single larval macrophage with no comparison point. To strengthen this result, the authors could include another protein quantification method, such as western blotting. Alternatively, labelling of NimB1>UASmRFP in embryo that present the highest expression levels would also strengthen this result.

      Unfortunately, we cannot currently perform additional experiments on embryos within the scope of this project because those experiments were performed by our collaborators in Haifa (Estee Kurant Lab). Repeating them would require sending the lines to their lab and accommodating their experimental schedule and manpower constraints.

      In supplementary Figure S1F: the authors overexpress NimB1-RFP using the fat body driver Lpp-Gal4 and show larvae with RFP in the nephrocyte. Could filet preparations be shown? Could the authors present evidence that the Lpp driver is not expressed in the nephrocytes (or refer to literature)?

      The Lpp-Gal80 driver is described as fat body-specific and has been used to manipulate gene expression in the fat body in many other studies. We have checked Lpp-Gal80>UAS-GFP expression in larvae and did not observe expression in larval nephrocytes. The whole live larvae were observed under the microscope with no prior filet preparations. To provide the evidence that Lpp is not expressed in the nephrocytes we are providing the images of the whole larvae expressing GPF from the Lpp, as per genotype: Lgg-Gal80>UAS-GFP (see below, Revisions Figure 2).


      Revisions Figure 2.

      __Expression pattern of Lpp-Gal80>UAS-GFP in Drosophila larvae __

      Representative fluorescence microscopy images showing GFP expression driven by the Lpp-Gal80 system in Drosophila larvae. The images display dorsal (top) and ventral (bottom) views of the same larva, demonstrating the pattern of expression throughout the fat body tissue. Green fluorescence indicates cells expressing the GFP reporter under the control of the Lpp promoter, which is predominantly active in the larval fat body.

      The results on the increased number of hemocytes observed in the double NimB1, NimB4 mutant animals (Figure S3A) remains not only disconnected from the rest of the data but also unexplained. Providing a mechanistic view may require a significant amount of work that may indicate an additional role of the two NimBs but will not add to our understanding of the role of NimB1 in phagocytosis. Nevertheless, it would be at least useful to know whether in the double mutant the lymph gland is still intact, as its precocious histolysis could account for the elevated number of hemocytes. If that were the case, that could indicate that lacking the two NimBs triggers an inflammatory state that affects the lymph gland, meaning that the pathway controlling phagocytosis also has a systemic impact on development. When checking the representative Figure S4D, it seems that very large cells are present in the double mutants, even larger than in the single mutants. These could be (pre)lamellocytes, which constitute activated hemocytes, known to impact the status of the lymph gland. If the enhanced number of hemocytes does not depend on lymph gland histolysis, a simple immunolabeling with the anti-PH3 antibody would assess the proliferative phenotype of the double mutant hemocytes. At least this piece of data would provide a better explanation for the observed phenotype.

      We thank the reviewer for this interesting comment. We cannot explain why NimB1, NimB4 double mutants have more hemocytes. It is unclear to us if this is a secondary consequence of defects in efferocytosis or linked to another function of these two NimBs, such as a role in adhesion. We did look at the lymph gland and our preliminary observations suggest that NimB1, NimB4 double mutants have an easily ruptured or fragile lymph gland, which could explain the higher number and the roundish shape of hemocytes in circulation as proposed by the reviewer. Lacking expertise on lymph gland, we prefer not to include this data, as they are not central to the main message of this article on role of NimB1 on efferocytosis. We have also noted the presence of lamellocytes in unchallenged NimB1, NimB4 double mutant larvae, as well as excessive lamellocyte production compared to controls upon clean injury (see below, Revisions Figure 3). We have mentioned the presence of lamellocytes in NimB1, NimB4 double mutants in the revised version. We prefer not include this new data directly in the article because this not central to the main message of the article.


      __Revisions Figure 3. __

      A.

      B.

      Lamellocyte recruitment following a clean injury in L3 Drosophila larvae:

      (A) Quantification of lamellocytes per 50 frames of x63 microscopy lens in w1118 (wild-type), NimB1crimic, NimB4sk2, NimB161, NimB4sk2, and draperΔ5 larvae under unchallenged conditions (UC) and 3 hours after clean injury (3h). Arrowheads indicate lamellocytes.

      (B) Representative confocal microscopy images of hemocytes isolated from challenged NimB161, NimB4sk2 larvae. Cells were fixed and stained with Phalloidin (green) to label F-actin and DAPI (blue) to visualize nuclei. The smaller inset (40x magnification) shows a detailed view of individual lamellocytes with characteristic morphology, while the larger field (20x magnification) displays the overall view on the hemocytes. Scale bar = 50 μm.

      Figure 6: The connection between the ex-vivo (Figure 5) and in vivo (Figure 6) assays should be clarified. In the first type of assay, the lack of NimB4 results in reduced internalization (while lack of NimB1 enhances it). In the in vivo assay, more fragments are seen within the cell (hence internalized), using the NimB4 mutant. Also, in the ex-vivo assay, the lack of NimB1 does not affect the first steps ('attachment' and 'membrane'), while NimB4 does, yet it is proposed that NimB1 acts in the early steps (page 11-12). In that case, wouldn't we expect the double mutant NimB1 NmB4 to have the NimB1 phenotype?

      The apparent discrepancy between our ex vivo and in vivo assays reflects the different methodologies and what each assay measures. In the ex vivo assay (Figure 4), we add exogenous fluorescently-labeled apoptotic cells to measure new engulfment events. Here, NimB4 mutant macrophages show reduced phagocytic index because they are already saturated with unresolved phagosomes, limiting their capacity to uptake additional corpses, as previously described by (Petrignani et al., 2021). This reduced uptake capacity is reflected in the decreased phagocytic index observed.

      In contrast, our in vivo assay (Figure 6) uses DAPI staining to visualize all internalized material, including previously engulfed debris. As expected, we observe accumulation of DAPI signals in NimB4 mutant macrophages under unchallenged conditions, reflecting their inability to process and clear phagosomes rather than enhanced engulfment. This phenotype highlights the role of NimB4 in phagosome maturation rather than initial uptake.

      Regarding the role of NimB1 in early phagocytic steps, while attachment and membrane measurements in the ex vivo assay don't show significant differences in NimBcrimic mutants individually, our other experiments demonstrate that NimB1 functions as a negative regulator during early recognition phases. The predominance of the NimB4 phenotype in the NimB1crimic, NimB4 double mutant parallels observations in draper mutants, where double mutants lacking both Draper I (positive regulator) and Draper II (negative regulator) display the Draper I phenotype (Logan et al., 2012). This suggests that phagosome maturation defects (the NimB4 phenotype) present a more severe bottleneck in the phagocytic process than enhanced early uptake (the NimB1crimic phenotype), explaining why the double mutant primarily exhibits accumulation of unresolved phagosomes rather than accelerated uptake. We have re-written this part of the article to clarify these points (see page 11).

      Figure 8A: a definition of the phagocytic cup mentioned in the text (page 12, 2nd paragraph) as well as the homogenization of the scale bars in Figure 8A would clarify the interpretation of Figure 8A. The structures shown for w1118 seem quite distant from the structures highlighted for NimB1crimic.

      According to reviewer 2, we have now moved this figure to the supplement. The reviewer is correct and we have modified the associated text to clarify the interpretation of the images (see page 12-13).

      The same scale should be used across different panels in Figure 8. This is particularly important since the authors mention the size of the lysotracker vesicles to conclude on their levels of maturity. This data and conclusions would be strengthened by a quantification of the vacuole sizes and the combination with markers of phagosome/lysosome maturation levels. It would help disentangling the complementary roles of NimB1 and NimB4.

      The scale bar has been homogenized.

      Minor comments:

      Figure 2BC: is there a particular reason to shift from Rp49 to Rpl32 as normalizing gene in Figure 2B and C? This prevents the comparison of NimB1 expression levels across the different graphs.

      We thank the reviewer for highlighting this point. We changed the housekeeping gene from Rp49 to RPL32 in Figure 2C to unify the normalization strategy across all experiments and allow comparisons throughout the manuscript.

      Page 9, 2nd paragraph and Figure S3C: the authors mention "Actin structure revealed an increased ratio of filopodia to lamellipodia across all mutants". A clear definition of the parameters defining filopodia and lamellipodia is required to fully appreciate the meaning of the ratio.

      We thank the reviewer for the comment. To address this comment, we have included a clear definition of the parameters used to distinguish filopodia and lamellipodia on page 9. In particular, in the revised version we now specify that filopodia were defined as thin, spike-like actin-rich protrusions, while lamellipodia were defined as broad, sheet-like structures at the cell periphery. These criteria were applied consistently for quantification.

      Figure S5B: a bar is missing in the right graph (% of cells containing AC, NimB1>UAS-NimB1-RFP). Page 10 2nd paragraph. The authors mention "draper mutants displayed impaired apoptotic cell binding and engulfment" referring to Figure 4. Figure S4 provide a more convincing illustration of this statement, since the decreased phagocytic index in Drpr KO is mostly due to less cells phagocytosing and not less material phagocytosed.

      We thank the reviewer for the careful examination. In Figure S5B, the missing bar was due to its color being too close to the background color, making it difficult to distinguish. We have now corrected this by adjusting the color to ensure it is clearly visible.

      Regarding the comment on page 10, we agree that Figure S4 more clearly illustrates the impaired apoptotic cell binding and engulfment observed in draper mutants, particularly through the reduced percentage of hemocytes engaging in phagocytosis. We have now clarified the statement in the text to ensure consistency and to guide the reader appropriately to Figure S4 (10).

      Figure 6: not easy to distinguish the DAPI labelling relative to the nucleus vs. that of apoptotic fragments.

      This is a good point. We have changed the images for clearer demonstration of the DAPI labelling. See Figure 6.

      Figure 7B: the number of cells used to generate the violin plot should be indicated in the legend or the method section.

      We have mentioned the number of cells used in the quantification (n-50 per genotype) in the figure legend.

      A schematic figure recapitulating the data would help

      We have added a schematic figure recapitulating the data. See Figure 9 with associated text.

      Page 11 last line: homeostatic rather than hemostatic.

      Thank you for this comment. We have changed it.

      Reviewer #3 (Significance (Required)):

      This study identifies a novel function for NimB1 in modulating the early stages of efferocytosis, adding to our understanding of how Nimrod proteins fine-tune apoptotic cell clearance. The authors establish a clear phenotypic contrast between NimB1 and NimB4, which provides a compelling framework for understanding how positive and negative regulators coordinate phagocytosis. It also highlights the multiple roles of the secreted members of the Nimrod scavenger receptor family, which have remained so far poorly investigated.

      This is an interesting study that could be strengthened by additional validation and broader experimental support. As the authors point out in the discussion, it is known that PS bridging molecules contribute to phagocytosis and that the contribution of positive and negative players finally tune phagocytosis in flies and mammals. Clarifying the mode of action of NimB1 in those processes would higher the impact of this interesting piece of work. For example, does NimB1 interact with NimB4 and if so, what is the role of this interaction? How does NimB1 integrate in the signaling cascade that allows scavenger receptors to bind PS? Does it act similar to Orion by enhancing the PS binding of a scavenger receptor?

      We thank the reviewer for the insightful comments and suggestions. Indeed, understanding the mode of action of NimB1 in the regulation of efferocytosis would significantly strengthen the impact of our findings. Our data, supported by structural and phylogenetic analyses, indicate that NimB1 and NimB4 share a conserved phosphatidylserine (PS)-binding motif, suggesting that these proteins may interact functionally. Preliminary biochemical observations, together with structural predictions, raise the possibility of a direct or indirect interaction between NimB1 and NimB4, although this remains to be experimentally confirmed.

      Our observations from NimB1 and NimB4 double mutants reveal that the phenotype closely resembles that of NimB4 single mutants, indicating that NimB4 plays a dominant role in the downstream maturation steps of phagosome processing. These findings are consistent with a model in which NimB1 may modulate early phagocytic uptake, possibly by competing with NimB4 for PS binding or by limiting NimB4 accessibility to apoptotic cells, thereby fine-tuning the rate of efferocytosis.

      Regarding the integration into the signaling cascade, while NimB1 and Orion both recognize PS, our data suggest that they function through distinct mechanisms. Orion enhances PS binding to Draper receptor isoforms to promote apoptotic corpse recognition. In contrast, NimB1 appears to act as an inhibitory modulator, potentially masking PS or limiting receptor engagement, thus slowing the phagocytic response. Further functional studies, including receptor-binding assays, will be important to determine whether NimB1 acts by altering receptor-ligand interactions or through a different regulatory pathway.

      Future experiments investigating the potential direct interactions between NimB1 and NimB4, their respective affinities for PS, and their influence on phagocytic receptor dynamics will be necessary to better understand NimB1’s precise mode of action. Such studies will help clarify how secreted regulators fine-tune efferocytosis in Drosophila and may offer broader insights into conserved principles of phagocytic regulation across species.

      __ __

      List of References:

      Horsthemke, M., Bachg, A. C., Groll, K., Moyzio, S., Müther, B., Hemkemeyer, S. A., Wedlich-Söldner, R., Sixt, M., Tacke, S., Bähler, M., & Hanley, P. J. (2017). Multiple roles of filopodial dynamics in particle capture and phagocytosis and phenotypes of Cdc42 and Myo10 deletion. The Journal of Biological Chemistry, 292(17), 7258–7273. https://doi.org/10.1074/jbc.M116.766923

      Ji, H., Wang, B., Shen, Y., Labib, D., Lei, J., Chen, X., Sapar, M., Boulanger, A., Dura, J.-M., & Han, C. (2023). The Drosophila chemokine–like Orion bridges phosphatidylserine and Draper in phagocytosis of neurons. Proceedings of the National Academy of Sciences, 120(24), e2303392120. https://doi.org/10.1073/pnas.2303392120

      Kurant, E., Axelrod, S., Leaman, D., & Gaul, U. (2008). Six-Microns-Under Acts Upstream of Draper in the Glial Phagocytosis of Apoptotic Neurons. Cell, 133(3), 498–509. https://doi.org/10.1016/j.cell.2008.02.052

      Lee, W. L., Mason, D., Schreiber, A. D., & Grinstein, S. (2007). Quantitative Analysis of Membrane Remodeling at the Phagocytic Cup. Molecular Biology of the Cell, 18(8), 2883–2892. https://doi.org/10.1091/mbc.E06-05-0450

      Lillico, D. M. E., Pemberton, J. G., & Stafford, J. L. (2018). Selective Regulation of Cytoskeletal Dynamics and Filopodia Formation by Teleost Leukocyte Immune-Type Receptors Differentially Contributes to Target Capture During the Phagocytic Process. Frontiers in Immunology, 9. https://doi.org/10.3389/fimmu.2018.01144

      Masters, T. A., Pontes, B., Viasnoff, V., Li, Y., & Gauthier, N. C. (2013). Plasma membrane tension orchestrates membrane trafficking, cytoskeletal remodeling, and biochemical signaling during phagocytosis. Proceedings of the National Academy of Sciences, 110(29), 11875–11880. https://doi.org/10.1073/pnas.1301766110

      Meehan, T. L., Joudi, T. F., Timmons, A. K., Taylor, J. D., Habib, C. S., Peterson, J. S., Emmanuel, S., Franc, N. C., & McCall, K. (2016). Components of the Engulfment Machinery Have Distinct Roles in Corpse Processing. PLOS ONE, 11(6), e0158217. https://doi.org/10.1371/journal.pone.0158217

      Melcarne, C., Ramond, E., Dudzic, J., Bretscher, A. J., Kurucz, É., Andó, I., & Lemaitre, B. (2019). Two Nimrod receptors, NimC1 and Eater, synergistically contribute to bacterial phagocytosis in Drosophila melanogaster. The FEBS Journal, 286(14), 2670–2691. https://doi.org/10.1111/febs.14857

      Petrignani, B., Rommelaere, S., Hakim-Mishnaevski, K., Masson, F., Ramond, E., Hilu-Dadia, R., Poidevin, M., Kondo, S., Kurant, E., & Lemaitre, B. (2021). A secreted factor NimrodB4 promotes the elimination of apoptotic corpses by phagocytes in Drosophila. EMBO Reports, 22(9), e52262. https://doi.org/10.15252/embr.202052262

      Roddie, H. G., Armitage, E. L., Coates, J. A., Johnston, S. A., & Evans, I. R. (2019). Simu-dependent clearance of dying cells regulates macrophage function and inflammation resolution. PLoS Biology, 17(5), e2006741. https://doi.org/10.1371/journal.pbio.2006741

      Serizier, S. B., & McCall, K. (2017). Scrambled Eggs: Apoptotic Cell Clearance by Non-Professional Phagocytes in the Drosophila Ovary. Frontiers in Immunology, 8, 1642. https://doi.org/10.3389/fimmu.2017.01642

      Serizier, S. B., Peterson, J. S., & McCall, K. (2022). Non-autonomous cell death induced by the Draper phagocytosis receptor requires signaling through the JNK and SRC pathways. Journal of Cell Science, 135(20), jcs250134. https://doi.org/10.1242/jcs.250134

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This paper investigates the role of NimB1, a secreted member of the Nimrod family in Drosophila, in the process of efferocytosis, the clearance of apoptotic cells by macrophages. Previous studies have identified NimB4, another secreted Nimrod protein, as a positive regulator of efferocytosis, enhancing both apoptotic cell binding and phagosome maturation. In contrast, the authors propose that NimB1 functions as a negative regulator, slowing down the early stages of apoptotic cell binding and internalization. This regulatory balance is suggested to fine-tune efferocytosis to maintain homeostasis.

      The primary aim of this study was to characterize the function of NimB1 to better understand the roles of proteins within the NimB family.

      This study identifies a novel function for NimB1 in modulating the early stages of efferocytosis, adding to our understanding of how Nimrod proteins fine-tune apoptotic cell clearance. The authors establish a clear phenotypic contrast between NimB1 and NimB4, which provides a compelling framework for understanding how positive and negative regulators coordinate phagocytosis. It also highlights the multiple roles of the secreted members of the Nimrod scavenger receptor family, which have remained so far poorly investigated.

      This is an interesting study that could be strengthened by additional validation and broader experimental support. As the authors point out in the discussion, it is known that PS bridging molecules contribute to phagocytosis and that the contribution of positive and negative players finely tune phagocytosis in flies and mammals. Clarifying the mode of action of NimB1 in those processes would higher the impact of this interesting piece of work. For example, does NimB1 interact with NimB4 and if so, what is the role of this interaction? How does NimB1 integrate in the signaling cascade that allows scavenger receptors to bind PS? Does it act similar to Orion by enhancing the PS binding of a scavenger receptor ?

      Key Findings

      • NimB1 and NimB4 are structurally similar, as supported by AlphaFold2 modeling, suggesting functional relatedness.
      • NimB1 is expressed in macrophages, secreted into the hemolymph, and binds apoptotic cells in a phosphatidylserine (PS)-dependent manner.
      • NimB1 is induced by challenge.
      • NimB1 mutants display a hyper-phagocytic phenotype, with faster recognition and internalization of apoptotic cells.
      • NimB1 loss enhances macrophage adhesion and actin remodeling, while bacterial phagocytosis remains unaffected, suggesting a specific role in apoptotic clearance.
      • NimB1 acts early in the phagocytic process, while NimB4 functions at later stages, particularly in phagosome maturation.

      Major comments

      1. Figure 1: AlphaFold is a valuable tool for generating hypotheses, however predicted structures should not be presented as definitive evidence of similarity, particularly without complementary experimental validation. This section would be stronger if the structural predictions were explicitly framed as predictions. In the absence of such data, the interpretation should be toned down.
      2. Figure 2DE : Given its basal level in homeostatic conditions, it would have been useful to look at the NimB1-GFP upon challenge. Also, the authors show only a single larval macrophage with no comparison point. To strengthen this result, the authors could include another protein quantification method, such as western blotting. Alternatively, labelling of NimB1>UASmRFP in embryo that present the highest expression levels would also strengthen this result.
      3. In supplementary Figure S1F : the authors overexpress NimB1-RFP using the fat body driver Lpp-Gal4 and show larvae with RFP in the nephrocyte. Could filet preparations be shown? Could the authors present evidence that the Lpp driver is not expressed in the nephrocytes (or refer to literature)?
      4. The results on the increased number of hemocytes observed in the double NimB1, NimB4 mutant animals (Figure S3A) remains not only disconnected from the rest of the data but also unexplained. Providing a mechanistic view may require a significant amount of work that may indicate an additional role of the two NimBs but will not add to our understanding of the role of NimB1 in phagocytosis. Nevertheless, it would be at least useful to know whether in the double mutant the lymph gland is still intact, as its precocious histolysis could account for the elevated number of hemocytes. If that were the case, that could indicate that lacking the two NimBs triggers an inflammatory state that affects the lymph gland, meaning that the pathway controlling phagocytosis also has a systemic impact on development. When checking the representative Figure S4D, it seems that very large cells are present in the double mutants, even larger than in the single mutants. These could be (pre)lamellocytes, which constitute activated hemocytes, known to impact the status of the lymph gland. If the enhanced number of hemocytes does not depend on lymph gland histolysis, a simple immunolabeling with the anti-PH3 antibody would assess the proliferative phenotype of the double mutant hemocytes. At least this piece of data would provide a better explanation for the observed phenotype.
      5. Figure 6: The connection between the ex-vivo (Figure 5) and in vivo (Figure 6) assays should be clarified. In the first type of assay, the lack of NimB4 results in reduced internalization (while lack of NimB1 enhances it). In the in vivo assay, more fragments are seen within the cell (hence internalized), using the NimB4 mutant. Also, in the ex-vivo assay, the lack of NimB1 does not affect the first steps ('attachment' and 'membrane'), while NimB4 does, yet it is proposed that NimB1 acts in the early steps (page 11-12). In that case, wouldn't we expect the double mutant NimB1 NmB4 to have the NimB1 phenotype?
      6. Figure 8A : a definition of the phagocytic cup mentioned in the text (page 12, 2nd paragraph) as well as the homogenization of the scale bars in Figure 8A would clarify the interpretation of Figure 8A. The structures shown for w1118 seem quite distant from the structures highlighted for NimB1crimic.
      7. The same scale should be used across different panels in Figure 8. This is particularly important since the authors mention the size of the lysotracker vesicles to conclude on their levels of maturity. This data and conclusions would be strengthened by a quantification of the vacuole sizes and the combination with markers of phagosome/lysosome maturation levels. It would help disentangling the complementary roles of NimB1 and NimB4.

      Minor comments:

      Figure 2BC : is there a particular reason to shift from Rp49 to Rpl32 as normalizing gene in Figure 2B and C? This prevents the comparison of NimB1 expression levels across the different graphs. Page 9, 2nd paragraph and Figure S3C: the authors mention "Actin structure revealed an increased ratio of filopodia to lamellipodia across all mutants". A clear definition of the parameters defining filopodia and lamellipodia is required to fully appreciate the meaning of the ratio. Figure S5B: a bar is missing in the right graph (% of cells containing AC, NimB1>UAS-NimB1-RFP). Page 10 2nd paragraph. The authors mention "draper mutants displayed impaired apoptotic cell binding and engulfment" referring to Figure 4. Figure S4 provide a more convincing illustration of this statement, since the decreased phagocytic index in Drpr KO is mostly due to less cells phagocytosing and not less material phagocytosed. Figure 6: not easy to distinguish the DAPI labelling relative to the nucleus vs. that of apoptotic fragments. Figure 7B: the number of cells used to generate the violin plot should be indicated in the legend or the method section. A schematic figure recapitulating the data would help. Page 11 last line: homeostatic rather than hemostatic.

      Significance

      This study identifies a novel function for NimB1 in modulating the early stages of efferocytosis, adding to our understanding of how Nimrod proteins fine-tune apoptotic cell clearance. The authors establish a clear phenotypic contrast between NimB1 and NimB4, which provides a compelling framework for understanding how positive and negative regulators coordinate phagocytosis. It also highlights the multiple roles of the secreted members of the Nimrod scavenger receptor family, which have remained so far poorly investigated.

      This is an interesting study that could be strengthened by additional validation and broader experimental support. As the authors point out in the discussion, it is known that PS bridging molecules contribute to phagocytosis and that the contribution of positive and negative players finely tune phagocytosis in flies and mammals. Clarifying the mode of action of NimB1 in those processes would higher the impact of this interesting piece of work. For example, does NimB1 interact with NimB4 and if so, what is the role of this interaction? How does NimB1 integrate in the signaling cascade that allows scavenger receptors to bind PS? Does it act similar to Orion by enhancing the PS binding of a scavenger receptor ?

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this study, Dolgikh and colleagues propose a first investigation about the role of the drosophila Nimrod protein NimB1. Although the role of several members of the family in phagocytosis has been explored, the function of Nimrod type B proteins is less addressed. With in silico analysis, they first see a strong similarity between NimB1 and NimB4. They show that NimB1 is primarily expressed in phagocytes, and as NimB4 can bind phosphatidylserines (PS), leading to a possible shared role in efferocytosis. Using transgenic and null drosophila models, the authors then compare the impact of NimB1 overexpression or deficiency. They compare the effects shown to NimB4 and Draper deficient lines, as these two proteins were previously shown to play a role in efferocytosis. They propose that NimB1 is a secreted protein that binds apoptotic cells. They show that NimB1 deficiency changes the adhesion properties of macrophages. The major finding is that NimB1 delays the early stages of efferocytosis, contrary to NimB4 and Draper that on the contrary facilitate efferocytosis. In contrast, the authors propose that NimB1 increases the formation of phagosomes.

      Major comments:

      • One of the major technical challenges here was to generate models to allow the detection of the protein in cellulo and in vivo. Although the results are convincing in transgenic lines NimB1 expression is driven by the endogenous promoter, one could still argue that the GFP tags would lead to changes in the localization of the protein.
      • In line with the previous comment, to show that NimB1 is a secreted protein, the authors use an overexpression model. How to be sure, that overexpression itself does not lead to increased secretion, or shedding from the membrane ?
      • Would an experiment with a control consisting in a known protein secreted by macrophages lead to the same staining pattern (positive control)? Could another methodology like a Western Blot on supernatants from hemocyte cell culture (over)expressing NimB1, with an anti-RFP staining, be envisaged?
      • It sems counterintuitive that phagocytes from Draper and NimB4 null mutants with defects in efferocytosis show increased load of apoptotic cells (Figure 6C and D in both unchallenged and injury condition). Do the authors have precedent data to cite going to the same direction? Are cell debris engulfed but not degraded efficiently?
      • In Figure 6D it seems indeed that NimB4, NimB1/NimB4 and Draper mutants do not accumulate more apoptotic material upon injury. However, levels for NimB4 is close to the one obtained with NimB1 mutants. Is it statistically true? If yes, what could be the reason for this similarity ? In any case, as some important conclusion relies on the comparison between UC and injury conditions, adequate statistics and representations could be proposed.
      • The authors claim with analyses of Figure 8C and D, that NimB1 mutants show acidic vehicles normal in size and fluorescence intensity. However, statistical differences are still observed compared to control condition, which is also seen in representative images shown.

      Minor comments:

      • In figure 2D, what allows to say the expression is restricted in macrophages ? Is it the colocalization with SIMU being a macrophage-specific marker?
      • In figure S3B and C, it appears that double NimB1/NimB4 mutants exhibit less spreading than single ones (especially NimB4). Is it the case (statistic significance). If yes what could be the explanation?
      • Several graphs are identical between figure 4 ans S4. It is probably not useful and complicates reading.
      • As TEM images shown in Figure 8B do not lead to quantitative data, I would put it as supplementary file.

      Significance

      This study uses several approaches and models to address the role of NimB1 in efferocytosis. Both In Vitro and In Vivo approaches are proposed. They give insight into the role of this protein with unknown function so far. Some statistical analysis could be performed to improve the clarity of conclusions. One of the important aspects is the secreted nature of NimB1.However, additional approaches could be proposed to confirm this.

      Basic immunologists and cell biologists would be interested in reading this article that highlights the delicate equilibrium between pro and anti-efferocytosis molecules.

      I am an immunologist/cell biologist with expertise in lysosomal catabolism. As I work on mouse models or Human samples, my mastering of drosophila as a model is limited.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This work defines NimB1 protein as a PS binding bridging molecule but with a negative regulatory role in efferocytosis. Specifically, the authors demonstrate via a variety of genetic, cell biological, and other approaches that loss of NimB1 leads to Drosophila macrophages being more adherent to apoptotic targets and engulf them more robustly. The authors also nicely demonstrate that the function of NimB1 differs from NimB4, and the double mutant demonstrating PS-binding yet, distinct roles. Further, the authors show that NimB1 does not affect bacterial phagocytosis.

      Overall, this is a well-done study. The authors have already done a very thorough job addressing the key points and I congratulate the authors.

      My only minor comment is that the authors could try to make the comment better about whether or not such a 'negative regulatory' bridging molecules may exist in other species, and particularly mammals. If so, this is quite novel. The authors refer to CD47 but this is a membrane protein. The other minor comment is whether the authors ever tried express the PS binding domains as a fusion protein - this would provide a more direct evidence for the binding to PS (although the authors do competitive inhibition with Annexin V). This could be commented upon although testing this is not necessary if they have not already done so.

      Significance

      The identification of the negative regulator bridging protein NimB1 is novel and could be broadly interesting to those studying efferocytosis.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      • *

      *We thank the reviewers for finding the manuscript enjoyable and well-written, with experiments that were performed well, show solid results and provide useful data for the community. The reviewers have provided meaningful feedback to improve this study. We have addressed the comments point-by-point below. The main text will also be further modified to incorporate new analysis where it has not yet been done. *

      • *

      • *

      2. Description of the planned revisions

      Reviewer 1:

      Summary OTX2 is a pivotal transcription factor that regulates the fate choice between somatic and primordial germ cell (PGC) lineages in early mouse development. In the current study, the authors use in vitro stem cell models to demonstrate that OTX2 mediates this developmental fate decision through controlling chromatin accessibility, whereby OTX2 helps to activate putative enhancers that are associated with somatic fate. By extension, those somatic-associated regulatory regions therefore become inaccessible in cells adopting PGC identity in which Otx2 is downregulated.

      Comments I enjoyed reading this manuscript. The experiments have been carried out well and for the most part the results provide convincing evidence to support the claims and conclusions in the manuscript. I particularly liked the experiments using the inducible Otx2 transgene to examine the acute changes in chromatin accessibility following restoration of OTX2.

      I include some suggestions below to the authors for additional analyses that I feel would further strengthen their study.

      I also felt that the authors focus almost exclusively on the subset of OTX2-bound sites that lose accessibility in the absence of OTX2. But, as they show in several figure panels, these sites tend to be the minority and that most OTX2-occupied sites do not lose accessibility in Otx2-null cells (actually, more sites tend to gain accessibility). I encourage the authors to modify the text and some of the analyses to give a better balance to their study. We are pleased that this reviewer enjoyed our manuscript. As suggested by the reviewer, we included analyses on the regions that are bound by OTX2 but do not show an increase in accessibility (see section 3 reviewer 1 point 6). The text will be expanded to include the new data and to include the description of the subset of OTX2 sites that do not show accessibility changes in the absence of OTX2. We have responded to other points they raised as detailed in the sections below

      • *

      Figure 1: The authors write: "...OTX2 binds mostly to putative enhancers." Whether these distal sites are enhancers is not sufficiently evidenced in the manuscript, but it is important information to collect to support their model of OTX2 function. The authors should strengthen their analysis by examining whether OTX2 peaks are enriched at previously defined enhancer regions.

      We plan to compare OTX2 bound regions with defined lists of enhancers identified in ESCs grown in Serum/LIF (e.g. Whyte et al 2013) and, if available, in 2i/LIF and EpiLCs. We will also analyse publicly available datasets for H3K4me1 (enhancer marker) and H3K27ac (marker of active regulatory regions) at the regions bound by OTX2 in ESCs and EpiLCs.

      Figure 2: I'm still puzzled why the authors did not examine flow-sorted WT+cyto cells?

      *We agree with the reviewer that it would be interesting to examine flow-sorted WT +cyto PGCLCs. Unfortunately, the expression of CD61 and SSEA1 only becomes visible from day 4 of PGCLC differentiation. Therefore, we were not able to isolate PGCLC at day 2 from WT cells differentiated in the presence of cytokines. We then used OTX2-/- cells at day 2 to model PGCLCs. This is based on the assumption that because day 6 Otx2-/- PGCLCs are transcriptionally similar to sorted day 6 WT cells (Zhang, Zhang et al Nature 2018), the same will be true at day 2. We will modify the text in the final version of this manuscript to clarify this point that has also been raised by reviewers 2 and 3. *

      • *

      Figure 3: I would be tempted to put Figure S3A and S3B into Figure 3. It would be better to show all 1246 DARs together, either ordered by OTX2 CT&RUN signal, or presented in two pre-defined groups (OTX2-bound vs unbound). I also suggest that the author show OTX2 signals and ATAC-seq signals for the 3028 DARs that gain accessibility in Otx2-null EpiLCs (this could be added to a supplemental figure).

      Although the analysis has been carried out and the figures have been amended, the main text will be modified in a future updated version of the manuscript to incorporate these results.

      • *

      Figure 3: What is special about the 8% of OTX2-bound site that lose accessibility, versus the 92% of sites that do not?

      *The 8% of the OTX2-bound regions that lose accessibility in the absence of OTX2 appear to be more sensitive to the loss of OTX2. One possible explanation is that the accessibility of the rest of OTX2 bound regions relies on other TFs, such as OCT4, that are expressed in EpiLCs. We will modify the main text to discuss this interesting point raised by the reviewer. *

      Figure 6F: If the 4221 sites are split into those bound by OTX2 versus those that are not (related to Figure 6C) then is there a difference? i.e. are the OTX2-bound sites opening up?

      We separated the 4,221 sites in OTX2 bound and unbound. The result is reported below:

      *Although there is a slight increase in accessibility in the OTX2 bound subset, the average accessibility reaches less than ¼ of the accessibility of these regions when OTX2 is present from day 0 to day 4, while the OTX2 unbound regions do not show an increase in accessibility. Although we can not rule out that a longer treatment with tamoxifen may lead to higher accessibility in the OTX2 bound subset, the dynamics are extremely slower compared to the EpiLC regions where accessibility reaches 50% of the d0-d4 sample in just 1 hour of tamoxifen treatment. *

      • *

      Is there any evidence that OTX2 binds and compacts PGCLC enhancers in somatic cells? I appreciate this is different to the main thrust of the authors' model, but being able to show that OTX2 does not compact these sites lends further support to their preferred model of OTX2 opening sites of somatic lineages.

      *Comparing the ATAC-seq in PGCLCs with ESCs and EpiLCs, we identify a subset of regions that are open in PGCLC only (PGCLC-specific accessible regions, see below). These regions do not show binding of OTX2 in WT EpiLCs or the d0-d2 Tam sample, suggesting that OTX2 does not bind and compact PGCLC-specific enhancers. *

      • *

      PGCLC-specific regions showing high accessibility only in PGCLCs.

      • OTX2 CUT&RUN signal in WT EpiLC, OTX2-ERT2 PGCLCs in presence or absence of Tamoxifen, showing that OTX2 does not bind PGCLC-specific regions even when it is overexpressed in GK15 medium.*

      *These analyses will be incorporated in the manuscript. *

      • *

      Discussion: Have prior studies established a connection between OTX2 and chromatin remodellers that can open chromatin? Or, if not, then perhaps this could be proposed as a line of future research.

      We thank the reviewer for suggesting to amplify the discussion on the possible connection between OTX2 and chromatin remodellers. Although there is no evidence in the literature of a direct interaction between OTX2 and chromatin remodellers, this can not be excluded. The connection might also be indirect: OTX2 is known to interact with OCT4, which in turn interacts and recruits to chromatin the catalytic subunit of the SWI/SNF complex, BRG1. This point will be discussed in a modified version of the manuscript.

      • *

      • *

      Reviewer 2:

      Barbieri and Chambers explore the role of OTX2 on mouse pluripotency and differentiation. To do so, they examine how the chromatin accessibility and OTX2 binding landscape changes across pluripotency, the exit of pluripotency towards formative and primed states, and through to PGCLC/somatic differentiation. The work mostly represents a resource for the community, with possible implications for our understanding of how OTX2 might mediate the germline-soma switch of fates. While the findings of the work are modest, the results seem solid and the manuscript is clear and well-written.

      *We are pleased that this reviewer found our results solid and the manuscript clear. *

      I have some comments as indicated below:

      1. The comparison between Otx2-/- cells in the presence of PGCLC cytokines compared to WT cells in the absence of cytokines seems like it is missing controls to me. I assume the authors wanted to enable homogeneous populations to facilitate their bulk sequencing methods, but it seems to me like they are comparing apples with oranges. It would have been better to have the reciprocal situations (Otx2-/- cells in basal differentiation medium, and WT cells in PGCLC cytokines) with a sorting strategy to better unpick the differences between the presence and absence of Otx2 in the 2 protocols. Having said that, the authors are careful not to draw many comparisons between those populations so I don't think this omission affects their current claims. They should however clarify whether the flow cytometry (Supp Fig2) was used for sorting cells or if all cells were taken for bulk sequencing. *We agree with the reviewer that it would be of interest to compare the PGCLC and somatic population derived from the OTX2-/- cells in GK15 without cytokines with the same populations derived from WT cells differentiated in the presence of cytokines. Our work aims to identify what happens at the stages of PGCLC differentiation when cells are still competent for both germline and somatic differentiation. Previous work from the lab showed that this dual competence is lost after day 2, therefore we focus our attention on this time of differentiation. Unfortunately, the two surface markers characteristics of PGCs (CD61 and SSEA1) are not expressed at day2 and, therefore we are not able to sort PGCLCs derived from OTX2-/- cells in GK15 without cytokines or WT cells differentiated in the presence of cytokines. As recognised by this reviewer, we aimed to obtain two homogenous populations that can model PGCLCs and somatic cells. This is based on data obtained at day 6 when Otx2-/- PGCLCs show a similar transcriptome to sorted day 6 WT cells (Zhang, Zhang et al Nature 2018) and the assumption that the same will be true at day 2. We will clarify that the supplementary Figure 2 is not a sorting strategy. As this point has been raised by reviewers 1 and 2 as well, we will modify the text to clarify the choice and the assumption behind using OTX2-/- cells in the presence of cytokines and WT cells in the absence of cytokines to model PGCLCs and somatic cells respectively. *

      2. *

      Throughout the text, the authors subject cells (WT / Otx2-/- /Otx2ER ) to different protocols to look at accessibility and Otx2 binding, but with no mention of the cell fate differences that occur in these different conditions. For instance, it is unclear to me to which fate the WT cells without PGCLC cytokines go - I presume this is neural but perhaps this is a mixed fate, given that they are in GK15 rather than N2B27. Likewise, the OTX2ER experiments may promote a mixed population between PGCLC/somatic fates, and this is never described. Ideally transcriptomic data would be collected, but failing that, qPCR data should be obtained to examine this more closely.

      *We are planning to generate RT-qPCR data for germ layer markers (ectoderm, endoderm and mesoderm) in WT cells in GK15 without cytokines at day 2, as well as OTX2-ERT2 cells with and without Tamoxifen at day 2 (noTam, d0-d2) and day 4 (no Tam, d0-d4). *

      The authors also state that "OTX2 facilitates Fgf5 transcription' (page10) but provide no transcriptional data to substantiate this claim. Again RT-qPCR would help make this point.

      *We will analyse the level of Fgf5 by RT-qPCR in OTX2-ERT2 EpiLCs treated for 1 hour and 6 hours with Tamoxifen to show the effect of OTX2 on Fgf5 transcription. *

      • *

      It is unclear to me what the 'increase[d] accessibility' (eg abstract final sentence, Figure 3E) really means at the cellular level. Does this indicate that more cells have this site open, and does this have implications for the heterogeneity of cell fates observed? Since the authors are concerned with fate decisions, this seems like an important consideration that should at least be discussed.

      The possibility that the increased accessibility is due to higher heterogeneity in the population is interesting and it will be included in the discussion in a revised version of the manuscript.

      • *

      • *

      Reviewer 3:

      In this manuscript, the authors perform OTX2 CUT&RUN and ATAC-seq in Otx2-null and WT ESCs, EpiLCs and PGCLCs to understand whether the role of OTX2 in restricting mouse germline entry that they previously described (Zhang Nature 2018) mechanistically depends on chromatin remodeling. They identify differentially accessible regions (DARs) between Otx2-null and WT cells at different stages of differentiation and show that many of these are OTX2 bound in WT. They then show using cells expressing OTX2-ER^T2 in Otx2-null Epiblast cells that when OTX2 is moved into the nucleus, the regions that were differentially closed in Otx2-null open within an hour, suggesting chromatin accessibility is directly controlled by OTX2 (rather than indirect effects involving transcription and translation which one would expect to take longer). The scope is narrow, but this is nice work and useful data for the mouse PGC field. However, there are a few places where the data could be strengthened, and the writing is a little confusing in places, for example by stating as fact in early sections what is not proven until later.

      We thank the reviewer for finding our work nice and useful for the mouse PGC field, and for the useful comments to improve the manuscript. We have included new analysis and modified the text as suggested to improve the writing, avoiding early statements that were not fully proven until later in the manuscript. We have responded to other points they raised as detailed below and in the next section.

      • *

      1) "we compared Otx2-/- cells cultured in the presence of PGC-promoting cytokines with wild-type cells cultured in the absence of PGC-promoting cytokines. Under these conditions Otx2-/- cells produce an essentially pure (>90%) CD61+/SSEA1+ population that we refer to as PGCLCs, while wild-type cells yield a cell population from which PGCLCs are absent"

      This is not a controlled comparison since one cannot separate the day 2 effect of cytokines from that of the Otx2 knockout. The manuscript would be strengthened if the authors include WT somatic and PGCLCs from the +cytokine conditions, which could be easily sorted out as shown in Supp. Fig. 2. Ideally they would also include Otx2-null somatic cells, although Supp. Fig. 2 shows those are rare under the conditions considered.

      *This work aimed to analyse early stages of EpiLC to PGCLC differentiation when cells are still competent for both somatic and germline differentiation. This stage has been described previously to be at day 2 of differentiation in GK15 + cytokines (PGCLC differentiation medium, Zhang, Zhang et al, Nature 2018). Unfortunately, CD61 and SSEA1 are not expressed at day 2 of PGCLC differentiation, and they start to be expressed on the cell surface by day 4. Consequently, it is impossible to sort cells at day 2 using the CD61+/SSEA1+ strategy. To overcome this problem, we used WT cells grown in GK15 without cytokines to model a population of somatic cells and OTX2-/- cells grown in GK15+ cytokines to model a homogeneous population of PGCLCs. As explained in a similar point raised by reviewers 2 and 3, we assumed that, as OTX2-/- cells grown in the presence of cytokines are transcriptionally similar to sorted WT cells at day 6 (Zhang, Zhang et al, Nature 2018), OTX2-/- cells at day 2 are similar to their WT counterpart at day 2. The main text will be modified to clarify that we are using homogeneous populations to model both PGCLC and somatic cells and that Figure S2 does not show a sorting strategy. *

      • *

      3) "In ESCs, OTX2 binds We are planning to perform a statistical analysis to ascertain that the small number of DARs bound by OTX2 are or are not bound by chance by OTX2.

      • *

      4) It would be good if the discussion was broadened to include both human and other transcription factors that are involved. How much of these conclusions could one expect to carry over to human or other mammals? There is some work from the Surani lab considering OTX2 in human. One could even look at published ATAC or OTX2 chip-seq data in hPSCs and potentially learn something interesting. Furthermore, there are studies on other transcription factors modulating chromatin accessibility in the decision between germline and somatic cells, for example PRDM1, PRDM14 (refs in e.g. Tang et al Nat Rev Gen 2016) or TFAP2A (at least in human (Chen et al Cell Rep 2019)). Do these factors affect the same genes? Is a coherent picture emerging of their respective roles in germline entry?

      *As suggested by the reviewer, we will discuss the role of OTX2 in human PGCLC formation and include studies on PGC-specific transcription factors concerning changes in chromatin accessibility in germline and somatic cells. This will be included in a revised version of the manuscript. *

      • *

      • *

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer 1:

      1. Figure 1: The authors report in the methods that they performed OTX2 CUT&RUN in biological duplicates. It would strengthen their results if they showed in Figure S1 some representative data from each replicate separately to show the consistency. As suggested by the reviewer to show consistency between replicates, two representative tracks of the two CUT&RUN replicates at the Tet2 (ESCs) and Fgf5 (EpiLCs) loci have been included in Figure S1A. The corresponding tracks of the average bigwig files are reported in Figure 1E. The main text (page 5) and the figure legends have been amended to incorporate the new panels.

      2. *

      Figure 2: I think it would be helpful to remind the reader here that Otx2 is normally downregulated in PGCs, and that Otx2 expression is maintained (at least initially) in somatic cells. This would help explain the logic behind the choice of samples that were profiled.

      We modified the text with the following sentence, as suggested by the reviewer, emphasising the level of OTX2 in early somatic vs early PGCLCs: “Otx2 expression is rapidly downregulated in the EpiLC to PGCLC transition while its expression is maintained longer in cells entering the somatic lineage [8]*” (page 7). *

      • *

      Figure 2D: I appreciate that the highlighted region at the Tet2 locus is a DAR, but from the genome tracks it looks as though the region still has high accessibility. Are there any other examples to exemplify a more obvious DAR? Additionally, since twice as many DARs gain accessibility in Otx2-null ESCs compared to lose accessibility, why not show examples of these as well? The same is true of EpiLCs. (Or alternatively, provide a good explanation for why not to show these other categories)

      We substituted the Tet2 DAR with a more clear example of ESC DAR located in the Hes1 locus that shows low accessibility in Otx2-/- ESCs versus WT ESCs. Examples of ESC DARs and EpiLC DARs that show higher accessibility in Otx2-/- vs WT cells have been added as new panels 2E (DAR in Pebp4 locus) and 2G (DAR in Tdh locus). We also simplified the panels showing only ATAC-seq tracks in WT and OTX2-/- cells, either ESCs (2D-E) or EpiLCs (G-H). Text and figure legends have been modified to accommodate the changes made in Figure 2.

      • *

      Figure 3: I would be tempted to put Figure S3A and S3B into Figure 3. It would be better to show all 1246 DARs together, either ordered by OTX2 CT&RUN signal, or presented in two pre-defined groups (OTX2-bound vs unbound). I also suggest that the author show OTX2 signals and ATAC-seq signals for the 3028 DARs that gain accessibility in Otx2-null EpiLCs (this could be added to a supplemental figure).

      Figures S3A and S3B have been moved to the main figure. Figure S3A is now part of Figure 3C, where all the 1,246 DARs are shown together, separated into two groups (OTX2-bound and -unbound). Figure S3B is now part of Figure 3F. A new heatmap showing the OTX2 and ATAC-seq signals for the 3028 regions that gain accessibility in Otx2-/- EpiLCs has been added as new Figure S3B. Only 28 out of the 3,028 regions overlap an OTX2 peak as shown in the new Figure S3A. These regions appear to be already open in ESCs (Figure S3C) and they do not fully close when OTX2 is absent. This can be explained by either a) the lack of expression of an OTX2 target gene that represses these regions or b) the continuous expression of a gene that is usually repressed by OTX2 in the transition to EpiLCs. In both cases, OTX2 does not directly repress these regions. Figure legends have been amended to incorporate the new panels. The main text will be modified to incorporate these results.

      • *

      Figure 6: Do the PGCLCs with OTX2 expression have chromatin accessibility profiles similar to somatic cells? Consider adding WT somatic cell data to Figure 6A, which could be an interesting comparison with the Tam d0-d2 samples.

      *The heatmap showing the ATAC-seq signal at the additional OTX2-induced regions in somatic cells has been added to Figure 6A. The data show that the regions induced by OTX2 are not open in somatic cells generated in GK15. One possible explanation is the overexpression of OTX2 induces the opening of neural-associated regions, but neural differentiation is not fully supported in GK15 medium (see reviewer 2, point 3). As suggested by reviewer 2, we will perform RT-qPCR of germ layer markers to analyse the identity of somatic cells grown in GK15 (without cytokines) and somatic cells induced by OTX2 overexpression. *

      • *

      • *

      • *

      Reviewer 2:

      The authors focus solely on the activating role of Otx2 in their data, but given the substantial proportion of DARs that decrease following Otx2 depletion, I presume it is possible that it also has a repressive effect? Either way, this should be discussed.

      *As also suggested by reviewer 1 (point 6), we analysed the accessibility level and the OTX2 signal at the 3,028 regions that gain accessibility in Otx2-/- EpiLCs (new Figure S3A-C). These regions show high accessibility in ESCs suggesting that these are ESC regions that do not close properly in the transition to EpiLCs in the absence of OTX2. OTX2 CUT&RUN show a low to absent signal at these regions, with just 28 regions overlapping EpiLCs DARs that show higher accessibility in Otx2-/- cells, suggesting that OTX2 does not have a direct suppressive effect on them. *

      • *

      The authors state that d2 PGCLCs "show an intermediate position between ESCs and EpiLCs" based on the PCA location. They should be careful to qualify that this is only in the first 2 principal components, because it may well be the case (and is likely) that in other components the PGCLC population is far removed from the pluripotent states.

      • The text has been updated as follows: d2 PGCLCs “show an intermediate position between ESCs and EpiLCs on both PC1 and PC2”.*

      • *

      Reviewer2 Minor Suggestions:

      1. Presumably the regions bound by OTX2 in Tet2, Mycn and Fgf5 (Fig1E) are called enhancers because these are known from existing literature. It would be helpful to cite the relevant references to this in the text for those unfamiliar with these. References (Whyte et al, Cell, 2018 – Tet2 and Mycn, Buecker et al, Cell Stem Cell, 2013, Thomas et al, Mol Cell 2021 – Fgf5) have been added to the text and the figure legends.

      On page 13, the authors say "To determine whether OTX2 expression is essential to maintain chromatin accessibility in somatic cells..." but this does not seem to be what they test because they are using PGCLC medium. Perhaps I misunderstood, but this could be clarified.

      *Expression of OTX2 during the first 2 days of PGCLC differentiation leads to a block of germline differentiation as previously shown in Zhang, Zhang et al, Nature 2018. After 2 days of tamoxifen treatment, cells have acquired somatic fate and cells will undergo somatic differentiation even after tamoxifen is withdrawn after day 2. Nevertheless, we agree with the reviewer that the sentence is of difficult interpretation and we modified the sentence as shown below and as reported in the updated manuscript: “To determine whether OTX2 expression is essential to maintain chromatin accessibility in cells differentiating in the presence of PGC-inducing cytokines after day 2” (page 12). *

      On page 14 the authors claim, "These results indicate that...the partner proteins that OTX2 act alongside differ...". While this may be the case, their results do not substantiate this, it is just speculation. Should be toned down.

      The text has been modified as follows: "These results suggest that...the partner proteins that OTX2 act alongside differ..."

      Page 18, PGCLC differentiation method sections needs to be described as such (ie. Add "For PGCLC differentiation..." before the second paragraph)

      *The text “For PGCLC differentiation” has been added at the beginning of the PGCLC differentiation method section. *

      It would be helpful to indicate time on the protocol schematics (eg Fig4A, 5A, 5D etc) as I had to keep checking the methods to find out how long the full differentiation time-course was.

      *Indication of time has been added to Figures 1, 2, 4, 5 and 6. *

      Since the authors compare between the Tam d0-d2 treatments assessed at d2 versus d4 (Figure5B vs 5E) it would be helpful to make the colourbars the same scale, for both ATAC and Cut&Run datasets.

      *The heatmap in Figure 5B has been modified. The colourbars of Figure 5B and 5E are now using the same scale. *

      • *

      Reviewer 3:

      1) As a minor point related to this, the second sentence is confusing since it kind of sounds like Otx2-/- and WT cells are compared under the same conditions unless one carefully reads the previous sentence.

      The text has been modified to clarify the different medium conditions for WT and OTX2-/- cells, as follows: “In the presence of PGC-inducing cytokines, Otx2-/- cells produce an essentially pure (>90%) CD61+/SSEA1+ population that we refer to as PGCLCs, while wild-type cells differentiated in GK15 medium without cytokines yield a cell population from which PGCLCs are absent” (page 7).

      • *

      2) "This suggests that OTX2 acts as a pioneer TF to regulate the accessibility of enhancers E1, E2 and E3."

      This is from the text corresponding to Fig. 2. That data actually only shows that Otx2-null cells have DARs, so somehow OTX2 affects chromatin accessibility but it could be indirect by controlling transcription of genes that modify chromatin accessibility. It is not until figure 4 that the data suggests that OTX2 directly affects accessibility, perhaps as a pioneer TF.

      The authors continue to make many statements about the direct action of OTX2 before the data supporting this is shown, on which I got hung up as a reader. I suggest the authors edit the manuscript to improve this. E.g. "OTX2 may directly control accessibility at these sites (Figure 3E)." and the fact that in 3E and other figure, it says "DARs increased by OTX2 binding" which at that point is not proven, so would better say "Otx2-null vs WT DARs" or something like that.

      The sentence "This suggests that OTX2 acts as a pioneer TF to regulate..” has been removed from the text (page 9). The sentence “OTX2 may directly control accessibility at these sites” has been modified with “*suggesting that the presence of OTX2 affects accessibility at these sites” (page 9). The sentence “ Together, these results suggest that OTX2 is required to open these chromatin regions” has been modified to “Together, these results suggest that OTX2 is required for the accessibility of these chromatin regions”. *

      The subset of DARs that increase in WT EpiLC and are bound by OTX2 that was called “DARs increased by OTX2 binding” has been renamed as “DARs higher in WT with OTX2 binding”. For consistency, the subset of DARs showing increased accessibility in WT EpiLCs that are not bound by OTX2 are now called “DARs higher in WT without OTX2 binding” (Figure 3, Figure 4, main text and figure legends). We will further revise the manuscript to avoid statements or hypotheses that are not yet supported by data throughout the text.

      • *

      Reviewer 3 – minor comments:

      1) "Comparing wild-type and Otx2-/- ESCs identified 375 differentially accessible regions (DARs) with increased accessibility in wild-type cells, and 743 regions with higher accessibility in Otx2-/- ESCs (Figures 2C). An example of ESC DARs where accessibility is increased in cells expressing OTX2 is the intragenic enhancer of Tet2. Tet2 is expressed at high levels in ESCs but at low levels in EpiLCs."

      The authors compare Otx2-null and WT ESCs then proceed to give an example comparing ESCs to EpiLCs, instead of Otx2-null vs WT ESCs, which is confusing.

      Furthermore, here and in other places the authors describe ESCs as not expressing OTX2. However, they also show CUT&RUN data for OTX2 in ESCs etc, clearly indicating that it is expressed, just lower (otherwise how could one get anything?).

      *We originally chose Tet2 enhancer as an example of the 375 ESC DAR with higher accessibility in WT vs Otx2-/- ESCs as it shows a slightly decreased level of accessibility and OTX2 binding in ESCs. Therefore, the sentence “where accessibility is increased in cells expressing OTX2” refers to WT cells (expressing OTX2) when compared to Otx2-/- cells (OTX2-null). The text has been changed to describe the new panel. The rest of the main text will be checked and modified where appropriate to avoid possible misinterpretations. *

      *We also appreciate that the change in accessibility is not clearly visible in the original Figure 2, as also pointed out by Reviewer 1 (point 6). In the updated Figure 2, we show a region in the Hes1 locus as an example of the 375 ESC DARs. Moreover, we simplified the panels showing ATAC-seq tracks of WT and OTX2-/- ESC (Fig. 2D-E) or EpiLCs (Fig. G-H). *

      2) "In contrast, in EpiLCs, OTX2 binds almost 40% (446 out of 1,246) of the DARs that are more accessible in wild-type than in Otx2-/- cells (Figure 3B-C). Notably, these regions are mainly located distal to genes (91%, Figure 3D), despite the increased fraction of promoter regions bound by OTX2 in EpiLCs (Figure S1A)."

      Are the authors rounding percentages with 2 significant digits, as suggested by the "91%"? If so, 446/1245 ~ 36%, not 40%.

      *The text has been modified from “OTX2 binds almost 40%” to “OTX2 binds 36%”. *

      3) The results in Figure 4 are nice and the real meat of the paper. One suggestion: It would be helpful is Fig. 4B were split up between the 446 and 800 genes instead of showing all 1246, and if the WT control was shown in the same figure as well.

      *Panels with the 446 and 800 regions have been added to Figure 4 instead of the panels with all 1246 regions. WT control has been inserted in Figure 4. The main text and the figure legends have been updated accordingly. *

      4) "Enforced OTX2 expression opens additional somatic regulatory regions" - it would be clearer to say "OTX2 overexpression opens additional somatic regulatory regions", since this is really about DARs between EpiLCs that already express OTX2 and those forced to express higher than WT endogenous levels by the OTX2-ER system?

      We thank the reviewer for their suggestion. The text has been modified (page 12)

      • *

      • *

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, the authors perform OTX2 CUT&RUN and ATAC-seq in Otx2-null and WT ESCs, EpiLCs and PGCLCs to understand whether the role of OTX2 in restricting mouse germline entry that they previously described (Zhang Nature 2018) mechanistically depends on chromatin remodeling. They identify differentially accessible regions (DARs) between Otx2-null and WT cells at different stages of differentiation and show that many of these are OTX2 bound in WT. They then show using cells expressing OTX2-ER^T2 in Otx2-null Epiblast cells that when OTX2 is moved into the nucleus, the regions that were differentially closed in Otx2-null open within an hour, suggesting chromatin accessibility is directly controlled by OTX2 (rather than indirect effects involving transcription and translation which one would expect to take longer). The scope is narrow, but this is nice work and useful data for the mouse PGC field. However, there are a few places where the data could be strengthened, and the writing is a little confusing in places, for example by stating as fact in early sections what is not proven until later.

      Major Comments:

      1. "we compared Otx2-/- cells cultured in the presence of PGC-promoting cytokines with wild-type cells cultured in the absence of PGC-promoting cytokines. Under these conditions Otx2-/- cells produce an essentially pure (>90%) CD61+/SSEA1+ population that we refer to as PGCLCs, while wild-type cells yield a cell population from which PGCLCs are absent"

      This is not a controlled comparison since one cannot separate the day 2 effect of cytokines from that of the Otx2 knockout. The manuscript would be strengthened if the authors include WT somatic and PGCLCs from the +cytokine conditions, which could be easily sorted out as shown in Supp. Fig. 2. Ideally they would also include Otx2-null somatic cells, although Supp. Fig. 2 shows those are rare under the conditions considered.

      As a minor point related to this, the second sentence is confusing since it kind of sounds like Otx2-/- and WT cells are compared under the same conditions unless one carefully reads the previous sentence. 2. "This suggests that OTX2 acts as a pioneer TF to regulate the accessibility of enhancers E1, E2 and E3."

      This is from the text corresponding to Fig. 2. That data actually only shows that Otx2-null cells have DARs, so somehow OTX2 affects chromatin accessibility but it could be indirect by controlling transcription of genes that modify chromatin accessibility. It is not until figure 4 that the data suggests that OTX2 directly affects accessibility, perhaps as a pioneer TF.

      The authors continue to make many statements about the direct action of OTX2 before the data supporting this is shown, on which I got hung up as a reader. I suggest the authors edit the manuscript to improve this. E.g. "OTX2 may directly control accessibility at these sites (Figure 3E)." and the fact that in 3E and other figure, it says "DARs increased by OTX2 binding" which at that point is not proven, so would better say "Otx2-null vs WT DARs" or something like that. 3. "In ESCs, OTX2 binds <10% (30 out of 375) of DARs that are more accessible in wild-type cells than in Otx2-/- cells (Figure 3A), suggesting that accessibility of ESC DARs is directly due to OTX2 in a small subset of DARs."

      When a small number of DARs are OTX2 bound, it does not necessarily suggest that that small set is directly affected by OTX2. It could just mean no DARs are controlled directly by OTX2 and then some are bound by chance by OTX2. Some appropriate statistical null hypotheses about the occurrence of OTX2 motifs might help to see if 10% is more than chance. 4. It would be good if the discussion was broadened to include both human and other transcription factors that are involved. How much of these conclusions could one expect to carry over to human or other mammals? There is some work from the Surani lab considering OTX2 in human. One could even look at published ATAC or OTX2 chip-seq data in hPSCs and potentially learn something interesting. Furthermore, there are studies on other transcription factors modulating chromatin accessibility in the decision between germline and somatic cells, for example PRDM1, PRDM14 (refs in e.g. Tang et al Nat Rev Gen 2016) or TFAP2A (at least in human (Chen et al Cell Rep 2019)). Do these factors affect the same genes? Is a coherent picture emerging of their respective roles in germline entry?

      Minor comments:

      1. "Comparing wild-type and Otx2-/- ESCs identified 375 differentially accessible regions (DARs) with increased accessibility in wild-type cells, and 743 regions with higher accessibility in Otx2-/- ESCs (Figures 2C). An example of ESC DARs where accessibility is increased in cells expressing OTX2 is the intragenic enhancer of Tet2. Tet2 is expressed at high levels in ESCs but at low levels in EpiLCs."

      The authors compare Otx2-null and WT ESCs then proceed to give an example comparing ESCs to EpiLCs, instead of Otx2-null vs WT ESCs, which is confusing. Furthermore, here and in other places the authors describe ESCs as not expressing OTX2. However, they also show CUT&RUN data for OTX2 in ESCs etc, clearly indicating that it is expressed, just lower (otherwise how could one get anything?). 2. "In contrast, in EpiLCs, OTX2 binds almost 40% (446 out of 1,246) of the DARs that are more accessible in wild-type than in Otx2-/- cells (Figure 3B-C). Notably, these regions are mainly located distal to genes (91%, Figure 3D), despite the increased fraction of promoter regions bound by OTX2 in EpiLCs (Figure S1A)."

      Are the authors rounding percentages with 2 significant digits, as suggested by the "91%"? If so, 446/1245 ~ 36%, not 40%. 3. The results in Figure 4 are nice and the real meat of the paper. One suggestion: It would be helpful is Fig. 4B were split up between the 446 and 800 genes instead of showing all 1246, and if the WT control was shown in the same figure as well. 4. "Enforced OTX2 expression opens additional somatic regulatory regions" - it would be clearer to say "OTX2 overexpression opens additional somatic regulatory regions", since this is really about DARs between EpiLCs that already express OTX2 and those forced to express higher than WT endogenous levels by the OTX2-ER system?

      Significance

      Also see summary. Understanding what restricts cells to germline vs somatic lineages is an important question. By providing functional data showing that OTX2 directly controls chromatin accessibility, the authors add an important layer of understanding to their previous finding that OTX2 plays a key role in preventing mouse germline entry. The use of their previously established OTX2-null cells expressing OTX2-ER to rapidly induce nuclear OTX2 in a mutant background or the most part makes their experiments elegant and convincing. In focusing on the role of one gene in one event in one species, it is specialized and narrow in scope and will mostly be of interest to experts in the field, but there is nothing wrong with that.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Barbieri and Chambers explore the role of OTX2 on mouse pluripotency and differentiation. To do so, they examine how the chromatin accessibility and OTX2 binding landscape changes across pluripotency, the exit of pluripotency towards formative and primed states, and through to PGCLC/somatic differentiation. The work mostly represents a resource for the community, with possible implications for our understanding of how OTX2 might mediate the germline-soma switch of fates. While the findings of the work are modest, the results seem solid and the manuscript is clear and well-written. I have some comments as indicated below:

      • The comparison between Otx2-/- cells in the presence of PGCLC cytokines compared to WT cells in the absence of cytokines seems like it is missing controls to me. I assume the authors wanted to enable homogeneous populations to facilitate their bulk sequencing methods, but it seems to me like they are comparing apples with oranges. It would have been better to have the reciprocal situations (Otx2-/- cells in basal differentiation medium, and WT cells in PGCLC cytokines) with a sorting strategy to better unpick the differences between the presence and absence of Otx2 in the 2 protocols. Having said that, the authors are careful not to draw many comparisons between those populations so I don't think this omission affects their current claims. They should however clarify whether the flow cytometry (Supp Fig2) was used for sorting cells or if all cells were taken for bulk sequencing.
      • The authors focus solely on the activating role of Otx2 in their data, but given the substantial proportion of DARs that decrease following Otx2 depletion, I presume it is possible that it also has a repressive effect? Either way, this should be discussed.
      • Throughout the text, the authors subject cells (WT / Otx2-/- /Otx2ER ) to different protocols to look at accessibility and Otx2 binding, but with no mention of the cell fate differences that occur in these different conditions. For instance, it is unclear to me to which fate the WT cells without PGCLC cytokines go - I presume this is neural but perhaps this is a mixed fate, given that they are in GK15 rather than N2B27. Likewise, the OTX2ER experiments may promote a mixed population between PGCLC/somatic fates, and this is never described. Ideally transcriptomic data would be collected, but failing that, qPCR data should be obtained to examine this more closely.
      • The authors also state that "OTX2 facilitates Fgf5 transcription' (page10) but provide no transcriptional data to substantiate this claim. Again RT-qPCR would help make this point.
      • The authors state that d2 PGCLCs "show an intermediate position between ESCs and EpiLCs" based on the PCA location. They should be careful to qualify that this is only in the first 2 principal components, because it may well be the case (and is likely) that in other components the PGCLC population is far removed from the pluripotent states.
      • It is unclear to me what the 'increase[d] accessibility' (eg abstract final sentence, Figure 3E) really means at the cellular level. Does this indicate that more cells have this site open, and does this have implications for the heterogeneity of cell fates observed? Since the authors are concerned with fate decisions, this seems like an important consideration that should at least be discussed.

      Minor Suggestions:

      • Presumably the regions bound by OTX2 in Tet2, Mycn and Fgf5 (Fig1E) are called enhancers because these are known from existing literature. It would be helpful to cite the relevant references to this in the text for those unfamiliar with these.
      • On page 13, the authors say "To determine whether OTX2 expression is essential to maintain chromatin accessibility in somatic cells..." but this does not seem to be what they test because they are using PGCLC medium. Perhaps I misunderstood, but this could be clarified.
      • On page 14 the authors claim, "These results indicate that...the partner proteins that OTX2 act alongside differ...". While this may be the case, their results do not substantiate this, it is just speculation. Should be toned down.
      • Page 18, PGCLC differentiation method sections needs to be described as such (ie. Add "For PGCLC differentiation..." before the second paragraph)
      • It would be helpful to indicate time on the protocol schematics (eg Fig4A, 5A, 5D etc) as I had to keep checking the methods to find out how long the full differentiation time-course was.
      • Since the authors compare between the Tam d0-d2 treatments assessed at d2 versus d4 (Figure5B vs 5E) it would be helpful to make the colourbars the same scale, for both ATAC and Cut&Run datasets.

      Significance

      The study examines the binding of OTX2 and subsequent chromatin accessibility in pluripotent, primed and differentiated (PGCLC/Somatic) cell states, including through Otx2-/- cell lines and temporally-controlled exogenous expression of Otx2. As such, it represents a valuable resource into the potential direct targets of Otx2 and their change in accessibility state across cell types. The work is likely to be of interest to those working on understanding the exit of pluripotency, gene regulatory networks, and chromatin remodelling. My expertise is in cell fate decisions, pluripotency regulation and PGC(LC) differentiation.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      OTX2 is a pivotal transcription factor that regulates the fate choice between somatic and primordial germ cell (PGC) lineages in early mouse development. In the current study, the authors use in vitro stem cell models to demonstrate that OTX2 mediates this developmental fate decision through controlling chromatin accessibility, whereby OTX2 helps to activate putative enhancers that are associated with somatic fate. By extension, those somatic-associated regulatory regions therefore become inaccessible in cells adopting PGC identity in which Otx2 is downregulated.

      Comments

      I enjoyed reading this manuscript. The experiments have been carried out well and for the most part the results provide convincing evidence to support the claims and conclusions in the manuscript. I particularly liked the experiments using the inducible Otx2 transgene to examine the acute changes in chromatin accessibility following restoration of OTX2.

      I include some suggestions below to the authors for additional analyses that I feel would further strengthen their study.

      I also felt that the authors focus almost exclusively on the subset of OTX2-bound sites that lose accessibility in the absence of OTX2. But, as they show in several figure panels, these sites tend to be the minority and that most OTX2-occupied sites do not lose accessibility in Otx2-null cells (actually, more sites tend to gain accessibility). I encourage the authors to modify the text and some of the analyses to give a better balance to their study.

      1. Figure 1: The authors report in the methods that they performed OTX2 CUT&RUN in biological duplicates. It would strengthen their results if they showed in Figure S1 some representative data from each replicate separately to show the consistency.
      2. Figure 1: The authors write: "...OTX2 binds mostly to putative enhancers." Whether these distal sites are enhancers is not sufficiently evidenced in the manuscript, but it is important information to collect to support their model of OTX2 function. The authors should strengthen their analysis by examining whether OTX2 peaks are enriched at previously defined enhancer regions.
      3. Figure 2: I think it would be helpful to remind the reader here that Otx2 is normally downregulated in PGCs, and that Otx2 expression is maintained (at least initially) in somatic cells. This would help explain the logic behind the choice of samples that were profiled. That said, I'm still puzzled why the authors did not examine flow-sorted WT+cyto cells?
      4. Figure 2D: I appreciate that the highlighted region at the Tet2 locus is a DAR, but from the genome tracks it looks as though the region still has high accessibility. Are there any other examples to exemplify a more obvious DAR? Additionally, since twice as many DARs gain accessibility in Otx2-null ESCs compared to lose accessibility, why not show examples of these as well? The same is true of EpiLCs. (Or alternatively, provide a good explanation for why not to show these other categories)
      5. Figure 2: the authors write: "This suggests that OTX2 acts as a pioneer TF...". However, at this point in the manuscript, there is no evidence to support that OTX2 might have pioneer activity. I think this claim would be better suited to later in the manuscript, or in the discussion, following the finding that reintroduction of OTX2 can induce chromatin accessibility at previously closed sites.
      6. Figure 3: I would be tempted to put Figure S3A and S3B into Figure 3. It would be better to show all 1246 DARs together, either ordered by OTX2 CT&RUN signal, or presented in two pre-defined groups (OTX2-bound vs unbound). I also suggest that the author show OTX2 signals and ATAC-seq signals for the 3028 DARs that gain accessibility in Otx2-null EpiLCs (this could be added to a supplemental figure).
      7. Figure 3: What is special about the 8% of OTX2-bound site that lose accessibility, versus the 92% of sites that do not?
      8. Figure 6: Do the PGCLCs with OTX2 expression have chromatin accessibility profiles similar to somatic cells? Consider adding WT somatic cell data to Figure 6A, which could be an interesting comparison with the Tam d0-d2 samples.
      9. Figure 6F: If the 4221 sites are split into those bound by OTX2 versus those that are not (related to Figure 6C) then is there a difference? i.e. are the OTX2-bound sites opening up?
      10. Is there any evidence that OTX2 binds and compacts PGCLC enhancers in somatic cells? I appreciate this is different to the main thrust of the authors' model, but being able to show that OTX2 does not compact these sites lends further support to their preferred model of OTX2 opening sites of somatic lineages.
      11. Discussion: Have prior studies established a connection between OTX2 and chromatin remodellers that can open chromatin? Or, if not, then perhaps this could be proposed as a line of future research.

      Significance

      Strengths

      The results presented provide a careful dissection of the role of OTX2 in controlling chromatin accessibility in different stages of pluripotent to somatic and PGC fates. The authors do a good job of revealing the stage-specific differences in OTX2 occupancy and chromatin accessibility as well as the different responses following the acute reintroduction of OTX2.

      Limitations

      I felt that the authors could present/discuss a bit more on alternative possibilities and models, as it would help the reader to better understand why they favour one model over other ones, and presenting these other possibilities could also provide more support for their preferred model.

      Whether OTX2 is binding to putative enhancers is inferred but could be evidenced more strongly, as that is important for their model.

      Advance

      This study provides key information to understand the mechanisms of OTX2 function in cell fate choice. Similar functions have been shown in other contexts for other transcription factors, but this is a nicely done study and adds to our understanding of how transcription factors function in early development to direct cell-fate decisions.

      Expertise

      My field of expertise lies in the gene regulatory control of early developmental decisions.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The study investigates the relationship between replication timing (RT) and transcription. While there is evidence that transcription can influence RT, the underlying mechanisms remain unclear. To address this, the authors examined a single genomic locus that undergoes transcriptional activation during differentiation. They engineered the Pln locus by inserting promoters of varying strengths to modulate transcription levels and assessed the impact on replication timing using Repli-seq. Key Findings: • Figure 1C and 1D: The data show that higher transcription levels correlate with an advanced RT, suggesting that transcriptional activity influences replication timing. • Figure 2: To determine whether transcription alone is sufficient to alter RT, the authors inserted an hPGK reporter at different genomic locations. However, given the findings in Figure 1, which suggest that this is not the primary mechanism, • Figure 3: The authors removed the marker to examine whether the observed effects were due to the promoter-driven Pln locus, which has significantly larger then the marker. • Figure 4: The study explores the effect of increased doxycycline (Dox) treatment at the TRE (tetracycline response element), further supporting the role of transcription in RT modulation. • Figure 5: The findings demonstrate that Dox-induced RT advancement occurs rapidly, is reversible, and correlates with transcription levels, reinforcing the hypothesis that transcription plays a direct role in influencing replication timing. • Figure 6. Shows that during differentiation transcription of Pln is not required for RT advancement.

      Overall, the study presents a compelling link between transcription and replication timing, though some experimental choices warrant further clarification. I have no major comments.

      __Minor Comments: __Overall, the results are convincing, and the study appears to be well-conducted. In Figure 2, the authors use the hPGK promoter. However, it is unclear why they did not use the constructs from the previous experiments. Given that the hPGK promoter did not advance RT in Figure 1, the results in Figure 2 may not be entirely unexpected.

      We took advantage of previously published cell lines using a PiggyBac Vector designed to pepper the reporter gene at random sites throughout the genome; the point of the experiment was to acquire supporting evidence for the hypothesis that any vector with its selectable marker driven by the hPGK promoter will not advance RT no matter where it is inserted. Since there are reports concluding that transcription per se is sufficient to advance RT, it was important to confirm that there was nothing unique about the particular vector or locus into which we inserted our panel of vectors.

      ACTION DONE: We have now added the following sentence to the results describing this experiment: “____By analyzing RT in these lines, we could evaluate the effect of a different hPGK vector on RT when integrated at many different chromosomal sites. “

      Additionally, the study does not formally exclude the possibility that Pln protein expression itself influences RT. In Figure 1, readthrough transcription at the Pln locus could potentially drive protein expression. It would be useful to know whether the authors address this point in the discussion.

      NOT DONE FOR NEED OF CLARIFICATION: It is unclear why a secreted neural growth factor would have a direct effect on replication timing in embryonic stem cells and, in particular, only in cis (remember there is a control allele that is unaffected). We would be happy to address this in the Discussion if we understood the reviewers’ hypothesis. We cannot respond to this comment without understanding the hypothesis being tested as we do not know how a secreted protein could affect the RT of one allele without affecting the other.

      Regarding the mechanism, if transcription across longer genomic regions contributes to RT changes, transcription-induced could DNA supercoiling play a role. For instance, could negative supercoiling generated by active transcription influence replication timing?

      Yes, many mechanisms are possible.

      ACTION DONE: ____We have added the following sentence to the discussion, referencing a seminal paper on that topic by Nick Gilbert: “ ____For example, long transcripts could remodel a large segment of chromatin, possibly by creating domains of DNA supercoiling (Naughton et. al., 2013____).____”

      It remains puzzling why Pln transcription does not contribute to replication timing during differentiation. Is there any evidence of chromatin opening during this process? For example, are ATAC-seq profiles available that could provide insights into chromatin accessibility changes during differentiation?

      We thank the reviewer for asking this as we should have mentioned something very important here. Lack of necessity for transcription implies that independent mechanisms are functioning to elicit the RT switch. In other work (Turner et. al., bioRxiv, provisionally accepted to EMBO J.), we have shown that specific cis elements (ERCEs) can function to maintain early replication in the absence of transcription.

      ACTION DONE: We now explicitly state in the Discussion: “____This is not surprising, given that ERCEs can maintain early RT in the absence of transcription (Turner, bioRxiv).”

      ACTION TO BE DONE SOON: We will provide a new Figure 6D showing ATAC-seq changes upon differentiation of mESCs to mNPCs and their location relative to the promoter/enhance deletion. As you will see, there is an ATAC-seq site that appears during differentiation, upstream of the deletion. We will hypothesize in the revised manuscript that these are the elements that drive the RT switch and that future studies need to investigate that hypothesis. We have also added the following sentences to the discussion after the sentence above, stating: “____In fact, new sites of open chromatin, consistent with ERCEs appear outside of the deleted Ptn transcription control elements after differentiation (soon to be revised Figure 6D). The necessity and sufficiency of these sites to advance RT independent of transcription will be important to follow up.”

      We also have preliminary data that are part of a separate project in the lab so they are not ready for publication, but are directly relevant to the reviewer’s question. This data shows evidence for a region upstream of the Ptn promoter/enhancer deletion described in Figure 6 that, when deleted, DOES have an effect on the RT switch during differentiation. This deletion overlaps an ATAC-seq site we will show in the new figure 6D.

      Reviewer #1 (Significance (Required)):

      This is a compelling basic single-locus study that systematically compares replication timing (RT) and transcription dynamics while measuring several key parameters of transcription.

      My relevant expertise lies in transcriptional regulation and understanding how noncoding transcription influences local chromatin and gene expression.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In the manuscript entitled: Transcription can be sufficient, but is not necessary, to advance replication timing", the authors use as they state a "reductionist approach" to address a long-standing question in the replication field on what level the process of transcription within a replication domain can alter the underlying replication timing of this domain. The authors use an elegant hybrid mouse embryonic stem cell line to discriminate the two allelic copies and focus on a specific replication domain harboring the neuronal Ptn gene that is only expressed upon differentiation. The authors first introduce four different promoters in the locus upstream of Ptn gene that drive expression of small transgenes. Only the promoters with highest transcriptional induction could advance RT. If the promoters are placed in such a way that they drive expression of the 96kb Ptn gene, then also some the weaker promoters can drive RT advancement, suggesting that it is a combination of transcriptional strength and size of the transcribed domain important for RT changes. Using a DOX-inducible promoter, the authors show that this happens very fast (3-6h after transcription induction) and is reversible as removal of DOX leads to slower RT again. Finally, deleting the promoter of Ptn gene and driving cells into differentiation still advances RT, allowing the authors to conclude that "transcription can be sufficient but not necessary to advance replication timing."

      Major comments: Overall, this is a well designed study that includes all necessary controls to support the author's conclusions. I think it is a very interesting system that the authors developed. The weakness of the manuscript is that there is no mechanistic explanation how such RT changes are achieved on a molecular basis. But I'm confident that the system could be indeed used to further dissect the mechanistic basis for the transcription dependence of RT advancements.

      Therefore, I support publication of this manuscript if a few comments below can be addressed.

      1) Figure 4 shows a titration of different DOX concentrations and provides clear evidence that the degree of RT advancement tracks well with the level of transcription. As the doses of DOX are quite high in this experiment, have the authors checked on a global scale to what extent transcription might be deregulated in neighbouring genes or genome-wide?

      The DOX concentration that we use for all experiments other than the titration is 2 µg/ml, which is quite standard. The high concentrations (up to 16µg/ml) are only used in the titration experiments shown in Figure 4 to demonstrate that we have reached a plateau. In fact, we stated in Materials and Methods that high doses of Dox led to cell toxicity. Looking at the transcription datasets, there are no significant changes in transcription below 8µg/ml, a few dozen significant changes at 8 and more such changes at 16µg/ml of DOX. The tables of genome wide RT and transcription are provided in the manuscript for anyone wishing to investigate the effects of Dox on cellular physiology but at the concentration used in all other experiments (2µg/ml) there are no effects on transcription.

      __ACTION DONE: We have now modified the statement in the Materials and Methods to read: “ ____Mild toxicity and changes in genome-wide transcription were observed at 8µg/ml and more so at 16µg/ml”. __

      2) One general aspect is that the whole study is only focused on the one single Ptn replication domain. Could the authors extend this rather narrow view a bit and also show RT data in the neighbouring domains. This would be particularly important for the DOX titration experiment that has the potential to induce transcriptional deregulation (see comment above).

      __ACTION DONE: We have now added to revised Supplemental Figure 4 a zoom out of 10 Mb surrounding the Ptn gene showing no detectable effects on RT at any of the titration concentrations. __

      __ACTION TO BE DONE SOON: To address the generalization of the findings (length and strength matter), we have repeated the ESC to NPC differentiation and performed both Repli-seq and BrU-seq to evaluate RT changes relative to total genomic nascent transcriptional changes. The sequencing reads for this experiment are in our analyst’s hands so we expect this to be ready within a few weeks. We will provide a new Figure 7 comparing genome-wide changes in RT vs. transcription to determine the significance of length and strength of transcription induction to RT advances and the necessity of transcriptional induction for RT advances. We and other laboratories have performed many integrative analyses of RNA-microarray/RNA-seq data vs. RT changes, but not total genomic nascent transcription and not with a focus on the effect of length and strength of transcription. For example, outcomes that would be consistent with our reductionist findings at the Ptn locus would be if we find domains that are advanced for RT with no induction of transcription (transcription not necessary) and little to no regions showing significant induction of transcription without RT advances. __

      3) Figure 5 shows that the full capacity to advance RT upon DOX induction of the Ptn gene is achieved after 3h to 6h of DOX induction, so substantially less than a full cell cycle in mEScs (12h). This result suggests that origin licensing/MCM loading cannot be the critical mechanism to drive the RT change because only a small fraction of the cells has undergone M/G1-phase where origins are starting to get loaded. As a large fraction of mESCs (60-70%) are S-phase cells in an asynchronous population, the mechanism is likely taking place directly in S-phase. Could the authors try to synchronize cells in G1/S using double-thymidine block, then induce DOX for 3h before allowing cells to reenter S-phase and then check replication timing of the domain? This can be compared to an alternative experiment where transcription is only induced for 3h upon release into S-phase. This could provide more mechanistic insights as to whether transcription is sufficient to drive RT changes in G1 versus S-phase cells.

      We agree that the timing of induction is such that it is very likely that alterations in RT can occur during S phase. The reviewer proposes a reasonable experiment that could be done, but it would require a long delay of this publication to develop and validate those synchronization protocols and we do not have personnel at this time to carry out the experiment. This would be a great initiating experiment for someone to pursue the mechanisms by which transcription can advance RT.

      ACTION DONE: We have added the following sentence to the Discussion section on mechanisms: ____The rapid nature of the RT change after induction of transcription suggests that RT changes can occur after the functional loading of inactive MCM helicases onto chromatin in telophase/early G1 (Dimitrova, JCB, 1999; Okuno, EMBO J. 2001; Dimitrova, J. Cell Sci, 2002), and possibly after S phase begins.

      Minor comments: • Figure 1B and Figure 6A. Quality of the genome browser snapshots could be improved and certain cryptic labelling such as "only Basic displayed by default" could be removed

      ACTION DONE: We have modified these figures.

      • The genome browser tracks appear a bit small across the figures and could be visually improved.

      ACTION DONE: We have modified the genome browser tracks to improve their presentation

      • In figure 1E we see an advancement in RT in Ptn gene caused by nearby enhanced Hyg-TK gene expression induced by mPGK promoter. However, in figure 3D we see mPGK promoter has reduced ability to advance RT of Ptn gene. It would be nice to address this discrepancy in the results.

      The reviewer’s point is well taken. We are not sure of the answer. You can see that the transcription is very low in both cases, while the RT shift is greater in one replicate vs. the other.

      ACTION DONE: We have, rather unsatisfactorily, added the following sentence to the results section describing Figure 3. “____We do not know why the mPGK promoter was so poor at driving transcription in this context.”

      Reviewer #2 (Significance (Required)):

      In my point of view, this is an important study that unifies a large amount of literature into a conceptual framework that will be interesting to a broad audience working on the intertwined fields of gene regulation, transcription and DNA replication, as well as cell fate switching and development.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __ In their manuscript, "Transcription can be sufficient, but is not necessary, to advance replication timing," Vouzas et al. take a systematic and reductionist approach to investigate a late-replicating domain on chromosome VI. Here, they examine the effect of transcribing a single gene locus, Pleiotrophin, on replication timing. When inserting or manipulating promoters or transcript lengths using CRISPR-Cas9, replication timing was altered in mESCs as judged by a combination of Repli-Seq, Bru-Seq, and RNA-Seq. Importantly, they found that transcription can be sufficient to advance replication timing depending on the length and strength of the expression of an ectopically transcribed gene. Taken together, the manuscript presents a compelling argument that transcription can advance replication timing but is not necessary for it.

      Major comments • A schematic or conceptual model summarising the major findings of transcription-dependent and independent mechanisms of RT advancement should be included in the discussion to add to the conceptual framework

      NOT DONE: We discussed this at length between the two senior authors and the first author and we do not feel ready to draw a summary model. We do not know what is advancing RT when transcription is induced or not induced, and we are not comfortable choosing one possible model of many. We hope that the added speculations on mechanism in the Discussion will sufficiently convey the future research that we feel needs to be done.

      ACTIONS DONE: In addition to the speculation on mechanism that already was in our Discussion section, we have added: On mechanisms of rapid induction of RT change, we have added to the Discussion: “____The rapid nature of the RT change after induction of transcription suggests that RT changes can occur after the functional loading of inactive MCM helicases onto chromatin in telophase/early G1 (Dimitrova, JCB, 1999; Okuno, EMBO J. 2001; Dimitrova, J. Cell Sci, 2002), and possibly after S phase begins.” And “For example, long transcripts could remodel a large segment of chromatin, possibly by creating domains of DNA supercoiling (Naughton et. al., 2013, PMID ____23416946).____ “ On mechanisms of RT advance in the absence of transcription, we have added the following to the Discussion: “____This is not surprising, given that ERCEs can maintain early RT in the absence of transcription (Turner, bioRxiv). In fact, chromatin features with the properties of ERCEs do appear outside of the deleted Ptn transcription control elements after differentiation (soon to be revised Figure 6C). The necessity and sufficiency of these new chromatin features to advance RT independent of transcription will be important to follow up.”

      • Vouzas et al. spend a substantial part of the manuscript to delve into the requirements to advance RT and even use a Doxycycline-based titration for temporal advancement of RT. Yet, all conclusions come from the use of hybrid-genome mouse embryonic stem cells (mESCs). Therefore, it remains speculative if and whether findings can be generalized to other cell types or organisms. The authors could include another organism/ cell type to strengthen the relevance of their findings to a broader audience, particular as they identified promoters that drive ectopic gene expression without affecting RT. Showcasing this in other model organisms would be of great interest.

      NOT DONE: To set this system up in another cell type or species would take a very long time. We also do not have personnel to carry that approach.

      ACTION TO BE DONE SOON: As an alternative approach that partially addresses this reviewer’s concern, we will provide a new Figure 7 with an analysis of RT changes vs. transcriptional changes when mESCs are differentiated to neural precursor cells. As described above in response to Revier #2s criticism #2, we have repeated the ESC to NPC differentiation and performed both Repli-seq and BrU-seq to evaluate RT changes relative to total genomic nascent transcriptional changes. The sequencing reads for this experiment are in our analyst’s hands so we expect this to be ready within a few weeks. We will compare genome-wide changes in RT vs. transcription to determine the significance of length and strength of transcription induction to RT advances and the necessity of transcriptional induction for RT advances. We and other laboratories have performed many integrative analyses of RNA-microarray/RNA-seq data vs. RT changes, but not total genomic nascent transcription and not with a focus on the effect of length and strength of transcription. For example, outcomes that would be consistent with our reductionist findings at the Ptn locus would be if we find domains that are advanced for RT with no induction of transcription (transcription not necessary) and little to no regions showing significant induction of transcription without RT advances.

      • OPTIONAL: as with the previous point, the authors went to great depth and length to show how ectopic manipulations affect RT changes on a single locus using genome-wide methods. In addition, the manuscript would benefit from the inclusion of other loci, particularly as transcription of the Ptn locus wasn't needed during differentiation to advance RT at all.

      NOT DONE: This rigorous reductionist approach is laborious and to set it up at one gene at a time at additional loci would be a huge effort taking quite a long time.

      ACTION TO BE DONE SOON: (same as response above) As an alternative approach that partially addresses this reviewer’s concern, we will provide a new Figure 7 with an analysis of RT changes vs. transcriptional changes when mESCs are differentiated to neural precursor cells. As described above in response to Reviewer #2s criticism #2, we have repeated the ESC to NPC differentiation and performed both Repli-seq and BrU-seq to evaluate RT changes relative to total genomic nascent transcriptional changes. The sequencing reads for this experiment are in our analyst’s hands so we expect this to be ready within a few weeks. We will compare genome-wide changes in RT vs. transcription to determine the significance of length and strength of transcription induction to RT advances and the necessity of transcriptional induction for RT advances. We and other laboratories have performed many integrative analyses of RNA-microarray/RNA-seq data vs. RT changes, but not total genomic nascent transcription and not with a focus on the effect of length and strength of transcription. For example, outcomes that would be consistent with our reductionist findings at the Ptn locus would be if we find domains that are advanced for RT with no induction of transcription (transcription not necessary) and little to no regions showing significant induction of transcription without RT advances.

      • The same point of Ptn not needing to be transcribed to advance RT of the respective domain, albeit being a very interesting observation, disturbs the flow of the manuscript, as the whole case was built around transcription and this particular locus-containing domain. Maybe one can adapt the storytelling to fit better within the overall framework.

      We would argue that demonstrating induction of Ptn, the only gene in this domain, is sufficient to induce early RT is a logical segway to asking whether, in the natural situation, induction is correlated with advance in RT. Our results show that transcription is sufficient but not necessary, which is expected if there are other mechanisms that regulate RT.

      __ACTION DONE: To make this transition more smooth, we have added the following sentence to the beginning of the results section describing Figure 6: “ ____This raises the question as to whether the natural RT advance that accompanies Ptn induction during differentiation requires Ptn transcription, or whether other mechanisms, such as ERCEs (Sima / Turner) can advance RT independent of transcription. “ __

      ACTION TO BE DONE SOON:____ To finish the work flow in a way that ties length and strength and sufficiency but not necessity in to the theme of natural cellular differentiation, we will provide a new Figure 7 with an analysis of RT changes vs. transcriptional changes when mESCs are differentiated to neural precursor cells, as described above.

      Minor comments • While citations are thorough, some references (e.g., "need to add Wang, Klein, Mol. Cell 2021") are incomplete.

      __ACTION TO BE DONE SOON: We apologize that some references seemed to not be incorporated into the reference manager Mendely. Since we are still planning to add one more figure soon and we will need to add some references for the datasets that will be shown in future Figure 6D, after that draft is ready, we will comb the manuscript for any references that were not entered and correct them. __

      • The text corresponding to Figure 1C could use more explanation for readers not familiar with the depiction of Repli-Seq data.

      ACTION DONE: “____Repli-seq labels nascent DNA with BrdU, followed by flow cytometry to purify cells in early vs. late S phase based on their DNA content, then BrdU-substituted DNA from each of these fractions is immunoprecipitated, sequenced and expressed as a log2 ration of early to late synthesized DNA (log2E/L). BrU-seq labels total nascent RNA, which is then immunoprecipitated an expressed as reads per million per kilobase (RPMK).”

      • Figure 1C needs labelling of the x-axes.

      ACTION DONE: We have now labeled the X axes.

      • Statistical analyses should be used consistently throughout the manuscript and explained in more detail, i.e. significance levels, tests, instead of "Significant differences....calculated using x".

      We used the same analysis for all the Repliseq data and the same analysis for all the Bruseq data. We agree that we did not present this consistently in the figure legends and methods.

      ACTION DONE:____ To correct the confusion we have clarified the statistical methods in the methods section and referred to methods in the figure legends as follows:

      The methods description of statistical significance for RT now reads: “____Statistical significance of RT changes for all windows in each sample, relative to WT, were calculated using RepliPrint (Ryba et al., 2011), with a p-value of 0.01 used as the cut-off for windows with statistically significant differences.”

      The methods description of statistical significance for transcription now reads: “____Differential expression analysis, including the calculation of statistically significant differences in expression, was conducted using the R package DESeq2____. In Figure 1, statistical significance was calculated relative to HTK expression in the parental cell line, which is expected to be zero, since the parental line does not have an HTK insertion. In all other Figures significance was calculated relative to Ptn expression in the parental line, which is expected to be zero, since the parental line does not express Ptn.____”

      The legend to Figure 1C now reads: The red shading indicates 50kb windows with statistically significant differences in RT between WT casteneus and modified 129 alleles, determined as described in Methods.

      The legend to Figure 1E now reads: “The asterisks indicate a significant difference in the levels of HTK expression relative to HTK expression in the parental cell line as described in Methods. ____There are no asterisks for the RT data, as statistical significance was calculated for individual 50kb windows as shown in panel (C).”

      Each time significance is measured in the subsequent legends, it is followed by the phrase “, determined as described in Methods” or “presented as in Figure 1C” or “presented as in Figure 1E” as appropriate.

      __ __ **Referees cross-commenting** __ Comment on Reviewer#1's review__, comment mentioning ATAC-Seq: Another way to look at this could be to investigate for origin usage changes (BrdU-Seq or GLOE-Seq) of chromosome 6 during differentiation.

      NOT DONE: Unfortunately we could not find any studies comparing origin mapping in mESCs and mNPCs.

      Comment on Reviewer#2's review, major comment 3: I do agree with their statement that origin loading cannot be the driver of RT change, as MCM2-7 double hexamer loading is strictly uncoupled from origin firing. Hence, any mechanism responsible for RT advance must happen at the G1/S phase transition or during S-phase, most likely due to the regulated activity of DDK/CDK or the limitation and preferred recruitment of firing factors to early origins. This could be tested through overexpression of said factors.

      NOT DONE: We agree that manipulating these factors would be a reasonable next approach to sort out mechanism. Due to limited resources and personnel, we will not be able to do this in a short period of time. We also argue that these are experiments for the next chapter of the story, likely requiring an entire PhD thesis (or multiple) to sort out.

      ACTION DONE: We have added the following sentence to the Discussion section on mechanisms: ____The rapid nature of the RT change after induction of transcription suggests that RT changes can occur after the functional loading of inactive MCM helicases onto chromatin in telophase/early G1 (Dimitrova, JCB, 1999; Okuno, EMBO J. 2001; Dimitrova, J. Cell Sci, 2002), and possibly after S phase begins.

      Reviewer #3 (Significance (Required)):

      General: This manuscript presents a compelling study investigating the relationship between transcription and replication timing (RT) using a reductionist approach. The authors systematically manipulated transcriptional activity at the Ptn locus to dissect the elements of transcription that influence RT. The study's strengths lie in its rigorous experimental design, clear results, and the reconciliation of seemingly contradictory findings in the existing literature. However, some aspects could be improved, particularly in exploring the mechanistic details of transcription-independent RT regulation at the investigated domain, the generalisability of the findings to other cells/organisms, and enhancing the presentation of certain data (explanation of e.g. Figure 1c, dense figure arrangement, lack of a summary figure illustrating key findings (e.g., correlation between transcription rate, readthrough effects, and RT advancement)).

      Advance: The manuscript directly addresses and reconciles contradictory findings in the literature regarding the effect of ectopic transcription on RT. Previous studies have reported varying effects, with some showing that transcription advances RT (Brueckner et al., 2020; Therizols et al., 2014), while others have shown no effect or only partial effects depending on the insertion site (Gilbert & Cohen, 1990; Goren et al., 2008). The current study conceptually advances the field by systematically testing different promoters and transcript lengths at a single locus (mechanistic insight), demonstrating that the length and strength of transcription, as well as promoter context, influence RT. This presents a unifying concept on how RT can be influenced. The authors also present a tunable system (technical advance) that allows rapid and reversible alterations of RT, which will certainly be useful for future studies and the field.

      Audience: The primary audience will be specialised researchers in the fields of replication timing, epigenetics, and gene regulation. This study may be of interest beyond the specific field of replication timing, such as cancer biology, developmental biology, particularly if a more broader applicability of its tools and concepts can be shown.

      Expertise: origin licensing, origin activation, MCM2-7, yeast and human cell lines

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In their manuscript, "Transcription can be sufficient, but is not necessary, to advance replication timing," Vouzas et al. take a systematic and reductionist approach to investigate a late-replicating domain on chromosome VI. Here, they examine the effect of transcribing a single gene locus, Pleiotrophin, on replication timing. When inserting or manipulating promoters or transcript lengths using CRISPR-Cas9, replication timing was altered in mESCs as judged by a combination of Repli-Seq, Bru-Seq, and RNA-Seq. Importantly, they found that transcription can be sufficient to advance replication timing depending on the length and strength of the expression of an ectopically transcribed gene. Taken together, the manuscript presents a compelling argument that transcription can advance replication timing but is not necessary for it.

      Major comments

      • A schematic or conceptual model summarising the major findings of transcription-dependent and independent mechanisms of RT advancement should be included in the discussion to add to the conceptual framework
      • Vouzas et al. spend a substantial part of the manuscript to delve into the requirements to advance RT and even use a Doxycycline-based titration for temporal advancement of RT. Yet, all conclusions come from the use of hybrid-genome mouse embryonic stem cells (mESCs). Therefore, it remains speculative if and whether findings can be generalized to other cell types or organisms. The authors could include another organism/ cell type to strengthen the relevance of their findings to a broader audience, particular as they identified promoters that drive ectopic gene expression without affecting RT. Showcasing this in other model organisms would be of great interest.
      • OPTIONAL: as with the previous point, the authors went to great depth and length to show how ectopic manipulations affect RT changes on a single locus using genome-wide methods. In addition, the manuscript would benefit from the inclusion of other loci, particularly as transcription of the Ptn locus wasn't needed during differentiation to advance RT at all.
      • The same point of Ptn not needing to be transcribed to advance RT of the respective domain, albeit being a very interesting observation, disturbs the flow of the manuscript, as the whole case was built around transcription and this particular locus-containing domain. Maybe one can adapt the storytelling to fit better within the overall framework.

      Minor comments

      • While citations are thorough, some references (e.g., "need to add Wang, Klein, Mol. Cell 2021") are incomplete.
      • The text corresponding to Figure 1C could use more explanation for readers not familiar with the depiction of Repli-Seq data.
      • Figure 1C needs labelling of the x-axes.
      • Statistical analyses should be used consistently throughout the manuscript and explained in more detail, i.e. significance levels, tests, instead of "Significant differences....calculated using x".

      Referees cross-commenting

      Comment on Reviewer#1's review, comment mentioning ATAC-Seq: Another way to look at this could be to investigate for origin usage changes (BrdU-Seq or GLOE-Seq) of chromosome 6 during differentiation.

      Comment on Reviewer#2's review, major comment 3: I do agree with their statement that origin loading cannot be the driver of RT change, as MCM2-7 double hexamer loading is strictly uncoupled from origin firing. Hence, any mechanism responsible for RT advance must happen at the G1/S phase transition or during S-phase, most likely due to the regulated activity of DDK/CDK or the limitation and preferred recruitment of firing factors to early origins. This could be tested through overexpression of said factors.

      Significance

      General: This manuscript presents a compelling study investigating the relationship between transcription and replication timing (RT) using a reductionist approach. The authors systematically manipulated transcriptional activity at the Ptn locus to dissect the elements of transcription that influence RT. The study's strengths lie in its rigorous experimental design, clear results, and the reconciliation of seemingly contradictory findings in the existing literature. However, some aspects could be improved, particularly in exploring the mechanistic details of transcription-independent RT regulation at the investigated domain, the generalisability of the findings to other cells/organisms, and enhancing the presentation of certain data (explanation of e.g. Figure 1c, dense figure arrangement, lack of a summary figure illustrating key findings (e.g., correlation between transcription rate, readthrough effects, and RT advancement)).

      Advance: The manuscript directly addresses and reconciles contradictory findings in the literature regarding the effect of ectopic transcription on RT. Previous studies have reported varying effects, with some showing that transcription advances RT (Brueckner et al., 2020; Therizols et al., 2014), while others have shown no effect or only partial effects depending on the insertion site (Gilbert & Cohen, 1990; Goren et al., 2008). The current study conceptually advances the field by systematically testing different promoters and transcript lengths at a single locus (mechanistic insight), demonstrating that the length and strength of transcription, as well as promoter context, influence RT. This presents a unifying concept on how RT can be influenced. The authors also present a tunable system (technical advance) that allows rapid and reversible alterations of RT, which will certainly be useful for future studies and the field.

      Audience: The primary audience will be specialised researchers in the fields of replication timing, epigenetics, and gene regulation. This study may be of interest beyond the specific field of replication timing, such as cancer biology, developmental biology, particularly if a more broader applicability of its tools and concepts can be shown.

      Expertise: origin licensing, origin activation, MCM2-7, yeast and human cell lines

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the manuscript entitled: Transcription can be sufficient, but is not necessary, to advance replication timing", the authors use as they state a "reductionist approach" to address a long-standing question in the replication field on what level the process of transcription within a replication domain can alter the underlying replication timing of this domain. The authors use an elegant hybrid mouse embryonic stem cell line to discriminate the two allelic copies and focus on a specific replication domain harboring the neuronal Ptn gene that is only expressed upon differentiation. The authors first introduce four different promoters in the locus upstream of Ptn gene that drive expression of small transgenes. Only the promoters with highest transcriptional induction could advance RT. If the promoters are placed in such a way that they drive expression of the 96kb Ptn gene, then also some the weaker promoters can drive RT advancement, suggesting that it is a combination of transcriptional strength and size of the transcribed domain important for RT changes. Using a DOX-inducible promoter, the authors show that this happens very fast (3-6h after transcription induction) and is reversible as removal of DOX leads to slower RT again. Finally, deleting the promoter of Ptn gene and driving cells into differentiation still advances RT, allowing the authors to conclude that "transcription can be sufficient but not necessary to advance replication timing."

      Major comments:

      Overall, this is a well designed study that includes all necessary controls to support the author's conclusions. I think it is a very interesting system that the authors developed. The weakness of the manuscript is that there is no mechanistic explanation how such RT changes are achieved on a molecular basis. But I'm confident that the system could be indeed used to further dissect the mechanistic basis for the transcription dependence of RT advancements. Therefore, I support publication of this manuscript if a few comments below can be addressed.

      1. Figure 4 shows a titration of different DOX concentrations and provides clear evidence that the degree of RT advancement tracks well with the level of transcription. As the doses of DOX are quite high in this experiment, have the authors checked on a global scale to what extent transcription might be deregulated in neighbouring genes or genome-wide?
      2. One general aspect is that the whole study is only focused on the one single Ptn replication domain. Could the authors extend this rather narrow view a bit and also show RT data in the neighbouring domains. This would be particularly important for the DOX titration experiment that has the potential to induce transcriptional deregulation (see comment above).
      3. Figure 5 shows that the full capacity to advance RT upon DOX induction of the Ptn gene is achieved after 3h to 6h of DOX induction, so substantially less than a full cell cycle in mEScs (12h). This result suggests that origin licensing/MCM loading cannot be the critical mechanism to drive the RT change because only a small fraction of the cells has undergone M/G1-phase where origins are starting to get loaded. As a large fraction of mESCs (60-70%) are S-phase cells in an asynchronous population, the mechanism is likely taking place directly in S-phase. Could the authors try to synchronize cells in G1/S using double-thymidine block, then induce DOX for 3h before allowing cells to reenter S-phase and then check replication timing of the domain? This can be compared to an alternative experiment where transcription is only induced for 3h upon release into S-phase. This could provide more mechanistic insights as to whether transcription is sufficient to drive RT changes in G1 versus S-phase cells.

      Minor comments:

      • Figure 1B and Figure 6A. Quality of the genome browser snapshots could be improved and certain cryptic labelling such as "only Basic displayed by default" could be removed
      • The genome browser tracks appear a bit small across the figures and could be visually improved.
      • In figure 1E we see an advancement in RT in Ptn gene caused by nearby enhanced Hyg-TK gene expression induced by mPGK promoter. However, in figure 3D we see mPGK promoter has reduced ability to advance RT of Ptn gene. It would be nice to address this discrepancy in the results.

      Significance

      In my point of view, this is an important study that unifies a large amount of literature into a conceptual framework that will be interesting to a broad audience working on the intertwined fields of gene regulation, transcription and DNA replication, as well as cell fate switching and development.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The study investigates the relationship between replication timing (RT) and transcription. While there is evidence that transcription can influence RT, the underlying mechanisms remain unclear. To address this, the authors examined a single genomic locus that undergoes transcriptional activation during differentiation. They engineered the Pln locus by inserting promoters of varying strengths to modulate transcription levels and assessed the impact on replication timing using Repli-seq.

      Key Findings:

      • Figure 1C and 1D: The data show that higher transcription levels correlate with an advanced RT, suggesting that transcriptional activity influences replication timing.
      • Figure 2: To determine whether transcription alone is sufficient to alter RT, the authors inserted an hPGK reporter at different genomic locations. However, given the findings in Figure 1, which suggest that this is not the primary mechanism,
      • Figure 3: The authors removed the marker to examine whether the observed effects were due to the promoter-driven Pln locus, which has significantly larger then the marker.
      • Figure 4: The study explores the effect of increased doxycycline (Dox) treatment at the TRE (tetracycline response element), further supporting the role of transcription in RT modulation.
      • Figure 5: The findings demonstrate that Dox-induced RT advancement occurs rapidly, is reversible, and correlates with transcription levels, reinforcing the hypothesis that transcription plays a direct role in influencing replication timing.
      • Figure 6. Shows that during differentiation transcription of Pln is not required for RT advancement.

      Overall, the study presents a compelling link between transcription and replication timing, though some experimental choices warrant further clarification. I have no major comments.

      Minor Comments:

      Overall, the results are convincing, and the study appears to be well-conducted. In Figure 2, the authors use the hPGK promoter. However, it is unclear why they did not use the constructs from the previous experiments. Given that the hPGK promoter did not advance RT in Figure 1, the results in Figure 2 may not be entirely unexpected.

      Additionally, the study does not formally exclude the possibility that Pln protein expression itself influences RT. In Figure 1, readthrough transcription at the Pln locus could potentially drive protein expression. It would be useful to know whether the authors address this point in the discussion.

      Regarding the mechanism, if transcription across longer genomic regions contributes to RT changes, transcription-induced could DNA supercoiling play a role. For instance, could negative supercoiling generated by active transcription influence replication timing?

      It remains puzzling why Pln transcription does not contribute to replication timing during differentiation. Is there any evidence of chromatin opening during this process? For example, are ATAC-seq profiles available that could provide insights into chromatin accessibility changes during differentiation?

      Significance

      This is a compelling basic single-locus study that systematically compares replication timing (RT) and transcription dynamics while measuring several key parameters of transcription.

      My relevant expertise lies in transcriptional regulation and understanding how noncoding transcription influences local chromatin and gene expression.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number____: RC-2024-02806R

      Corresponding author(s): Hamed Jafar-Nejad, Carmen Paradas

      Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Cho et al uses conditional and inducible conditional mouse models to characterize the function of protein O-glucosyltransferase 1 (POGLUT1), known to cause a type of Limb Girdle Muscular Dystrophy (LGMD-R21), in skeletal muscle satellite cells, differentiation and regeneration. The Authors find that conditional deletion of POGLUT1 in the myogenic progenitors leads to postnatal muscle defects and lethality by postnatal day 30 or so. Muscle progenitors lacking POGLUT1 undergo reduced proliferation and accelerated differentiation, possibly leading to impairment in muscle regeneration. This is supported by an inducible conditional deletion of POGLUT1 in adult satellite cells. Finally, in vitro experiments suggest that POGLUT1 is required for NOTCH pathway activation in myogenic cells and that POGLUT1 could potentially glycosylate specific residues in NOTCH3.

      • We thank the reviewer for the concise summary of the major findings of our manuscript and for her/his comments, which have helped us improve the manuscript.

        Major comments

      1. What is the control used in Figure 1B and other panels? The genotype should be clearly specified instead of saying controls, in all the figures. It will be easier to interpret the data if densitometry of the western blots is provided, normalized to GAPDH levels. Also, is there some POGLUT1 protein remaining in the satellite cells in the cKOs? Data should be shown for an adult time point also (P30 or later), in addition to P0 and P4, to see whether Poglut1 levels are reduced in adult stages in the muscle and satellite cells in the cKOs.
      • As described in the Methods section of the original submission, for all experiments involving Poglut1-cKO, siblings without Poglut1 deletion (Poglut1+/+, Poglut1flox/flox, or Pax7Cre; Poglut1+/+) were used as controls. To address the reviewer's comment, we added this information to the legend of all figures showing Poglut1-cKO
      • In the revised version, the densitometry is provided for all western blot images (Figures 1B and 7A). We added western blot with anti-POGLUT1 on whole muscle extract from P21 animals (Figure 1B), which is the latest stage at which we consistently obtain Poglut1-cKO PAX7 staining and injury studies indicate that at P21 there is a significant reduction in the number of satellite cells. Therefore, isolation of sufficient satellite cells from later stages is not feasible.

        1. In Figure 1C, can the tibia weight be normalized to total body weight instead of tibia length and analyzed? Similarly, can the grip strength measurements in Figure 1G be normalized to total body weight and represented? Grip strength measurements in neonates is tricky; the Authors should clearly explain how this was done. The NMJ defects can be characterized better, especially since the Authors discuss about Agrin in detail.
      • Normalizing tibialis anterior muscle weight to tibia bone length is commonly used and is especially useful when assessing muscle-specific growth or atrophy, as it provides a standardized measure to compare between different mice. To address the reviewer's comment, in the revised manuscript we have provided the tibialis anterior weight (not normalized), tibia length, and then the normalized TA weight (which was also shown in the first submission). Before normalization, the TA weight in cKO showed ~ 56% reduction compared to control (on average). The tibia length did not show a statistically significant difference between cKO and sibling control. After normalization to tibia length, the TA muscle weight still showed a substantial (55%) reduction in cKO mice compared to controls. These data indicate that a reduction in muscle size is the major if not the sole reason for the reduction observed in cKO body size.

      • We reanalyzed the grip strength based on the reviewer's comment and found that when normalized to body weight, the grip strength is not significantly different between cKO and control animals (please see the following figure for the reviewer). Accordingly, we have modified the Results section to avoid any direct suggestion that the muscle function itself is impaired in the cKO animals. Specifically, instead of writing "... suggesting that the muscle function is impaired in the mutant mice" we wrote " ... suggesting reduced muscle strength in the mutant mice". Moreover, at the end of this paragraph, instead of referring to "a severe muscle weakness" we just describe the phenotype as "muscle weakness". Our goal from this paragraph is to suggest that since the animals have much smaller muscles, they are weaker and cannot eat properly; that's why switching to soft food somewhat improves their survival. The observed NMJ defects suggest that there might be functional muscle defects as well. However, to draw conclusions about the muscle function we will need additional experiments, which we believe are beyond the scope of the current manuscript. If the reviewer believes the normalized grip test should be added to the manuscript, we will be happy to do so as a supplementary figure.
      • As stated in the original manuscript, grip test was done starting at P12. Following the Boston University Guideline (https://www.bu.edu/research/ethics-compliance/animal-subjects/animal-care/anesthesia/anesthesia-and-analgesia-neonatal-mice-and-rats-iacuc/; recommended by Jacksons Laboratory when working with neonatal mice), we consider P10 to be the end of the neonatal period in mice. This is also supported by the majority of the papers in literature. For example, in a paper entitled "Quantitative analysis of neonatal skeletal muscle functional improvement in the mouse" (PMID 18310108), the authors show that by P28, most anatomical and gene expression parameters of mouse skeletal muscle are close to adults, and that by P21, these parameters are closer to P28 than the early postnatal mice. Also, they show that the expression of neonatal myosin heavy chain is dramatically reduced between P7 and P14, with another 3-4-fold reduction between P14 and P21. Therefore, for muscle development, similar to many other contexts studied in mice, neonatal period ends between P7 and P14 (most likely P8-P10). Therefore, the grip tests have not been performed on neonatal mice. In the revised manuscript, we have clarified this issue in the Methods section.
      • The NMJ defects were not the main focus of the manuscript and were performed as a standard characterization of this new mouse model. We agree that additional experiments can be performed to better characterize the NMJ defects in this model, but we believe those experiments are beyond the scope of the current manuscript. Also, given the new data and text added to the revised manuscript, we removed the section related to NMJ and agrin from the Discussion to reduce the manuscript length.

        1. What are the small myofibers seen at the corners of the larger myofibers at P21 in the cKOs in Figure 2E? What is the point that the Authors want to conclude from Figure 2C and D? Clearly, if there are fewer satellite cells, Pax7 transcript levels will decrease. In Figure 2D, how are the satellite cell samples normalized between control and cKOs; did they start with equal number of satellite cells or equal amount of satellite cell RNA between control and cKOs? In Figure 2F, representative images for cKOs should be shown? The conclusion from Figure 2F-G is unclear. Since the Authors claim that the weak laminin staining is resolved in the cKOs by P21, why is α-dystroglycan hypoglycosylation seen in the P21 muscle?
      • The structures to which the reviewer is referring are blood vessels. As shown in the following figure generated for the reviewer, staining the muscle sections from control and cKO mice using the endothelial marker CD31 confirms the vascular nature of those structures.

      • The decrease in PAX7+ cells was shown based on antibody staining, but this does not necessarily mean that Pax7 mRNA is similarly reduced. Figure 2C and 2D show that loss of Poglut1 in these cells affects Pax7 expression at the mRNA level. We started with equal amounts of RNA for Figure 2D and have added additional text to describe RNA isolation and RT-PCR in the Methods section of the revised manuscript.

      • Representative images for cKOs are added to the revised Figure 2, as suggested.
      • The conclusion from 2F-G is that a statistically significant reduction is specifically observed in those PAX7+ cells which co-express M-cadherin, which is another well-established satellite cell marker. The data also indicate that some PAX7+ cells persist in the mutant muscle, highlighting the importance of examining the ability of mutant muscle to repair muscle injury, as shown in Figure 4.
      • The glycans on α-dystroglycan mediate its binding to laminin and other extracellular matrix proteins, but laminin expression level is not thought to regulate the degree of α-dystroglycan glycosylation. In other words, laminin staining intensity and the degree of α-dystroglycan are not causally related to each other. What we observed in P21 animals mirrors what we have reported for adult patient biopsies: normal level of extracellular matrix protein collagen VI associated with α-dystroglycan hypoglycosylation. It is worth mentioning that based on our previous report using primary LGMD-R21 patient myoblasts and C2C12 cells treated with a Notch inhibitor, reducing Notch signaling leads to α-dystroglycan hypoglycosylation, likely by altering the differentiation dynamics of the myoblasts (PMID: 27807076).

        1. Representative images should be shown as examples for all time points for both genotypes in Figure 3A. Figure 3H should be represented with statistically significant differences marked clearly. Have the Authors checked whether cell death contributes to the decrease in cultured satellite cells and Pax7+ cells in the cKOs in Figure 3F, G? Are the differences between the control and cKOs in fusion index and Myogenin expression (Figure 3J, K) statistically significant?
      • Representative images are added in a new supplementary figure (Figure S3A) as requested by the reviewer.

      • Thanks for pointing out the lack of statistics in Figure 3H. To address this point and a comment by reviewer 2, we removed those quantifications from the manuscript. Instead, we performed PAX7/MYOD/Ki67 co-staining on cultured cells and quantified the percentage of each cell state based on the expression of these three markers as a more accurate measure of quiescent versus cycling satellite cells, as well as progenitors and precursor cells. The revised quantification and statistical analysis is presented in the revised Figure 3H. This quantification is represented similarly to the data shown in Gattazzo et al 2020.
      • Thanks for bringing up the issue of cell death. To address this point, we performed two rounds of anti-Caspase 3 staining on a limited number of myogenic progenitors that we were able to isolate from cKO and control animals during the revisions. Unfortunately, the stainings did not work. To ensure that we do not rule out the possibility of apoptosis without experimental data, we added the following sentence to the Results section: "These data suggest reduced proliferation of myogenic cells upon loss of Poglut1, although we cannot exclude that cell death also contributes to the reduced cell number."
      • Thanks for pointing out the lack of statistics for these data. We have added additional independent samples to the data presented in 3J, K and found that the differences between the control and cKOs in fusion index and Myogenin expression are indeed statistically significant.

        1. Figure 4 has multiple problems in my opinion. First, P21 is too early a time point to be talking about regeneration, since satellite cells are just transitioning to become quiescent cells at this time (described in Lepper et al, Nature 2009). It is difficult to distinguish the regenerative and neonatal developmental roles of satellite cells in the experiment that the Authors have carried out. Another issue is that the Authors show a decrease in satellite cells in the cKOs; satellite cell function is clearly known to be required for regeneration. Therefore, there is not much novelty that the cKOs exhibit poor regenerative capability. The data shown in Figure 4B-E is more or less a repetition of what has been shown in Figures 2 and 3.
      • After carefully reviewing publications including the one suggested by the reviewer, we found that multiple studies have performed muscle injury experiments at P21 or even younger ages to study regeneration capabilities. Also, please consider that many of our mutant mice are lethal around weaning age which was another reason for selecting P21 as our injury timepoint. To better explain the logic for this choice, we have added the following sentences to the revised manuscript (new text is underlined: "To address this question, we induced muscle injury in mutant and control mice by injecting cardiotoxin (CTX) into TA muscles at P21 (Fig. 4A), an age at which we consistently obtain Poglut1-cKO animals without any dietary changes (Fig. 1E). Importantly, in WT C57BL/6 mice, 51% of PAX7+ cells are reported to be in the quiescent state at P21 (Gattazzo et al., 2020) and the TA muscles of P21 mice exhibit a robust regenerative response to cardiotoxin-induced injury (Lepper et al., 2009)."

      • While the reduction in satellite cells (SCs) in cKOs suggests impaired regeneration, it was essential to confirm this experimentally, as remaining SCs could still compensate. Therefore, it was necessary to assess whether the residual satellite cells could proliferate and contribute to muscle repair. Our results demonstrate that our mutant mice fail to repair muscle, providing evidence that the reduction in PAX7+ cells in Poglut1-cKO mice is functionally significant. The goal of these experiments was not to increase novelty; it was to increase experimental rigor and reproducibility so that we don't draw functional conclusions merely based on tissue staining.
      • While Figures 2 and 3 examine muscle sections and myoblast cultures, Figure 4B-E presents isolated fibers in an ex vivo culture system. This approach allows us to assess satellite cell activation and proliferation in a different injury model. Figure 4 lets us confirm that the proliferation defect observed in the cKO is consistent across multiple experimental conditions. Additionally, we utilized two forms of muscle injury-in vivo and ex vivo-to comprehensively evaluate the function of PAX7+ cells in WT versus mutant muscles. The ex vivo findings from EDL fibers align with our cell culture experiments in Figure 3, further supporting the observed defect in muscle repair. We believe this approach will increase the likelihood of obtaining reproducible data.

        1. In Figure 5C, D, have the Authors checked the Pax7+ cell numbers between the controls and the i-cKOs? Is the difference seen in 1X TAM due to preexisting reduction in satellite cells in the i-cKOs? What is the explanation by the Authors for the large number of Tomato+ fibers seen in the 2X TAM and 3X TAM uninjured muscle (Figure 3C, E)? Figure 5E is difficult to comprehend and should be represented in a clearer manner.
      • New PAX7 staining shows that most of the TOM+ cells (93% in i-cKO, 97% in control) co-express PAX7 (revised Figure 6D and 6E). If preexisting meant "before tamoxifen injection", a preexisting reduction in satellite cells should be highly unlikely, as the Pax7-Cre-ERT2 line has been used by multiple groups to study adult satellite cells, and we have not observed any abnormalities in the Poglut1[flox/flox] strain. Moreover, our control animals harbor both Pax7-Cre-ERT2 and the tdTomato transgenes. The observed difference in 1xTAM most likely reflects spontaneous activation and fusion of these cells following loss of Poglut1, as evidenced by the enhanced appearance of Tomato+ fibers in i-cKO muscles compared to control muscles. At 1xTAM, we can observe a modest increase in Tomato⁺ The reduction becomes more pronounced with repeated tamoxifen, consistent with progressive differentiation of the satellite cell pool and their fusion with myofibers.

      • For Figure 5E (now Figure 6C), we made some changes to make the graph clearer and added a sentence to the figure legend to make it easier to follow.

        1. Densitometry for Figure 6A should be included, since N1 ICD levels seem quite variable in the controls also. POGLUT1 expression and shRNA knockdown efficiency should be shown in Figure 6B. What is the correlation between the NOTCH3 glycosylation with reduced NOTCH pathway activation and satellite cell function? This should be clearly discussed. Why was the NOTCH3 glycosylation assay done in HEK293 cells?
      • In the revised version, we have included densitometry for Figure 6A (now Figure 7A).

      • In the revised version, we show qRT-PCR data indicating that C2C12 cells stably expressing Poglut1 shRNA show ~ 75% decrease in Poglut1 mRNA levels compared to control cells (now Figure 7B), in agreement with our original report of these cells (PMID 21490058). We also performed qRT-PCR for Pax7 and found a significant reduction in Pax7 mRNA levels in Poglut1-knockdown C2C12 cells, further supporting their usage in these signaling assays.
      • As discussed in detail in the Results and Discussion sections of the original submission, reports from our lab and others have provided strong evidence that glycosylation of NOTCH1 and NOTCH2 by POGLUT1 promotes signaling mediated by these receptors. Since it was not clear whether this phenomenon is ligand-specific, in the current manuscript we showed that both DLL1-madiated and JAG1-mediated signaling by NOTCH1 and NOTCH2 require the expression of POGLUT1 in the signal-receiving cells. Since NOTCH3 also plays a key role in muscle stem cell biology (albeit a not so well characterized one), we showed in the current manuscript that reducing POGLUT1 in the NOTCH3-expressing cell also leads to a significant reduction in signaling mediated by this receptor. Moreover, we found that NOTCH3 is broadly glycosylated by POGLUT1. In fact, during the revision, we succeeded in confirming four additional POGLUT1 target sites on the mouse NOTCH3 protein (added to the revised Figure 7D and Supplementary Figure S4). These data suggest that similar to NOTCH1 and NOTCH2, POGLUT1-medaited O-glucosylation of NOTCH3 is required for its signaling. To address the reviewer's comment, in the revised manuscript, we have added the following phrase to the Discussion to address the reviewer's comment: "..., strongly suggesting that O-glucosylation of NOTCH3 by POGLUT1 promotes its ligand-mediated activation."
      • HEK293 cells express robust levels of POGLUT1 and have been used in previous studies for the characterization of POGLUT1-mediated glycosylation of its target proteins. Analysis of overexpressed target proteins in this cell line is used to establish assay conditions and glycopeptide identification parameters and thereby sets the stage to analyze glycosylation of endogenous protein (in this case, mouse NOTCH3) in specific cell types. We note that in cases examined so far, the glycosylation data obtained from overexpressed proteins in HEK293 or other commonly used mammalian cell lines recapitulate the glycosylation of endogenously expressed POGLUT1 targets very well (both in cell culture and in vivo; for example see PMID 27268051).

        Reviewer #1 (Significance (Required)):

        Based on my expertise as a muscle and stem cell biologist, the manuscript is not clearly thought through, with not many novel inferences that one can draw from the data provided. While the manuscript could be informative to muscle biologists and stem cell investigators, several additional experiments are required to better characterize the phenotypes and provide meaningful conclusions from the study. The role of POGLUT1 in the muscle could be of great interest, especially in light of its role in LGMD-R21, as described by some of the Authors previously. Several pieces of data provided in the current manuscript are disjointed, with few connecting links, such as the NMJ characterization, the NOTCH glycosylation data and the regeneration experiments done on P21 neonates. Better quantitation of data is required as well, as detailed below. Overall, the manuscript may be revised to address the specific comments and reconsidered at a suitable journal.

      Response: We hope that our explanations, additional experiments, and changes to the text have helped address the concerns raised by the reviewer.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Poglut1 mutations have been identified in an autosomal recessive form of muscular dystrophy. Poglut1 encodes an O-glucosyltransferase that modifies Notch receptors and ligands but also has other substrates. In mice, null mutations in Poglut1 are embryonic lethal, which has previously precluded the analysis of Poglut1's role in muscle development and regeneration using murine models. To address this limitation, the authors generated a conditional Poglut1 allele and introduced mutations using Pax7Cre or Pax7CreERT2. They characterize the phenotypic consequences of these mutations and further demonstrate that signaling through the Notch1, Notch2, and Notch3 receptors requires Poglut1 using the C2C12 cell culture model.

      Some aspects of the manuscript's description of muscle stem cell behavior and regeneration are not fully up to date. These points should be addressed before publication.

      Specific points

      1) Evidence Supporting Reduced Notch Signaling as the Cause of the Phenotype The comparison of muscle phenotypes observed in other Notch signaling mutations strongly supports the hypothesis that the phenotype is due to reduced Notch signaling. Relevant studies, such as those addressing development (PMID: 17194759, PMID: 17360543) and those focusing on adult muscle and regeneration (PMID: 21989910, PMID: 22069237, PMID: 22045613, PMID: 30862660) should be cited and discussed in the manuscript. Including these references will strengthen the argument and provide a broader context for the findings.

      • We thank the reviewer for this comment. To address this point and a similar comment made by reviewer 3, we have modified the Introduction and Discussion in the revised manuscript. In the Introduction, we have added the following sentences and have cited all 6 references listed by the reviewer: "Loss of function studies for key components of the Notch signaling pathway including Rbpj and Dll1 indicated that disruption of this pathway during muscle development leads to premature differentiation and depletion of muscle progenitor cells (Schuster-Gossler et al., 2007; Vasyutina et al., 2007). In addition, multiple studies have shown that Notch signaling is required in adult mice to prevent the spontaneous or premature differentiation of satellite cells and to maintain a muscle stem cell pool capable of repairing muscle damage (Bjornson et al., 2012; Fukada et al., 2011; Lahmann et al., 2019; Mourikis et al., 2012)." In the third paragraph of the Discussion, we have highlighted the similarities between Poglut1-cKO and i-cKO phenotypes and the phenotypes observed in "animals with germline or conditional loss of various Notch pathway components (Bjornson et al., 2012; Fukada et al., 2011; Lahmann et al., 2019; Mourikis et al., 2012; Schuster-Gossler et al., 2007; Vasyutina et al., 2007)".

        2) Figure 1: NMJ Deficits - Pre- or Postsynaptic? The authors describe the mutant synaptic vesicles as showing a significantly reduced evoked neurotransmitter release (quantal content) compared to controls. This phrasing raises the question: are motor neurons mutated in these animals? It should be clarified why the synaptic vesicles are referred to as "mutant." To my knowledge, Pax7Cre does not recombine in motor neurons, and this discrepancy needs to be addressed. The text should be rephrased to accurately reflect the origins of the observed deficits.

      • We are not aware of any reports on recombination in motor neurons by the Pax-Cre line used in our study and had implied this in the initial submission in the following sentence in Discussion: "Therefore, even if agrin is indeed glycosylated by POGLUT1, loss of Poglut1 with Pax7-Cre is not expected to affect the glycosylation of agrin expressed by motoneurons." [please note that given the new data and text added to the revised manuscript, we removed the paragraph related to NMJ and agrin from the Discussion to reduce the manuscript length.] We agree that referring to synaptic vesicles as "mutant" can be misleading and have revised "mutant synaptic vesicles" to "in the NMJs of Poglut1-cKO LAL muscles". To address the discrepancy, we have added the following sentence to the Results section after describing a reduction in quantal content: "Since the Pax7-Cre strain used in our study is not reported to induce recombination in motor neurons (Murdoch et al., 2012), this presynaptic NMJ defect might be secondary to defects in postsynaptic NMJ abnormalities."

        3) Quantification of Myofibers with Internal Nuclei The statement, "We first quantified the ratio of myofibers with internal nuclei, which is an indication for recent fusion of myoblasts to myofibers," is not entirely accurate. Recent studies, such as PMID: 38569550, provide a more nuanced explanation of this phenomenon. The manuscript should reference this study and update the description to ensure it accurately reflects the current understanding of myofiber internal nuclei as markers of muscle pathology or regeneration.

      • We thank the reviewer for bringing this paper to our attention. In the revised manuscript, we have changed the above sentence to make it aligned with the observations of the paper mentioned by the reviewer: "We first quantified the ratio of myofibers with internal nuclei, recently reported to specifically result from the fusion of embryonic myogenic cells during limb myogenesis and also driven by myocyte-myocyte fusion in the first phase of postnatal muscle regeneration (Collins et al., 2024)." We also added the following sentence, which we believe makes our interpretation more accurate in light of Collins et al, 2024: "These data suggest enhanced differentiation of Poglut1-deficient progenitors into myocytes followed by continued myocyte-myocyte fusion and/or a delay in the peripheral migration of internal nuclei which normally occurs in the perinatal period (Collins et al., 2024)." This way, in the revised manuscript we do not link the presence of internal nuclei with recent fusion of myoblasts anymore. Finally, we also referred to Collins et al 2024 when describing the appearance of internal nuclei in injured muscles 5 days after injury (Figure 4A).

        4) Figure 3H: Cell Identification in Culture

        The use of PAX7+ MYOD− to identify quiescent cells in culture is not accurate. Instead, PAX7+ Ki67− should be used for this purpose. Similarly, PAX7− MYOD+ does not reliably identify differentiating cells. Instead, staining for MYOG should be used to ensure accuracy. The figure and accompanying text should be adjusted accordingly to reflect these updates.

      • To address this issue and a comment by reviewer 1, we performed PAX7/MYOD/Ki67 co-staining on cultured cells and quantified the percentage of each cell state based on the expression of these three markers as a more accurate measure of quiescent versus cycling satellite cells, as well as progenitors and precursor cells. The revised quantification and statistical analysis is presented in the revised Figure 3H. This quantification is represented similarly to the data shown in Gattazzo et al 2020. Based on the reviewer's comment and the results of these quantification, we have modified the Results section as follows (new text is underlined): "These observations indicate that loss of Poglut1 impairs the ability of muscle stem cells to remain in a quiescent state and suggest that the mutant myogenic progenitors might undergo premature differentiation." We then present the fusion index and myogenin staining data to conclude enhanced differentiation.

        5) Description of Satellite Cell Quiescence

        The statement, "About 2-3 weeks after birth, some of the PAX7+ cells generated by active proliferation of embryonic myogenic progenitors enter a quiescent state to generate adult satellite cells," is not entirely correct. The description should be updated based on the findings in PMID: 32763161, which provide a more accurate account of the transition of PAX7+ cells to quiescence and their role in generating adult satellite cells.

      • We thank the reviewer for bringing this paper to our attention. To address this issue and one of the concerns raised by Reviewer 1, we have made the following changes to the corresponding paragraph in the revised version: (A) We removed the statement highlighted by the reviewer. (B) We provided better justification for using P21 mice to assess muscle regeneration by quiescent satellite cells by adding the underlined sentences to this section: "To address this question, we induced muscle injury in mutant and control mice by injecting cardiotoxin (CTX) into TA muscles at P21 (Fig. 4A), an age at which we consistently obtain Poglut1-cKO animals without any dietary changes (Fig. 1E). Importantly, 51% of PAX7+ cells are reported to be in the quiescent state at P21 (Gattazzo et al., 2020) and the TA muscles of P21 mice exhibit a robust regenerative response to cardiotoxin-induced injury (Lepper et al., 2009)." We note that Gattazzo et al 2020 is PMID: 32763161, the paper with a more accurate account of satellite cell quiescence mentioned by the reviewer.

        6) Figure 5: Loss of Quiescence in Satellite Cells

        A hallmark phenotype of mutations in Notch signaling genesis the loss of quiescence in satellite cells when the mutation is introduced in the adult,. The authors should include data on Ki67 and MyoD expression in PAX7+ cells of mice with the Poglut1 mutation introduced in the adult by an analysis of the uninjured muscle. This would provide insight into the maintenance of quiescence in the mutant satellite cells.

      • Based on the reviewer's recommendation, we aimed to assess Ki67 and MyoD expression in PAX7⁺ cells of adult inducible-cKO (i-cKO) mice in the uninjured muscle. However, due to technical limitations including antibody species incompatibility and the number of available fluorescence channels, we were unable to perform simultaneous triple staining combined with tdTomato visualization on the same tissue sections. Instead, we utilized the Tomato reporter as a marker for PAX7⁺ cells, based on new data indicating that 93-97% of TOM⁺ cells co-express PAX7 (Figure 6E). We performed Ki67 and MyoD double staining and quantified the percentage of Tomato⁺ cells that expressed one or both of these markers. Our analysis showed that TOM⁺/Ki67⁻/MyoD⁻ (quiescent) satellite cells were significantly reduced in the i-cKO mice compared to controls, while the proportions of single-positive cells (Ki67⁺ or MyoD⁺) were increased (Figure 6F-G). These findings are consistent with loss of quiescence and increased activation of satellite cells upon loss of POGLUT1 in the adult mice.

      • During the revision, we performed additional experiments (not suggested by the reviewers) to better assess the role of Poglut1 in self-renewal of satellite cells upon injury. First, in a new cohort of i-cKO and control animals, we performed two rounds of tamoxifen injections but skipped the first round of injury and only induced CTX injury after the second round of recombination. Interestingly, analysis of 14 dpi muscles from these animals showed a full repair in both control and i-cKO mice (shown in revised Figure 5B). Second, we inspected the repaired control and i-cKO muscle after one round of recombination + injury for the presence of single tdTomato+ cells next to the repaired myofibers, which would represent the satellite cells formed after the repair. While single tdTomato+ cells were readily seen in the repaired muscle in control mice, we did not see any such cells next to the repaired muscle from i-cKO animals (shown in the revised Fig 5C, n = 4 animals per condition). Together, these new data provide strong evidence that loss of Poglut1 in adult muscle stem cells impairs their ability to return to quiescence and form new satellite cells upon repair of injured muscle.

        7) Figure 6A: Assay of Cleaved NOTCH1 Intracellular Domain

        The text describes that satellite cells isolated from Poglut1-cKO muscles showed a strong reduction in the level of cleaved (active) NOTCH1 intracellular domain compared to control cells. It is unclear whether these satellite cells were freshly isolated and directly assayed or cultured before the assay. Please specify this in the result section.

      • These satellite cells were freshly isolated from whole muscle, and the proteins were extracted directly from the isolated satellite cells without culturing them. This is now included in the Results section related to the revised Figure 7A.

        8) Figure 6: Notch Receptor Overexpression in C2C12 Cells The results section should explicitly explain that the Notch 1-3 receptors were overexpressed or transfected in the C2C12 cells. This detail is essential for understanding the experimental design.

      • Thank you. The C2C12 cells were transfected with mouse NOTCH receptors 1, 2, or 3 to overexpress each receptor. This is now included in the Results section as suggested.

        Reviewer #2 (Significance (Required)):

        The manuscript by Cho et al. demonstrates that the muscular dystrophy phenotype associated with Poglut1 mutations is caused by a Notch signaling deficit in muscle stem cells. Although previous studies had suggested this connection, alternative mechanisms could not be excluded, as Poglut1 glycosylates a multitude of proteins. Overall, this is a thorough and careful analysis of Poglut1's role in muscle development and regeneration, providing valuable insights into the mechanisms underlying this rare muscle disease.

      Response: We sincerely thank the reviewer for her/his positive assessment of our manuscript and for the constructive comments.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      POGLUT1 is a key enzyme essential for the glycosylation and signaling activity of Notch receptors in C2C12 muscle cells. Elegant work from multiple groups has demonstrated the essential role of Notch signaling in maintaining the quiescence of muscle stem cells known as satellite cells. For instance, genetic deletion of Rbpj in tamoxifen-inducible conditional knockout mouse models causes ectopic expression of MyoD, diminished Pax7 expression, and precocious differentiation and fusion of quiescent satellite cells without entering the cell cycle. This is accompanied by the complete loss of tissue regeneration following repetitive muscle injuries. Hypomorphic mutations of POGLUT1 cause limb-girdle muscular dystrophy (type R21), accompanied by a significant reduction of Notch signaling and a decrease in the number of satellite cells in patient muscles.

      Despite the substantial body of evidence establishing the key function of the POGLUT1-Notch-Pax7/MyoD axis in satellite cells, the in vivo function of POGLUT1 in mice has not been studied. In the present study, Cho et al. carefully examined muscle development and regeneration using the POGLUT1 conditional knockout mouse model. The results, at both phenotypic and cellular/molecular levels, align perfectly with the previously established working model of the POGLUT1-Notch-Pax7/MyoD axis. Overall, the experiment is well-designed and executed, and the data are generally of high quality supporting the conclusions. No specific experimental issues was identified.

      Major suggestions include adding references in the discussion section: 1) studies of Rbpj cKO in mice (PMID: 22069237; 22045613), as genetic deletion of POGLUT1 using the same Pax7 driver showed almost identical phenotypes across all levels: phenotype, cellular fate changes, and gene expression changes; 2) studies of Notch-Rbpj targets (Fig. S1, PMID: 29795344), as this paper identified multiple genes encoding collagen V and VI as direct targets of Notch signaling in quiescent satellite cells. These findings are consistent with the authors' observations in Fig. 2e.

      • We thank the reviewer for the suggestion to add these highly relevant papers to the manuscript. 1) To address this comment and the specific point #1 raised by reviewer 2, we have highlighted the similarities between Poglut1-cKO and i-cKO phenotypes and the phenotypes observed in "animals with germline or conditional loss of various Notch pathway components (Bjornson et al., 2012; Fukada et al., 2011; Lahmann et al., 2019; Mourikis et al., 2012; Schuster-Gossler et al., 2007; Vasyutina et al., 2007)" in the Discussion. Mourikis et al 2012 and Bjornson et al 2012 are the two papers that the reviewer has referred to. 2) To address this point, we have added the following sentences to the Discussion: "Notch signaling has been shown to directly activate the transcription of several genes encoding collagen V and VI in muscle stem/progenitor cells, and collagen V expressed by satellite cells plays a key role in the niche to maintain satellite cell quiescence (Baghdadi et al., 2018). Therefore, ECM abnormalities caused by reduced Notch signaling might contribute to the loss of satellite cell quiescence in i-cKO animals as well." Baghdadi et al, 2018 is PMID: 29795344, to which the reviewer has referred.

        Minor suggestions: 1B: "P0" is an awkward term. Does it refer to newborn day 1 or a near full-term embryo?

      • By saying "P0", we are referring to the day the litters are born.

        Fig. 1C: The y-axis label includes an extra space.

      • The extra space is removed in the revised figure.

        Fig. 1D: Provide the N number for each genotype/diet group.

      • The N number has been added (now Figure 1E).

        Gene nomenclature: Use POGLUT1 exclusively for the human protein and Poglut1 for the mouse protein.

      • According to the guidelines of the International Committee for Standardized Genetic Nomenclature for Mice, protein symbols for mice should use all uppercase letters. Therefore, while the gene symbols for mouse and human are different in terms of uppercase versus lowercase usage, the protein symbols for mouse and human are usually identical. Please see the following webpage: https://www.informatics.jax.org/mgihome/nomen/gene.shtml#ps (accessed January 18, 2025).

        Reviewer #3 (Significance (Required)):

        In the present study, Cho et al. carefully examined muscle development and regeneration using the POGLUT1 conditional knockout mouse model. The results, at both phenotypic and cellular/molecular levels, align perfectly with the known working model of the POGLUT1-Notch-Pax7/MyoD axis in satellite cells and muscle regeneration. Overall, the experiment is well-designed and executed, and the data are generally of high quality.

      Response: We sincerely thank the reviewer for the positive evaluation of our manuscript as well as the helpful suggestions.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      POGLUT1 is a key enzyme essential for the glycosylation and signaling activity of Notch receptors in C2C12 muscle cells. Elegant work from multiple groups has demonstrated the essential role of Notch signaling in maintaining the quiescence of muscle stem cells known as satellite cells. For instance, genetic deletion of Rbpj in tamoxifen-inducible conditional knockout mouse models causes ectopic expression of MyoD, diminished Pax7 expression, and precocious differentiation and fusion of quiescent satellite cells without entering the cell cycle. This is accompanied by the complete loss of tissue regeneration following repetitive muscle injuries. Hypomorphic mutations of POGLUT1 cause limb-girdle muscular dystrophy (type R21), accompanied by a significant reduction of Notch signaling and a decrease in the number of satellite cells in patient muscles.

      Despite the substantial body of evidence establishing the key function of the POGLUT1-Notch-Pax7/MyoD axis in satellite cells, the in vivo function of POGLUT1 in mice has not been studied. In the present study, Cho et al. carefully examined muscle development and regeneration using the POGLUT1 conditional knockout mouse model. The results, at both phenotypic and cellular/molecular levels, align perfectly with the previously established working model of the POGLUT1-Notch-Pax7/MyoD axis. Overall, the experiment is well-designed and executed, and the data are generally of high quality supporting the conclusions. No specific experimental issues was identified.

      Major suggestions include adding references in the discussion section: 1) studies of Rbpj cKO in mice (PMID: 22069237; 22045613), as genetic deletion of POGLUT1 using the same Pax7 driver showed almost identical phenotypes across all levels: phenotype, cellular fate changes, and gene expression changes; 2) studies of Notch-Rbpj targets (Fig. S1, PMID: 29795344), as this paper identified multiple genes encoding collagen V and VI as direct targets of Notch signaling in quiescent satellite cells. These findings are consistent with the authors' observations in Fig. 2e.

      Minor suggestions:

      Fig. 1B: "P0" is an awkward term. Does it refer to newborn day 1 or a near full-term embryo?

      Fig. 1C: The y-axis label includes an extra space.

      Fig. 1D: Provide the N number for each genotype/diet group.

      Gene nomenclature: Use POGLUT1 exclusively for the human protein and Poglut1 for the mouse protein.

      Significance

      In the present study, Cho et al. carefully examined muscle development and regeneration using the POGLUT1 conditional knockout mouse model. The results, at both phenotypic and cellular/molecular levels, align perfectly with the known working model of the POGLUT1-Notch-Pax7/MyoD axis in satellite cells and muscle regeneration. Overall, the experiment is well-designed and executed, and the data are generally of high quality.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Poglut1 mutations have been identified in an autosomal recessive form of muscular dystrophy. Poglut1 encodes an O-glucosyltransferase that modifies Notch receptors and ligands but also has other substrates. In mice, null mutations in Poglut1 are embryonic lethal, which has previously precluded the analysis of Poglut1's role in muscle development and regeneration using murine models. To address this limitation, the authors generated a conditional Poglut1 allele and introduced mutations using Pax7Cre or Pax7CreERT2. They characterize the phenotypic consequences of these mutations and further demonstrate that signaling through the Notch1, Notch2, and Notch3 receptors requires Poglut1 using the C2C12 cell culture model.

      Some aspects of the manuscript's description of muscle stem cell behavior and regeneration are not fully up to date. These points should be addressed before publication.

      Specific points

      1. Evidence Supporting Reduced Notch Signaling as the Cause of the Phenotype

      The comparison of muscle phenotypes observed in other Notch signaling mutations strongly supports the hypothesis that the phenotype is due to reduced Notch signaling. Relevant studies, such as those addressing development (PMID: 17194759, PMID: 17360543) and those focusing on adult muscle and regeneration (PMID: 21989910, PMID: 22069237, PMID: 22045613, PMID: 30862660) should be cited and discussed in the manuscript. Including these references will strengthen the argument and provide a broader context for the findings. 2. Figure 1: NMJ Deficits - Pre- or Postsynaptic?

      The authors describe the mutant synaptic vesicles as showing a significantly reduced evoked neurotransmitter release (quantal content) compared to controls. This phrasing raises the question: are motor neurons mutated in these animals? It should be clarified why the synaptic vesicles are referred to as "mutant." To my knowledge, Pax7Cre does not recombine in motor neurons, and this discrepancy needs to be addressed. The text should be rephrased to accurately reflect the origins of the observed deficits. 3. Quantification of Myofibers with Internal Nuclei

      The statement, "We first quantified the ratio of myofibers with internal nuclei, which is an indication for recent fusion of myoblasts to myofibers," is not entirely accurate. Recent studies, such as PMID: 38569550, provide a more nuanced explanation of this phenomenon. The manuscript should reference this study and update the description to ensure it accurately reflects the current understanding of myofiber internal nuclei as markers of muscle pathology or regeneration. 4. Figure 3H: Cell Identification in Culture

      The use of PAX7+ MYOD− to identify quiescent cells in culture is not accurate. Instead, PAX7+ Ki67− should be used for this purpose. Similarly, PAX7− MYOD+ does not reliably identify differentiating cells. Instead, staining for MYOG should be used to ensure accuracy. The figure and accompanying text should be adjusted accordingly to reflect these updates. 5. Description of Satellite Cell Quiescence

      The statement, "About 2-3 weeks after birth, some of the PAX7+ cells generated by active proliferation of embryonic myogenic progenitors enter a quiescent state to generate adult satellite cells," is not entirely correct. The description should be updated based on the findings in PMID: 32763161, which provide a more accurate account of the transition of PAX7+ cells to quiescence and their role in generating adult satellite cells. 6. Figure 5: Loss of Quiescence in Satellite Cells

      A hallmark phenotype of mutations in Notch signaling genesis the loss of quiescence in satellite cells when the mutation is introduced in the adult,. The authors should include data on Ki67 and MyoD expression in PAX7+ cells of mice with the Poglut1 mutation introduced in the adult by an analysis of the uninjured muscle. This would provide insight into the maintenance of quiescence in the mutant satellite cells. 7. Figure 6A: Assay of Cleaved NOTCH1 Intracellular Domain

      The text describes that satellite cells isolated from Poglut1-cKO muscles showed a strong reduction in the level of cleaved (active) NOTCH1 intracellular domain compared to control cells. It is unclear whether these satellite cells were freshly isolated and directly assayed or cultured before the assay. Please specify this in the result section. 8. Figure 6: Notch Receptor Overexpression in C2C12 Cells

      The results section should explicitly explain that the Notch 1-3 receptors were overexpressed or transfected in the C2C12 cells. This detail is essential for understanding the experimental design.

      Significance

      The manuscript by Cho et al. demonstrates that the muscular dystrophy phenotype associated with Poglut1 mutations is caused by a Notch signaling deficit in muscle stem cells. Although previous studies had suggested this connection, alternative mechanisms could not be excluded, as Poglut1 glycosylates a multitude of proteins. Overall, this is a thorough and careful analysis of Poglut1's role in muscle development and regeneration, providing valuable insights into the mechanisms underlying this rare muscle disease.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Cho et al uses conditional and inducible conditional mouse models to characterize the function of protein O-glucosyltransferase 1 (POGLUT1), known to cause a type of Limb Girdle Muscular Dystrophy (LGMD-R21), in skeletal muscle satellite cells, differentiation and regeneration. The Authors find that conditional deletion of POGLUT1 in the myogenic progenitors leads to postnatal muscle defects and lethality by postnatal day 30 or so. Muscle progenitors lacking POGLUT1 undergo reduced proliferation and accelerated differentiation, possibly leading to impairment in muscle regeneration. This is supported by an inducible conditional deletion of POGLUT1 in adult satellite cells. Finally, in vitro experiments suggest that POGLUT1 is required for NOTCH pathway activation in myogenic cells and that POGLUT1 could potentially glycosylate specific residues in NOTCH3.

      Major comments

      1. What is the control used in Figure 1B and other panels? The genotype should be clearly specified instead of saying controls, in all the figures. It will be easier to interpret the data if densitometry of the western blots is provided, normalized to GAPDH levels. Also, is there some POGLUT1 protein remaining in the satellite cells in the cKOs? Data should be shown for an adult time point also (P30 or later), in addition to P0 and P4, to see whether Poglut1 levels are reduced in adult stages in the muscle and satellite cells in the cKOs.
      2. In Figure 1C, can the tibia weight be normalized to total body weight instead of tibia length and analyzed? Similarly, can the grip strength measurements in Figure 1G be normalized to total body weight and represented? Grip strength measurements in neonates is tricky; the Authors should clearly explain how this was done. The NMJ defects can be characterized better, especially since the Authors discuss about Agrin in detail.
      3. What are the small myofibers seen at the corners of the larger myofibers at P21 in the cKOs in Figure 2E? What is the point that the Authors want to conclude from Figure 2C and D? Clearly, if there are fewer satellite cells, Pax7 transcript levels will decrease. In Figure 2D, how are the satellite cell samples normalized between control and cKOs; did they start with equal number of satellite cells or equal amount of satellite cell RNA between control and cKOs? In Figure 2F, representative images for cKOs should be shown? The conclusion from Figure 2F-G is unclear. Since the Authors claim that the weak laminin staining is resolved in the cKOs by P21, why is α-dystroglycan hypoglycosylation seen in the P21 muscle?
      4. Representative images should be shown as examples for all time points for both genotypes in Figure 3A. Figure 3H should be represented with statistically significant differences marked clearly. Have the Authors checked whether cell death contributes to the decrease in cultured satellite cells and Pax7+ cells in the cKOs in Figure 3F, G? Are the differences between the control and cKOs in fusion index and Myogenin expression (Figure 3J, K) statistically significant?
      5. Figure 4 has multiple problems in my opinion. First, P21 is too early a time point to be talking about regeneration, since satellite cells are just transitioning to become quiescent cells at this time (described in Lepper et al, Nature 2009). It is difficult to distinguish the regenerative and neonatal developmental roles of satellite cells in the experiment that the Authors have carried out. Another issue is that the Authors show a decrease in satellite cells in the cKOs; satellite cell function is clearly known to be required for regeneration. Therefore, there is not much novelty that the cKOs exhibit poor regenerative capability. The data shown in Figure 4B-E is more or less a repetition of what has been shown in Figures 2 and 3.
      6. In Figure 5C, D, have the Authors checked the Pax7+ cell numbers between the controls and the i-cKOs? Is the difference seen in 1X TAM due to preexisting reduction in satellite cells in the i-cKOs? What is the explanation by the Authors for the large number of Tomato+ fibers seen in the 2X TAM and 3X TAM uninjured muscle (Figure 3C, E)? Figure 5E is difficult to comprehend and should be represented in a clearer manner.
      7. Densitometry for Figure 6A should be included, since N1 ICD levels seem quite variable in the controls also. POGLUT1 expression and shRNA knockdown efficiency should be shown in Figure 6B. What is the correlation between the NOTCH3 glycosylation with reduced NOTCH pathway activation and satellite cell function? This should be clearly discussed. Why was the NOTCH3 glycosylation assay done in HEK293 cells?

      Significance

      Based on my expertise as a muscle and stem cell biologist, the manuscript is not clearly thought through, with not many novel inferences that one can draw from the data provided. While the manuscript could be informative to muscle biologists and stem cell investigators, several additional experiments are required to better characterize the phenotypes and provide meaningful conclusions from the study. The role of POGLUT1 in the muscle could be of great interest, especially in light of its role in LGMD-R21, as described by some of the Authors previously. Several pieces of data provided in the current manuscript are disjointed, with few connecting links, such as the NMJ characterization, the NOTCH glycosylation data and the regeneration experiments done on P21 neonates. Better quantitation of data is required as well, as detailed below. Overall, the manuscript may be revised to address the specific comments and reconsidered at a suitable journal.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity)

      *This study examines the reorganization of the microtubule (MT) cytoskeleton during early neuronal development, specifically focusing on the establishment of axonal and dendritic polarity. Utilizing advanced microscopy techniques, the authors demonstrate that stable microtubules in early neurites initially exhibit a plus-end-out orientation, attributed to their connection with centrioles. Subsequently, these microtubules are released and undergo sliding, resulting in a mixed-polarity orientation in early neurites. Furthermore, the study elegantly illustrates the spatial segregation of microtubules in dendrites based on polarity and stability. The experiments are rigorously executed, and the microscopy data are presented with exceptional clarity. The following are my primary concerns that warrant further consideration by the authors. *

      1. Potential Bias in the MotorPAINT Assay: Kinesin-1 and kinesin-3 motors exhibit distinct preferences for post-translationally modified (PTM) microtubules. Given that kinesin-1 preferentially binds to acetylated microtubules over tyrosinated microtubules in the MotorPAINT assay, the potential for bias in the results arises. Have the authors explored the use of kinesin-3, which favors tyrosinated microtubules, to corroborate the observed microtubule polarity?

      We thank the reviewer for the careful assessment of our manuscript. As the reviewer noted, it has indeed been demonstrated that kinesin-1 prefers microtubules marked by acetylation (Cai et al., PLoS Biol 2009; Reed et al., Curr Biol 2006) and kinesin-3 prefers microtubules marked by tyrosination in cells (Guedes-Dias et al., Curr Biol 2019; Tas et al., Neuron 2017); however, these preferences are limited in vitro, as demonstrated for example in Sirajuddin et al. (Nat Cell Biol 2014). When motor-PAINT was introduced, it was verified that purified kinesin-1 moves over both acetylated and tyrosinated microtubules with no apparent preference in this assay (Tas et al., Neuron 2017). This could be due to the more in vitro-like nature of the motor-PAINT assay (e.g. some MAPs may be washed away) and/or because of the addition of Taxol during the gentle fixation step, which converts all microtubules into those preferred by kinesin-1. We will clarify this in the text.

      Planned revisions:

      • We will clarify the lack of kinesin-1 selectivity in motor-PAINT assays in the text by adding the following sentence in the main text when introducing motor-PAINT: Importantly, while kinesin-1 has been shown to selectively move on stable, highly-modified microtubules in cells (Cai et al., PLoS Biol 2009; Reed et al., Curr Biol 2006), this is not the case after motor-PAINT sample preparation (Tas et al., Neuron 2017).

      Axon-Like Neurites in Stage 2b Neurons: The observation of axon-like neurites in Stage 2b neurons, characterized by an (almost) uniformly plus-end-out microtubule organization, is noteworthy. Have the authors confirmed this polarity using end-binding (EB) protein tracking (e.g., EB1, EB3) in Stage 2b neurons? Do these neurites display distinct morphological features, such as variations in width? Furthermore, do they consistently differentiate into axons when tracked over time using live-cell EB imaging, rather than the MotorPAINT assay? Could stable microtubule anchoring impede free sliding in these neurites or restrict sliding into them? Investigating microtubule sliding dynamics in these axon-like neurites would provide valuable insights.

      We thank the reviewer for highlighting this finding. Early in development, cultured neurons are known to transiently polarize and have axon-like neurites that may or may not develop into the future axon (Burute et al., Sci Adv 2022; Schelski & Bradke, Sci Adv 2022; Jacobson et al., Neuron 2006). In the absence of certain molecular or physical factors (e.g. Burute et al., Sci Adv 2022; Randlett et al., Neuron 2011), this transient polarization is seemingly random and as such, we do not expect the axon-like neurites in stage 2b neurons to necessarily become the axon. Interestingly, anchoring stable microtubules in a specific neurite using cortically-anchored StableMARK (Burute et al., Sci Adv 2022) or stabilizing microtubules in a specific neurite using Taxol (Witte et al., JCB 2008) has been shown to promote axon formation, but these stable microtubules have slower turnover (perhaps necessitating the use of laser severing as in Yau et al., J Neurosci 2016) and may not always bear EB comets given that EB comets are less commonly seen at the ends of stable microtubules (Jansen et al., JCB 2023).

      Planned revision:

      • We will add additional details to the text to clarify the likely transient nature of this polarization in agreement with previous literature and specify that they are otherwise not morphologically distinct.
      • We will perform additional EB3 tracking experiments in Stage 2b neurons to examine potential differences between neurites.

      *Taxol and Microtubule Sliding: Taxol-induced microtubule stabilization is known to induce the formation of multiple axons. Does taxol treatment diminish microtubule sliding and prevent polarity reversal in minor neurites, thereby facilitating their development into axons? *

      We thank the reviewer for this interesting suggestion. Taxol converts all microtubules into stable microtubules. Given that the initial neurites tend to be of mixed polarity, having stable microtubules pointing the "wrong" way may impede sliding and polarity sorting. Alternatively, since it is precisely the stable microtubules that we see sliding between and within neurites using StableMARK, Taxol may also increase the fraction of microtubules undergoing sliding. Because of this, it is not straightforward to predict how Taxol affects microtubule (re-)orientation and sliding. Preliminary motor-PAINT experiments do suggest that the multiple axons induced by Taxol treatment all contain predominantly plus-end-out microtubules, as expected, and that this is the case from early in development. We will further develop these findings to include them in our manuscript.

      Planned revision:

      • We have already performed some experiments in which we treat neurons with 10 nM Taxol and verify that we observe the formation of multiple axons by motor-PAINT. We will perform additional experiments in which we add this low dose of Taxol to the cells and determine its effect on microtubule sliding dynamics.

      *Sorting of Minus-End-Out Microtubules (MTs) in Developing Axons: Traces of minus-end-out MTs are observed proximal to the soma in both Stage 2b axon-like neurites and Stage 3 developing axons (Figure S4). Does this indicate a clearance mechanism for misoriented MTs during development? If so, is this sorting mechanism specific to axons? Could dynein be involved? Pharmacological inhibition of dynein (e.g., ciliobrevin-D or dynarrestin) could assess whether blocking dynein disrupts uniform MT polarity and axon formation. *

      We indeed think that a clearance mechanism is involved for removing misoriented microtubules in the axon after axon specification. Many motor proteins have been implicated in the polarity sorting of microtubules in neurons and for axons, dynein is believed to play a role (Rao et al., Cell Rep 2017; del Castillo et al., eLife 2015; Schelski & Bradke, Sci Adv 2022). A few of these studies already employed ciliobrevin, noting that it increases the fraction of minus-end-out microtubules in axons (Rao et al., Cell Rep 2017) and reduces the rate of retrograde flow of microtubules in immature neurites (Schelski & Bradke, Sci Adv 2022). These findings are in line with the suggestion of the reviewer. Interestingly, however, as we highlight in the discussion, the motility we observe for polarity reversal is extremely slow on average (~60 nm/minute) because the microtubule end undergoes bursts of motility and periods in which it appears to be tethered and rather immobile. Given that most neurites are non-axon-like, we assume these sliding events are mostly not taking place in axons or axon-like neurites. These events may thus be orchestrated by other motor proteins (e.g. kinesin-1, kinesin-2, kinesin-5, kinesin-6, and kinesin-12) that have been implicated in microtubule polarity sorting in neurons. We do observe retrograde sliding of stable microtubules in these neurites at a median speed of ~150 nm/minute, which is again much slower than typical motor speeds and occurs in almost all neurites and not specifically in one or two axon-like neurites. It is thus unclear which motors may be involved, and it is difficult to predict how any drug treatments would affect microtubule polarity.

      Dissecting the mechanisms of microtubule sliding will require many more experiments and will first require the recruitment and training of a new PhD student or postdoc. Therefore, we feel this falls outside the scope of the current work, which carefully maps the microtubule organization during neuronal development and demonstrates the active polarity reversal of stable microtubules during this process.

      Planned revision:

      • We will expand our discussion of the potential mechanisms facilitating polarity sorting in axons and axon-like neurites in the discussion.

      Impact of Kinesin-1 Rigor Mutants on MT Polarity and Dynamics: Would the expression of kinesin-1 rigor mutants alter MT dynamics and polarity? Validation with alternative methods, such as microtubule photoconversion, would be beneficial.

      It is important to note that StableMARK and its effects on microtubule stability have been extensively verified in the paper in which it was introduced (Jansen et al., JCB 2023). At low expression levels (where StableMARK has a speckled distribution along microtubules), StableMARK does not alter the stability of microtubules (e.g., they are still disassembled in response to serum starvation), alter their post-translational modification status or their distribution in the cell, or impede the transport of cargoes along them. Given that we chose to image neurons with very low expression levels of StableMARK (as inferred by the speckled distribution along microtubules), we expect its effects on the microtubule cytoskeleton to be minimal.

      Planned revision:

      • We will clarify the potential effects of StableMARK in the manuscript. We will perform experiments with photoactivatable tubulin to examine whether we still see microtubules that live for over 2 hours. We will furthermore examine whether it allows us to see microtubule sliding between neurites similar to work performed in the Gelfand lab (Lu et al., Curr Biol 2013).

      *Molecular Motors Driving MT Sliding: Which specific motors drive MT sliding in the soma and neurites? If a motor drives minus-end-out MTs into neurites, it must be plus-end-directed. The discussion should clarify the polarity of the involved motors to strengthen the conclusions. *

      We thank the reviewer for highlighting this point and will improve our discussion to clarify the polarity of the involved motors.

      Planned revision:

      • We will expand our discussion of the motors potentially involved in sliding microtubules when revising the manuscript.

      Stability of Centriole-Derived Microtubules: Microtubules emanating from centrioles are typically young and dynamic. How do they acquire acetylation and stability at an early stage? Do centrioles exhibit active EB1/EB3 comets in Stage 1/2a neurons? If these microtubules are severed from centrioles, could knockdown of MT-severing proteins (e.g., Katanin, Spastin, Fidgetin) alter microtubule polarity during neuronal development? A brief discussion would be valuable.

      We thank the reviewer for raising these interesting questions and suggestions. As suggested, we will include a brief discussion of these issues. What is known about the properties of stable microtubules is limited, so it is currently unclear how they are made. For example, we do not know if they are converted from labile microtubules or nucleated by a distinct pathway. If they are nucleated by a distinct pathway, do these microtubules grow in a similar manner as labile microtubules and do they have EB comets at their plus-ends (given that EB compacts the lattice (Zhang et al., Cell 2015, PNAS 2018) and stable microtubules have an expanded lattice in cells (de Jager et al., JCB 2025))? If they are converted, does something first cap their plus-end to limit further growth (given that EB comets are rarely observed at the ends of stable microtubules (Jansen et al., JCB 2023))?

      We also do not know how the activity of the tubulin acetyltransferase αTAT1 is regulated. Is its access to the microtubule lumen regulated or is its enzymatic activity stimulated by some means (e.g., microtubule lattice conformation or a molecular factor)?

      We find the possibility that microtubule severing enzymes release these stable microtubules from the centrioles very exciting and hope to test the effects of their absence on microtubule polarity in the future. We will discuss this in the manuscript as suggested.

      Planned revision:

      • We will expand our discussion about the centriole-associated stable microtubules in the revised manuscript. Minor Points

      • In Movies 3 and 4, please use arrowheads or pseudo-coloring to highlight microtubules detaching from specific points. In Movie 5, please mark the stable microtubule that rotates within the neurite. These annotations would enhance clarity.

      Planned revision:

      • We will add arrowheads/traces to the movies to enhance clarity.* *

      The title states: 'Stable microtubules predominantly oriented minus-end-out in the minor neurites of Stage 2b and 3 neurons.' However, given that the minus-end-out percentage increases after nocodazole treatment but only reaches a median of 0.48, 'predominantly' may be an overstatement. Please consider rewording.

      We thank the reviewer for catching this mistake and will adjust the statement to better reflect the median value.

      Planned revision:

      • We will reword this statement in the revised text.

      *Please compare the StableMARK system with the K560Rigor-SunTag approach described by Tanenbaum et al. (2014). What are the advantages of StableMARK over the SunTag method? *

      While the SunTag is certainly a powerful tool to visualize molecules at low copy number, we believe that StableMARK is more appropriate than the K560Rigor-SunTag tool for our assays due to two main reasons. Firstly, K560Rigor-SunTag is based on the E236A kinesin-1 mutation, while StableMARK is based on the G234A mutation. These are both rigor mutations of kinesin-1 but behave differently; the E236A mutant is strongly bound to the microtubule in an ATP-like state (neck linker docked), while the G234A mutant is also strongly bound, but not in an ATP-like state (Rice et al., Nature 1999). This means that they may have different effects on or preferences of the microtubule lattice. Indeed, while StableMARK (G234A) has been shown to preferentially bind microtubules with an expanded lattice (Jansen et al., JCB 2023; de Jager et al., JCB 2025), this may not be the case for the E236A mutant. In support of this, it has been shown that, while nucleotide free kinesin-1 can expand the lattice of GDP-microtubules at high concentrations (>10% lattice occupancy) in vitro (Peet et al., Nat Nanotechnol 2018; Shima et al., JCB 2018), kinesin-1 in the ATP-bound state does not maintain this expanded lattice (Shima et al., JCB 2018). Thus, we expect the kinesin-1 rigor used by Tanenbaum et al. (Cell 2014) to not be specific for stable microtubules (with an expanded lattice) in cells. In addition, given the dense packing of microtubules in neurites (not well-established in developing neurites, but with an inter-microtubule distance of ~25 nm in axons and ~65 nm in dendrites (Chen et al., Nature 1992)), the very large size of the SunTag could be problematic. The K560Rigor-SunTag tool from Tanenbaum et al. (Cell 2014) is bound by up to 24 copies of GFP (each ~3 nm in size), meaning that it may obstruct or be obstructed by the dense microtubule network in neurites.

      Planned revision:

      • Given that, unlike the K560Rigor-SunTag construct, StableMARK has been carefully validated as a live-cell marker for stable microtubules, we believe that the above discussion goes beyond the scope of the manuscript.* *

      Microscopy data (Movies 2, 3, and 4) show microtubule bundling with StableMARK labeling, which is absent in tubulin immunostaining. Could this be an artifact of ectopic StableMARK expression? If so, a brief note addressing this potential effect would be beneficial.

      As with any overexpression, there is a risk of artifacts. We feel that in the cells presented, the risk of artifacts is limited because we have chosen neurons expressing StableMARK at very low levels. Prior work has demonstrated that in cells where StableMARK has a speckled appearance on microtubules, it has limited undesired effects on stable microtubules or the cargoes moving along them (Jansen et al., JCB 2023). Perhaps some of the apparent differences in the amount of bundling can be explained in that the expansion microscopy images shown may have less apparent bundling because of the improved z-resolution and thus optical sectioning. Any z-slice imaged using expansion microscopy will contain fewer microtubules, so bundling may be less obvious. If we compare the amount of bundling seen in StableMARK expressing cells with the amount of bundling of acetylated microtubules (a marker for stable microtubules) in DMSO/nocodazole treated (non-electroporated) cells imaged by confocal microscopy in Figure S7, we feel that the difference is not so large. Nonetheless, we can briefly address this potential effect in the text.

      Planned revision:

      • We will improve the transparency of the manuscript by briefly mentioning this in the text. Reviewer #1 (Significance)

      It is an important paper challenging established ideas of microtubule organization in neurons. It is important to the wide audience of cell and neurobiologists.__ __

      Reviewer #2 (Evidence, reproducibility and clarity)

      *The manuscript uses state-of-the-art microscopy (e,g. expansion microscopy, motorPAINT) to observe microtubule organization during early events of differentiation of cultured rat hippocampal neurons. The authors confirm previous work showing that microtubules in neurites and dendrites are of mixed polarity whereas they are of uniform plus-end-out polarity in axons. They show that stable microtubules (labeled with antibody against acetylated tubulin) are located in the central region of neurite cross-section across all differentiation stages. They show that acetylated microtubules are associated with centrioles early in differentiation but less so at later stages. And they show that stable microtubules can move from one neurite to another, presumably by microtubule sliding. *

      Comments

      1. *I found the manuscript difficult to read. There are lots of "segregations" of microtubules occurring over these stages of neuronal differentiation: segregation between the center of a neurite and the outer edge with respect to neurite cross-section, segregation between the region proximal to the cell body and the region distal to the cell body, and segregation over time (stages). The authors don't do a good job of distinguishing these and reporting the major findings in a way that is clear and straightforward. *

      We thank the reviewer for their feedback and will go over the text to make it easier to read. Within neurites, we use the word 'segregated' in the manuscript to mean that the microtubules form two spatially separate populations across the width of the neurites (i.e., their cross-section if viewed in 3D). Because of variability seen in the neurites of this stage, this segregation does not always present as a peripheral vs. central enrichment of the different populations of microtubules as we sometimes observed two side-by-side populations instead. We will make sure that we properly define this in the manuscript to avoid any confusion.

      When discussing other types of segregation, we tried to use different wording such as when discussing the proximal-distal distribution of microtubules with different orientations in axon-like neurites in this excerpt:

      Sometimes these axons and axon-like neurites had a small bundle of minus-end-out microtubules proximal to the soma (Figure S4). This suggests that plus-end-out uniformity emerges distally first in these neurites, perhaps by retrograde sliding of these minus-end-out microtubules (see Discussion).

      When discussing changes related to a particular stage, we instead aimed to list which stage we were talking about, such as seen in the discussion:

      Emerging neurites of early stage 2 neurons already contain microtubules of both orientations and these are typically segregated. These emerging neurites also contain segregated networks of acetylated (stable) and tyrosinated (labile) microtubules. In later stage 2, stage 3, and stage 4 neurons, stable (nocodazole-resistant) microtubules are oriented more minus-end-out compared to the total (untreated) population of microtubules; however, in early stage 2 neurons, stable microtubules are preferentially oriented plus-end-out, likely because their minus-ends are still anchored at the centrioles at this stage. The fraction of anchored stable microtubules decreases during development, while the appearance of short stumps of microtubules attached to the centrioles suggests that these microtubules may be released by severing.

      We appreciate the reviewer's concerns and will review the text carefully for clarity.

      Planned revision:

      • We will carefully go through the text when revising the manuscript to ensure that these distinctions are clear and consider using synonyms or other descriptors where they would enhance clarity.

      *The major focus is on microtubule changes between stages 2a and 2b. This is introduced in the text and in the methods but not reflected in Figure 1A which should serve as an orientation of what is to come. It would be helpful to move the information about stages to the main text and/or Figure 1A. *

      We thank the reviewer for pointing this out and will be more explicit about the distinction between stages 2a and 2b in the main text and make the suggested change to Figure 1A.

      Planned revision:

      • We will incorporate the suggested changes in the revised manuscript.

      For Figure 1, the conclusions are generally supported by the data with the exception of the data for stage 2b in 1D and 1H. The images in D and the line scan in H suggest that for stage 2b, minus-end-out are on one edge whereas the plus-end-out are on the other edge of the neurite cross-section. But this is only true for one region along this example neurite. If the white line in D was moved proximal or distal along the neurite, the line scan for stage 2b would look like those of stages 2a and 3.

      We thank the reviewer for noting this in the figure. For these earlier stages in neuronal development, the distribution of different types of microtubules within the neurite is more variable and does not always adhere to the central-peripheral distribution described for more mature neurons (Tas et al., Neuron 2017). We did not intend to suggest that neurites of stage 2b neurons consistently have a different radial distribution of microtubules of opposite orientation, but rather that microtubules of the same orientation tend to bundle together. Sometimes this bundling produces a central or peripheral enrichment, as described for mature neurons (Tas et al., Neuron 2017) and as seen in Figure 1D-F at certain points along the length of the neurites, and sometimes the bundling simply produces two side-by-side populations. To reflect this diversity, we chose two different examples in the figure. The line scans presented in Figure 1H were taken approximately at the midpoint of the presented ROIs. In addition, as our imaging in this case is two-dimensional, we do not want to make explicit claims about the radial distribution of the different populations of microtubules.

      Planned revision:

      • We will adjust our description of this figure in the main text to be more explicit about how we interpret these results. We will ensure that it is apparent that we do not think there is a specific radial distribution of microtubules depending on the developmental stage.

      *For Figure 2, I found it difficult to relate panels A-F to panels G-J. I recommend combining 2G-J with 3A-B for a separate figure focused on the orientation of stable microtubules across different stages. *

      We thank the reviewer for this suggestion and will take it into consideration when preparing the revised manuscript, making sure that our figure organization is well justified.

      For Figure 3, it is difficult to reconcile the traces with the corresponding images - that is, there are many acetylated microtubules in the top view image that appear to contact centrioles but are not in the tracing. Perhaps the tracings would more accurately reflect the localization of the acetylated microtubules in the top view images if a stack of images was shown rather than the max projections. Or if the authors were to stain for CAMSAPs to identify non-centrosomal microtubules. I find the data unconvincing but I do believe their conclusion because it is consistent with published data in the field. The data need to be quantified.

      We thank the reviewer for noting this. Importantly, the tracing was done on a three-dimensional stack of images, whereas we present maximum projections of a few slices in Figure 3C for easy visualization. Projection artifacts indeed make it look as though some additional microtubules are attached to the centrioles, whereas in the three-dimensional stacks it is apparent that they are not. We can include the z-stacks as supplementary material so that readers can also verify this themselves. We will additionally clarify that this is the case in the text related to Figure 3C.

      Planned revision:

      • We will better explain how the tracing was done in the methods section and make a brief note of the projection artifacts in the main text.
      • We will also include the z-stacks as supplementary data.

      *I have a major concern with the conclusions of Figure 4. Here the authors use StableMARK to argue that microtubules do not depolymerize in one neurite and then repolymerize in another neurite but rather can be moved (presumably by sliding) from one neurite to another. The problem is that StableMARK-decorated microtubules do not depolymerize. So yes, StableMARK-decorated microtubules can move from one neurite to another but that does not say anything about what normally happens to microtubules during neuronal differentiation. In addition, the text says that the focus on Figure 4 is on how microtubules change between stages 2a and 2b but data is only shown for stage 2b. *

      As noted by the reviewer, StableMARK can indeed hyperstabilize microtubules when over-expressed; however, it is important to note that this strongly depends on the level of overexpression of the marker. This is discussed in detail in the paper introducing StableMARK, where it is described that at low expression levels, StableMARK does not alter the stability of microtubules (i.e., StableMARK decorated microtubules can still depolymerize/disassemble and they are disassembled in response to serum starvation), alter their post-translational modification status or their distribution in the cell, or impede the transport of cargoes along them (Jansen et al. JCB 2023). Despite this, we agree that it is important to validate these findings in our experimental system (primary rat hippocampal neurons) and so we plan to perform experiments with photoactivatable tubulin to verify the long lifetime of stable microtubules and aim to also observe microtubule sliding (similar to assays performed in the Gelfand lab (Lu et al., Curr Biol 2013)) in the absence of StableMARK.

      Planned revision:

      • We will confirm our findings using photoactivatable tubulin. We hope to demonstrate the long lifetime of the microtubules in this case and observe the sliding of microtubules by another means.
      • We will also revise the text to better explain the potential impacts of StableMARK and that we chose the lowest expressing cells we could find so early after electroporation.

      *The data are largely descriptive and it is of course important to first describe things before one can dive into mechanism. But most of the findings confirm previous work and new findings are limited to showing that e.g. microtubule segregation appears earlier than previously observed. *

      Our study is the first to use Motor-PAINT to carefully map changes in microtubule orientations during neuronal development. Furthermore, it is the first to use the recently introduced live-cell marker for stable microtubules to directly demonstrate the active polarity reversal of stable microtubules during this process.

      Optional: It would be nice if the authors could investigate some potential mechanisms. For example, does knockdown or knockout of severing enzymes prevent the loss of centriolar microtubules shown in Figure 3? Does knockdown or knockout of kinesin-2 or EB1 prevent the reorientation of microtubules (Chen et al 2014)?

      We agree with the reviewer that these are exciting experiments to perform, and we hope to unravel the mechanisms underlying microtubule reorganization in future work. However, this will require many more experiments, as well as the recruitment and training of a new PhD student or postdoc, given that the first author has left the lab. Therefore, we feel that this falls outside the scope of the current work, which carefully maps the microtubule organization during neuronal development and demonstrates the active polarity reversal of stable microtubules during this process.

      *Overall, the methods are presented in such a way that they can be reproduced. One exception is in the motor paint sample prep section: is it three washes for 1 min each or three washes over 1 min? *

      We thank the reviewer for pointing out this mistake and will adjust this step in the methods section accordingly.

      Planned revision:

      • We will revise the methods section to read 'washed three times for 1 minute each'.

      *No statistical analysis is provided. The spread of the data in the violin plots is very large and it is difficult to ascertain how strongly one should make conclusions based on different data spreads between different conditions. *

      We thank the reviewer for noting this and will add statistical tests to the graphs showing the fraction of minus-end-out microtubules in different stages/conditions.

      Planned revision:

      • We will include statistical tests in the specified graphs.

      For Figure S5, the excluded data (axons and axon-like neurites) should also be shown.

      We thank the reviewer for this suggestion and will include this data.

      Planned revision:

      • We will adjust this supplemental figure to also include the specified data.

      *For the movies, it would be helpful to have the microtubule moving from one neurite to another identified in some way as it is difficult to tell what is going on. *

      We thank the reviewer for pointing this out.

      Planned revision:

      • We will trace the microtubule in this movie to enhance clarity.* * Reviewer #2 (Significance)

      A strength of the study is the state-of-the-art microscopy (e,g. expansion microscopy, motorPAINT) and its application to a classic experimental model (rat hippocampal neurons). The information will be useful to those interested in the details of neuronal differentiation. A limitation of the study is that it appears to mostly confirm previous findings in the field (microtubule segregation, loss of centriolar anchoring, microtubule sliding). The advance to the field is that the manuscript shows that these events occur earlier in differentiation that previously known.

      • *

      Reviewer #3 (Evidence, reproducibility and clarity)

      *The study by Iwanski and colleagues explores the establishment of the specific organisation of the neuronal microtubule cytoskeleton during neuronal differentiation. They use cultures of dissociated primary hippocampal rat neurons as a model system, and apply the optimised motor-PAINT technology, expansion microscopy/immunofluorescence and live cell imaging to investigate the polarity establishment and the distribution of differentially modified microtubules during early development. *

      They show that in young neurons microtubules are of mixed polarity, but at this stage already the stable (acetylated) microtubules are preferentially oriented plus-end-out, and are connected to the centrioles. In later stages, the stable microtubules are released from the centrioles and reverse their orientation by moving around inside the cell body and the neurites.

      *Overall, the conclusions are well supported by the presented data. The experiments are conducted thoroughly, the figures are clearly presented (for minor comments, see below) and the manuscript is well and clearly written. *

      Major comments

      1. What is the proportion of neurons with different types of neurites (axon-like, non-axon-like) in stage 2b? (middle paragraph page 5 and Fig 1E). Please provide a quantification. * How was the quantification in Fig 2B-D-F done? Why do the curves all start at 0? Please provide a scheme explaining these measurements. Furthermore, the data in Fig 2B do not reflect the statement "the segregation (...) was less evident" than in later stages (top of page 6): while it is less evident than in stage 2b, it is extremely similar to stage 3. Please revise accordingly.*

      We thank the reviewer for pointing out these important details. We will make the suggested changes in the text, adding the proportion of neurons with different types of neurites and adjusting statement mentioned.

      The radial intensity distributions were quantified as described in Katrukha et al. (eLife 2021). In the methods section, we describe the process in brief:

      To analyze the radial distribution of acetylated and tyrosinated microtubules in expanded neurites, deconvolved image stacks were processed using custom scripts in ImageJ (v1.54f) and MATLAB (R2024b) as described in detail elsewhere (Katrukha et al., 2021). Briefly, on maximum intensity projections (XY plane), we drew polylines of sufficient thickness (300 px) to segment out neurite portions 44 µm (10 µm when corrected for expansion factor) in length proximal to the cell soma. Using Selection > Straighten on the corresponding z-stacks generated straightened B-spline interpolated stacks of the neurite sections. These z-stacks were then resliced perpendicularly to the neurite axis (YZ-plane) to visualize the neurite cross-section. From this, we could semi-automatically find the boundary of the neurite in each slice using first a bounding rectangle that encompasses the neurite (per slice) and then a smooth closed spline (approximately oval). To build a radial intensity distribution from neurite border to center, closed spline contours were then shrunken pixel by pixel in each YZ-slice while measuring ROI area and integrated fluorescence intensity. From this, we could ascertain the average fluorescence intensity per contour iteration, allowing us to calculate a radial intensity distribution by calculating the radius corresponding to each area (assuming the neurite cross-section is circular).

      The curves thus all start at 0 because no intensity "fits" into a circle of radius 0 and then gradually increase because very few microtubules "fit" into circles with the smallest radii.

      Planned revision:

      • We will revise the text to include the suggested changes and add a brief statement to the methods section to explain why the curves start at 0.* *

      *It should be stressed in the text, that the modification-specific antibodies only detect modified microtubules. Thus, in figure 3, in the absence of total tubulin staining, it is possible that there are more microtubules than revealed with the anti-acetylated tubulin antibody. A possible explanation should be discussed. *

      We thank the reviewer for highlighting this point and will adjust the text accordingly.

      Planned revision:

      • We will clarify this in the revised text by adding the following sentence: In addition, given that we specifically stained for acetylated tubulin (a marker for stable microtubules), it is possible that other non-acetylated and thus perhaps dynamic microtubules are also associated with the centrioles.* *

      *OPTIONAL: As discussed in the manuscript's discussion, testing some of the proposed mechanisms regulating microtubule cytoskeleton architecture in development (motors, crosslinkers, severing enzymes) would significantly increase the impact of this study. Exploring these phenomena in a more complex system (3D culture, brain explants) closer to the intricate character of the brain than the 2D dissociated neurons would be a real game-changer. *

      We agree that sorting out the mechanisms driving microtubule reorganization would be very exciting. However, this will require many more experiments, as well as the recruitment and training of a new PhD student or postdoc, given that the first author has left the lab. Therefore, we feel this falls outside the scope of the current work, which carefully maps the microtubule organization during neuronal development and demonstrates the active polarity reversal of stable microtubules during this process.

      Minor comments

      1. *It could be useful to write on each panel whether the images were obtained with expansion or motor-PAINT technique: the rendering of the figures is very similar, and despite the different colour scheme can be confusing. *

      We thank the reviewer for pointing this out.

      Planned revision:

      • We will incorporate this suggestion when revising our manuscript.

      Reviewer #3 (Significance)

      This manuscript provides insights into the establishment of the microtubule cytoskeleton architecture specific to highly polarised neurons. The imaging techniques used, improved from the ones published before (motor-PAINT: Kapitein lab in 2017, U-ExM: Hamel/Guichard lab in 2019), yield beautiful and convincing data, marking an improvement compared to previous studies.

      *However, the novelty of some of the findings is relatively limited. Indeed, a mixed microtubule orientation in very young neurites has already been shown (Yau et al, 2016, co-authored by Kapitein), as has the separate distribution of acetylated and tyrosinated / stable and labile / plus-end-out and plus-end-in microtubules in dendrites (Tas, ..., Kapitein, 2017). *

      *On the other hand, observation of the live movement of microtubules with the resolution allowing to see single (stable) microtubules is new and important. It provides an exciting setup to explore the mechanisms of polarity reversal of microtubules in neuronal development and it is regrettable that these mechanisms have not been explored further. *

      *The association of (stable) microtubules with the centrioles is also a technically challenging analysis. Despite not being able to visualise all microtubules, but only acetylated ones, these data are novel and exciting. *

      *This work will be of interest for neuronal cell biologists, developmental neurobiologists. The impact would be larger if the mechanistic questions were addressed using these sophisticated methodologies. *

      *This reviewer's expertise is the regulation of the microtubule cytoskeleton and its impact on molecular, cellular and organism levels. *

      • *


    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The study by Iwanski and colleagues explores the establishment of the specific organisation of the neuronal microtubule cytoskeleton during neuronal differentiation. They use cultures of dissociated primary hippocampal rat neurons as a model system, and apply the optimised motor-PAINT technology, expansion microscopy/immunofluorescence and live cell imaging to investigate the polarity establishment and the distribution of differentially modified microtubules during early development. They show that in young neurons microtubules are of mixed polarity, but at this stage already the stable (acetylated) microtubules are preferentially oriented plus-end-out, and are connected to the centrioles. In later stages, the stable microtubules are released from the centrioles and reverse their orientation by moving around inside the cell body and the neurites.

      Major comments:

      • Overall, the conclusions are well supported by the presented data.

      • What is the proportion of neurons with different types of neurites (axon-like, non-axon-like) in stage 2b? (middle paragraph page 5 and Fig 1E). Please provide a quantification. How was the quantification in Fig 2B-D-F done? Why do the curves all start at 0? Please provide a scheme explaining these measurements. Furthermore, the data in Fig 2B do not reflect the statement "the segregation (...) was less evident" than in later stages (top of page 6): while it is less evident than in stage 2b, it is extremely similar to stage 3. Please revise accordingly.

      • It should be stressed in the text, that the modification-specific antibodies only detect modified microtubules. Thus, in figure 3, in the absence of total tubulin staining, it is possible that there are more microtubules than revealed with the anti-acetylated tubulin antibody. A possible explanation should be discussed.

      • OPTIONAL: As discussed in the manuscript's discussion, testing some of the proposed mechanisms regulating microtubule cytoskeleton architecture in development (motors, crosslinkers, severing enzymes) would significantly increase the impact of this study. Exploring these phenomena in a more complex system (3D culture, brain explants) closer to the intricate character of the brain than the 2D dissociated neurons would be a real game-changer.

      Minor comments:

      • The experiments are conducted thoroughly, the figures are clearly presented (for minor comments, see below) and the manuscript is well and clearly written.

      • It could be useful to write on each panel whether the images were obtained with expansion or motor-PAINT technique: the rendering of the figures is very similar, and despite the different colour scheme can be confusing.

      Significance

      • This manuscript provides insights into the establishment of the microtubule cytoskeleton architecture specific to highly polarised neurons. The imaging techniques used, improved from the ones published before (motor-PAINT: Kapitein lab in 2017, U-ExM: Hamel/Guichard lab in 2019), yield beautiful and convincing data, marking an improvement compared to previous studies.

      • However, the novelty of some of the findings is relatively limited. Indeed, a mixed microtubule orientation in very young neurites has already been shown (Yau et al, 2016, co-authored by Kapitein), as has the separate distribution of acetylated and tyrosinated / stable and labile / plus-end-out and plus-end-in microtubules in dendrites (Tas, ..., Kapitein, 2017).

      • On the other hand, observation of the live movement of microtubules with the resolution allowing to see single (stable) microtubules is new and important. It provides an exciting setup to explore the mechanisms of polarity reversal of microtubules in neuronal development and it is regrettable that these mechanisms have not been explored further.

      • The association of (stable) microtubules with the centrioles is also a technically challenging analysis. Despite not being able to visualise all microtubules, but only acetylated ones, these data are novel and exciting.

      • This work will be of interest for neuronal cell biologists, developmental neurobiologists. The impact would be larger if the mechanistic questions were addressed using these sophisticated methodologies.

      • This reviewer's expertise is the regulation of the microtubule cytoskeleton and it's impact on molecular, cellular and organism levels.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript uses state-of-the-art microscopy (e,g. expansion microscopy, motorPAINT) to observe microtubule organization during early events of differentiation of cultured rat hippocampal neurons. The authors confirm previous work showing that microtubules in neurites and dendrites are of mixed polarity whereas they are of uniform plus-end-out polarity in axons. They show that stable microtubules (labeled with antibody against acetylated tubulin) are located in the central region of neurite cross-section across all differentiation stages. They show that acetylated microtubules are associated with centrioles early in differentiation but less so at later stages. And they show that stable microtubules can move from one neurite to another, presumably by microtubule sliding.

      Comments:

      • I found the manuscript difficult to read. There are lots of "segregations" of microtubules occurring over these stages of neuronal differentiation: segregation between the center of a neurite and the outer edge with respect to neurite cross-section, segregation between the region proximal to the cell body and the region distal to the cell body, and segregation over time (stages). The authors don't do a good job of distinguishing these and reporting the major findings in a way that is clear and straightforward.

      • The major focus is on microtubule changes between stages 2a and 2b. This is introduced in the text and in the methods but not reflected in Figure 1A which should serve as an orientation of what is to come. It would be helpful to move the information about stages to the main text and/or Figure 1A.

      • For Figure 1, the conclusions are generally supported by the data with the exception of the data for stage 2b in 1D and 1H. The images in D and the line scan in H suggest that for stage 2b, minus-end-out are on one edge whereas the plus-end-out are on the other edge of the neurite cross-section. But this is only true for one region along this example neurite. If the white line in D was moved proximal or distal along the neurite, the line scan for stage 2b would look like those of stages 2a and 3.

      • For Figure 2, I found it difficult to relate panels A-F to panels G-J. I recommend combining 2G-J with 3A-B for a separate figure focused on the orientation of stable microtubules across different stages.

      • For Figure 3, it is difficult to reconcile the traces with the corresponding images - that is, there are many acetylated microtubules in the top view image that appear to contact centrioles but are not in the tracing. Perhaps the tracings would more accurately reflect the localization of the acetylated microtubules in the top view images if a stack of images was shown rather than the max projections. Or if the authors were to stain for CAMSAPs to identify non-centrosomal microtubules. I find the data unconvincing but I do believe their conclusion because it is consistent with published data in the field. The data need to be quantified.

      • I have a major concern with the conclusions of Figure 4. Here the authors use StableMARK to argue that microtubules do not depolymerize in one neurite and then repolymerize in another neurite but rather can be moved (presumably by sliding) from one neurite to another. The problem is that StableMARK-decorated microtubules do not depolymerize. So yes, StableMARK-decorated microtubules can move from one neurite to another but that does not say anything about what normally happens to microtubules during neuronal differentiation. In addition, the text says that the focus on Figure 4 is on how microtubules change between stages 2a and 2b but data is only shown for stage 2b.

      • The data are largely descriptive and it is of course important to first describe things before one can dive into mechanism. But most of the findings confirm previous work and new findings are limited to showing that e.g. microtubule segregation appears earlier than previously observed.

      • Optional: It would be nice if the authors could investigate some potential mechanisms. For example, does knockdown or knockout of severing enzymes prevent the loss of centriolar microtubules shown in Figure 3? Does knockdown or knockout of kinesin-2 or EB1 prevent the reorientation of microtubules (Chen et al 2014)?

      • Overall, the methods are presented in such a way that they can be reproduced. One exception is in the motor paint sample prep section: is it three washes for 1 min each or three washes over 1 min?

      • No statistical analysis is provided. The spread of the data in the violin plots is very large and it is difficult to ascertain how strongly one should make conclusions based on different data spreads between different conditions.

      • For Figure S5, the excluded data (axons and axon-like neurites) should also be shown.

      • For the movies, it would be helpful to have the microtubule moving from one neurite to another identified in some way as it is difficult to tell what is going on.

      Significance

      A strength of the study is the state-of-the-art microscopy (e,g. expansion microscopy, motorPAINT) and its application to a classic experimental model (rat hippocampal neurons). The information will be useful to those interested in the details of neuronal differentiation. A limitation of the study is that it appears to mostly confirm previous findings in the field (microtubule segregation, loss of centriolar anchoring, microtubule sliding). The advance to the field is that the manuscript shows that these events occur earlier in differentiation that previously known.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study examines the reorganization of the microtubule (MT) cytoskeleton during early neuronal development, specifically focusing on the establishment of axonal and dendritic polarity. Utilizing advanced microscopy techniques, the authors demonstrate that stable microtubules in early neurites initially exhibit a plus-end-out orientation, attributed to their connection with centrioles. Subsequently, these microtubules are released and undergo sliding, resulting in a mixed-polarity orientation in early neurites. Furthermore, the study elegantly illustrates the spatial segregation of microtubules in dendrites based on polarity and stability. The experiments are rigorously executed, and the microscopy data are presented with exceptional clarity. The following are my primary concerns that warrant further consideration by the authors.

      1. Potential Bias in the MotorPAINT Assay: Kinesin-1 and kinesin-3 motors exhibit distinct preferences for post-translationally modified (PTM) microtubules. Given that kinesin-1 preferentially binds to acetylated microtubules over tyrosinated microtubules in the MotorPAINT assay, the potential for bias in the results arises. Have the authors explored the use of kinesin-3, which favors tyrosinated microtubules, to corroborate the observed microtubule polarity?

      2. Axon-Like Neurites in Stage 2b Neurons: The observation of axon-like neurites in Stage 2b neurons, characterized by an (almost) uniformly plus-end-out microtubule organization, is noteworthy. Have the authors confirmed this polarity using end-binding (EB) protein tracking (e.g., EB1, EB3) in Stage 2b neurons? Do these neurites display distinct morphological features, such as variations in width? Furthermore, do they consistently differentiate into axons when tracked over time using live-cell EB imaging, rather than the MotorPAINT assay? Could stable microtubule anchoring impede free sliding in these neurites or restrict sliding into them? Investigating microtubule sliding dynamics in these axon-like neurites would provide valuable insights.

      3. Taxol and Microtubule Sliding: Taxol-induced microtubule stabilization is known to induce the formation of multiple axons. Does taxol treatment diminish microtubule sliding and prevent polarity reversal in minor neurites, thereby facilitating their development into axons?

      4. Sorting of Minus-End-Out Microtubules (MTs) in Developing Axons: Traces of minus-end-out MTs are observed proximal to the soma in both Stage 2b axon-like neurites and Stage 3 developing axons (Figure S4). Does this indicate a clearance mechanism for misoriented MTs during development? If so, is this sorting mechanism specific to axons? Could dynein be involved? Pharmacological inhibition of dynein (e.g., ciliobrevin-D or dynarrestin) could assess whether blocking dynein disrupts uniform MT polarity and axon formation.

      5. Impact of Kinesin-1 Rigor Mutants on MT Polarity and Dynamics: Would the expression of kinesin-1 rigor mutants alter MT dynamics and polarity? Validation with alternative methods, such as microtubule photoconversion, would be beneficial.

      6. Molecular Motors Driving MT Sliding: Which specific motors drive MT sliding in the soma and neurites? If a motor drives minus-end-out MTs into neurites, it must be plus-end-directed. The discussion should clarify the polarity of the involved motors to strengthen the conclusions.

      7. Stability of Centriole-Derived Microtubules: Microtubules emanating from centrioles are typically young and dynamic. How do they acquire acetylation and stability at an early stage? Do centrioles exhibit active EB1/EB3 comets in Stage 1/2a neurons? If these microtubules are severed from centrioles, could knockdown of MT-severing proteins (e.g., Katanin, Spastin, Fidgetin) alter microtubule polarity during neuronal development? A brief discussion would be valuable.

      Minor Points:

      1. In Movies 3 and 4, please use arrowheads or pseudo-coloring to highlight microtubules detaching from specific points. In Movie 5, please mark the stable microtubule that rotates within the neurite

      2. In Movies 3 and 4, please use arrowheads or pseudo-coloring to highlight microtubules detaching from specific points. In Movie 5, mark the stable microtubule that rotates within the same neurite and the microtubule that exits and enters another neurite in the opposite orientation. These annotations would enhance clarity."

      3. The title states: 'Stable microtubules predominantly oriented minus-end-out in the minor neurites of Stage 2b and 3 neurons.' However, given that the minus-end-out percentage increases after nocodazole treatment but only reaches a median of 0.48, 'predominantly' may be an overstatement. Please consider rewording.

      4. Please compare the StableMARK system with the K560Rigor-SunTag approach described by Tanenbaum et al. (2014). What are the advantages of StableMARK over the SunTag method?

      5. Microscopy data (Movies 2, 3, and 4) show microtubule bundling with StableMARK labeling, which is absent in tubulin immunostaining. Could this be an artifact of ectopic StableMARK expression? If so, a brief note addressing this potential effect would be beneficial.

      Significance

      It is an important paper challenging established ideas of microtubule organization in neurons. It is important to the wide audience of cell and neurobiologists.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to warmly thank all the reviewers for their helpful and fair comments which will increase the quality of our manuscript.

      We would like to inform the reviewers that changes have been made concerning the Figures numbers as follows :

      Figure number in old version

      Figure number in revised manuscript

      1B

      S1C

      S1C

      S1D

      1C

      S2A

      S1D

      S2B

      S1E

      S2C

      1D

      1B

      S2

      S3

      S3

      S4

      S4

      S5

      1. Description of the planned revision

      Reviewer #1

      Major comments 3) Upon food supplementation with 20E the authors could not measure a significant effect on systemic growth or midgut maturation (Fig. S3), whereas the dose of 20E they fed (20µg/ml) was already much higher than endogenous 20E level they measured in the midgut (Fig. 2B).

      We thank the reviewer #1 for this comment.

      Fig. S3 is now Fig. S4

      First, the concentration of 20µg/mL is the final concentration in the fly food and is different from the levels of 20HE we measured in the organs and in the haemolymph, due to the different cell absorption and degradation of the product.

      This concentration of 20µg/mL corresponds to a molar concentration of approximately 0.04mM which is less than the common concentration of 20HE used in the literature in the food (1mM).

      Tiffany V. Roach, Kari F. Lenhart; Mating-induced Ecdysone in the testis disrupts soma-germline contacts and stem cell cytokinesis. Development 1 June 2024; 151 (11): dev202542. doi: https://doi.org/10.1242/dev.202542

      Ahmed, S.M.H., Maldera, J.A., Krunic, D. et al. Fitness trade-offs incurred by ovary-to-gut steroid signalling in Drosophila. Nature 584, 415-419 (2020). https://doi.org/10.1038/s41586-020-2462-y

      The authors should consider to feed larvae with RH5849 (Dr. Ehrenstorfer), which is an insecticide functioning as an ecdysone agonist and was designed for high stability (Wing et al, 1988). RH5849 was already successfully fed to adult Drosophila to investigate the impact of Ecdysone signalling on the adult midgut (Neophytou et al, 2023; Zipper et al, 2025; Zipper et al, 2020) and elicits 20E response. Furthermore, uptake of RH5849 is not limited by the levels of EcI.

      We thank the reviewer #1 for this comment. We ordered that compound and the experiment should be performed in July since the sending date is expected in late June.

      8) The authors should include a discussion of how Ecdysone signalling in postmitotic EC is regulating midgut size, which may include recent data from Edgar and Reiff labs (Ahmed et al, 2020; Zipper et al., 2025; Zipper et al., 2020).

      We thank the reviewer #1 for this comment. We would like to target a format of report for the journal, thus there are some constraints about the number of words. Of course, if the editor allows us to bypass that limit, we would be delighted to cite and discuss these papers.

      9) There are several recent publications showing a role for gut microbiota in regulating oestrogen metabolism in humans, and implications in oestrogen-related diseases such as endometriosis (Baker et al, 2017; Xholli et al, 2023). More precisely bacteria including Lactobacilli strains produce gut microbial β-glucuronidase enzymes, which reactivate oestrogens (Ervin et al, 2019; Hu et al, 2023). As Drosophila ecdysone is the functional equivalent of mammalian oestrogens (Aranda & Pascual, 2001; Martinez et al, 1991; Oberdörster et al, 2001) these publications should be discussed by the authors.

      We thank the reviewer #1 for this comment. We would like to target a format of report for the journal, thus there are some constraints about the number of words. Also, the topics of these papers seem a little bit out of the scope of our manuscript which is focused on the microbiota impact on midgut growth.

      Reviewer #2 Minor Comments

      Figure S2: columns A and B are box plots, while columns C and D are columns with error bars. Presentation of quantitative data should be uniform and ideally as box plots throughout.

      The authors thank the reviewer #2 for this advice and the figure will be further revised.

      Fig. S2 is now Fig. S3


      __Reviewer #3 __

      Major comments:

      The study relies on loss-of-function experiments to manipulate ecdysone signaling; gain-of-function experiments would provide an informative complement. Does feeding ecdysone phenocopy Lp association in GF larvae? Would ecdysone feeding have an additive effect with Lp association? Given the pleiotropic effects of ecdysone on larval phenotypes, a more targeted approach could be used to overexpress transgenes to augment ecdysone signaling.

      We thank the reviewer #3 for this comment. This thought is shared with reviewer #1 and this experiment will be repeated with RH5849. The results are expected in July.

      Minor comments:

      1. For gut and carcass length analysis, the EcR-RNAi and shd-RNAi conditions look slightly smaller in both GF and Lp conditions. Is there a genetic background effect on larval size? It would be helpful to calculate the interaction score between genotype and microbiome status via a 2-way ANOVA with post hoc tests.

      The authors thank the reviewer #3 for this comment. We will further analyse statistically that differences.


      6) In Fig. 3 the authors added the values for numbers of biological replica within the graphs. In Fig. 4 M-P they added the values for number of technical replicas. They should apply adding these two types of values to all graphs and I would suggest to make the difference between biological replica 'n' and technical replica 'N' obvious in the figure.

      The authors thank the reviewer #3 for this comment. We will modify these numbers in the Figures and/or we will clarify these numbers in the legends to not overwrite the Figures.


      The scope of the bibliography seems limited in scope. As one example, Shin et al., 2011 seems quite relevant for this study.

      We thank the reviewer #1 for this comment. We would like to target a format of report for the journal, thus there are some constraints about the number of words. Of course, if the editor allows us to bypass that limit we would be delighted to cite and discuss this paper.

      • *

      2. Description of the revisions that have already been incorporated in the transferred manuscript

      All changes are visible in red in the text of the revised manuscript.

      __Reviewer #1 __

      __Major remarks __

      1) In Fig.2 E - G there is a remarkable difference between controls in D compared to F and E compared to G. The difference between the controls in E and G is stronger than the shown significant difference of EcRRNAi to the control in E. How do the authors explain such a difference of the two (basically equal) controls and the high variance in control values shown in G?

      We thank the reviewer #1 for this comment. As mentioned in the material and methods, the controls are different due to the different RNAi construct. Thus, this can generate variability in such type of developmental experiment.

      Line 253: "UAS-EcRRNAi (BDSC 9327), UAS-dsmCherryRNAI (BDSC 35785), UAS-shadeRNAi (VDRC 108911), and respective RNAi control lines (KK60101)."

      Are the comparisons of control and EcRRNAi shown in D significantly different?

      As mentioned in the figure panel, the EcRRNAi GF and control GF are significantly different and this is discussed in the text as follows in Line 154: "This phenomenon could be explained by genetic background and/or by additional deleterious effect of germ-freeness, as well as a putative contribution of EcR to intestinal functions that are important for systemic growth independently of the contributions of microbiota to adaptive growth."


      4) Lines 167-169: the authors state that 'Size-matched Lp associated larvae, controlRNAi or EcRRNAi, show longer midguts than their relative GF condition (Fig. 3A, B)', but there are no significant statistics shown for this comparison in Fig. 3A, B.

      We thank the reviewer #1 for this comment and we agree that the sentence can be misleading. Thus, we reformulated it as : "Size-matched Lp-associated EcRRNAi larvae show longer midguts than their relative GF controls (Fig. 3A, B)."

      10) Fig. S4 is not mentioned at all in the manuscript.

      We thank the reviewer #1 for this comment and we added the reference to the supplementary Figure 4, now Figure S5 on Line 202 : "In the anterior part, the cells and nuclei are bigger in Lp-associated than GF animals (Fig. 4M-N, Fig.S5). For the posterior part, the cell area was significantly increased in Lp- monoassociated animals compared to GF cell while no change was shown for the nucleus area (Fig. 4O-P, Fig.S5)."

      Minor comments: • The authors are inconsistent in indicating their experimental groups. One example is Fig. S3: In A and B they write the GF groups non-italic, whereas the L.p. groups are written italic. In C - E they only partially write the L.p. groups italic. Furthermore, in A, C - E they write 'L.p.', whereas its written 'Lp' and missing the 'WJL' in B.

      We thank the reviewer #1 for this comment and we corrected that mistake in Fig. S3.

      Fig. S3 is now Fig. S4

      • Line 52: The last 'i' in 'Lactobacilli' is not italic.

      We thank the reviewer #1 for this comment and we corrected that mistake. • Line 122: Spelling error in 'Surpringsinly'

      We thank the reviewer #1 for this comment and we corrected that mistake. • Line 151: Spelling error in 'progenies'. Needs to read 'progeny'.

      We thank the reviewer #1 for this comment and we corrected that mistake. • Lines 231-235: Last part of the sentence is repetitive

      We thank the reviewer #1 for this comment and we corrected that mistake as "Our work paves the way to deciphering the signals delivered by the bacteria that are sensed at the host cellular level and to understand how this microbe-mediated Ecd-dependent midgut growth contributes to the Drosophila larval growth upon malnutrition."

      Reviewer #2 Minor Comments 1. Figure 1 is interesting but challenging to follow. The fonts are very small and challenging to read. Pink on blue background is particularly hard to read and doesn't seem necessary. As the entire manuscript follows from data in Figure 1, I would encourage the authors to revise it with a vie3w to making the results more accessible.

      The authors thank the reviewer #2 for this advice and the Figure 1 has been revised.

      Figure 4 is impressive and important for the overall manuscript. The authors should provide representative images to show how they measured cell area and nucleus area.

      The authors thank the reviewer #2.

      How cell area and nucleus area were measured is described in Figure S4. The reference to this supplementary Figure was missing in the initial manuscript and we deeply apologize for that.

      Reviewer #1 also pointed out that the reference of Figure S4 covering that point was missing in the text and we corrected that point.

      I struggled to follow this sentence (line 215): "Also, it will be interesting to test, beyond their shared growth phenotype, whether they respond differently at the mechanistical level to the presence of bacteria in the anterior compartment." I would encourage the authors to consider alternative formulations.

      The authors thank the reviewer #2 and revised that sentence as follows :

      "Also, it will be interesting to investigate whether the midgut comprises sub-populations of enterocytes that differ in their physiological functions. Indeed, these sub-populations could be differently distributed along the midgut and be localized on anterior and/or posterior parts. Thus, they could present varied responses to the presence of the bacteria."

      __Reviewer #3 __

      Major comments

      Figure 4 title is misleading. No manipulations of ecdysone signaling are performed to demonstrate whether scaling relationships across tissues differ depending on ecdysone. The same experiment should be performed using mex>EcR-RNAi larvae and/or mex>shd-RNAi larvae.

      We thank the reviewer #3 for this comment.

      We agree with the reviewer and the title has been changed as follows and mentioned in red in the manuscript : Midgut-specific adaptive growth promoted by Lp in Drosophila larvae.


      Minor comments:

      It is notable that mex>EcR-RNAi in germ-free larvae exacerbates developmental delay. A possible interpretation is that ecdysone signaling in the germ-free context promotes increased growth rate. Could the authors comment?

      We thank reviewer #3 for this comment.

      Since we described a local effect at the intestine level for Ecd it is unlikely but not totally excluded that intestinal Ecd promotes systemic growth.

      Our comments are here in the text :

      "This phenomenon could be explained by genetic background and/or by additional deleterious effect of germ-freeness, as well as a putative contribution of EcR to intestinal functions that are important for systemic growth independently of the contributions of microbiota to adaptive growth."

      Experimental variation is substantial between the control conditions of the EcR and Shd knockdown experiments; median control + Lp D50 in the EcR experiment is ~6 days whereas in the shade experiment it is ~9 days. Can the authors comment on this between-experiment variation, which seems substantial (similar to the effect size between control + Lp and control GF)?

      We thank reviewer #3 for this comment which was also highlighted by the reviewer #1 and we answered as follows :

      As mentioned in the material in methods, the controls are different due to the different RNAi construct. Thus, this can generate variability in such type of developmental experiment.

      Line 253: "UAS-EcRRNAi (BDSC 9327), UAS-dsmCherryRNAI (BDSC 35785), UAS-shadeRNAi (VDRC 108911), and respective RNAi control lines (KK60101)."

      As mentioned in the figure panel, the EcRRNAi GF and control GF are significantly different and this is discussed in the text as follows in Line 154: "This phenomenon could be explained by genetic background and/or by additional deleterious effect of germ-freeness, as well as a putative contribution of EcR to intestinal functions that are important for systemic growth independently of the contributions of microbiota to adaptive growth."

      The methods detail an ecdysone feeding protocol that I could not find used in the experiments. Please clarify.

      We thank reviewer #3 for this comment.

      We would like to highlight that this protocol is related to an experiment described in Fig. S3 (now Fig.S4) and that supplementary Figure was cited here in the text of the manuscript Line 179 as follows "While the systemic growth of animals is not affected by addition of 20E, a slight trend to faster midgut maturation of GF larvae is observed through the measurements of longer guts (Fig. S4)."

      Also, in supplementary data :

      Fig. S3 : Feeding larvae with 20E does not impact the gut growth.

      (A-B) Addition of 20E has no impact on larval developmental timing (DT) and their D50. From size-matched animals (C), Lp promotes intestinal growth compare to GF (D) but no significant difference is shown in the gut/carcass ratio (E). Animals receiving 20E are represented with color filled circles +Lp (blue), GF (black) and controls without 20E supplementation with empty circles.

      The manuscript would benefit from additional proofreading. The text contains spelling errors throughout. The in-text reference formatting is inconsistent. Figure legends could be improved to better describe the data.

      We thank reviewer #3 for this comment and following the different reviewers comments we improved the manuscript in that way.

      3. Description of analyses that authors prefer not to carry out

      Reviewer #1

      __Major remarks __ 2) The authors should consider investigating an EcIRNAi in addition to EcRRNAi. EcR functions as activator, but also as suppressor in the absence of Ecdysone and a EcRRNAi suppresses both functions of EcR. By knocking down EcI the authors would prevent uptake of Ecdysone and thus interfere only with the ligand-induced activating function of EcR.

      We thank reviewer #1 for this comment.

      This experiment has been performed using EcI RNAi but not shown here because in our hands the genetic tool was not efficient (RNA interference does not work effectively) and thus the experiment was not conclusive.

      No phenotype was observed in our study (see Figure attached). Also, the others Oatp family members were tested for their expression in midgut and were found close to null expression.

      5) Why are the authors comparing the carcass length of GF shade RNAi with L.p. control in Fig. 3 D?

      We thank reviewer #1 for this comment. For transparency of the results, these statistics were added. Because in these conditions GF larvae were difficult to rise at the same size than their relative Lp monoassociated. Hence, the carcass length was used to normalize the data.

      7) In Fig. S3C the authors compared L.p. WJL 20E with the GF EtOH control, where is the comparison to the corresponding L.p. WJL EtOH control? The L.p. WJL EtOH control is compared to GF 20E instead.

      We thank reviewer #1 for this comment that will help to clarify our experiment.

      Fig. S3 is now Fig. S4

      For the Fig. S4C, it is a larval size that allows to compare sizes in all conditions independently. That explains that statistics are shown between all conditions. To not overload the Figure the p values not different are not mentioned.

      Reviewer #2 Minor Comments 3. Figure S3 confuses me. It seems that addition of 20E to GF larvae leads to a significant reduction of larval size, and that mono-association with Lp also significantly shortens larval size. Data in Figure 4G suggest that Lp should not affect larval body length relative to GF larvae. Can the authors explain the apparent discrepancy?

      The authors thank the reviewer #2 for this question. Fig. S3 is now Fig. S4.

      This difference could be explained as follows :

      • The developmental experiment in Fig. S3B shows no difference between the two GF conditions. Thus, at the end of the is larval development, systemic growth is similar in both conditions.

      Because performed earlier during development, the larval size experiment shows higher variability in measurements of larval size. Moreover, less larvae are present in the GF 20E condition that could explained that difference.

      • We have previously shown that Lp mono-associated larvae grow faster than GF. Thus, to collect size-matched larvae on the same day, GF or Lp animals come from a different initial day of experiment. Due to biological variability, some differences in timing could be observed between GF and Lp animals.

      Reviewer #3

      Major comments

      1. The authors conclude that intestinal ecdysone signals are not required for Lp-promoted systemic growth. However, their data shows that circulating 20E titer increases in an Lp-dependent manner, and this circulating 20E presumably affects multiple tissues throughout the organism. Since EcR is broadly expressed, can the authors examine how EcR knockdown in other tissues influences systemic growth in Lp-associated larvae? Fat body-specific EcR knockdown seems particularly of interest here given the established relationship between fat body ecdysone signaling and growth (Delanoue et al., 2010). This additional analysis would help clarify whether ecdysone signaling in non-intestinal tissues mediates the Lp-associated growth phenotype.

      We thank reviewer #3 for this comment that will help to clarify our manuscript.

      We would like to emphasize that we never mention in this manuscript that intestinal ecdysone signals are not required for systemic growth. Nevertheless, we highlighted that it is required for Lp-related midgut growth and not rate limiting for Lp-promoted systemic growth:

      Line 179 : "While the systemic growth of animals is not affected by addition of 20E, a slight trend to faster midgut maturation of GF larvae is observed through the measurements of longer guts (Fig. S3). Thus, the intestinal Ecd signaling is required for the midgut growth effect mediated by Lp in a context of malnutrition."

      Line 227: "Specifically, intestinal Ecd signaling is not rate-limiting for Lp-mediated adaptive growth."

      While it will be very interesting to study the effects of Ecd modulation from Fat Body, we feel this is out of the scope of our manuscript that focused on the Lp-based intestinal growth.

      The experimental design compares larvae associated with live Lp versus germ-free larvae provided sterile PBS. Since Lp cells constitute a potential nutrient source for developing larvae, it's unclear whether gene expression differences arise from larvae digesting Lp cells as a nutrient source or from active, microbe-host signaling interactions. To conclusively address this ambiguity, the authors should perform RNA-seq on larvae inoculated with live versus heat-killed Lp. Alternatively, qPCR could be used to provide evidence for the extent to which changes in ecdysone-related gene expression specifically require live Lp.

      We thank reviewer #3 for this comment.

      We (the lab) previously showed that the systemic growth phenotype is supported by bacteria during development and that bacteria have to be alive to support optimal effects (Storelli et al 2018, PMID: 29290388; Consuegra et al 2020a, PMID: 32196485; Consuegra et al 2020b, PMID: 32563155). This topic of bacteria viability has also been directly addressed independently by colleagues and reported recently (da Silva Soares NF, PMID: 37488173). Hence, we did not design our RNAseq with inactivated bacteria. However, if the editor believes this is essential to provide qPCR results on Ecd-related gene expression in live vs inactivated bacteria associations, we shall provide them but at this stage we believe this notion is not core to our message.

      Shade is expressed in the larval midgut, however the larval fat body is thought to be a major site of 20E to 20HE conversion. Can the authors test how Shd knockdown in the fat body affects systemic growth in the Lp-associated condition?

      We thank reviewer #3 for this comment. Nevertheless, we think this is out of the scope of our manuscript that focused on the Lp-based intestinal growth.

      In the knockdown experiments, body size is not measured for larvae/pupae. Given that ecdysone signaling impacts pupal volume (Delanoue et al., 2010) and controls metamorphosis timing, D50 plots by pupal volume would be informative to give a rough estimate of growth rate. For example, do germ-free EcR-RNAi larvae, which develop slower, have an equivalent body size to germ-free control larvae?

      We thank the reviewer #3 for this comment.

      All experiments were done with size-matched larvae because the aim of this manuscript is to detail what is the impact of Lp on the relative midgut vs systemic growth. Hence, we are using animals of similar systemic size to study their midgut size and identify allometry changes (midgut/larval size ratios) at a similar developmental point, which is same larval systemic growth (here L3). Thus, we feel that focusing on growth rates and systemic sizes in different genetic conditions, while interesting in general, is out of the scope of the study since we focus our study on midgut/larval size allometry.


      __Minor comments __

      The number of pupae in the EcR-RNAi and shd-RNAi experiments (Fig 2D, F) differ. Were larval densities controlled during development?

      I could not find this mentioned in the methods, and it is an important control parameter as larval density impacts developmental growth. Presenting this data as % viability of a known number of larvae deposited in food would be preferable.

      We thank the reviewer #3 for this comment.

      As mentioned in the material and methods, 40 eggs from axenic animals were deposited on each tube. It is true that the final number of pupae is different and could come from differential viability of the genetic backgrounds used. It would be difficult to follow from the same tube the larval development because of the manipulation of gnotobiotics animals. Nevertheless, in all experiments more than 25% of initial eggs deposited in tubes emerged as adults.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Bai and colleagues present a detailed characterization of the larval midgut transcriptome, comparing size-matched germ-free and Lactobacillus plantarum (Lp)-associated L3 larvae. The authors identify Lp-mediated changes in ecdysone signaling genes, show increased ecdysone titer in Lp-associated larval hemolymph, and explore the functional role of intestinal ecdysone signaling and Lp in gut-specific vs systemic growth. This work offers insights into microbiome-intestinal EcR signaling that will attract broad interest in the Drosophila community.

      Major comments:

      1. The authors conclude that intestinal ecdysone signals are not required for Lp-promoted systemic growth. However, their data shows that circulating 20E titer increases in an Lp-dependent manner, and this circulating 20E presumably affects multiple tissues throughout the organism. Since EcR is broadly expressed, can the authors examine how EcR knockdown in other tissues influences systemic growth in Lp-associated larvae? Fat body-specific EcR knockdown seems particularly of interest here given the established relationship between fat body ecdysone signaling and growth (Delanoue et al., 2010). This additional analysis would help clarify whether ecdysone signaling in non-intestinal tissues mediates the Lp-associated growth phenotype.
      2. The experimental design compares larvae associated with live Lp versus germ-free larvae provided sterile PBS. Since Lp cells constitute a potential nutrient source for developing larvae, it's unclear whether gene expression differences arise from larvae digesting Lp cells as a nutrient source or from active, microbe-host signaling interactions. To conclusively address this ambiguity, the authors should perform RNA-seq on larvae inoculated with live versus heat-killed Lp. Alternatively, qPCR could be used to provide evidence for the extent to which changes in ecdysone-related gene expression specifically require live Lp.
      3. Shade is expressed in the larval midgut, however the larval fat body is thought to be a major site of 20E to 20HE conversion. Can the authors test how Shd knockdown in the fat body affects systemic growth in the Lp-associated condition?
      4. In the knockdown experiments, body size is not measured for larvae/pupae. Given that ecdysone signaling impacts pupal volume (Delanoue et al., 2010) and controls metamorphosis timing, D50 plots by pupal volume would be informative to give a rough estimate of growth rate. For example, do germ-free EcR-RNAi larvae, which develop slower, have an equivalent body size to germ-free control larvae?
      5. Figure 4 title is misleading. No manipulations of ecdysone signaling are performed to demonstrate whether scaling relationships across tissues differ depending on ecdysone. The same experiment should be performed using mex>EcR-RNAi larvae and/or mex>shd-RNAi larvae.
      6. The study relies on loss-of-function experiments to manipulate ecdysone signaling; gain-of-function experiments would provide an informative complement. Does feeding ecdysone phenocopy Lp association in GF larvae? Would ecdysone feeding have an additive effect with Lp association? Given the pleiotropic effects of ecdysone on larval phenotypes, a more targeted approach could be used to overexpress transgenes to augment ecdysone signaling.

      Minor comments:

      1. For gut and carcass length analysis, the EcR-RNAi and shd-RNAi conditions look slightly smaller in both GF and Lp conditions. Is there a genetic background effect on larval size? It would be helpful to calculate the interaction score between genotype and microbiome status via a 2-way ANOVA with post hoc tests.
      2. It is notable that mex>EcR-RNAi in germ-free larvae exacerbates developmental delay. A possible interpretation is that ecdysone signaling in the germ-free context promotes increased growth rate. Could the authors comment?
      3. The number of pupae in the EcR-RNAi and shd-RNAi experiments (Fig 2D, F) differ. Were larval densities controlled during development? I could not find this mentioned in the methods, and it is an important control parameter as larval density impacts developmental growth. Presenting this data as % viability of a known number of larvae deposited in food would be preferable.
      4. Experimental variation is substantial between the control conditions of the EcR and Shd knockdown experiments; median control + Lp D50 in the EcR experiment is ~6 days whereas in the shade experiment it is ~9 days. Can the authors comment on this between-experiment variation, which seems substantial (similar to the effect size between control + Lp and control GF)?
      5. The methods detail an ecdysone feeding protocol that I could not find used in the experiments. Please clarify.
      6. Figure S4 is not called out in the text.
      7. The scope of the bibliography seems limited in scope. As one example, Shin et al., 2011 seems quite relevant for this study.
      8. The manuscript would benefit from additional proofreading. The text contains spelling errors throughout. The in-text reference formatting is inconsistent. Figure legends could be improved to better describe the data.

      Significance

      The authors identify Lp-mediated changes in ecdysone signaling genes, show increased ecdysone titer in Lp-associated larval hemolymph, and explore the functional role of intestinal ecdysone signaling and Lp in gut-specific vs systemic growth. This work offers insights into microbiome-intestinal EcR signaling that will attract broad interest in the Drosophila community.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Gut-resident bacteria promote growth and development in the host intestine. With remarkable genetic accessibility and broad physiological relevance, flies are an excellent experimental system to ask how intestinal bacteria modify growth. Previous studies established a role for Lactobacillus planetarium in larval fly growth, particularly during periods of nutrient deprivation. Growth is principally supported by signals from insulin-like peptides and the melting hormone ecdysone. In this manuscript, the authors present transcriptomic data that Lp association has large effects on transcription in larval hosts raised on low-protein diets, including signatures of ecdysone signalling activation. Through a series of follow up studies the authors present evidence that ecdysone signalling is particularly important for midgut growth in Lp-associated larvae raised on low protein food.

      Major Comments

      I have no major comments. The manuscript is well written, easy to follow, and data are interpreted appropriately. Most of my remarks are minor in nature and can be addressed by simple changes to the text or figures.

      Minor Comments

      1. Figure 1 is interesting but challenging to follow. The fonts are very small and challenging to read. Pink on blue background is particularly hard to read and doesn't seem necessary. As the entire manuscript follows from data in Figure 1, I would encourage the authors to revise it with a vie3w to making the results more accessible.
      2. Figure S2: columns A and B are box plots, while columns C and D are columns with error bars. Presentation of quantitative data should be uniform and ideally as box plots throughout.
      3. Figure S3 confuses me. It seems that addition of 20E to GF larvae leads to a significant reduction of larval size, and that mono-association with Lp also significantly shortens larval size. Data in Figure 4G suggest that Lp should not affect larval body length relative to GF larvae. Can the authors explain the apparent discrepancy?
      4. Figure 4 is impressive and important for the overall manuscript. The authors should provide representative images to show how they measured cell area and nucleus area.
      5. I struggled to follow this sentence (line 215): "Also, it will be interesting to test, beyond their shared growth phenotype, whether they respond differently at the mechanistical level to the presence of bacteria in the anterior compartment." I would encourage the authors to consider alternative formulations.

      Significance

      Strengths and limitations

      This is a well written manuscript that adds to our understanding of the effects of intestinal bacteria on host development. Except for labeling in Figure 1, all data are accessibly presented, and accurately interpreted.

      Advance

      The study accurately links gut microbes to endocrinological control of organ growth in the fly. It also advances our understanding of Drosophila intestinal development by showing how ecdysone is important for gut development.

      Audience

      The work is focussed in scope and will likely have greater appeal to groups that work primarily with the Drosophila model. However, the work will likely have a broader appeal to those that study microbial control of development in other animal models.

      My expertise

      This study aligns with my area of expertise.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript entitled 'Drosophila larval gut transcriptome reveals a microbe-mediated intestinal tissue growth via Ecdysone during adaptive growth' identifies endocrine ecdysone signalling to regulate Lactobacilli-mediated adaptive growth during Drosophila larval development in context of malnutrition. The authors performed transcriptomic analysis to pin down genes that are deregulated in Lactiplantibacillus plantarum (L.p.) associated animals compared to germ-free raised animals. Besides the newly highlighted ecdysteroid kinase-like genes the other isolated deregulated candidates will be of interest for the audience specialized on gut microbiomes. Furthermore, by knockdown of the ecdysone receptor and the converting enzyme shade in midgut enterocytes of L.p. associated and GF raised larvae Bai et al. validated a requirement for ecdysone signalling in midgut growth, but not systemic growth of malnourished animals.

      Although the manuscript is mostly well-written and concise, I have major remarks (and suggestions) that the authors need to consider during a major revision process to sustain their claims. In addition, in several occasions the shown graphs do not support the text statements statistically, which I point out in the following major remarks. In addition, an entire Figure is not referred to in the manuscript. Overall, that leaves a bit of a 'premature' impression of the manuscript.

      Major remarks:

      1. In Fig.2 E - G there is a remarkable difference between controls in D compared to F and E compared to G. The difference between the controls in E and G is stronger than the shown significant difference of EcRRNAi to the control in E. How do the authors explain such a difference of the two (basically equal) controls and the high variance in control values shown in G? Are the comparisons of control and EcRRNAi shown in D significantly different?
      2. The authors should consider investigating a EcIRNAi in addition to EcRRNAi. EcR functions as activator, but also as suppressor in the absence of Ecdysone and a EcRRNAi suppresses both functions of EcR. By knocking down EcI the authors would prevent uptake of Ecdysone and thus interfere only with the ligand-induced activating function of EcR.
      3. Upon food supplementation with 20E the authors could not measure a significant effect on systemic growth or midgut maturation (Fig. S3), whereas the dose of 20E they fed (20µg/ml) was already much higher than endogenous 20E level they measured in the midgut (Fig. 2B). The authors should consider to feed larvae with RH5849 (Dr. Ehrenstorfer), which is an insecticide functioning as an ecdysone agonist and was designed for high stability (Wing et al, 1988). RH5849 was already successfully fed to adult Drosophila to investigate the impact of Ecdysone signalling on the adult midgut (Neophytou et al, 2023; Zipper et al, 2025; Zipper et al, 2020) and elicits 20E response. Furthermore, uptake of RH5849 is not limited by the levels of EcI.
      4. Lines 167-169: the authors state that 'Size-matched Lp associated larvae, controlRNAi or EcRRNAi, show longer midguts than their relative GF condition (Fig. 3A, B)', but there are no significant statistics shown for this comparison in Fig. 3A, B.
      5. Why are the authors comparing the carcass length of GF shade RNAi with L.p. control in Fig. 3 D?
      6. In Fig. 3 the authors added the values for numbers of biological replica within the graphs. In Fig. 4 M-P they added the values for number of technical replicas. They should apply adding these two types of values to all graphs and I would suggest to make the difference between biological replica 'n' and technical replica 'N' obvious in the figure.
      7. In Fig. S3C the authors compared L.p. WJL 20E with the GF EtOH control, where is the comparison to the corresponding L.p. WJL EtOH control? The L.p. WJL EtOH control is compared to GF 20E instead.
      8. The authors should include a discussion of how Ecdysone signalling in postmitotic EC is regulating midgut size, which may include recent data from Edgar and Reiff labs (Ahmed et al, 2020; Zipper et al., 2025; Zipper et al., 2020).
      9. There are several recent publications showing a role for gut microbiota in regulating oestrogen metabolism in humans, and implications in oestrogen-related diseases such as endometriosis (Baker et al, 2017; Xholli et al, 2023). More precisely bacteria including Lactobacilli strains produce gut microbial β-glucuronidase enzymes, which reactivate oestrogens (Ervin et al, 2019; Hu et al, 2023). As Drosophila ecdysone is the functional equivalent of mammalian oestrogens (Aranda & Pascual, 2001; Martinez et al, 1991; Oberdörster et al, 2001) these publications should be discussed by the authors.
      10. Fig. S4 is not mentioned at all in the manuscript.

      Minor comments:

      • The authors are inconsistent in indicating their experimental groups. One example is Fig. S3: In A and B they write the GF groups non-italic, whereas the L.p. groups are written italic. In C - E they only partially write the L.p. groups italic. Furthermore, in A, C - E they write 'L.p.', whereas its written 'Lp' and missing the 'WJL' in B.
      • Line 52: The last 'i' in 'Lactobacilli' is not italic.
      • Line 122: Spelling error in 'Surpringsinly'
      • Line 151: Spelling error in 'progenies'. Needs to read 'progeny'.
      • Lines 231-235: Last part of the sentence is repetitive

      References

      Ahmed SMH, Maldera JA, Krunic D, Paiva-Silva GO, Penalva C, Teleman AA, Edgar BA (2020) Fitness trade-offs incurred by ovary-to-gut steroid signalling in Drosophila. Nature 584: 415-419

      Aranda A, Pascual A (2001) Nuclear hormone receptors and gene expression. Physiol Rev 81: 1269-1304

      Baker JM, Al-Nakkash L, Herbst-Kralovetz MM (2017) Estrogen-gut microbiome axis: Physiological and clinical implications. Maturitas 103: 45-53

      Ervin SM, Li H, Lim L, Roberts LR, Liang X, Mani S, Redinbo MR (2019) Gut microbial β-glucuronidases reactivate estrogens as components of the estrobolome that reactivate estrogens. J Biol Chem 294: 18586-18599

      Hu S, Ding Q, Zhang W, Kang M, Ma J, Zhao L (2023) Gut microbial beta-glucuronidase: a vital regulator in female estrogen metabolism. Gut Microbes 15: 2236749

      Martinez E, Givel F, Wahli W (1991) A common ancestor DNA motif for invertebrate and vertebrate hormone response elements. The EMBO journal 10: 263-268

      Neophytou C, Soteriou E, Pitsouli C (2023) The Sterol Transporter Npc2c Controls Intestinal Stem Cell Mitosis and Host-Microbiome Interactions in Drosophila. Metabolites 13

      Oberdörster E, Clay MA, Cottam DM, Wilmot FA, McLachlan JA, Milner MJ (2001) Common phytochemicals are ecdysteroid agonists and antagonists: a possible evolutionary link between vertebrate and invertebrate steroid hormones. J Steroid Biochem Mol Biol 77: 229-238

      Wing KD, Slawecki RA, Carlson GR (1988) RH 5849, a Nonsteroidal Ecdysone Agonist: Effects on Larval Lepidoptera. Science 241: 470-472

      Xholli A, Cremonini F, Perugi I, Londero AP, Cagnacci A (2023) Gut Microbiota and Endometriosis: Exploring the Relationship and Therapeutic Implications. Pharmaceuticals (Basel) 16 Zipper L, Corominas-Murtra B, Reiff T (2025) Steroid hormone-induced wingless ligands tune female intestinal size in Drosophila. Nature Communications 16: 436

      Zipper L, Jassmann D, Burgmer S, Görlich B, Reiff T (2020) Ecdysone steroid hormone remote controls intestinal stem cell fate decisions via the PPARγ-homolog Eip75B in Drosophila. eLife 9

      Significance

      Brief general assessment before a revision: The study provides important new insights into organ versus systemic growth and show that this is regulated by a central steroid hormonal pathway making this study interesting for a broad audience.

  3. Apr 2025
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their thoughtful comments and suggestions. Our plans for revisions are first summarized. Below you can find the original reviews and our responses and detailed plans (indicated by "Response").

      Revision plan summary:

      1. Many of the concerns can be addressed by changes in the text and better explanations of how the experiments were done. These changes are detailed in the point-by-point responses.
      2. The reviewers suggested experiments such as ChIP-seq and immunoprecipitation which require collection of a large number of mutants. Since our mutants are sterile, the line needs to be maintained as heterozygotes, from which we can pick out individual mutant worms. Therefore, with the current reagents it is impossible to collect mutants in sufficient quantities for ChIP-seq or IP. We understand that it limits the conclusions that can be drawn.
      3. For some figures, additional quantification of fluorescence signal will be done to show differences between mutant and wild type.
      4. A few experiments will be repeated:
      5. We will repeat the ATPase assays shown on Fig 1 with additional independently prepared and purified protein samples.
      6. Additional replicates will be performed for the few immunofluorescence experiments that were only performed once. Point-by-point responses:

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Dosage compensation (DC) in C. elegans involves halving the gene expression from the two hermaphrodite X chromosomes to match the output of the single X in male worms. The key regulator of this repression is a specialized condensin complex, which is defined by a dedicated SMC-4 paralog, termed DPY-27. SMC-4 in other animals is an ATPase that functions as a motor of loop extrusion in cohesion complexes. In their current manuscript, Chawla et al. assessed whether DPY-27 has ATPase function and whether this activity is required for dosage compensation. It had previously been shown that an ATPase-deficient 'EQ' mutant DPY-27 protein interacts with other DC complex members, yet fails to localize to the X. This observation was made with an extra copy of DPY-GFP expressed in addition to the endogenous wildtype protein [Ref 77]. No dominant negative effect was observed. The authors have now engineered the 'EQ' mutation into the endogenous gene locus and genetically generated hetero- and homozygous ATPase mutant worms. Their data suggest that the ATPase activity is required or X-chromosome localization, complex assembly, chromosome compaction as well as enrichment of H4K20me1 on the dampened X chromosome.

      Major comments: 1. ATPase assays, Figure 1.Preparations of individual recombinant proteins may vary significantly and may occasionally show much reduced enzymatic activity. A conclusion about the failure of an ATPase activity should not be concluded from a single preparation, but several protein preps need to be tested, which then serve as 'biological replicates' for the in vitro reaction. Apparently, the ATPase assays shown only involved technical replicates, which is not sufficient.

      Response: We will express and purify additional protein samples and will repeat the assay.

      CRISPR-mediated engineering may lead to unwanted reactions, exemplified by the 'indel' mutation that was recovered in one clone. As a good practice and important control, the sequences of the mutated alleles in the worms should be determined by sequencing of PCR products. Restrictions enzyme cleavage or gel electrophoresis of the PCR products is not sufficient to document the nature of the mutation.

      Response: The sequence of the edit was confirmed by Sanger sequencing. We will make it clear in the text.

      All IF data need to be collected from at least 2 biological replicates, i.e. the experiment must have been carried out independently on two different days. The replicates should deliver consistent results. The number of independent replicates should be mentioned in each figure legend.

      Response: Most of our experiments were performed multiple times. We will indicate the number of replicates in the figure legends. The one or two experiments that were only performed once, will be repeated an additional time.

      The expression levels of wildtype and mutant proteins are concluded from IFM. This is very qualitative; quantitative measurements would strengthen the paper.

      Response: We will quantify fluorescence intensity on our existing images to show differences between mutant and wild type.

      Figure 4B: What are the criteria for classification of the three classes of mutant nuclei? To the uninitiated eye they look very similar. I am a bit worried about the human bias, if such diffuse staining are to be categorized. The two categories of localization need be documented better.

      Response: We will provide more images to show the range of phenotypes and provide a better explanation of how they were classified. We will also try a few ways to quantify “diffuseness” to provide a numerical readout.

      Figure 5: volume of the X chromosome. Related to (5): Apparently, the mask that contains the X chromosome was drawn by hand on each individual nucleus? I find it very difficult to see how the X chromosomal territory would be assessed in the examples shown. I would be good to see a panel of nuclei, in which the masks are visible. I think the analysis should be blinded, in which a researcher not involved in the analysis draws masks on coded nuclei and their classes are only revealed later. The same concern holds for the FISH/IP overlaps or DPY-27/SDC-2 overlaps.

      Response: The masks used were not drawn by hand but were based on fluorescence intensity thresholds. We will make a supplementary figure that shows the masks used for quantification to help clarify how the experiment and quantification were performed.

      For figure 5, age-matched hermaphrodites were analyzed. How was the age determined and what would be the consequence of age-variations? What is the effect of the mutations on development?

      Response: For our staining experiments, we routinely use young adult which we define as 24 hr past larval L4 stage. At this stage, young adults have started laying eggs. We have unpublished data that shows that dosage compensation and chromosome compaction deteriorates with age. To avoid using old worms in our assays, we pick L4 larvae, and then use them for experiments the following day.

      Minor comments: 8. The labeling of p-values as a-f in the figures with the values listed in a supplemental table is not comfortable. The p-values corresponding to the letters should be listed in the corresponding legends.

      Response: p values can be added to the figure or the figure legend (they are currently in supplementary tables).

      How were the concentrations of the ATPase preparations determined? It would help to see a proteins gel in the supplement to assess their purity.

      Response: Concentrations were determined using a spectrometer. We can show protein gels of the preparations as a supplementary figure.

      In figure 1, heterodimers are assumed, but not shown. Do they dimerize under these conditions?

      Response: We can cite papers from others that show heterodimerization in these conditions (for example, Hassler et al, 2019).

      Reviewer #1 (Significance (Required)):

      Significance: The involvement of the ATPase function of DPY-27 was somewhat expected, in light of the earlier findings published in reference 77 using a transgene. The current study confirms and extends these earlier findings. In principle, the genetic experiment presented here is stronger, if documented better.

      Strengths: The study investigates endogenous proteins and measures different phenomena known to be correlated from previous work. The data are internally consistent.

      Limitations: The lack of biological replicates, and unclear procedures of how to draw the IF masks that underlie the conclusions about X chromosome (co)localization and nuclear volume determination render the argument less convincing. For this reviewer, who is not in the C. elegans field, the analysis of mutant phenotypes is difficult to follow. The conclusions are based on only one type of experiment. In reference 77, the X chromosome binding was done by ChIP-seq, clearly a superior, complementary method.

      Response: As explained above, since the strain has to be maintained as a heterozygote, we are unable to collect enough mutants for a ChIP-seq experiment. We can perform and better document the experimental replicates and we can better explain the quantification methods used.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors analyzed the ATPase function of an SMC-4 variant required for dosage compensation in C. elegans. They made a single amino acid mutation that significantly reduced ATPase activity of the protein as shown by in vitro ATP hydrolysis. They showed that the mutation results in the phenotypic consequences of those shown for other DC mutants, including viability assay, immunofluorescence and DNA FISH. These results demonstrate the important role of ATPase activity in transcription repression.

      Major comments: - Are the key conclusions convincing? The key conclusion that DPY-27 has ATPase activity and using a classic mutation that reduces it largely eliminates its function is convincing. The interpretation of the IF experiments to build the model in the final figure requires stronger evidence, as commented below in additional experiment section.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Yes, as explained below.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      The main issue with the current model is that the authors assume that the EQ proteins that they are analyzing is in complex with the rest of the condensin IDC subunits. However, there is no evidence in the paper suggesting that this occurs. The results are consistent with the possibility that a large portion of the DPY27-EQ is not in a complex.

      IP-western experiments comparing the proportion of other subunits pulled down by the wild type versus the EQ mutant (perhaps extract from ~50% EQ containing population could be reached) is needed to understand the incorporation of the EQ mutant in the complex. This is particularly important for the interpretation of the data in Figure 4A, where 70% of the nuclei show diffuse CAPG-1 and DPY-27 EQ. Is this signal due to disassembled subunits diffusing freely, or as depicted in the model figure, bound less stably everywhere? The immunofluorescence results are consistent with both EQ mutation 1) forming a full complex and unstably binding or 2) destabilizing the complex but incompletely assembled complexes sustaining a pool of free EQ detected by the immunofluorescence experiments.

      Response: We agree that to conclusively show interactions, an IP would be necessary. However, as explained above for ChIP, it is not possible to collect enough mutants to make enough protein extract for an IP. An IP in heterozygous worms is also not ideal, as it would be nearly impossible to distinguish wild protein from the mutant. The antibody we used recognizes the N terminus, which is identical in the two proteins. The only way to distinguish them would be mass spec. However, during the fragmentation process for mass spec, Q can deaminate to E, which would complicate interpretation of our data. To do this experiment properly, we would need to introduce a different tag into the mutant protein. With the current reagents, an IP is not possible.

      Instead, we have to rely on indirect evidence. The fact that DPY-27 and CAPG-1 colocalize (figure 4) does provide some support for the hypothesis. From previous studies,including our recent publication Trombley et al PLoS Genetics 2025, we know that the condensin IDC complex is not stable unless all subunits are present. It is therefore highly unlikely, although not impossible, that what we detect is diffuse individual subunits.

      We can make changes in the text to soften this claim and better discuss the caveats of the experiment and the conclusions.

      Along the same point, authors show that EQ protein that binds to the X is incapable of bringing H4K20me1, which is consistent with the possibility that a large portion of the EQ protein is not in a complex. : "To our surprise, we observed that there was no discernable enrichment of H4K20me1, even though there is discernable enrichment of DPY-27 EQ on the X chromosomes in the dpy-27 EQ mutants (Figure 8A).

      Response: There is an important difference. CAPG-1 and DPY-27 are both members of condensin IDC. The five subunits of this complex depend on each other for stability. DPY-21, the protein that introduces the H4K20me1 mark, also localizes to the X chromosomes, but is not part of condensin IDC. Condensin IDC is able to localize to the X chromosomes in the absence of DPY-21, and is not dependent on DPY-21 for stability. However, DPY-21 is dependent on condensin IDC for X localization (Yonker et al 2003). It is then possible that the mutant condensin IDC is X-bound, but it is unable to recruit DPY-21. We can clarify this in the text.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. It is unclear how long it would take to collect enough het/mutant worms can be collected for IP-western. Without additional evidence, interpretation of the data would be affected.

      Response: As explained above, collecting enough mutant worms is essentially impossible. Collecting enough heterozygotes is possible, but distinguishing the mutant protein from the wild type in hets is not.

      • Are the data and the methods presented in such a way that they can be reproduced? Yes
      • Are the experiments adequately replicated and statistical analysis adequate? Yes, except the presentation of the test (see minor comment below)

      Minor comments: - Specific experimental issues that are easily addressable. The use of letters for statistical test result is confusing and the figure legend is not clear about what actual p values were produced "Letters represent multiple comparison p values, with different letters indicating statistically significant differences, and any repeated letter demonstrating no significance. " Providing the values at a reasonably concise manner in the legend will help the reader a lot.

      Response: P values can be added to the figures, or the legend

      • Are prior studies referenced appropriately? The authors state that "Surprisingly, this mutant did not phenocopy the transgenic EQ mutant in [77], .." however in the previous paragraph, the authors state that the transgenic was expressed in the presence of wild type copy. Therefore, the endogenous mutant showing phenotypes rather than the transgenic is rather expected.

      Response: What we referred to were ways in which the protein behaved (for example in ability to bind to the X at all), and not mutant phenotypes of worms. We can clarify this in the text.

      The authors state that "One possible explanation could be that mitotic condensation has multiple drivers of equal consequence including changes in histone modifications [129], whereas condensation of dosage compensated X chromosomes is predominantly dependent on the DCC. " In a dpy-21 mutant, X chromosome decondenses but DPY-27 stays on the chromosome. Therefore, the effect of the EQ mutation may be due to lack of H4K20me1 enrichment in addition to the lack of loop extrusion.

      Response: We can add the role of H4K20me1 to the discussion.

      • Are the text and figures clear and accurate? Yes
      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? The Pearson correlation coefficient for assessing colocalization between SDC-2 and DPY-27 was helpful for quantification, because there is a lot of background signal that makes the support for or lack of colocalization with the X in the other IF/FISH figures difficult to assess. Additionally, please provide information on how chromatic aberration was assessed when analyzing colocalization experiments.

      Response: Chromatic aberration was not considered for these experiments.

      Reviewer #2 (Significance (Required)):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. Although long assumed to be a functional SMC, the demonstration of DPY-27 function depending on ATPase activity is important. This demonstrates that an X-specific condensin retained its SMC activity.

      • Place the work in the context of the existing literature (provide references, where appropriate). The authors do an adequate job in doing this in their discussion.

      • State what audience might be interested in and influenced by the reported findings. The field of 3D genome organization and function would be influenced by the reported findings.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Genomic analyses of 3D genome organization and gene expression.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The authors analyzed the ATPase function of an SMC-4 variant required for dosage compensation in C. elegans. They made a single amino acid mutation that significantly reduced ATPase activity of the protein as shown by in vitro ATP hydrolysis. They showed that the mutation results in the phenotypic consequences of those shown for other DC mutants, including viability assay, immunofluorescence and DNA FISH. These results demonstrate the important role of ATPase activity in transcription repression.

      Major comments:

      • Are the key conclusions convincing? The key conclusion that DPY-27 has ATPase activity and using a classic mutation that reduces it largely eliminates its function is convincing. The interpretation of the IF experiments to build the model in the final figure requires stronger evidence, as commented below in additional experiment section.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Yes, as explained below.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      The main issue with the current model is that the authors assume that the EQ proteins that they are analyzing is in complex with the rest of the condensin IDC subunits. However, there is no evidence in the paper suggesting that this occurs. The results are consistent with the possibility that a large portion of the DPY27-EQ is not in a complex.

      IP-western experiments comparing the proportion of other subunits pulled down by the wild type versus the EQ mutant (perhaps extract from ~50% EQ containing population could be reached) is needed to understand the incorporation of the EQ mutant in the complex. This is particularly important for the interpretation of the data in Figure 4A, where 70% of the nuclei show diffuse CAPG-1 and DPY-27 EQ. Is this signal due to disassembled subunits diffusing freely, or as depicted in the model figure, bound less stably everywhere? The immunofluorescence results are consistent with both EQ mutation 1) forming a full complex and unstably binding or 2) destabilizing the complex but incompletely assembled complexes sustaining a pool of free EQ detected by the immunofluorescence experiments.

      Along the same point, authors show that EQ protein that binds to the X is incapable of bringing H4K20me1, which is consistent with the possibility that a large portion of the EQ protein is not in a complex. : "To our surprise, we observed that there was no discernable enrichment of H4K20me1, even though there is discernable enrichment of DPY-27 EQ on the X chromosomes in the dpy-27 EQ mutants (Figure 8A).

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. It is unclear how long it would take to collect enough het/mutant worms can be collected for IP-western. Without additional evidence, interpretation of the data would be affected.

      • Are the data and the methods presented in such a way that they can be reproduced? Yes

      • Are the experiments adequately replicated and statistical analysis adequate? Yes, except the presentation of the test (see minor comment below)

      Minor comments:

      • Specific experimental issues that are easily addressable. The use of letters for statistical test result is confusing and the figure legend is not clear about what actual p values were produced "Letters represent multiple comparison p values, with different letters indicating statistically significant differences, and any repeated letter demonstrating no significance. " Providing the values at a reasonably concise manner in the legend will help the reader a lot.

      • Are prior studies referenced appropriately? The authors state that "Surprisingly, this mutant did not phenocopy the transgenic EQ mutant in [77], .." however in the previous paragraph, the authors state that the transgenic was expressed in the presence of wild type copy. Therefore, the endogenous mutant showing phenotypes rather than the transgenic is rather expected.

      The authors state that "One possible explanation could be that mitotic condensation has multiple drivers of equal consequence including changes in histone modifications [129], whereas condensation of dosage compensated X chromosomes is predominantly dependent on the DCC. " In a dpy-21 mutant, X chromosome decondenses but DPY-27 stays on the chromosome. Therefore, the effect of the EQ mutation may be due to lack of H4K20me1 enrichment in addition to the lack of loop extrusion.

      • Are the text and figures clear and accurate? Yes
      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? The Pearson correlation coefficient for assessing colocalization between SDC-2 and DPY-27 was helpful for quantification, because there is a lot of background signal that makes the support for or lack of colocalization with the X in the other IF/FISH figures difficult to assess. Additionally, please provide information on how chromatic aberration was assessed when analyzing colocalization experiments.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. Although long assumed to be a functional SMC, the demonstration of DPY-27 function depending on ATPase activity is important. This demonstrates that an X-specific condensin retained its SMC activity.

      • Place the work in the context of the existing literature (provide references, where appropriate). The authors do an adequate job in doing this in their discussion.

      • State what audience might be interested in and influenced by the reported findings. The field of 3D genome organization and function would be influenced by the reported findings.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Genomic analyses of 3D genome organization and gene expression.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Dosage compensation (DC) in C. elegans involves halving the gene expression from the two hermaphrodite X chromosomes to match the output of the single X in male worms. The key regulator of this repression is a specialized condensin complex, which is defined by a dedicated SMC-4 paralog, termed DPY-27. SMC-4 in other animals is an ATPase that functions as a motor of loop extrusion in cohesion complexes. In their current manuscript, Chawla et al. assessed whether DPY-27 has ATPase function and whether this activity is required for dosage compensation. It had previously been shown that an ATPase-deficient 'EQ' mutant DPY-27 protein interacts with other DC complex members, yet fails to localize to the X. This observation was made with an extra copy of DPY-GFP expressed in addition to the endogenous wildtype protein [Ref 77]. No dominant negative effect was observed. The authors have now engineered the 'EQ' mutation into the endogenous gene locus and genetically generated hetero- and homozygous ATPase mutant worms. Their data suggest that the ATPase activity is required or X-chromosome localization, complex assembly, chromosome compaction as well as enrichment of H4K20me1 on the dampened X chromosome.

      Major comments:

      1. ATPase assays, Figure 1.Preparations of individual recombinant proteins may vary significantly and may occasionally show much reduced enzymatic activity. A conclusion about the failure of an ATPase activity should not be concluded from a single preparation, but several protein preps need to be tested, which then serve as 'biological replicates' for the in vitro reaction. Apparently, the ATPase assays shown only involved technical replicates, which is not sufficient.

      2. CRISPR-mediated engineering may lead to unwanted reactions, exemplified by the 'indel' mutation that was recovered in one clone. As a good practice and important control, the sequences of the mutated alleles in the worms should be determined by sequencing of PCR products. Restrictions enzyme cleavage or gel electrophoresis of the PCR products is not sufficient to document the nature of the mutation.

      3. All IF data need to be collected from at least 2 biological replicates, i.e. the experiment must have been carried out independently on two different days. The replicates should deliver consistent results. The number of independent replicates should be mentioned in each figure legend.

      4. The expression levels of wildtype and mutant proteins are concluded from IFM. This is very qualitative; quantitative measurements would strengthen the paper.

      5. Figure 4B: What are the criteria for classification of the three classes of mutant nuclei? To the uninitiated eye they look very similar. I am a bit worried about the human bias, if such diffuse staining are to be categorized. The two categories of localization need be documented better.

      6. Figure 5: volume of the X chromosome. Related to (5): Apparently, the mask that contains the X chromosome was drawn by hand on each individual nucleus? I find it very difficult to see how the X chromosomal territory would be assessed in the examples shown. I would be good to see a panel of nuclei, in which the masks are visible. I think the analysis should be blinded, in which a researcher not involved in the analysis draws masks on coded nuclei and their classes are only revealed later. The same concern holds for the FISH/IP overlaps or DPY-27/SDC-2 overlaps.

      7. For figure 5, age-matched hermaphrodites were analyzed. How was the age determined and what would be the consequence of age-variations? What is the effect of the mutations on development?

      Minor comments:

      1. The labeling of p-values as a-f in the figures with the values listed in a supplemental table is not comfortable. The p-values corresponding to the letters should be listed in the corresponding legends.

      2. How were the concentrations of the ATPase preparations determined? It would help to see a proteins gel in the supplement to assess their purity.

      3. In figure 1, heterodimers are assumed, but not shown. Do they dimerize under these conditions?

      Significance

      Significance: The involvement of the ATPase function of DPY-27 was somewhat expected, in light of the earlier findings published in reference 77 using a transgene. The current study confirms and extends these earlier findings. In principle, the genetic experiment presented here is stronger, if documented better.

      Strengths: The study investigates endogenous proteins and measures different phenomena known to be correlated from previous work. The data are internally consistent.

      Limitations: The lack of biological replicates, and unclear procedures of how to draw the IF masks that underlie the conclusions about X chromosome (co)localization and nuclear volume determination render the argument less convincing. For this reviewer, who is not in the C. elegans field, the analysis of mutant phenotypes is difficult to follow. The conclusions are based on only one type of experiment. In reference 77, the X chromosome binding was done by ChIP-seq, clearly a superior, complementary method.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the reviewers for their thoughtful and detailed feedback, which we found highly constructive and encouraging. The comments have been invaluable in guiding improvements to the clarity, rigor, and impact of our manuscript. Below, we provide our responses and outline the specific revisions we plan to make in response to each point raised. It was extremely encouraging that all the comments were highly relevant to the study demonstrating careful work by experts in the field and they truly help to improve the clarity and message of the manuscript.

      2. Description of the planned revisions


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Gizaw et al characterizes the cholesterol biosynthetic pathway and the effect of its knockdown or inhibition on rhabdomyosarcoma tumor properties. The Authors find that the PROX1 transcription factor mediated cholesterol biosynthesis regulates rhabdomyosarcoma cell growth and proliferation. Blocking the cholesterol biosynthetic pathway leads to reduced proliferation, cell cycle arrest and ER-stress mediated enhanced apoptosis. Detailed transcriptomic analysis indicate gene expression patterns that support these findings. Reviewer #1 (Significance (Required)):

      Based on my expertise on rhabdomyosarcoma tumors, the manuscript is clear, concise and provides a significant advance to the field. Detailed mechanistic characterization is lacking, which takes away some of the significance of the findings, but the work done stands alone as description of the effect of the cholesterol biosynthetic pathway in rhabdomyosarcoma. Another aspect to be considered by the Authors is the potential specificity of targeting a ubiquitous pathway such as cholesterol biosynthesis, which is important to most cells and not only cancer cells. Overall, the manuscript may be revised to address the specific comments below.

      Responses to Reviewer #1 comments

      We thank the reviewer for the thoughtful and encouraging comments on our manuscript. We appreciate the recognition of the significance of our findings and the detailed suggestions provided. We are committed to addressing each of the reviewer's points to strengthen the manuscript and ensure clarity and rigor. Below, we outline how we plan to address each comment.

      Major Comments:

      1. __ Details of the healthy human myoblasts that are used in Figure 1A are not provided and should be updated. Evidence of PROX1 knockdown should be presented. What kind of pathways and gene ontology predictions were associated with the 225 genes that are commonly downregulated between all three cell lines in Figure 1A?__

      Response: In the revised manuscript, we will include complete information regarding the origin and characterization of the healthy human myoblasts used in the Figure 1A. We will also provide additional data confirming PROX1 knockdown. Furthermore, we will present more details on the gene ontology (GO) and pathway enrichment analyses, and include the full results as supplemental data to highlight key biological processes affected by PROX1 silencing.

      __ In Figure 2, while the effect of the shRNAs targeting DHCR7 or the DHCR7 inhibitor AY9944 are striking, it is not clear whether these effects are specific to rhabdomyosarcoma cells or cancer cells. A control, human myoblast cell line or another non-cancerous cell line should be used to repeat these experiments quantifying Caspase3/7 activity, cell growth etc. to assess the cancer cell specificity of such treatments. Evidence of DHCR7 knockdown at the protein level would add to the study.__


      Response: We fully agree with the reviewer's suggestion and will conduct additional experiments using non-cancerous human myoblasts to assess the specificity of DHCR7 inhibition. These will include assays for Caspase 3/7 activation, cell viability, and proliferation under similar conditions. We have already performed western blot validation of DHCR7 knockdown at the protein level in RMS cell lines and will include this data in the manuscript. We will also highlight in the discussion that RMS cells in our experiments were highly vulnerable when cultured with full media (incl. FBS), whereas previous studies with breast cancer cells have shown that their growth is affected by cholesterol biosynthesis inhibition only if they are cultured without serum (containing cholesterol). We also show that cholesterol supplementation does not rescue RMS cells demonstrating the essential role of de novo cholesterol synthesis.

      __ Western blots for Caspase3 quantification and a cell proliferation marker such as Cyclin D in shSCR and shDHCR7 tumor lysates would validate the data shown in the Figure 3. Are the shRNA constructs used inducible ones? If not, how do the Authors distinguish the effect of shDHCR7 on tumor engraftment versus tumor proliferation and growth? Many of the graphs need proper labeling of the axes and what the bars represent.__


      Response: We will include western blot analysis for cleaved Caspase 3 and Cyclin D1 in tumor lysates to support the observed effects on apoptosis and proliferation. We will clarify in the revised manuscript that the shRNA constructs used were constitutive. To distinguish between effects on tumor engraftment versus tumor growth, we will provide additional detail on how we controlled for initial cell viability and engraftment potential prior to injection. We will also revise figure panels to ensure all axes and error bars are clearly labeled.

      __ Gene ontology and pathway analysis will add to Figure 4.__


      Response: We will expand Figure 4 to include GO and pathway enrichment analyses of the RNA-seq data following DHCR7 knockdown. This will help illustrate the functional significance of the transcriptional changes and further support our conclusions regarding ER stress, apoptosis, and cell cycle regulation.

      __ In Figure 5A, how do the Authors explain the upregulation of cholesterol biosynthetic pathway genes upon shDHCR7 treatment? Are these effects seen at the protein level and if alternate pathways maintain cholesterol biosynthesis, how do the Authors think this strategy will be viable to treat such tumors? In Figure 5G-H, was a loading control used? If so, blots for that should be included.__


      Response: We will expand the discussion to address the compensatory transcriptional upregulation of cholesterol biosynthesis genes following DHCR7 knockdown, likely driven by SREBP-mediated feedback regulation. To support this, we will include western blot data for key enzymes in the pathway. We will also clarify that despite this transcriptional compensation, functional cholesterol synthesis is impaired due to DHCR7 silencing, which cannot be rescued by increased upstream pathway activity. Regarding Figure 5G-H, we will include the missing loading control images in the revised version. Protein normalization was performed using Stain-Free technology, which enables the quantification of total protein in each lane, and was analyzed using ImageLab 6.0.1 software (Bio-Rad). We will include the Stain-Free gel images to demonstrate equal protein loading and will also indicate the molecular weights of the presented proteins in the updated figure legend.

      __ Lines 286-287 refer to Figure S1G, H; it should be corrected to Figure S1I, J.__

      Response: We thank the reviewer for pointing this out. We will correct the figure citation in the revised manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript entitled "Targeting de novo cholesterol synthesis in rhabdomyosarcoma induces cell cycle arrest and triggers apoptosis through ER stress-mediated pathways" Gizaw et al investigate the crucial effect of targeting cholesterol biosynthesis in RMS. While this manuscript gives novel insights into putative therapeutic approach, there are some comments that should be address by the authors.

      Reviewer #2 (Significance (Required)):

      A nice and coherent study. Please see text above.


      Response to Reviewer #2

      We are grateful to the reviewer for the thoughtful and constructive comments on our manuscript. We appreciate your recognition of the novelty and therapeutic potential of our findings, and we thank you for highlighting specific areas that will help further improve the clarity, rigor, and reproducibility of our work. Below, we respond point-by-point to your comments and outline how we plan to address each issue in the revised version of the manuscript.

      Major Comments:

      1. __ The authors demonstrated a correlation between PROX1 levels and the cholesterol synthesis pathway. Which genes from the pathway are mostly affected? The manuscript could benefit from a graphical representation of the pathway showing up- and downregulated genes from the RNA-seq analysis. This will help in understanding why the authors decided to study HMGCR silencing as shown in Supplementary Figure 1A.__

      Response: We fully agree and will include a new graphical figure showing the cholesterol biosynthesis pathway, with up- and downregulated genes from our RNA-seq data visually mapped. This is, indeed, interesting as the whole pathway is consistently downregulated. We chose to study specifically these two rate-limiting genes in the pathway, as DHCR7 is the last enzyme in the mevalonate pathway and its inhibition does not affect other arms deviating from this pathway. It was also recently found to be highly upregulated in pancreatic cancer, suggesting its role in cancer development/growth. HMGCR was chosen as it is the target for statins, which are widely used in treating high cholesterol and shown to be rather safe in clinical use. We will add this rationale to the manuscript to clarify our focus on HMGCR and DHCR7.

      __ Based on the previous comment, are the genes from the cholesterol synthesis identified in the RNA-seq similar to those detected in the publicly available data set presented in Figure 1E? In addition, validation of changes of these genes should be performed in the RMS cell lines as well as in myoblasts.__


      Response: Yes, there is a significant overlap between the cholesterol biosynthesis genes identified in our RNA-seq dataset and those from the public dataset in Figure 1E. In the revised version, we will include this comparative analysis with the inclusion of the schematic figure (see our response #1). We also plan to perform qPCR validation of several key cholesterol biosynthesis genes in additional RMS cell lines and healthy myoblasts to reinforce the disease-specific regulation of this pathway.

      __ In Figure 3, the authors study the impact of DHCR7-silencing in tumor growth in vivo. Please, provide stainings also for DHCR7 to show that cells indeed have silenced DHCR7.__


      Response: Thank you for this important suggestion. We will include immunofluorescence staining for DHCR7 in xenograft tumor sections to confirm DHCR7 knockdown in vivo and visually validate the efficiency of our silencing strategy. We will also add qPCR results from the cells at the time when they were implanted confirming the deletion.

      __ In Figure 4, the RNA-seq data revealed downregulation in E2F genes as well as genes involved in cell cycle progression. It would be important that the authors provide examples of these genes and validate this data by performing qPCR.__


      Response: We will select representative cell cycle-related genes, including members of the E2F family and other G1/S and G2/M regulators, for qPCR validation in RMS cells following DHCR7 knockdown. Comparison to healthy myoblasts will be also performed. This will further substantiate the transcriptomic findings.

      __ In Figure 4J-M, cell cycle distribution using flow cytometry should be assessed in an additional cell line.__


      Response: We will repeat the flow cytometry-based cell cycle analysis in an additional RMS cell line to ensure reproducibility and confirm the generalizability of the observed G2/M arrest phenotype.

      __ In line 271, the authors described that PROX1 is associated with an increase in DHCR7. However, in the next paragraph they evaluated the effect of silencing HMGCR. Is this enzyme also increased? Please clarify.__


      Response: We appreciate the need for clarity. HMGCR expression is also elevated in RMS cells and regulated by PROX1. We will clarify this in the revised manuscript and update the text to explain the rationale behind examining both enzymes: HMGCR as the rate-limiting enzyme at the top of the cholesterol biosynthesis pathway, and DHCR7 as the final step enzyme. See also our response to question #1.

      __ The authors show that cholesterol biosynthesis is crucial in RMS. Would overexpression of DHCR7 in shDHCR7 cells rescue the anti-tumor effects? A rescue experiment would give information on whether this enzyme has a direct role in driving RMS cell behavior.__


      Response: This is an excellent suggestion. We are currently generating a DHCR7 rescue construct and plan to perform these experiments. While these data may not be available in time for the current revision, we will clearly outline this approach as a key next step in our Discussion section and incorporate results if available.

      Minor Comments:

      1. __ In line 287 "Supplementary Fig.1G and 1H" are mentioned, while it should be "Supplementary Fig.1I and 1J" since it regards the treatment with lovastatin.__

      Response: Thank you for catching this. We will correct the figure references accordingly.

      __ In line 340, authors mentioned the data "Supplementary Figure 4A and 4E", but there is not any corresponding data available in the Supplementary Information.__


      Response: We apologize for this oversight. These references will be corrected, and any missing supplementary data will be properly included and labeled.

      __ In the Legend of Figure 2L, authors mention "PRXO-1 silencing", this should be corrected to "shDHCR7". Also, please change "l" to capital "L".__


      Response: This will be corrected in the revised figure legend.

      __ In Figure 5G-H, please provide the data regarding loading control in the Western blot, as well as the molecular weights of the proteins presented.__


      Response: We thank the reviewer for this important point. For the Western blot analysis in Figure 5G-H, normalization was performed by quantifying the total protein in each lane using Bio-Rad's Stain-Free technology and analyzed with ImageLab 6.0.1 software. This approach allows for accurate lane-to-lane comparison without relying on a single housekeeping protein. We will add the Stain-Free total protein images as a supplemental figure (Supplementary Figure) and include the molecular weights for each of the proteins in the figure legend to improve clarity and reproducibility.

      __ Please, include the information of what black, red etc refer to in each figure. This information is missing in several figures including Figure 2D, 2K, 3C, 3J, 3K, 3L which makes it difficult to follow.__


      Response: We agree and will update all relevant figure legends to clearly explain color coding, symbols, and what each bar or line represents to improve figure clarity.

      __ The authors should indicate the numbers of biological replicates in individual experiments throughout whole figure legends.__


      Response: Thank you for the suggestion. We will include the number of biological replicates for each experiment in the figure legends to enhance transparency and reproducibility.


    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript entitled "Targeting de novo cholesterol synthesis in rhabdomyosarcoma induces cell cycle arrest and triggers apoptosis through ER stress-mediated pathways" Gizaw et al investigate the crucial effect of targeting cholesterol biosynthesis in RMS. While this manuscript gives novel insights into putative therapeutic approach, there are some comments that should be address by the authors.

      Major comments

      1. The author demonstrated a correlation between PROX1 levels and the cholesterol synthesis pathway. Which genes from the pathway are mostly affected? The manuscript could benefit from a graphical representation of the pathway showing up- and downregulated genes from the RNAseq analysis. This will help in understanding why the authors decided to study HMGCR silencing as shown in Supplementary Figure 1A.
      2. Based on the previous comment, are the genes from the cholesterol synthesis identified in the RNA-seq similar to those detected in the publicly available data set presented in Figure 1E? In addition, validation of changes of these genes should be performed in the RMS cell lines as well as in myoblasts.
      3. In Figure 3, the authors study the impact of DHCR7-silencing in tumor growth in vivo. Please, provide stainings also for DHCR7 to show that cells indeed have silenced DHCR7.
      4. In Figure 4, the RNAseq data revealed downregulation in E2F genes as well as genes involved in cell cycle progression. It would be important that the authors provide examples of these genes and validate this data by performing qPCR.
      5. In Figure 4J-M, cell cycle distribution using flow cytometry should be assessed in an additional cell line.
      6. In line 271, the authors described that PROX1 is associated with an increase in DHCHR7. However, in the next paragraph they evaluated the effect of silencing HMGCR. Is this enzyme also increased? Please clarify.
      7. The authors show that cholesterol biosynthesis is crucial in RMS. Would overexpression of the DHCR7 in shDHCR7 cells rescue the anti-tumor effects? A rescue experiment would give information on whether this enzyme has a direct role in driving RMS cell behavior.

      Minor comments:

      1. In line 287 "Supplementary Fig.1G and 1H" are mentioned, while it should be "Supplementary Fig.1I and 1J" since it regards the treatment with lovastan.
      2. In line 340, authors mentioned the data "Supplementary Figure 4A and 4E", but there is not any corresponding data available in the Supplementary Information.
      3. In the Legend of Figure 2L, authors mention "PRXO-1 silencing", this should be corrected to "shDHCR7". Also, please change "l" to capital "L".
      4. In Figure 5G-H, please provide the data regarding loading control in the Western blot, as well as the molecular weights of the proteins presented.
      5. Please, include the information of what black, red etc refer to in each Figure. This information is missing in several figures including Figure 2D, 2K, 3C, 3J, 3K, 3L which makes it difficult to follow.
      6. The authors should indicate the numbers of biological replicates in individual experiments through whole figure legends.

      Significance

      A nice and coherent study. Please see text above.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Gizaw et al characterizes the cholesterol biosynthetic pathway and the effect of its knockdown or inhibition on rhabdomyosarcoma tumor properties. The Authors find that the PROX1 transcription factor mediated cholesterol biosynthesis regulates rhabdomyosarcoma cell growth and proliferation. Blocking the cholesterol biosynthetic pathway leads to reduced proliferation, cell cycle arrest and ER-stress mediated enhanced apoptosis. Detailed transcriptomic analysis indicate gene expression patterns that support these findings.

      Major comments

      1. Details of the healthy human myoblasts that are used in Figure 1A are not provided and should be updated. Evidence of PROX1 knockdown should be presented. What kind of pathways and gene ontology predictions were associated with the 225 genes that are commonly downregulated between all three cell lines in Figure 1A?
      2. In Figure 2, while the effect of the shRNAs targeting DHRC7 or the DHRC7 inhibitor AY9944 are striking, it is not clear whether these effects are specific to rhabdomyosarcoma cells or cancer cells. A control, human myoblast cell line or another non-cancerous cell line should be used to repeat these experiments quantifying Caspase3/7 activity, cell growth etc. to assess the cancer cell specificity of such treatments. Evidence of DHRC7 knockdown at the protein level would add to the study.
      3. Western blots for Caspase3 quantification and a cell proliferation marker such as Cyclin D in shSCR and shDHRC7 tumor lysates would validate the data shown in the Figure 3. Are the shRNA constructs used inducible ones? If not, how do the Authors distinguish the effect of shDHRC7 on tumor engraftment versus tumor proliferation and growth? Many of the graphs need proper labeling of the axes and what the bars represent.
      4. Gene ontology and pathway analysis will add to Figure 4.
      5. In Figure 5A, how do the Authors explain the upregulation of cholesterol biosynthetic pathway genes upon shDHRC7 treatment? Are these effects seen at the protein level and if alternate pathways maintain cholesterol biosynthesis, how do the Authors think this strategy will be viable to treat such tumors? In Figure 5G-H, was a loading control used? If so, blots for that should be included.
      6. Lines 286-287 refers to Figure S1G, H; it should be corrected to Figure S1I, J.

      Significance

      Based on my expertise on rhabdomyosarcoma tumors, the manuscript is clear, concise and provides a significant advance to the field. Detailed mechanistic characterization is lacking, which takes away some of the significance of the findings, but the work done stands alone as description of the effect of the cholesterol biosynthetic pathway in rhabdomyosarcoma. Another aspect to be considered by the Authors is the potential specificity of targeting a ubiquitous pathway such as cholesterol biosynthesis, which is important to most cells and not only cancer cells. Overall, the manuscript may be revised to address the specific comments below.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study applies cellular and molecular assays, together with transcriptome analysis, to dissect how certain heterochromatin-based epimutations can confer resistance to caffeine and other drugs in fission yeast cells. The findings indicate that compromising the function of two mitochondrial proteins, Cup1 and Ppr4, leads to increased oxidants and the activation of the mito-nuclear retrograde response, which in turn causes the activation of the Pap1-mediated oxidative stress response, including the induction of transmembrane transporters to increase the efflux of drugs. This provides mechanistic insights into how the chromatin-mediated silencing of mitochondrial factors can result in fungal drug resistance. The authors also show that these phenotypes are variable within a cell population, allowing phenotypic plasticity to changing environments. This is a straightforward and clearly presented study, and the conclusions are generally justified based on the experiments presented.

      Minor comments:

      1. Fig. 3A: The legend needs more information to understand what is shown here. Does this show the normalized read counts (cpm?) for each gene scaled per average counts in all samples? Another possibility would be to show relative data for the two mutants compared to wild-type. Also, the labels for the bottom two clusters seem the wrong way round, i.e. the last cluster should be cup1-tt only. How many genes are shown here which made the cutoff?
      2. To strengthen some of the conclusions, it would be meaningful to calculate the significance of overlaps between key gene lists, given the size of the lists involved and the background gene list (Fig. 3B; Fig. 4).
      3. The font size indicating the significance of differences is too tiny in some bar plots (Fig. 5C-E; Fig. 6D; Fig. 7C).

      Referees cross-commenting

      In response to issues raised by Reviewer 1:

      In my opinion, the growth, TPF and ROS assays applied are robust and diagnostic to show a mitochondrial dysfunction. Additional assays, like Seahorse, would provide more specific insights about particular aspects of mitochondrial dysfunction, but this is not really relevant to this study. The key point is that the epimutations compromise mitochondrial function by downregulating mitochondrial proteins, which, in turn, are exploited by the cell to trigger a stress response that protects against antifungal compounds. The exact nature of the mitochondrial dysfunction, any changes in morphology, or details of differentially expressed genes are not critical for this mechanism, as it relies on downstream processes like the retrograde response that is activated by diverse mitochondrial problems.

      The question of whether heterochromatin-mediated resistance phenotypes are prevalent in human fungal pathogens is interesting and an important avenue for future study. But it is not evident to me how this could be addressed bioinformatically.

      Significance

      This manuscript builds on a previous study by the same group, which showed that different heterochromatin-based epimutations can provide cellular resistance to caffeine (Torres-Garcia et al. Nature, 2000). Here they use the UR1 and UR2 epimutations to highlight an example of how such mutations can generate antifungal resistance and phenotypic plasticity by exploiting side effects of mitochondrial dysfunction. Epimutations are an interesting case of cellular adaptation that lasts longer than gene-expression responses but are more readily reversible and flexible than genetic mutations, allowing bet-hedging by generating variable phenotypes in a clonal cell population. This study provides fresh insights into the downstream effects of epimutations causing altered cellular traits, thus complementing previous studies focusing on the patterns and mechanisms of establishing heterochromatin-based genomic islands. The current study is of interest to researchers working on genome regulation, mitochondrial function, cellular adaption/evolution, and has possible applications to combat antifungal resistance.

      Field of expertise: genome regulation, gene function, fission yeast, stress response

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This very interesting manuscript describes the impact of heterochromatin in triggering down-regulation of mitochondrial respiratory activity in S. pombe, and thereby causing increased efflux and consequently increased resistance to several compounds, including the azole class of antifungal drugs. The authors performed detailed mechanistic studies, focusing on two mitochondrial genes, cup1 and ppr4, which are under heterochromatin-dependent repression. Based on their findings, they conclude that reduced mitochondrial respiration causes increased levels of reactive oxygen species (ROS), which activates the transcription factor Pap1. Pap1 then upregulates several genes, including efflux pumps. The authors performed an excellent set of experiment to address heterogenous cell populations in the epimutants, which are described in the second part of the Results section and provide strong evidence of plastic drug resistance phenotypes.

      The manuscript is beautifully written and the data is presented well. Overall, the conclusions are supported by the data.

      I have a reservation with one particular conclusion that I discuss below under point 1. This can be addressed by modifications to the text. Under point 2, I suggest an easy to do experiment, which would strengthen the conclusion that ROS produced due to mitochondrial dysfunction are driving the drug resistance phenotype. This is an interesting mechanism, and the data in the manuscript supports it, but the authors' have not demonstrated it directly. They could do so by using antioxidants, as I suggest below.

      1. Most of the mechanistic analysis is centred around the transcription factor Pap1. The authors performed experiments to connect the production of ROS in mitochondrial mutants, with higher nuclear localisation of Pap1 and its activation of several genes, including the membrane transporter Cas5 and to a lesser extent Bfr1, which might be responsible for increased efflux. There is no question that efflux is elevated in mitochondrial mutants (a phenotype consistent with previous work in other yeast models). The authors also present data to show that inhibition of efflux reverses drug resistance. The data for Pap1's involvement is good in the cup1 mutant (one of the mitochondrial mutants that was studied) but not so much in the ppr4 mutant (the other mitochondrial mutant that was studied). There was little enrichment of Pap1 on the Cas5 promoter in the ppr4 mutant, and no effects of Pap1 on the expression of Cas5 in the ppr4 mutant (Fig 5C and 5D). While the pap1 mutation reduced resistance of the ppr4 mutant to drugs, the authors acknowledge that this could be due to increased sensitivity of the pap1 mutant to drugs. The enrichment of Pap1 on the bfr1 promoter was also modest in the mitochondrial mutants.

      I would therefore suggest that another transcription factor might be responsible for the upregulation of these efflux pumps and/or other efflux pumps are involved in Pap1' s contribution to drug resistance. The authors should consider modifying their conclusions on Pap1-dependent targets that are responsible for drug resistance in the mitochondrial mutants. 2. The authors' conclusion that increased ROS levels upon dysfunctional respiration might be driving the drug resistance phenotype in S. pombe (via Pap1 but perhaps other mechanisms too), presents a novel mechanistic link between mitochondria and drug resistance. I would suggest solidifying this conclusion by asking if antioxidants can reduce ROS levels and thereby decrease drug resistance in S. pombe. N-acetyl-L-cysteine could be used for this purpose.

      Significance

      That mitochondrial dysfunction causes drug resistance has been known for over 20 years. This manuscript describes a new mechanism, which relies on the formation of semi-stable epimutants, whereby the expression of genes encoding key mitochondrial proteins is down-regulated. As the authors propose, the beauty of epimutations is that they cause a heterogenous phenotype and are reversible, which would create an opportunity for the organism to use a bet-hedging strategy in drug. The ability to reverse the phenotype would be particularly important with using mitochondrial dysfunction as a strategy to increase drug resistance, because mitochondrial dysfunction lowers metabolic flexibility and growth rates for the organism. Therefore, it is only beneficial in the presence of drugs. This is to my knowledge one of the first logical mechanistic explanations for how fungal cells (but likely applicable more broadly) might use mitochondrial dysfunction to their advantage when needed, and then this can be reversed back to respiratory competence to maintain metabolic flexibility when drug selection is no longer present.

      This study will be of high interest to researchers studying drug resistance and how phenotypic plasticity and bet-hedging mechanisms are used by cells to survive toxic compounds. This is applicable across fields. This study will further be very interesting to the fields of antifungal drug resistance and fungal pathogenesis, and will provide the foundation for studying similar mechanisms in relevant fungal pathogens of animals and plants.

      My expertise is in metabolism and mitochondrial roles in fungal pathogens. I really enjoyed reading the manuscript.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript under review appears to present findings on heterochromatin-mediated antifungal resistance, specifically focusing on the role of mitochondrial dysfunction in the model organism Schizosaccharomyces pombe. However, there are several significant concerns regarding the novelty and robustness of the conclusions drawn by the authors.

      The key conclusions of the paper lack sufficient convincing evidence. While the authors attribute resistance phenotypes to heterochromatin-mediated repression, the evidence presented does not strongly support these claims. Significant claims should be qualified as preliminary or speculative, particularly those that extend beyond the experimental results provided. For example, asserting a definitive link between heterochromatin status and antifungal resistance mechanisms requires more comprehensive empirical data.

      Additional experiments are crucial to bolster the claims made in the manuscript. The authors rely heavily on growth assays on nonfermentable carbon sources to supposedly elucidate respiratory function. However, this approach is outdated, and advancements in the field should be employed for a more robust assessment of mitochondrial integrity and function. Techniques such as the Seahorse assay could provide critical insights into the respiration capacity of the mutants. Furthermore, the use of electron transport chain (ETC) inhibitors like antimycin A would offer stronger evidence regarding mitochondrial dysfunction. The current use of generalized DCF staining to assess reactive oxygen species (ROS) lacks specificity. MitoSox and MitoTracker should be utilized to measure mitochondrial ROS levels and examine mitochondrial morphology effectively.

      The authors claim that mitochondrial dysfunction correlates with significant changes in the transcriptome related to aerobic respiration, yet this crucial aspect lacks adequate elaboration in their analysis. Given that mitochondrial function is a primary theme of the manuscript, in-depth discussion and interpretation of the differentially expressed aerobic respiratory genes in the transcriptome data are necessary to validate their conclusions.

      Additionally, as a pathogenic fungal microbiologist, I express interest in investigating whether heterochromatin-mediated resistance phenotypes are prevalent in human fungal pathogens, including Candida albicans and Cryptococcus neoformans. A bioinformatic analysis could help address this inquiry and potentially broaden the relevance of the findings.

      Lastly, in the section "Cup1 and Ppr4 deficiencies...retrograde gene repression," the conclusions are made primarily based on transcriptome analysis and lack empirical confirmation through molecular biology techniques. This section should be revised to include comprehensive molecular evidence supporting the claims.

      Significance

      General Assessment: Strengths and Limitations

      Strengths:

      The study introduces a potentially novel mechanism of antifungal resistance in Schizosaccharomyces pombe through heterochromatin-mediated epimutations. This is particularly relevant in the context of rising antifungal resistance globally.

      The focus on mitochondrial dysfunction as a contributor to drug resistance provides a deeper understanding of how fungi adapt to environmental stressors.

      The manuscript raises important questions about the epigenetic factors influencing fungal resistance, which could inspire subsequent investigations in the field.

      Limitations:

      The research relies heavily on traditional methods for assessing respiratory function, which may not fully characterize the complexities of mitochondrial integrity and function. This may weaken the overall conclusions regarding mitochondrial dysfunction.

      The evidence supporting key claims is not robust enough to confidently assert a direct link between heterochromatin changes and antifungal resistance. The lack of confirmatory experiments using more advanced techniques limits the study's impact.

      The analysis of transcriptome data is insufficiently detailed, leaving significant gaps in understanding the specific mechanisms at play.

      Advance: Comparison to Existing Published Knowledge

      This study contributes to the existing literature by exploring the role of epimutations in antifungal resistance, aligning with emerging interests in epigenetic mechanisms in microbial adaptation. While previous studies have focused on genetic mutations and efflux mechanisms, this research attempts to link heterochromatin dynamics to resistance pathways, thereby filling a conceptual gap in understanding how eukaryotic microorganisms may adapt to antifungal treatments.

      However, the advances made by the study appear to be incremental rather than groundbreaking. ​While it does shed light on the potential role of heterochromatin in drug resistance, further empirical evidence and a stronger methodological approach are required to substantiate these findings convincingly.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Reply to the Reviewers

      I thank the Referees for their...

      Referee #1

      1. The authors should provide more information when...

      Responses + The typical domed appearance of a hydrocephalus-harboring skull is apparent as early as P4, as shown in a new side-by-side comparison of pups at that age (Fig. 1A). + Though this is not stated in the MS 2. Figure 6: Why has only...

      Response: We expanded the comparison

      Minor comments:

      1. The text contains several...

      Response: We added...

      Referee #2

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Kemp et al. aimed to explore the transcriptional cell cycle regulation of replication-dependent (RD) histone genes at histone locus body (HLB) in Drosophila. They evaluate the accumulation of RNA pol II and RD histone transcripts at HLB during the cell cycle using live and fixed imaging of Drosophila tissues at different stages of development. They find that RNA pol II is enriched at HLB, not only during S phase when RD histone genes are transcribed but throughout the cell cycle. Outside of S phase, they detect short but not full-length RD histone transcripts suggesting a mechanism of RNA pol II pausing. Full length RD transcripts are only produced upon cyclin E/Cdk2 activation when cells enter S phase, arguing that Cyclin E/Cdk2 can activate transcription elongation. They propose that the elongation release triggered by Cyclin E/Cdk2 is the critical step linking RD histone gene expression and cell cycle progression rather than the recruitment of RNA pol II to HLB. The data are interesting and robust, however, additional experiments could reinforce the findings and the proposed model.

      Specific comments/concerns are listed below.

      1. In Figure 3, quantifications of the fluorescence at HLBs for mCherry-RBP1 and MXC-mScarlet should be provided.
      2. In Figure 5C, both 5' and 3' transcripts are observed in G214 cells. However, their accumulation in the cytoplasm is not visible. How do the authors explain this result? What happens in S14 cells?
      3. In Figure 6, the authors observed RD histone 3' transcripts only in replicating cells (EdU positive) while they detected 5' transcripts in both replicating and non-replicating cells. They argue that the appearance of 3' transcripts is due to the release from transcriptional pausing. To further support particular states in the transcriptional arrest, data by immunofluorescence using specific antibodies recognizing either RNA pol II ser5P or ser2P would determine whether the presence of 3' transcripts is associated with the accumulation at HLB of RNA pol II ser2P (elongating polymerase). Moreover, is there a correlation between P-MXC and RNA pol II ser2P?
      4. In Figure 7 panels C and D, the 5' transcripts should be shown. Although RD histone 3' transcripts accumulate in CyE+ embryonic cells, unfortunately, their presence at HLBs (pointed by arrows) is not visible in the image of panel E. To firm up conclusions quantifications of the 3' and 5' transcripts should be provided for CycE+ and CycEnull cells. In Hur et al., 2020, the authors looked at RD histone transcripts in WT embryo and CycE+/-/Cdk2+/- mutant. They found that the amount of H3 transcripts using a probe corresponding to the coding sequence is not changed in the mutant as compared to the WT. In contrast, they found that there is an increase of transcripts that are not correctly processed using probes downstream the stem-loop region. This seems inconsistent with the results presented here where a decrease of 3' transcripts is observed. This needs an explanation/discussion. Are such incorrectly processed transcripts observed in CycEnull mutant?
      5. The authors suggest that active Cyclin E/Cdk2 triggers the release of RNA pol II promoter-proximal pausing and therefore induces transcriptional elongation at RD histone genes when cells enter S phase. To further support this hypothesis, determining whether there is an enrichment of the elongation factor p-TEFb at HLB when Cyclin E/Cdk2 is active would help.
      6. Instead of using cycling E mutants, to determine whether it is the phosphorylation of MXC which directly impacts the elongation of RD histone genes, it would be interesting to generate phospho-null or phospho-mimetic mutant of MXC.
      7. In Suzuki et al., 2022, the authors described 3' RNA pol II pausing at RD histone genes. Although this study used human cells, it would be interesting to discuss that in addition to a promoter-proximal pausing that regulates transcription elongation, a 3' pausing could also regulate the transcription termination and 3' processing.
      8. In the discussion, the authors should point out some limitations of their studies linked to the method and could propose for the future that a more precise and molecular view of the pausing mechanism could be carried out using sequencing methods such as ChIP-seq of various isoforms of the RNA pol II (total, ser2P, ser5P) and elongation regulators (p-TEFb.....) and PRO-seq.

      Minor points:

      1. In Figure 1, for panels B and D as well as for panels C and E, to falicitate comparison of the localization of the different proteins, it would help to show the same developmental stages and the same image scales.
      2. In Figures 3 and 7 (C-F), the developmental stages should be indicated on the images, as it is done in the other figures.
      3. In the legend of Figure 7, it is indicated (D) and (E) instead of (C) and (D) in the sentence: "Endocycling midgut cells in (D) contain cytoplasmic histone mRNA which is absent in (E) (boxed regions)."

      Significance

      Kemp et al. aimed to explore the transcriptional cell cycle regulation of replication-dependent (RD) histone genes at histone locus body (HLB) in Drosophila. They evaluate the accumulation of RNA pol II and RD histone transcripts at HLB during the cell cycle using live and fixed imaging of Drosophila tissues at different stages of development. They find that RNA pol II is enriched at HLB, not only during S phase when RD histone genes are transcribed but throughout the cell cycle. Outside of S phase, they detect short but not full-length RD histone transcripts suggesting a mechanism of RNA pol II pausing. Full length RD transcripts are only produced upon cyclin E/Cdk2 activation when cells enter S phase, arguing that Cyclin E/Cdk2 can activate transcription elongation. They propose that the elongation release triggered by Cyclin E/Cdk2 is the critical step linking RD histone gene expression and cell cycle progression rather than the recruitment of RNA pol II to HLB.

      The data are interesting and robust, however, additional experiments could reinforce the findings and the proposed model.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Kemp et al. seek to define the molecular interactions that limit replication-dependent histone gene transcription to S-phase of the cell cycle. They use the Drosophila model system and leverage live-imaging tools, such as tagged proteins and Jabba trap, and RNA FISH in several tissues to determine that RNA Pol II is enriched at the locus throughout the cell cycle and is paused outside of S-phase. Therefore, they conclude that it is not Pol II recruitment to the locus that couples histone transcription to S-phase, but release of Pol II pausing.

      Major comments:

      The data presented are clean and well-presented. The claims are supported by the data without exaggeration. It would be helpful to provide -omics support for this entirely image-based analysis (e.g. PRO- or GRO-seq data from synchronized, sorted Drosophila cells may already exist- OPTIONAL).

      A major requirement is that the authors make clear in Introduction and Discussion that the observation of Pol Ii pausing at RD histone genes is not novel. This requires, at minimum, a discussion of Liu (2024) and Suzuki (2022). This allows readers to focus on the advance novel to this work, which is specifically the cell cycle coupling of Pol II pausing.

      As the authors are claiming different dynamics between Spt6 and RPB1 in Figure 1, they should provide similarly-staged embryos for comparison. For example, the authors should show RPB1 in early/mid S of NC 14, as this is when they see Spt6 variability. In theory, this should be relatively easy as these are stills from the live videos.

      Minor comments:

      The use of Spt6 live imaging early on was slightly confusing. The authors should consider moving this data later in the results or providing more written justification for why they investigated Spt6 (further than "to further explore the regulation of RNA pol II dynamics... p6). Similarly, Spt6 is included in the model figure, which might be a stretch given the only Spt6 data involves the timing of Spt6 colocalization with Mxc during the cell cycle.

      Misleading language/missed citations:

      p3: "600 kB array" is misleading. The whole locus is ~ 600 kB.

      p3: Mxc may remain at the locus throughout the cell cycle, so the whole HLB does not disassemble (Terzo, 2015).

      p4: H1-specific factors include cramped (Gibert and Karch, 2011; Bodner et al. 2024 bioRxiv)

      p4: Hodkinson, 2023 is not the correct reference. The correct reference is Hodkinson, 2024, Genetics.

      p5: The Drosophila HLB is detectable at NC 10 (White, 2011; Terzo, 2015) not White, 2007

      p5: A typo: "imagining"

      p7: The section title "RNA pol II is necessary for HLB assembly" is incorrect, as Figure 3 shows that pol II is NOT necessary for Mxc recruitment, but for HLB growth. Mxc, however, is necessary for pol II recruitment.

      p9: The authors should clarify what "HisC" means as this is the first usage.

      Figures/experiments:

      Fig 2: The authors should show the gating in Figure I that led to the three categories in Figure J. The legend/colors in Figure J are not necessary.

      An "easy" experiment would be to use the FUCCI cell lines and 5'/3' RNA FISH in combination (assuming fluorophores allow) - OPTIONAL

      Discussion:

      p13: The reference to the work of Gugliemlmi, 2013 should first come up in the Introduction, as it provides rationale.

      p13: "without engaging in transcription" is misleading, as pol II is transcribing, but paused.

      p15: It makes sense for pol II to pause at histone genes in G1, as they are preparing for the rapid burst of histone transcription needed in S phase. But what might be the functional rationale for pol II pausing in G2, if the HLB disassembles in M?

      Methods:

      It should be made clear how embryos were staged for live imaging, as it is likely by timing after cell cycle events. What is this timing? It would be best if this detail is not just mentioned in the methods, but also in the main text. This is especially important for readers not familiar with Drosophila embryogenesis. Please cite/acknowledge DGRC for Fly-FUCCI line (if appropriate)

      Significance

      This study provides convincing evidence that pol II is enriched at the histone locus and paused outside of S-phase. What limits the significance is that several prior studies concluded that Pol II is paused at the histone locus:

      Lu et al. bioRxiv 2024, "Integrator-mediated clustering of poised RNA polymerase II synchronizes histone transcription"

      Suzuki et al. Nat Comm 2022, "The 3' Pol II pausing at replication-dependent histone genes is regulated by Mediator through Cajal bodies' association with histone locus bodies"

      Neither of these studies is discussed or even cited in the manuscript, which is disappointing. Therefore, the advance is limited to the cell-cycle coupling of pausing. This is still important, as a major knowledge gap as outlined by the authors is that it is not clear how histone transcription is coupled to S-phase and they rule out Pol II pausing as a possible mechanism, and point toward Pol II pausing release.

      Moreover, there is also evidence (from these authors) that Mxc phosphorylation is not always coupled to histone transcription in Drosophila ovaries. This work is also not discussed or cited:

      Potter-Birriel et al. J Cell Sci 2021, "A region of SLBP outside the mRNA-processing domain is essential for deposition of histone mRNA into the Drosophila egg"

      The current research may be of interest to the broad cell cycle field, but it may also be useful as a model for those conducting basic, foundational research who seek to describe how Pol II is released from pausing. The histone locus may be of interest as a novel, facile model for pausing.

      Reviewer expertise: Drosophila, chromatin, gene expression

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      The following revisions are in progress:

      - From Reviewer-1: The authors observe defects in CNCCs through genomic experiments. It would be really nice to perform simple wound healing/scratch assays and/or transwell assays to test if the CNCC migration phenotype is reduced in the CHD3 KO as well which would support the transcriptomic data.

      As recommended by the Reviewer, we are performing a transwell assays to investigate whether CHD3 loss leads to defects in cell migration. These experiments should be completed in the next two weeks.

      __- From Reviewer-2: __Since CHD3 shows a progressive upregulation in expression during CNCC differentiation (Fig. 2E), one hypothesis can be that it is not necessary involved in the activation of the CNCC programs but instead it is involved in maintaining these programs active - by keeping regulatory elements accessible. Thus, authors should check expression of CNCC markers, and EMT genes at the same time point than Fig. 2E in both WT and KO cells.

      As recommended by the reviewer we are differentiating the cells to perform RT-qPCR timecourse for CNCC and EMT markers. These experiments will be completed in the next two weeks.

      __- From Reviewer-2: __It has been shown that CNCC regulatory elements controlling differentiation genes are primed/accessible prior migration (PMID: 31792380; PMID: 33542111). Since the authors claim "CHD3 may have the role of priming the developing CNCCs to respond to BMP by opening the chromatin at the BMP responsive enhancers", it will be good to perform ATAC-seq are several time point during the differentiation process to assess the dynamic of chromatin reorganization to see when the switch to mesoderm fate occurs and how accessibility of BMP responsive element changes in WT and KO cells during CNCC differentiation to be able to demonstrate the KO fail to make BMP responsive element accessible or whether it is a defect in the maintenance of this accessibility.

      As recommended by the Reviewer, we are differentiating the cells to perform ATAC-seq timecourse. These experiments will be completed in the next two/three weeks.

      2. Description of the revisions that have already been incorporated in the transferred manuscript

      The following revisions have already been carried out:

      Reviewer1

      1. Figure 1 presents nice confirmation of the CHD3 KO cell lines being used. However, given that these cell lines were previously published, I suggest moving these data to the supplement. As suggested by the Reviewer, we moved most of Figure 1 to the supplement, merging the remaining Figure 1 with Figure 2.

      In the results section for Figure 1, the authors discuss the CHD3 heterozygotes, but I only see the KO cell line data presented. It would be especially nice to see the protein levels of Chd3 in the het.

      As suggested, we have now performed western blot and qPCR for CHD3 in the heterozygous line and added it to Supplementary Figure S1.

      The authors discuss which genes are up and downregulated in the Chd3 KO D18 RNA-seq, and show a clear heatmap in Figure 2A for WT cells. The same heatmap for candidate genes discussed in the results would be appreciated for Chd3 KO.

      As recommended by the Reviewer, we have added CHD3-KO RNA-seq to the heatmap in Fig. 2A.

      In general 2-3 replicates are presented. While the authors are showing heatmaps for selected locations for individual clones, which is appreciated (ex: Figure 4B and Fig 6), the QC for data quality is missing. For example, show spearmean correlation across the genome for datasets as a supplement.

      We performed spearman correlation of ATAC-seq and RNA-seq data, which confirmed the replicates are very highly correlated, and created new dedicated supplemental figures (Supplemental Figures S3, S4, S5, S6, S7).

      In the section discussing the results presented in Figure 4, the authors discuss the ATAC-seq peak number changes and overlap with gene expression changes. However, the overlap with gene expression changes is not shown. Making a simple Venn diagram would help readers.

      As suggested, we added a Venn diagram with ATAC-seq/RNA-seq overlap in Figure 3D.

      In addition, showing a heatmap for unchanged ATAC-seq peaks can help to demonstrate the increase/decrease.

      As recommended, we have added an heatmap for unchanged ATAC-seq regions as Supplementary Figure S7.

      In Figure 6, the authors present ChIPseq data for CHD3 in D14 and D18 samples, focusing on locations losing or gaining accessibility. What is enrichment at unchanged sites? Is CHD3 specifically enriched at changed locations? Then what about over genes with altered gene expression vs not changed? Is CHD3 only bound to distal elements? Performing an analysis of the peak distribution, perhaps with ChromHMM or other methods to look at promoter vs enhancer vs other locations. These types of analyses could really enrich the interpretation of direct CHD3 function.

      Unfortunately, there is no ChromHMM data for neural crest cells, nor for closely related cell types. Therefore, to address the Reviewer's suggestion, we have taken two approaches: 1) We have further broken down the distribution of the peaks, dividing them between intergenic, intronic, exonic and TSS. Moreover, we have leveraged publicly available H3K27ac ChIP-seq data generated (by our group) in iPSC-derived CNCCs to identify CHD3 peaks that are decorated by this histone modification which typically marks active enhancers. This analysis revealed that 91% of the peaks are either intergenic (50%) or intronic (41%) and that ~a third of the peaks are decorated with H3K27ac in human iPSC-derived CNCCs, suggesting that they are bona-fide active enhancers in this cell type.

      Related to the above, I am not sure if there is a phenotypic test for enhanced mesoderm. I suspect only IF/expression and morphology are possible, which the authors did. However, sorting the cells (with some defined markers) to ask how many are mesoderm-like vs CNCC in WT vs CHD3 KO would give some information outside of the bulk expression data.

      The manuscript already included IF experiments for mesodermal markers, which clearly show that nearly all the cells acquired the mesodermal fate. See for example Brachyury IF in Figure 2E.

      Minor points Reviewer-1: 12. 1A seems to fit better with Figure 2. Done 13. The authors say that the KO cell lines are not defective in pluripotency, but Figures 1G suggests a slight decrease in SSEA-1. Is this reproducibly observed? It is not statistically significant and not reproducibly observed. 14. Would be nice to show number of up and downregulated genes in volcano plots for fast viewing of readers (ex: Fig 2B). We have modified the volcano plot as suggested. 15. Is it fair to use violin plots when data points are only 2-3 replicates (as in Figures 2C, 3D). To address this, we have layered the actual datapoints on top of the violin plots.

      The labels in Fig 4A and 5E are very hard to read.We have changed color to improve readability. 17. For browser tracks, the authors show very zoomed in examples (Fig 4C, and especially Fig 6C). showing a bit more of the area around these peaks would give readers a more clear appreciation of the data. Related to browser tracks, including more information just as including the gene expression changes (such as in Fig 6C) to enhance the interpretation of the impact of Chd3 binding, accessibility change and then, I presume, reduced Sox9 expression. Similar suggestion for Figure 4C, where I anticipate coordinate transcription changes of the associated genes. We have zoomed out the tracks, as suggested, and added expression data next to them. 19. Do the authors observe any clone variability between the two CHD3 KO clones? There is variability I see in some of the heatmaps, but don't know if that it is because of clones or technical variation. We do not observe any significant variability between the clones.

      Reviewer-2 1. What is the expression level of CHD3 in the heterozygote line? Does the remaining allele compensate for the loss which will explain the absence of phenotype?

      Ass suggested also by Reviewer-1, we have performed western blot for CHD3 in the heterozygous line and added it to Supplementary Figure S1. The bot shows that the remaining allele does not compensate. However it is likely that even a reduced amount of wild-type CHD3 is sufficient for proper CNCC specification.

      The authors should use the term "regulatory elements" instead of "enhancers" as they can act either as activator or repressors.

      As suggested, we have changed nomenclature from enhancers to cis-regulatory elements.

      On the same line, while the authors indicate "Motif analysis of the enhancers aberrantly active in CHD3-KO cells ", they haven't shown these are active. They should say they perform the analysis on regulatory elements aberrantly accessible in CHD3 KO. Done.

      See point 3 above.

      The rationale that led the authors to focus on genes typically expressed in the primitive streak and in the early pre-migratory mesoderm, and BMP responsive transcription factors could be better explained. Are they part of the most deregulated genes in the RNA-seq analysis?

      Not only mesodermal genes are among the most upregulated genes in the RNA-seq, but the motifs for the transcription factors encoded by these genes (e.g. TBR2, Brachyury, GATA, TBX3, TBX6) are among the most frequently represented in the aberrantly accessible cis-regulatory elements. The same applies to BMP responsive factor, but the other way around (they are downregulated and enriched in the aberrantly closed ATAC-seq regions).

      In the absence of CHD3, BMP response is not effective. While the authors nicely showed this is linked with changes in chromatin accessibility, it is necessary to check the expression levels of BMP receptors in CHD3 KO cells.

      We have checked the expression of these genes, and they were not differentially expressed. This is consistent with the downstream response being affected rather than ligand binding to the receptors.

      Aberrant early mesoderm signature of the CHD3-KO cells needs to be better shown. It is not obvious from the GO analysis in Fig. 2 and the authors then showed expression of some markers but it is unclear how they picked them up.

      See point 5: not only mesodermal genes are among the most upregulated genes in the RNA-seq, but the motifs for the transcription factors encoded by these genes (e.g. TBR2, Brachyury, GATA, TBX3, TBX6) are among the most frequently represented in the aberrantly accessible cis-regulatory elements. See for example expression levels of typical mesodermal genes below:

      EOMES - upregulated log2FC: 5.5

      TBXT - upregulated log2FC: 4.6

      MESP1 - upregulated log2FC: 4.7

      MIXL1 - upregulated log2FC: 5.4

      TBX6 - upregulated log2FC: 3.2

      MSGN1 - upregulated log2FC: 4.6

      HAND1 - upregulated log2FC: 5.5

      The authors claim CHD3 directly binds at BMP responsive enhancers, but in the figure, they show the data for all the region gaining or losing activity. It will be nice to add the information for the BMP responsive elements only.

      As recommended, we have added an heatmap for BMP responsive regions only, clearly showing that CHD3 binds them (Supplementary Figure S7).

      The authors need to support better that CHD3-KO express more Wnt signaling/activity.

      We have checked expression of many genes that are typically Wnt responsive during mesoderm specification (see also point 7). These include:

      EOMES - upregulated log2FC: 5.5

      TBXT - upregulated log2FC: 4.6

      MESP1 - upregulated log2FC: 4.7

      MIXL1 - upregulated log2FC: 5.4

      TBX6 - upregulated log2FC: 3.2

      MSGN1 - upregulated log2FC: 4.6

      HAND1 - upregulated log2FC: 5.5

      These data clearly support that the Wnt-mediated mesodermal program is markedly upregulated.

      Minor points Reviewer-2: 13. In the discussion, the authors could indicate whether CHD3 mutants somehow phenocopies some of the craniofacial defects observed in DLX5 mutant patients. Done. 14. It is not indicated were to find the data regarding expression epithelial and mesenchymal genes in the CHD3-KO cells. They are in the heatmap in Fig. 1C. 15. Authors could add in the discussion what is known about how CHD3 function changes from opening or closing chromatin is very intriguing a could be discussed. To our knowledge, nothing is known on this. CHD3 is significantly understudied.

      OPTIONAL: While this is not necessary for the current study, it is very intriguing that other CHD family member do not compensate. How this tissue or DNA sequence activity is achieved could be discussed. What are CHD4 or CHD5 expressed during CNCC differentiation? Could they be used to rescue the CHD3 KO phenotype? While this may be difficult to test, it could perhaps be discussed.

      We have added a paragraph on this in the discussion.

      3. Description of analyses that authors prefer not to carry out* *

      From Reviewer 1: Given the changes in the CHD3-KO accessibility are mostly gene distal, are there existing Hi-C/microC/promoter CaptureC or other that can be used to ask if these are interacting with the predicted genes?

      We are not aware of this type of essays being performed genome-wide in human CNCCs. The only studies performed in human CNCCs are SOX9-centred. Looking at 3D chromatin conformation would also be out of the scope of the paper.

      From Reviewer-2:

      OPTIONAL: Does increasing BMP concentration early during CHD3 KO differentiation has a better effect at rescuing CNCC differentiation?

      Indicated by Reviewer as OPTIONAL. We do not think that adding BMP earlier on would make a significant difference in rescuing CNCC differentiation.

      From Reviewer-1: Are the results observed NuRD-based or CHD3 NuRD independent functions? Looking at other NuRD subunit binding or effects in differentiation would help to dig into this a bit more. I realize this is a bit of a big ask, so I am not asking for everything. Are there existing binding data in CNCCs for a NuRD subunit that could be examined for overlap in where these changes occur, for example? I want to be clear I am not asking the authors to do all the experiments for an alternative NuRD subunit.

      There are no existing data on NuRD binding in CNCCs. However, while the Reviewer is definitely not recommending generating new data in this regard, we still decided to make an attempt at performing ChIP-seq for the core NuRD subunit MBD3 in our CNCC. We will only make one attempt (multiple replicates), and if it does not work we will not pursue this any further as the Reviewer clearly stated that this is not necessary nor required and we do not want to delay the resubmission.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      In this manuscript, Mitchell et al. study the function of CHD3 - a member of the NuRD chromatin remodeling complex - during human cranial neural crest cells (CNCC) differentiation in vitro. To this end they use iPSC CHD3-KO lines. They first observed this deletion has no impact on pluripotency levels of mutant iPSC neither on their capacity to differentiate into the three germ layers derivatives. Yet, they showed these mutant cells are unable to form CNCC as they fail to induce EMT genes and undergo CNCC differentiation. Using ATACseq, the authors then showed CHD3 KO present a reorganization of the chromatin accessible landscape, biasing these cells from a CNCC to a mesoderm fate. They further determine that upon differentiation of CHD3 KO cells, BMP responsive regulatory elements are aberrantly closed, making the cells insensitive to the signaling, explaining how they then fail to generate CNCC. Using ChIP-seq, they confirmed a direct action of CHD3 in making these elements accessible as it normally binds to these chromatin regions to allow proper differentiation. In addition, they demonstrate these BMP responsive genes are bound by DLX5, a transcription factor essential for neural crest development. Finally, the authors showed that during CNCC differentiation, CHD3 KO cells experience an imbalance between BMP and WNT signaling, leading to these cells adopting a mesoderm instead of a CNCC identity. They finally, showed this can be partially rescued by reducing the amount of Wnt signaling - that decreases the mesoderm gene expression - however, it not sufficient to induce a neural crest identity.

      Major comments

      1. What is the expression level of CHD3 in the heterozygote line? Does the remaining allele compensate for the loss which will explain the absence of phenotype?
      2. Since CHD3 shows a progressive upregulation in expression during CNCC differentiation (Fig. 2E), one hypothesis can be that it is not necessary involved in the activation of the CNCC programs but instead it is involved in maintaining these programs active - by keeping regulatory elements accessible. Thus, authors should check expression of CNCC markers, and EMT genes at the same time point than Fig. 2E in both WT and KO cells.
      3. The authors should use the term "regulatory elements" instead of "enhancers" as they can act either as activator or repressors.
      4. On the same line, while the authors indicate "Motif analysis of the enhancers aberrantly active in CHD3-KO cells ", they haven't shown these are active. They should say they perform the analysis on regulatory elements aberrantly accessible in CHD3 KO.
      5. The rationale that led the authors to focus on genes typically expressed in the primitive streak and in the early pre-migratory mesoderm, and BMP responsive transcription factors could be better explained. Are they part of the most deregulated genes in the RNAseq analysis?
      6. In the absence of CHD3, BMP response is not effective. While the authors nicely showed this is linked with changes in chromatin accessibility, it is necessary to check the expression levels of BMP receptors in CHD3 KO cells.
      7. Aberrant early mesoderm signature of the CHD3-KO cells needs to be better shown. It is not obvious from the GO analysis in Fig. 2 and the authors then showed expression of some markers but it is unclear how they picked them up.
      8. It has been shown that CNCC regulatory elements controlling differentiation genes are primed/accessible prior migration (PMID: 31792380; PMID: 33542111). Since the authors claim "CHD3 may have the role of priming the developing CNCCs to respond to BMP by opening the chromatin at the BMP responsive enhancers", it will be good to perform ATACseq are several time point during the differentiation process to assess the dynamic of chromatin reorganization to see when the switch to mesoderm fate occurs and how accessibility of BMP responsive element changes in WT and KO cells during CNCC differentiation to be able to demonstrate the KO fail to make BMP responsive element accessible or whether it is a defect in the maintenance of this accessibility.
      9. The authors claim CHD3 directly binds at BMP responsive enhancers, but in the figure, they show the data for all the region gaining or losing activity. It will be nice to add the information for the BMP responsive elements only.
      10. Motifs enrichment analysis of regions gaining accessibility in CHD3 KO do not seems to be labeled as Wnt responsive elements. The authors need to support better that CHD3 KO express more Wnt signaling/activity.
      11. OPTIONAL: Does increasing BMP concentration early during CHD3 KO differentiation has a better effect at rescuing CNCC differentiation?
      12. OPTIONAL: While this is not necessary for the current study, it is very intriguing that other CHD family member do not compensate. How this tissue or DNA sequence activity is achieved could be discussed. What are CHD4 or CHD5 expressed during CNCC differentiation? Could they be used to rescue the CHD3 KO phenotype? While this may be difficult to test, it could perhaps be discussed.

      Minor comments

      1. In the discussion, the authors could indicate whether CHD3 mutants somehow phenocopies some of the craniofacial defects observed in DLX5 mutant patients.
      2. It is not indicated were to find the data regarding expression epithelial and mesenchymal genes in the CHD3-KO cells.
      3. Authors could add in the discussion what is known about how CHD3 function changes from opening or closing chromatin is very intriguing a could be discussed.

      Significance

      General assessment:

      The link between chromatin remodelers and craniofacial defects has been shown in several studies in the past, but it still remains unclear how mutation of a given factor leads to such tissue specific defects. This manuscript represents an interesting and detailed mechanistic study on the role of chromatin remodeler in cell fate decision, showing that reorganization of chromatin accessibility is essential to proper response to signaling pathway and cell differentiation.

      Advance:

      The authors manage to link how mutant-induced changes in chromatin accessibility biased the cells towards a mesoderm fate as they directly impact the capacity of the cells to respond to BMP signaling - these regions closing upon CHD3 loss. However, the question remains to know whether CHD3 acts as an initiating factor or instead in involved in maintaining these programs active. As noted by the authors, a clinical link (with patient-derived iPCS) would be of great interest but as it stands the story already provide a good mechanistic understanding on how CHD3 control CNCC differentiation.

      Audience:

      This manuscript will be of great interest for specialized audience, yet a broader public may find it interesting too.

      Reviewer field of expertise:

      Neural crest and craniofacial development, epigenetics, transcriptomics

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Mitchell et al examine the impact of CHD3 KO (or het) on iPSC differentiation to CNCCs to model how pathogenic CHD3 mutations promote a neurodevelopmental disorder. The authors perform genomic characterization of the KO and het mutants during this differentiation model, and identify loss of CHD3 results in BMP target gene repression and increased mesoderm fate. Finally, the authors attempt to rescue the observed defects by titrating Wnt levels and observe partial rescue. I think the paper is easy to follow, contains interesting data, and establishes a novel role for CHD3 in CNCC differentiation, which may have implications in the disorder highlighted. I have the following suggestions:

      1. Figure 1 presents nice confirmation of the CHD3 KO cell lines being used. However, given that these cell lines were previously published, I suggest moving these data to the supplement.
      2. In the results section for Figure 1, the authors discuss the CHD3 heterozygotes, but I only see the KO cell line data presented. It would be especially nice to see the protein levels of Chd3 in the het.
      3. The authors discuss which genes are up and downregulated in the Chd3 KO D18 RNAseq, and show a clear heatmap in Figure 2A for WT cells. The same heatmap for candidate genes discussed in the results would be appreciated for Chd3 KO. Only a subset of markers are shown in Fig 2C.
      4. In general 2-3 replicates are presented. While the authors are showing heatmaps for selected locations for individual clones, which is appreciated (ex: Figure 4B and Fig 6), the QC for data quality is missing. For example, show spearmean correlation across the genome for datasets as a supplement.
      5. In the section discussing the results presented in Figure 4, the authors discuss the ATAC-seq peak number changes and overlap with gene expression changes. However, the overlap with gene expression changes is not shown. Making a simple venn diagram would help readers. a. In addition, showing a heatmap for unchanged ATACseq peaks can help to demonstrate the increase/decrease.
      6. In Figure 6, the authors present ChIPseq data for CHD3 in D14 and D18 samples, focusing on locations losing or gaining accessibility. What is enrichment at unchanged sites? Is CHD3 specifically enriched at changed locations? Then what about over genes with altered gene expression vs not changed? Is CHD3 only bound to distal elements? Performing an analysis of the peak distribution, perhaps with ChromHMM or other methods to look at promoter vs enhancer vs other locations. These types of analyses could really enrich the interpretation of direct CHD3 function.
      7. Given the changes in the CHD3 KO accessibility are mostly gene distal, are there existing Hi-C/microC/promoter CaptureC or other that can be used to ask if these are interacting with the predicted genes?
      8. Are the results observed NuRD-based or CHD3 NuRD independent functions? Looking at other NuRD subunit binding or effects in differentiation would help to dig into this a bit more. I realize this is a bit of a big ask, so I am not asking for everything. Are there existing binding data in CNCCs for a NuRD subunit that could be examined for overlap in where these changes occur, for example? I want to be clear I am not asking the authors to do all the experiments for an alternative NuRD subunit.
      9. The authors observe defects in CNCCs through genomic experiments. It would be really nice to perform simple wound healing/scratch assays and/or transwell assays to test if the CNCC migration phenotype is reduced in the CHD3 KO as well which would support the transcriptomic data.
      10. Related to the above, I am not sure if there is a phenotypic test for enhanced mesoderm. I suspect only IF/expression and morphology are possible, which the authors did. However, sorting the cells (with some defined markers) to ask how many are mesoderm-like vs CNCC in WT vs CHD3 KO would give some information outside of the bulk expression data.
      11. I did not see a reviewer token for the GEO data, so I could not check the deposited datasets.

      Minor points

      1. 1A seems to fit better with Figure 2.
      2. The authors say that the KO cell lines are not defective in pluripotency, but Figures 1G suggests a slight decrease in SSEA-1. Is this reproducibly observed?
      3. Would be nice to show number of up and downregulated genes in volcano plots for fast viewing of readers (ex: Fig 2B).
      4. Is it fair to use violin plots when data points are only 2-3 replicates (as in Figures 2C, 3D)
      5. The labels in Fig 4A and 5E are very hard to read.
      6. For browser tracks, the authors show very zoomed in examples (Fig 4C, and especially Fig 6C). showing a bit more of the area around these peaks would give readers a more clear appreciation of the data.
      7. Related to browser tracks, including more information just as including the gene expression changes (such as in Fig 6C) to enhance the interpretation of the impact of Chd3 binding, accessibility change and then, I presume, reduced Sox9 expression. Similar suggestion for Figure 4C, where I anticipate coordinate transcription changes of the associated genes.
      8. Do the authors observe any clone variability between the two CHD3 KO clones? There is variability I see in some of the heatmaps, but don't know if that it is because of clones or technical variation.

      Referees cross-commenting

      I think that the other reviewer and I are inline with each other in terms of our reviews and thoughts on the manuscript, so I do not have anything to add.

      Significance

      The paper presented by Mitchell et al represents a new role for CHD3 in regulating CNCC differentiation and perhaps explains why CHD3 mutations exist in neurodevelopmental disorders such as Snijders Blok-Campeau Syndrome. Limitations are the reliance on genomic datasets and modeled differentiation, although this permits for more mechanistic studies.

      I believe the fields of neural development, stem cell, chromatin biology, and others will be interested in this manuscript.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We sincerely thank all three reviewers for their professional and constructive feedback. We appreciate the thorough evaluation of our manuscript and are committed to revising both the manuscript and supplemental materials based on the suggestions. We have carefully considered each comment and have addressed most of them in the initial revised version, which has been transferred. Additionally, we are currently conducting new experiments to provide the requested data to address a few comments. We are confident that these revision experiments will be completed in a couple of months or so, which will significantly enhance the quality of our study.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In the manuscript by Agarwal and Ghosh, the authors examine yeast Scm3 function in the DNA damage response. They show that Scm3 loss results in DNA damage sensitivity and more Rad52 foci. Importantly, Scm3 is recruited to DSB sites using an HO-cut site and its loss results in an attenuated DNA damage checkpoint as measured by Rad53 phosphorylation. The authors demonstrate convincingly that Scm3, like its human counterpart HJURP, plays a role in the DNA damage response through altered Rad53 activation. However, what its specific role in DNA repair is remains ambiguous.

      Major comments:

      1. It is unusual to see multiple DNA repair foci as those observed in Figure 2B. What is the distribution of cells with 1, 2, 3, 4, or more foci? Are more observed in SCM3-AID cells perhaps suggesting that the DSB ends are not being clustered as would be expected in WT cells exposed to DNA damage?

      Response: As per the reviewer’s comment, we have included a graph (Figure R2) showing the distribution of cells with 1, 2, 3, 4, or more Rad52-GFP foci when they are treated with MMS. There are more cells with 4 or more foci when Scm3 is depleted (SCM3-AID + Auxin) compared to the wild type (SCM3-AID). The average number of Rad52-GFP foci per cell presented in Figure 2B (2.8 in the mutant vs. 1.9 in the wild type) is well in accord with the previous report (Conde and San-Segundo, 2008), where the same was reported as ~2.5 in the cells lacking a methyl transferase Dot1, vs ~ 1.5 in the wild type. More Rad52-GFP foci in MMS-treated cells lacking Scm3 may arise due to the creation of too many damaged sites to be accommodated in 1-2 foci and/or due to the inability of the cells to cluster the DSB ends.

      This result has been incorporated as a new supplementary Figure S4C and new text has been added in the revised manuscript as: “We further quantified the distribution of cells with 1, 2, 3, or >4 Rad52-GFP foci in wild type (SCM3-AID) or Scm3 depleted (SCM3-AID + auxin) cells treated with MMS. Scm3 depleted cells showed a significantly higher number of cells with more than >4 Rad52-GFP foci, suggesting the possibility of the creation of too many damaged sites to be accommodated in 1-2 foci or the inability of such cells to cluster the DSB ends.” in page 7, lines: 237-241.

      2. The peaks with increased Scm3 recruitment by ChIP-seq upon MMS is confusing as MMS does not induce specific damage at genomic locations. Is Scm3 being recruited at other genomic sites that might be more susceptible to DNA damage? Is Scm3 recruited to Pol2 sites for example? Or fragile sites?

      __Response: __We believe that the MMS induced increase in association of Scm3 with the non-centromeric chromatin loci depends on MMS sensitive vulnerable chromosomal sites. We agree with the reviewer that MMS might cause DNA damage at these sites, leading to Scm3 occupancy at those sites. Therefore, we compared the sites of Scm3 occupancy with possible such sites available from the literature that include fragile sites, RNA Pol II binding sites, double strand break hotspots, and coldspots. Based on our analysis, we have included the following lines in the ‘discussion’ section in page 16-17, lines 566-594 as follows:

      “Moreover, an overall increase in the chromatin association of Scm3 in response to MMS also suggests that Scm3 might be recruited to several repair centers or sites that are susceptible to DNA damage, for example, the fragile sites (Figure 3B, C, E, S6). These sites in yeast are DNA regions prone to breakage under replication stress, often corresponding to replication-slow zones (RSZs) (Lemoine et al., 2005). These regions include replication termination (TER) sequences, tRNA genes, long-terminal repeats (LTRs), highly transcribed genes, inverted repeats/palindromes, centromeres, autonomously replicating sequences (ARS), telomeres, and rDNA (Song et al., 2014). Since the helicase Rrm3 is often associated with these fragile regions (Song et al., 2014), we compared Scm3 binding sites with the top 25 Rrm3 binding sites from the literature (Azvolinsky et al., 2009). In untreated cells, Scm3 sites overlapped with three Rrm3 sites on chromosomes X, XII, and XIV. Whereas in MMS treated cells, overlapping was found with four Rrm3 sites, with two (on chromosomes XII and XIV) shared with untreated cells and two new sites were observed on chromosomes II and XII (Table R1). Mapping of the Scm3 sites with the tRNA genes and LTRs revealed that these sites from the untreated cells did not overlap with the LTRs (Raveendranathan et al., 2006). However, the same from the treated cells showed overlap with two LTRs on chromosome XVI. No overlap with tRNA genes was observed in the treated cells (Table R1). We next examined Scm3 occupancy at 71 TERs documented in the literature (Fachinetti et al., 2010). Scm3 was found to bind to 6 TERs in both untreated and MMS-treated cells. Notably, MMS treatment resulted in three new peaks, while three peaks were shared with untreated samples (Table R1). Lastly, we compared Scm3 sites with top 25 RNA Pol II sites obtained from the literature (Azvolinsky et al., 2009). In untreated cells, Scm3 was found at only one of these Pol II sites, whereas after MMS treatment, Scm3 sites overlapped with four such sites (Table R1). We further checked the occupancy of Scm3 at a few DSB hotspots (BUD23, ECM3, and CCT6) and DSB coldspot (YCR093W) as mentioned in the literature (Dash et al., 2024; Nandanan et al., 2021). However, we did not find Scm3 binding to these sites. Overall, in-silico analysis of the binding sites indicates that the non-centromeric enrichment of Scm3 occurs at sites that are amenable to DNA damage.”

      Table R1: The table summarising the occupancy of Scm3 in untreated or MMS treated conditions at the indicated regions

      Region

      Chromosome

      Scm3 occupancy

      Untreated

      MMS treated

      Rrm3 binding sites

      Chr II

      YES

      Chr X

      YES

      Chr XII

      YES

      Chr XII

      YES

      YES

      Chr XIV

      YES

      YES

      LTRs

      Chr XVI

      YES

      Chr XVI

      YES

      tRNA

      Chr XV

      YES

      TERs

      Chr IV

      YES

      Chr V

      YES

      Chr VI

      YES

      YES

      Chr VII

      YES

      Chr X

      YES

      Chr X

      YES

      Chr XIV

      YES

      YES

      Chr XV

      YES

      YES

      Chr XVI

      YES

      Pol II binding sites

      Chr II

      YES

      Chr X

      YES

      Chr XII

      YES

      Chr XII

      YES

      Chr XV

      YES

      The Table R1 has been incorporated as Table S1 in the revised manuscript.

      3. The phosphorylation aspect of Scm3 is intriguing and the authors show that Mec1 is not responsible for mediating its phosphorylation. Tel1 is another kinase that should be examined.

      __Response: __We thank the reviewer for the suggestion. We are in the process of examining the role of Tel1 kinase on Scm3 phosphorylation. The results from the experiment will be incorporated in the manuscript.

      Minor comment: 1. It is hard to see what MMS resistance the authors state is observed in Mif2-depleted cells in Figure S3. Perhaps this could be better explained or the claim removed.

      __Response: __We agree with the comment and have removed the claim from the manuscript.

      2. Protein levels of Scm3 or any of the other factors depleted with AID were never assessed.

      __Response: __We have assessed the protein level of Scm3 and a control protein, tubulin using western blotting as per the reviewer’s suggestion (Figure R3). We did not observe any significant change in the protein levels in SCM3-HA or SCM3-HA-AID cells, suggesting that the AID tagging of Scm3 per se did not make the cells non-functional and the protein was degraded as expected upon addition of auxin. Moreover, the SCM3-AID cells were used previously to examine the effect of Scm3 on kinetochore assembly (Lang et al., 2018).

      This result has been incorporated as Figure S2C, and new text has been added in the revised manuscript as: “The depletion of Scm3 was verified by observing a higher percentage of G2/M arrested cells and by western blot analysis verifying degradation of Scm3-AID after auxin treatment (Figure S2B, C).” in page 5, lines: 150-152.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This manuscript describes a study implicating the Scm3 protein from budding yeast in the DNA damage response (DDR). Scm3 is a chaperone protein, whose main role is considered to be the loading of CENP-A(Cse4) at centromeres to facilitate chromosome segregation. However, the human ortholog of Scm3, HJURP, is known to have a role in DDR and in this study the authors provide evidence that Scm3 is also involved in the DDR in yeast. For the most part, the results presented support the conclusions made.

      Main Points

      1. Figure 1 Could depletion of Scm3 arrest cells in late G2/M and it is this delay that causes increased sensitivity to DNA damaging agents? A control with nocodazole or other means - that also arrests cells at this point - might provide a nice control for this. Perhaps the other kinetochore mutants, used therein, achieve this control - but cell cycle phase would need to be assessed.

      __Response: __ We thank the reviewer for pointing out to a probable effect of the cell cycle stage on the observed MMS sensitivity. In fact, we were also concerned that the observed DNA damage sensitivity in Scm3 depleted cells might be due to G2/M arrest. To rule out this possibility, we monitored Rad52-GFP foci as a marker for DNA damage in the wild type and Scm3 depleted cells both arrested at G2/M using nocodazole (Figure S4). While Scm3 depleted condition exhibited >20% Rad52-GFP positive cells, less than 10% wild type cells showed the same in the absence of any DNA damaging agents (Figure S4E, no MMS, 60 mins). Upon challenging these cells with MMS in the presence of nocodazole, Scm3 depleted condition exhibited over 40% Rad52-GFP positive cells, whereas less than 20% wild-type cells harboured Rad52-GFP. This significant increase in Rad52-GFP positive cells when Scm3 is depleted clearly indicates that the observed MMS sensitivity in these cells is due to the absence of Scm3 rather than due to an effect of a cell cycle stage. Furthermore, we have also used Cdc20 depleted G2/M arrested cells as a wild type control to test the activation of the DNA damage checkpoint by Rad53 phosphorylation. These cells showed robust Rad53 activation in response to MMS, in contrast to poor activation in Scm3 depleted cells (Figure 6), suggesting that G2/M arrest is not the reason for the DNA damage sensitivity observed in the latter cells.

      However, as per the reviewer's suggestion, we examined the MMS sensitivity of the wild type cells arrested at G2/M by nocodazole. As expected, these cells did not show increased sensitivity which further confirms that the DNA damage sensitivity observed in the scm3 mutant is not due to G2/M arrest (Figure R4B). This result has been incorporated within Figure S3, replacing the earlier Figure S3.

      To include this result, we have included new text, and revised the result section in page 5-6, lines 160-181 as follows:

      “The increased sensitivity of scm3-depleted cells to DNA-damaging agents could be due to the weakening of the kinetochores as Scm3-mediated deposition of Cse4 promotes kinetochore assembly or due to the delay in cell cycle, as Scm3 depleted cells arrest in late G2/M phase (Camahort et al., 2007; Cho and Harrison, 2011). If either of these holds true, perturbation of the kinetochore by degradation of other kinetochore proteins or wild type cells arrested at metaphase must show a similar sensitivity to MMS. In budding yeast, Ndc10 is recruited to the centromeres upstream of Scm3 (Lang et al., 2018), whereas the centromeric localization of Mif2, another essential inner kinetochore protein, depends on Scm3 and Cse4 (Xiao et al., 2017). We constructed NDC10-AID and MIF2-AID strains and used them for our assay to represent the proteins independent or dependent on Scm3 for centromeric localization, respectively. We also included one non-essential kinetochore protein, Ctf19, a protein of the COMA complex, to remove any possible mis-judgement in distinguishing cell-growth-arrest phenotype occurring due to drug-sensitivity vs. auxin-mediated degradation of essential proteins. The COMA complex is directly recruited to the centromeres through interaction with the N terminal tail of Cse4, hence dependent on Scm3 (Chen et al., 2000; Fischböck-Halwachs et al., 2019). Mid-log phase cells were harvested and spotted on the indicated plates, however, we did not observe any increased sensitivity of such cells to MMS (Figure S3). Further, wild type cells, when challenged in the presence of nocodazole and MMS, also did not show any increased sensitivity to MMS. Therefore, the increased sensitivity to MMS in scm3 mutant but not in other kinetochore mutant or metaphase arrested cells indicates that Scm3 possesses an additional function in genome stability besides its role in kinetochore assembly.”

      Further we have also revised the discussion section to include the observed results in page 15, lines 502-510 as follows:

      “However, since the primary function of Scm3 is to promote kinetochore formation by depositing Cse4 at the centromeres, it is important to address if the observed sensitivity is due to perturbation in kinetochores or due to cell cycle delay imposed in the absence of Scm3. Therefore, we similarly partially depleted two essential kinetochore proteins, Ndc10 and Mif2, and deleted one non-essential kinetochore protein, Ctf19, in separate cells and also challenged wild type cells to metaphase block but failed to detect any increased sensitivity to DNA damage stress (Figure S3). These results indicate that the drug sensitivity phenotype of Scm3 depleted cells is not due to weakly formed kinetochores or cell cycle delay.”

      2. Mutants of the HR pathway in yeast (e.g. rad52∆ with mre11∆ for example) are typically epistatic. The observation that Scm3 depletion is not epistatic with rad52∆ (Figure 1C) suggests the Scm3 acts via another pathway than the classic Rad52 HR pathway. This should be pointed out and discussed.

      __Response: __We have now included the discussion “In yeast, although HR is the preferred repair pathway, in the case of perturbed HR, an alternate pathway named non-homologous end joining (NHEJ) can occur. The absence of epistatic interaction between SCM3 and RAD52 (Figure 1C) suggests that Scm3 may function in ways other than the Rad52-mediated classical HR pathway. In this context, it would be interesting to test how Scm3 might interact with the key proteins of the NHEJ pathway, such as Ku70/Ku80 and Lig4 (Gao et al., 2016). It is possible that Scm3 may promote a certain chromatin architecture facilitating the DSB ends to stay together to be accessible for NHEJ-mediated end joining.” in page 16, lines 541-548.

      3. Figure 2 should include auxin treatment of RAD52-GFP cells (without the Scm3 degron) to show that the auxin treatment alone does not increase Rad52 foci.

      __Response: __ We performed the suggested experiment and did not observe any significant increase in Rad52-GFP positive cells when treated the cells with auxin+DMSO as compared to only DMSO (Figure R5).

      This result has been incorporated as a new supplementary Figure S4A,B and new text has been added in the revised manuscript as “To rule out the possibility that auxin treatment alone can cause increased Rad52-GFP foci formation, we challenged the wild type (RAD52-GFP) cells with auxin or DMSO and counted the number of cells with Rad52-GFP foci. We did not observe any increase in Rad52-GFP positive cells when treated with auxin+DMSO as compared to only DMSO (Figure S4A, B).” in page 7, lines: 233-236.

      4. Line 246-247 For the data presented, it seems to me possible that Scm3 depleted cells may indicate a defective DDR pathway (as stated) or may indicate defects in DNA replication or an increase in some other form of DNA damage?

      __Response: __We agree with the reviewer’s comment that the depletion of Scm3 can cause replication error or other form of DNA damage in addition to the defect in DDR pathway. To include this, we have modified the sentence as “Taken together, Scm3 depleted cells exhibit more Rad52 foci, indicating a compromised DDR pathway in these cells. Although, defects in DNA replication or creation of other DNA lesions producing more foci also cannot be ruled out.” in page 8, lines 255-257.

      5. In Figure 1 and throughout, please describe in the figure legends how error bars and p values are derived, and the number of experiments involved.

      __Response: __We have now verified all the figure legends and described how error bars and p values are derived and have mentioned the number of experiments involved.

      Minor points Line 35 replace 'cell survival' with 'cell division' - non-dividing cells can survive fine without chromosome segregation. See also line 62.

      __Response: __We have now changed ‘cell survival’ with ‘cell division’ in lines 35 and 62.

      Line 52 and throughout, I suggest replacing CenH3 with CENP-A or Cse4. The term CenH3 is confusing since regional centromeres contain both CENP-A nucleosomes and H3 nucleosomes - the latter of which can also be called CenH3 nucleosomes.

      __Response: __We have replaced CenH3 with CENP-A or Cse4 at the appropriate locations.

      Lines 69-79 specific references are needed for the sentences starting "HJURP was so named...", "In addition,...", "As a corollary,..." and "Finally,..." The final sentence of this paragraph, starting "Perhaps due to..." is unclear.

      __Response: __We have included the reference as mentioned by the reviewer. Also, we have changed the last line as “Notably, HJURP has been visualized to be diffusely present throughout the nucleus (Dunleavy et al., 2009; Kato et al., 2007), which may be due to its global chromatin binding and involvement in DDR.” in page 3, lines 77-79.

      Line 96 "gross chromatin" is unclear; also line 476.

      __Response: __We have changed gross chromatin to “bulk of the chromatin.” and incorporated it into the main text.

      Line 103 "dimerize"

      __Response: __We have replaced ‘dimerizes’ with ‘dimerize’

      Line 109 "most" and "highly" don't work together - perhaps better to say "the functions appear conserved from humans to yeast".

      __Response: __We have changed the wording as the reviewer suggested.

      Line 175 "grown" to "phase", see also line 223.

      __Response: __We have changed the wording as the reviewer suggested.

      Line 293 delete "besides"

      __Response: __We have deleted the word ‘besides’.

      Figure 5 - panels C & D, please make x axis labels clearer - they are directly underneath the 2kb ChIP. They should include a horizontal bar to indicate that all 5 ChIP experiments are included in each time point.

      __Response: __We have now included a horizontal bar in both Figure 5 and the corresponding supplementary Figure S8, to better represent the ChIP experiments. We thank the reviewer for pointing this out.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary This manuscript studies the role of the Cse4 histone chaperone Scm3 in the S. cerevisiae DNA damage response. The authors show that decreased Scm3 levels exhibit genetic interactions with mutations in the Rad52 gene and sensitivity to MMS. They go on to show that some Scm3 co-localizes with sites of DNA damage using spreads and ChIP-seq techniques and that decreased levels of Scm3 have a reduced DNA damage checkpoint response. The Scm3 protein is also phosphorylated in response to DNA damage. Taken together, the authors propose a model whereby DNA damage recruits the Scm3 protein and Scm3 then helps mediate the checkpoint response. Overall, the data make a case that Scm3 has a relationship to the DNA damage checkpoint but the authors should be careful not to over-conclude that it has a precise role in checkpoint activation based on the data.

      Major Comments 1. The western blots in the paper are not always entirely convincing. In addition, they are not described in enough detail to understand if a membrane was cut or if multiple gels were run. For example, the tubulin loading in Figure 6D is interrupted toward the end of the blot and the bands in Figure 7D go in different directions for the two blots for the MMS treated cells. In figure 6B, there are no detectable phospho-forms of Rad53 detected on the upper blot for the WT and scm3 lanes even though quantification is given on the right. It would be good to present better examples of the westerns or at least better describe what the reader is visualizing so the quantification and conclusions can be understood. How were the blots quantified? How were the westerns run and processed?

      Response: We have now included a separate paragraph in materials and methods regarding the gel run, processing and quantification of the western blots in the revised manuscript for better understanding of the readers:

      “To detect Scm3-6HA, Rad53, and g-H2A, the total proteins isolated from the appropriate cells were run on 12%, 8%, and 15% SDS gels, respectively. The proteins were transferred to the membranes, which were cut to detect the above proteins and the control protein tubulin separately. For the quantification of the bands on the western blots, a region of interest (ROI) was made around the band of interest, and the intensity of the band was calculated using ImageJ. A same ROI from a no-band area of the blot was used to calculate the background intensity. The background intensity was subtracted from the band intensity. The same process was done for the tubulin bands. The intensity of the target bands (Scm3-6HA, Rad53, and g-H2A) was divided by control tubulin band intensity to get the normalized values for the target bands, which were plotted using GraphPad Prism 9.0 (Version 9.4.1) software.” This has been added in page 25, lines: 881-890.

      Furthermore, we will again perform the experiments for a better representation of the western blots in figures 6B, D, and 7D.

      2. The argument that scm3 depletion leads to a defect in DNA damage checkpoint activation is not strongly supported. Monitoring exit from the cell cycle by multibudding is not the most rigorous assay, especially since the image shows one cell with 5 nuclei. The authors should release cells from G1 into auxin and MMS and monitor cell cycle progression at least one other cell cycle marker, such as anaphase onset, DNA replication and/or Pds1 levels.

      Response: As per the reviewer’s suggestion, in order to support our argument that the absence of Scm3 causes a defect in DNA damage checkpoint activation, we will examine if these cells abrogate G2/M arrest and show an early anaphase onset. For this, we will monitor the levels of Pds1, as a marker of anaphase onset, along the cell cycle in wild type and Scm3-depleted cells both deleted for Mad2 to remove any inadvertent effect of spindle assembly checkpoint. The schematics of the experimental workflow is given in Figure R1. Typically, the cells will be released from alpha factor arrest in the absence or presence of auxin (for the depletion of Scm3) and in the absence or presence of MMS. The samples will be harvested at the indicated time points and will be analyzed for:

      1. Western blot: Pds1-Myc (to detect anaphase onset)
      2. Western blot: Rad53 and p-Rad53 (to detect DNA damage activation)
      3. Immunofluorescence: Tubulin (to detect cell cycle stages) The results of the above experiment will be incorporated in the revised manuscript.

      3. The quantification in Figure 3B is not clear. Is it done on a per/nuclei basis? What pools of Scm3 and Ndc10 are being normalized?

      Response: The intensity was calculated as done before (Mittal et al., 2020, Shah et al., 2023). Typically, the intensity was first measured from the total signal of Scm3/Ndc10 from each chromatin mass or spread (DAPI) by making a polygon (ROI) around the Scm3/Ndc10+DAPI signal. The same ROI was dragged to the background area, from where two separate intensities were calculated. The average of the background intensities was then subtracted from the Scm3/Ndc10 intensity obtained from the same spread to get the normalized intensity depicting each dot in the box plot of Figure 3B. At least 30 spreads were quantified in a similar manner.

      We have mentioned this in the materials and methods section under “Microscopic image analysis.” section in page 22, lines 770-777 as follows: “For intensity calculation, a Region of Interest (ROI) was drawn around the Scm3/Ndc10/g-H2A+DAPI signal, and the intensity of Scm3/Ndc10/g-H2A was measured from each chromatin mass or spread (DAPI). An ROI of the same size was put elsewhere in the background area, from where two separate intensities were calculated. The average of the background intensities was then subtracted from the Scm3/Ndc10/g-H2A intensity obtained from the same spread to get the normalized intensity depicting each dot in the box plot of the respective figures as mentioned previously (Mittal et al., 2020; Shah et al., 2023).”

      Minor Comments 1. The model is elegant but there are chromatin pools (beyond the kinetochore pool) of Scm3 that do not contain Rad52 and/or gamma-H2X and vice versa. It would be helpful if the authors could speculate on how to reconcile these different pools. It might be premature to suggest such a detailed model at this point since the function of Scm3 in the checkpoint is still very unclear so I would encourage the authors to make a less detailed model.

      Response: By showing the green hallow, we have depicted the nuclear pool of Scm3, and we have not shown that the pool contains DDR proteins viz., Rad52 or g-H2A. Rather, we have shown the recruitment of these proteins at the DNA damage sites. Since the focus of this manuscript is on the non-centromeric functions of Scm3, we have not shown the kinetochore pool of Scm3. Although the model is a detailed one, the contribution from this work has been mentioned legitimately at every stage so that the readers can judge the merit of this work. We believe that a detailed model would provide a better perspective to the readers to correlate the revealed as well as yet-to-reveal functions of Scm3 in a spatiotemporal manner with the other players of the DDR pathway. Therefore, we prefer to keep the model in a detailed form.

      2. The chip-seq data is not publicly accessible. There is no reference to the data being available to review.

      __Response: __The data will be uploaded to the public domain.

      3. Line 103: not clear what "both the proteins dimerize" means...probably should be "both proteins dimerize"

      __Response: __We have changed the wording to “both proteins dimerize”.

      4. The argument that Ndc10 does not have a growth defect on MMS is a weak conclusion given that almost no control cells grow on auxin in the absence of MMS.

      __Response: __We have now repeated the spotting assay with a lesser concentration of auxin and replaced Figure S3 with a new Figure S3 (Figure R4) to better represent and conclude that the loss of Ndc10 does not cause MMS sensitivity.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      This manuscript studies the role of the Cse4 histone chaperone Scm3 in the S. cerevisiae DNA damage response. The authors show that decreased Scm3 levels exhibit genetic interactions with mutations in the Rad52 gene and sensitivity to MMS. They go on to show that some Scm3 co-localizes with sites of DNA damage using spreads and ChIP-seq techniques and that decreased levels of Scm3 have a reduced DNA damage checkpoint response. The Scm3 protein is also phosphorylated in response to DNA damage. Taken together, the authors propose a model whereby DNA damage recruits the Scm3 protein and Scm3 then helps mediate the checkpoint response. Overall, the data make a case that Scm3 has a relationship to the DNA damage checkpoint but the authors should be careful not to over-conclude that it has a precise role in checkpoint activation based on the data.

      Major Comments

      1. The western blots in the paper are not always entirely convincing. In addition, they are not described in enough detail to understand if a membrane was cut or if multiple gels were run. For example, the tubulin loading in Figure 6D is interrupted toward the end of the blot and the bands in Figure 7D go in different directions for the two blots for the MMS treated cells. In figure 6B, there are no detectable phospho-forms of Rad53 detected on the upper blot for the WT and scm3 lanes even though quantification is given on the right. It would be good to present better examples of the westerns or at least better describe what the reader is visualizing so the quantification and conclusions can be understood. How were the blots quantified? How were the westerns run and processed?

      2. The argument that scm3 depletion leads to a defect in DNA damage checkpoint activation is not strongly supported. Monitoring exit from the cell cycle by multibudding is not the most rigorous assay, especially since the image shows one cell with 5 nuclei. The authors should release cells from G1 into auxin and MMS and monitor cell cycle progression at least one other cell cycle marker, such as anaphase onset, DNA replication and/or Pds1 levels.

      3. The quantification in Figure 3B is not clear. Is it done on a per/nuclei basis? What pools of Scm3 and Ndc10 are being normalized?

      Minor Comments

      1. The model is elegant but there are chromatin pools (beyond the kinetochore pool) of Scm3 that do not contain Rad52 and/or gamma-H2X and vice versa. It would be helpful if the authors could speculate on how to reconcile these different pools. It might be premature to suggest such a detailed model at this point since the function of Scm3 in the checkpoint is still very unclear so I would encourage the authors to make a less detailed model.

      2. The chip-seq data is not publicly accessible. There is no reference to the data being available to review.

      3. Line 103: not clear what "both the proteins dimerize" means...probably should be "both proteins dimerize"

      4. The argument that Ndc10 does not have a growth defect on MMS is a weak conclusion given that almost no control cells grow on auxin in the absence of MMS.

      Significance

      Significance

      This is the first examination of the role of Scm3 in the DNA damage response in S. cerevisiae. My expertise is in the chromatin and segregation fields, but I believe this work will be of interest to the DNA damage field as well. While the homologs of Scm3 are known to have a role in DNA damage, it was unclear if this was conserved in budding yeast. The data in this manuscript are consistent with findings in other organisms but the precise role of the chaperone in the DNA damage response is still unclear.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript describes a study implicating the Scm3 protein from budding yeast in the DNA damage response (DDR). Scm3 is a chaperone protein, whose main role is considered to be the loading of CENP-A(Cse4) at centromeres to facilitate chromosome segregation. However, the human ortholog of Scm3, HJURP, is known to have a role in DDR and in this study the authors provide evidence that Scm3 is also involved in the DDR in yeast. For the most part, the results presented support the conclusions made.

      Main Points:

      1. Figure 1 Could depletion of Scm3 arrest cells in late G2/M and it is this delay that causes increased sensitivity to DNA damaging agents? A control with nocodazole or other means - that also arrests cells at this point - might provide a nice control for this. Perhaps the other kinetochore mutants, used therein, achieve this control - but cell cycle phase would need to be assessed.

      2. Mutants of the HR pathway in yeast (e.g. rad52∆ with mre11∆ for example) are typically epistatic. The observation that Scm3 depletion is not epistatic with rad52∆ (Figure 1C) suggests the Scm3 acts via another pathway than the classic Rad52 HR pathway. This should be pointed out and discussed.

      3. Figure 2 should include auxin treatment of RAD52-GFP cells (without the Scm3 degron) to show that the auxin treatment alone does not increase Rad52 foci.

      4. Line 246-247 For the data presented, it seems to me possible that Scm3 depleted cells may indicate a defective DDR pathway (as stated) or may indicate defects in DNA replication or an increase in some other form of DNA damage?

      5. In Figure 1 and throughout, please describe in the figure legends how error bars and p values are derived, and the number of experiments involved.

      Minor points:

      1. Line 35 replace 'cell survival' with 'cell division' - non-dividing cells can survive fine without chromosome segregation. See also line 62.

      2. Line 52 and throughout, I suggest replacing CenH3 with CENP-A or Cse4. The term CenH3 is confusing since regional centromeres contain both CENP-A nucleosomes and H3 nucleosomes - the latter of which can also be called CenH3 nucleosomes.

      3. Lines 69-79 specific references are needed for the sentences starting "HJURP was so named...", "In addition,...", "As a corollary,..." and "Finally,..." The final sentence of this paragraph, starting "Perhaps due to..." is unclear.

      4. Line 96 "gross chromatin" is unclear; also line 476.

      5. Line 103 "dimerize"

      6. Line 109 "most" and "highly" don't work together - perhaps better to say "the functions appear conserved from humans to yeast".

      7. Line 175 "grown" to "phase", see also line 223.

      8. Line 293 delete "besides"

      9. Figure 5 - panels C & D, please make x axis labels clearer - they are directly underneath the 2kb ChIP. They should include a horizontal bar to indicate that all 5 ChIP experiments are included in each time point.

      Significance

      This is a nice complement to the human work on HJURP and provides convincing evidence that Scm3 can be used to model the function of HJURP. Since yeast is such a tractable model, this work provides a route to study the role of this chaperone in DNA damage repair, which may also be true for human HJURP. The work itself is perhaps not too surprising, but is a solid advance in our understanding of the role of Scm3.

      My own expertise is in yeast DNA repair and chromosome segregation.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In the manuscript by Agarwal and Ghosh, the authors examine yeast Scm3 function in the DNA damage response. They show that Scm3 loss results in DNA damage sensitivity and more Rad52 foci. Importantly, Scm3 is recruited to DSB sites using an HO-cut site and its loss results in an attenuated DNA damage checkpoint as measured by Rad53 phosphorylation. The authors demonstrate convincingly that Scm3, like its human counterpart HJURP, plays a role in the DNA damage response through altered Rad53 activation. However, what its specific role in DNA repair is remains ambiguous.

      Major comment:

      1. It is unusual to see multiple DNA repair foci as those observed in Figure 2B. What is the distribution of cells with 1, 2, 3, 4, or more foci? Are more observed in SCM3-AID cells perhaps suggesting that the DSB ends are not being clustered as would be expected in WT cells exposed to DNA damage?

      2. The peaks with increased Scm3 recruitment by ChIP-seq upon MMS is confusing as MMS does not induce specific damage at genomic locations. Is Scm3 being recruited at other genomic sites that might be more susceptible to DNA damage? Is Scm3 recruited to Pol2 sites for example? Or fragile sites?

      3. The phosphorylation aspect of Scm3 is intriguing and the authors show that Mec1 is not responsible for mediating its phosphorylation. Tel1 is another kinase that should be examined.

      Minor comment:

      1. It is hard to see what MMS resistance the authors state is observed in Mif2-depleted cells in Figure S3. Perhaps this could be better explained or the claim removed.

      2. Protein levels of Scm3 or any of the other factors depleted with AID were never assessed.

      Significance

      As mentioned above, a clear link for Scm3 in DNA damage repair has now been established in this work but its function in this process is descriptive.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02825

      Corresponding author(s): Padinjat, Raghu

      Key to revision plan document:

      Black: reviewer comments

      Red: response to reviewer comment-authors

      Blue: specific changes that will be done in a revision-authors

      1. General Statements [optional]

      We thank the reviewers for their detailed comments on our manuscript and appreciating the novelty, quality and thoroughness of the work. Detailed responses to individual queries and revision plans are indicated below.

      2. Description of the planned revisions

      Reviewer 1:

      Summary The study by Sharma et al uses iPSC and neural differentiation in 2D and 3D to investigate how mutation in the OCRL gene affects neural differentiation and neurons. Mutation in the OCRL gene the cause of Lowe Syndrome (LS), a neurodevelopmental disorder. Neural cultures derived from LS patient iPSCs exhibited reduced excitability and increased glial markers expression. Additional data show increased levels of DLK1, cleaved Notch protein, and HES5 indicate upregulated Notch signaling in OCRL mutated neural cells. Treatment of brain organoids with a PIP5K inhibitor restored calcium signalling in neurons. These findings describe new dysregulated phenotypes in neural cultures of OCRL mutated cell shedding light on the underlaying caus of Lowe Syndrome.

      Major comments

      1. In general, I think the use of iNeurons usually means direct reprogramming from a somatic cell to neurons without the iPSC stage. Could be confusing to use this term for iPSC derived neurons. Thank you for pointing this out. We agree and will remove this term and replace it with a more suitable one in the revised manuscript.

      Please add at least one more replicate of WP cell line to the single nuclei RNAseq.

      There is no cell line called WP1 in the manuscript. We believe the reviewer was likely referring to WT1 (wild-type 1).

      10xgenomics guidelines highlight that the statistical power of a multiome experiment relies on several factors including sequencing depth, total number of cells per sample, sample size and number of cells per cell type of interest (10xgenomics). In this study, we performed a multiome experiment and obtained high-quality reads from 20,000 nuclei for each sample for both the modalities: snRNA seq and snATAC seq. The multiome kit recommends a lower limit is 10,000 nuclei per sample. Thus the number of cells sampled per cell line is double the suggested minimum. Therefore, and consistent with other single-cell seq studies already published, our study followed the approach where biological replicates were not included ( for e.g see PMID: 39487141, GSE238206; PMID: 31651061; PMID: 32109367, GSE144477; PMID: 40056913, GSE279894; PMID: 38280846 GSE250386; PMID: 36430334, GSE213798; PMID: 33333020, GSE123722; PMID: 32989314, GSE145122; PMID: 38711218, GSE243015, PMID: 38652563, GSE236197). Furthermore, single-cell RNA-seq inherently treats each individual cell as as a replicate (Satija lab guidelines, PMID: 29567991; Wellcome Sanger Institute), reducing the necessity for additional biological replicates. Overall this appears to be the current standard in the field which we have followed.

      Importantly, we took additional steps to validate the predictions our single-nuclei RNA-seq findings experimentally. For this we used a 3D brain organoid system. We confirmed key observations noted initially in 2D neural stem cells using a brain organoid model. This approach allowed us to confirm key predictions from the single cell sequencing data set. For example, in Lowe Syndrome patient derived organoids and OCRL-KO organoids, we noted increased DLK1 levels (Fig5.C-D, H-I) as well as increased GFAP+ cells and gene expression in brain organoids (Fig.S4E,F). These complementary approaches strengthen our confidence in the biological relevance of our findings from the single nuclei sequencing experiments.

      The WT1 and the patient lines are rarely analysed together with the WT2 and KO lines, thus it is tricky to understand if the KO line is mimicking the patient lines? Please, add more merged analyses. Co-analysing all lines:

      (i)would show if the KO line is more similar to the patient lines or to the WT1 or somewhere in between.

      1. ii) Could answer questions about the variation in phenotypes between the genetic backgrounds. iii) Elucidate how much variability there is between the two WT lines in your assays. If the two WT lines vary much then conclusions about phenotypes in the patients and KO lines might need to be rethought? The reviewer is right is noting that throughout the manuscript we have analysed the patient lines with WT1 and the KO line with WT2. This was a conscious decision which we believe is the correct one for the following reasons:

      It is well recognized and discussed in the literature that genetic background can be a key factor contributing to phenotypes observed in cells differentiated from iPSC (Anderson et al., 2021, PMID: 33861989; Brunner et al., 2023, PMID: 36385170; Hockemeyer and Jaenisch, 2016, PMID: 27152442; Soldner and Jaenisch, 2012, PMID: 30340033; Volpato and Webber, 2020, PMID: 31953356). Therefore, as a matter of abundant precaution, in this study we have tried to use the closest possible genetically matched control lines for analysis.

      The patient lines used in this study for Lowe syndrome were all derived from a family in India of Indian ethnic origin. Therefore, in order to reduce the potential impact of genetic background contributing to potential phenotypes, we have used a control line derived from an individual of Indian ethnic background; this line has previously been developed and published by our group (PMID: 29778976 DOI: 10.1016/j.scr.2018.05.001). By contrast, the OCRLKO line was generated using the control line NCRM5 (WT2); this line is derived from a Caucasian male (RRID: CVCL_1E75). Therefore, whenever we have analyzed OCRLKO, we have used NCRM5 as the control; throughout the manuscript, NCRM5 is referred to as WT2.

      However, in deference to the reviewer’s concerns we have performed a few analyses to compare the extent of variability between the two control lines.

      Figure Legend: Replotted [Ca2+]i transients data from LS patient lines, OCRLKO and two control cell lines WT1 and WT2. (A) There is no statistical difference in the frequency of [Ca2+]i transients between WT 1 and WT2. Test used-Mann Whitney test. (B) Plot with WT1 and WT2 data combined versus all three LS lines and OCRLKO combined. Test used-Mann Whitney test. (C) WT1 and WT2 combined plotted against three individual patient lines and OCRLKO. Statistical test used One-way ANOVA. (total neurons analysed: WT1:808; WT2:267; LSP2:150; LSP3:462; LSP4:463; OCRLKO:411)

      (i) We compared the frequency of calcium transients between neurons of age 30 DIV between WT1 and WT2 (Panel A above). We found no significant difference between these.

      Additionally, as suggest we combined the data from both control lines into a single set and that from all the LSP patient lines and OCRLKO into another one (Panel B above). At the end of the analysis the difference between control and OCRL depleted cells remains. Please note the large number of cells studied in each genotype.

      We also combined both control lines into a single control data set and compared it to each patient line and OCRLKO. We find that each patient line and OCRLKO is still significantly different from the control set (panel C above).

      We did not find that OCRLKO to be significantly different from LSP2 or LSP4, indicating that the OCRLKO line closely aligns with the patient-derived lines, supporting the idea that the observed phenotype is primarily disease-driven rather than background-dependent. However, we did observe a significant difference between LSP3 and OCRLKO, highlighting some degree of inter-patient variability. Therefore, the key point is that the disease phenotype remains stable across different backgrounds, reinforcing the idea that the observed differences are driven by OCRL loss rather than background variability. This will be discussed in the revision.

      (ii) In our RTPCR assay for HES5, when WT1 and WT2 are plotted together, there is no significant difference observed (panel A below). Similarly, western blotting data for cNotch (panel C) and DLK1 (panel B) of pooled WT1 and WT2 together on one plot shows no significant difference (Unpaired t-test, Welch’s correction). Overall, based on the above data, WT1 and WT2 are not statistically different.

      Figure legend: Comparison of control lines WT1 and WT2. (A) comparison of HES5 transcripts. (B) Western blot for DLK1 levels. (C) Western blot for cleaved notch protein levels. Statistical test: Unpaired t-test, Welch’s correction.

      Please include more discussion and rational around the link between the expression pattern of OCRL and the various phenotypes shown. From the RNAseq data performed at the NSC state where the expression of OCRL is lower than in neurons there are considerable differences in cell type distribution between lines. How can this skew cell type distribution affect downstream differentiation and neuronal function?

      We would like to highlight that we did not perform bulk RNAseq in NSC and neurons; rather, we performed snRNA seq in NSCs (Fig3). The data in Fig.1E is mined from a publicly available resource dataset (Sidhaye et.al., 2023, PMID: 36989136) as mentioned in line 155, which is an integrated proteomics and transcriptomics generated from iPSC-derived human brain organoids at different stages of development in-vitro.

      Fig 1D and 1E do indeed show lower levels of OCRL expression in NSC compared to neurons. However, it is important to bear in mind that even though OCRL may be expressed at relatively low levels during the NSC stage, its enzymatic activity could still have a substantial impact. Therefore, even at low expression levels, OCRL could be modulating the PI(4,5)P2 pool in ways that significantly influence cellular functions, especially during early stages of neurodevelopment that alter cell-fate decisions thereby affecting neuronal excitability.

      Our working model posits that loss of OCRL leads to increased levels of PI(4,5)P2 which upregulates Notch pathway thereby leading to an increase in its downstream effector HES5. HES5 is a known transcription factor influencing gliogenesis and thus leading to a precocious glial shift in OCRL deficient NSCs as seen in our multiome dataset. This temporal perturbation in differentiation affects maturation of LS/OCRL-KO neurons and/or astrocytes leading to a defective neuronal excitability.

      Also, OCRL is expressed also at the iPSC state as shown in Figure 1I, do you see any phenotypes in iPSC? If not, explain how that could be.

      Yes, OCRL is indeed expressed in iPSCs as shown in Figure 1I. In an earlier paper from our lab that described the generation of these patient derived iPSC from Lowe syndrome patients (Akhtar et.al 2022 PMID: 35023542), we have reported that PIP2 levels are elevated at the iPSC stage as well as NSC stage in OCRL patient lines. We have not performed a detailed analysis of the iPSC stage for these lines as the focus of our investigation was primarily on the later stages of differentiation, particularly in neural progenitors and differentiated neurons. However, in response to the reviewer’s questions on why there are no obvious phenotypes at the iPSC we would suggest that this is due to compensation from the activity of other genes of the 5-phosphatse family. In support of this, we would cite our previous study (Akhtar et.al 2022 PMID: 35023542), in which we show that in LS patient derived lines, at the iPSC Stage, at least six other 5-phosphatases are upregulated.

      There is not enough data in the manuscript to show mechanistic links between OCRL, DLK1 and Notch so be aware not to overstate the conclusions.

      We appreciate the reviewer’s constructive comment regarding the mechanistic links between OCRL, DLK1, and Notch. Treatment of organoids and neurons with UNC-3230 PIP5K1C inhibitor rescues the observed phenotypes suggesting a role for a PIP2 dependent process, this process itself remains to be identified. We will adjust the wording in the manuscript during the revision to ensure that this comes through and the conclusions do not appear overstated.

      Line 173, please describe what mutation in the OCRL these patients have, is it a biallelic deletion? Is the protein totally absents? Please show western blot analyses of the protein in the patient lines.

      The patients from whom these LS lines were generated, the nature of the OCRL allele in them and the status of OCRL protein have all been previously been described in detail in a paper from our lab. This paper (Akhtar et.al 2022 PMID: 35023542) has been cited in the present manuscript at the very first occasion that the lines are described (Line 174, references 26 and 27). In addition, in the present manuscript, the protein status of OCRL in all the three patient lines is shown with a Western blot in Figure 3C.

      Would be good with a bit of clinical explanation of these patients? Do they have the same level of severity? Are there any differences between their clinical symptoms? This could be interesting to link to differences in cellular phenotypes.

      The clinical details of each patient are described in a preprint from our lab (Pallikonda et.al., 2021 bioRxiv 2021.06.22.449382).The potential reasons for the difference in severity, a very interesting scientific question, is also addressed in this preprint. Currently experimental analysis to support the proposed likely reasons is ongoing in our lab. We feel those analysis are beyond the scope of this manuscript and will be published later this year as a separate study.

      As described in in ref 26 and 27, LSP patients have a mutation in exon 8 leading to a stop codon. We mimicked this by CRISPR based genome editing to introduce a stop codon and protein truncation in exon 8 to generate of WT2 to OCRLKO. This is also described in supplementary Fig 1 of the present manuscript and the technical details of line generation are fully described in the materials and methods.

      Like the patient lines OCRLKO is a protein null allele-this is shown by Western blot in Fig 2D. Also in OCRLKO, the PIP2 levels are elevated (Fig 2E) recapitulating what has been described by us in (Akhtar et.al 2022 PMID: 35023542). We will explicitly state this detail around line 185.

      Figure 1I, could the protein levels at the different stages be quantified?

      Yes, we can and will do it in the revision

      Figure 3A, there seem to be much more cells in LSP2, making it tricky to compare with the other cell lines. Density during differentiation can affect the cell fate. Please, provide images from the different lines that are comparable with similar density.

      We controlled for cell density by seeding equal number of cells 50,000 cells/cm2 for all the genotypes, as mentioned in the material and methods. However, heterogeneity between lines during terminal differentiation is well-established, leading to crowding in some genotypes while not in others. Additionally, different growth rates during terminal differentiation also leads to crowded neural cultures as a function of genotype. Therefore, to complement our immunostaining data, we have provided western blot analyses showing increased GFAP protein levels in LS patient lines compared to controls. We will provide images from different lines that are comparable in density during the revision.

      Please provide quantification to the statement that there is fewer number of S100B cells in the LSP lines.

      As we haven’t quantified the number of S100B cells, we will remove that statement.

      Figure 3B, the images show cells very different, and it is tricky to compare similarities and differences, please provide images that look more similar to each other. Avoid images with clusters of cells or make sure to select representative images with clusters from each cell line. If the clustering is a phenotype explain and quantify that. Make sure the density is similar in all pictures.

      We will provide images of matched density during the revision. Also see response to comment above.

      Line 2018, the statement "In the same cultures, there was no change in the staining pattern of the neuronal markers MAP2 and CTIP2 (Fig 3B)" is not strengthened by the figure. Please provide new pictures or data to prove the statement.

      As CTIP2 staining is inherently observed in either clumps or sparsely distributed regions across WT1 and LSP genotypes, we will replace the CTIP2 marker with TBR1, which is also a deep layer cortical marker (layer VI-V), as shown below. Using this additional marker for neurons, we continue to see no change in staining pattern of neuronal markers MAP2 and TBR1. Corresponding images for each genotype are optically zoomed-in images of individual neurons positive for MAP2 and TBR1. Scale bar=50µm, 20µm.

      Figure 3E, please describe all markers in the picture, thus also MAP2, S100B, CTIP2 and draw conclusions. Try to show comparable pictures.

      This will be attended in the revision

      Fig 3D and G, what are the replicates? please explain.

      Each point represents a single neural induction done on iPSCs to generate NSCs and then terminally differentiated 30DIV cultures. Experiments were done across 3-6 independent neural inductions. This detail will be included in the revised figure legend.

      Figure 4 A, C, there is a large difference in the ratio of different cell types between the different cell lines, also between the LSP2 and LSP3. This would indicate either that the genetic background affects the phenotype to a large extent or that there is large variability between rounds of differentiation. To understand how much variability that comes from the differentiation and culturing: another replicate of WP cell from another donor (WT2) should be included (single nuclei RNAseq). Confirm that three independed rounds of differentiation of the WT1, WT2, LSP2, LSP3, LSP4, and OCRL-KO result in similar outcome when it comes to cell type distribution. Could be done with qPCR marker.

      For scientific reasons explained in response to the reviewer’s comment #2 we feel it is not necessary to perform replicates of the single nucleus multiome seq. However to allay the reviewer’s concern of variability between differentiations leading to a conclusion of altered cell state we present the following three suggestions for a revised manuscript:

      • We will perform multiple differentiations from iPSC to NSC and test the altered cell state using Q-PCR for transcripts of glial lineage markers.
      • Shown below are western blot analyses for WT1, LSP2, LSP3 and LSP4 NSCs (left). Analyses were done from 4 independent rounds of neural inductions and exhibit a significant increase in the levels of a astrocytic fate-determinant marker NF1A in LSP NSCs wrt to WT1 (Mann Whitney test used to measure statistical significance). Each point represents sample from an independent neural differentiation.

      • We would also like to highlight that we have already demonstrated increased GFAP levels in LS patient derived differentiated cultures and OCRLKO. These data, quantified in Fig 3D are done using samples derived from multiple differentiations of iPSC to NSC and then terminally differentiated. Thus the phenotype of enhanced glial cells in LS derived cultures, is most likely a consequence of the increased number of glial precursor cells is seen across multiple differentiations.

      Line 309, "astrocytic transcripts NF1A and GFAP was elevated" It is unclear from this sentence in which cell lines NF1A and GFAP is elevated? Please explain.

      We acknowledge the incompleteness in the statement. We will add the complete statement explaining the graphs. The levels of astrocytic transcripts NF1A and GFAP were elevated in LSP3 and LSP4 compared to WT1.

      Figure 5C, E, G, there is a large variation of Notch and Hes5 expression between the different

      This comment is incomplete.

      Figure 5H, unclear which of the bands that is DLK1 and how the bands relate to the quantification. The band at 50 kDa seems to be stronger in the WT2 than in the OCRL-KO but in the quantification in Figure 5I, it shows 2x more in the KO. Thus, the other way around.

      The datasheet of DLK1 antibody used (Abcam ab21682; RRID_AB731965) describes bands seen at 50,48, 45 and 15kDa. We have quantified the bands at 50kDa and 48-45kDa for all the genotypes. This will be explicitly stated in the revised figure legend.

      Figure 6, please show that the inhibitor is inhibiting PIP5KC.

      Have you titered the added concentration of the inhibitor?

      Figure legend: Fields of view from WT1 derived NSC expressing the plasma membrane PIP2 reporter. Plasma membrane distribution of the probe indicating PIP2 levels is shown in (A) untreated cells (B) treatment with 10mM and (C) 50mM UNC-3230 PIP5K1C inhibitor. Scale bar=50µm (D) Quantification of plasma membrane PIP2 levels using this reporter. Y-axis shows probe levels at PM; X-axis shows treatment conditions.

      Yes, we used a previously generated plasma membrane PH-PLC::mCherry reporter WT1-NSCs (Akhtar et.al., 2021) and carried out a dose-response experiment using 10mM and 50mM of the UNC-3230 PIP5K1C inhibitor as shown above. We quantified intensity of PI(4,5)P2::mCherry at the plasma membrane and plotted the mean intensity. We observed a significant decrease in plasma PI(4,5)P2 levels at 50mM (Statistical used: Mann Whitney test) but not 10mM and therefore we selected that concentration for our experiments.

      Figure 6B, why do the calcium data for the WT2+1Ci look so different to the other, the dots are much more spread and seem to fewer replicates that for the other sample, please explain.

      We had only analysed a few replicates for WT2+1Ci genotype. We analysed the remaining replicates and have updated the data as shown below. The revised data set resolves the reviewer’s concern. The revised data set will be included in the revision.

      Figure 6F, there is no significant differences between the bars but the statement in the text (sentence starts on line 332) indicate it is, please update the figure or remove the statement.

      We added more replicates (now total is 7-10 biological replicates each with 15-20 organoids) and updated the figure (panel B) is shown below. The differences between treated and untreated of OCRLKO are significant whereas there is no significant difference between wild type, treated and untreated (statistical test: Mann Whitney test).

      Revised figure will be included in the revision

      Figure 6G, the HES5 expression seem to behave very similar in both WT2 and OCRL-KO cells when the inhibitor is used. What does this mean? Seems to not be linked to OCRL. Explain.

      Thank you for your comment. In our initial experiment (shown in original version of manuscript), we observed a reduction in HES5 expression upon inhibitor treatment in both WT2 and OCRL-KO cells. However, to ensure robustness of our findings, we repeated the experiment across multiple, additional independent organoid differentiation batches. In this redone experiment, we no longer observe the previous trend. Instead, we see no significant changes in WT2 on inhibitor treatment, while OCRLKO cells show a reduction in HES5 expression upon inhibitor treatment (Panel A). Similarly, the protein levels of cNotch and DLK1 are not different between WT2 and WT2+1Ci (panel B and C). This strongly suggests loss of OCRL leading to elevated levels of PIP2 perturbs Notch pathway, resulting in higher cNotch and thereby increased effector expression of HES5. New data set will be included in the revision.

      Minor comments

      The panels in Figure 6 are not completely referred to correctly in the text, please check. Double check that all figure panels are referred to properly in the text

      Yes, we will correct it in the revised manuscript.

      Reviewer #1 (Significance (Required)): The manuscript is an interesting addition to the in vitro iPSC derived cellular modelling of neurodevelopmental disorder. Strengths: The use of both patient iPSC lines and CRISPR edited lines The use of both monolayer and 3D cultures We thanks the reviewer for their detailed critique. Addressing these has helped improve the manuscript. We thank the reviewer for appreciating the strengths of the manuscript. Weaknesses: the significance decrease a bit due too few replicates (only 1 WT line in each experiment) and the variability between the patients' cell lines. We thank the reviewer for this comment. As explained above we have added substantially more data and revised the analysis which should remove this concern.

      Reviewer 2:

      This paper describes the effects of loss of OCRL (the Lowe syndrome protein) upon the function and differentiation of neurones, using an in vitro iPSC model system. Cells derived from three related Lowe syndrome patients and an OCRL knockout, generated using CRISPR, were used for these experiments. The results show that upon loss of OCRL, differentiation of stem cells into neurones is reduced, with an increased number of cells adopting glial and astrocytic fates. The neurones that are generated have reduced calcium transients and electrical activity. Gene expression data combined with biochemical analysis indicate altered Notch activity, which may account for the altered cell fate data seen in the in vitro differentiation model. Finally, rescue of cell fate and neuronal activity is seen upon knockdown of a PIP5K, which indicates that these phenotypes are due to the elevated PIP2 levels seen on the OCRL-deficient cells.

      The results provide new insights into the pathogenesis of Lowe syndrome. I found the paper to be well done, and the data supports the conclusions of the authors. I have a few comments below that may improve the manuscript:

      We thank the reviewer for summarizing the comprehensive nature of our study and appreciating the value of our study in providing new insights into the pathogenesis of Lowe syndrome with respect to the brain. Thank you for appreciating that our study is well done, and that the data supports the conclusions of the authors.

      Major points

      1. The UMAP and ATAC-Seq data indicate different maps for the two different Lowe syndrome patient-derived cells (Fig 4 and Fig S3). This suggests that the cells are quite different, and therefore that changes seen in one Lowe syndrome patient may not be applicable to the others. I think this heterogeneity has important implications for the paper i.e. how general are findings obtained? Several different glioblast types are described (numbered 1-5)- how different or similar are these? We are unclear what the reviewer means by “ the UMAP and ATAC seq data indicate different maps…….”.

      UMAP is a technique for visually representing data generated by single cell analysis methods be it RNAseq or ATAC seq. Perhaps what the reviewer means is that the UMAP generated from RNA seq and ATAC seq data looks different from each other.

      We would like to reiterate that the UMAP generated from single cell RNA seq data is based on the complement of transcripts in each cell of the analysis compared to an existing single cell RNAseq data set, whereas the UMAP generated from ATACseq is generated from regions of open chromatin detected in and around genes and therefore presumably also reflecting ongoing gene expression. In principle the two analyses for any set of cells should indicate overall clustering into similar groups on UMAPs generated using both data sets, if the ATACseq based read out of transcription largely maps the RNAseq based read out of differences in transcription. However, it may not be reasonable to expect them to be identical. This is indeed what we see for our data set, and this has been represented in Fig 4E. The cell clusters detected based on GEX (gene expression i.e single cell RNA seq) analysis are plotted against the cells clusters detected from ATACseq data using a confusion matrix. As can be seen from this panel (Fig 4E), a very large fraction of cells falls on the diagonal indicated a large degree of similarity between clusters detected by both methods (GEX and ATACseq) of analysis. This can be reiterated more strongly during the revision by strengthening this statement.

      The PIP5K inhibitor seems to have a very strong effect on both WT and KO cells in terms of Notch activity (Fig 5G). This strongly suggests the effects of this inhibitor are not through OCRL and that changes in PIP2 induced by the inhibitor override those of OCRL. Thus, the experiments shown in Fig 5 seem not to be due to a rescue of OCRL activity as such.

      We think reviewer means Fig 6G and our response is as follows:

      In our initial experiment (shown in the current version of manuscript), we observed a reduction in HES5 expression upon inhibitor treatment in both WT2 and OCRLKO cells. However, to ensure robustness of our findings, we repeated the experiment across multiple, additional independent organoid differentiation batches. In this redone experiment, we no longer observe the previous trend. Instead, we see no significant changes in WT2 on inhibitor treatment, while OCRLKO cells show a reduction in HES5 expression upon inhibitor treatment (Panel A). Similarly, the protein levels of cNotch and DLK1 are not different between WT2 and WT2+1Ci (panel B and C). This strongly suggests loss of OCRL leading to elevated levels of PIP2 perturbs Notch pathway, resulting in higher cNotch and thereby increased effector expression of HES5. The figures updated with the new data will be included in the revision.

      Minor points

      1. The main text needs to say what synapsin is and why it was analysed. In Fig 1I, synapsin abundance declines at 90 days. This appears quite strange. The authors should comment on it in the text. We will add a line about use of synapsin in the western. Synapsin is only used qualitatively to highlight mature neuronal culture age, as was done in Sidhaye et.al PMID: 36989136.

      In the revised main text, we will add the following explanation: "We also analyzed the expression of synapsin-1, a synaptic vesicle protein that serves as a marker for mature synapses and functional neuronal networks. The presence of synapsin-1 indicates the development of synaptic connections in our cultures, providing evidence of neuronal maturation."

      .

      The decline and thereby variability in synapsin-1 protein levels has been reported before. Regarding the decline in synapsin-1 at 90 days, we can add the following discussion:

      "We observed a decline in synapsin-1 levels at 90 days in vitro (DIV) compared to earlier time points. This pattern has been previously reported in iPSC-derived neuronal models (Togo et.al PMID: 34629097 and Nazir et.al PMID: 30342961). Such variability in synapsin-1 expression over extended culture periods may reflect the dynamic nature of synaptic remodeling and maturation processes in vitro. It's important to note that synapsin-1 levels can fluctuate due to various factors, including culture conditions and the heterogeneity of neuronal populations present at different time points."

      In Fig 2A and 3B there are clumps of green cells (CTIP2 positive). I am concerned that the lack of uniformity in the cell distribution could impact other analysis performed, where certain fields of view have been analysed e.g. by imaging or electrophysiology e.g. calcium measurements.

      To address the reviewers concern about uniformity, in the revised manuscript, we will provide/replace the representative images of deep layer markers along with MAP2 from all genotypes showing the areas selected for analysis to demonstrate that data collection was performed in comparable regions across all experimental conditions. As answered in the response to reviewer 1, comment 11.

      The clumps of neurons (as seen in Fig2A) poses challenges for obtaining high-quality seals during patch-clamp recordings. To address this, we primarily selected areas with sparsely distributed neurons for electrophysiology experiments. This approach ensured robust recordings. To address this, we can provide a clarification in the Methods section to explicitly state that neurons used for all patch-clamp recordings were chosen from regions where cells were sparsely distributed.

      In case of calcium imaging experiments, we focused on both crowded and sparse fields of views across genotypes to avoid potential biases introduced by clumped cells. However, it is to be noted that during the stages of terminal differentiation there are NSCs undergoing proliferation, which makes the neuronal culture denser. We can provide video files as a supplementary material to demonstrate the types of areas used for calcium imaging experiments. Additionally, we will include a statement in the Methods section specifying that regions with uniform neuronal distribution were selected for calcium imaging to ensure consistency in our analysis.

      In Fig 2J and 2K are the differences between sampels significant? The error bars are huge.

      From line 204-209, we have not used the word “significantly different”. We acknowledge that the error bars in Figures 2J and 2K are indeed large, which is not uncommon in electrophysiological recordings from iPSC-derived neurons due to their inherent variability. We have intentionally refrained from claiming statistical significance for these specific comparisons. Instead, we describe the data as showing a pattern or trend of reduced currents in OCRLKO neurons compared to WT2. To improve clarity, we propose to add a statement in the results section acknowledging the variability in these measurements and explaining our interpretation of the data as a trend rather than a statistically significant difference.

      In Fig S4- it would be good to show gene expression analysis and GFAP staining

      We are not completely sure what this comment means. However the present figure shows double staining with GFAP and S100beta. These will be split and shown separately to enhance clarity.

      Fig 5A needs more annotation- fold change comparing what to what?

      We will add the annotation “fold change wrt to WT1”.

      There should be more information provided in the main text relating to DLK1. For example, it is shown to be secreted, but no information is provided on whether this is expected. Secreted? The DLK1 blot in Fig 5F is not convincing.

      We will add more information relating to DLK1 and secretion status.

      DLK1 is a non-canonical notch ligand that is indeed known to be secreted by neighboring cells to either activate/inhibit notch pathway. While we acknowledge the blot could have been better, however, variability in the blot could arise due to differences in secretion efficiency, or protein stability in the cell culture media that could have led to inconsistencies across LSP genotypes. However, as shown in the blot, the OCRLKO shows a clear enrichment of secreted-DLK1 compared to WT2.

      We have performed the western blot analyses across two independent differentiations of organoids from WT1, LSP2, LSP3, LSP4, WT2, OCRL-KO iPSCs in phenol-free neurobasal-A medium, and quantified secreted protein. We then loaded 40mg of protein per genotype. Shown below is the quantification. The quantification of mean intensity of DLK1 band shows a moderate increase in LSP2, and substantial increase in LSP3 and LSP4 organoids as compared to WT1. While OCRL-KO a substantial increase compared to its control, WT2. A revised figure will be used in the revision.

      Rationale for choosing PIP5K1C

      PIP5K1C is one of the major regulators maintaining appropriate levels of the synaptic pool of PI(4,5)P2, synaptic transmission and synaptic vesicle trafficking (Hara et al., 2013 PMID: 23802628; Morleo et al., 2023 PMID: 37451268; Wenk et al., 2001 PMID: 11604140). Therefore, we were interested in rescuing the physiological phenotype, we chose PIP5K1C. Additionally, in initial experiments we found that inhibiting PIP5K1B using ISA-2011B killed the organoids or lead to detachment of 2D neuronal cultures.

      Fig 6D is confusing. I suspect the figure labelling is not correct- it does not correlate with the graphs.

      We apologise for the error and will correct this.

      Reviewer #2 (Significance (Required)):

      This paper is significant because it provides important new information on the neurological features of Lowe syndrome. The approach is novel in terms of studying this condition. The findings are likely to be of interest to clinicians, cell biologists, neurobiologists and those studying human development. My expertise is in membrane traffic and OCRL/Lowe syndrome. I am not a neurobiologist.

      We thank the reviewer for appreciating the importance of our study, novelty of findings and newof our approach we have used. We would light to highlight that while extensive work has been done with respect to the renal phenotype of Lowe syndrome, the brain phenotypes have remained largely a black box. This is in part because mouse knockouts of OCRL have failed to recapitulate the brain related clinical phenotypes displayed by Lowe syndrome patients (for e.g. PMID: 30590522; PMCID: PMC6548226; DOI: 10.1093/hmg/ddy449). Our study of brain development defects in Lowe syndrome depleted cells provides the first insight into the cellular and developmental changes in this disorder.

      Reviewer 3:

      This paper by Sharma et al describes findings in an iPSC model of Lowe Syndrome. This is an important line of research because no mouse models phenocopy the neurodevelopmental aspects of the condition. They identified a potential role of Notch signaling in pathogenesis, a potentially druggable target. However, several issues need to be addressed.

      We thank the reviewer for appreciating the importance of our study in covering the basis of the neurodevelopmental phenotype of Lowe syndrome. Due to a lack of a mouse model, there was previously no understanding of how the clinical features related to the brain arise.

      Major issues

      1. The sample size is very small, which is understandable to some extent given the expense and difficulty doing research using iPSCs. However, there are a couple of opportunities to improve the sample size. For example, in the analysis of DLK1 and other proteins shown in Figure 5, the analysis amounts to a single control vs the 3 patient lines, and a single control vs the KO line. The separation is justified because a complete KO of the gene might result in differences compared to hypomorphic mutation that apparently affects the 3 cases. However, there is no reason why WT1 and WT2 shouldn't be combined. They are both random controls. This might not affect the results of the other proteins analyzed, NOTCH and HES5, but the significance of DLK1 could change. Nature of the allele in LS patient lines

      There is a misconception in the reviewer comment that the OCRL allele in the three Lowe syndrome lines is a hypomorph. This is not correct. In the patients from whom these LS lines were generated, the nature of the OCRL allele and the status of OCRL protein in cells have been previously described in detail in a peer-reviewed, published paper from our lab. This paper (Akhtar et.al 2022 PMID: 35023542) has been cited in the present manuscript at the very first occasion that the LS patient lines are described (Line 174, references 26 and 27). As described in in ref 26 and 27, LSP patients have a mutation in exon 8 leading to a stop codon. This results in a protein null allele of OCRL in all three patient lines. This has been shown in Fig 1B of Akhtar et.al 2022 by immunofluorescence using an OCRL specific antibody (PMID: 35023542). It has also been demonstrated by Western blot using an OCRL specific antibody for all three LS patient lines in Fig 3C and 5C of the present manuscript. The nature of the allele will be highlighted more clearly in the revision.

      *Combining WT1 and WT2 *

      We are not in favour of combining WT1 and WT2. The reason for this is as follows.

      It is well recognized and discussed that genetic background can be a key factor contributing to phenotypes observed in cells differentiated from iPSC (Anderson et al., 2021, PMID: 33861989; Brunner et al., 2023, PMID: 36385170; Hockemeyer and Jaenisch, 2016, PMID: 27152442; Soldner and Jaenisch, 2012, PMID: 30340033; Volpato and Webber, 2020, PMID: 31953356). As a result, it is recommended that a line closely matched for genetic background be used when assessing the validity of observed phenotypes. The patient lines used in this study for Lowe syndrome were all derived from a family in India of Indian ethnic origin. Therefore, in order to reduce the impact of genetic background contributing to potential phenotypes, we have used a control line (referred to in this manuscript as WT1) derived from an individual of Indian ethnic background; this line has previously been developed and published by our group (PMID: 29778976 DOI: 10.1016/j.scr.2018.05.001).”

      In the case of OCRLKO we have genome edited NCRM5 (a white Caucasian male control line) to introduce a stop codon in exon 8 to mimic the truncation seen in our LS patient lines. This allele is also protein null as shown by Western blot using an OCRL specific antibody. The data is shown in Fig 2D of the present manuscript. Therefore, we reiterate that all the LS patient lines in this study and OCRLKO are protein null alleles.

      Status of DLK1 levels

      We have performed a combined analysis of DLK1 levels in the two control lines and all the patient lines as well as OCRLKO. As shown below the result remains unchanged, namely that DLK1 levels are elevated in OCRL depleted cells in this model system.

      Figure legend: Quantification of DLK1 protein levels in control, LS patient and OCRLKO iPSC lines. Western blot intensities for each patient line and OCRLKO were normalized to GAPDH and then to the respective internal WT control (WT1 or WT2) resulting in fold-change values. For statistical analysis across genotypes, normalized fold-change values from different gels were pooled post hoc. All statistical testing was performed on fold-change values. Statistical test used: Mann Whitney test. (A) Values for WT1 and WT2 have been combined and plotted against individual values for three patient lines and OCRLKO (B) Values for WT1 and WT2 have been combined and plotted against combined values for all three LSP lines and OCRLKO.

      Reviewer comment: DLK1 expression brings up another point. This, along with MEG3 and MEG8 are imprinted genes, two of the top differentially expressed genes in this study. However, these findings can be questioned by the well-known phenomenon that the expression of some imprinted genes may not be properly maintained during iPSC reprogramming. Thus, the differential expression of these imprinted genes might be due to a reprogramming artifact rather than the effects of OCRL per se. Analyzing both controls together could mitigate this objection. However, even if it does, the potential dysregulation of imprinted genes in the development of iPSCs should be acknowledged and addressed.

      We are aware that the DLK1 locus is imprinted. However, we feel that reprogramming artifacts are very unlikely to explain the observed changes in DLK1 levels.

      It is important to note that the patient lines and WT1 were not directly re-programmed from White blood cells to iPSC and then used for differentiation and analysis. As detailed in our previous peer-reviewed publications WT1 (PMID: 29778976) and the patient LSP lines (PMID: 35023542) were first converted to lymphoblastoid cell lines and subsequently reprogrammed into iPSC.

      We think that re-programming induced imprinting changes are unlikely to be responsible for the elevated levels of DLK1 seen in LS patient lines. The reason is as follows:

      We compared DLK1 levels in WT2 and OCRLKO which is a CRISPR edited line that introduces a stop codon in exon 8. NCRM-5/WT2 was derived from CD34+ cord blood cells. What we found is that levels of DLK1 are elevated in OCRLKO compared to WT2. Since OCRLKO was generated by genome editing WT2, it must be the case that the level of imprinting of the DLK-DIO3 locus is comparable if not identical between the two lines. Therefore, the difference in DLK1 levels between WT2 and OCRLKO cannot be a consequence of different imprinting status of the DLK1 locus between these two lines. Rather, it strongly suggests a causal link to OCRL deficiency. Following on from this, the DLK1 levels are elevated in patient lines compared to the OCRLKO. We will highlight and discuss and explain this in the revised version.

      Similarly, in the calcium signaling experiment shown in fig.2, the KO and patient lines are justifiably separated. However, again, why not combine both controls in the comparison with the patient samples?

      The data has been reanalyzed and presented as requested by the reviewer. There is no change in the conclusion.

      For the reasons described above, it remains our preference to present each set of lines with the appropriate control; i.e WT1 and the three LS patient lines and WT2 with OCRLKO. However, as the reviewer has asked for it, we also present below analysis in which WT1 and WT2 and combined and LS patient lines and OCRLKO are combined. The replotted data is shown below. The essential conclusion shown in the main manuscript remains, namely that [Ca2+]i transients in LS depleted developing neurons is lower than in wild type.

      Figure Legend: Replotted [Ca2+]i transients from LS patient lines, OCRLKO and two control cell lines WT1 and WT2 (A) There is no statistical difference in the frequency of [Ca2+]i transients between WT 1 and WT2. Test used-Mann Whitney test. (B) Plot with WT1 and WT2 data combined v all three LS lines and OCRLKO combined. Test used-Mann Whitney test. (C) WT1 and WT2 combined plotted against three individual patient lines and OCRLKO. Statistical test used One-way ANOVA. (total neurons analyzed: WT1:808; WT2:267; LSP2:150; LSP3:462; LSP4:463; OCRLKO:411)

      Regarding the hypomorphic nature of the patient-specific iPSC, I do not see the OCRL variant that was found in the family. Please correct me if I missed that, and if it was omitted, it should be included. I suspect that the variant generates a hypomorphic OCRL protein because the authors show expression in Figure 1D. Hypomorphic OCRL mutations compared with complete KO could show differences in molecular phenotypes, as found in Barnes et al. (PMID: 30147856) in an analysis of F-actin and WAVE-1 expression.

      Nature of the allele in LS patient lines

      There is a misconception in the reviewer’s comment that the OCRL allele in the three Lowe syndrome lines is a hypomorph. This is incorrect. In the patients from whom these LS lines were generated, the nature of the OCRL allele in them and the status of OCRL protein have all previously been described in detail in a peer-reviewed, published paper from our lab. This paper (Akhtar et.al 2022 PMID: 35023542) has been cited in the present manuscript at the very first occasion that the LS patient lines are described (Line 174, references 26 and 27). As described in in ref 26 and 27, LSP patients have a mutation in exon 8 leading to a stop codon. This results in a protein null allele of OCRL in all three patient lines. This has been shown in Fig 1B of Akhtar et.al 2022 by immunofluorescence using an OCRL specific antibody (PMID: 35023542). It has also been demonstrated by Western blot using an OCRL specific antibody for all three LS patient lines in Fig 3C and 5C of the present manuscript.

      The data presented in Fig.1D, E is a publicly available resource data PMID: 36989136 as mentioned in line 155, which is an integrated proteomics and transcriptomics generated from control iPSC-derived human brain organoids at different stages of development in-vitro.

      Minor issue

      The authors use the term mental retardation on line 102 to describe the cognitive phenotype in Lowe Syndrome. Medical, legal, and advocacy groups have abandoned this term because it is viewed as offensive. It is being replaced by intellectual disability, although this term also is problematic. In any event, many conferences on autism and intellectual disabilities are attended by families, and high-functioning cases sometimes address an audience of scientists. They would object to the use of this term if presented in a talk by one of the co-authors.

      Thank you. We will rephrase this line.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Not applicable at this stage. The above is a revision plan.

      4. Description of analyses that authors prefer not to carry out

      We prefer to not carry out replicates of the single cell multiome analysis. As explained above the state of the art in the single cell analysis field is to not do so. The scientific reasons as to why such replicates are not required have been explained in the response to the reviewer comment.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This paper by Sharma et al describes findings in an iPSC model of Lowe Syndrome. This is an important line of research because no mouse models phenocopy the neurodevelopmental aspects of the condition. They identified a potential role of Notch signaling in pathogenesis, a potentially druggable target. However, several issues need to be addressed.

      Major issues

      1. The sample size is very small, which is understandable to some extent given the expense and difficulty doing research using iPSCs. However, there are a couple of opportunities to improve the sample size. For example, in the analysis of DLK1 and other proteins shown in Figure 5, the analysis amounts to a single control vs the 3 patient lines, and a single control vs the KO line. The separation is justified because a complete KO of the gene might result in differences compared to hypomorphic mutation that apparently affects the 3 cases. However, there is no reason why WT1 and WT2 shouldn't be combined. They are both random controls. This might not affect the results of the other proteins analyzed, NOTCH and HES5, but the significance of DLK1 could change. DLK1 expression brings up another point. This, along with MEG3 and MEG8 are imprinted genes, two of the top differentially expressed genes in this study. However, these findings can be questioned by the well-known phenomenon that the expression of some imprinted genes may not be properly maintained during iPSC reprogramming. Thus, the differential expression of these imprinted genes might be due to a reprogramming artifact rather than the effects of OCRL per se. Analyzing both controls together could mitigate this objection. However, even if it does, the potential dysregulation of imprinted genes in the development of iPSCs should be acknowledged and addressed.
      2. Similarly, in the calcium signaling experiment shown in fig.2, the KO and patient lines are justifiably separated. However, again, why not combine both controls in the comparison with the patient samples?
      3. Regarding the hypomorphic nature of the patient-specific iPSC, I do not see the OCRL variant that was found in the family. Please correct me if I missed that, and if it was omitted, it should be included. I suspect that the variant generates a hypomorphic OCRL protein because the authors show expression in Figure 1D. Hypomorphic OCRL mutations compared with complete KO could show differences in molecular phenotypes, as found in Barnes et al. (PMID: 30147856) in an analysis of F-actin and WAVE-1 expression.

      Minor issue

      The authors use the term mental retardation on line 102 to describe the cognitive phenotype in Lowe Syndrome. Medical, legal, and advocacy groups have abandoned this term because it is viewed as offensive. It is being replaced by intellectual disability, although this term also is problematic. In any event, many conferences on autism and intellectual disabilities are attended by families, and high-functioning cases sometimes address an audience of scientists. They would object to the use of this term if presented in a talk by one of the co-authors.

      Significance

      as noted above, this study is significant because there are no mammalian models available to address the neurodevelopmental aspects of Lowe Syndrome

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This paper describes the effects of loss of OCRL (the Lowe syndrome protein) upon the function and differentiation of neurones, using an in vitro iPSC model system. Cells derived from three related Lowe syndrome patients and an OCRL knockout, generated using CRISPR, were used for these experiments. The results show that upon loss of OCRL, differentiation of stem cells into neurones is reduced, with an increased number of cells adopting glial and astrocytic fates. The neurones that are generated have reduced calcium transients and electrical activity. Gene expression data combined with biochemical analysis indicate altered Notch activity, which may account for the altered cell fate data seen in the in vitro differentiation model. Finally, rescue of cell fate and neuronal activity is seen upon knockdown of a PIP5K, which indicates that these phenotypes are due to the elevated PIP2 levels seen on the OCRL-deficient cells.

      The results provide new insights into the pathogenesis of Lowe syndrome. I found the paper to be well done, and the data supports the conclusions of the authors. I have a few comments below that may improve the manuscript:

      Major points

      1. The UMAP and ATAC-Seq data indicate different maps for the two different Lowe syndrome patient-derived cells (Fig 4 and Fig S3). This suggests that the cells are quite different, and therefore that changes seen in one Lowe syndrome patient may not be applicable to the others. I think this heterogeneity has important implications for the paper i.e. how general are findings obtained? Several different glioblast types are described (numbered 1-5 )- how different or similar are these?
      2. The PIP5K inhibitor seems to have a very strong effect on both WT and KO cells in terms of Notch activity (Fig 5G). This strongly suggests the effects of this inhibitor are not through OCRL and that changes in PIP2 induced by the inhibitor override those of OCRL. Thus, the experiments shown in Fig 5 seem not to be due to a rescue of OCRL activity as such.

      Minor points

      1. The main text needs to say what synapsin is and why it was analysed. In Fig 1I, synapsin abundance declines at 90 days. This appears quite strange. The authors should comment on it in the text.
      2. In Fig 2A and 3B there are clumps of green cells (CTIP2 positive). I am concerned that the lack of uniformity in the cell distribution could impact other analysis performed, where certain fields of view have been analysed e.g. by imaging or electrophysiology e.g. calcium measurements.
      3. In Fig 2J and 2K are the differences between sampels significant? The error bars are huge.
      4. In Fig S4- it would be good to show gene expression analysis and GFAP staining for organoids made using the OCRL KO cells
      5. Fig 5A needs more annotation- fold change comparing what to what?
      6. There should be more information provided in the main text relating to DLK1. For example, it is shown to be secreted, but no information is provided on whether this is expected. Secreted? The DLK1 blot in Fig 5F is not convincing.
      7. Of the 3 PIP5Ks, only PIP5Kc was targeted. The rationale for picking only this one needs to be provided.
      8. Fig 6D is confusing. I suspect the figure labelling is not correct- it does not correlate with the graphs.

      Significance

      This paper is significant because it provides important new information on the neurological features of Lowe syndrome. The approach is novel in terms of studying this condition. The findings are likely to be of interest to clinicians, cell biologists, neurobiologists and those studying human development. My expertise is in membrane traffic and OCRL/Lowe syndrome. I am not a neurobiologist.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The study by Sharma et al uses iPSC and neural differentiation in 2D and 3D to investigate how mutation in the OCRL gene affects neural differentiation and neurons. Mutation in the OCRL gene the cause of Lowe Syndrome (LS), a neurodevelopmental disorder. Neural cultures derived from LS patient iPSCs exhibited reduced excitability and increased glial markers expression. Additional data show increased levels of DLK1, cleaved Notch protein, and HES5 indicate upregulated Notch signaling in OCRL mutated neural cells. Treatment of brain organoids with a PIP5K inhibitor restored calcium signalling in neurons. These findings describe new dysregulated phenotypes in neural cultures of OCRL mutated cell shedding light on the underlaying caus of Lowe Syndrome.

      Major comments

      In general, I think the use of iNeurons usually means direct reprogramming from a somatic cell to neurons without the iPSC stage. Could be confusing to use this term for iPSC derived neurons.

      Please add at least one more replicate of WP cell line to the single nuclei RNAseq.

      The WT1 and the patient lines are rarely analysed together with the WT2 and KO lines, thus it is tricky to understand if the KO line is mimicking the patient lines? Please, add more merged analyses. Co-analysing all lines:

      i) would show if the KO line is more similar to the patient lines or to the WT1 or somewhere inbetween.

      ii) Could answer questions about the variation in phenotypes between the genetic backgrounds.

      iii) Elucidate how much variability there is between the the two WT lines in your assays. If the two WT lines vary much then conclusions about phenotypes in the patients and KO lines might need to be rethought?

      Please include more discussion and rational around the link between the expression patten of OCRL and the various phenotypes shown. From the RNAseq data performed at the NSC state where the expression of OCRL is lower than in neurons there are considerable differences in cell type distribution between lines. How can this skew cell type distribution affect downstream differentiation and neuronal function? Also, OCRL is expressed also at the iPSC state as shown in Figure 1I, do you see any phenotypes in iPSC? If not, explain how that could be.

      There is not enough data in the manuscript to show mechanistic links between OCRL, DLK1 and Notch so be aware not to overstate the conclusions.

      Line 173, please describe what mutation in the OCRL these patients have, is it a biallelic deletion? Is the protein totally absents? Please show western blot analyses of the protein in the patient lines. Would be good with a bit of clinical explanation of these patients? Do they have the same level of severity? Are there any differences between their clinical symptoms? This could be interesting to link to differences in cellular phenotypes. Line 185, please be clear on how the CRISPR KO line genetically mimics the patients' lines.

      Figure 1I, could the protein levels at the different stages be quantified?

      Figure 3A, there seem to be much more cells in LSP2, making it tricky to compare with the other cell lines. Density during differentiation can affect the cell fate. Please, provide images from the different lines that are comparable with similar density. Please provide quantification to the statement that there is fewer number of S100B cells in the LSP lines. Figure 3B, the images show cells very different, and it is tricky to compare similarities and differences, please provide images that look more similar to each other. Avoid images with clusters of cells or make sure to select representative images with clusters from each cell line. If the clustering is a phenotype explain and quantify that. Make sure the density is similar in all pictures.

      Line 2018, the statement "In the same cultures, there was no change in the staining pattern of the neuronal markers MAP2 and CTIP2 (Fig 3B)" is not strengthened by the figure. Please provide new pictures or data to prove the statement.

      Figure 3E, please describe all markers in the picture, thus also MAP2, S100B, CTIP2 and draw conclusions. Try to show comparable pictures.

      Fig 3D and G, what are the replicates? please explain.

      Figure 4 A, C, there is a large difference in the ratio of different cell types between the different cell lines, also between the LSP2 and LSP3. This would indicate either that the genetic background affects the phenotype to a large extent or that there is large variability between rounds of differentiation. To understand how much variability that comes from the differentiation and culturing:

      i) another replicate of WP cell from another donor (WT2) should be included (single nuclei RNAseq)

      ii) Confirm that three independed rounds of differentiation of the WT1, WT2, LSP2, LSP3, LSP4, and OCRL-KO result in similar outcome when it comes to cell type distribution. Could be done with qPCR marker.

      Line 309, "astrocytic transcripts NF1A and GFAP was elevated" It is unclear from this sentece in which cell lines NF1A and GFAP is elevenated? Please explain.

      Figure 5C, E, G, there is a large variation of Cnotch and Hes5 experession between the different

      Figure 5H, unclear which of the bands that is DLK1 and how the bands relate to the quantification. The band at 50 kDa seems to be stronger in the WT2 than in the OCRL-KO but in the quantification in Figure 5I, it shows 2x more in the KO. Thus, the other way around.

      Figure 6, please show that the inhibitor is inhibiting PIP5KC. Have you titered the added concentration of the inhibitor?

      Figure 6B, why do the calcium data for the WT2+1Ci look so different to the other, the dots are much more spread and seem to fewer replicates that for the other sample, please explain.

      Figure 6F, there is no significant differences between the bars but the statement in the text (sentence starts on line 332) indicate it is, please update the figure or remove the statement.

      Figure 6G, the HES5 expression seem to behave very similar in both WT2 and OCRL-KO cells when the inhibitor is used. What does this mean? Seems to not be linked to OCRL. Explain.

      Minor comments

      The panels in Figure 6 are not completely referred to correctly in the text, please check. Double check that all figure panels are referred to properly in the text

      Significance

      The manuscript is an interesting addition to the in vitro iPSC derived cellular modelling of neurodevelopmental disorder.

      Strengths:

      The use of both patient iPSC lines and CRISPR edited lines The use of both monolayer and 3D cultures

      Weaknesses: the significance decrease a bit due too few replicates (only 1 WT line in each experiment) and the variability between the patients' cell lines.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Maxian et al. developed a mathematical model to explain the essential elements and interactions necessary and sufficient for the polarisation of the C. elegans zygote. The initiation of zygote polarisation has been extensively studied in recent years, highlighting the role of the centrosomal kinase Aurora-A (AIR-1) in controlling the cortical distribution of RhoGEF (ECT-2) and actomyosin contractility during polarisation. Although genetic experiments have demonstrated their function in this process, it remains to be tested whether these factors and their interactions are sufficient to induce polarisation.

      This work has provided a theoretical framework to predict the activity of AIR-1 in the cytoplasm and at the cell cortex, and the cortical distribution of ECT-2 and myosin-II (NMY-2). This framework can recapitulate the dynamic rearrangement of ECT-2 and myosin-II during polarisation, with centrosomes positioned at the posterior pole of the zygote. This model can explain, at least in part, the asymmetric distribution of ECT-2 and myosin-II in the zygote undergoing cytokinesis, suggesting that the mechanism of AIR-1-mediated control of ECT-2 and myosin-II would regulate patterning during polarisation and cytokinesis. This theoretical framework is developed with reasonable assumptions based on previous genetic experiments (except for the myosin-dependent regulation of ECT-2; see comments below).

      Issue #1

      The authors insist that this model correctly predicts the spatio-temporal dynamics of ECT-2 and myosin-II during polarisation and cytokinesis. However, the predicted results do not reproduce the in vivo pattern of ECT-2 in both phases. ECT-2 is cleared from the posterior cortex and establishes a graded pattern across the antero-posterior axis during polarisation (see their previous publication in eLife 2022, 11, e83992, Fig1A -480s) and cytokinesis (see eLife 2022, 11, e83992, Fig1C 60s and 120s). During both stages, ECT-2 does not show local enrichment at the boundary between the anterior and posterior cortical domains in vivo. In fact, when comparing the predicted results with the in vivo pattern of ECT-2 and cortical flow, the authors used non-quantitative descriptions such as 'in good agreement', 'a realistic magnitude', 'resemble'. These vague descriptions should be revised and a quantitative assessment of ECT-2 distribution between in silico and in vivo should be included in a revised manuscript.

      Issue #2

      I assume that the strange local enrichment of ECT-2 at the anteroposterior boundary is due to their assumption that the binding rate of ECT-2 is increased by a linear increase via cortical myosin-II (page 6). This assumption is not directly supported by experimental evidence. A previous study by the same group (eLife 2022, 11, e83992) showed that a progressive increase in ECT-2 concentration at the anterior cortex is partially accompanied by an increase in cortical flow and transport of myosin-II from the posterior pole to the anterior cortex. This observation supports the idea that ECT-2 may associate with cortical components transported by myosin-II based cortical flow. This unrealistic assumption makes the predicted distribution pattern of ECT-2 almost identical to that of cortical myosin-II, resulting in an increase in the concentration of ECT-2 at the anteroposterior boundary where myosin-II forms pheudo-cleavages and cleavage furrows. The authors should clarify why their mathematical model used this assumption and provide a comprehensive analysis and evaluation of the parameter value for an ECT-2-myosin-II interaction.

      Significance

      This work includes a valuable tool that can be used to explain other actomyosin-mediated polarisation processes. Although the paper provides useful insights in principle, the weakness of this work is that the model is designed with parameter sets that only recapitulate previously published phenotypes. Therefore, this paper confirms previous findings and provides less/no new mechanistic insights into cell polarisation. As such, this work would be of interest to specialised cell biologists and biophysicists working on the cytoskeleton and cell division, but will not be of general interest to biologists and biochemists.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript by Maxian, Longhini and Glotzer presents purely modeling work performed by the first author in conjunction with the already published experimental work by Longhini and Glotzer (eLife, 2022). The aim of the manuscript is to provide a mathematical model that connects the actomyosin contractility of the cell cortex in C. elegans zygote with the activity of the centrosomal kinase AurA (AIR-1 in C. elegans). The major claim of the authors is that their model, fitted to the experimental data pertaining to the zygote polarization, also describes dynamics during the zygote cytokinesis. In the model, the authors provide a heuristic approach to the biochemical dynamics, reducing their treatment to two variables: myosin and Ect2 Rho GEF. The biochemical model is integrated with a simple 1D active gel-type model for the cortical flow. The model uses static diffusive field of activity of AurA kinase in the cytoplasm as an input to their chemo-mechanical model. Major concerns: 1. The biochemical model is highly heuristic and several major assumptions are poorly justified. Thus, the authors explicitly introduce recruitment of Ect2 by myosin, something apparently based on the experimental observations by Longhini and Glotzer in 2022, which had not been biochemically confirmed since with a clear molecular mechanism. 2. The contribution of AurA is introduced highly schematically as a term based on enzyme inhibition biochemistry that increases the off rate of Ect2. The major assumption of the model is that AurA phosphorylates Ect2 strictly on the membrane (cortex) of the cell. Why? No molecular justification is given. If the authors cannot provide clear justification, this major assumption has to be clearly declared as such. The phosphorylation/dephosphorylation dynamics of Ect2 is not considered at all. 3. In the equation for myosin, the authors introduce disassembly/ inactivation term proportional to the fourth order of concentration of myosin. Why? This is a major assumption, which appears to be derived from the work by Michaux et al. 2018. There the authors (Michaux et al.) postulated that the rate of inactivation of RhoA GTPase was somehow proportional to the fourth power of RhoA concentration. It appears that Maxian et al. further assume that the myosin concentration is fast variable enslaved by Rho, so that M ~ [RhoA]. They then presumably assume that if the rate of degradation/ inactivation of Rho is proportional to the forth power of Rho concentration, so is true for myosin (M). This is a logical error and is not justified. An important question, why do the current authors need this unusual assumption with such a high power of M disassembly/inactivation? Perhaps, this is because without this rather dubious term the cortex flow produces a blow-up of myosin concentration? This would be expected in their mechanical model - the continuous flow of actomyosin not compensated by cortex disassembly generally causes blow-up of biochemical concentrations transported by the flow, this is a known problem of the "simple" active gel model used by the authors. Maxian et al. have to provide clear derivation of the term -kfb*M^4 and also demonstrate why they need this exotic assumption. 4. The equation for myosin M has a membrane-binding term, which is second order in concentration of Ect2 ~E^2, without which the model will not show the instability that the authors need. The only justification given is that "some nonlinearity is required". A proper derivation should be given here. 5. The diffusion coefficients for Ect2 and myosin are chosen to be the same. Why? Clearly these molecules so different in size - myosin being a gigantic cluster monster of ~300 nm believed to be bound to actin, should have a much smaller diffusion coefficient? 6. There are confusing statements regarding the role of actomyosin flows. In the beginning of the manuscript, the authors seem to state that since Ect2 has a high off rate, the effect of the flow on Ect2 localization is negligible in comparison with direct binding to myosin. Later, the authors state that flows are absolutely essential for the patterning. The authors need to clearly explain where and how the flows are important or not. Minor points: 1. page 9. Why is the rate of dephosphorylation of AurA is named Koff? 2. page 10. "Note that the model is calibrated to predict... which matches experimental observations" - this sentence needs changing. You want to say that you fit the model to experiments in the Longhini and Glotzer paper. There is no prediction here. 3. page 14. "A plot of Ect-2 accumulation as a function of distance from the nearest cortex..." - clearly the word "centrosome" is meant here instead of "cortex". 4. page 16. "Inactive, non-phosphorylatable version of Ect-2..." - non-phosphorylatable is clear, but why inactive?

      Significance

      This reviewer sees limited significance of this manuscript to the field in general. The modeling approach is hardly novel as it is based on a variety of published models, all cited by the authors, to be precise. The model, being very simplistic and heuristic, is not predictive. The main novelty of the current manuscript is the introduction of the effect of Aurora A on the activity of the actomyosin cortex. Since this is taken to be very schematic, simply via the effective increase in the off rate of Ect2, the model is showing that it is consistent with the earlier published experimental results by Longhini and Glotzer. This is to be expected. The main claim of the authors, that the model fitted to the polarization data also qualitatively describes the cytokinesis (there are no quantitative data to compare to) is probably valid, but the result is not surprising either. At best, the model can be labeled as fitted to the data and confirming the experimental results. Since it contains several postulated heuristic terms not properly justified on the mechanistic level, this is also not surprising.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this article, Maxian et al. propose a model combining 1-d simulations of ECT-2 and Myosin concentration at the cortex through binding/unbinding and advection at the cortex, with an input for AIR-1 cortical concentration based on the spatial localisation of the centrosomes in the cytoplasm. The objective of the authors is to recapitulate the role of (1) AIR-1, (2) its effector ECT-2 and (3) the downstream effector, driver of cortical flows, the molecular motors Myosin, in two key physiological processes, polarization and cell division. This is important as work over the last 10 years have emphasized the role of AIR-1 in embryo polarization. Previous biochemical-mechanical models have focused on RhoA/Myosin interactions (Nishikawa et al, 2017), the importance of a negative feedback and excitable RhoA dynamics (Michaux et al, 2018), or anterior PARs/posterior PARs/Myosin (Gross et al, 2019). The authors thus attempt to provide a new descriptive model in which RhoA is implicit, instead focusing on the role of centrosome localization on AIR-1 localization, and providing a framework to explore polarity establishment and cell division based on these 3 simple players. The first part of the model is very reminiscent of previously published models, while the second instead provides a link between the initial polarizing cue AIR-1 and polarization. Based on this description, the model is precisely tuned to achieve polarization while matching experimental observations of flow speed and ECT-2 A/P enrichment shape. The results are therefore certainly new and interesting.

      Major comments:

      1. The authors use the position of the centrosomes as a static entry, resulting in a static AIR-1 input. Is this true, or are the positions of the centrosomes dynamically modulated over the course of the different processes simulated here (for example as a consequence of cortical flows?), and if so, is the assumption of immobile position?
      2. While in its principle the model is quite simple and elegant, the detailed form of the equations describing the interactions between the players is more complex. Are all these required? If they are crucially important for the behavior of the model, these should be described more thoroughly, and if possible rooted more directly in experimental results, in particular:
        • k(ME)MEc (Linear enhancement term): why would myosin impact E concentration? The authors state, p.7, "There is a modest increase in the recruitment rate of ECT-2 due to cortical myosin (directly or indirectly), in a myosin concentration-dependent manner (Longhini and Glotzer, 2022)." I could not find the data supporting this assumption - Longhini and Glotzer apparently rather point to a modulation of cortical flows. ("During anaphase, asymmetric ECT-2 accumulation is also myosin-dependent, presumably due to its role in generating cortical flows."). Embedding this effect in the recruitment rate instead of expecting it from the model thus appears awkward. Could the authors specify how they came to this conclusion, which the authors might have derived from observations made in their previous work, but maybe did not fully document there?
        • k(EM)E^2Mc (ECT-2 non-linear impact on Myosin): does the specific of the value to convey the enhancement (square form) have an impact on the results?
        • k(fb)*M^4 "The form of this term is a coarse-grained version of previously-published work (Michaux et al., 2018)." Myosin feedback on myosin localization proportionally to M^4 does not seem to directly derive from Michaux et al... Please detail this points more extensively and detail the derivation, in the supplements if not in the main text. P23. Parameter values: "This is 1.5 times longer than the estimate for single molecules (Nishikawa et al., 2017; Gross et al., 2019) to reflect the more long-lived nature of myosin foci during establishment phase (Munro et al., 2004)." Not sure what the authors mean by more long-lived duration of foci during establishment phase. Seems rather arbitrary.
      3. It would be very helpful (and indeed more convincing) to include a direct comparison between modeling results and experimental counterpart whenever possible. This might not be possible for some data (e.g. Fig. 3d from Cowan et al), but should be possible for other, in particular Fig. 3c and Fig. 5b, for the flow speed and ECT-2 profiles. In Fig. 5b in particular, previously published experimental data could be produced to give the reader to compare model with experiments (possibly provided as an inset, at least for the wild type conditions).

      Minor comments

      Fig. 5b: ECT-2 C 6A(dhc-1) do not seem to be referenced or discussed in the main text. Also, why present the results for the flow for 2 conditions and the ECT-2 localisation for 4? Or does the variation of ECT-2 not impact the flow profile?

      p.6: Eqn 1a: ^ missing on 3rd E?

      p.6: Given that the non-normalized data is used in the main text, and the normalized only appears in the supplemental, maybe star the dimensionless and remove all hats from the main for greater legibility?

      p.14: replace "embryo treatment" with "experimental conditions"?

      p.21, S4a: add A=Â/Atot

      p.22: "L = 134.6 μm" - please write 134µm to retain the precision of original measurements

      p.22: Please provide formula for all dimensionless values as a table at the end of the supplemental for the eager but less-mathematically proficient reader.

      The authors' attention to providing specific citations including figure number corresponding to the specific point they reference in the papers they cite is appreciated.

      Significance

      General assessment:

      This modeling paper interestingly leverages existing experimental data to develop a new mathematical model of embryo polarization and cell division focusing on the role of AIR-1/Aurora Kinase. It combines classical 1-d advection/diffusion-reaction scheme with an upstream cue, AIR-1/Aurora Kinase, the profile of which is defined by the localization of the two centrosomes, and use the model as a framework to explore cortical flows and ECT-2 and Myosin cortical localization. Calibrated using information from polarization phase, the model recapitulates without any further tuning, in a variety of mutants, key localisation hallmarks of Ect-2 during cell division, simply based on the localization of the centrosomes. Finally, it provides strong, experimentally testable predictions of the validity of the proposed model.

      Advance:

      In particular, this study provide compelling evidence showing that their model, based on dynamics during polarization, is sufficient to explain the ultra-sensitivity of cortical ECT-2 accumulation to centrosome distance during cell division. Their model further predicts that short ECT-2 cortical residence time is required to prevent advection-mediated counter-flows of ECT-2 that would otherwise prevent polarization, a prediction testable experimentally by engineering modifications of ECT-2 cortical residence time.

      Audience:

      This is primarily a modeling paper. Although the bulk of the article is written to capture the interest of cellular biologists with a sound backgrounds in mathematics and an interest in minimal models of cell division and polarization, the overall conclusions and prediction are further-reaching and would be of interest to a larger audience with an interest in cell division, polarization, and the role of Aurora Kinase in these processes.

      Expertise:

      Developmental biology / Cell biology / Biological physics

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Referee #1 Major concerns:

      1) One major concern that I have about the sexual dimorphism in tolerance to nutrient deprivation is that the papers cited by the authors, and paradigms that are used broadly in the field, all use adult flies. The authors must show that in larvae, a completely different life stage from their citations, there is a sexual dimorphism in tolerance to nutrient deprivation.

      In our descriptions of previous literature that describes tolerance to nutrient deprivation, we have added language that specifies that the results from nutrient deprivation mentioned therein were performed in adults (lines 82, 91, 96, highlighted in the preliminary revision).

      In response to the concern from this reviewer that our data do not assay for nutrient deprivation in larvae, we would like to clarify that our “stress tolerance assay” more specifically demonstrates that developmental nutrient deprivation compromises male survival through pupariation to adulthood. While the effects of acute nutrient deprivation on developmental delay can be assayed in foraging or earlier larval stages, we have not tested whether ATF4 signaling is present and dimorphic in these stages and believe it to be beyond the scope of this study. In the revision, we will edit the text to be more precise in our conclusions with these data.

      Interestingly, Diaz et al 2023 (Genetics) show that male larvae have greater fat stores than female larvae. Considering fat is the main determinant of tolerance to nutrient deprivation it's not clear that females will actually survive nutrient deprivation longer as larvae. This is an essential test of whether female larvae do have increased tolerance to nutrient deprivation, which is the basic foundation of the authors' model.

      We thank the reviewer for making this clarifying point about the relationship between fat stores and nutrient deprivation. ____In response to the concern our data do not assay for nutrient deprivation in larvae (major point #1), we would like to clarify that our “stress tolerance assay” more specifically demonstrates that developmental nutrient deprivation compromises male survival through pupariation to adulthood. While the effects of acute nutrient deprivation on developmental delay can be assayed in foraging or earlier larval stages, we have not tested whether ATF4 signaling is present and dimorphic in these stages and believe it to be beyond the scope of this study. In the revision, we will edit the text to be more precise in our conclusions with these data.

      2) Another concern is the way that the authors "genetically induce nutrient deprivation using methioninase overexpression". As they acknowledge in the discussion (Line 381-390), methioninase overexpression will have many cellular effects. While there is no doubt that methionine levels would be lower in their model, it is less certain whether this is the main driver of the male-specific lethality.

      There are two potential solutions to this problem. First, the authors could change the text throughout the paper to more accurately describe their paradigm as "methioninase-induced lethality" rather than "nutrient deprivation". This would limit the scope of their scientific question and the conclusions they draw, but would eliminate the need for more experiments.

      The second solution would be to complete experiments to establish the following points: i) methioninase overexpression causes all the classical features of nutrient deprivation (e.g. changes to canonical signaling pathways such as TOR); ii) using other genetic means of nutrient deprivation such as slimfast-RNAi to see if those manipulations phenocopies the male-specific lethality they see with methioninase overexpression; iii) testing a role for ATF4 in mediating sex differences (if any) in other contexts such as slimfast-RNAi. This will take 2-3 months but is essential to draw any conclusions about whether their paradigm is truly a model for nutrient deprivation.

      We agree that methionine depletion is not the only cellular change effected by methioninase over-expression. For example, a molecular byproduct of methioninase metabolism via methioninase is the production of ammonia, which has recently been shown to indue ISR signaling in the context of____ alcohol-associated liver disease (Song et al. 2024, PMID 37995805). We believe our experimental controls and genetic rescues account for this and other possible effects in the interpretation of our data. ____To further establish the utility of methioninase overexpression as a genetic means of methionine deprivation (first described in Parkhitko et al. 2021, PMID 34588310), we will perform ____slimfastRNAi_ in the fat (another genetic means of reducing intracellular amino acid levels) per the reviewer’s suggestion. In these animals we will evaluate 1) ATF4 activity in L3 adipocytes using 4E-BPintron-GFP (1.5 months) , and 2) male vs. female lethality (as determined by counting eclosed adults) (2 months. If male lethality is observed with _UAS-slimfastRNAi _as with _methioninase ____expression, we will test the requirement for dimorphic ATF4 signaling in the fat for such male susceptibility to lethality/female resistance to lethality. (3 months)

      3) Another important point is that the authors state that sexually dimorphic ATF4 activity in the fat body is instructed by sexual identity in a cell-autonomous manner. Despite a clear decrease in ATF4 reporter levels in tra mutants, the fat body-specific tra-RNAi effect on the ATF4 reporter was less convincing. Together with the fact that changes to tra in the fat body affect insulin secretion from the insulin-producing cells, it is possible that the effect on ATF4 is not cell-autonomous. To conclusively test if sexual identity regulates ATF4 in a cell-autonomous manner the authors should use the flp-out system to make Tra-expressing or tra-RNAi-expressing clones in the fat body. This would take approximately 1.5 months to make the strain and test this.

      We thank the reviewer for making the astute observation that the effect of fat body-specific ____tra_ knockdown on female ATF4 reporter activity was more modest than whole-animal _tra_ mutants. We ascribe this to RNAi knockdown efficiency rather than non-autonomous effects of sexual identity on ATF4 expression in the fat. This is underscored by our data showing fat body knockdown of _spenito_ (_nito_), a sex determinant upstream of _tra____ that is shown to instruct female sexual identity in the larval fat (Diaz et al. 2023, PMID 36824729), does indeed reduce ATF4 levels in female fat to that of control male fat (Fig. 2K).

      4) As the authors show for the UAS-methioninase, other UAS lines used in the paper such as UAS-traF, UAS-tra-RNAi, UAS-dsx-RNAi may have leaky effects on gene/reporter expression. The authors must include a UAS only control to establish that the tra-RNAi, UAS-traF, UAS-dsx-RNAi do not affect gene/reporter expression.

      We thank the reviewer for suggesting that we evaluate the “leakiness” of all UAS lines used in this study (major point #4). To do this, we will quantify ATF4 reporter activity in the fat (4E-BPintron-GFP) in the presence of UAS lines but in the absence of GAL4 for ____UAS-traRNAi_, _UAS-traF_, and _UAS-dsxRNAi____ (1.5 months)

      5) I have concerns about the statistics used. In the methods and legends only t-tests are mentioned; however, when three groups are compared a one-way ANOVA with post-hoc tests must be used to correct for multiple comparisons. To compare differential responses to genetic/environmental manipulations between the sexes, a two-way ANOVA must be used. For example, to conclude that males and females have different responses in the two-way ANOVA, there must be a significant genotype:sex interaction. The p-values for comparisons between genotypes in either the one-way or two-way ANOVA must be derived from post-hoc tests within the ANOVA analysis.

      __We thank the reviewer for carefully assessing our usage of statistical analyses to interpret the data in the study. To the best of our understanding, such ANOVA analyses are helpful in evaluating significance when comparing multiple sample groups simultaneously. However, in all our analyses we are only ever comparing two samples at a time, making a two-tailed Student’s t-test with Welch’s correction (assuming unequal variance) to be the best statistical method. __

      Referee #1 Minor points

      1) Please ensure to make the reader aware of which life stage was tested in the literature cited supporting sexually dimorphic tolerance to nutrient deprivation.

      We thank the reviewer for pointing out this ambiguity in our description of previous and current work on nutrient deprivation tolerance. We address this minor point in tandem with major point #1 above ____by adding language that specifies that the results from nutrient deprivation mentioned therein were performed in adults (lines 82, 91, 96, highlighted in the preliminary revision).

      2) Published data about sex-specific mechanisms of metabolic regulation mean that the introduction should be more fully cited than it is. Even in the introduction "the molecular basis of these differences and how they impact tolerance to nutrient deprivation is still under investigation" is inaccurate, as there are published studies identifying some mechanisms (work on gut hormones and sex-specific effects on starvation resistance and body fat, role of ecdysone on body fat and feeding, sex-specific roles for brummer and Akh in regulating body fat, intestinal transit and gut size and feeding). Please adjust the paper to acknowledge this growing body of knowledge.

      We thank the reviewer for appropriately highlighting that there are other relevant studies in the context of sex-specific mechanisms of metabolic regulation in addition to those referenced in the original manuscript. Specifically, we will include additional citations and appropriate descriptions of previous work, such as those that report on sex-specific effects of starvation (i.e. Millington et al. 2022, PMID 35195254) and sex-specific roles for metabolic regulators such as Brummer/ATGL (Wat et al. 2020, PMID 31961851) and Adipokinetic hormone (Wat et al. 2021, PMID 34672260) in _Drosophila fat storage._ __

      3) Please list the ingredients per L so that individuals can replicate the diet easily.

      __We thank the reviewer for requesting additional details on the diet fed to animals in this study, which will improve the reproducibility of our findings. In the Methods section, we have now included additional details on the specific diet fed to animals used in this study (lines 465-468 in the preliminary revision).____ __

      4) Please cite grant numbers for all the community resources (e.g. Bloomington, DSHB), and please acknowledge FlyBase and its grants as well. For example, here are the instructions for citing BDSC https://bdsc.indiana.edu/about/acknowledge.html and similar instructions are available for the other resources.

      We thank the reviewer for underscoring the importance of citing grant numbers for all community resources used. We have added to the Acknowledgements section statements and grant numbers regarding use of community resources such as FlyBase, Bloomington Drosophila Stock Center, and DSHB (lines 533-538 in the preliminary revision).

      Referee #2:

      1. Figure 4 is an important part of this study, where the authors show a male-specific vulnerability to methioninase expression. They show that ATF4 RNAi confers vulnerability to methioninase expression even in females. An obvious question is whether ATF4 overexpression is sufficient to enhance resistance to methionine deprivation in males.

      We thank the reviewer for pointing out that the ability of increased ATF4 in male fat to enhance resistance to methionine deprivation was not interrogated. To examine this, ____we will quantify survival rates of males and females following dual over-expression of methioninase and ATF4 (3 months). We would like to state here that experimental over-expression of ATF4 at the levels induced by GAL4 activity is sometimes lethal, so this experiment may be difficult to execute/interpret due to technical limitations.

      Methioninase expression results (Figure 4) are interesting. Are the levels of methioninase expression similar between males and females?

      We thank the reviewer for asking for clarification on whether methioninase induction is similar between males and females. Whether methioninase induction is sexually dimorphic is likely a function of whether there is sexual dimorphism in the strength of the GAL4 driver used. While the drivers employed in this study are widely used for fat body expression, to our knowledge relative expression of ____Dcg-GAL4_ in males versus females has never been reported. Thus, we will perform qPCR to compare GAL4 and methioninase transcript levels in _Dcg-GAL4; UAS-methioninase____ male and female fat bodies (1 month).

      1. This manuscript focuses on ATF4, but there could be additional possible reasons for the sexually dimorphic ISR activity. For example, the degree of physiological stress that activates ISR could be different between males and females. I suggest comparing the levels of Phospho-eIF2alpha (or any other markers upstream of ATF4) in both sexes.

      We thank the reviewer for suggesting additional checks for sexual dimorphism in ISR activity in the fat, such as degree of eIF2α phosphorylation, which is directly upstream of ATF4 induction. Per their suggestion, we will compare p-eIF2α staining in male and female larval adipocytes (1.5 months).

      In Figures 1 to 3, the authors examine the intensity of ATF4 signaling after perturbing the sexual determination pathway. The methioninase experiments in Figure 4 are interesting, but there is nothing in this Figure linking male-specific vulnerability to sex determination genes. Examining the vulnerability to methioninase expression after perturbing the sexual determination genes would make Figure 4 integrate better with the rest of the manuscript.

      We thank the reviewer for highlighting that the role of male sexual identity in vulnerability of males to methioninase expression was not interrogated. Similar to our genetic interaction study proposed in point #1 from this reviewer, we will test whether feminizing male fat bodies (using UAS-traF over-expression) will change survival rate of males in our methioninase-expression paradigm (3 months).

      1. The authors write that they generated 4EBP intron-GFP because the 4EBP intron-DsRed signal was frequently observed in the cytoplasm (line 122). They seem to suggest that the DsRed reporter is less reliable than the GFP reporter. However, they continue to mix results using 4EBP intron-GFP (Fig. 4A) and 4EBP intron-DsRed (Fig. 4F). The two figures examine slightly different conditions (Fig 4A shows tra1 KO females, while Fig. 4F shows traF males). If the DsRed reporter is less reliable due to the signal from the cytoplasm, the authors should show results with the GFP reporter in traF males.

      We thank the reviewer for raising the legitimate concern that the ____4EBPintron-DsRed_ reporter used for some of the included quantifications in Fig. 3 might be less reliable then _4EBPintron-GFP_ that was generated for this study. We have updated the manuscript text (in the Results section) to more accurately describe the justification for building the _4EBPintron-GFP____ line (lines 122-127 in the preliminary revision).

      1. In Figure S1, the authors label 4EBP intron-GFP as Thor2p-GFP, which is confusing. There are other parts in the methods section referring to Thor2p. I suggest using consistent terminology throughout the manuscript.

      We thank the reviewer for pointing out this typo. We have modified the text accordingly in Figure S1.

      Referee #3 Major concerns:

      1) Sexually dimorphic ATF4 activity (Figure 1 and associated supplemental figure) as evidenced by reporter expression is the basis of this study, yet a detailed description of the immunofluorescence quantification is lacking. The methods sections needs to include information on how a) images were acquired (Were the same acquisition settings used across all images?), b) the intensity measurements were taken (What software was used? Does each data point in the distribution represent a single nucleus (the assumption is yes)? Is nuclear size adjusted for? Panels A' and B' have obvious differences in nuclear size which would in turn affect total intensity measurements), c) the sample size (How many fat images taken per animal per sample/genotype? How many trials were performed?)

      We thank the reviewer for requesting additional information describing the immunofluorescence quantification methods. ____We have now added an additional paragraph to the Methods section detailing image acquisition for quantifying reporter activity (lines 483-494 in the preliminary revision).

      2) While the authors nicely address the lack in specificity for two of the Gal4 driver lines used in the study limitation section, the fact that the one driver that is fat body-specific, 3.1Lsp2-Gal4, shows a modest, not statistically significant decrease in Figure 4C still raises some concern. There is another Lsp2-Gal4 line described in Lazareva et al., 2007 (PLoS Genetics) that drives expression in larval fat, perhaps to combat the issue of 3.1Lsp2-Gal4 have low activity, as mentioned by the authors. Alternatively, this phenotype could be assessed using Gal4 lines that only drive expression in the other tissues (if available). Otherwise, the conclusion that ISR/ATF4 signaling specifically in the fat mediates the starvation response needs to be toned down.

      We thank the reviewer for carefully analyzing our data showing survival during methioninase over-expression using different GAL4 drivers. ____The reviewer raises a valid concern that the GAL4 driver with highest specificity for the fat body (that is, with the least off-target tissue expression), ____3.1 Lsp2-GAL4_, induces the most modest methioninase-induced lethality (major point #2). We attribute this to the fact that _3.1 Lsp2-GAL4_ is reportedly (and in our hands) a weaker driver than _Dcg-GAL4_ in the larval fat body. We will demonstrate this experimentally by performing UAS-nucGFP expression using both _Dcg-GAL4_ and _3.1 Lsp2-GAL4____ side by side and quantifying nuclear GFP intensity in the larval fat (2 months).

      The reviewer also mentions that the other drivers with more statistically significant effects on male lethality (____Dcg-GAL4_ and _r4-GAL4_, Fig. 4) are not restricted to the fat body. Importantly, both these drivers are also expressed in the blood lineage (hemocytes). To examine whether ISR activation in hemocytes contributes to the female stress tolerance (and/or male lethality) observed upon methioninase induction, we will quantify male and female survival rate following methioninase induction in the blood lineage using a blood-specific driver, _HHLT-GAL4____ (Mondal et al 2014, PMID 25201876). (2.5 months)

      3) Several analyses rely on RNAi, and this is understandably important for tissue-specific knockdown of gene expression. At least one of the two following issues needs to be addressed: a) the efficiency of knockdown for each gene are not provided or reported on and b) only single RNAi lines were used for each gene targeted for knockdown.

      We thank the reviewer for pointing out that the original manuscript does not report on knockdown efficiencies of the RNAi lines used in the study. The RNAi lines from the Harvard Transgenic RNAi Project (TRiP) collection (traRNAi, dsxRNAi, nitoRNAi) have been verified in Yan & Perrimon 2015 (PMID 26324914). The ATF4RNAi line was verified in Grmai et al. 2024 (PMID 38457339). We have included all citations for these validation studies in Table S1 in the preliminary revision.

      Referee #3 Moderate concerns:

      1) Lines 137-141: It would be nice to see a gel that confirms that these newly designed primers detect the expected isoforms (supplemental perhaps).

      We thank the reviewer for requesting confirmation of isoform specificity of the primers used to detect ATF4 transcript in the fat body in Fig. 2B-C. Because these are qPCR primers, they were all designed to produce amplicons of nearly equal size. There is currently no reliable method to specifically deplete one ____ATF4_ isoform at a time, which would be the only way to experimentally demonstrate isoform specificity of each primer set. However, we have designed each primer pair to specifically detect isoform-specific regions of _ATF4_ mRNA and have verified specificity (and lack of off-target products in the _D. melanogaster_ genome) _in silico____ using Primer-BLAST (NCBI).

      2) Lines 278-282 and Figure 4D: Shouldn't the second and fourth bars be compared? Based on the hypothesis and conclusion, second bar females can resist nutrient stress because they have ATF4, but fourth bar females can't because they don't have ATF4 - is this difference statistically significant?

      We thank the reviewer for pointing out this missing statistical report that compares the second and fourth bars in Figure 4D ____(females expressing methioninase, with and without ATF4 knockdown). We have now performed this analysis and reported the p-value in text (lines 282-285 in the preliminary revision).

      3) For all scatter plot graphs, figure legends should indicate what the horizontal line represents (is this the average?). Also, error bars and what they represent (SD or SEM) are not included or described.

      We thank the reviewer for asking for additional details on our graph annotations. We have added language to explain that 1) horizontal lines on ATF4 reporter quantification graphs denote mean intensity (Fig. 1 legend, lines 567-568 in the preliminary revision) and 2) error bars on qPCR graphs represent SEM (Fig. 2 legend, line 583 in the preliminary revision).

      Referee #3 Minor concerns:

      1) Line 27: "counter parts" should be one word 2) Line 33: should the word "nutrient" be included before "stress" 3) Line 42: It would be nice to see a couple of examples of the "well documented across species" statement 4) Line 44-45: Add in the word "human" before population and use "women" instead of "females" 5) Line 53: There seems to be an issue with comma placement or word usage in the section of the sentence that reads "coincident with, or a comorbidity, for" 6) Lines 82-83: Mention of a couple examples would be nice 7) Line 104: Perhaps add the word "cellular" before "sexual" 8) Line 204: Delete the word "and" after "expression" 9) Line 234: Delete "a" before "significantly" 10) Line 276: Should "adult" be "adulthood" 11) For the discussion, a model schematic would nicely depict the findings as a whole 12) Line 330: May consider incorporating the following studies - Stobdan et al., 2019 and De Groef et al., 2021 13) Related to the point above: It would be great to see discussion/speculation of potential ATF4 targets that might be mediating this effect 14) Line 374: The placement of "yet unidentified" makes it seem like other ATF4 target genes aren't known, but really what is meant is that their sexually dimorphic expression is not known 15) Line 535: (beta-gal) "protein" instead of "gene"? 16) Figure S2: Please indicate what the two horizontal dotted lines are supposed to point out

      We thank the reviewer for carefully pointing out these minor yet critical text concerns. ____We have addressed all minor concerns raised by the reviewer in text edits to the preliminary revision, which are each highlighted in yellow in lines 27, 33, 44, 53, 105, 204, 236, 279, 375,554, 624 in the preliminary revision. The exceptions are points 3, 6, 11, 13, which we will address in the subsequent revision as described in the previous section.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: Using a combination of genetic and molecular tools, Grmai and colleagues present data showing the sexually dimorphic expression of ATF4, a transcription factor that mediates the integrated stress response, in larval fat tissue. Moreover, they find that higher basal ATF4 activity in female larvae supports the stronger resistance to nutrient deprivation that females exhibit compared to male larvae. The data are clearly described and nicely laid out in well-organized figures. Some major, moderate, and minor concerns, delineated below, regarding the approach and conclusions should be addressed prior to acceptance for publication.

      Major concerns:

      1. Sexually dimorphic ATF4 activity (Figure 1 and associated supplemental figure) as evidenced by reporter expression is the basis of this study, yet a detailed description of the immunofluorescence quantification is lacking. The methods sections needs to include information on how a) images were acquired (Were the same acquisition settings used across all images?), b) the intensity measurements were taken (What software was used? Does each data point in the distribution represent a single nucleus (the assumption is yes)? Is nuclear size adjusted for? Panels A' and B' have obvious differences in nuclear size which would in turn affect total intensity measurements), c) the sample size (How many fat images taken per animal per sample/genotype? How many trials were performed?)
      2. While the authors nicely address the lack in specificity for two of the Gal4 driver lines used in the study limitation section, the fact that the one driver that is fat body-specific, 3.1Lsp2-Gal4, shows a modest, not statistically significant decrease in Figure 4C still raises some concern. There is another Lsp2-Gal4 line described in Lazareva et al., 2007 (PLoS Genetics) that drives expression in larval fat, perhaps to combat the issue of 3.1Lsp2-Gal4 have low activity, as mentioned by the authors. Alternatively, this phenotype could be assessed using Gal4 lines that only drive expression in the other tissues (if available). Otherwise, the conclusion that ISR/ATF4 signaling specifically in the fat mediates the starvation response needs to be toned down.
      3. Several analyses rely on RNAi, and this is understandably important for tissue-specific knockdown of gene expression. At least one of the two following issues needs to be addressed: a) the efficiency of knockdown for each gene are not provided or reported on and b) only single RNAi lines were used for each gene targeted for knockdown.

      Moderate concerns:

      1. Lines 137-141: It would be nice to see a gel that confirms that these newly designed primers detect the expected isoforms (supplemental perhaps).
      2. Lines 278-282 and Figure 4D: Shouldn't the second and fourth bars be compared? Based on the hypothesis and conclusion, second bar females can resist nutrient stress because they have ATF4, but fourth bar females can't because they don't have ATF4 - is this difference statistically significant?
      3. For all scatter plot graphs, figure legends should indicate what the horizontal line represents (is this the average?). Also, error bars and what they represent (SD or SEM) are not included or described.

      Minor concerns:

      1. Line 27: "counter parts" should be one word
      2. Line 33: should the word "nutrient" be included before "stress"
      3. Line 42: It would be nice to see a couple of examples of the "well documented across species" statement
      4. Line 44-45: Add in the word "human" before population and use "women" instead of "females"
      5. Line 53: There seems to be an issue with comma placement or word usage in the section of the sentence that reads "coincident with, or a comorbidity, for"
      6. Lines 82-83: Mention of a couple examples would be nice
      7. Line 104: Perhaps add the word "cellular" before "sexual"
      8. Line 204: Delete the word "and" after "expression"
      9. Line 234: Delete "a" before "significantly"
      10. Line 276: Should "adult" be "adulthood"
      11. For the discussion, a model schematic would nicely depict the findings as a whole
      12. Line 330: May consider incorporating the following studies - Stobdan et al., 2019 and De Groef et al., 2021
      13. Related to the point above: It would be great to see discussion/speculation of potential ATF4 targets that might be mediating this effect
      14. Line 374: The placement of "yet unidentified" makes it seem like other ATF4 target genes aren't known, but really what is meant is that their sexually dimorphic expression is not known
      15. Line 535: (beta-gal) "protein" instead of "gene"?
      16. Figure S2: Please indicate what the two horizontal dotted lines are supposed to point out

      Significance

      Study novelty: This work begins to shed light on the underlying molecular mechanisms that mediate differential responses to nutrient deprivation in male and female larvae. The knowledge gained from Drosophila studies will very likely have implications for human adipose physiology given the known sex differences in adipose associated physiology and pathophysiology in men and women.

      General assessment: This study makes excellent use of the Drosophila melanogaster genetic toolkit to better understand the involvement of the ISR in mediating sexually dimorphic responses to nutrient deprivation. In addition, carefully thought-out figure layouts make the data easy to visualize. Limitations of the study include lack of specificity of fat body-specific driver lines and thus a potentially overstated conclusion.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      It is now well-established that the Integrated Stress Response (ISR) mediated by ATF4 plays important roles in metabolism and proteostasis. This manuscript by Grmai and colleagues reports that the sex determination genes tra and dsx allow higher levels of ATF4 expression in Drosophila. They further show that female flies depend on ATF4 to survive under conditions of metabolic stress.

      The presented data are technically sound, and the manuscript is generally very well written. It is a concise study with four Figures. The authors could have chosen to expand the scope: For example, they have shown the requirement, but not the sufficiency, of ATF4 in the sexually dimorphic nature of vulnerability to nutrient deprivation. They also demonstrate that ATF4 affects male-specific survival upon metabolic stress, which could be improved with additional experiments. These and other technical points are outlined below:

      1. Figure 4 is an important part of this study, where the authors show a male-specific vulnerability to methioninase expression. They show that ATF4 RNAi confers vulnerability to methioninase expression even in females. An obvious question is whether ATF4 overexpression is sufficient to enhance resistance to methionine deprivation in males.
      2. Methioninase expression results (Figure 4) are interesting. Are the levels of methioninase expression similar between males and females?
      3. This manuscript focuses on ATF4, but there could be additional possible reasons for the sexually dimorphic ISR activity. For example, the degree of physiological stress that activates ISR could be different between males and females. I suggest comparing the levels of Phospho-eIF2alpha (or any other markers upstream of ATF4) in both sexes.
      4. In Figures 1 to 3, the authors examine the intensity of ATF4 signaling after perturbing the sexual determination pathway. The methioninase experiments in Figure 4 are interesting, but there is nothing in this Figure linking male-specific vulnerability to sex determination genes. Examining the vulnerability to methioninase expression after perturbing the sexual determination genes would make Figure 4 integrate better with the rest of the manuscript.
      5. The authors write that they generated 4EBP intron-GFP because the 4EBP intron-DsRed signal was frequently observed in the cytoplasm (line 122). They seem to suggest that the DsRed reporter is less reliable than the GFP reporter. However, they continue to mix results using 4EBP intron-GFP (Fig. 4A) and 4EBP intron-DsRed (Fig. 4F). The two figures examine slightly different conditions (Fig 4A shows tra1 KO females, while Fig. 4F shows traF males). If the DsRed reporter is less reliable due to the signal from the cytoplasm, the authors should show results with the GFP reporter in traF males.
      6. In Figure S1, the authors label 4EBP intron-GFP as Thor2p-GFP, which is confusing. There are other parts in the methods section referring to Thor2p. I suggest using consistent terminology throughout the manuscript.

      Significance

      Overall, the authors report a novel and interesting observation because the sex determination pathway was not previously associated with ISR signaling. As many metabolic diseases show sex-specific outcomes, the main findings of this study will draw broad interest.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      This study aims to explore sexual dimorphism in tolerance to nutrient deprivation using Drosophila larvae as a model. In particular the authors focus on sex differences in the larval fat body. They show that ATF4, an ISR transcription factor, has higher mRNA levels in female fat bodies. ATF4 transcriptional activity is also higher based on a reporter of ATF4 function, where this female bias in expression is influenced by sex determination factors. When the authors

      Overall, this study is interesting, as it identifies previously unrecognized sex-specific regulation of ATF4, an important transcription factor that mediates cellular stress responses. The study also shows that sex determination genes regulate ATF4. However, I have concerns about the paradigms of nutrient deprivation used in the study, and about data interpretation and statistical analysis that should be addressed prior to publication to support the authors' conclusions.

      Major concerns

      1. One major concern that I have about the sexual dimorphism in tolerance to nutrient deprivation is that the papers cited by the authors, and paradigms that are used broadly in the field, all use adult flies. The authors must show that in larvae, a completely different life stage from their citations, there is a sexual dimorphism in tolerance to nutrient deprivation.

      Interestingly, Diaz et al 2023 (Genetics) show that male larvae have greater fat stores than female larvae. Considering fat is the main determinant of tolerance to nutrient deprivation it's not clear that females will actually survive nutrient deprivation longer as larvae. This is an essential test of whether female larvae do have increased tolerance to nutrient deprivation, which is the basic foundation of the authors' model. 2. Another concern is the way that the authors "genetically induce nutrient deprivation using methioninase overexpression". As they acknowledge in the discussion (Line 381-390), methioninase overexpression will have many cellular effects. While there is no doubt that methionine levels would be lower in their model, it is less certain whether this is the main driver of the male-specific lethality.

      There are two potential solutions to this problem. First, the authors could change the text throughout the paper to more accurately describe their paradigm as "methioninase-induced lethality" rather than "nutrient deprivation". This would limit the scope of their scientific question and the conclusions they draw, but would eliminate the need for more experiments.

      The second solution would be to complete experiments to establish the following points: i) methioninase overexpression causes all the classical features of nutrient deprivation (e.g. changes to canonical signaling pathways such as TOR); ii) using other genetic means of nutrient deprivation such as slimfast-RNAi to see if those manipulations phenocopies the male-specific lethality they see with methioninase overexpression; iii) testing a role for ATF4 in mediating sex differences (if any) in other contexts such as slimfast-RNAi. This will take 2-3 months but is essential to draw any conclusions about whether their paradigm is truly a model for nutrient deprivation. 3. Another important point is that the authors state that sexually dimorphic ATF4 activity in the fat body is instructed by sexual identity in a cell-autonomous manner. Despite a clear decrease in ATF4 reporter levels in tra mutants, the fat body-specific tra-RNAi effect on the ATF4 reporter was less convincing. Together with the fact that changes to tra in the fat body affect insulin secretion from the insulin-producing cells, it is possible that the effect on ATF4 is not cell-autonomous. To conclusively test if sexual identity regulates ATF4 in a cell-autonomous manner the authors should use the flp-out system to make Tra-expressing or tra-RNAi-expressing clones in the fat body. This would take approximately 1.5 months to make the strain and test this. 4. As the authors show for the UAS-methioninase, other UAS lines used in the paper such as UAS-traF, UAS-tra-RNAi, UAS-dsx-RNAi may have leaky effects on gene/reporter expression. The authors must include a UAS only control to establish that the tra-RNAi, UAS-traF, UAS-dsx-RNAi do not affect gene/reporter expression. 5. I have concerns about the statistics used. In the methods and legends only t-tests are mentioned; however, when three groups are compared a one-way ANOVA with post-hoc tests must be used to correct for multiple comparisons. To compare differential responses to genetic/environmental manipulations between the sexes, a two-way ANOVA must be used. For example, to conclude that males and females have different responses in the two-way ANOVA, there must be a significant genotype:sex interaction. The p-values for comparisons between genotypes in either the one-way or two-way ANOVA must be derived from post-hoc tests within the ANOVA analysis.

      Minor points

      1. Please ensure to make the reader aware of which life stage was tested in the literature cited supporting sexually dimorphic tolerance to nutrient deprivation.
      2. Published data about sex-specific mechanisms of metabolic regulation mean that the introduction should be more fully cited than it is. Even in the introduction "the molecular basis of these differences and how they impact tolerance to nutrient deprivation is still under investigation" is inaccurate, as there are published studies identifying some mechanisms (work on gut hormones and sex-specific effects on starvation resistance and body fat, role of ecdysone on body fat and feeding, sex-specific roles for brummer and Akh in regulating body fat, intestinal transit and gut size and feeding). Please adjust the paper to acknowledge this growing body of knowledge.
      3. Please list the diet ingredients per L so that individuals can replicate the diet easily.
      4. Please cite grant numbers for all the community resources (e.g. Bloomington, DSHB), and please acknowledge FlyBase and its grants as well. For example, here are the instructions for citing BDSC https://bdsc.indiana.edu/about/acknowledge.html and similar instructions are available for the other resources.

      Significance

      This study identifies for the first time the sex-specific regulation of ATF4, and reveals the sex determination genes that mediate this effect. A strength of the study is the characterization of sex-specific ATF4 regulation. Limitations of the study include the paradigm for nutrient deprivation, need for additional controls, and statistical analysis. If the concerns above are addressed, this study will be of interest to researchers studying organismal and cellular stress responses, stress signaling, and builds upon a growing body of knowledge of sex differences in stress responses (e.g. autophagy, infection responses).

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer 1:

      Evidence, reproducibility and clarity

      The manuscript "WWOX deficiency impairs neurogenesis and neuronal function in human organoids" by Aqeilan and co-workers provides impressive set of studies, mostly utilizing cerebral organoids (CO), gaining insights into the roles of the gene WWOX in neuronal development and molecular etiology of WOREE and SCAR12, two devastating rare diseases originating from mutations in WWOX. Further, therapeutic modalities through the neuron-specific gene therapy are investigated using the WWOX k/o and WOREE and SCAR12 patient-derived COs. Among the major findings of this work one can highlight the identification of the main source of WWOX-expressing cells as radial glia (RG) cells; the discovery of the massive upregulation of Myc upon loss/decrease in WWOX expression in RGs; and the strong neuronal under-differentiation induced upon WWOX k/o and mutations. Regarding the latter finding, the authors report massive increase in RGs and concomitant drop in neuronal cells in WWOX k/o COs. In contrast, in WOREE and SCAR12 patient-derived COs, a more subtle under-differentiation is seen. Specifically, while WOREE but not SCAR12 patient-derived COs also show a certain increase in the RG proportion, both types of patient-derived COs demonstrate higher proportions of "young" neuronal cells as compared to wild-type COs. Thus, a picture can be drawn whereas complete loss of WWOX leads to strong under-differentiation mostly manifested as expansion of RGs and hence under-production of neuronal cells, while hypomorphic loss-of-function of WWOX in WOREE and in missense mutations in SCAR12 lead to the later defect in neuronal cell maturation. Overall, I find the work highly interesting, but I would like the authors to address one major issue and several minor ones.

      Major Comments

      Comment____:

      The major issue is related to the overall model the authors seem to build based on their data - or at least the overall model the reader may get from the paper. This model suggests that the loss / decrease in WWOX levels in RGs leads to Myc overexpression, that in turn affects the cell cycle and prevents neuronal differentiation. This model is highly attractive, but is probably incomplete, in the sense that it does not fully recapitulate the complicated picture. Indeed, all three types of mutated WWOX COs (WWOX k/o, WOREE patient-derived organoids, and SCAR12 patient-derived organoids) demonstrate strong - but equal levels of Myc upregulation. Yet the under-differentiation in each of these three types is different, as described above, and the disease manifestations among WOREE vs. SCAR12 patients are also different. Thus, another player (in addition to Myc) must be at place, that is differentially affected by the partial null mutations in WOREE and missense mutations in SCAR12. This point - ideally to be addressed experimentally - should be at least faced directly by the authors in the Discussion. Perhaps they can already point to such additional players based on their transcriptomics analysis.

      Response____:

      We thank the reviewer for this important point. We agree with the reviewer that the model of WWOX loss / decrease levels in RGs leading to MYC overexpression is incomplete, and that it is a limitation of our model. It seems plausible that other players have a high impact on the genotype and are potentially differently affected, resulting in this complexed phenotype. Following the reviewer advice, we plan to address this in the discussion as a limitation of the model, and we will compare how the expression levels of MYC change based on the genotype in comparison to the WT, using the single cell RNA-sequencing data. We would also like to clarify that MYC upregulation we observed in the patient lines in SOX2+/MYC+ populations, does not quantify expression levels of MYC, but rather positive/negative nuclear staining, in contrast to the high-resolution of scRNA-seq data.

      Minor Comments

      Comment____: 1. It would be useful if a table (perhaps supplementary) describing the details of the WWOX__ mutations__ in all the COs models studied in this paper were presented.

      Response____:

      We thank the reviewer for this suggestion, and we plan to prepare a table summarizing all the mutations in the COs models presented in the paper.

      Comment____: 2. For the new WOREE individual with complex genetics in WWOX: it is not clear why any WWOX protein is still present in this patient in Fig. S1D (please give an explanation or speculation); it is not clear which tissue was used for the Western blot in Fig. S1D; the data in Fig. S1D need to be quantified.

      Response____:

      We thank the reviewer for their observation and would like to clarify that the ‘upper’ band seen in WWOX bands in a nonspecific one that appears in the parent lines and the mutant offspring. We will quantify the WB levels and clearly state that they are the IPSCs in the figure legend.


      Comment____: 3. Western blot, quantified, should be performed on all COs under study, to compare the WWOX expression levels. Please also change the immunofluorescence shown in Fig. 1B (e.g. show WWOX in a different color), as the figure provided shows WWOX poorly in wild-type CO, and it is not clear how much it is removed in the mutant organoids. Why should there be no signal in the SCAR12 COs?

      Response____:

      We thank the reviewer for their observation, we will provide protein levels of WWOX in patient and KO cerebral organoids which will better clarify the decreased WWOX levels, specifically in SCAR12 (see WB figure below). We will also perform any necessary changes to the figure to enhance visualization of WWOX.


      Significance

      The manuscript "WWOX deficiency impairs neurogenesis and neuronal function in human organoids" by Aqeilan and co-workers provides impressive set of studies, mostly utilizing cerebral organoids (CO), gaining insights into the roles of the gene WWOX in neuronal development and molecular etiology of WOREE and SCAR12, two devastating rare diseases originating from mutations in WWOX. Further, therapeutic modalities through the neuron-specific gene therapy are investigated using the WWOX k/o and WOREE and SCAR12 patient-derived COs. Among the major findings of this work one can highlight the identification of the main source of WWOX-expressing cells as radial glia (RG) cells; the discovery of the massive upregulation of Myc upon loss/decrease in WWOX expression in RGs; and the strong neuronal under-differentiation induced upon WWOX k/o and mutations. Regarding the latter finding, the authors report massive increase in RGs and concomitant drop in neuronal cells in WWOX k/o COs. In contrast, in WOREE and SCAR12 patient-derived COs, a more subtle under-differentiation is seen. Specifically, while WOREE but not SCAR12 patient-derived COs also show a certain increase in the RG proportion, both types of patient-derived COs demonstrate higher proportions of "young" neuronal cells as compared to wild-type COs. Thus, a picture can be drawn whereas complete loss of WWOX leads to strong under-differentiation mostly manifested as expansion of RGs and hence under-production of neuronal cells, while hypomorphic loss-of-function of WWOX in WOREE and in missense mutations in SCAR12 lead to the later defect in neuronal cell maturation.

      Overall, I find the work highly interesting, but I would like the authors to address one major issue and several minor ones.

      Response____:

      We sincerely thank the reviewer for their thoughtful and constructive comments, which have greatly helped us improve the clarity and rigor of our manuscript. We appreciate the recognition of our work’s significance and the careful evaluation of both our major findings and methodological details. We have addressed all the raised points to the best of our ability and believe the manuscript will be substantially strengthened as a result. We are grateful for the reviewer’s time and valuable insights.

      Reviewer 2:

      Evidence, reproducibility and clarity

      Summary

      In this study, Steinberg et al aim to elucidate the role of WWOX in human neurogenesis and model WOREE and SCAR12 syndromes which are rare neurodevelopmental disorders. They chose to investigate its function in human brain organoids after generating WWOX KO and patient-derived iPSC lines. Their major finding is that radial glial cells, the main neural progenitor population during corticogenesis, are affected. Via single-cell-RNA-sequencing, they try to decipher the perturbed molecular mechanisms identifying MYC, a proto-oncogene, as a major player. At the end of their study, they proceed to gene therapy restoration and suggest that this could become a potential therapeutic intervention for WOREE and SCAR12 syndromes. The study aims to elucidate major cellular and molecular mechanisms that modulate neurodevelopment and neurodevelopmental disorders. Although sc-RNA-seq could potentially be of great interest and unravel major mechanisms, the authors do not follow this part, but only discuss potential future avenues. Here are some suggestions that could be useful to the authros.

      Major comments

      Comment____:

      A big part of the paper focuses on generating the iPSCs and characterizing the generated brain organoids and gene restoration of the phenotype via restoration of the WWOX gene expression (Fig.1, Fig.6, Fig.S1, Fig.S8, Fig.S10 and potentially Fig.S9 - this figure is not included) however, this has already been done by the same authors (first and last authors) in a previous publication. What are the differences in the line that have been generated in previous publication (Steinberg et al 2021, EMBO Mol. Med.)? If there are differences, the authors should make a thought comparison and explain why they generated different lines. If there is no difference, the authors should reduce to minimum this part and place it to supplementary.

      Response____:

      We thank the reviewer for pointing this out. We would like to clarify that some iPSC lines used in this publication were not introduced in the previous one (Steinberg et al 2021, EMBO Mol. Med.), including the wildtype JH-iPS11, and the new compound heterozygous WOREE line LM-iPS. In this paper, we aimed to widen our understanding of the effects of WWOX mutations through advanced techniques not applied before, and by adding these lines we were able to better generalize our findings as we did not depend on a single patient for both WOREE and SCAR12. Additionally, WWOX rescue in this current paper relies on AAV9-hSynI targeting that is more clinically relevant to gene therapy, as opposed to lenti-WWOX and AAVS1 WWOX in the previous publication. We will include the differences in a summary table.

      Comment____:

      • Fig.1E: in the pictures shown, the majority of the Satb2+ cells are colocalized with SOX2. Although a small portion of neurons have been shown from many studies that in brain organoids are co-localized to SOX2, in the pictures depicted this percentage is big. Also in ctrl condition the VZ-CP like areas are not easily recognized. The authors should check if this co-localization is a more general phenotype and if not choose more representative pictures.

      Response____:

      We thank the reviewer for their observation. We will check for the presence of a trend of colocalization between SATB2 and SOX2 and address the concern experimentally if needed. We will also choose pictures that better display VZ-CO areas in the control line.

      Comment____: Information about the number of organoids per batch used in each figure is not included. This needs to be added for each experiment. Data (at least the majority of them) should be collected from brain organoids from at least two batches.

      Response____:

      We thank the reviewer for their point, and we plan to better clarify the technical parts of the experiments, and if needed will include data from more batches.

      Comment____: The expression of WWOX in cortical development has been shown in the previous publication. Although sc-RNA data are validating the previous data and are adding more information, these data should be put as supplementary. Besides, in Fig.3G where authors aim to compare WWOX expression to MYC that fits nicely with their results depicting MYC as the most affected gene in KO and mutant line, when one looks at the WWOX expression only it seems that its expression is higher in CP that VZ. This is contrary to the conclusion that WWOX is mainly characterizes RGs. Why is that? Authors should at least discuss this.

      Response____: We agree with the reviewer that Fig.3G can be misleading, and we acknowledge that it can lead to the opposite conclusion of WWOX being mainly characterized in RGs. In Fig.3G the x axis displays positive values on the left, and negative scores on the right. Following the reviewer’s suggestion, we modified the graph to show positive values on the right, demonstrating how WWOX expression is higher in the VZ compared to the CP.

      Comment____: In this study, authors show that progenitors are reduced in WWOX-KO organoids, however in the previous publication SOX2 population is not majorly affected. Why are there such differences? Given that RGs are the main population affected as authors propose in this study, these differences must be at least discussed. Similar comments regarding neurons: in previous publication there is a minimal reduction of neurons in WWOX-KO brain organoids, while here authors describe major differences.

      Response____:

      We thank the reviewer for their remark and agree that it should be mentioned in our discussion. We believe that this is unfortunately due to the inherent issue of heterogeneity between organoids and could partly be attributed to the difference in the age of organoids at that timepoint (week 10 organoids in previous paper, week 7 organoids in this one), and the difference in control lines (WiBR3 hESCs vs JH-iPSCs). Additionally, while percentages of SOX2+ populations in WT organoids vary between the previous publication and this one, WWOX-KO organoids display similar levels of SOX2 upregulation in precious and current papers: 69% and 74%, respectively. We would also like to point out that calculations in scRNA-seq data convey the phenotype at a much higher resolution, as it identifies radial glia populations that are not necessarily SOX2+, further strengthening the validation of the SOX2+ RG quantifications that are present in this study.

      Comment____: Data from sc-RNA-seq analysis highlighting MYC as major differentially regulated gene are very interesting and seem to be key to the molecular pathway affected as authors suggest. Authors also validate this with immunostainings in brain orgnaoids. However, in Fig.3J MYC expression in ctrl is not depicted, even though in the respective graph it seems that 20% of SOX2+ cells co-express MYC. Please choose a more representative picture.

      Response____:

      We agree with the reviewer’s comments that the current image size and resolution limits the ability to appreciate the MYC staining in the control, and plan to use a more representative figure of the phenotype.

      Comment____: One of the main findings in this study is the cell cycle changes observed in WWOX-KO and mutant organoids. Given that the major novelty of the publication is the cellular and molecular mechanism implicated in WOREE and SCAR12 syndromes, authors should perform additional experiments towards this direction. One suggestion would be to perform stainings in brain organoids using markers of the different cell cycle phases (eg. KI67, cyclin a, BrdU/EdU, ph3). Also, treatment of organoids with different BrdU/EdU chase experiments would be important so as to measure exactly the length of each cell cycle phase.

      Response____:

      We appreciate the reviewer’s suggestions and plan to validate the findings through staining and quantifying percentages of proliferative RGs in WT vs mutant WWOX lines.

      Comment____: Regarding the molecular cascade, is WWOX directly affecting MYC of Wnt genes? Do they have information on upstream and downstream factors in the affected molecular pathway?

      Response____:

      We thank the reviewer for highlighting this important point. To address this question, encouraged by our results, we will compile genes belonging to the regulon of MYC and study the upstream and downstream factors in our transcriptional data. Additionally, we will look at protein expression levels of WNT genes in our organoid samples.

      Comment____: Restoration of phenotype via reinsertion of WWOX gene has already been done in the previous publications by the same authors. But what about MYC? Is MYC manipulation able to rescue the phenotype?

      Response____:

      We thank the reviewer for this insightful suggestion. We fully agree that understanding the role of MYC in the observed phenotype is of great interest. However, due to the essential and widespread role of MYC in both radial glia and neurons, we refrained from direct perturbation of MYC levels—either through knockdown or overexpression—as such manipulations may have broad, uncontrolled effects that could confound the interpretation of our findings. The potential deleterious consequences of MYC modulation in radial glia have been originally discussed in the Discussion section of the manuscript. In our revisions, we will further explore the role of MYC regulons in our scRNA-seq dataset to better understand their contribution to the WWOX-related phenotype.


      Comment____: Finally, MYC association to ribosome biogenesis as mentioned by the authors in discussion is very interesting. The authors should consider investigating this direction, as it will be a great addition to the mechanisms that regulate WOREE and SCAR12 syndromes which is the main focus of this study.

      Response____:

      We thank the reviewer for highlighting this point, and we agree that MYC's association with ribosome biogenesis is a fascinating topic to discuss. This could be connected to the alterations of the proliferative potential and to the anabolic state of the cell, and we plan to expand the discussion of this observation and its implication in the context of RGs and neurons*. *


      Minor comments

      Comment: - Line 115: authors say that the data they discuss are found in Fig.S2A, maybe they mean Fig.S1A?

      Response:

      We thank the reviewer for their observation, we will correct Fig.S2A to Fig.S1A and B.

      Comment: - Fig.S9 is missing, in the current version this Fig is the same with Fig.S10. Please change it.

      Response____:

      We thank the reviewer for pointing this out and apologize for this oversight. We acknowledge the error and will correct the duplication by replacing Fig. S9 with the intended figure in the revised version of the manuscript.


      Significance

      This study is the continuation of a previous publication the authors have published. The topic is very interesting and novel especially in modelling neurodevelopmental disorders in a human context, however, given that the main phenotype has already been published, the authors should include more effort in the molecular cascade. Clinical interventions if the molecular cascade is described would be of great importance to the field.

      Response____:

      We sincerely thank the reviewer for their thoughtful, constructive, and detailed review. We appreciate the time and effort taken to carefully read our manuscript and provide insightful suggestions, taking into consideration also our previous published work. The suggestion raised, especially regarding MYC-WNT axis and its potential link to ribosome biogenesis, will help us clarify, strengthen, and expand the scope of our study. We have carefully addressed each of the points raised and have incorporated the necessary experimental validations, clarifications, and revisions accordingly. We believe these changes have substantially improved the manuscript.

      Reviewer 3:

      Summary:

      The manuscript by Steinberg and colleagues describes cellular and molecular changes linked to mutations in WWOX, a gene implicated in rare neurodevelopmental disorders, WOREE and SCAR12 syndromes. By comparing immunofluorescene and single cell trascriptomics of unguided brain organoids from control and WWOX-knockout iPSCs, as well as 2D NSCs and in vivo fetal brain expression datasets, the authors identified radial glia as relevant cell types in which WWOX is expressed and affected by WWOX deficiency. Using immunofluorescence, single cell trascriptomics analysis and western blotting on week 16 organoids, the authors show that WWOX deficiency results in increased abundance of radial glia cells at the expenses of neuronal production. These changes are accompanied by accumulation of cells in G2/M and S phases, overexpression of c-MYC and Wnt activation. In addition to this, the authors characterize unguided brain organoids generated from iPSCs reprogrammed from patients affected by WOREE or SCARE12 syndromes. Using immunofluoresce and single cell trascriptomics, they find that, while RG abundance changes were very modest, patients's iPSC-derived neurons are enriched for signatures related to early development, suggesting delayed differentiation. Finally, the authors use patch clumping, calcium imaging and gene therapy in 16 weeks old organoids derived from control and patients-derived iPSCs, to demonstrate that WWOX restoration normalized hyperexcitability phenotypes in both WOREE and SCAR12 organoids. These results thus provide a proof-of-concept evidence that WWOX restoration in human cells is a valid strategy to correct for hyperexcitability pehnotypes in WWOX related syndromes.

      Major ____C____omments

      The study's main conclusions regarding neurodevelopmental phenotypes linked to WWOX deficiency and genotype-phenotype relationships are based on iPSC-derived brain organoid models analyzed using immunofluorescence, single-cell transcriptomics, and excitability recordings (cell-attached patch clamping, calcium imaging). While the analyses involve a diverse collection of iPSCs and two time points (7 and 16 weeks), the study falls short in providing sufficient experimental details and validation to fully support its conclusions. Additional quantification, replication, and functional validation would be necessary to solidify the study's conclusions. Some of these validations are achievable within a reasonable timeframe, while others would require a more substantial investment of time and resources as detailed below.

      Comment____:

      A key concern is the lack of experimental details and replicability. Number of individual organoids, number of images per organoid for IF, and whether multiple batches were used are only partially provided. While the authors report generating multiple WWOX knockout clones, the legends and methods do not specify whether multiple clones were used across different organoid experiments. The study states that four organoids were used for scRNA-seq, but it is unclear whether this means four organoids per genotype or one organoid per genotype was analyzed. These ambiguities make the claims appear rather preliminary.

      Response____:

      We thank the reviewer for pointing this out, and we acknowledge that the clarity of our description of the batch used in each experiment can be improved. Therefore, we will provide all these details, adding information on additional batches adopted for the different validations that were not included in the manuscript.

      Comment____:

      Another issue is the limited validation of scRNA-seq observations. Since scRNA-seq is often performed on a limited number of organoids, orthogonal validation is crucial to strengthen the findings. For example, changes in radial glia abundance and neuronal production observed in scRNA analyis (Figure 2-5) could be validated using immunofluorescence across genotypes and batches. Currently, IF stainings for Sox2 and TUBB3 are shown only at 7 weeks in Figure 1B, but no quantificative assessment is provided. Also, it is not clear if quantifications provided in Figure 1F refer to multiple organoids or batches.

      Response:

      We thank the reviewer for this important point. We would like to clarify that in Figure 1B, TUBB3 staining is primarily used for visualization purposes to provide anatomical context and delineate the overall architecture of the organoids, rather than for quantitative assessment of neuronal output. As such, the focus of our quantification in Figure 1F was on SOX2+ radial glial cells. That said, we agree that clearly stating the number of organoids and batches used in the quantification is important, and we will include this information in the figure legend for clarity.

      Comment____:

      Furthermore, the observations on cell cycle arrest, DNA damage, senescence, metabolic alterations, Wnt activation obtained via scRNA-seq could be further validated on organoid tissues using specific antibodies that the lab used before (e.g. yH2AX antibody in PMID: 34268881) or assays that have been developed elsewhere (some examples are reviewed in PMID: 38759644). As for feasibility, immunofluorescence validation of existing tissues is realistic, requiring validated antibodies and procedures, some additional imaging time and analysis (estimated 1-2 months, with some budget to purchase antibodies and cover imaging time costs). Feasibility of efforts related to validation across organoids and batches depends on the number of organoids used so far and available tissues. Generating new organoids would be indeed more time-consuming (≈ 6 months) and expensive (but extact costs would depend on number of clones, organoids and batches used), but feasible.

      Response____:

      We appreciate the reviewer’s thoughtful feedback and for drawing our attention to the review by Sandoval, Soraya O., Anderson, Stewart, et al. We also thank the reviewer for their suggestions and intend to explore the proposed modifications through immunostaining, particularly to address questions related to cell cycle changes, and Wnt pathway. However, regarding DNA damage, senescence, and cell cycle arrest, we do not believe additional validation is necessary, as our current manuscript does not present findings related to these aspects.


      Comment____:

      Another limitation is the lack of functional relevance of MYC alterations. The study confirms increased MYC expression via both scRNA-seq and immunofluorescence in organoid tissues. However, these results remain correlative and demonstrating the functional requirement of MYC overexpression in mediating WWOX-deficiency-related changes would significantly strengthen the study's conclusions. This would require additional differentiation experiments, including MYC overexpression or knockout models, to assess its direct impact. These efforts would represent a major conceptual advance by linking RG effects to MYC function and highlighting MYC-related therapeutic directions. These additonal experiments would require a substantial investment to generate the necessary regents (e.g. WWOX-KO and WT iPSCs with altered MYC levels) and additional time and costs for organoid analysis, mostly by immunofluorescence (estimated 6-8 months).

      Response____:

      We thank the reviewer for this insightful comment and fully agree that elucidating the functional contribution of MYC alterations in the context of WWOX deficiency would represent a major conceptual advance. We acknowledge that our current findings are correlative, based on scRNA-seq and immunostaining, and that direct manipulation of MYC could help establish causality.

      However, due to MYC’s essential and pleiotropic role in both progenitor and neuronal populations—including its regulation of cell cycle, metabolism, and apoptosis—we refrained from genetic overexpression or silencing approaches in this study. Such perturbations often lead to widespread, non-specific effects that can obscure the interpretation of lineage-specific phenotypes, particularly in a complex model like brain organoids.

      That said, we agree that further insight into the functional role of MYC is crucial. To this end, we plan to leverage our scRNA-seq dataset to analyze the activation state of MYC regulons across genotypes and cell types, and to assess how these regulons intersect with cell cycle dysregulation observed in WWOX-deficient radial glia. We also aim to integrate available transcriptomic data from primary cortical tissue to support the relevance of MYC pathway alterations in human development. While these analyses cannot replace experimental perturbation, we believe they can provide strong, hypothesis-generating evidence for MYC’s mechanistic involvement and help prioritize targeted experiments in future studies.

      Comment____:

      Another issue is the lack of patterning analysis in unguided organoids, which are known to exhibit high variability in regional identity (PMID: 28283582). While the authors acknowledge this limitation to some extent-abstaining from fine-resolution analysis (Lines 173-174)-this variability, combined with the limited number of organoids used, could be a major confounding factor in the phenotypic analyses, even at a broad resolution. Indeed, some of the reported differences across genotypes may stem from variability in organoid patterning rather than true genotype-driven effects. For example, the reduced SATB2 expression in KO and patient-derived organoids from Figure 1E-F could result from impaired cortical patterning rather than a direct effect of WWOX deficiency. Additionally, in Figure 6D and 6E, the fact that WOREE iPSC-derived organoids - but not SCAR12 organodis- show lower levels of both CTIP2 and SATB2, might reflect a shift toward a non-cortical identity rather than a direct WWOX-dependent phenotype. To rule out patterning variability as a contributing factor, the authors should analyze organoid regional identity across genotypes using immunostaining for dorsal and ventral forebrain markers. This would allow a more solid inference of genotype-specific effects on neurodevelopmental phenotypes. Patterning validation can be performed on existing organoid tissues (week 7) using validated antibodies (PMID: 28283582). As such, this analysis is expected to be relatively straightforward and feasible in a few weeks time. If the generation of new organoids is needed, such patterning validation should still be relatively feasible, as week 7 organoids are ideal for assessing regional identity. Analysis of patterning effects should also extend to 2D NSC cultures. In the 2D NSC models derived from WWOX-KO lines (Figure 3L, Figure S4A), the differentiation protocol includes patterning factors that promote ventral fates (SAG and IWP2). Interestingly, the quantification of MYC expression from unguided organoids and 2D NSCs (Figure 3K-L) reveals a major difference in the fraction of MYC-positive cells in WT conditions across the two culture models. A possible changes in the dorsal and ventral patterning of 3D and 2D cultures might explain these differences and implementing immunostaining for patterning markers in 2D would help clarify patterning contributions.

      Response____:

      We thank the reviewer for this thoughtful and constructive comment. We fully agree that regional identity variability in unguided cerebral organoids is a well-recognized challenge, and that systematic assessment of dorso-ventral patterning is important to confidently interpret genotype-driven phenotypes.

      We would like to clarify that the cerebral organoid protocol used here has consistently been shown to favor a dorsal forebrain identity (PMID: 23995685, 28562594, 32483384, 33328611), and in our previous work (PMID: 34268881), we demonstrated that WWOX mutations did not substantially alter dorsal identity in this model. Nevertheless, to directly address the reviewer’s concern, we plan to perform additional immunostaining for regional patterning markers on our existing week 7 organoid tissues and explore our scRNA-seq data to evaluate potential shifts in regional identity and rule out patterning-related confounders.

      Regarding the 2D NSC cultures, while the differentiation protocol included the ventralizing factor SAG, it did not include IWP2. We acknowledge the importance of validating patterning outcomes in this model as well and will do so using immunostaining.

      Comment____:

      There are also some concerns regarding WOREE and SCAR12 phenotypes. First, the genotypes of the patient-derived iPSCs are not clearly defined, making it difficult to establish clear genotype-phenotype relationships. The study uses iPSCs from four different patients (2 WOREE, 2 SCAR12), some of which were validated in a previous study (PMID: 34268881). However, it remains unclear how they were validated, and detailed genomic alterations of the four patients are not explicitly reported. Additionally, it is uncertain whether all variants result in a full loss of WWOX function, as protein loss evidence is only provided for one WOREE patient (Figure S1D). Also, the authors state that SCAR12 should have a milder phenotype (line 168), but it is unclear whether this claim is based on clinical evidence or genomic data from these specific patients. To improve genotype-phenotype comparisons, the authors should consider including a clear schematic summarizing the genomic alterations in all patient-derived lines and their expected disease severity.


      Response____:

      We thank the reviewer for this suggestion, and we agree that including a schematic summarizing the genomic alterations in all patient-derived lines and their severity will improve the genotype-phenotype comparison. We will include this clarification and provide additional information on how the mutations affect the protein level, and the genotype-phenotype correlations in WWOX mutants based on clinical and genetic evidence.


      Comment____:

      Second, the experimental design lacks appropriate controls for patient-derived iPSCs. All patient-derived iPSC comparisons are performed against a single reference male iPSC line, which is neither isogenic to WOREE nor SCAR12 iPSCs. This complicates the interpretation of differences between healthy and patient-derived organoids, as well as comparisons between WOREE and SCAR12 phenotypes. Given this design, it is impossible to draw solid conclusions about genotype-phenotype relationships. A more robust approach would involve including multiple healthy controls to account for genetic background variability or using isogenic parental or genetically corrected lines, which would provide a cleaner genetic comparison. A recent study (PMID: 36385170) discusses different study designs that could strengthen this aspect and might be useful for the authors to consult.


      Response____:

      We thank the reviewer for highlighting this and pointing us to the work of Brunner, Lammertse, van Berkel et al. While we agree that isogenic controls for each mutant line would be the ideal wild-type reference, generating these through genomic editing is particularly challenging, specifically for the compound heterozygous mutants. Instead and as suggested, we plan to include additional wild-type lines derived from healthy individuals, collected from different batches. We will use these to validate our key findings, including analyses of RG and SATB2+ cell populations, as well as MYC expression through immunofluorescence.


      Comment____:

      Third, the study presents seemingly conflicting results regarding WOREE and SCAR12 phenotypes. The authors present immunofluorescence (IF) and scRNA-seq data indicating that changes in radial glia (RG) abudance are not observed in these patient-derived organoids. However, using same methodologies, they indicate that neuronal production is affected, leading to the accumulation of early neuronal signatures in both WOREE and SCAR12 neurons. The study does not explore whether RG signatures might be altered in a way that could contribute to neuronal phenotypes. Also, Figure 1F suggests that while Sox2+ cell counts are not increased in SCAR12 organoids, SATB2 levels are still altered, indicating that Sox2 and SATB2 trends are not tightly coupled across genotypes.

      Furthermore, Figure 1 and 6 show that while both syndromes exhibit similar hyperexcitability, data in Figure 6 report that only WOREE organoids display reductions in SATB2 and CTIP2 counts and that this can be rescued by WWOX restoration. Some of these discrepancies could stem from patterning variability as discussed above. Also, neuronal firing rate across WOREE and SCAR12 iPSC-derived organoids (Figure 6B) was different at later stages, but was rather comparable at an earlier stages (Figure S1G). The reasons for these differences are not thoroughly discussed.

      To strengthen the discussion, the authors should address how RG alterations (if any) might contribute to neuronal phenotypes, provide a more detailed comparison between WOREE and SCAR12 organoids and the WWOX-KO model and elaborate on the distinct phenotypes of the two syndromes, including possible explanations for observed functional and molecular discrepancies.


      Response____:

      We thank the reviewer and agree that further investigation into the proposed link between WWOX deficiency and MYC-related alterations in radial glia would provide deeper insight into the downstream effects on neuronal populations. To this end, we will first illustrate how our model of radial glia alterations accounts for changes in neuronal production without affecting overall RG abundance. Second, we will expand our comparisons of RGs and MYC expression using patient-derived and control single-cell RNA-sequencing datasets. Third, we will address the discrepancies between neuronal hyperexcitability and SATB2/CTIP2 counts more comprehensively. Notably, while SATB2 is an early marker for several cortical neuron subtypes, it is not expressed in all neurons. In contrast, SOX2 is considered a pan-radial glia marker, which may help explain the differing expression trends observed.

      Comment:

      Lastly, the conclusions drawn about WWOX restoration via gene therapy are weakened by the lack of replication and validation (see points above).

      First, the authors claim successful WWOX restoration in neurons, but provide limited evidence that the infected population consists of neurons. NeuN staining (Figure 6A and S10) suggests some neuronal expression, but quantification of WWOX+ NeuN+ / WWOX+ total cells is missing. Given that IF data are already available, this additional quantification could be completed within a few weeks and would significantly strengthen the claim.


      Response____:

      We thank the reviewer for the suggestion, and based on this, we will quantify the WWOX+ NeuN+ / WWOX+ total cells, incorporating data from additional batches to strengthen the analysis.


      Comment____:

      Second, the rationale for restoring WWOX in neurons is unclear, given that WT neurons do not normally express WWOX. Is WWOX being considered a functional neuronal maturation factor? If so, this should be explicitly discussed in the manuscript.

      Third, the authors propose that WWOX deficiency might lead to a delay in neuronal maturation. However, to demonstrate delayed maturation, the study should show that, given additional time, affected organoids can eventually produce late-stage neuronal signatures. Since this additional experiment may be technically challenging and time-consuming, the claim should instead be rephrased as speculative and discussed accordingly in the text.


      Response____:

      *We thank the reviewer for highlighting this point. We will discuss and re-phrase the rationale for restoring WWOX in the neurons and the WWOX deficiency-associated delayed maturation. *

      Comment____:

      Lastly, the study lacks key details necessary for reproducibility in multiple aspects. In addition to details about organoid numbers and batches discussed above, all IF images are shown as insets, making it difficult to assess broader reproducibility within the whole organoid tissue section. Also, whether distinct iPSC clones/sections/organoids were used across IF experiments - which is critical for ensuring reproducibility - is not specified.


      Response____:

      We thank the reviewer for mentioning this problem. We will include the details needed for reproducibility, including the number of batches and organoids.

      Comment____:

      As for details about experimental and bioinformatics methods, the bioinformatics pipeline is not fully described, making it impossible to verify or reproduce the computational analysis. No information is provided regarding batch correction procedures for scRNA-seq data (Lines 695-697) and on how clusters were mapped (lines 695-697) for cell type identification. Legends in Figure 1F, 2K-L, 6B, S10 do not specify what the error bars represent (e.g., standard deviation or standard error). Many catalog numbers for critical cell culture reagents are not provided, which is essential for experimental replication. The Western blot methods lack crucial details.


      Response____:

      *We thank the reviewer for highlighting this point. We acknowledge that our clarity in our methods could be improved, therefore, we will expand the bioinformatics pipeline description, the reagents used, and the details for the Western Blot. *

      Minor comments:

      Comment:

      • One relevant study (PMID: 32581702) that examines WWOX function in rat models and human fetal brains from patients has not been referenced or discussed. Notably, this study characterizes molecular changes associated with WWOX knockdown in human ESC-derived NPCs. Given its direct relevance to the current study, these findings should be acknowledged and integrated into the discussion to provide a more comprehensive understanding of WWOX-related neurodevelopmental alterations. Response____:

      We thank the reviewer for suggesting the work of Iacomino et al. which we are very aware of and shall cite appropriately in our revised version.

      Comment____:

      • For WKO-1C and 2C the exact mutations in exon1 identified by Sanger sequencing are not reported. Also, validation for WWOX protein loss in all the lines used is also missing. Information about cell line genome integrity check are also missing. Response____:

      We thank the reviewer for bringing up these important points. We will provide the exact mutations identified in exon 1 of WKO-1C and WKO-2C as determined by Sanger sequencing and include this information in the revised manuscript. Additionally, we will present data additional data regarding WWOX protein loss in all the cell lines used in the study.

      Comment:

      • Line 116 and 394, reference Steinberg et al is not formatted. Response:

      We apologize for this oversight and the formatting will be fixed.

      Comment:

      • Figure S1A: Localization of WWOX seems to be cytoplasmic and/or membrane-bound in organoids, while staining in IPSCs shows cytoplasmic and nuclear signals. Perphaps an orthogonal valiation with another anti-WWOX antibody would be appropriate to confirm subcellular localization. Response____:

      We thank the reviewer for their comment. WWOX localization was previously confirmed using anti-WWOX HPA050992 (Sigma), as reported in our prior publication (PMID: 34268881). While the images were not included due to a lack of novelty, we acknowledge the importance of confirming the observed patterns. The difference in localization between organoids (cytoplasmic/membrane-bound) and iPSCs (cytoplasmic/nuclear) may be attributed to differences in cell morphology, with RGs in 3D organoid sections exhibiting distinct characteristics compared to iPSCs cultured in 2D (Supplementary Figure S1A). In fact, in 2D cultures of NSCs (Supplementary Figure S4A), WWOX also shows a nuclear localization, similar to iPSCs. We will clarify this point in the manuscript.


      Comment____:

      In Figure 1, authors use week 7 organoids and claim that they are enriched for early born preplate neurons (line 141). However the authors decide to look at SATB2, which is not an eary-born preplate neuron. So while the rationale for using Satb2 is not clear, the reported staining in Figure 1E shows an unusal overlap beetween Sox2 and Satb2 nuclear signals in wt organoids. The authors needs to recheck that the correct antibodies were used in this analysis.

      Response____:

      We thank the reviewer highlighting this. We will better define the rationale for the usage of SATB2 as a marker expressed in many types of young neurons (not specifically preplate neurons), and add DCX as a marker for neurogenesis.

      Comment____:

      Figure 1 Panel F: legend states that n indiactes 3 neurons. Please specifify what n referes to.

      Response____:

      We thank the reviewer for the keen eyes and apologize for this mistake. We will correct the legend and specify that n is indeed referring to organoids and not single neurons.


      Comment____:

      Figure 3J: MYC staining appears to be nuclear in WWOX-KO organoids but more cytoplasmic in SCAR12 organoids. Also in WOREE organoids, both Sox2 and MYC staining appears different from what seen in other panels/ genotypes from the same figure panel.

      Response____:

      • *We thank the reviewer for their comment. Upon repeated staining, we consistently observe this MYC localization across organoids and more. Similarly, the differences in Sox2 and MYC staining in WOREE organoids are reproducible. While these results may seem divergent, they accurately represent the findings. We will, however, review the staining protocols and ensure that representative images are carefully selected to best reflect the data.

      Comment____:

      Figures 3 and related legend: Authors use the term w for weeks but they need to specify whether this refers to gestational weeks or post-conception weeks.

      Response____:

      *We thank the reviewer for pointing this out. We will add in the legend that “w” refers to post-conceptional weeks. *

      Comment____:

      Figure 4: The UMAP in B, E and G seems to be blurred in the bottom parts. Is this an intentional choice? If so, what would be intent? Also, title and legend for E mention metabolic alterations but data presented are not related to metabolic patwhays.

      Response____:

      We thank the reviewer for addressing this. The blurred parts of the UMAP are intentional, we will add a description of why and what it represents.

      Comment____:

      Figure 6. The same exact images from A and C are also reported in Figure S8 and S9 respectively.

      Response____:

      *We thank the reviewer for pointing this out, we will better clarify that figures S8 and S9 are an expansion of the panels shown in Figure 6, showing ROIs per cell line and rather than per genotype. *

      Comment____:

      Figure S1D: WWOX antibody seems to give an extra band at higehr molecular weight. This is also evident from S4B, where the upper band seems overrepresented in KO2. Also, are the healthy parents haploinsufficient for WWOX? what are the levels compared to wt (unrelated) controls?


      Response____:____

      • *We thank the reviewer for raising this point. We will quantify the bands in the WOREE patient samples and compare them to wild-type controls. We would like to clarify that the "upper" band is a nonspecific band, and its overrepresentation in KO2 samples is not indicative of WWOX expression. Additionally, we will address the question of WWOX haploinsufficiency in healthy parents and provide a comparison of WWOX levels to unrelated wild-type controls.

      Comment____:

      Figure S2: In B, what is the difference between top and bottom UMAPs? In C-D, what is NP? Correlation map suggests that the NP clusters 7 and 8 are different from cluster 11. What is the rational for labelling them all NP cluster?

      Response____:

      We would like to thank the reviewer, and we will add a clarification for the differences between top and bottom UMAPs and the rationale behind NP labeling.

      Comment____:

      Figure S6: In the legend, full description of cluster labels are missing. Also legends specifes A-D while the figure contains only A-C.

      Response____:

      We thank the reviewer and will alter the figure and its legend to clarify this.

      Comment:

      Figure S4A: The staining for TUBB3 is very different between KO1 and KO2.

      Response____:

      We thank the reviewer and will examine the pictures and if need be will replace them with more representative pictures.

      Comment____:

      Figure S8: The legend indicates n as 4 organoids but images are not quantified so there is no evidence that these patterns have been replicated in 4 organoids.

      Response____:

      We thank the reviewer for pointing this out. We will add the quantifications of NEUN+/WWOX+.

      Comment____:

      Figure S9: The title is duplicated and not corresponding to the data in the figure. The whole figure is duplicated in Figure S10 (which is wrongly labelled as Figure 10 in the legend).

      Response____:

      We thank the reviewer; we will fix the figures and corresponding titles and legends.

      Comment:

      Line 330: Figure S6 F-H should be corrected in Figure 6 F-H.

      Response____:

      We thank and agree with the observation; we will correct it.

      Comment____:

      Line 353: reference needs to be added for "our earlier findings”.

      Response____:

      We thank the reviewer, and we will re-phrase to clarify.

      Comment____:

      Lines 383 and 392: The authors describe several possible MYC roles but which ones could relevant in this contex is not discussed.

      Response____:

      We thank the reviewer and agree with their observation and would like to clarify that as we are not aware of any relevant literature examining the relationship between WWOX and MYC in non-tumor settings, we refrained from drawing any conclusions in any one direction without further experimental exploration. Nonetheless, we will re-phrase the sentences to draw clearer conclusions.

      Comment____:

      Lines 402 and 403: The authors state that the study "highlights the critical role of Wnt signalling" but they fail to provide evidence that Wnt is functionally involved, as Wnt perturbation experiments are not applied.

      Response____:

      • *We thank the reviewer for their comment. We agree that further clarification is needed regarding the functional involvement of Wnt signaling. While we have previously shown that Wnt is inappropriately activated in WWOX-KO, WOREE, and SCAR12 organoids (PMID: 34268881), and demonstrated Wnt activation in RGs via our scRNA-seq data (Figure 4I), we recognize that direct perturbation experiments would strengthen this aspect. In light of this, we will examine the levels of Wnt target genes in our transcriptomic data to provide more direct evidence of Wnt signaling involvement and its functional relevance in the context of WWOX deficiency.

      Comment:

      Line 473: "at X concentration" needs to be correct to specify the concentration used.

      Response:

      We thank the reviewer for noticing this missing information and we will add the final puromycin concentration (1 mg/ml) .

      Comment____:

      Line 478: The authors state that "inform consent is under approval". Does this mean that the study was conducted before approval was obatined?


      Response____:

      We thank the reviewer for raising this concern. To clarify, approval was obtained prior to the commencement of experimentation. The sentence should read: "Skin biopsies and blood samples were obtained with informed consent, under the approval of the Kaplan Medical Center Helsinki Committee," indicating that the study was conducted in full compliance with ethical requirements, with prior approval from the committee.

      Comment____:

      Line 525: which orbital shaker and which speed was used?

      Response____:

      We thank the reviewer and will add orbital shaker details and speed.

      Comment____:

      Line 537: what is GC in GC/ul?

      Response____:

      We thank the reviewer and clarify that this is the accepted units for viral load. GC is Genome Copies, and this is often used in qPCR assays to estimate the amount of viral genetic material in a sample. It is often used interchangeably with vg/µL (Viral Genome per microliter).

      Comment____:

      Line 629: samples were centrifgues at which speed and for how long?

      Response____:

      We thank the reviewer and will fix to include details about centrifugation.

      Comment____:

      Line 639: "All primer sequence" should be plural.

      Response____:

      We thank the reviewer and will correct the typing mistake.

      Referees cross-commenting

      All four reviews appear fair and complementary to each other. Reviewers have consistently highlighted concerns regarding unclear genomic alterations in patients' iPSCs and experimental reproducibility in organoid cultures, emphasizing the need for further validation of the reported findings and the underlying molecular cascade. Additionally, they have noted some inconsistencies, with Reviewer #2 specifically identifying a major discrepancy in the WWOX-KO phenotypes compared to those previously described by the same team.

      General assessment:

      The strengths of this study lie in its focus on disease phenotypes in a human context and the use of patient-derived iPSC lines, which provide valuable translational relevance. Additionally, the study employs a complementary set of analyses, including functional assays, immunofluorescence (IF), and single-cell RNA sequencing (scRNA-seq), which enhance its depth. However, the study has several critical weaknesses, primarily related to suboptimal experimental design and limited reproducibility. These are discussed in section A and also indicated below:

      • Lack of isogenic controls or patient-derived lines and presence of conflicting data for patient-derived organoids, making genotype-based comparisons for patients' lines less robust; examples of studies using iPSC isogenic controls for dissecting neurodevelopmental disorders are found here (PMID: 35084981; PMID: 26186191).
      • Limited reproducibility, due to a small number of organoids used and the lack of orthogonal validation for key findings.
      • Absence of functional validation for MYC's contribution, making its proposed role unclear. Advance:

      This study builds upon and expands previous efforts by the same team to characterize brain organoid models obtained from patient-derived iPSCs, as well as to explore gene therapy restoration approaches (PMID: 34268881, PMID: 34747138). Some of the bioinformatics analyses appear to have been developed elsewhere, and technically, the study offers only a limited methodological advance.

      Instead, the key advancement of this work is more conceptual: it proposes potential underlying mechanisms of WWOX-related neurodevelopmental disorders. If the study's limitations were addressed, it could provide valuable insights into WWOX's role as a key regulator of radial glia proliferation and differentiation, as well as potential functions in neuronal maturation. These findings would be relatively novel in the context of WWOX-related neurodevelopmental disorders. WWOX has been extensively studied in rodent models, where WWOX -/- mice exhibit growth retardation and brain malformations (PMID: 32000863, PMID: 18487609, PMID: 15026124). Additionally, studies in rats and human fetal cortical tissue from patients (PMID: 32581702) have linked WWOX deficiency to migration defects and cortical cytoarchitectural alterations. Previous work in mice by the same team suggested that neurons are the key population affected, linking WWOX deficiency to hyperexcitability and intractable epilepsy (PMID: 33914858). However, the relevance of radial glia and cell-type specific molecular alterations linked to WWOX mutations have remained poorly defined. Through scRNA-seq, this study offers some insights into cell-type-specific molecular changes, especially in radial glia cells. These changes are linked to MYC fucntion, cell cycle arrest and altered differentiation trajectories. However, these insights remain preliminary due to the study's design limitations.

      Another potential advancement of this study is its exploration of syndrome-specific alterations in WOREE and SCAR12 patients and their rescue through WWOX gene therapy-an aspect that has been difficult to study in animal models and remains largely unexplored. While the brain organoid model offers a promising approach, the true conceptual advance of this study remains uncertain, as its current limitations hinder the ability to draw definitive conclusions.

      Audience: This study could be particularly relevant to a specialized audience, including basic research scientists working in developmental biology and the molecular basis of neurodevelopmental disorders, as well as those interested in translational approaches. Additionally, given WWOX's known roles beyond neurodevelopment and potential involvement of MYC, the findings may also be of interest to cancer biologists.

      Expertise: My expertise lies in iPSCs and brain organoid modeling of neurodevelopmental disorders, with a strong focus on organoid phenotypic analysis, particularly immunofluorescence and transcriptomics. However, I do not have a strong background in bioinformatics and therefore lack sufficient expertise to evaluate the bioinformatic methodologies utilized in the study.

      Response:

      We thank the reviewer for their valuable feedback and for acknowledging the strengths of our study. We agree with the reviewer that additional validation and replication are needed to strengthen our conclusions. We acknowledge the limitations in experimental design, and we are committed to enhancing the reproducibility of our findings. We also appreciate the reviewer's comments on the study's conceptual advancements, which we believe offer new insights into WWOX's role in neurodevelopmental disorders.

      We are confident that with the additional experiments outlined, our study will provide valuable contributions to understanding WWOX-related syndromes. Thank you again for your thoughtful suggestions.

      __ __


      Reviewer 4:

      Summary

      The article deals with WWOX gene deficiency related neural diseases such as WOREE and SCAR12 syndromes. While there is no available drugs for treatment, the authors used organoid approach to study the development of the potential of disease development. The authors utilized neural organoids and single-cell transcriptomics and identified radial glial cells (RGs) as preferentially affected. The RG cells have disrupted cell cycle arrests in the leading G2/M and S phases, along with MYC overexpression and concomitant reduction in neuronal generation. The study also included detecting neural hyperexcitability and restoring defective WWOX gene for functional assessment. The study is important in understanding the function of WWOX and its mutated states, especially in identifying RG in the potential disease progression.

      My concerns are:

      Comment____:

      1.Although organoids are good models for in vitro simulation of disease progression, I am not convinced that RG is the only cell type affected initially.

      Response____:

      We thank the reviewer for their thoughtful comment. We would like to clarify that we do not suggest that only radial glia cells are affected. As mentioned in both the current manuscript and our previous work (Steinberg et al., EMBO Mol. Med. 2021, and Repudi et al., Brain, 2021), other cell populations, including neurons and oligodendrocytes, are also impacted by WWOX deficiency. WWOX is widely expressed in the mature brain, and we are actively investigating whether these effects are cell-autonomous. In this study, we focus on WWOX in RGs due its high expression and possible importance in maintaining RG homeostasis. We will further clarify this point in the revised manuscript.

      Comment____:

      Functional characterization of RG needs further strengthening. I suggest utilizing a proteomic approach to compare the diseased-ongoing RG versus regular RG and identify which proteins are involved for functional characterization. Finally, the functional alterations in the mitochondria due to WWOX deficiency should be checked.

      Response____:

      We thank the reviewer for their suggestion and agree that performing proteomic analysis on RG populations will strengthen our understanding of the underlying mechanism, however, the experiment itself was attempted and proved to be technically challenging at this size, and for now is beyond the scope of this paper.

      Comment____:

      WWOX-deficient radial glia cells are expected not to guide neurons' migration normally during neural development. Please note that neuronal heterotopia occurs frequently in the WWOX deficiency. Neurons tend to exhibit groups of cells coming together in the neocortex. Purified RG cells are used to run versus typical neurons or RG cells. One can expect WWOX-deficient cells to run away from the normal cells, and they may kill each other, leading to compromise. The authors should run the real-time cell migration experiments using normal neurons versus WWOX-deficient radial glia cells and see the behavior of both cell types upon encountering each other. This will provide better insight regarding the deficiency of WWOX in radial glia cells.

      Response____:

      We thank the reviewer for their insightful suggestion regarding the validation of neuronal heterotopia in WWOX-deficient cells through real-time migration experiments. While we recognize the potential value of this approach for investigating the behavior of WWOX-deficient radial glia cells, we believe that such experiments would extend beyond the scope of the current study. However, we are considering them as part of our future research to further explore the impact of WWOX deficiency on cell migration and neuronal positioning. Thank you again for your valuable input.

      Significance

      The study is significant in our understanding the progression of syndromes associated with WWOX deficiency. My suggestions are shown in the above section.

      Response____:

      We thank the reviewer for their thoughtful and constructive feedback. We especially appreciate the suggestions regarding the broader involvement of additional cell types and the importance of exploring radial glia function through real-time migration assays. These insights will help us refine the focus and interpretation of our findings, and we will address the relevant clarifications and improvements in the revised manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      The article deals with WWOX gene deficiency related neural diseases such as WOREE and SCAR12 syndromes. While there is no available drugs for treatment, the authors used organoid approach to study the development of the potential of disease development. The authors utilized neural organoids and single-cell transcriptomics and identified radial glial cells (RGs) as preferentially affected. The RG cells have disrupted cell cycle arrests in the leading G2/M and S phases, along with MYC overexpression and concomitant reduction in neuronal generation. The study also included detecting neural hyperexcitability and restoring defective WWOX gene for functional assessment. The study is important in understanding the function of WWOX and its mutated states, especially in identifying RG in the potential disease progression. My concerns are:

      1. Although organoids are good models for in vitro simulation of disease progression, I am not convinced that RG is the only cell type affected initially.
      2. Functional characterization of RG needs further strengthening. I suggest utilizing a proteomic approach to compare the diseased-ongoing RG versus regular RG and identify which proteins are involved for functional characterization. Finally, the functional alterations in the mitochondria due to WWOX deficiency should be checked.
      3. WWOX-deficient radial glia cells are expected not to guide neurons' migration normally during neural development. Please note that neuronal heterotopia occurs frequently in the WWOX deficiency. Neurons tend to exhibit groups of cells coming together in the neocortex. Purified RG cells are used to run versus typical neurons or RG cells. One can expect WWOX-deficient cells to run away from the normal cells, and they may kill each other, leading to compromise. The authors should run the real-time cell migration experiments using normal neurons versus WWOX-deficient radial glia cells and see the behavior of both cell types upon encountering each other. This will provide better insight regarding the deficiency of WWOX in radial glia cells.

      Significance

      The study is significant in our understanding the progression of syndromes associated with WWOX deficiency. My suggestions are shown in the above section.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Steinberg and colleagues describes cellular and molecular changes linked to mutations in WWOX, a gene implicated in rare neurodevelopmental disorders, WOREE and SCAR12 syndromes. By comparing immunofluorescene and single cell trascriptomics of unguided brain organoids from control and WWOX-knockout iPSCs, as well as 2D NSCs and in vivo fetal brain expression datasets, the authors identified radial glia as relevant cell types in which WWOX is expressed and affected by WWOX deficiency. Using immunofluorescence, single cell trascriptomics analysis and western blotting on week 16 organoids, the authors show that WWOX deficiency results in increased abundance of radial glia cells at the expenses of neuronal production. These changes are accompanied by accumulation of cells in G2/M and S phases, overexpression of c-MYC and Wnt activation. In addition to this, the authors characterize unguided brain organoids generated from iPSCs reprogrammed from patients affected by WOREE or SCARE12 syndromes. Using immunofluoresce and single cell trascriptomics, they find that, while RG abundance changes were very modest, patients's iPSC-derived neurons are enriched for signatures related to early development, suggesting delayed differentiation. Finally, the authors use patch clumping, calcium imaging and gene therapy in 16 weeks old organoids derived from control and patients-derived iPSCs, to demonstrate that WWOX restoration normalized hyperexcitability phenotypes in both WOREE and SCAR12 organoids. These results thus provide a proof-of-concept evidence that WWOX restoration in human cells is a valid strategy to correct for hyperexcitability pehnotypes in WWOX related syndromes.

      Major comments:

      The study's main conclusions regarding neurodevelopmental phenotypes linked to WWOX deficiency and genotype-phenotype relationships are based on iPSC-derived brain organoid models analyzed using immunofluorescence, single-cell transcriptomics, and excitability recordings (cell-attached patch clamping, calcium imaging). While the analyses involve a diverse collection of iPSCs and two time points (7 and 16 weeks), the study falls short in providing sufficient experimental details and validation to fully support its conclusions. Additional quantification, replication, and functional validation would be necessary to solidify the study's conclusions. Some of these validations are achievable within a reasonable timeframe, while others would require a more substantial investment of time and resources as detailed below. A key concern is the lack of experimental details and replicability. Number of individual organoids, number of images per organoid for IF, and whether multiple batches were used are only partially provided. While the authors report generating multiple WWOX knockout clones, the legends and methods do not specify whether multiple clones were used across different organoid experiments. The study states that four organoids were used for scRNA-seq, but it is unclear whether this means four organoids per genotype or one organoid per genotype was analyzed. These ambiguities make the claims appear rather preliminary. Another issue is the limited validation of scRNA-seq observations. Since scRNA-seq is often performed on a limited number of organoids, orthogonal validation is crucial to strengthen the findings. For example, changes in radial glia abundance and neuronal production observed in scRNA analyis (Figure 2-5) could be validated using immunofluorescence across genotypes and batches. Currently, IF stainings for Sox2 and TUBB3 are shown only at 7 weeks in Figure 1B, but no quantificative assessment is provided. Also, it is not clear if quantifications provided in Figure 1F refer to multiple organoids or batches. Furthermore, the observations on cell cycle arrest, DNA damage, senescence, metabolic alterations, Wnt activation obtained via scRNA-seq could be further validated on organoid tissues using specific antibodies that the lab used before (e.g. yH2AX antibody in PMID: 34268881) or assays that have been developed elsewhere (some examples are reviewed in PMID: 38759644). As for feasibility, immunofluorescence validation of existing tissues is realistic, requiring validated antibodies and procedures, some additional imaging time and analysis (estimated 1-2 months, with some budget to purchase antibodies and cover imaging time costs). Feasibility of efforts related to validation across organoids and batches depends on the number of organoids used so far and available tissues. Generating new organoids would be indeed more time-consuming (≈ 6 months) and expensive (but extact costs would depend on number of clones, organoids and batches used), but feasible. Another limitation is the lack of functional relevance of MYC alterations. The study confirms increased MYC expression via both scRNA-seq and immunofluorescence in organoid tissues. However, these results remain correlative and demonstrating the functional requirement of MYC overexpression in mediating WWOX-deficiency-related changes would significantly strengthen the study's conclusions. This would require additional differentiation experiments, including MYC overexpression or knockout models, to assess its direct impact. These efforts would represent a major conceptual advance by linking RG effects to MYC function and highlighting MYC-related therapeutic directions. These additonal experiments would require a substantial investment to generate the necessary regents (e.g. WWOX-KO and WT iPSCs with altered MYC levels) and additional time and costs for organoid analysis, mostly by immunofluorescence (estimated 6-8 months). Another issue is the lack of patterning analysis in unguided organoids, which are known to exhibit high variability in regional identity (PMID: 28283582). While the authors acknowledge this limitation to some extent-abstaining from fine-resolution analysis (Lines 173-174)-this variability, combined with the limited number of organoids used, could be a major confounding factor in the phenotypic analyses, even at a broad resolution. Indeed, some of the reported differences across genotypes may stem from variability in organoid patterning rather than true genotype-driven effects. For example, the reduced SATB2 expression in KO and patient-derived organoids from Figure 1E-F could result from impaired cortical patterning rather than a direct effect of WWOX deficiency. Additionally, in Figure 6D and 6E, the fact that WOREE iPSC-derived organoids - but not SCAR12 organodis- show lower levels of both CTIP2 and SATB2, might reflect a shift toward a non-cortical identity rather than a direct WWOX-dependent phenotype. To rule out patterning variability as a contributing factor, the authors should analyze organoid regional identity across genotypes using immunostaining for dorsal and ventral forebrain markers. This would allow a more solid inference of genotype-specific effects on neurodevelopmental phenotypes. Patterning validation can be performed on existing organoid tissues (week 7) using validated antibodies (PMID: 28283582). As such, this analysis is expected to be relatively straightforward and feasible in a few weeks time. If the generation of new organoids is needed, such patterning validation should still be relatively feasible, as week 7 organoids are ideal for assessing regional identity. Analysis of patterning effects should also extend to 2D NSC cultures. In the 2D NSC models derived from WWOX-KO lines (Figure 3L, Figure S4A), the differentiation protocol includes patterning factors that promote ventral fates (SAG and IWP2). Interestingly, the quantification of MYC expression from unguided organoids and 2D NSCs (Figure 3K-L) reveals a major difference in the fraction of MYC-positive cells in WT conditions across the two culture models. A possible changes in the dorsal and ventral patterning of 3D and 2D cultures might explain these differences and implementing immunostaining for patterning markers in 2D would help clarify patterning contributions. There are also some concerns regarding WOREE and SCAR12 phenotypes. First, the genotypes of the patient-derived iPSCs are not clearly defined, making it difficult to establish clear genotype-phenotype relationships. The study uses iPSCs from four different patients (2 WOREE, 2 SCAR12), some of which were validated in a previous study (PMID: 34268881). However, it remains unclear how they were validated, and detailed genomic alterations of the four patients are not explicitly reported. Additionally, it is uncertain whether all variants result in a full loss of WWOX function, as protein loss evidence is only provided for one WOREE patient (Figure S1D). Also, the authors state that SCAR12 should have a milder phenotype (line 168), but it is unclear whether this claim is based on clinical evidence or genomic data from these specific patients. To improve genotype-phenotype comparisons, the authors should consider including a clear schematic summarizing the genomic alterations in all patient-derived lines and their expected disease severity. Second, the experimental design lacks appropriate controls for patient-derived iPSCs. All patient-derived iPSC comparisons are performed against a single reference male iPSC line, which is neither isogenic to WOREE nor SCAR12 iPSCs. This complicates the interpretation of differences between healthy and patient-derived organoids, as well as comparisons between WOREE and SCAR12 phenotypes. Given this design, it is impossible to draw solid conclusions about genotype-phenotype relationships. A more robust approach would involve including multiple healthy controls to account for genetic background variability or using isogenic parental or genetically corrected lines, which would provide a cleaner genetic comparison. A recent study (PMID: 36385170) discusses different study designs that could strengthen this aspect and might be useful for the authors to consult. Third, the study presents seemingly conflicting results regarding WOREE and SCAR12 phenotypes. The authors present immunofluorescence (IF) and scRNA-seq data indicating that changes in radial glia (RG) abudance are not observed in these patient-derived organoids. However, using same methodologies, they indicate that neuronal production is affected, leading to the accumulation of early neuronal signatures in both WOREE and SCAR12 neurons. The study does not explore whether RG signatures might be altered in a way that could contribute to neuronal phenotypes. Also, Figure 1F suggests that while Sox2+ cell counts are not increased in SCAR12 organoids, SATB2 levels are still altered, indicating that Sox2 and SATB2 trends are not tightly coupled across genotypes. Furthermore, Figure 1 and 6 show that while both syndromes exhibit similar hyperexcitability, data in Figure 6 report that only WOREE organoids display reductions in SATB2 and CTIP2 counts and that this can be rescued by WWOX restoration. Some of these discrepancies could stem from patterning variability as discussed above. Also, neuronal firing rate across WOREE and SCAR12 iPSC-derived organoids (Figure 6B) was different at later stages, but was rather comparable at an earlier stages (Figure S1G). The reasons for these differences are not thoroughly discussed. To strengthen the discussion, the authors should address how RG alterations (if any) might contribute to neuronal phenotypes, provide a more detailed comparison between WOREE and SCAR12 organoids and the WWOX-KO model and elaborate on the distinct phenotypes of the two syndromes, including possible explanations for observed functional and molecular discrepancies. Lastly, the conclusions drawn about WWOX restoration via gene therapy are weakened by the lack of replication and validation (see points above). First, the authors claim successful WWOX restoration in neurons, but provide limited evidence that the infected population consists of neurons. NeuN staining (Figure 6A and S10) suggests some neuronal expression, but quantification of WWOX+ NeuN+ / WWOX+ total cells is missing. Given that IF data are already available, this additional quantification could be completed within a few weeks and would significantly strengthen the claim. Second, the rationale for restoring WWOX in neurons is unclear, given that WT neurons do not normally express WWOX. Is WWOX being considered a functional neuronal maturation factor? If so, this should be explicitly discussed in the manuscript. Third, the authors propose that WWOX deficiency might lead to a delay in neuronal maturation. However, to demonstrate delayed maturation, the study should show that, given additional time, affected organoids can eventually produce late-stage neuronal signatures. Since this additional experiment may be technically challenging and time-consuming, the claim should instead be rephrased as speculative and discussed accordingly in the text. Lastly, the study lacks key details necessary for reproducibility in multiple aspects. In addition to details about organoid numbers and batches discussed above, all IF images are shown as insets, making it difficult to assess broader reproducibility within the whole organoid tissue section. Also, whether distinct iPSC clones/sections/organoids were used across IF experiments - which is critical for ensuring reproducibility - is not specified. As for details about experimental and bioinformatics methods, the bioinformatics pipeline is not fully described, making it impossible to verify or reproduce the computational analysis. No information is provided regarding batch correction procedures for scRNA-seq data (Lines 695-697) and on how clusters were mapped (lines 695-697) for cell type identification. Legends in Figure 1F, 2K-L, 6B, S10 do not specify what the error bars represent (e.g., standard deviation or standard error). Many catalog numbers for critical cell culture reagents are not provided, which is essential for experimental replication. The Western blot methods lack crucial details.

      Minor comments:

      • One relevant study (PMID: 32581702) that examines WWOX function in rat models and human fetal brains from patients has not been referenced or discussed. Notably, this study characterizes molecular changes associated with WWOX knockdown in human ESC-derived NPCs. Given its direct relevance to the current study, these findings should be acknowledged and integrated into the discussion to provide a more comprehensive understanding of WWOX-related neurodevelopmental alterations.
      • For WKO-1C and 2C the exact mutations in exon1 identified by Sanger sequencing are not reported. Also, validation for WWOX protein loss in all the lines used is also missing. Information about cell line genome integrity check are also missing.
      • Line 116 and 394, reference Steinberg et al is not formatted.
      • Figure S1A: Localization of WWOX seems to be cytoplasmic and/or membrane-bound in organoids, while staining in IPSCs shows cytoplasmic and nuclear signals. Perphaps an orthogonal valiation with another anti-WWOX antibody would be appropriate to confirm subcellular localization.
      • In Figure 1, authors use week 7 organoids and claim that they are enriched for early born preplate neurons (line 141). However the authors decide to look at SATB2, which is not an eary-born preplate neuron. So while the rationale for using Satb2 is not clear, the reported staining in Figure 1E shows an unusal overlap beetween Sox2 and Satb2 nuclear signals in wt organoids. The authors needs to recheck that the correct antibodies were used in this analysis.
      • Figure 1 Panel F: legend states that n indiactes 3 neurons. Please specifify what n referes to.
      • Figure 3J: MYC staining appears to be nuclear in WWOX-KO organoids but more cytoplasmic in SCAR12 organoids. Also in WOREE organoids, both Sox2 and MYC staining appears different from what seen in other panels/ genotypes from the same figure panel.
      • Figures 3 and related legend: Authors use the term w for weeks but they need to specify whether this refers to gestational weeks or post-conception weeks.
      • Figure 4: The UMAP in B, E and G seems to be blurred in the bottom parts. Is this an intentional choice? If so, what would be intent? Also, title and legend for E mention metabolic alterations but data presented are not related to metabolic patwhays.
      • Figure 6. The same exact images from A and C are also reported in Figure S8 and S9 respectively.
      • Figure S1D: WWOX antibody seems to give an extra band at higehr molecular weight. This is also evident from S4B, where the upper band seems overrepresented in KO2. Also, are the healthy parents haploinsufficient for WWOX? what are the levels compared to wt (unrelated) controls?
      • Figure S2: In B, what is the difference between top and bottom UMAPs? In C-D, what is NP? Correlation map suggests that the NP clusters 7 and 8 are different from cluster 11. What is the rational for labelling them all NP cluster?
      • Figure S6: In the legend, full description of cluster labels are missing. Also legends specifes A-D while the figure contains only A-C.
      • Figure S4A: The staining for TUBB3 is very different between KO1 and KO2.
      • Figure S8: The legend indicates n as 4 organoids but images are not quantified so there is no evidence that these patterns have been replicated in 4 organoids.
      • Figure S9: The title is duplicated and not corresponding to the data in the figure. The whole figure is duplicated in Figure S10 (which is wrongly labelled as Figure 10 in the legend).
      • Line 330: Figure S6 F-H should be corrected in Figure 6 F-H.
      • Line 353: reference needs to be added for "our earlier findings"
      • Lines 383 and 392: The authors describe several possible MYC roles but which ones could relevant in this contex is not discussed.
      • Lines 402 and 403: The authors state that the study "highlights the critical role of Wnt signalling" but they fail to provide evidence that Wnt is functionally involved, as Wnt perturbation experiments are not applied.
      • Line 473: "at X concentration" needs to be correct to specify the concentration used.
      • Line 478: The authors state that "inform consent is under approval". Does this mean that the study was conducted before approval was obatined?
      • Line 525: which orbital shaker and which speed was used?
      • Line 537: what is GC in GC/ul?
      • Line 629: samples were centrifgues at which speed and for how long?
      • Line 639: "All primer sequence" should be plural

      Referees cross-commenting

      All four reviews appear fair and complementary to each other. Reviewers have consistently highlighted concerns regarding unclear genomic alterations in patients' iPSCs and experimental reproducibility in organoid cultures, emphasizing the need for further validation of the reported findings and the underlying molecular cascade. Additionally, they have noted some inconsistencies, with Reviewer #2 specifically identifying a major discrepancy in the WWOX-KO phenotypes compared to those previously described by the same team.

      Significance

      General assessment:

      The strengths of this study lie in its focus on disease phenotypes in a human context and the use of patient-derived iPSC lines, which provide valuable translational relevance. Additionally, the study employs a complementary set of analyses, including functional assays, immunofluorescence (IF), and single-cell RNA sequencing (scRNA-seq), which enhance its depth. However, the study has several critical weaknesses, primarily related to suboptimal experimental design and limited reproducibility. These are discussed in section A and also indicated below:

      • Lack of isogenic controls or patient-derived lines and presence of conflicting data for patient-derived organoids, making genotype-based comparisons for patients' lines less robust; examples of studies using iPSC isogenic controls for dissecting neurodevelopmental disorders are found here (PMID: 35084981; PMID: 26186191).
      • Limited reproducibility, due to a small number of organoids used and the lack of orthogonal validation for key findings.
      • Absence of functional validation for MYC's contribution, making its proposed role unclear.

      Advance:

      This study builds upon and expands previous efforts by the same team to characterize brain organoid models obtained from patient-derived iPSCs, as well as to explore gene therapy restoration approaches (PMID: 34268881, PMID: 34747138). Some of the bioinformatics analyses appear to have been developed elsewhere, and technically, the study offers only a limited methodological advance. Instead, the key advancement of this work is more conceptual: it proposes potential underlying mechanisms of WWOX-related neurodevelopmental disorders. If the study's limitations were addressed, it could provide valuable insights into WWOX's role as a key regulator of radial glia proliferation and differentiation, as well as potential functions in neuronal maturation. These findings would be relatively novel in the context of WWOX-related neurodevelopmental disorders. WWOX has been extensively studied in rodent models, where WWOX -/- mice exhibit growth retardation and brain malformations (PMID: 32000863, PMID: 18487609, PMID: 15026124). Additionally, studies in rats and human fetal cortical tissue from patients (PMID: 32581702) have linked WWOX deficiency to migration defects and cortical cytoarchitectural alterations. Previous work in mice by the same team suggested that neurons are the key population affected, linking WWOX deficiency to hyperexcitability and intractable epilepsy (PMID: 33914858). However, the relevance of radial glia and cell-type specific molecular alterations linked to WWOX mutations have remained poorly defined. Through scRNA-seq, this study offers some insights into cell-type-specific molecular changes, especially in radial glia cells. These changes are linked to MYC fucntion, cell cycle arrest and altered differentiation trajectories. However, these insights remain preliminary due to the study's design limitations. Another potential advancement of this study is its exploration of syndrome-specific alterations in WOREE and SCAR12 patients and their rescue through WWOX gene therapy-an aspect that has been difficult to study in animal models and remains largely unexplored. While the brain organoid model offers a promising approach, the true conceptual advance of this study remains uncertain, as its current limitations hinder the ability to draw definitive conclusions.

      Audience: This study could be particularly relevant to a specialized audience, including basic research scientists working in developmental biology and the molecular basis of neurodevelopmental disorders, as well as those interested in translational approaches. Additionally, given WWOX's known roles beyond neurodevelopment and potential involvement of MYC, the findings may also be of interest to cancer biologists.

      Expertise: My expertise lies in iPSCs and brain organoid modeling of neurodevelopmental disorders, with a strong focus on organoid phenotypic analysis, particularly immunofluorescence and transcriptomics. However, I do not have a strong background in bioinformatics and therefore lack sufficient expertise to evaluate the bioinformatic methodologies utilized in the study.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this study, Steinberg et al aim to elucidate the role of WWOX in human neurogenesis and model WOREE and SCAR12 syndromes which are rare neurodevelopmental disorders. They chose to investigate its function in human brain organoids after generating WWOX KO and patient-derived iPSC lines. Their major finding is that radial glial cells, the main neural progenitor population during corticogenesis, are affected. Via single-cell-RNA-sequencing, they try to decipher the perturbed molecular mechanisms identifying MYC, a proto-oncogene, as a major player. At the end of their study, they proceed to gene therapy restoration and suggest that this could become a potential therapeutic intervention for WOREE and SCAR12 syndromes. The study aims to elucidate major cellular and molecular mechanisms that modulate neurodevelopment and neurodevelopmental disorders. Although sc-RNA-seq could potentially be of great interest and unravel major mechanisms, the authors do not follow this part, but only discuss potential future avenues. Here are some suggestions that could be useful to the authros.

      Major comments:

      • A big part of the paper focuses on generating the iPSCs and characterizing the generated brain organoids and gene restoration of the phenotype via restoration of the WWOX gene expression (Fig.1, Fig.6, Fig.S1, Fig.S8, Fig.S10 and potentially Fig.S9 - this figure is not included) however, this has already been done by the same authors (first and last authors) in a previous publication. What are the differences in the line that have been generated in previous publication (Steinberg et al 2021, EMBO Mol. Med.)? If there are differences, the authors should make a thought comparison and explain why they generated different lines. If there is no difference, the authors should reduce to minimum this part and place it to supplementary.
      • Fig.1E: in the pictures shown, the majority of the Satb2+ cells are colocalized with SOX2. Although a small portion of neurons have been shown from many studies that in brain organoids are co-localized to SOX2, in the pictures depicted this percentage is big. Also in ctrl condition the VZ-CP like areas are not easily recognized. The authors should check if this co-localization is a more general phenotype and if not choose more representative pictures.
      • Information about the number of organoids per batch used in each figure is not included. This needs to be added for each experiment. Data (at least the majority of them) should be collected from brain organoids from at least two batches.
      • The expression of WWOX in cortical development has been shown in the previous publication. Although sc-RNA data are validating the previous data and are adding more information, these data should be put as supplementary. Besides, in Fig.3G where authors aim to compare WWOX expression to MYC that fits nicely with their results depicting MYC as the most affected gene in KO and mutant line, when one looks at the WWOX expression only it seems that its expression is higher in CP that VZ. This is contrary to the conclusion that WWOX is mainly characterizes RGs. Why is that? Authors should at least discuss this.
      • In this study, authors show that progenitors are reduced in WWOX-KO organoids, however in the previous publication SOX2 population is not majorly affected. Why are there such differences? Given that RGs are the main population affected as authors propose in this study, these differences must be at least discussed. Similar comments regarding neurons: in previous publication there is a minimal reduction of neurons in WWOX-KO brain organoids, while here authors describe major differences.
      • Data from sc-RNA-seq analysis highlighting MYC as major differentially regulated gene are very interesting and seem to be key to the molecular pathway affected as authors suggest. Authors also validate this with immunostainings in brain orgnaoids. However, in Fig.3J MYC expression in ctrl is not depicted, even though in the respective graph it seems that 20% of SOX2+ cells co-express MYC. Please choose a more representative picture.
      • One of the main findings in this study is the cell cycle changes observed in WWOX-KO and mutant organoids. Given that the major novelty of the publication is the cellular and molecular mechanism implicated in WOREE and SCAR12 syndromes, authors should perform additional experiments towards this direction. One suggestion would be to perform stainings in brain organoids using markers of the different cell cycle phases (eg. KI67, cyclin a, BrdU/EdU, ph3). Also treatment of organoids with different BrdU/EdU chase experiments would be important so as to measure exactly the length of each cell cycle phase.
      • Regarding the molecular cascade, is WWOX directly affecting MYC of Wnt genes? Do they have information on upstream and downstream factors in the affected molecular pathway?
      • Restoration of phenotype via reinsertion of WWOX gene has already been done in the previous publications by the same authors. But what about MYC? Is MYC manipulation able to rescue the phenotype?
      • Finally, MYC association to ribosome biogenesis as mentioned by the authors in discussion is very interesting. The authors should consider investigating this direction, as it will be a great addition to the mechanisms that regulate WOREE and SCAR12 syndromes which is the main focus of this study.

      Minor comments:

      • Line 115: authors say that the data they discuss are found in Fig.S2A, maybe they mean Fig.S1A?
      • Fig.S9 is missing, in the current version this Fig is the same with Fig.S10. Please change it.

      Significance

      This study is the continuation of a previous publication the authors have published. The topic is very interesting and novel especially in modelling neurodevelopmental disorders in a human context, however, given that the main phenotype has already been published, the authors should include more effort in the molecular cascade. Clinical interventions if the molecular cascade is described would be of great importance to the field

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript "WWOX deficiency impairs neurogenesis and neuronal function in human organoids" by Aqeilan and co-workers provides impressive set of studies, mostly utilizing cerebral organoids (CO), gaining insights into the roles of the gene WWOX in neuronal development and molecular etiology of WOREE and SCAR12, two devastating rare diseases originating from mutations in WWOX. Further, therapeutic modalities through the neuron-specific gene therapy are investigated using the WWOX k/o and WOREE and SCAR12 patient-derived COs. Among the major findings of this work one can highlight the identification of the main source of WWOX-expressing cells as radial glia (RG) cells; the discovery of the massive upregulation of Myc upon loss/decrease in WWOX expression in RGs; and the strong neuronal under-differentiation induced upon WWOX k/o and mutations. Regarding the latter finding, the authors report massive increase in RGs and concomitant drop in neuronal cells in WWOX k/o COs. In contrast, in WOREE and SCAR12 patient-derived COs, a more subtle under-differentiation is seen. Specifically, while WOREE but not SCAR12 patient-derived COs also show a certain increase in the RG proportion, both types of patient-derived COs demonstrate higher proportions of "young" neuronal cells as compared to wild-type COs. Thus, a picture can be drawn whereas complete loss of WWOX leads to strong under-differentiation mostly manifested as expansion of RGs and hence under-production of neuronal cells, while hypomorphic loss-of-function of WWOX in WOREE and in missense mutations in SCAR12 lead to the later defect in neuronal cell maturation. Overall, I find the work highly interesting, but I would like the authors to address one major issue and several minor ones. The major issue is related to the overall model the authors seem to build based on their data - or at least the overall model the reader may get from the paper. This model suggests that the loss / decrease in WWOX levels in RGs leads to Myc overexpression, that in turn affects the cell cycle and prevents neuronal differentiation. This model is highly attractive, but is probably incomplete, in the sense that it does not fully recapitulate the complicated picture. Indeed, all three types of mutated WWOX COs (WWOX k/o, WOREE patient-derived organoids, and SCAR12 patient-derived organoids) demonstrate strong - but equal levels of Myc upregulation. Yet the under-differentiation in each of these three types is different, as described above, and the disease manifestations among WOREE vs. SCAR12 patients are also different. Thus, another player (in addition to Myc) must be at place, that is differentially affected by the partial null mutations in WOREE and missense mutations in SCAR12. This point - ideally to be addressed experimentally - should be at least faced directly by the authors in the Discussion. Perhaps they can already point to such additional players based on their transcriptomics analysis.

      The minor issues are as follows.

      1. It would be useful if a table (perhaps supplementary) describing the details of the WWOX mutations in all the COs models studied in this paper were presented.
      2. For the new WOREE individual with complex genetics in WWOX: it is not clear why any WWOX protein is still present in this patient in Fig. S1D (please give an explanation or speculation); it is not clear which tissue was used for the Western blot in Fig. S1D; the data in Fig. S1D need to be quantified.
      3. Western blot, quantified, should be performed on all COs under study, to compare the WWOX expression levels. Please also change the immunofluorescence shown in Fig. 1B (e.g. show WWOX in a different color), as the figure provided shows WWOX poorly in wild-type CO, and it is not clear how much it is removed in the mutant organoids. Why should there be no signal in the SCAR12 COs?

      Significance

      The manuscript "WWOX deficiency impairs neurogenesis and neuronal function in human organoids" by Aqeilan and co-workers provides impressive set of studies, mostly utilizing cerebral organoids (CO), gaining insights into the roles of the gene WWOX in neuronal development and molecular etiology of WOREE and SCAR12, two devastating rare diseases originating from mutations in WWOX. Further, therapeutic modalities through the neuron-specific gene therapy are investigated using the WWOX k/o and WOREE and SCAR12 patient-derived COs. Among the major findings of this work one can highlight the identification of the main source of WWOX-expressing cells as radial glia (RG) cells; the discovery of the massive upregulation of Myc upon loss/decrease in WWOX expression in RGs; and the strong neuronal under-differentiation induced upon WWOX k/o and mutations. Regarding the latter finding, the authors report massive increase in RGs and concomitant drop in neuronal cells in WWOX k/o COs. In contrast, in WOREE and SCAR12 patient-derived COs, a more subtle under-differentiation is seen. Specifically, while WOREE but not SCAR12 patient-derived COs also show a certain increase in the RG proportion, both types of patient-derived COs demonstrate higher proportions of "young" neuronal cells as compared to wild-type COs. Thus, a picture can be drawn whereas complete loss of WWOX leads to strong under-differentiation mostly manifested as expansion of RGs and hence under-production of neuronal cells, while hypomorphic loss-of-function of WWOX in WOREE and in missense mutations in SCAR12 lead to the later defect in neuronal cell maturation. Overall, I find the work highly interesting, but I would like the authors to address one major issue and several minor ones.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      __Evidence, reproducibility and clarity __

      The manuscript explores mild physiological and metabolic disturbances in patient-derived fibroblasts lacking G6Pase expression, suggesting that these cells retain a "distinctive disease phenotype" of GSD1a. The manuscript is well written with well-designed experiments. However, it remains unclear whether these phenotypes genuinely reflect the pathology of GSD1a-relevant tissues. The authors did not validate these findings in a liver-specific G6pc knockout mouse model, raising concerns about the study's relevance to GSD1a. Additionally, the lack of sufficient in vivo evidence undermines the therapeutic potential of GHF201 for this disease. Overall, the study lacks a few key pieces of evidence to completely justify its conclusions at both fundamental and experimental levels.

      __Reply:__We thank the reviewer for this general comment which gives us the opportunity to better explain the scope of our work. The purpose and focus of this work are not to test the pathological relevance of skin fibroblasts to GSD1a pathology. We do not claim that skin fibroblasts are involved in GSD1a pathogenesis. It is also not a developmental work claiming to uncover GSD1a pathogenic axis throughout embryonic development. As a matter of fact, since skin fibroblasts originate from the mesoderm embryonic germ layer and hepatocytes develop from the endoderm embryonic germ layer, it would even be unlikely that the pathological phenotype found in skin fibroblasts directly contributes to GSD1a pathology in model mice or in patients. Indeed, we are not aware of any dermatological contribution to GSD1a pathology in patients. However, our results suggest that in addition to the established and mutated organ (liver in the liver-specific G6pc knockout mouse model), other, relatively less studied, patho-mechanisms in distal tissues may also contribute to GSD1a pathology. Notably, this work is also not testing a therapeutic modality for GSD1a. Our work uses GSD1a disease models as a tool for demonstrating, or reviving, the concept of epigenomic landscape (Waddington, 1957): Different cell phenotypes, such as healthy and diseased, are established by innate metabolic differences between their respective cell environments, which impose epigenetic changes generating these different phenotypes. In this respect, our manuscript has a similar message to the one in the recently published paper Korenfeld et al (2024) Nucleic Acids Res 53:gkae1161. doi: 10.1093/nar/gkae1161: The Kornfeld et al paper shows that intermittent fasting generates an epigenetic footprint in PPARα-binding enhancers that is "remembered" by hepatocytes leading to stronger transcriptional response to imposed fasting by up-regulation of ketogenic pathways. In the same way, the diseased GSD1a status imposes metabolic changes, as detailed here, leading to permanent epigenetic changes, also described here, which are "remembered" by GSD1a fibroblasts and play a major role in the transcription of pathogenic genes in these patient's cells. This in turn is how the diseased state is preserved, even in cells not expressing the G6Pase mutant, which is the direct cause of the disease. We added this perspective to the Discussion to better highlight the key takeaway from our manuscript.Naturally, research such as ours with a claim on biological memory would involve ex vivo experiments where tissues are isolated from their in-situ environments and tested for preservation of the original in situ phenotype. The few in vivo experiments we performed (Fig. 5) are mainly aimed at demonstrating that not only the phenotype, but also therapy response is "remembered" ex vivo: In the same way that the G6PC-loss-of-function liver responded positively to GHF201 therapy in situ, ex vivo cells not expressing G6PC also responded positively to the same therapy. This observation only demonstrates further support for "memorization" of the disease phenotype by cell types not expressing the mutant: Both the diseased phenotype and response to therapy were preserved ex vivo.Lastly, while interesting, validation of our findings in vivo (as suggested by the reviewer) is not related to the scope of this manuscript. Such experiments, using the liver-targeted G6pc knockout mouse model, are the follow-up story, which is related to the origin of inductive signals that cause the curious and novel phenotype mechanism in GSD1a fibroblasts described in this manuscript. The scope and volume of such research constitute a novel manuscript.

      Since dietary restriction is the only management strategy for GSD1a, the authors should clarify whether the patient fibroblast donors were on a dietary regimen and for how long. Given that fibroblasts do not express G6Pase, it is possible that the observed phenotype could be influenced by the patient's diet history.

      __Reply:__We thank the reviewer for this important comment, we agree that it is important to note the dietary regimen assigned to the cohort of patients described in this study. We added an explanation to the manuscript on patient's diets as shown below.Briefly, all patients besides patient 6894 were treated with the recommended dietary regimen for GSD1a as explained in Genereviews (Bali et al (2021)). This dietary treatment (now added to the Methods section in the manuscript) allows to maintain normal blood glucose levels, prevent secondary metabolic derangements, and prevent long-term complications. Specifically, this dietary treatment includes- nocturnal nasogastric infusion of a high glucose formula in addition to usual frequent meals during. By constantly maintaining a nearly normal level of blood glucose, this treatment causes a remarkable decrease, although not normalization, of blood lactate, urate and triglyceride levels, as well as bleeding time values. A second layer in the treatment includes the use of uncooked starch in the dietary regimen to allow maintenance of a normal blood glucose levels for long periods of time. Patient 6894 did not tolerate well the uncooked cornstarch and therefore was treated with a tailored dietary treatment planned by metabolic disease specialists and dedicated certified dieticians highly experienced with the management of pediatric and adult patients with GSDs and other inborn errors of metabolism. The biopsies of patients were taken in the range of 3 month to several years from receiving the aforementioned dietary regimen.Importantly, the strict metabolic diet imposed on GSD1a patients might influence the observed phenotype described throughout the manuscript. This concept aligns with our claim that the GSD1a skin cells are affected by the dysregulated metabolism in patients in comparison to healthy individuals. Interestingly, while patient 0762 harbors a mutation in the SI gene in addition to the G6PC mutation and patient 6894 did not receive the same dietary regimen as other patients (as explained above), all patients do show similar disease related phenotypes, perhaps highlighting the role of an early programing process that affected these cells due to the severe metabolic aberrations presented in this disease from birth.One of the main pathological features of GSD1a is glycogen buildup. The authors should compare glycogen levels between healthy controls and GSD1a fibroblasts and provide a dot plot analysis.

      One of the main pathological features of GSD1a is glycogen buildup. The authors should compare glycogen levels between healthy controls and GSD1a fibroblasts and provide a dot plot analysis.

      __Reply:__We thank the reviewer for this important comment. We added glycogen levels of HC to Figure S2A and accordingly also edited the relevant text in the Results section.

      Figure S2A - As mentioned above, the authors should present healthy control vs. patient fibroblast glycogen data. Without this, the rationale for using GHF201 is questionable.

      __Reply:__We thank the reviewer for this important comment. We added glycogen levels of HC to Figure S2A as mentioned above.

      Figure S2B-C - If the authors propose that GHF201 reduces glycogen and increases intracellular glucose in GSD1a fibroblasts, they need direct evidence. Either directly quantifying glycogen levels or even better would be a labeling experiment to confirm that the free intracellular glucose originates from glycogen. Additionally, the reduction in sample size from N=24 in glycogen analysis to N=3 in the glucose assay needs justification.

      __Reply:__We thank the reviewer for this comment. To clarify, the results shown in Figure S2A left are based on PAS assay, directly quantifying glycogen in cells with and without GHF201 treatment. We have now added HC glycogen levels as requested above. Regarding N, this is explained in Methods: In imaging experiments N was determined based on wells from the experiments done in three independent plates following the rationale that each well is independent from the others and reflects a population of hundreds of cells as previously described in (Lazic SE, Clarke-Williams CJ, Munafò MR (2018) What exactly is 'N' in cell culture and animal experiments?. PLOS Biology 16(4):e2005282. https://doi.org/10.1371/journal.pbio.2005282, Gharaba S, Sprecher U, Baransi A, Muchtar N, Weil M. Characterization of fission and fusion mitochondrial dynamics in HD fibroblasts according to patient's severity status. Neurobiol Dis. 2024 Oct 15;201:106667. doi: 10.1016/j.nbd.2024.106667. Epub 2024 Sep 14. PMID: 39284371.). Figure S2A right shows the glucose quantification experiment that we think the reviewer is referring to. Glucose increase is normally concomitant with glycogen reduction and we therefore show these results in support of the glycogen reduction results. These glucose results are part of our metabolomics results done on the same cells (Figure 6), where glucose is one of the metabolites analyzed. This metabolomics analysis was repeated three times; therefore, N is 3. In summary, these results show that GHF201 directly contributes to glycogen reduction in GSD1a fibroblasts and concomitantly increases glucose levels.

      Figure S2B-C- It is not shown how GHF201 increases intracellular glucose? If glycophagy is a possibility, the authors should do an experiment to confirm this.

      __Reply:__Assuming the reviewer's comment is related to Figure S2A right, glucose levels are only shown to validate the glycogen reduction results (also see point 4): When glycogen levels are reduced, especially by inhibition of glycogen synthesis, glucose levels are supposed to concomitantly rise, being spared as an indirect substrate of glycogen synthesis. There is no proof, and as a matter of fact we also do not assume, that the GHF201-mediated reduction in glycogen levels is a result of increased glycophagy: Glycophagy has been described in cell types with high glycogen turnover, e.g., muscle and liver cells, not fibroblasts. Additionally, glycophagy is a glycogen-selective process implicating STBD1 whose expression in skin fibroblasts is negligible (https://www.proteinatlas.org/ENSG00000118804-STBD1/tissue).On the other hand, glycogen in GSD1a does not accumulate in lysosomes. It is built up in the cytoplasm (Hicks et al (2011) Ultrastr Pathol 35: 183-196; Hannah et al (2023) Nat Rev Dis Primers DOI: 10.1038/s41572-023-00456-z). Therefore, we do not believe that GHF201 reduced glycogen by enhancing glycophagy. As we show, GHF201 activated several key catabolic pathways. It is more likely that activation of one of these pathways, the AMPK pathway, inhibited glycogen synthesis via phosphorylation and ensuing inhibition of glycogen synthase. Alternatively, excessive cytoplasmic glycogen might enter lysosomes by bulk autophagy, or microautophagy (not by glycophagy) and GHF201 might induce lysosomal glycogenolysis by alpha glucosidase as an established lysosomal activator (Kakhlon et al (2021)). However, since, as explained, the mechanism of action of GHF201 is not the topic of this manuscript and therefore we did not dwell more into that.

      Figure 2- How can GSD1a fibroblasts have significantly reduced OCR (Fig. 2B) but increased mitochondrial ATP production (Fig. 2H)?

      __Reply:__We thank the reviewer for highlighting this important topic. OCR, measured in Fig. 2B, is an indirect measure of ATP production. Therefore, changes in OCR only measure the capacity of the mitochondria to produce ATP, and not the direct quantity of ATP. Other factors might influence ATP production, e.g., substrate availability and the activity of other metabolic pathways. On the other hand, the ATP Rate Assay (Figure 2h), provides a real-time direct measurement of ATP levels incorporating coupling efficiency and P/O ratio assumptions. Therefore, these two measurements do not necessarily match. We will add this information to the relevant segment in the text to clarify why OCR is reduced and mitochondrial ATP production increased in GSD1a cells.

      Why do GSD1a fibroblasts show reduced glycolytic ATP (Figure 2h) despite increased glycolysis and glycolytic capacity (Fig 2J-K)?

      __Reply:__We thank the reviewer for highlighting this important topic. ECAR measures medium acidification and thus reflects the production of lactic acid, which is a byproduct of glycolysis. However, medium acidification is also influenced by other factors that can acidify the extracellular environment, especially CO2 production which can originate from the intramitochondrial Krebs cycle which produces reductive substrates for mitochondrial respiration, or OCR. Moreover, the buffering capacity of the Seahorse mito stress assay medium might mask changes in lactic acid production, leading to an underestimation of glycolytic activity. On the other hand, glycolytic ATP production measured by the ATP rate assay directly quantifies the rate of ATP production from glycolysis. Notably, there is a major difference between ECAR and the ATP rate assay: The ATP rate assay is less sensitive to variations in buffering capacity than ECAR measurements. This is because the ATP rate assay relies on inhibitor-driven changes in OCR and ECAR, rather than absolute pH values.Teleologically, as indicated, the increased ECAR in GSD1a cells represents a known compensatory response to deficient ATP production which is stimulation of glycolysis (Figure 2i). To test the success of this known compensatory attempt, we applied the real-time ATP rate assay, but as explained they do not report the same entities. We will add this information to the relevant segment in the text to clarify how reduced glycolytic ATP can be co-observed with increased glycolytic capacity.

      The authors should clarify how many healthy control and patient fibroblast lines were compared per experiment. Given the wide age range, the unexpectedly small error bars raise concerns about variability and statistical robustness.

      Reply:__We thank the reviewer for raising this topic. Number of samples per experiment is reported in the Methods section. As for the age range, patients age was matched to healthy controls to account for age differences and experiments were performed under similar passages range. This procedure allowed us to control for technical differences between samples that might arise due to different passages and ages. Importantly, the cohort of samples used in this manuscript included GSD1a patients with different ages further implying the strength of the observed disease phenotype found in patients' cells which exists regardless of the different age and gender of patients. The HC samples were chosen to match age and gender and passages were used in the recommended range (L. Hayflick,The limited in vitro lifetime of human diploid cell strains,Experimental Cell Research,Volume 37, Issue 3,1965,Pages 614-636, änzelmann S, Beier F, Gusmao EG, Koch CM, Hummel S, Charapitsa I, Joussen S, Benes V, Brümmendorf TH, Reid G, Costa IG, Wagner W. Replicative senescence is associated with nuclear reorganization and with DNA methylation at specific transcription factor binding sites. Clin Epigenetics. 2015 Mar 4;7(1):19. doi: 10.1186/s13148-015-0057-5. PMID: 25763115; PMCID: PMC4356053., Magalhães, S.; Almeida, I.; Pereira, C.D.; Rebelo, S.; Goodfellow, B.J.; Nunes, A. The Long-Term Culture of Human Fibroblasts Reveals a Spectroscopic Signature of Senescence. Int. J. Mol. Sci. __2022, 23, 5830. https://doi.org/10.3390/ijms23105830). Finally, for the error bars, assuming the reviewer is addressing this for all experiments, this means that results are consistent across each compared group and reflects robustness of the results. Further, to ensure statistical robustness we used bootstrapping, 95% confidence intervals and other statistical methodologies that were designed to increase the validity of the conclusions drawn from different experiments.

      Figure 5- The study should include Tamoxifen-untreated mice as a control to properly assess the efficacy of GHF201 in regulating glucose-6-P and glycogen levels.

      __Reply:__GHF201 reduced liver glucose-6-phosphate (G6P) with p-/-* mice livers and their normalization by GHF201.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      General comments: the authors propose a very intriguing concept, that metabolic abnormalities trigger epigenetic changes in tissues distal from the disease site, even in cells in which the affected gene is not expressed. This is demonstrated in primary fibroblasts from patients with Glycogen Storage Disease type 1a (GSD1a). The authors provide a large amount of data to support the compelling concept of "Disease-Associated Programming", a term that they have coined to describe this effect. The level of novelty is very high and so is the impact of the study, since the above may apply to many different pathological conditions. Although, the study is well performed and employs multiple approaches and analyses to address the raised hypothesis, there are some limitations and concerns that need to be addressed by the authors.

      __Reply:__We thank the reviewer for this comment and will address each comment raised.

      The different phenotypic characteristics are only demonstrated in skin fibroblasts which is not sufficient to support the conclusions made in the Discussion about the general applicability of the proposed disease-induced, metabolite-driven epigenetic programming to all cells and tissues. The authors should discuss this as a limitation of the study and general conclusions should be formulated with more caution.

      __Reply:__We concur with this comment and accept that this is a general limitation of the study. We added a reservation clause at the beginning of the Discussion section.

      The authors describe a range of alterations in patients' fibroblasts as compared to healthy control fibroblasts. However, they draw parallels to the liver which is the organ primarily affected by GSD1a, stating that tissues other than the liver such as skin fibroblasts phenocopy the liver pathology (Discussion). Extrapolation of the findings to the liver is also made in the section "ATAC-seq, RNA-seq and EPIC methylation data integration". Here, the authors comment on the finding that identified genes are associated with tumour formation and draw parallels to hepatocellular carcinoma which is an important co-morbidity of GSD1a. These correlations, although interesting, should be presented as indications and not as "strong links". A major difference between fibroblasts and liver cells in the case of GSD1a is the massive accumulation of glycogen in the liver. This is a major metabolic feature which largely defines the disease's pathology. In addition to the similarities in the pathological features between the liver and other tissues such as fibroblasts, the authors should highlight this major difference and discuss their findings within this context.

      __Reply:__We thank the reviewer for this important comment. We have toned down the language correlating the regulation of gene expression between fibroblasts and liver in GSD1a. We have also alluded to the key metabolic difference between fibroblasts and liver - glycogen levels and turnover - in the second paragraph of the Discussion. We are aware that if our deep analyses were conducted on a different tissue with different basal metabolism the results might have been different. However, the GSD1a-pathogenic findings in fibroblasts suggest that they might also contribute to pathology in situ, perhaps by modulating the expression of functionally redundant genes.

      For basically all experiments performed in the study the authors follow the approach of culturing cells for 48 hours under serum and glucose starvation, followed be 24-hour cultivation in complete medium. This was practiced in a previous study by the authors (PMID: 34486811) to enhance the levels of glycogen in skin fibroblasts of patients with Adult Polyglucosan Body Disease. For the current study the selection of this treatment protocol is not sufficiently justified. Although, differences are described between patients' fibroblasts and controls under these conditions, it would have been interesting to address the reported parameters also at standard culturing conditions. This might be too much to ask for the purposes of this revision, but the authors may provide a better justification for the selection of the above treatment protocol and discuss whether the described phenotypic features are constitutive abnormalities present at all times or are induced by the metabolic stress imposed to the cells through this treatment.

      __Reply:__We thank the reviewer for pointing this important topic. Previously, we used the 72 h condition (48 h starvation followed by 24 h glucose supplementation) to attain two goals: generation of glycogen burden by excessive glucose re-uptake after glucose starvation and induction of basal autophagy by serum starvation so as to sensitize detection of the action of the autophagic activator GHF201 on a background of already induced autophagy. As stated, this 72 h condition was used previously in other GSD cell models (Kakhlon et al (2021) - GSDIV, Mishra et al (2024) - GSDIII, GSDII - in preparation), so we decided to use it in this work as well to enable cross-GSD comparison of GHF201 efficacy in GSD cell models. Moreover, as shown in Figure 1, the largest differences between HC and GSD1a fibroblasts, especially in lysosomal and mitochondrial features, were observed at the 72 h time condition. We therefore used this condition in all other fibroblasts experiments presented in this manuscript. Our ultimate aim was to test whether the metabolic reprograming induced in situ by the patients' diseased state before culturing generates stable epigenetic modifications withstanding seclusion from the original in situ environment. Thus, using the non-physiological 72 h condition, after the fibroblasts were cultured in full media remote from the in situ environment, can only confirm the stability and environment-independence of these metabolically-driven epigenetic modulations. We now provide this justification at the beginning of the Results section.

      In the Figures, the authors provide comparisons between controls and patient fibroblasts (+/- GHF201). Although the authors provide the respective p values in all figures, it is not clear which differences are considered significant and which are not. Since some of the indicated p values are > 0.0. The authors should indicate which of these changes are significant or non-significant and these should be presented and discussed accordingly in the text.

      __Reply:__We thank the reviewer for highlighting this important topic. We will add this information to the methods segment. Throughout the manuscript, p https://doi.org/10.1080/00031305.2018.1529624, Cumming, G. (2013). The New Statistics: Why and How. Psychological Science, 25(1), 7 29. https://doi.org/10.1177/0956797613504966 (Original work published 2014)). Along with the p values we presented all data points in each comparison and added bootstrap mediated 95 % confidence intervals as well. Since our sample size was small, we chose to focus on effect sizes, to use a higher p value threshold and to implement various advanced methodologies that allowed us to find important biological patterns.

      In Figure S2A, the authors show a reduction of glycogen levels in GSD1a fibroblasts following treatment with GHF201. Glycogen accumulation is central to this study, since a) is considered by the authors "a disease marker which is reversed by GHF201" - this is demonstrated in the liver of L.G6pc-/- mice and, according to the authors, replicated in the fibroblasts, b) as suggested by the authors it is the biochemical aberration that drives epigenetic modifications generating "disease memory". It is therefore important to appreciate whether GSD1a cells display pathologically increased levels of glycogen. This is also pertinent to the lack of G6PC expression in fibroblasts. The authors should include in Fig. S2A glycogen measurements of HC control fibroblasts cultured under the same conditions to compare with the levels present in GSD1a cells.

      __Reply:__We thank the reviewer for highlighting this issue. We added glycogen levels of HC to Figure 2SA as requested. Expectedly, glycogen levels are similar between HC and GSD1a fibroblasts because neither wild type G6PC1 in HC, or mutated G6PC1 in GSD1a fibroblasts is expressed. We have now corrected the manuscript text suggesting that glycogen is accumulated in GSD1a fibroblasts and rephrased the text to express the more versatile state where epigenetic modulation could be mediated by different metabolic perturbations according to the expression profile: G6PC1 mutant expressers (notably liver and kidney cells) could inhibit p-AMPK by glycogen accumulation, while non-expressers could inhibit p-AMPK by lowering NAD+. Text changes related to this new concept are found in the Results section "Exploring epigenetics as a phenotypic driver in GSD1a fibroblasts by ATAC-seq analysis" and in the Discussion section "Metabolic-driven, disease-associated programming of cell memory."

      Comparisons between protein levels (AMPK/pAMPK, Sirt1, TFEB, p62 ane PGC1a) are made on the basis of fluorescence intensity in immunostained cells. These results need to be supported by relevant western blot images to exclude that binding of the antibodies to unspecific sites contributes to the measured fluorescence.

      __Reply:__We thank the reviewer for this comment allowing us to clarify the reasoning behind the selected methods for the main markers identification. Throughout the manuscript we employed both Western blot and immunofluorescence experiments. We believe that immunofluorescence present as a more robust and efficient method for the following reasons: i. It allows to focus on proteins in their native state; ii. Immunofluorescence allows to observe proteins in relation to their location in the cells (for example TFs in nuclei area); iii. Immunofluorescence allows to focus on each cell and exclude cells which are dead, stressed or with a low viability characteristic; iv. Immunofluorescence allows to generate much more data. For the following reasons, the main proteins explored in this work we used immunofluorescence, in each immunofluorescence experiment we added a control for the secondary antibody alone, verifying the signal is related to the antibodies only. This information can be added if requested. Importantly, some of the antibodies used were recommended for immunofluorescence and not for Western blot. As the reviewer requested, we now provide western blot results for proteins that produced a signal with the antibodies in Western blots, all markers mentioned except TFEB were added to Figure S3 d.

      The authors demonstrate that treatment of GSD1a fibroblasts with histone deacetylase inhibitors reverses some of the phenotypic alterations. Given that GHF201 also improves these phenotypic differences it would be interesting to address whether GHF201 has any effect on histone acetylation.

      Reply: We strongly agree with this comment and have therfore tested for the effect of GHF201 on H3K27 acetylation levels as shown in Fiugre 3f and on the deacetylase -SIRT-1 as shown in Figure 3e, Figure S3d and representative images in Figure S2b.

      The authors report reduced levels of the transcription factors PGC1α and TFEB in GSD1a fibroblasts. Does this correlate with lower levels of expression of PGC1α and TFEB target genes in the RNA-seq experiments?

      Reply:

      We thank the reviewer for raising this topic, since there were thousands of differentially expressed genes and we cannot mention all we focused on the most important ones that comprise key pathways we wanted to highlight as described in the Results section. We have now linked in the Results section examples of PGC1α and TFEB target genes that were reduced due to lower levels of these transcription factors in GSD1a, as compared to HC cells. Importantly, a full list of the genes from the RNA-seq experiment can be found in Table S3. Genes regulated by TFEB contain the CLEAR (Coordinated Lysosomal Expression and Regulation) motif. Two notable genes regulated by CLEAR binding TFs such as TFEB, which are very important biologically, are cathepsin L and S (Figure 6A right) both of which were reduced in GSD1a and are now elaborated in the Results section referring to Figure 6a right. Additionally, Table S3 shows differentially expressed genes in GSD1a cells where there are many other lysosomal related genes that are downmodulated in GSD1a, we now added another important example, ATP6V0D2 to the Discussion as the reviewer suggested. As for PGC1alpha, a notable gene whose expression is up-modulated by PGC1alpha, which is down-modulated in GSD1a, is ALDH1A1 (Figure 6a right). In addition, we have now added PPARG and its coactivators alpha and beta to the discussion as requested by the reviewer, these genes are shown in Table S3 and are downmodulated in GSD1a. Finally, the transcriptional effect of PGC1alpha and TFEB is also mentioned in the Discussion within the cell phenotyping section, where we describe the deep impact of dysregulation of NAD+/NADH-Sirt-1-TFEB regulatory axis on the cell phenotype at all the levels described in the manuscript.

      Please revise the following sentences as the statements made are not adequately supported by the provided data a. "This NAD+/NADH increase correlated with reduced cytotoxicity and increased cell confluence (Figure 3d) suggesting that NAD+ availability prevails over ATP availability as an effector of cell thriving in GSD1a cells."

      __Reply:__If one ranks treatments according to NAD+/NADH (Figure 3c) and according to cytotoxicity (Figure 3d left) and cell confluence (Figure 3d right), then the mentioned correlation can be supported. ATP availability is compromised by gramicidin, yet gramicidin, which also increased NAD+/NADH, reduced cytotoxicity and enhanced cell confluence.

      b. "....in further support that respiration-dependent NAD+ availability mediate GHF201's corrective effect in GSD1a cells."

      __Reply:__Our data (Figure 3c) show that GHF201 increased NAD+/NADH both alone and with gramicidin.

      Please indicate on the densitometry graph of Fig. 10b the treatment (HDACi), for better visibility.

      __Reply:__We agree and have corrected the Figure as requested.

      The reference list (n=160) is probably too long for a research article.

      __Reply:__The number of references reflect the length and depth of the manuscript and we believe that each reference merits its place. We agree that the number of references is large but we are not sure which criteria to use to exclude some references and to reduce them to a more acceptable number that we assume would be determined by the publishing journal.

      The study is of high novelty and impact, as it proposes a so far undescribed biological mechanism contributing to disease pathology that could apply for general pathological conditions. Although this is a compelling concept, it is only demonstrated in skin fibroblasts which limits its applicability at an organismal level.

      __Reply:__We thank the reviewer for this comment and for raising the important comments that allowed us to improve our manuscript, please see our reply to point 1.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      General comments: the authors propose a vey intriguing concept, that metabolic abnormalities trigger epigenetic changes in tissues distal from the disease site, even in cells in which the affected gene is not expressed. This is demonstrated in primary fibroblasts from patients with Glycogen Storage Disease type 1a (GSD1a). The authors provide a large amount of data to support the compelling concept of "Disease-Associated Programming", a term that they have coined to describe this effect. The level of novelty is very high and so is the impact of the study, since the above may apply to many different pathological conditions. Although, the study is well performed and employs multiple approaches and analyses to address the raised hypothesis, there are some limitations and concerns that need to be addressed by the authors.

      1. The different phenotypic characteristics are only demonstrated in skin fibroblasts which is not sufficient to support the conclusions made in the Discussion about the general applicability of the proposed disease-induced, metabolite-driven epigenetic programming to all cells and tissues. The authors should discuss this as a limitation of the study and general conclusions should be formulated with more caution.
      2. The authors describe a range of alterations in patients' fibroblasts as compared to healthy control fibroblasts. However, they draw parallels to the liver which is the organ primarily affected by GSD1a, stating that tissues other than the liver such as skin fibroblasts phenocopy the liver pathology (Discussion). Extrapolation of the findings to the liver is also made in the section "ATAC-seq, RNA-seq and EPIC methylation data integration". Here, the authors comment on the finding that identified genes are associated with tumour formation and draw parallels to hepatocellular carcinoma which is an important co-morbidity of GSD1a. These correlations, although interesting, should be presented as indications and not as "strong links". A major difference between fibroblasts and liver cells in the case of GSD1a is the massive accumulation of glycogen in the liver. This is a major metabolic feature which largely defines the disease's pathology. In addition to the similarities in the pathological features between the liver and other tissues such as fibroblasts, the authors should highlight this major difference and discuss their findings within this context.
      3. For basically all experiments performed in the study the authors follow the approach of culturing cells for 48 hours under serum and glucose starvation, followed be 24-hour cultivation in complete medium. This was practiced in a previous study by the authors (PMID: 34486811) to enhance the levels of glycogen in skin fibroblasts of patients with Adult Polyglucosan Body Disease. For the current study the selection of this treatment protocol is not sufficiently justified. Although, differences are described between patients' fibroblasts and controls under these conditions, it would have been interesting to address the reported parameters also at standard culturing conditions. This might be too much to ask for the purposes of this revision, but the authors may provide a better justification for the selection of the above treatment protocol and discuss whether the described phenotypic features are constitutive abnormalities present at all times or are induced by the metabolic stress imposed to the cells through this treatment.
      4. In the Figures, the authors provide comparisons between controls and patient fibroblasts (+/- GHF201). Although the authors provide the respective p values in all figures, it is not clear which differences are considered significant and which are not. Since some of the indicated p values are > 0.0. The authors should indicate which of these changes are significant or non-significant and these should be presented and discussed accordingly in the text.
      5. In Figure S2A, the authors show a reduction of glycogen levels in GSD1a fibroblasts following treatment with GHF201. Glycogen accumulation is central to this study, since a) is considered by the authors "a disease marker which is reversed by GHF201" - this is demonstrated in the liver of L.G6pc-/- mice and, according to the authors, replicated in the fibroblasts, b) as suggested by the authors it is the biochemical aberration that drives epigenetic modifications generating "disease memory". It is therefore important to appreciate whether GSD1a cells display pathologically increased levels of glycogen. This is also pertinent to the lack of G6PC expression in fibroblasts. The authors should include in Fig. S2A glycogen measurements of HC control fibroblasts cultured under the same conditions to compare with the levels present in GSD1a cells.
      6. Comparisons between protein levels (AMPK/pAMPK, Sirt1, TFEB, p62 ane PGC1a) are made on the basis of fluorescence intensity in immunostained cells. These results need to be supported by relevant western blot images to exclude that binding of the antibodies to unspecific sites contributes to the measured fluorescence.
      7. The authors demonstrate that treatment of GSD1a fibroblasts with histone deacetylase inhibitors reverses some of the phenotypic alterations. Given that GHF201 also improves these phenotypic differences it would be interesting to address whether GHF201 has any effect on histone acetylation.
      8. The authors report reduced levels of the transcription factors PGC1α and TFEB in GSD1a fibroblasts. Does this correlate with lower levels of expression of PGC1α and TFEB target genes in the RNA-seq experiments?

      Minor points

      1. Please revise the following sentences as the statements made are not adequately supported by the provided data

      a. "This NAD+/NADH increase correlated with reduced cytotoxicity and increased cell confluence (Figure 3d) suggesting that NAD+ availability prevails over ATP availability as an effector of cell thriving in GSD1a cells."

      b. "....in further support that respiration-dependent NAD+ availability mediate GHF201's corrective effect in GSD1a cells." 2. Please indicate on the densitometry graph of Fig. 10b the treatment (HDACi), for better visibility. 3. The reference list (n=160) is probably too long for a research article.

      Significance

      The study is of high novelty and impact, as it proposes a so far undescribed biological mechanism contributing to disease pathology that could apply for general pathological conditions.

      Although this is a compelling concept, it is only demonstrated in skin fibroblasts which limits its applicability at an organismal level.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Major Comments:

      1. Since dietary restriction is the only management strategy for GSD1a, the authors should clarify whether the patient fibroblast donors were on a dietary regimen and for how long. Given that fibroblasts do not express G6Pase, it is possible that the observed phenotype could be influenced by the patient's diet history.
      2. One of the main pathological features of GSD1a is glycogen buildup. The authors should compare glycogen levels between healthy controls and GSD1a fibroblasts and provide a dot plot analysis.
      3. Figure S2A - As mentioned above, the authors should present healthy control vs. patient fibroblast glycogen data. Without this, the rationale for using GHF201 is questionable.
      4. Figure S2B-C - If the authors propose that GHF201 reduces glycogen and increases intracellular glucose in GSD1a fibroblasts, they need direct evidence. Either directly quantifying glycogen levels or even better would be a labeling experiment to confirm that the free intracellular glucose originates from glycogen. Additionally, the reduction in sample size from N=24 in glycogen analysis to N=3 in the glucose assay needs justification.
      5. Figure S2B-C- It is not shown how GHF201 increases intracellular glucose? If glycophagy is a possibility, the authors should do an experiemnt to confirm this.
      6. Figure 2- How can GSD1a fibroblasts have significantly reduced OCR (Fig. 2B) but increased mitochondrial ATP production (Fig. 2H)?
      7. Why do GSD1a fibroblasts show reduced glycolytic ATP (Fig. 2H) despite increased glycolysis and glycolytic capacity (Fig. 2J-K)? The authors should clarify how many healthy control and patient fibroblast lines were compared per experiment. Given the wide age range, the unexpectedly small error bars raise concerns about variability and statistical robustness.
      8. Figure 5- The study should include Tamoxifen-untreated mice as a control to properly assess the efficacy of GHF201 in regulating glucose-6-P and glycogen levels.
      9. Fig. 5B-C - The authors should explain how GHF201 reduces glucose-6-P levels. Additionally, they should demonstrate whether GHF201 activates lysosomal pathways and induces autophagy in the liver of G6pc knockout mice, as claimed in the fibroblast experiments.

      Significance

      The manuscript explores mild physiological and metabolic disturbances in patient-derived fibroblasts lacking G6Pase expression, suggesting that these cells retain a "distinctive disease phenotype" of GSD1a. The manuscript is well written with well designed experiments. However, it remains unclear whether these phenotypes genuinely reflect the pathology of GSD1a-relevant tissues. The authors did not validate these findings in a liver-specific G6pc knockout mouse model, raising concerns about the study's relevance to GSD1a. Additionally, the lack of sufficient in vivo evidence undermines the therapeutic potential of GHF201 for this disease. Overall, the study lacks a few key pieces of evidence to completely justify its conclusions at both fundamental and experimental levels.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We* thank all three Reviewers for appreciating our work and for sharing constructive feedback to further enhance the quality of our work. It is really gratifying to read that the Reviewers believe that this work will be of interest to broad audience and will be suitable for a high profile journal. Further, the experiments suggested by the reviewers will add value to the work and will substantiate our findings. It is important to highlight that we have already performed most of the suggested experiments except a couple of experiments that we have plan to carry out during full revision. Please find below the details of experiments performed and planned to address the reviewers comments. *

      2. Description of the planned revisions

      Reviewer #1

      Comment 6. In Figure 6A, B, does the Orai3 western blot show any of the heavier bands seen in the ubiquitination IP if you show the whole blot? It should.

      Reviewer #2

      Comment 5. Fig. 6A and 6B. Show the full Orai3 and Ubiquitin WBs. As presented the figure current just shows that there are ubiquitin proteins in Orai3 pull down, not that Orai3 is ubiquitinated.

      Reviewer #3

      Comment 3. In the scheme in Fig. 10, the authors highlight that Orai3 is ubiquitinated. Do they have any idea where the site of action of ubiquitination in Orai3 is located?

      Response: We thank the Reviewer 1, 2 and 3 regarding their query on the co-immunoprecipitation assays performed for studying Orai3 ubiquitination. The reviewers are asking for ubiquitination status of Orai3 and the potential sites for Orai3 ubiquitination. To address these comments, we are planning to perform co-immunoprecipitation assays with mutated Orai3 with mutations of potential ubiquitination sites. We have already performed bioinformatic analysis and it revealed presence of three potential ubiquitination sites on Orai3: K2 (present on N-terminal region), K274 and K279 (present on C-terminal region). We would mutate these lysine residues on Orai3 protein via site-directed mutagenesis and check the Orai3 ubiquitination status. These experiments will answer the question raised by Reviewers and strengthen the Orai3 ubiquitination data.

      Please refer to below diagrammatic illustration showing potential ubiquitination sites on Orai3:

      Reviewer #2

      Comment 7. Also, all the imaging and pull down do not prove conclusively direct interaction between MARCH8 and Orai3, they rather show that the proteins are in the same complex. Although it is unlikely best for the text to be moderated accordingly.

      Response: We understand the concern raised by Reviewer 2 regarding direct or indirect interaction of MARCH8 and Orai3. Hence, we are planning to perform co-immunoprecipitation assays in which we delete the MARCH8 interacting domain in Orai3 protein and check the for direct interaction of these proteins. Bioinformatic analysis and literature survey have highlighted two possible MARCH8 interacting domains in Orai3. The first domain is present in 2nd loop region, present between the 2nd and 3rd transmembrane domains at the LMVXXXL (AA113-120) motif and the second domain is present at the GXXXG (AA235-239) motif, present in the 3rd loop region of Orai3. We will remove these domains from Orai3 protein individually and check its effect on MARCH8 interaction. These experiments will provide conclusive evidence of direct interaction between Orai3 and MARCH8.

      Please refer to below diagrammatic illustration displaying potential MARCH8 binding sites on Orai3:

      3. Description of the revisions that have already been incorporated in the transferred manuscript


      Reviewer #1

      Comment 1. The observation that both transcriptional regulation and protein degradation of Orai3 is regulated downstream of one transcription factor is not, in and of itself, entirely surprising. All proteolytic components are transcriptionally regulated and this phenomenon is likely relatively common. However, what I do think is both impressive and important is that the authors have characterized both components of the pathway within a disease context. While I am not going to search the literature for how often transcription and proteolysis are co-regulated for other proteins, it is the case for many short-lived proteins and perhaps many others. As such, discussion throughout the abstract and introduction that co-regulation of these processes is unprecedented should be removed.

      Response: We thank the Reviewer for thinking that our work is both impressive and important. Further, we understand the Reviewer’s point that transcription and proteolysis may be co-regulated for other proteins. However, our extensive literature search did not resulted in such scenarios. Therefore, to best of our knowledge, we are revealing for the first time that same transcription factor regulates both transcription and protein degradation of the same target in a context dependent manner in a single study. In case, Reviewer would still recommend to modify the text in abstract and introduction, we would do it.

      Comment 2. In discussing figure 1, the authors switch from claiming to be studying NFATc binding to studying NFAT expression. This use of 2 different naming conventions is certain to confuse readers; the authors should use the approved current naming system in referring to NFAT isoforms. In which case NFAT2 is NFATc1.

      Response: We would like to thank the Reviewer for highlighting this point. We have effectively addressed this comment by changing the nomenclature of NFAT2 to NFATc1 throughout the manuscript text and figures.

      Comment 3. The ChIP analyses in figures 1H and 7D are important findings, however, there is missing information. Typically, ChIP is used to validate putative binding sites; as such, one would expect 3 separate qPCR reactions for Orai3, not one. It is also important to note that qPCR products should be uniform in size and under 100 bp; here, the product size is not stated. Finally, demonstrating that an antibody targeting ANY other NFAT isoform fails to pull down whatever product this is would increase confidence considerably.

      Also, the gold standard for validating ChIP is to mutate the sites and eliminate binding. The "silver" standard would be to mutate them in your luciferase vector and demonstrate that NFATc1 no longer stimulates luciferase expression. Since neither of these was done, the ChIP data provided should not be considered formally validated.

      Response: We thank the Reviewer for raising this highly relevant concern. In this revised manuscript, we have addressed this comment by performing several additional experiments. The new data provided in the revised manuscript corroborates our earlier results. Indeed, this data has notably strengthen our work.

      In the revised manuscript, we performed ChIP assay where we increased the number of sonication cycles to 35 so as to make sheared chromatin of around 100 bp. Next, we designed primers to amplify individual NFATc1 binding sites on Orai3 promoter, but due to close proximity of the NFATc1 binding sites, we could design two primer sets. The primer first set to amplify the -1017 bp binding site and the second set to amplify the -990 and -920 bp. Further, as suggested by the Reviewer, we performed immunoprecipitation with the four isoforms of NFAT. Our results show that only NFATc1 pulldown shows significant enrichment of Orai3 promoter with both the primer sets as compared to the IP mock samples and other NFAT isoforms (Figure 1J). Hence, our data reveals that only NFATc1 binds to these predicted sites on the Orai3 promoter and it doesn’t show a preference among these binding sites.

      Further, as suggested by the Reviewer, we mutated the Orai3 promoter in luciferase vector with deletions of the individual NFATc1 binding sites and also cloned a truncated Orai3 promoter with no NFATc1 binding sites into the luciferase vector. The luciferase assays with these mutant and truncated promoters show that upon co-expression of NFATc1, the luciferase activity of the mutant Orai3 promoter with deletion of individual NFATc1 binding site is significantly reduced in comparison to wild type Orai3 promoter. Furthermore, the maximum decrease in luciferase activity was seen with the truncated Orai3 promoter with no NFATc1 binding sites (Figure 1I). These results show that NFATc1 binds to the predicted binding sites on Orai3 promoter. Taken together, the additional ChIP assays with the four isoforms of NFAT and luciferase assays with mutated & truncated Orai3 promoters validates the transcriptional regulation of Orai3 by NFATc1.

      Comment 4. In figures 2 and 3, only one cell line is used to represent each of 3 conditions of pancreatic cancer. That is insufficient to make generalized conclusions; some aspects of this figure (expression and stability, not function) should be extended to 2 to 3 cell lines/condition. TCGA data validating this point would also be helpful.

      Response: We really appreciate the feedback given by Reviewer 1. To strengthen our manuscript, we have addressed this comment by performing experiments in 2 cell lines/condition of pancreatic cancer. This new data in the revised manuscript provides substantial evidence for the dichotomous regulation of Orai3 by NFATc1.

      In the revised manuscript, we carried out NFATc1 overexpression and NFAT inhibition via VIVIT studies in three additional cell lines: BXPC-3 (non-metastatic), ASPC-1 (invasive) and SW1990 (metastatic). The results in these cell-lines support our earlier findings as both overexpression of NFATc1 and VIVIT mediated NFAT inhibition leads to transcriptional upregulation of Orai3 in BXPC-3 (non-metastatic) (Figure S3A, D), ASPC-1 (invasive) (Figure S3G, J) and SW1990 (metastatic) (Figure S3M, P). These results are similar to our earlier data from MiaPaCa-2 (non-metastatic), PANC-1 (invasive) and CFPAC-1 (metastatic) cells. Further, NFATc1 overexpression leads to an increase in Orai3 protein levels in BXPC-3 (non-metastatic) (Figure S3B, C) and a decrease in Orai3 protein levels in ASPC-1 (invasive) (Figure S3H, I) and SW1990 (metastatic) (Figure S3N, O). Moreover, VIVIT transfection leads to a decrease in Orai3 protein levels in BXPC-3 (non-metastatic) (Figure S3E, F) and an increase in Orai3 protein levels in ASPC-1 (invasive) (Figure S3K, L) and SW1990 (metastatic) (Figure S3Q, R). The findings in these cell lines recapitulates the data obtained earlier from MiaPaCa-2 (non-metastatic), PANC-1 (invasive) and CFPAC-1 (metastatic) cell lines. Therefore, this new data supports our conclusion regarding the dichotomous regulation of Orai3 by NFATc1 across the three conditions of pancreatic cancer.

      Comment 5. Upon finding that NFAT inhibition stimulates Orai3 transcription (same as O/E), the authors essentially conclude that this confirms regulation of Orai3 by NFAT and that there must be compensation. This is not supported by any data; the use of siRNA validates that Orai3 has some dependence on NFATc1 for transcription, but the nature of this relationship is not adequately explained.

      Response: We thank the Reviewer for asking this question. In our manuscript, we performed NFATc1 inhibition studies using VIVIT and siRNA-mediated NFATc1 knockdown. Both of these assays show increase in Orai3 mRNA levels in all non-metastatic, invasive and metastatic pancreatic cancer cell lines. To understand if the increase in Orai3 mRNA levels is via transcriptional regulation, we performed luciferase assay which showed that VIVIT mediated NFAT inhibition leads to increase in luciferase activity suggesting the binding of other transcription factors on the Orai3 promoter. To corroborate this hypothesis, in our revised manuscript, we performed luciferase assay in wild type Orai3 promoter and truncated Orai3 promoter with no NFATc1 binding sites. NFAT inhibition via VIVIT transfection led to an increase in luciferase activity in both wild type and truncated Orai3 promoter (Figure S2A). Hence, removal of NFATc1 binding sites had no significant effect on luciferase activity suggesting that apart from NFATc1, other endogenous transcription factors are involved in regulating Orai3 transcription. We have not identified all the transcription factors that can modulate Orai3 upon NFAT inhibition as it is beyond the scope of this study. We sincerely hope the Reviewer 1 would be satisfied with this additional data.

      Reviewer #2

      Comment 1. Figure 1 all overexpression no evidence of endogenous NFAT2 regulating Orai3. I realize there may be limitations on available NFAT isoform specific antibodies so it is not essential to directly show this but a comment to that effect in the paper would be useful.

      Response: We apologize to the Reviewer for not highlighting the NFAT2 (NFATc1) loss of function data effectively. Actually, in the __Figure 3 __and __Supplementary Figure 2 __of the original manuscript, we showed VIVIT mediated NFAT inhibition and siRNA induced NFATc1 silencing data to provide the evidence that endogenous NFATc1 regulates Orai3.

      Comment 2. Figure 1F. Show RNA levels of Orai3 following overexpression of the other NFAT isoforms.

      Response: As suggested by the Reviewer, in the revised manuscript, we overexpressed the four NFAT isoforms: NFATc2, NFATc1, NFATc4 & NFATc3 and checked Orai3 mRNA levels. qRT-PCR analysis shows that overexpression of NFATc1 results in the highest and significant increase in Orai3 mRNA levels compared to the empty vector and other NFAT isoforms (Figure 1F). This data corroborates the western blot data of NFAT isoforms overexpression highlighting the transcriptional regulation of Orai3 by NFATc1.

      Comment 3. Fig. S3D, E. For both MARCH3 and 8 higher expression levels correlate with better survival whereas in the text it is stated that this is the case only for MARCH8. Please correct.

      Response: The survival analysis of pancreatic cancer patients with low MARCH3 and MARCH8 levels shows that around 30% of patients with low MARCH3 levels survived for 5.5 years, whereas in case of MARCH8 30% of patients with high MARCH8 levels survived for >7.5 years. Hence high MARCH8 expression in pancreatic cancer patients provided significant survival advantage compared to high MARCH3 levels. Therefore, in the text, we meant that compared to MARCH3, higher MARCH8 levels correlate with better survival. As suggested by the Reviewer, we have modified the text to make this point clearer.

      Comment 4. For the 2APB stimulation experiments there is a large variation in the level of the response between experiments even for the same cell type. For example, compare the level of the 2APB-stimulated Orai3 influx between Fig. 4H and 5C on the MiaPaCa-2 cells. Also there doesn't seem to be a correlation between the levels of Orai3 protein from WB and the 2APB stimulated entry among the different cell lines. This needs to be addressed and differences explained.

      Response: We understand the concern raised by Reviewer 2 regarding calcium imaging experiments in MiaPaCa-2 cell line. Therefore, in the revised manuscript, we repeated calcium imaging experiments in MiaPaCa-2 and updated the representative traces as well as quantitative analysis (Figure 2D, E, 3D, E, 4H, I, S2L, M). Further, we have discussed this point in the text of the manuscript.

      Comment 6. Fig. 6C and 6D. Show the line in 6C from which the intensity profile in 6D was generated. Also give the details of the imaging setup in methods: size of the pinhole, imaging mode, etc. The colocalization is not very convincing.

      Response: As recommended by the Reviewer, in the revised manuscript, we have indicated the region used for intensity profile generation by drawing a line in the representative image (Figure 6D). Further, we have updated the methodology of colocalization microscopy with details of the size of the pinhole and imaging mode.

      Comment 8. May be worth showing that overexpression of MARCH8 in the metastatic cell lines decreases their migration and metastasis as the argument is that these cells need high Orai3 but not too high. So, it would be predicted that overexpression of MARCH8 should lower Orai3 levels enough to prevent their metastasis.

      Response: We would like to thank the Reviewer for this highly relevant suggestion. In our revised manuscript, we carried out transwell migration assays with MARCH8 overexpression as well as MARCH8 knockdown in CFPAC-1 (metastatic) cells. Our data shows that stable lentiviral knockdown of MARCH8 increased the number of migrated CFPAC-1 cells compared to shNT CFPAC-1 cells while MARCH8 overexpression decreased the number of migrated CFPAC-1 cells compared to empty vector control cells (Figure 9F, G). Therefore, as pointed out by the Reviewer, MARCH8 overexpression lowers Orai3 levels in metastatic pancreatic cancer cells and hinders their metastatic potential.

      Comment 9. Fig. 10. Show higher levels of Orai3 protein in the metastatic side.

      Response: As suggested, we have updated the summary figure (Figure 10) showing higher Orai3 protein levels in the metastatic side.

      Comment 10. Please show all full WBs in the supplementary data.

      Response: As recommended by the Reviewer, we have provided all full western blots in a supplementary file (Supplementary File 1).

      Reviewer #3


      Comment 1. The authors show that MARCH8 physically associates with Orai3 using Co-IP and Co-localization studies. For the co-localization studies the authors should still provide a quantitative analysis. Furthermore, can the authors detect FRET between March and Orai3? Can you please state the labels used in the co-localization experiments also in the figure legend.

      Response: As suggested by Reviewer 3, in the revised manuscript, we have provided quantitative analysis of Orai3 and MARCH8 co-localization. Further, we have stated the labels used in the co-localization experiment in the figure legend of the revised manuscript. Unfortunately, we could not perform FRET assay between Orai3 and MARCH8 due to limited resources. Instead, as discussed in the planned revisions section, we are planning to perform co-immunoprecipitation assay with mutated Orai3 protein in which the MARCH8 interacting domains are deleted to investigate direct interaction of Orai3 and MARCH8. We believe that Reviewer 3 will be satisfied with this experiment.

      Comment 2. In the abstract it is only getting clear at the end that pancreatic cancer cells are used. It would be great if the authors could introduce this fact already more at the beginning of the abstract.

      Response: As recommended by the Reviewer, in the revised manuscript, we have introduced the use of pancreatic cancer cells at the beginning of the abstract.

      Comment 4. In other cancer types recent reports suggest a co-expression of Orai1 and Orai3 and even the formation of heteromers. Does only Orai3 or also Orai1 play a role in pancreatic cancer cells? Could there we difference in degradation when Orai3 forms homomers or heteromers with Orai1.

      Response: We thank the reviewer for asking this interesting question. There is only one report on Orai1’s role in pancreatic cancer. It was suggested that Orai1 can contribute to apoptotic resistance of pancreatic cancer cells (Kondratska et al. BBA-Molecular Cell Research, 2014). However, only one cell line i.e. PANC-1 was used in this study. While our earlier work and other studies have demonstrated that Orai3 drives pancreatic cancer metastasis (Arora et al. Cancers, 2021) and proliferation (Dubois et al. BBA-Molecular Cell Research, 2021) respectively. Therefore, emerging literature suggests that both Orai1 and Orai3 can contribute to different aspects of pancreatic cancer progression. But whether Orai1 and Orai3 form heteromers in pancreatic cancer cells remains unexplored. Further, we believe that the degradation machinery and the underlying molecular mechanisms would be analogous for both Orai3 homomers and heteromers. Nonetheless, the rate of degradation may differ for Orai3 homomers and heteromers as literature suggests that usually proteins are more stable in large heteromeric protein complexes.